1
|
Walker AS, Clardy J. Primed for Discovery. Biochemistry 2024; 63:2705-2713. [PMID: 39497571 PMCID: PMC11542185 DOI: 10.1021/acs.biochem.4c00464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2024] [Revised: 10/02/2024] [Accepted: 10/03/2024] [Indexed: 11/09/2024]
Abstract
Antibiotics are essential components of current medical practice, but their effectiveness is being eroded by the increasing emergence of antimicrobial-resistant infections. At the same time, the rate of antibiotic discovery has slowed, and our future ability to treat infections is threatened. Among Christopher T. Walsh's many contributions to science was his early recognition of this threat and the potential of biosynthesis─genes and mechanisms─to contribute solutions. Here, we revisit a 2006 review by Walsh and co-workers that highlighted a major challenge in antibiotic natural product discovery: the daunting odds for identifying new naturally occurring antibiotics. The review described strategies to mitigate the odds challenge. These strategies have been used extensively by the natural product discovery community in the years since and have resulted in some promising discoveries. Despite these advances, the rarity of novel antibiotic natural products remains a barrier to discovery. We compare the challenge of discovering natural product antibiotics to the process of identifying new prime numbers, which are also challenging to find and an essential, if underappreciated, element of modern life. We propose that inclusion of filters for functional compounds early in the discovery pipeline is key to natural product antibiotic discovery, review some recent advances that enable this, and discuss some remaining challenges that need to be addressed to make antibiotic discovery sustainable in the future.
Collapse
Affiliation(s)
- Allison S. Walker
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37240, United States
- Department
of Biological Sciences, Vanderbilt University, Nashville, Tennessee 37232, United States
- Department
of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee 37232, United States
| | - Jon Clardy
- Department
of Biological Chemistry and Molecular Pharmacology, Harvard Medical School and Blavatnik Institute, Boston, Massachusetts 02115, United States
| |
Collapse
|
2
|
Zulfiqar M, Singh V, Steinbeck C, Sorokina M. Review on computer-assisted biosynthetic capacities elucidation to assess metabolic interactions and communication within microbial communities. Crit Rev Microbiol 2024; 50:1053-1092. [PMID: 38270170 DOI: 10.1080/1040841x.2024.2306465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 11/17/2023] [Accepted: 01/12/2024] [Indexed: 01/26/2024]
Abstract
Microbial communities thrive through interactions and communication, which are challenging to study as most microorganisms are not cultivable. To address this challenge, researchers focus on the extracellular space where communication events occur. Exometabolomics and interactome analysis provide insights into the molecules involved in communication and the dynamics of their interactions. Advances in sequencing technologies and computational methods enable the reconstruction of taxonomic and functional profiles of microbial communities using high-throughput multi-omics data. Network-based approaches, including community flux balance analysis, aim to model molecular interactions within and between communities. Despite these advances, challenges remain in computer-assisted biosynthetic capacities elucidation, requiring continued innovation and collaboration among diverse scientists. This review provides insights into the current state and future directions of computer-assisted biosynthetic capacities elucidation in studying microbial communities.
Collapse
Affiliation(s)
- Mahnoor Zulfiqar
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University, Jena, Germany
- Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, Jena, Germany
| | - Vinay Singh
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University, Jena, Germany
| | - Christoph Steinbeck
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University, Jena, Germany
- Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, Jena, Germany
| | - Maria Sorokina
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University, Jena, Germany
- Data Science and Artificial Intelligence, Research and Development, Pharmaceuticals, Bayer, Berlin, Germany
| |
Collapse
|
3
|
Helf MJ, Buntin K, Klančar A, Rust M, Petersen F, Pistorius D, Weber E, Wong J, Krastel P. Scaling up for success: from bioactive natural products to new medicines. Nat Prod Rep 2024. [PMID: 39129507 DOI: 10.1039/d4np00022f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Covering 1986 to presentNatural product drug discovery at Novartis has a long and successful history of delivering life saving medicines to millions of patients. In this viewpoint, we are presenting the tools we use and challenges we face as we advance natural products from early research into development and beyond. We are leveraging our collection of 90 000 microbial strains and 20 000 isolated natural products to find new medications in an interdisciplinary approach that requires expertise in microbiology, computational biology, synthetic biology, chemistry, and process development. Technological advances, particularly in genome engineering and data science have transformed our field, accelerating discovery and facilitating sustainable compound supply. Emerging new modalities such as antibody drug conjugates, radioligand therapies and xRNA-based medications offer new opportunities for natural product-derived drugs. By taking advantage of these new modalities and the most recent research technologies, natural products will significantly contribute to the medicines of the future.
Collapse
Affiliation(s)
| | - Kathrin Buntin
- Biomedical Research, Novartis Pharma AG, 4002 Basel, Switzerland.
| | | | - Michael Rust
- Biomedical Research, Novartis Pharma AG, 4002 Basel, Switzerland.
| | - Frank Petersen
- Biomedical Research, Novartis Pharma AG, 4002 Basel, Switzerland.
| | | | - Eric Weber
- Biomedical Research, Novartis Pharma AG, 4002 Basel, Switzerland.
| | - Joanne Wong
- Biomedical Research, Novartis Pharma AG, 4002 Basel, Switzerland.
| | - Philipp Krastel
- Biomedical Research, Novartis Pharma AG, 4002 Basel, Switzerland.
| |
Collapse
|
4
|
Yancey CE, Hart L, Hefferan S, Mohamed OG, Newmister SA, Tripathi A, Sherman DH, Dick GJ. Metabologenomics reveals strain-level genetic and chemical diversity of Microcystis secondary metabolism. mSystems 2024; 9:e0033424. [PMID: 38916306 PMCID: PMC11264947 DOI: 10.1128/msystems.00334-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 04/22/2024] [Indexed: 06/26/2024] Open
Abstract
Microcystis spp. are renowned for producing the hepatotoxin microcystin in freshwater cyanobacterial harmful algal blooms around the world, threatening drinking water supplies and public and environmental health. However, Microcystis genomes also harbor numerous biosynthetic gene clusters (BGCs) encoding the biosynthesis of other secondary metabolites, including many with toxic properties. Most of these BGCs are uncharacterized and currently lack links to biosynthesis products. However, recent field studies show that many of these BGCs are abundant and transcriptionally active in natural communities, suggesting potentially important yet unknown roles in bloom ecology and water quality. Here, we analyzed 21 xenic Microcystis cultures isolated from western Lake Erie to investigate the diversity of the biosynthetic potential of this genus. Through metabologenomic and in silico approaches, we show that these Microcystis strains contain variable BGCs, previously observed in natural populations, and encode distinct metabolomes across cultures. Additionally, we find that the majority of metabolites and gene clusters are uncharacterized, highlighting our limited understanding of the chemical repertoire of Microcystis spp. Due to the complex metabolomes observed in culture, which contain a wealth of diverse congeners as well as unknown metabolites, these results underscore the need to deeply explore and identify secondary metabolites produced by Microcystis beyond microcystins to assess their impacts on human and environmental health.IMPORTANCEThe genus Microcystis forms dense cyanobacterial harmful algal blooms (cyanoHABs) and can produce the toxin microcystin, which has been responsible for drinking water crises around the world. While microcystins are of great concern, Microcystis also produces an abundance of other secondary metabolites that may be of interest due to their potential for toxicity, ecological importance, or pharmaceutical applications. In this study, we combine genomic and metabolomic approaches to study the genes responsible for the biosynthesis of secondary metabolites as well as the chemical diversity of produced metabolites in Microcystis strains from the Western Lake Erie Culture Collection. This unique collection comprises Microcystis strains that were directly isolated from western Lake Erie, which experiences substantial cyanoHAB events annually and has had negative impacts on drinking water, tourism, and industry.
Collapse
Affiliation(s)
- Colleen E. Yancey
- Department of Earth and Environmental Sciences, University of Michigan, Ann Arbor, Michigan, USA
| | - Lauren Hart
- Program in Chemical Biology, University of Michigan, Ann Arbor, Michigan, USA
- Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, USA
| | - Sierra Hefferan
- Program in Chemical Biology, University of Michigan, Ann Arbor, Michigan, USA
- Departments of Medicinal Chemistry, Chemistry, Microbiology, and Immunology, University of Michigan, Ann Arbor, Michigan, USA
| | - Osama G. Mohamed
- Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, USA
- Natural Products Discovery Core, Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, USA
- Pharmacognosy Department, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - Sean A. Newmister
- Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, USA
| | - Ashootosh Tripathi
- Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, USA
- Natural Products Discovery Core, Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, USA
- Pharmacognosy Department, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - David H. Sherman
- Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, USA
- Departments of Medicinal Chemistry, Chemistry, Microbiology, and Immunology, University of Michigan, Ann Arbor, Michigan, USA
- Pharmacognosy Department, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - Gregory J. Dick
- Department of Earth and Environmental Sciences, University of Michigan, Ann Arbor, Michigan, USA
- Program in Chemical Biology, University of Michigan, Ann Arbor, Michigan, USA
- Cooperative Institute for Great Lakes Research (CIGLR), School for Environment and Sustainability, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
5
|
Yan D, Zhou M, Adduri A, Zhuang Y, Guler M, Liu S, Shin H, Kovach T, Oh G, Liu X, Deng Y, Wang X, Cao L, Sherman DH, Schultz PJ, Kersten RD, Clement JA, Tripathi A, Behsaz B, Mohimani H. Discovering type I cis-AT polyketides through computational mass spectrometry and genome mining with Seq2PKS. Nat Commun 2024; 15:5356. [PMID: 38918378 PMCID: PMC11199612 DOI: 10.1038/s41467-024-49587-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 06/12/2024] [Indexed: 06/27/2024] Open
Abstract
Type 1 polyketides are a major class of natural products used as antiviral, antibiotic, antifungal, antiparasitic, immunosuppressive, and antitumor drugs. Analysis of public microbial genomes leads to the discovery of over sixty thousand type 1 polyketide gene clusters. However, the molecular products of only about a hundred of these clusters are characterized, leaving most metabolites unknown. Characterizing polyketides relies on bioactivity-guided purification, which is expensive and time-consuming. To address this, we present Seq2PKS, a machine learning algorithm that predicts chemical structures derived from Type 1 polyketide synthases. Seq2PKS predicts numerous putative structures for each gene cluster to enhance accuracy. The correct structure is identified using a variable mass spectral database search. Benchmarks show that Seq2PKS outperforms existing methods. Applying Seq2PKS to Actinobacteria datasets, we discover biosynthetic gene clusters for monazomycin, oasomycin A, and 2-aminobenzamide-actiphenol.
Collapse
Affiliation(s)
- Donghui Yan
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Muqing Zhou
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Abhinav Adduri
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Yihao Zhuang
- Natural Products Discovery Core, University of Michigan, Ann Arbor, MI, USA
| | - Mustafa Guler
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Sitong Liu
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Hyonyoung Shin
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Torin Kovach
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Gloria Oh
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Xiao Liu
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Yuting Deng
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Xiaofeng Wang
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Liu Cao
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - David H Sherman
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA
| | - Pamela J Schultz
- Natural Products Discovery Core, University of Michigan, Ann Arbor, MI, USA
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA
| | - Roland D Kersten
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | | | - Ashootosh Tripathi
- Natural Products Discovery Core, University of Michigan, Ann Arbor, MI, USA.
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA.
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA.
| | - Bahar Behsaz
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
- Chemia Biosciences Inc, Pittsburgh, PA, USA.
| | - Hosein Mohimani
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
6
|
Ferrinho S, Connaris H, Mouncey NJ, Goss RJM. Compendium of Metabolomic and Genomic Datasets for Cyanobacteria: Mined the Gap. WATER RESEARCH 2024; 256:121492. [PMID: 38593604 DOI: 10.1016/j.watres.2024.121492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 03/09/2024] [Accepted: 03/18/2024] [Indexed: 04/11/2024]
Abstract
Cyanobacterial blooms, producing toxic secondary metabolites, are becoming increasingly common phenomena in the face of rising global temperatures. They are the world's most abundant photosynthetic organisms, largely owing their success to a range of highly diverse and complex natural products possessing a broad spectrum of different bioactivities. Over 2600 compounds have been isolated from cyanobacteria thus far, and their characterisation has revealed unusual and useful chemistries and motifs including alkynes, halogens, and non-canonical amino acids. Genome sequencing of cyanobacteria lags behind natural product isolation, with only 19% of cyanobacterial natural products associated with a sequenced organism. Recent advances in meta(genomics) provide promise to narrow this gap and has also facilitated the uprise of combined genomic and metabolomic approaches, heralding a new era of discovery of novel compounds. Analyses of the datasets described within this manuscript reveal the asynchrony of current genomic and metabolomic data, highlight the chemical diversity of cyanobacterial natural products. Linked to this manuscript, we make these manually curated datasets freely accessible for the public to facilitate further research in this important area.
Collapse
Affiliation(s)
- Scarlet Ferrinho
- School of Chemistry, University of St Andrews, North Haugh, St Andrews, Fife, UK
| | - Helen Connaris
- School of Chemistry, University of St Andrews, North Haugh, St Andrews, Fife, UK
| | - Nigel J Mouncey
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Rebecca J M Goss
- School of Chemistry, University of St Andrews, North Haugh, St Andrews, Fife, UK.
| |
Collapse
|
7
|
Baunach M, Guljamow A, Miguel-Gordo M, Dittmann E. Harnessing the potential: advances in cyanobacterial natural product research and biotechnology. Nat Prod Rep 2024; 41:347-369. [PMID: 38088806 DOI: 10.1039/d3np00045a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2024]
Abstract
Covering: 2000 to 2023Cyanobacteria produce a variety of bioactive natural products that can pose a threat to humans and animals as environmental toxins, but also have potential for or inspire pharmaceutical use. As oxygenic phototrophs, cyanobacteria furthermore hold great promise for sustainable biotechnology. Yet, the necessary tools for exploiting their biotechnological potential have so far been established only for a few model strains of cyanobacteria, while large untapped biosynthetic resources are hidden in slow-growing cyanobacterial genera that are difficult to access by genetic techniques. In recent years, several approaches have been developed to circumvent the bottlenecks in cyanobacterial natural product research. Here, we summarize current progress that has been made in unlocking or characterizing cryptic metabolic pathways using integrated omics techniques, orphan gene cluster activation, use of genetic approaches in original producers, heterologous expression and chemo-enzymatic techniques. We are mainly highlighting genomic mining concepts and strategies towards high-titer production of cyanobacterial natural products from the last 10 years and discuss the need for further research developments in this field.
Collapse
Affiliation(s)
- Martin Baunach
- University of Potsdam, Institute of Biochemistry and Biology, Karl-Liebknecht-Str. 24/25, 14476 Potsdam, Germany.
- University of Bonn, Institute of Pharmaceutical Biology, Nußallee 6, 53115 Bonn, Germany
| | - Arthur Guljamow
- University of Potsdam, Institute of Biochemistry and Biology, Karl-Liebknecht-Str. 24/25, 14476 Potsdam, Germany.
| | - María Miguel-Gordo
- University of Potsdam, Institute of Biochemistry and Biology, Karl-Liebknecht-Str. 24/25, 14476 Potsdam, Germany.
| | - Elke Dittmann
- University of Potsdam, Institute of Biochemistry and Biology, Karl-Liebknecht-Str. 24/25, 14476 Potsdam, Germany.
| |
Collapse
|
8
|
Hamamsy T, Barot M, Morton JT, Steinegger M, Bonneau R, Cho K. Learning sequence, structure, and function representations of proteins with language models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.26.568742. [PMID: 38045331 PMCID: PMC10690258 DOI: 10.1101/2023.11.26.568742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The sequence-structure-function relationships that ultimately generate the diversity of extant observed proteins is complex, as proteins bridge the gap between multiple informational and physical scales involved in nearly all cellular processes. One limitation of existing protein annotation databases such as UniProt is that less than 1% of proteins have experimentally verified functions, and computational methods are needed to fill in the missing information. Here, we demonstrate that a multi-aspect framework based on protein language models can learn sequence-structure-function representations of amino acid sequences, and can provide the foundation for sensitive sequence-structure-function aware protein sequence search and annotation. Based on this model, we introduce a multi-aspect information retrieval system for proteins, Protein-Vec, covering sequence, structure, and function aspects, that enables computational protein annotation and function prediction at tree-of-life scales.
Collapse
|
9
|
Mullowney MW, Duncan KR, Elsayed SS, Garg N, van der Hooft JJJ, Martin NI, Meijer D, Terlouw BR, Biermann F, Blin K, Durairaj J, Gorostiola González M, Helfrich EJN, Huber F, Leopold-Messer S, Rajan K, de Rond T, van Santen JA, Sorokina M, Balunas MJ, Beniddir MA, van Bergeijk DA, Carroll LM, Clark CM, Clevert DA, Dejong CA, Du C, Ferrinho S, Grisoni F, Hofstetter A, Jespers W, Kalinina OV, Kautsar SA, Kim H, Leao TF, Masschelein J, Rees ER, Reher R, Reker D, Schwaller P, Segler M, Skinnider MA, Walker AS, Willighagen EL, Zdrazil B, Ziemert N, Goss RJM, Guyomard P, Volkamer A, Gerwick WH, Kim HU, Müller R, van Wezel GP, van Westen GJP, Hirsch AKH, Linington RG, Robinson SL, Medema MH. Artificial intelligence for natural product drug discovery. Nat Rev Drug Discov 2023; 22:895-916. [PMID: 37697042 DOI: 10.1038/s41573-023-00774-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/20/2023] [Indexed: 09/13/2023]
Abstract
Developments in computational omics technologies have provided new means to access the hidden diversity of natural products, unearthing new potential for drug discovery. In parallel, artificial intelligence approaches such as machine learning have led to exciting developments in the computational drug design field, facilitating biological activity prediction and de novo drug design for molecular targets of interest. Here, we describe current and future synergies between these developments to effectively identify drug candidates from the plethora of molecules produced by nature. We also discuss how to address key challenges in realizing the potential of these synergies, such as the need for high-quality datasets to train deep learning algorithms and appropriate strategies for algorithm validation.
Collapse
Affiliation(s)
| | - Katherine R Duncan
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK
| | - Somayah S Elsayed
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - Neha Garg
- School of Chemistry and Biochemistry, Center for Microbial Dynamics and Infection, Georgia Institute of Technology, Atlanta, GA, USA
| | - Justin J J van der Hooft
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
- Department of Biochemistry, University of Johannesburg, Johannesburg, South Africa
| | - Nathaniel I Martin
- Biological Chemistry Group, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - David Meijer
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
| | - Barbara R Terlouw
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
| | - Friederike Biermann
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
- Institute of Molecular Bio Science, Goethe-University Frankfurt, Frankfurt am Main, Germany
- LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt am Main, Germany
| | - Kai Blin
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | | | - Marina Gorostiola González
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
- ONCODE institute, Leiden, The Netherlands
| | - Eric J N Helfrich
- Institute of Molecular Bio Science, Goethe-University Frankfurt, Frankfurt am Main, Germany
- LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt am Main, Germany
| | - Florian Huber
- Center for Digitalization and Digitality, Hochschule Düsseldorf, Düsseldorf, Germany
| | - Stefan Leopold-Messer
- Institut für Mikrobiologie, Eidgenössische Technische Hochschule (ETH) Zürich, Zürich, Switzerland
| | - Kohulan Rajan
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University Jena, Jena, Germany
| | - Tristan de Rond
- School of Chemical Sciences, University of Auckland, Auckland, New Zealand
| | - Jeffrey A van Santen
- Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Maria Sorokina
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller University, Jena, Germany
- Pharmaceuticals R&D, Bayer AG, Berlin, Germany
| | - Marcy J Balunas
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Mehdi A Beniddir
- Équipe "Chimie des Substances Naturelles", Université Paris-Saclay, CNRS, BioCIS, Orsay, France
| | - Doris A van Bergeijk
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - Laura M Carroll
- Structural and Computational Biology Unit, EMBL, Heidelberg, Germany
| | - Chase M Clark
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, WI, USA
| | | | | | - Chao Du
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | | | - Francesca Grisoni
- Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
- Centre for Living Technologies, Alliance TU/e, WUR, UU, UMC Utrecht, Utrecht, The Netherlands
| | | | - Willem Jespers
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
| | - Olga V Kalinina
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany
- Drug Bioinformatics, Medical Faculty, Saarland University, Homburg, Germany
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
| | | | - Hyunwoo Kim
- College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University Seoul, Goyang-si, Republic of Korea
| | - Tiago F Leao
- Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba, Brazil
| | - Joleen Masschelein
- Center for Microbiology, VIB-KU Leuven, Heverlee, Belgium
- Department of Biology, KU Leuven, Heverlee, Belgium
| | - Evan R Rees
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, WI, USA
| | - Raphael Reher
- Institute of Pharmaceutical Biology and Biotechnology, University of Marburg, Marburg, Germany
- Institute of Pharmacy, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
| | - Daniel Reker
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
- Duke Microbiome Center, Duke University, Durham, NC, USA
| | - Philippe Schwaller
- Laboratory of Artificial Chemical Intelligence, Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | | | - Michael A Skinnider
- Adapsyn Bioscience, Hamilton, Ontario, Canada
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Allison S Walker
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - Egon L Willighagen
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, The Netherlands
| | - Barbara Zdrazil
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridgeshire, UK
| | - Nadine Ziemert
- Interfaculty Institute for Microbiology and Infection Medicine Tuebingen (IMIT), Institute for Bioinformatics and Medical Informatics (IBMI), University of Tuebingen, Tuebingen, Germany
| | | | - Pierre Guyomard
- Bonsai team, CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, Université de Lille, Villeneuve d'Ascq Cedex, France
| | - Andrea Volkamer
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
- In silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - William H Gerwick
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Hyun Uk Kim
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea
| | - Rolf Müller
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany
- Department of Pharmacy, Saarland University, Saarbrücken, Germany
- German Center for infection research (DZIF), Braunschweig, Germany
- Helmholtz International Lab for Anti-Infectives, Saarbrücken, Germany
| | - Gilles P van Wezel
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
- Netherlands Institute of Ecology, NIOO-KNAW, Wageningen, The Netherlands
| | - Gerard J P van Westen
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands.
| | - Anna K H Hirsch
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany.
- Department of Pharmacy, Saarland University, Saarbrücken, Germany.
- German Center for infection research (DZIF), Braunschweig, Germany.
- Helmholtz International Lab for Anti-Infectives, Saarbrücken, Germany.
| | - Roger G Linington
- Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada.
| | - Serina L Robinson
- Department of Environmental Microbiology, Eawag: Swiss Federal Institute for Aquatic Science and Technology, Dübendorf, Switzerland.
| | - Marnix H Medema
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands.
- Institute of Biology, Leiden University, Leiden, The Netherlands.
| |
Collapse
|
10
|
Zdouc MM, van der Hooft JJJ, Medema MH. Metabolome-guided genome mining of RiPP natural products. Trends Pharmacol Sci 2023; 44:532-541. [PMID: 37391295 DOI: 10.1016/j.tips.2023.06.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 06/12/2023] [Accepted: 06/12/2023] [Indexed: 07/02/2023]
Abstract
Ribosomally synthesized and post-translationally modified peptides (RiPPs) are a chemically diverse class of metabolites. Many RiPPs show potent biological activities that make them attractive starting points for drug development. A promising approach for the discovery of new classes of RiPPs is genome mining. However, the accuracy of genome mining is hampered by the lack of signature genes shared across different RiPP classes. One way to reduce false-positive predictions is by complementing genomic information with metabolomics data. In recent years, several new approaches addressing such integrative genomics and metabolomics analyses have been developed. In this review, we provide a detailed discussion of RiPP-compatible software tools that integrate paired genomics and metabolomics data. We highlight current challenges in data integration and identify opportunities for further developments targeting new classes of bioactive RiPPs.
Collapse
Affiliation(s)
- Mitja M Zdouc
- Bioinformatics Group, Wageningen University & Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, the Netherlands.
| | - Justin J J van der Hooft
- Bioinformatics Group, Wageningen University & Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, the Netherlands; Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa.
| | - Marnix H Medema
- Bioinformatics Group, Wageningen University & Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, the Netherlands.
| |
Collapse
|
11
|
Caesar LK, Butun FA, Robey MT, Ayon NJ, Gupta R, Dainko D, Bok JW, Nickles G, Stankey RJ, Johnson D, Mead D, Cank KB, Earp CE, Raja HA, Oberlies NH, Keller NP, Kelleher NL. Correlative metabologenomics of 110 fungi reveals metabolite-gene cluster pairs. Nat Chem Biol 2023; 19:846-854. [PMID: 36879060 PMCID: PMC10313767 DOI: 10.1038/s41589-023-01276-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 01/31/2023] [Indexed: 03/08/2023]
Abstract
Natural products research increasingly applies -omics technologies to guide molecular discovery. While the combined analysis of genomic and metabolomic datasets has proved valuable for identifying natural products and their biosynthetic gene clusters (BGCs) in bacteria, this integrated approach lacks application to fungi. Because fungi are hyper-diverse and underexplored for new chemistry and bioactivities, we created a linked genomics-metabolomics dataset for 110 Ascomycetes, and optimized both gene cluster family (GCF) networking parameters and correlation-based scoring for pairing fungal natural products with their BGCs. Using a network of 3,007 GCFs (organized from 7,020 BGCs), we examined 25 known natural products originating from 16 known BGCs and observed statistically significant associations between 21 of these compounds and their validated BGCs. Furthermore, the scalable platform identified the BGC for the pestalamides, demystifying its biogenesis, and revealed more than 200 high-scoring natural product-GCF linkages to direct future discovery.
Collapse
Affiliation(s)
- Lindsay K Caesar
- Department of Chemistry, Northwestern University, Evanston, IL, USA
| | - Fatma A Butun
- Department of Chemistry, Northwestern University, Evanston, IL, USA
| | - Matthew T Robey
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Navid J Ayon
- Department of Chemistry, Northwestern University, Evanston, IL, USA
| | - Raveena Gupta
- Department of Chemistry, Northwestern University, Evanston, IL, USA
| | - David Dainko
- Department of Chemistry, Northwestern University, Evanston, IL, USA
| | - Jin Woo Bok
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, USA
| | - Grant Nickles
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, USA
| | | | | | | | - Kristof B Cank
- Department of Chemistry and Biochemistry, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Cody E Earp
- Department of Chemistry and Biochemistry, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Huzefa A Raja
- Department of Chemistry and Biochemistry, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Nicholas H Oberlies
- Department of Chemistry and Biochemistry, University of North Carolina at Greensboro, Greensboro, NC, USA
| | - Nancy P Keller
- Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, USA
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Neil L Kelleher
- Department of Chemistry, Northwestern University, Evanston, IL, USA.
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA.
- Proteomics Center of Excellence, Northwestern University, Evanston, IL, USA.
| |
Collapse
|
12
|
Gaudêncio SP, Bayram E, Lukić Bilela L, Cueto M, Díaz-Marrero AR, Haznedaroglu BZ, Jimenez C, Mandalakis M, Pereira F, Reyes F, Tasdemir D. Advanced Methods for Natural Products Discovery: Bioactivity Screening, Dereplication, Metabolomics Profiling, Genomic Sequencing, Databases and Informatic Tools, and Structure Elucidation. Mar Drugs 2023; 21:md21050308. [PMID: 37233502 DOI: 10.3390/md21050308] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/11/2023] [Accepted: 05/12/2023] [Indexed: 05/27/2023] Open
Abstract
Natural Products (NP) are essential for the discovery of novel drugs and products for numerous biotechnological applications. The NP discovery process is expensive and time-consuming, having as major hurdles dereplication (early identification of known compounds) and structure elucidation, particularly the determination of the absolute configuration of metabolites with stereogenic centers. This review comprehensively focuses on recent technological and instrumental advances, highlighting the development of methods that alleviate these obstacles, paving the way for accelerating NP discovery towards biotechnological applications. Herein, we emphasize the most innovative high-throughput tools and methods for advancing bioactivity screening, NP chemical analysis, dereplication, metabolite profiling, metabolomics, genome sequencing and/or genomics approaches, databases, bioinformatics, chemoinformatics, and three-dimensional NP structure elucidation.
Collapse
Affiliation(s)
- Susana P Gaudêncio
- Associate Laboratory i4HB-Institute for Health and Bioeconomy, NOVA School of Science and Technology, NOVA University Lisbon, 2819-516 Caparica, Portugal
- UCIBIO-Applied Molecular Biosciences Unit, Chemistry Department, NOVA School of Science and Technology, NOVA University of Lisbon, 2819-516 Caparica, Portugal
| | - Engin Bayram
- Institute of Environmental Sciences, Room HKC-202, Hisar Campus, Bogazici University, Bebek, Istanbul 34342, Turkey
| | - Lada Lukić Bilela
- Department of Biology, Faculty of Science, University of Sarajevo, 71000 Sarajevo, Bosnia and Herzegovina
| | - Mercedes Cueto
- Instituto de Productos Naturales y Agrobiología-CSIC, 38206 La Laguna, Spain
| | - Ana R Díaz-Marrero
- Instituto de Productos Naturales y Agrobiología-CSIC, 38206 La Laguna, Spain
- Instituto Universitario de Bio-Orgánica (IUBO), Universidad de La Laguna, 38206 La Laguna, Spain
| | - Berat Z Haznedaroglu
- Institute of Environmental Sciences, Room HKC-202, Hisar Campus, Bogazici University, Bebek, Istanbul 34342, Turkey
| | - Carlos Jimenez
- CICA- Centro Interdisciplinar de Química e Bioloxía, Departamento de Química, Facultade de Ciencias, Universidade da Coruña, 15071 A Coruña, Spain
| | - Manolis Mandalakis
- Institute of Marine Biology, Biotechnology and Aquaculture, Hellenic Centre for Marine Research, HCMR Thalassocosmos, 71500 Gournes, Crete, Greece
| | - Florbela Pereira
- LAQV, REQUIMTE, Chemistry Department, NOVA School of Science and Technology, NOVA University of Lisbon, 2819-516 Caparica, Portugal
| | - Fernando Reyes
- Fundación MEDINA, Avda. del Conocimiento 34, 18016 Armilla, Spain
| | - Deniz Tasdemir
- GEOMAR Centre for Marine Biotechnology (GEOMAR-Biotech), Research Unit Marine Natural Products Chemistry, GEOMAR Helmholtz Centre for Ocean Research Kiel, Am Kiel-Kanal 44, 24106 Kiel, Germany
- Faculty of Mathematics and Natural Science, Kiel University, Christian-Albrechts-Platz 4, 24118 Kiel, Germany
| |
Collapse
|
13
|
Kou X, Shi P, Gao C, Ma P, Xing H, Ke Q, Zhang D. Data-Driven Elucidation of Flavor Chemistry. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:6789-6802. [PMID: 37102791 PMCID: PMC10176570 DOI: 10.1021/acs.jafc.3c00909] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Flavor molecules are commonly used in the food industry to enhance product quality and consumer experiences but are associated with potential human health risks, highlighting the need for safer alternatives. To address these health-associated challenges and promote reasonable application, several databases for flavor molecules have been constructed. However, no existing studies have comprehensively summarized these data resources according to quality, focused fields, and potential gaps. Here, we systematically summarized 25 flavor molecule databases published within the last 20 years and revealed that data inaccessibility, untimely updates, and nonstandard flavor descriptions are the main limitations of current studies. We examined the development of computational approaches (e.g., machine learning and molecular simulation) for the identification of novel flavor molecules and discussed their major challenges regarding throughput, model interpretability, and the lack of gold-standard data sets for equitable model evaluation. Additionally, we discussed future strategies for the mining and designing of novel flavor molecules based on multi-omics and artificial intelligence to provide a new foundation for flavor science research.
Collapse
Affiliation(s)
- Xingran Kou
- Collaborative Innovation Center of Fragrance Flavour and Cosmetics, School of Perfume and Aroma Technology, Shanghai Institute of Technology, Shanghai 201418, China
| | - Peiqin Shi
- Collaborative Innovation Center of Fragrance Flavour and Cosmetics, School of Perfume and Aroma Technology, Shanghai Institute of Technology, Shanghai 201418, China
| | - Chukun Gao
- Laboratory for Physical Chemistry, ETH Zürich, 8093 Zürich, Switzerland
| | - Peihua Ma
- Department of Nutrition and Food Science, University of Maryland, College Park, Maryland 20742, United States
| | - Huadong Xing
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Qinfei Ke
- Collaborative Innovation Center of Fragrance Flavour and Cosmetics, School of Perfume and Aroma Technology, Shanghai Institute of Technology, Shanghai 201418, China
| | - Dachuan Zhang
- National Centre of Competence in Research (NCCR) Catalysis, Institute of Environmental Engineering, ETH Zürich, 8093 Zürich, Switzerland
| |
Collapse
|
14
|
Louwen JJR, Kautsar SA, van der Burg S, Medema MH, van der Hooft JJJ. iPRESTO: Automated discovery of biosynthetic sub-clusters linked to specific natural product substructures. PLoS Comput Biol 2023; 19:e1010462. [PMID: 36758069 PMCID: PMC9946207 DOI: 10.1371/journal.pcbi.1010462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 02/22/2023] [Accepted: 01/24/2023] [Indexed: 02/11/2023] Open
Abstract
Microbial specialised metabolism is full of valuable natural products that are applied clinically, agriculturally, and industrially. The genes that encode their biosynthesis are often physically clustered on the genome in biosynthetic gene clusters (BGCs). Many BGCs consist of multiple groups of co-evolving genes called sub-clusters that are responsible for the biosynthesis of a specific chemical moiety in a natural product. Sub-clusters therefore provide an important link between the structures of a natural product and its BGC, which can be leveraged for predicting natural product structures from sequence, as well as for linking chemical structures and metabolomics-derived mass features to BGCs. While some initial computational methodologies have been devised for sub-cluster detection, current approaches are not scalable, have only been run on small and outdated datasets, or produce an impractically large number of possible sub-clusters to mine through. Here, we constructed a scalable method for unsupervised sub-cluster detection, called iPRESTO, based on topic modelling and statistical analysis of co-occurrence patterns of enzyme-coding protein families. iPRESTO was used to mine sub-clusters across 150,000 prokaryotic BGCs from antiSMASH-DB. After annotating a fraction of the resulting sub-cluster families, we could predict a substructure for 16% of the antiSMASH-DB BGCs. Additionally, our method was able to confirm 83% of the experimentally characterised sub-clusters in MIBiG reference BGCs. Based on iPRESTO-detected sub-clusters, we could correctly identify the BGCs for xenorhabdin and salbostatin biosynthesis (which had not yet been annotated in BGC databases), as well as propose a candidate BGC for akashin biosynthesis. Additionally, we show for a collection of 145 actinobacteria how substructures can aid in linking BGCs to molecules by correlating iPRESTO-detected sub-clusters to MS/MS-derived Mass2Motifs substructure patterns. This work paves the way for deeper functional and structural annotation of microbial BGCs by improved linking of orphan molecules to their cognate gene clusters, thus facilitating accelerated natural product discovery.
Collapse
Affiliation(s)
- Joris J. R. Louwen
- Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
| | - Satria A. Kautsar
- Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
| | | | - Marnix H. Medema
- Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
- * E-mail: (MHM); (JJJvdH)
| | - Justin J. J. van der Hooft
- Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
- Department of Biochemistry, University of Johannesburg, Johannesburg, South Africa
- * E-mail: (MHM); (JJJvdH)
| |
Collapse
|
15
|
Xu Z, Park TJ, Cao H. Advances in mining and expressing microbial biosynthetic gene clusters. Crit Rev Microbiol 2023; 49:18-37. [PMID: 35166616 DOI: 10.1080/1040841x.2022.2036099] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Natural products (NPs) especially the secondary metabolites originated from microbes exhibit great importance in biomedical, industrial and agricultural applications. However, mining biosynthetic gene clusters (BGCs) to produce novel NPs has been hindered owing that a large population of environmental microbes are unculturable. In the past decade, strategies to explore BGCs directly from (meta)genomes have been established along with the fast development of high-throughput sequencing technologies and the powerful bioinformatics data-processing tools, which greatly expedited the exploitations of novel BGCs from unculturable microbes including the extremophilic microbes. In this review, we firstly summarized the popular bioinformatics tools and databases available to mine novel BGCs from (meta)genomes based on either pure cultures or pristine environmental samples. Noticeably, approaches rooted from machine learning and deep learning with focuses on the prediction of ribosomally synthesized and post-translationally modified peptides (RiPPs) were dramatically increased in recent years. Moreover, synthetic biology techniques to express the novel BGCs in culturable native microbes or heterologous hosts were introduced. This working pipeline including the discovery and biosynthesis of novel NPs will greatly advance the exploitations of the abundant but unexplored microbial BGCs.
Collapse
Affiliation(s)
- Zeling Xu
- Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Center, South China Agricultural University, Guangzhou, China
| | - Tae-Jin Park
- HME Healthcare Co., Ltd, Suwon-si, Republic of Korea
| | - Huiluo Cao
- Department of Microbiology, The University of Hong Kong, Hong Kong, China
| |
Collapse
|
16
|
Louwen JJR, Medema MH, van der Hooft JJJ. Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching. MICROBIOME 2023; 11:13. [PMID: 36691088 PMCID: PMC9869629 DOI: 10.1186/s40168-022-01444-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 12/07/2022] [Indexed: 06/01/2023]
Abstract
BACKGROUND It is well-known that the microbiome produces a myriad of specialised metabolites with diverse functions. To better characterise their structures and identify their producers in complex samples, integrative genome and metabolome mining is becoming increasingly popular. Metabologenomic co-occurrence-based correlation scoring methods facilitate the linking of metabolite mass fragmentation spectra (MS/MS) to their cognate biosynthetic gene clusters (BGCs) based on shared absence/presence patterns of metabolites and BGCs in paired omics datasets of multiple strains. Recently, these methods have been made more readily accessible through the NPLinker platform. However, co-occurrence-based approaches usually result in too many candidate links to manually validate. To address this issue, we introduce a generic feature-based correlation method that matches chemical compound classes between BGCs and MS/MS spectra. RESULTS To automatically reduce the long lists of potential BGC-MS/MS spectrum links, we match natural product (NP) ontologies previously independently developed for genomics and metabolomics and developed NPClassScore: an empirical class matching score that we also implemented in the NPLinker platform. By applying NPClassScore on three paired omics datasets totalling 189 bacterial strains, we show that the number of links is reduced by on average 63% as compared to using a co-occurrence-based strategy alone. We further demonstrate that 96% of experimentally validated links in these datasets are retained and prioritised when using NPClassScore. CONCLUSION The matching genome-metabolome class ontologies provide a starting point for selecting plausible candidates for BGCs and MS/MS spectra based on matching chemical compound class ontologies. NPClassScore expedites genome/metabolome data integration, as relevant BGC-metabolite links are prioritised, and researchers are faced with substantially fewer proposed BGC-MS/MS links to manually inspect. We anticipate that our addition to the NPLinker platform will aid integrative omics mining workflows in discovering novel NPs and understanding complex metabolic interactions in the microbiome. Video Abstract.
Collapse
Affiliation(s)
- Joris J. R. Louwen
- Bioinformatics Group, Wageningen University & Research, 6708 PB Wageningen, the Netherlands
| | - Marnix H. Medema
- Bioinformatics Group, Wageningen University & Research, 6708 PB Wageningen, the Netherlands
| | - Justin J. J. van der Hooft
- Bioinformatics Group, Wageningen University & Research, 6708 PB Wageningen, the Netherlands
- Department of Biochemistry, University of Johannesburg, Johannesburg, 2006 South Africa
| |
Collapse
|
17
|
Gomes PWP, de Tralia Medeiros TC, Maimone NM, Leão TF, de Moraes LAB, Bauermeister A. Microbial Metabolites Annotation by Mass Spectrometry-Based Metabolomics. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1439:225-248. [PMID: 37843811 DOI: 10.1007/978-3-031-41741-2_9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
Since the discovery of penicillin, microbial metabolites have been extensively investigated for drug discovery purposes. In the last decades, microbial derived compounds have gained increasing attention in different fields from pharmacognosy to industry and agriculture. Microbial metabolites in microbiomes present specific functions and can be associated with the maintenance of the natural ecosystems. These metabolites may exhibit a broad range of biological activities of great interest to human purposes. Samples from either microbial isolated cultures or microbiomes consist of complex mixtures of metabolites and their analysis are not a simple process. Mass spectrometry-based metabolomics encompass a set of analytical methods that have brought several improvements to the microbial natural products field. This analytical tool allows the comprehensively detection of metabolites, and therefore, the access of the chemical profile from those biological samples. These analyses generate thousands of mass spectra which is challenging to analyse. In this context, bioinformatic metabolomics tools have been successfully employed to accelerate and facilitate the investigation of specialized microbial metabolites. Herein, we describe metabolomics tools used to provide chemical information for the metabolites, and furthermore, we discuss how they can improve investigation of microbial cultures and interactions.
Collapse
Affiliation(s)
- Paulo Wender P Gomes
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Talita Carla de Tralia Medeiros
- Departamento de Química, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Naydja Moralles Maimone
- Departamento de Ciências Exatas, Escola Superior de Agricultura 'Luiz de Queiroz', Universidade de São Paulo, Piracicaba, São Paulo, Brazil
| | - Tiago F Leão
- Centro de Energia Nuclear na Agricultura, Universidade de São Paulo, Piracicaba, São Paulo, Brazil
| | - Luiz Alberto Beraldo de Moraes
- Departamento de Química, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Anelize Bauermeister
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.
- Instituto de Ciências Biomédicas, Universidade de São Paulo, São Paulo, Brazil.
| |
Collapse
|
18
|
Bhosle A, Wang Y, Franzosa EA, Huttenhower C. Progress and opportunities in microbial community metabolomics. Curr Opin Microbiol 2022; 70:102195. [PMID: 36063685 DOI: 10.1016/j.mib.2022.102195] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 07/20/2022] [Accepted: 07/21/2022] [Indexed: 01/25/2023]
Abstract
The metabolome lies at the interface of host-microbiome crosstalk. Previous work has established links between chemically diverse microbial metabolites and a myriad of host physiological processes and diseases. Coupled with scalable and cost-effective technologies, metabolomics is thus gaining popularity as a tool for characterization of microbial communities, particularly when combined with metagenomics as a window into microbiome function. A systematic interrogation of microbial community metabolomes can uncover key microbial compounds, metabolic capabilities of the microbiome, and also provide critical mechanistic insights into microbiome-linked host phenotypes. In this review, we discuss methods and accompanying resources that have been developed for these purposes. The accomplishments of these methods demonstrate that metabolomes can be used to functionally characterize microbial communities, and that microbial properties can be used to identify and investigate chemical compounds.
Collapse
Affiliation(s)
- Amrisha Bhosle
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA; Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Ya Wang
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA; Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Eric A Franzosa
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA; Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Curtis Huttenhower
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA; Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA.
| |
Collapse
|
19
|
Leão TF, Wang M, da Silva R, Gurevich A, Bauermeister A, Gomes PWP, Brejnrod A, Glukhov E, Aron AT, Louwen JJR, Kim HW, Reher R, Fiore MF, van der Hooft JJJ, Gerwick L, Gerwick WH, Bandeira N, Dorrestein PC. NPOmix: A machine learning classifier to connect mass spectrometry fragmentation data to biosynthetic gene clusters. PNAS NEXUS 2022; 1:pgac257. [PMID: 36712343 PMCID: PMC9802219 DOI: 10.1093/pnasnexus/pgac257] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 10/18/2022] [Accepted: 11/25/2022] [Indexed: 06/02/2023]
Abstract
Microbial specialized metabolites are an important source of and inspiration for many pharmaceuticals, biotechnological products and play key roles in ecological processes. Untargeted metabolomics using liquid chromatography coupled with tandem mass spectrometry is an efficient technique to access metabolites from fractions and even environmental crude extracts. Nevertheless, metabolomics is limited in predicting structures or bioactivities for cryptic metabolites. Efficiently linking the biosynthetic potential inferred from (meta)genomics to the specialized metabolome would accelerate drug discovery programs by allowing metabolomics to make use of genetic predictions. Here, we present a k-nearest neighbor classifier to systematically connect mass spectrometry fragmentation spectra to their corresponding biosynthetic gene clusters (independent of their chemical class). Our new pattern-based genome mining pipeline links biosynthetic genes to metabolites that they encode for, as detected via mass spectrometry from bacterial cultures or environmental microbiomes. Using paired datasets that include validated genes-mass spectral links from the Paired Omics Data Platform, we demonstrate this approach by automatically linking 18 previously known mass spectra (17 for which the biosynthesis gene clusters can be found at the MIBiG database plus palmyramide A) to their corresponding previously experimentally validated biosynthetic genes (e.g., via nuclear magnetic resonance or genetic engineering). We illustrated a computational example of how to use our Natural Products Mixed Omics (NPOmix) tool for siderophore mining that can be reproduced by the users. We conclude that NPOmix minimizes the need for culturing (it worked well on microbiomes) and facilitates specialized metabolite prioritization based on integrative omics mining.
Collapse
Affiliation(s)
- Tiago F Leão
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba 13400-970, SP, Brazil
| | - Mingxun Wang
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Center for Computational Mass Spectrometry, University of California San Diego, La Jolla, CA 92093, USA
| | - Ricardo da Silva
- NPPNS, Physic and Chemistry Department, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto 14040-900, Brazil
| | - Alexey Gurevich
- Center for Algorithmic Biotechnology, St. Petersburg State University, St Petersburg 199004, Russia
| | - Anelize Bauermeister
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Paulo Wender P Gomes
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Asker Brejnrod
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Evgenia Glukhov
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - Allegra T Aron
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO 80210, USA
| | - Joris J R Louwen
- Bioinformatics Group, Wageningen University, 6708 PB Wageningen, The Netherlands
| | - Hyun Woo Kim
- College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University, Gyeonggi-do 10326, Korea
| | - Raphael Reher
- Institute of Pharmaceutical Biology and Biotechnology, University of Marburg, 35043 Marburg, Germany
| | - Marli F Fiore
- Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba 13400-970, SP, Brazil
| | - Justin J J van der Hooft
- Bioinformatics Group, Wageningen University, 6708 PB Wageningen, The Netherlands
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa
| | - Lena Gerwick
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - William H Gerwick
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - Nuno Bandeira
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Center for Computational Mass Spectrometry, University of California San Diego, La Jolla, CA 92093, USA
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA 92093, USA
- Departments of Pharmacology and Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
20
|
Feeney MA, Newitt JT, Addington E, Algora-Gallardo L, Allan C, Balis L, Birke AS, Castaño-Espriu L, Charkoudian LK, Devine R, Gayrard D, Hamilton J, Hennrich O, Hoskisson PA, Keith-Baker M, Klein JG, Kruasuwan W, Mark DR, Mast Y, McHugh RE, McLean TC, Mohit E, Munnoch JT, Murray J, Noble K, Otani H, Parra J, Pereira CF, Perry L, Pintor-Escobar L, Pritchard L, Prudence SMM, Russell AH, Schniete JK, Seipke RF, Sélem-Mojica N, Undabarrena A, Vind K, van Wezel GP, Wilkinson B, Worsley SF, Duncan KR, Fernández-Martínez LT, Hutchings MI. ActinoBase: tools and protocols for researchers working on Streptomyces and other filamentous actinobacteria. Microb Genom 2022; 8. [PMID: 35775972 PMCID: PMC9455695 DOI: 10.1099/mgen.0.000824] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Actinobacteria is an ancient phylum of Gram-positive bacteria with a characteristic high GC content to their DNA. The ActinoBase Wiki is focused on the filamentous actinobacteria, such as Streptomyces species, and the techniques and growth conditions used to study them. These organisms are studied because of their complex developmental life cycles and diverse specialised metabolism which produces many of the antibiotics currently used in the clinic. ActinoBase is a community effort that provides valuable and freely accessible resources, including protocols and practical information about filamentous actinobacteria. It is aimed at enabling knowledge exchange between members of the international research community working with these fascinating bacteria. ActinoBase is an anchor platform that underpins worldwide efforts to understand the ecology, biology and metabolic potential of these organisms. There are two key differences that set ActinoBase apart from other Wiki-based platforms: [1] ActinoBase is specifically aimed at researchers working on filamentous actinobacteria and is tailored to help users overcome challenges working with these bacteria and [2] it provides a freely accessible resource with global networking opportunities for researchers with a broad range of experience in this field.
Collapse
Affiliation(s)
- Morgan Anne Feeney
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Jake Terry Newitt
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | - Emily Addington
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Lis Algora-Gallardo
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Craig Allan
- Swansea University Institute of Life Science, College of Medicine, Swansea, Wales, UK
| | - Lucas Balis
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | - Anna S Birke
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Laia Castaño-Espriu
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | | | - Rebecca Devine
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | - Damien Gayrard
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | - Jacob Hamilton
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | - Oliver Hennrich
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures GmbH Inhoffenstraße 7B, 38124 Braunschweig, Germany
| | - Paul A Hoskisson
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Molly Keith-Baker
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | | | - Worarat Kruasuwan
- Division of Bioinformatics and Data Management for Research, Research Group and Research Network Division, Research Department, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - David R Mark
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Yvonne Mast
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures GmbH Inhoffenstraße 7B, 38124 Braunschweig, Germany
| | - Rebecca E McHugh
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Thomas C McLean
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | - Elmira Mohit
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - John T Munnoch
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Jordan Murray
- Department of Physics, SUPA, University of Strathclyde, Glasgow, G4 0NG, UK
| | - Katie Noble
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | - Hiroshi Otani
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.,Lawrence Berkeley National Laboratory, Environmental Genomics and Systems Biology Division, Berkeley, CA 94720, USA
| | - Jonathan Parra
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Camila F Pereira
- Department of Insect Symbiosis, Max Planck Institute for Chemical Ecology, Hans-Knöll-Straße 8, 07745 Jena, Germany
| | - Louisa Perry
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | | | - Leighton Pritchard
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | - Samuel M M Prudence
- School of Biological and Behavioral Sciences, Queen Mary University of London, Mile End Road, London, E1 4NS, UK
| | | | - Jana K Schniete
- Biology Department, Edge Hill University, St Helens Road, Ormskirk, L39 4QP, UK
| | - Ryan F Seipke
- Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK.,Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT, UK
| | - Nelly Sélem-Mojica
- Universidad Nacional Autónoma de México, Centro de Ciencias Matemáticas, en Morelia, Michoacán, Mexico
| | - Agustina Undabarrena
- Departamento de Química & Centro de Biotecnología Daniel Alkalay Lowitt, Universidad Técnica Federico Santa María, Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Valparaíso, 2340000, Chile
| | - Kristiina Vind
- Host-Microbe Interactomics Group, Wageningen University, 6708 WD Wageningen, The Netherlands
| | - Gilles P van Wezel
- Microbial Biotechnology, Institute of Biology, Leiden University, Rapenburg, The Netherlands
| | - Barrie Wilkinson
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| | - Sarah F Worsley
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
| | - Katherine R Duncan
- University of Strathclyde, Strathclyde Institute of Pharmacy and Biomedical Sciences, Glasgow, G4 0RE, UK
| | | | - Matthew I Hutchings
- Department of Molecular Microbiology, John Innes Centre, Norwich, NR4 7UH, UK
| |
Collapse
|
21
|
Caesar LK, Montaser R, Keller NP, Kelleher NL. Metabolomics and genomics in natural products research: complementary tools for targeting new chemical entities. Nat Prod Rep 2021; 38:2041-2065. [PMID: 34787623 PMCID: PMC8691422 DOI: 10.1039/d1np00036e] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Covering: 2010 to 2021Organisms in nature have evolved into proficient synthetic chemists, utilizing specialized enzymatic machinery to biosynthesize an inspiring diversity of secondary metabolites. Often serving to boost competitive advantage for their producers, these secondary metabolites have widespread human impacts as antibiotics, anti-inflammatories, and antifungal drugs. The natural products discovery field has begun a shift away from traditional activity-guided approaches and is beginning to take advantage of increasingly available metabolomics and genomics datasets to explore undiscovered chemical space. Major strides have been made and now enable -omics-informed prioritization of chemical structures for discovery, including the prospect of confidently linking metabolites to their biosynthetic pathways. Over the last decade, more integrated strategies now provide researchers with pipelines for simultaneous identification of expressed secondary metabolites and their biosynthetic machinery. However, continuous collaboration by the natural products community will be required to optimize strategies for effective evaluation of natural product biosynthetic gene clusters to accelerate discovery efforts. Here, we provide an evaluative guide to scientific literature as it relates to studying natural product biosynthesis using genomics, metabolomics, and their integrated datasets. Particular emphasis is placed on the unique insights that can be gained from large-scale integrated strategies, and we provide source organism-specific considerations to evaluate the gaps in our current knowledge.
Collapse
Affiliation(s)
- Lindsay K Caesar
- Department of Chemistry, Northwestern University, Evanston, IL, USA.
| | - Rana Montaser
- Department of Chemistry, Northwestern University, Evanston, IL, USA.
| | - Nancy P Keller
- Department of Medical Microbiology and Immunology and Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Neil L Kelleher
- Department of Chemistry, Northwestern University, Evanston, IL, USA.
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| |
Collapse
|
22
|
Jarmusch SA, van der Hooft JJJ, Dorrestein PC, Jarmusch AK. Advancements in capturing and mining mass spectrometry data are transforming natural products research. Nat Prod Rep 2021; 38:2066-2082. [PMID: 34612288 PMCID: PMC8667781 DOI: 10.1039/d1np00040c] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Covering: 2016 up to 2021Mass spectrometry (MS) is an essential technology in natural products research with MS fragmentation (MS/MS) approaches becoming a key tool. Recent advancements in MS yield dense metabolomics datasets which have been, conventionally, used by individual labs for individual projects; however, a shift is brewing. The movement towards open MS data (and other structural characterization data) and accessible data mining tools is emerging in natural products research. Over the past 5 years, this movement has rapidly expanded and evolved with no slowdown in sight; the capabilities of today vastly exceed those of 5 years ago. Herein, we address the analysis of individual datasets, a situation we are calling the '2021 status quo', and the emergent framework to systematically capture sample information (metadata) and perform repository-scale analyses. We evaluate public data deposition, discuss the challenges of working in the repository scale, highlight the challenges of metadata capture and provide illustrative examples of the power of utilizing repository data and the tools that enable it. We conclude that the advancements in MS data collection must be met with advancements in how we utilize data; therefore, we argue that open data and data mining is the next evolution in obtaining the maximum potential in natural products research.
Collapse
Affiliation(s)
- Scott A Jarmusch
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 221, DK-2800 Kongens Lyngby, Denmark.
| | | | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, 92093-0751, USA
| | - Alan K Jarmusch
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, 92093-0751, USA
- Immunity, Inflammation, and Disease Laboratory, Division of Intramural Research, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC 27709, USA
| |
Collapse
|
23
|
Beniddir MA, Kang KB, Genta-Jouve G, Huber F, Rogers S, van der Hooft JJJ. Advances in decomposing complex metabolite mixtures using substructure- and network-based computational metabolomics approaches. Nat Prod Rep 2021; 38:1967-1993. [PMID: 34821250 PMCID: PMC8597898 DOI: 10.1039/d1np00023c] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Indexed: 12/13/2022]
Abstract
Covering: up to the end of 2020Recently introduced computational metabolome mining tools have started to positively impact the chemical and biological interpretation of untargeted metabolomics analyses. We believe that these current advances make it possible to start decomposing complex metabolite mixtures into substructure and chemical class information, thereby supporting pivotal tasks in metabolomics analysis including metabolite annotation, the comparison of metabolic profiles, and network analyses. In this review, we highlight and explain key tools and emerging strategies covering 2015 up to the end of 2020. The majority of these tools aim at processing and analyzing liquid chromatography coupled to mass spectrometry fragmentation data. We start with defining what substructures are, how they relate to molecular fingerprints, and how recognizing them helps to decompose complex mixtures. We continue with chemical classes that are based on the presence or absence of particular molecular scaffolds and/or functional groups and are thus intrinsically related to substructures. We discuss novel tools to mine substructures, annotate chemical compound classes, and create mass spectral networks from metabolomics data and demonstrate them using two case studies. We also review and speculate about the opportunities that NMR spectroscopy-based metabolome mining of complex metabolite mixtures offers to discover substructures and chemical classes. Finally, we will describe the main benefits and limitations of the current tools and strategies that rely on them, and our vision on how this exciting field can develop toward repository-scale-sized metabolomics analyses. Complementary sources of structural information from genomics analyses and well-curated taxonomic records are also discussed. Many research fields such as natural products discovery, pharmacokinetic and drug metabolism studies, and environmental metabolomics increasingly rely on untargeted metabolomics to gain biochemical and biological insights. The here described technical advances will benefit all those metabolomics disciplines by transforming spectral data into knowledge that can answer biological questions.
Collapse
Affiliation(s)
- Mehdi A Beniddir
- Université Paris-Saclay, CNRS, BioCIS, 5 rue J.-B Clément, 92290 Châtenay-Malabry, France
| | - Kyo Bin Kang
- Research Institute of Pharmaceutical Sciences, College of Pharmacy, Sookmyung Women's University, Seoul 04310, Republic of Korea
| | - Grégory Genta-Jouve
- Laboratoire de Chimie-Toxicologie Analytique et Cellulaire (C-TAC), UMR CNRS 8038, CiTCoM, Université de Paris, 4, Avenue de l'Observatoire, 75006, Paris, France
- Laboratoire Ecologie, Evolution, Interactions des Systèmes Amazoniens (LEEISA), USR 3456, Université De Guyane, CNRS Guyane, 275 Route de Montabo, 97334 Cayenne, French Guiana, France
| | - Florian Huber
- Netherlands eScience Center, 1098 XG Amsterdam, The Netherlands
| | - Simon Rogers
- School of Computing Science, University of Glasgow, Glasgow G12 8QQ, UK
| | | |
Collapse
|
24
|
Undabarrena A, Pereira CF, Kruasuwan W, Parra J, Sélem-Mojica N, Vind K, Schniete JK. Integrating perspectives in actinomycete research: an ActinoBase review of 2020-21. MICROBIOLOGY (READING, ENGLAND) 2021; 167:001084. [PMID: 34515628 PMCID: PMC8549240 DOI: 10.1099/mic.0.001084] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 07/26/2021] [Indexed: 12/12/2022]
Abstract
Last year ActinoBase, a Wiki-style initiative supported by the UK Microbiology Society, published a review highlighting the research of particular interest to the actinomycete community. Here, we present the second ActinoBase review showcasing selected reports published in 2020 and early 2021, integrating perspectives in the actinomycete field. Actinomycetes are well-known for their unsurpassed ability to produce specialised metabolites, of which many are used as therapeutic agents with antibacterial, antifungal, or immunosuppressive activities. Much research is carried out to understand the purpose of these metabolites in the environment, either within communities or in host interactions. Moreover, many efforts have been placed in developing computational tools to handle big data, simplify experimental design, and find new biosynthetic gene cluster prioritisation strategies. Alongside, synthetic biology has provided advances in tools to elucidate the biosynthesis of these metabolites. Additionally, there are still mysteries to be uncovered in understanding the fundamentals of filamentous actinomycetes' developmental cycle and regulation of their metabolism. This review focuses on research using integrative methodologies and approaches to understand the bigger picture of actinomycete biology, covering four research areas: i) technology and methodology; ii) specialised metabolites; iii) development and regulation; and iv) ecology and host interactions.
Collapse
Affiliation(s)
- Agustina Undabarrena
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Departamento de Química & Centro de Biotecnología Daniel Alkalay Lowitt, Universidad Técnica Federico Santa María, Valparaíso 2340000, Chile
| | - Camila F Pereira
- School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, Brazil
| | - Worarat Kruasuwan
- Microbial Cell Factory Research Team, National Center for Genetic Engineering and Biotechnology, National Science and Technology Development Agency, 113 Thailand Science Park, Khlong Nueng, Khlong Luang, Pathum Thani 12120, Thailand
| | - Jonathan Parra
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, G4 0RE, UK
| | - Nelly Sélem-Mojica
- Centro de Ciencias Matemáticas, Antigua Carretera a Pátzcuaro # 8701, Col. Ex Hacienda San José de la Huerta, Morelia C.P. 58089, Michoacán, México
| | - Kristiina Vind
- NAICONS Srl, Viale Ortles 22/4, 20139 Milan (MI), Italy
- Host-Microbe Interactomics Group, Wageningen University, De Elst 1 6708 WD, Wageningen, Netherlands
| | - Jana K. Schniete
- Biology, Edge Hill University, St Helens Road, Ormskirk, L39 4QP, UK
| |
Collapse
|
25
|
Medema MH, de Rond T, Moore BS. Mining genomes to illuminate the specialized chemistry of life. Nat Rev Genet 2021; 22:553-571. [PMID: 34083778 PMCID: PMC8364890 DOI: 10.1038/s41576-021-00363-7] [Citation(s) in RCA: 105] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/09/2021] [Indexed: 02/07/2023]
Abstract
All organisms produce specialized organic molecules, ranging from small volatile chemicals to large gene-encoded peptides, that have evolved to provide them with diverse cellular and ecological functions. As natural products, they are broadly applied in medicine, agriculture and nutrition. The rapid accumulation of genomic information has revealed that the metabolic capacity of virtually all organisms is vastly underappreciated. Pioneered mainly in bacteria and fungi, genome mining technologies are accelerating metabolite discovery. Recent efforts are now being expanded to all life forms, including protists, plants and animals, and new integrative omics technologies are enabling the increasingly effective mining of this molecular diversity.
Collapse
Affiliation(s)
- Marnix H Medema
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
| | - Tristan de Rond
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Bradley S Moore
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA.
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
26
|
Comprehensive Large-Scale Integrative Analysis of Omics Data To Accelerate Specialized Metabolite Discovery. mSystems 2021; 6:e0072621. [PMID: 34427506 PMCID: PMC8407348 DOI: 10.1128/msystems.00726-21] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Microbial specialized metabolites are key mediators in host-microbiome interactions. Most of the chemical space produced by the microbiome currently remains unexplored and uncharacterized. This situation calls for new and improved methods to exploit the growing publicly available genomic and metabolomic data sets and connect the outcomes to structural and functional knowledge inferred from transcriptomics and proteomics experiments. Here, we first describe currently available approaches that support the comprehensive mining of metabolomics and genomics data. Next, we provide our vision on how to move forward toward the automated linking of omics data of specialized metabolites to their structures, biosynthesis pathways, producers, and functions.
Collapse
|