1
|
Orsi M, Reymond JL. Can large language models predict antimicrobial peptide activity and toxicity? RSC Med Chem 2024; 15:2030-2036. [PMID: 38911166 PMCID: PMC11187562 DOI: 10.1039/d4md00159a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 04/19/2024] [Indexed: 06/25/2024] Open
Abstract
Antimicrobial peptides (AMPs) are naturally occurring or designed peptides up to a few tens of amino acids which may help address the antimicrobial resistance crisis. However, their clinical development is limited by toxicity to human cells, a parameter which is very difficult to control. Given the similarity between peptide sequences and words, large language models (LLMs) might be able to predict AMP activity and toxicity. To test this hypothesis, we fine-tuned LLMs using data from the Database of Antimicrobial Activity and Structure of Peptides (DBAASP). GPT-3 performed well but not reproducibly for activity prediction and hemolysis, taken as a proxy for toxicity. The later GPT-3.5 performed more poorly and was surpassed by recurrent neural networks (RNN) trained on sequence-activity data or support vector machines (SVM) trained on MAP4C molecular fingerprint-activity data. These simpler models are therefore recommended, although the rapid evolution of LLMs warrants future re-evaluation of their prediction abilities.
Collapse
Affiliation(s)
- Markus Orsi
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern Freiestrasse 3 3012 Bern Switzerland
| |
Collapse
|
2
|
Mei S. A Multi-Label Learning Framework for Predicting Chemical Classes and Biological Activities of Natural Products from Biosynthetic Gene Clusters. J Chem Ecol 2023; 49:681-695. [PMID: 37779180 DOI: 10.1007/s10886-023-01452-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 08/28/2023] [Accepted: 09/13/2023] [Indexed: 10/03/2023]
Abstract
Natural products (NP) or secondary metabolites, as a class of small chemical molecules that are naturally synthesized by chromosomally clustered biosynthesis genes (also called biosynthetic gene clusters, BGCs) encoded enzymes or enzyme complexes, mediates the bioecological interactions between host and microbiota and provides a natural reservoir for screening drug-like therapeutic pharmaceuticals. In this work, we propose a multi-label learning framework to functionally annotate natural products or secondary metabolites solely from their catalytical biosynthetic gene clusters without experimentally conducting NP structural resolutions. All chemical classes and bioactivities constitute the label space, and the sequence domains of biosynthetic gene clusters that catalyse the biosynthesis of natural products constitute the feature space. In this multi-label learning framework, a joint representation of features (BGCs domains) and labels (natural products annotations) is efficiently learnt in an integral and low-dimensional space to accurately define the inter-class boundaries and scale to the learning problem of many imbalanced labels. Computational results on experimental data show that the proposed framework achieves satisfactory multi-label learning performance, and the learnt patterns of BGCs domains are transferrable across bacteria, or even across kingdom, for instance, from bacteria to Arabidopsis thaliana. Lastly, take Arabidopsis thaliana and its rhizosphere microbiome for example, we propose a pipeline combining existing BGCs identification tools and this proposed framework to find and functionally annotate novel natural products for downstream bioecological studies in terms of plant-microbiota-soil interactions and plant environmental adaption.
Collapse
Affiliation(s)
- Suyu Mei
- Software College, Shenyang Normal University, Shenyang, 110034, China.
| |
Collapse
|
3
|
Mullowney MW, Duncan KR, Elsayed SS, Garg N, van der Hooft JJJ, Martin NI, Meijer D, Terlouw BR, Biermann F, Blin K, Durairaj J, Gorostiola González M, Helfrich EJN, Huber F, Leopold-Messer S, Rajan K, de Rond T, van Santen JA, Sorokina M, Balunas MJ, Beniddir MA, van Bergeijk DA, Carroll LM, Clark CM, Clevert DA, Dejong CA, Du C, Ferrinho S, Grisoni F, Hofstetter A, Jespers W, Kalinina OV, Kautsar SA, Kim H, Leao TF, Masschelein J, Rees ER, Reher R, Reker D, Schwaller P, Segler M, Skinnider MA, Walker AS, Willighagen EL, Zdrazil B, Ziemert N, Goss RJM, Guyomard P, Volkamer A, Gerwick WH, Kim HU, Müller R, van Wezel GP, van Westen GJP, Hirsch AKH, Linington RG, Robinson SL, Medema MH. Artificial intelligence for natural product drug discovery. Nat Rev Drug Discov 2023; 22:895-916. [PMID: 37697042 DOI: 10.1038/s41573-023-00774-7] [Citation(s) in RCA: 54] [Impact Index Per Article: 54.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/20/2023] [Indexed: 09/13/2023]
Abstract
Developments in computational omics technologies have provided new means to access the hidden diversity of natural products, unearthing new potential for drug discovery. In parallel, artificial intelligence approaches such as machine learning have led to exciting developments in the computational drug design field, facilitating biological activity prediction and de novo drug design for molecular targets of interest. Here, we describe current and future synergies between these developments to effectively identify drug candidates from the plethora of molecules produced by nature. We also discuss how to address key challenges in realizing the potential of these synergies, such as the need for high-quality datasets to train deep learning algorithms and appropriate strategies for algorithm validation.
Collapse
Affiliation(s)
| | - Katherine R Duncan
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK
| | - Somayah S Elsayed
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - Neha Garg
- School of Chemistry and Biochemistry, Center for Microbial Dynamics and Infection, Georgia Institute of Technology, Atlanta, GA, USA
| | - Justin J J van der Hooft
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
- Department of Biochemistry, University of Johannesburg, Johannesburg, South Africa
| | - Nathaniel I Martin
- Biological Chemistry Group, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - David Meijer
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
| | - Barbara R Terlouw
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
| | - Friederike Biermann
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
- Institute of Molecular Bio Science, Goethe-University Frankfurt, Frankfurt am Main, Germany
- LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt am Main, Germany
| | - Kai Blin
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | | | - Marina Gorostiola González
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
- ONCODE institute, Leiden, The Netherlands
| | - Eric J N Helfrich
- Institute of Molecular Bio Science, Goethe-University Frankfurt, Frankfurt am Main, Germany
- LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt am Main, Germany
| | - Florian Huber
- Center for Digitalization and Digitality, Hochschule Düsseldorf, Düsseldorf, Germany
| | - Stefan Leopold-Messer
- Institut für Mikrobiologie, Eidgenössische Technische Hochschule (ETH) Zürich, Zürich, Switzerland
| | - Kohulan Rajan
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University Jena, Jena, Germany
| | - Tristan de Rond
- School of Chemical Sciences, University of Auckland, Auckland, New Zealand
| | - Jeffrey A van Santen
- Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Maria Sorokina
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller University, Jena, Germany
- Pharmaceuticals R&D, Bayer AG, Berlin, Germany
| | - Marcy J Balunas
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Mehdi A Beniddir
- Équipe "Chimie des Substances Naturelles", Université Paris-Saclay, CNRS, BioCIS, Orsay, France
| | - Doris A van Bergeijk
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - Laura M Carroll
- Structural and Computational Biology Unit, EMBL, Heidelberg, Germany
| | - Chase M Clark
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, WI, USA
| | | | | | - Chao Du
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | | | - Francesca Grisoni
- Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
- Centre for Living Technologies, Alliance TU/e, WUR, UU, UMC Utrecht, Utrecht, The Netherlands
| | | | - Willem Jespers
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
| | - Olga V Kalinina
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany
- Drug Bioinformatics, Medical Faculty, Saarland University, Homburg, Germany
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
| | | | - Hyunwoo Kim
- College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University Seoul, Goyang-si, Republic of Korea
| | - Tiago F Leao
- Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba, Brazil
| | - Joleen Masschelein
- Center for Microbiology, VIB-KU Leuven, Heverlee, Belgium
- Department of Biology, KU Leuven, Heverlee, Belgium
| | - Evan R Rees
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, WI, USA
| | - Raphael Reher
- Institute of Pharmaceutical Biology and Biotechnology, University of Marburg, Marburg, Germany
- Institute of Pharmacy, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
| | - Daniel Reker
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
- Duke Microbiome Center, Duke University, Durham, NC, USA
| | - Philippe Schwaller
- Laboratory of Artificial Chemical Intelligence, Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | | | - Michael A Skinnider
- Adapsyn Bioscience, Hamilton, Ontario, Canada
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Allison S Walker
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - Egon L Willighagen
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, The Netherlands
| | - Barbara Zdrazil
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridgeshire, UK
| | - Nadine Ziemert
- Interfaculty Institute for Microbiology and Infection Medicine Tuebingen (IMIT), Institute for Bioinformatics and Medical Informatics (IBMI), University of Tuebingen, Tuebingen, Germany
| | | | - Pierre Guyomard
- Bonsai team, CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, Université de Lille, Villeneuve d'Ascq Cedex, France
| | - Andrea Volkamer
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
- In silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - William H Gerwick
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Hyun Uk Kim
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea
| | - Rolf Müller
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany
- Department of Pharmacy, Saarland University, Saarbrücken, Germany
- German Center for infection research (DZIF), Braunschweig, Germany
- Helmholtz International Lab for Anti-Infectives, Saarbrücken, Germany
| | - Gilles P van Wezel
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
- Netherlands Institute of Ecology, NIOO-KNAW, Wageningen, The Netherlands
| | - Gerard J P van Westen
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands.
| | - Anna K H Hirsch
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany.
- Department of Pharmacy, Saarland University, Saarbrücken, Germany.
- German Center for infection research (DZIF), Braunschweig, Germany.
- Helmholtz International Lab for Anti-Infectives, Saarbrücken, Germany.
| | - Roger G Linington
- Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada.
| | - Serina L Robinson
- Department of Environmental Microbiology, Eawag: Swiss Federal Institute for Aquatic Science and Technology, Dübendorf, Switzerland.
| | - Marnix H Medema
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands.
- Institute of Biology, Leiden University, Leiden, The Netherlands.
| |
Collapse
|
4
|
Akone S, Hug JJ, Kaur A, Garcia R, Müller R. Structure Elucidation and Biosynthesis of Nannosterols A and B, Myxobacterial Sterols from Nannocystis sp. MNa10993. JOURNAL OF NATURAL PRODUCTS 2023; 86:915-923. [PMID: 37011180 PMCID: PMC10152446 DOI: 10.1021/acs.jnatprod.2c01143] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Indexed: 05/04/2023]
Abstract
Myxobacteria represent an underinvestigated source of chemically diverse and biologically active secondary metabolites. Here, we report the discovery, isolation, structure elucidation, and biological evaluation of two new bacterial sterols, termed nannosterols A and B (1, 2), from the terrestrial myxobacterium Nannocystis sp. (MNa10993). Nannosterols feature a cholestanol core with numerous modifications including a secondary alcohol at position C-15, a terminal vicinal diol side chain at C-24-C-25 (1, 2), and a hydroxy group at the angular methyl group at C-18 (2), which is unprecedented for bacterial sterols. Another rare chemical feature of bacterial triterpenoids is a ketone group at position C-7, which is also displayed by 1 and 2. The combined exploration based on myxobacterial high-resolution secondary metabolome data and genomic in silico investigations exposed the nannosterols as frequently produced sterols within the myxobacterial suborder of Nannocystineae. The discovery of the nannosterols provides insights into the biosynthesis of these new myxobacterial sterols, with implications in understanding the evolution of sterol production by prokaryotes.
Collapse
Affiliation(s)
- Sergi
H. Akone
- Helmholtz-Institute
for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for
Infection Research (HZI), Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- Department
of Pharmacy, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- German
Center for Infection Research (DZIF), Partner Site Hannover-Braunschweig, 38124 Braunschweig, Germany
- Helmholtz
International Laboratories, Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- Department
of Chemistry, Faculty of Science, University
of Douala, P.O. Box 24157, Douala, Cameroon
| | - Joachim J. Hug
- Helmholtz-Institute
for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for
Infection Research (HZI), Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- Department
of Pharmacy, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- German
Center for Infection Research (DZIF), Partner Site Hannover-Braunschweig, 38124 Braunschweig, Germany
- Helmholtz
International Laboratories, Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
| | - Amninder Kaur
- Helmholtz-Institute
for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for
Infection Research (HZI), Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- Department
of Pharmacy, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- German
Center for Infection Research (DZIF), Partner Site Hannover-Braunschweig, 38124 Braunschweig, Germany
- Helmholtz
International Laboratories, Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
| | - Ronald Garcia
- Helmholtz-Institute
for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for
Infection Research (HZI), Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- Department
of Pharmacy, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- German
Center for Infection Research (DZIF), Partner Site Hannover-Braunschweig, 38124 Braunschweig, Germany
- Helmholtz
International Laboratories, Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
| | - Rolf Müller
- Helmholtz-Institute
for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for
Infection Research (HZI), Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- Department
of Pharmacy, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
- German
Center for Infection Research (DZIF), Partner Site Hannover-Braunschweig, 38124 Braunschweig, Germany
- Helmholtz
International Laboratories, Department of Microbial Natural Products, Saarland University, Campus E8 1, 66123 Saarbrücken, Germany
| |
Collapse
|
5
|
Morehouse NJ, Clark TN, McMann EJ, van Santen JA, Haeckl FPJ, Gray CA, Linington RG. Annotation of natural product compound families using molecular networking topology and structural similarity fingerprinting. Nat Commun 2023; 14:308. [PMID: 36658161 PMCID: PMC9852437 DOI: 10.1038/s41467-022-35734-z] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 12/20/2022] [Indexed: 01/20/2023] Open
Abstract
Spectral matching of MS2 fragmentation spectra has become a popular method for characterizing natural products libraries but identification remains challenging due to differences in MS2 fragmentation properties between instruments and the low coverage of current spectral reference libraries. To address this bottleneck we present Structural similarity Network Annotation Platform for Mass Spectrometry (SNAP-MS) which matches chemical similarity grouping in the Natural Products Atlas to grouping of mass spectrometry features from molecular networking. This approach assigns compound families to molecular networking subnetworks without the need for experimental or calculated reference spectra. We demonstrate SNAP-MS can accurately annotate subnetworks built from both reference spectra and an in-house microbial extract library, and correctly predict compound families from published molecular networks acquired on a range of MS instrumentation. Compound family annotations for the microbial extract library are validated by co-injection of standards or isolation and spectroscopic analysis. SNAP-MS is freely available at www.npatlas.org/discover/snapms .
Collapse
Affiliation(s)
- Nicholas J Morehouse
- Department of Biological Sciences, University of New Brunswick, Saint John, NB, Canada
| | - Trevor N Clark
- Department of Chemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Emily J McMann
- Department of Chemistry, Simon Fraser University, Burnaby, BC, Canada
| | | | - F P Jake Haeckl
- Department of Chemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Christopher A Gray
- Department of Biological Sciences, University of New Brunswick, Saint John, NB, Canada.,Department of Chemistry, University of New Brunswick, Fredericton, NB, Canada
| | - Roger G Linington
- Department of Chemistry, Simon Fraser University, Burnaby, BC, Canada.
| |
Collapse
|
6
|
de Medeiros LS, de Araújo Júnior MB, Peres EG, da Silva JCI, Bassicheto MC, Di Gioia G, Veiga TAM, Koolen HHF. Discovering New Natural Products Using Metabolomics-Based Approaches. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1439:185-224. [PMID: 37843810 DOI: 10.1007/978-3-031-41741-2_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
The incessant search for new natural molecules with biological activities has forced researchers in the field of chemistry of natural products to seek different approaches for their prospection studies. In particular, researchers around the world are turning to approaches in metabolomics to avoid high rates of re-isolation of certain compounds, something recurrent in this branch of science. Thanks to the development of new technologies in the analytical instrumentation of spectroscopic and spectrometric techniques, as well as the advance in the computational processing modes of the results, metabolomics has been gaining more and more space in studies that involve the prospection of natural products. Thus, this chapter summarizes the precepts and good practices in the metabolomics of microbial natural products using mass spectrometry and nuclear magnetic resonance spectroscopy, and also summarizes several examples where this approach has been applied in the discovery of bioactive molecules.
Collapse
Affiliation(s)
- Lívia Soman de Medeiros
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil.
| | - Moysés B de Araújo Júnior
- Grupo de Pesquisa em Metabolômica e Espectrometria de Massas, Universidade do Estado do Amazonas, Manaus, Brazil
| | - Eldrinei G Peres
- Grupo de Pesquisa em Metabolômica e Espectrometria de Massas, Universidade do Estado do Amazonas, Manaus, Brazil
| | | | - Milena Costa Bassicheto
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | - Giordanno Di Gioia
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | - Thiago André Moura Veiga
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | | |
Collapse
|
7
|
Ruchawapol C, Fu WW, Xu HX. A review on computational approaches that support the researches on traditional Chinese medicines (TCM) against COVID-19. PHYTOMEDICINE : INTERNATIONAL JOURNAL OF PHYTOTHERAPY AND PHYTOPHARMACOLOGY 2022; 104:154324. [PMID: 35841663 PMCID: PMC9259013 DOI: 10.1016/j.phymed.2022.154324] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 06/23/2022] [Accepted: 07/05/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND COVID-19 highly caused contagious infections and massive deaths worldwide as well as unprecedentedly disrupting global economies and societies, and the urgent development of new antiviral medications are required. Medicinal herbs are promising resources for the discovery of prophylactic candidate against COVID-19. Considerable amounts of experimental efforts have been made on vaccines and direct-acting antiviral agents (DAAs), but neither of them was fast and fully developed. PURPOSE This study examined the computational approaches that have played a significant role in drug discovery and development against COVID-19, and these computational methods and tools will be helpful for the discovery of lead compounds from phytochemicals and understanding the molecular mechanism of action of TCM in the prevention and control of the other diseases. METHODS A search conducting in scientific databases (PubMed, Science Direct, ResearchGate, Google Scholar, and Web of Science) found a total of 2172 articles, which were retrieved via web interface of the following websites. After applying some inclusion and exclusion criteria and full-text screening, only 292 articles were collected as eligible articles. RESULTS In this review, we highlight three main categories of computational approaches including structure-based, knowledge-mining (artificial intelligence) and network-based approaches. The most commonly used database, molecular docking tool, and MD simulation software include TCMSP, AutoDock Vina, and GROMACS, respectively. Network-based approaches were mainly provided to help readers understanding the complex mechanisms of multiple TCM ingredients, targets, diseases, and networks. CONCLUSION Computational approaches have been broadly applied to the research of phytochemicals and TCM against COVID-19, and played a significant role in drug discovery and development in terms of the financial and time saving.
Collapse
Affiliation(s)
- Chattarin Ruchawapol
- School of Pharmacy, Shanghai University of Traditional Chinese Medicine, Cai Lun Lu 1200, Shanghai 201203, China; Engineering Research Centre of Shanghai Colleges for TCM New Drug Discovery, Cai Lun Lu 1200, Shanghai 201203, China
| | - Wen-Wei Fu
- School of Pharmacy, Shanghai University of Traditional Chinese Medicine, Cai Lun Lu 1200, Shanghai 201203, China; Engineering Research Centre of Shanghai Colleges for TCM New Drug Discovery, Cai Lun Lu 1200, Shanghai 201203, China.
| | - Hong-Xi Xu
- School of Pharmacy, Shanghai University of Traditional Chinese Medicine, Cai Lun Lu 1200, Shanghai 201203, China; Engineering Research Centre of Shanghai Colleges for TCM New Drug Discovery, Cai Lun Lu 1200, Shanghai 201203, China.
| |
Collapse
|
8
|
Saldívar-González FI, Medina-Franco JL. Approaches for enhancing the analysis of chemical space for drug discovery. Expert Opin Drug Discov 2022; 17:789-798. [PMID: 35640229 DOI: 10.1080/17460441.2022.2084608] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
INTRODUCTION Chemical space is a powerful, general, and practical conceptual framework in drug discovery and other areas in chemistry that addresses the diversity of molecules and it has various applications. Moreover, chemical space is a cornerstone of chemoinformatics as a scientific discipline. In response to the increase in the set of chemical compounds in databases, generators of chemical structures, and tools to calculate molecular descriptors, novel approaches to generate visual representations of chemical space in low dimensions are emerging and evolving. Such approaches include a wide range of commercial and free applications, software, and open-source methods. AREAS COVERED The current state of chemical space in drug design and discovery is reviewed. The topics discussed herein include advances for efficient navigation in chemical space, the use of this concept in assessing the diversity of different data sets, exploring structure-property/activity relationships for one or multiple endpoints, and compound library design. Recent advances in methodologies for generating visual representations of chemical space have been highlighted, thereby emphasizing open-source methods. EXPERT OPINION Quantitative and qualitative generation and analysis of chemical space require novel approaches for handling the increasing number of molecules and their information available in chemical databases (including emerging ultra-large libraries). In addition, it is of utmost importance to note that chemical space is a conceptual framework that goes beyond visual representation in low dimensions. However, the graphical representation of chemical space has several practical applications in drug discovery and beyond.
Collapse
Affiliation(s)
- Fernanda I Saldívar-González
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico
| | - José L Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico
| |
Collapse
|
9
|
Saldívar-González FI, Aldas-Bulos VD, Medina-Franco JL, Plisson F. Natural product drug discovery in the artificial intelligence era. Chem Sci 2022; 13:1526-1546. [PMID: 35282622 PMCID: PMC8827052 DOI: 10.1039/d1sc04471k] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 12/10/2021] [Indexed: 12/19/2022] Open
Abstract
Natural products (NPs) are primarily recognized as privileged structures to interact with protein drug targets. Their unique characteristics and structural diversity continue to marvel scientists for developing NP-inspired medicines, even though the pharmaceutical industry has largely given up. High-performance computer hardware, extensive storage, accessible software and affordable online education have democratized the use of artificial intelligence (AI) in many sectors and research areas. The last decades have introduced natural language processing and machine learning algorithms, two subfields of AI, to tackle NP drug discovery challenges and open up opportunities. In this article, we review and discuss the rational applications of AI approaches developed to assist in discovering bioactive NPs and capturing the molecular "patterns" of these privileged structures for combinatorial design or target selectivity.
Collapse
Affiliation(s)
- F I Saldívar-González
- DIFACQUIM Research Group, School of Chemistry, Department of Pharmacy, Universidad Nacional Autónoma de México Avenida Universidad 3000 04510 Mexico Mexico
| | - V D Aldas-Bulos
- Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Centro de Investigación y de Estudios Avanzados del IPN Irapuato Guanajuato Mexico
| | - J L Medina-Franco
- DIFACQUIM Research Group, School of Chemistry, Department of Pharmacy, Universidad Nacional Autónoma de México Avenida Universidad 3000 04510 Mexico Mexico
| | - F Plisson
- CONACYT - Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Centro de Investigación y de Estudios Avanzados del IPN Irapuato Guanajuato Mexico
| |
Collapse
|
10
|
Kim HW, Wang M, Leber CA, Nothias LF, Reher R, Kang KB, van der Hooft JJJ, Dorrestein PC, Gerwick WH, Cottrell GW. NPClassifier: A Deep Neural Network-Based Structural Classification Tool for Natural Products. JOURNAL OF NATURAL PRODUCTS 2021; 84:2795-2807. [PMID: 34662515 PMCID: PMC8631337 DOI: 10.1021/acs.jnatprod.1c00399] [Citation(s) in RCA: 143] [Impact Index Per Article: 47.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Indexed: 05/04/2023]
Abstract
Computational approaches such as genome and metabolome mining are becoming essential to natural products (NPs) research. Consequently, a need exists for an automated structure-type classification system to handle the massive amounts of data appearing for NP structures. An ideal semantic ontology for the classification of NPs should go beyond the simple presence/absence of chemical substructures, but also include the taxonomy of the producing organism, the nature of the biosynthetic pathway, and/or their biological properties. Thus, a holistic and automatic NP classification framework could have considerable value to comprehensively navigate the relatedness of NPs, and especially so when analyzing large numbers of NPs. Here, we introduce NPClassifier, a deep-learning tool for the automated structural classification of NPs from their counted Morgan fingerprints. NPClassifier is expected to accelerate and enhance NP discovery by linking NP structures to their underlying properties.
Collapse
Affiliation(s)
- Hyun Woo Kim
- Center
for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, United States
| | - Mingxun Wang
- Skaggs
School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
- Ometa
Laboratories LLC, San Diego, California 92121, United States
| | - Christopher A. Leber
- Center
for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, United States
| | - Louis-Félix Nothias
- Skaggs
School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Raphael Reher
- Center
for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, United States
- Institute
of Pharmacy Martin-Luther-University Halle-Wittenberg, Universitätsplatz 10, 06108 Halle (Saale), Germany
| | - Kyo Bin Kang
- Research
Institute of Pharmaceutical Sciences, College of Pharmacy, Sookmyung Women’s University, Seoul 04310, Korea
| | | | - Pieter C. Dorrestein
- Skaggs
School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - William H. Gerwick
- Center
for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, United States
- Skaggs
School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Garrison W. Cottrell
- Department
of Computer Science and Engineering, University
of California, San Diego, La Jolla, California 92093, United States
| |
Collapse
|
11
|
Capecchi A, Reymond JL. Classifying natural products from plants, fungi or bacteria using the COCONUT database and machine learning. J Cheminform 2021; 13:82. [PMID: 34663470 PMCID: PMC8524952 DOI: 10.1186/s13321-021-00559-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 10/02/2021] [Indexed: 01/13/2023] Open
Abstract
Natural products (NPs) represent one of the most important resources for discovering new drugs. Here we asked whether NP origin can be assigned from their molecular structure in a subset of 60,171 NPs in the recently reported Collection of Open Natural Products (COCONUT) database assigned to plants, fungi, or bacteria. Visualizing this subset in an interactive tree-map (TMAP) calculated using MAP4 (MinHashed atom pair fingerprint) clustered NPs according to their assigned origin ( https://tm.gdb.tools/map4/coconut_tmap/ ), and a support vector machine (SVM) trained with MAP4 correctly assigned the origin for 94% of plant, 89% of fungal, and 89% of bacterial NPs in this subset. An online tool based on an SVM trained with the entire subset correctly assigned the origin of further NPs with similar performance ( https://np-svm-map4.gdb.tools/ ). Origin information might be useful when searching for biosynthetic genes of NPs isolated from plants but produced by endophytic microorganisms.
Collapse
Affiliation(s)
- Alice Capecchi
- 1 Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Freiestrasse 3, 3012, Bern, Switzerland
| | - Jean-Louis Reymond
- 1 Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Freiestrasse 3, 3012, Bern, Switzerland.
| |
Collapse
|
12
|
Daley SK, Cordell GA. Alkaloids in Contemporary Drug Discovery to Meet Global Disease Needs. Molecules 2021; 26:molecules26133800. [PMID: 34206470 PMCID: PMC8270272 DOI: 10.3390/molecules26133800] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 06/05/2021] [Accepted: 06/14/2021] [Indexed: 12/15/2022] Open
Abstract
An overview is presented of the well-established role of alkaloids in drug discovery, the application of more sustainable chemicals, and biological approaches, and the implementation of information systems to address the current challenges faced in meeting global disease needs. The necessity for a new international paradigm for natural product discovery and development for the treatment of multidrug resistant organisms, and rare and neglected tropical diseases in the era of the Fourth Industrial Revolution and the Quintuple Helix is discussed.
Collapse
Affiliation(s)
| | - Geoffrey A. Cordell
- Natural Products Inc., Evanston, IL 60202, USA;
- Department of Pharmaceutics, College of Pharmacy, University of Florida, Gainesville, FL 32610, USA
- Correspondence:
| |
Collapse
|
13
|
Medina-Franco JL, Sánchez-Cruz N, López-López E, Díaz-Eufracio BI. Progress on open chemoinformatic tools for expanding and exploring the chemical space. J Comput Aided Mol Des 2021; 36:341-354. [PMID: 34143323 PMCID: PMC8211976 DOI: 10.1007/s10822-021-00399-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 06/14/2021] [Indexed: 01/10/2023]
Abstract
The concept of chemical space is a cornerstone in chemoinformatics, and it has broad conceptual and practical applicability in many areas of chemistry, including drug design and discovery. One of the most considerable impacts is in the study of structure-property relationships where the property can be a biological activity or any other characteristic of interest to a particular chemistry discipline. The chemical space is highly dependent on the molecular representation that is also a cornerstone concept in computational chemistry. Herein, we discuss the recent progress on chemoinformatic tools developed to expand and characterize the chemical space of compound data sets using different types of molecular representations, generate visual representations of such spaces, and explore structure-property relationships in the context of chemical spaces. We emphasize the development of methods and freely available tools focusing on drug discovery applications. We also comment on the general advantages and shortcomings of using freely available and easy-to-use tools and discuss the value of using such open resources for research, education, and scientific dissemination.
Collapse
Affiliation(s)
- José L Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico.
| | - Norberto Sánchez-Cruz
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
| | - Edgar López-López
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico.,Departamento de Química y Programa de Posgrado en Farmacología, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, Apartado 14-740, 07000, Mexico City, Mexico
| | - Bárbara I Díaz-Eufracio
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
| |
Collapse
|
14
|
Daley SK, Cordell GA. Natural Products, the Fourth Industrial Revolution, and the Quintuple Helix. Nat Prod Commun 2021. [DOI: 10.1177/1934578x211003029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
The profound interconnectedness of the sciences and technologies embodied in the Fourth Industrial Revolution is discussed in terms of the global role of natural products, and how that interplays with the development of sustainable and climate-conscious practices of cyberecoethnopharmacolomics within the Quintuple Helix for the promotion of a healthier planet and society.
Collapse
Affiliation(s)
| | - Geoffrey A. Cordell
- Natural Products Inc., Evanston, IL, USA
- Department of Pharmaceutics, College of Pharmacy, University of Florida, Gainesville, FL, USA
| |
Collapse
|
15
|
|