1
|
Zhao L, Xue Q, Zhang H, Hao Y, Yi H, Liu X, Pan W, Fu J, Zhang A. CatNet: Sequence-based deep learning with cross-attention mechanism for identifying endocrine-disrupting chemicals. JOURNAL OF HAZARDOUS MATERIALS 2024; 465:133055. [PMID: 38016311 DOI: 10.1016/j.jhazmat.2023.133055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 11/02/2023] [Accepted: 11/20/2023] [Indexed: 11/30/2023]
Abstract
Endocrine-disrupting chemicals (EDCs) pose significant environmental and health risks due to their potential to interfere with nuclear receptors (NRs), key regulators of physiological processes. Despite the evident risks, the majority of existing research narrows its focus on the interaction between compounds and the individual NR target, neglecting a comprehensive assessment across the entire NR family. In response, this study assembled a comprehensive human NR dataset, capturing 49,244 interactions between 35,467 unique compounds and 42 NRs. We introduced a cross-attention network framework, "CatNet", innovatively integrating compound and protein representations through cross-attention mechanisms. The results showed that CatNet model achieved excellent performance with an area under the receiver operating characteristic curve (AUCROC) = 0.916 on the test set, and exhibited reliable generalization on unseen compound-NR pairs. A distinguishing feature of our research is its capacity to expand to novel targets. Beyond its predictive accuracy, CatNet offers a valuable mechanistic perspective on compound-NR interactions through feature visualization. Augmenting the utility of our research, we have also developed a graphical user interface, empowering researchers to predict chemical binding to diverse NRs. Our model enables the prediction of human NR-related EDCs and shows the potential to identify EDCs related to other targets.
Collapse
Affiliation(s)
- Lu Zhao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Qiao Xue
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China.
| | - Huazhou Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Yuxing Hao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Hang Yi
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China
| | - Xian Liu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China
| | - Wenxiao Pan
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China
| | - Jianjie Fu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China; School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, PR China
| | - Aiqian Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China; School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, PR China.
| |
Collapse
|
2
|
Mullowney MW, Duncan KR, Elsayed SS, Garg N, van der Hooft JJJ, Martin NI, Meijer D, Terlouw BR, Biermann F, Blin K, Durairaj J, Gorostiola González M, Helfrich EJN, Huber F, Leopold-Messer S, Rajan K, de Rond T, van Santen JA, Sorokina M, Balunas MJ, Beniddir MA, van Bergeijk DA, Carroll LM, Clark CM, Clevert DA, Dejong CA, Du C, Ferrinho S, Grisoni F, Hofstetter A, Jespers W, Kalinina OV, Kautsar SA, Kim H, Leao TF, Masschelein J, Rees ER, Reher R, Reker D, Schwaller P, Segler M, Skinnider MA, Walker AS, Willighagen EL, Zdrazil B, Ziemert N, Goss RJM, Guyomard P, Volkamer A, Gerwick WH, Kim HU, Müller R, van Wezel GP, van Westen GJP, Hirsch AKH, Linington RG, Robinson SL, Medema MH. Artificial intelligence for natural product drug discovery. Nat Rev Drug Discov 2023; 22:895-916. [PMID: 37697042 DOI: 10.1038/s41573-023-00774-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/20/2023] [Indexed: 09/13/2023]
Abstract
Developments in computational omics technologies have provided new means to access the hidden diversity of natural products, unearthing new potential for drug discovery. In parallel, artificial intelligence approaches such as machine learning have led to exciting developments in the computational drug design field, facilitating biological activity prediction and de novo drug design for molecular targets of interest. Here, we describe current and future synergies between these developments to effectively identify drug candidates from the plethora of molecules produced by nature. We also discuss how to address key challenges in realizing the potential of these synergies, such as the need for high-quality datasets to train deep learning algorithms and appropriate strategies for algorithm validation.
Collapse
Affiliation(s)
| | - Katherine R Duncan
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK
| | - Somayah S Elsayed
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - Neha Garg
- School of Chemistry and Biochemistry, Center for Microbial Dynamics and Infection, Georgia Institute of Technology, Atlanta, GA, USA
| | - Justin J J van der Hooft
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
- Department of Biochemistry, University of Johannesburg, Johannesburg, South Africa
| | - Nathaniel I Martin
- Biological Chemistry Group, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - David Meijer
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
| | - Barbara R Terlouw
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
| | - Friederike Biermann
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
- Institute of Molecular Bio Science, Goethe-University Frankfurt, Frankfurt am Main, Germany
- LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt am Main, Germany
| | - Kai Blin
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | | | - Marina Gorostiola González
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
- ONCODE institute, Leiden, The Netherlands
| | - Eric J N Helfrich
- Institute of Molecular Bio Science, Goethe-University Frankfurt, Frankfurt am Main, Germany
- LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt am Main, Germany
| | - Florian Huber
- Center for Digitalization and Digitality, Hochschule Düsseldorf, Düsseldorf, Germany
| | - Stefan Leopold-Messer
- Institut für Mikrobiologie, Eidgenössische Technische Hochschule (ETH) Zürich, Zürich, Switzerland
| | - Kohulan Rajan
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University Jena, Jena, Germany
| | - Tristan de Rond
- School of Chemical Sciences, University of Auckland, Auckland, New Zealand
| | - Jeffrey A van Santen
- Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Maria Sorokina
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller University, Jena, Germany
- Pharmaceuticals R&D, Bayer AG, Berlin, Germany
| | - Marcy J Balunas
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Mehdi A Beniddir
- Équipe "Chimie des Substances Naturelles", Université Paris-Saclay, CNRS, BioCIS, Orsay, France
| | - Doris A van Bergeijk
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | - Laura M Carroll
- Structural and Computational Biology Unit, EMBL, Heidelberg, Germany
| | - Chase M Clark
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, WI, USA
| | | | | | - Chao Du
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
| | | | - Francesca Grisoni
- Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
- Centre for Living Technologies, Alliance TU/e, WUR, UU, UMC Utrecht, Utrecht, The Netherlands
| | | | - Willem Jespers
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
| | - Olga V Kalinina
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany
- Drug Bioinformatics, Medical Faculty, Saarland University, Homburg, Germany
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
| | | | - Hyunwoo Kim
- College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University Seoul, Goyang-si, Republic of Korea
| | - Tiago F Leao
- Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba, Brazil
| | - Joleen Masschelein
- Center for Microbiology, VIB-KU Leuven, Heverlee, Belgium
- Department of Biology, KU Leuven, Heverlee, Belgium
| | - Evan R Rees
- Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, WI, USA
| | - Raphael Reher
- Institute of Pharmaceutical Biology and Biotechnology, University of Marburg, Marburg, Germany
- Institute of Pharmacy, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
| | - Daniel Reker
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
- Duke Microbiome Center, Duke University, Durham, NC, USA
| | - Philippe Schwaller
- Laboratory of Artificial Chemical Intelligence, Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | | | - Michael A Skinnider
- Adapsyn Bioscience, Hamilton, Ontario, Canada
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Allison S Walker
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - Egon L Willighagen
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, The Netherlands
| | - Barbara Zdrazil
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridgeshire, UK
| | - Nadine Ziemert
- Interfaculty Institute for Microbiology and Infection Medicine Tuebingen (IMIT), Institute for Bioinformatics and Medical Informatics (IBMI), University of Tuebingen, Tuebingen, Germany
| | | | - Pierre Guyomard
- Bonsai team, CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, Université de Lille, Villeneuve d'Ascq Cedex, France
| | - Andrea Volkamer
- Center for Bioinformatics, Saarland University, Saarbrücken, Germany
- In silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - William H Gerwick
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Hyun Uk Kim
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea
| | - Rolf Müller
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany
- Department of Pharmacy, Saarland University, Saarbrücken, Germany
- German Center for infection research (DZIF), Braunschweig, Germany
- Helmholtz International Lab for Anti-Infectives, Saarbrücken, Germany
| | - Gilles P van Wezel
- Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
- Netherlands Institute of Ecology, NIOO-KNAW, Wageningen, The Netherlands
| | - Gerard J P van Westen
- Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands.
| | - Anna K H Hirsch
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany.
- Department of Pharmacy, Saarland University, Saarbrücken, Germany.
- German Center for infection research (DZIF), Braunschweig, Germany.
- Helmholtz International Lab for Anti-Infectives, Saarbrücken, Germany.
| | - Roger G Linington
- Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada.
| | - Serina L Robinson
- Department of Environmental Microbiology, Eawag: Swiss Federal Institute for Aquatic Science and Technology, Dübendorf, Switzerland.
| | - Marnix H Medema
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands.
- Institute of Biology, Leiden University, Leiden, The Netherlands.
| |
Collapse
|
3
|
TCMSID: a simplified integrated database for drug discovery from traditional chinese medicine. J Cheminform 2022; 14:89. [PMID: 36587232 PMCID: PMC9805110 DOI: 10.1186/s13321-022-00670-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Accepted: 12/14/2022] [Indexed: 01/01/2023] Open
Abstract
Traditional Chinese Medicine (TCM) has been widely used in the treatment of various diseases for millennia. In the modernization process of TCM, TCM ingredient databases are playing more and more important roles. However, most of the existing TCM ingredient databases do not provide simplification function for extracting key ingredients in each herb or formula, which hinders the research on the mechanism of actions of the ingredients in TCM databases. The lack of quality control and standardization of the data in most of these existing databases is also a prominent disadvantage. Therefore, we developed a Traditional Chinese Medicine Simplified Integrated Database (TCMSID) with high storage, high quality and standardization. The database includes 499 herbs registered in the Chinese pharmacopeia with 20,015 ingredients, 3270 targets as well as corresponding detailed information. TCMSID is not only a database of herbal ingredients, but also a TCM simplification platform. Key ingredients from TCM herbs are available to be screened out and regarded as representatives to explore the mechanism of TCM herbs by implementing multi-tool target prediction and multilevel network construction. TCMSID provides abundant data sources and analysis platforms for TCM simplification and drug discovery, which is expected to promote modernization and internationalization of TCM and enhance its international status in the future. TCMSID is freely available at https://tcm.scbdd.com .
Collapse
|
4
|
Erlina L, Paramita RI, Kusuma WA, Fadilah F, Tedjo A, Pratomo IP, Ramadhanti NS, Nasution AK, Surado FK, Fitriawan A, Istiadi KA, Yanuar A. Virtual screening of Indonesian herbal compounds as COVID-19 supportive therapy: machine learning and pharmacophore modeling approaches. BMC Complement Med Ther 2022; 22:207. [PMID: 35922786 PMCID: PMC9347098 DOI: 10.1186/s12906-022-03686-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 07/21/2022] [Indexed: 11/10/2022] Open
Abstract
Background The number of COVID-19 cases continues to grow in Indonesia. This phenomenon motivates researchers to find alternative drugs that function for prevention or treatment. Due to the rich biodiversity of Indonesian medicinal plants, one alternative is to examine the potential of herbal medicines to support COVID therapy. This study aims to identify potential compound candidates in Indonesian herbal using a machine learning and pharmacophore modeling approaches. Methods We used three classification methods that had different decision-making processes: support vector machine (SVM), multilayer perceptron (MLP), and random forest (RF). For the pharmacophore modeling approach, we performed a structure-based analysis on the 3D structure of the main protease SARS-CoV-2 (3CLPro) and repurposed SARS, MERS, and SARS-CoV-2 drugs identified from the literature as datasets in the ligand-based method. Lastly, we used molecular docking to analyze the interactions between the 3CLpro and 14 hit compounds from the Indonesian Herbal Database (HerbalDB), with lopinavir as a positive control. Results From the molecular docking analysis, we found six potential compounds that may act as the main proteases of the SARS-CoV-2 inhibitor: hesperidin, kaempferol-3,4'-di-O-methyl ether (Ermanin); myricetin-3-glucoside, peonidin 3-(4’-arabinosylglucoside); quercetin 3-(2G-rhamnosylrutinoside); and rhamnetin 3-mannosyl-(1-2)-alloside. Conclusions Our layered virtual screening with machine learning and pharmacophore modeling approaches provided a more objective and optimal virtual screening and avoided subjective decision making of the results. Herbal compounds from the screening, i.e. hesperidin, kaempferol-3,4'-di-O-methyl ether (Ermanin); myricetin-3-glucoside, peonidin 3-(4’-arabinosylglucoside); quercetin 3-(2G-rhamnosylrutinoside); and rhamnetin 3-mannosyl-(1-2)-alloside are potential antiviral candidates for SARS-CoV-2. Moringa oleifera and Psidium guajava that consist of those compounds, could be an alternative option as COVID-19 herbal preventions. Supplementary Information The online version contains supplementary material available at 10.1186/s12906-022-03686-y.
Collapse
|
5
|
Three New Stigmatellin Derivatives Reveal Biosynthetic Insights of Its Side Chain Decoration. Molecules 2022; 27:molecules27144656. [PMID: 35889529 PMCID: PMC9317276 DOI: 10.3390/molecules27144656] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 07/19/2022] [Accepted: 07/19/2022] [Indexed: 11/23/2022] Open
Abstract
Myxobacteria generate natural products with unique chemical structures, which not only feature remarkable biological functions, but also demonstrate unprecedented biosynthetic assembly strategies. The stigmatellins have been previously described as potent inhibitors of the mitochondrial and photosynthetic respiratory chain and originate from an unusual polyketide synthase assembly line. While previous biosynthetic investigations were focused on the formation of the 5,7-dimethoxy-8-hydroxychromone ring, side chain decoration of the hydrophobic alkenyl chain in position 2 was investigated less thoroughly. We report here the full structure elucidation, as well as cytotoxic and antimicrobial activities of three new stigmatellins isolated from the myxobacterium Vitiosangium cumulatum MCy10943T with side chain decorations distinct from previously characterized members of this compound family. The hydrophobic alkenyl chain in position 2 of the herein described stigmatellins feature a terminal carboxylic acid group (1), a methoxy group at C-12′ (2) or a vicinal diol (3). These findings provide further implications considering the side chain decoration of these aromatic myxobacterial polyketides and their underlying biosynthesis.
Collapse
|
6
|
Ravichandran J, Karthikeyan BS, Jost J, Samal A. An atlas of fragrance chemicals in children's products. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 818:151682. [PMID: 34793786 DOI: 10.1016/j.scitotenv.2021.151682] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Revised: 11/09/2021] [Accepted: 11/10/2021] [Indexed: 06/13/2023]
Abstract
Exposure to environmental chemicals during early childhood is a potential health concern. At a tender age, children are exposed to fragrance chemicals used in toys and child care products. Although there are few initiatives in Europe and United States towards monitoring and regulation of fragrance chemicals in children's products, such efforts are still lacking elsewhere. Besides there has been no systematic effort to create a database compiling the surrounding knowledge on fragrance chemicals used in children's products from published literature. Here, we built a database of Fragrance Chemicals in Children's Products (FCCP) that compiles information on 153 fragrance chemicals from published literature. The fragrance chemicals in FCCP have been classified based on their chemical structure, children's product source, chemical origin and odor profile. Moreover, we have also compiled the physicochemical properties, predicted Absorption, Distribution, Metabolism, Excretion and Toxicity (ADMET) properties, molecular descriptors and human target genes for the fragrance chemicals in FCCP. After building FCCP, we performed multiple analyses of the associated fragrance chemical space. Firstly, we assessed the regulatory status of the fragrance chemicals in FCCP through a comparative analysis with 21 chemical lists reflecting current guidelines or regulations. We find that several fragrance chemicals in children's products are potential carcinogens, endocrine disruptors, neurotoxicants, phytotoxins and skin sensitizers. Secondly, we performed a similarity network based analysis of the fragrance chemicals in children's products to reveal the high structural diversity of the associated chemical space. Lastly, we identified skin sensitizing fragrance chemicals in children's products using ToxCast assays. In a nutshell, we present a comprehensive resource and detailed analysis of fragrance chemicals in children's products highlighting the need for their better risk assessment and regulation to deliver safer products for children. FCCP is accessible at: https://cb.imsc.res.in/fccp.
Collapse
Affiliation(s)
- Janani Ravichandran
- The Institute of Mathematical Sciences (IMSc), Chennai 600113, India; Homi Bhabha National Institute (HBNI), Mumbai 400094, India
| | | | - Jürgen Jost
- Max Planck Institute for Mathematics in the Sciences, Leipzig 04103, Germany; The Santa Fe Institute, Santa Fe, NM 87501, USA
| | - Areejit Samal
- The Institute of Mathematical Sciences (IMSc), Chennai 600113, India; Homi Bhabha National Institute (HBNI), Mumbai 400094, India.
| |
Collapse
|
7
|
Saldívar-González FI, Aldas-Bulos VD, Medina-Franco JL, Plisson F. Natural product drug discovery in the artificial intelligence era. Chem Sci 2022; 13:1526-1546. [PMID: 35282622 PMCID: PMC8827052 DOI: 10.1039/d1sc04471k] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 12/10/2021] [Indexed: 12/19/2022] Open
Abstract
Natural products (NPs) are primarily recognized as privileged structures to interact with protein drug targets. Their unique characteristics and structural diversity continue to marvel scientists for developing NP-inspired medicines, even though the pharmaceutical industry has largely given up. High-performance computer hardware, extensive storage, accessible software and affordable online education have democratized the use of artificial intelligence (AI) in many sectors and research areas. The last decades have introduced natural language processing and machine learning algorithms, two subfields of AI, to tackle NP drug discovery challenges and open up opportunities. In this article, we review and discuss the rational applications of AI approaches developed to assist in discovering bioactive NPs and capturing the molecular "patterns" of these privileged structures for combinatorial design or target selectivity.
Collapse
Affiliation(s)
- F I Saldívar-González
- DIFACQUIM Research Group, School of Chemistry, Department of Pharmacy, Universidad Nacional Autónoma de México Avenida Universidad 3000 04510 Mexico Mexico
| | - V D Aldas-Bulos
- Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Centro de Investigación y de Estudios Avanzados del IPN Irapuato Guanajuato Mexico
| | - J L Medina-Franco
- DIFACQUIM Research Group, School of Chemistry, Department of Pharmacy, Universidad Nacional Autónoma de México Avenida Universidad 3000 04510 Mexico Mexico
| | - F Plisson
- CONACYT - Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Centro de Investigación y de Estudios Avanzados del IPN Irapuato Guanajuato Mexico
| |
Collapse
|
8
|
Ignacz G, Yang C, Szekely G. Diversity matters: Widening the chemical space in organic solvent nanofiltration. J Memb Sci 2022. [DOI: 10.1016/j.memsci.2021.119929] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
9
|
Bayer S, Mayer AI, Borgonovo G, Morini G, Di Pizio A, Bassoli A. Chemoinformatics View on Bitter Taste Receptor Agonists in Food. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2021; 69:13916-13924. [PMID: 34762411 PMCID: PMC8630789 DOI: 10.1021/acs.jafc.1c05057] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Food compounds with a bitter taste have a role in human health, both for their capability to influence food choice and preferences and for their possible systemic effect due to the modulation of extra-oral bitter taste receptors (TAS2Rs). Investigating the interaction of bitter food compounds with TAS2Rs is a key step to unravel their complex effects on health and to pave the way to rationally design new additives for food formulation or drugs. Here, we propose a collection of food bitter compounds, for which in vitro activity data against TAS2Rs are available. The patterns of TAS2R subtype-specific agonists were analyzed using scaffold decomposition and chemical space analysis, providing a detailed characterization of the associations between food bitter tastants and TAS2Rs.
Collapse
Affiliation(s)
- Sebastian Bayer
- Leibniz
Institute for Food Systems Biology at the Technical University of
Munich, Lise-Meitner Str. 34, D-85354 Freising, Germany
- Faculty
of Life Sciences, University of Vienna, Djerassiplatz 1, 1030 Vienna, Austria
| | - Ariane Isabell Mayer
- Department
of Food, Environmental and Nutritional Sciences-DeFENS, University of Milan, via Celoria 2, 20147 Milano, Italy
| | - Gigliola Borgonovo
- Department
of Food, Environmental and Nutritional Sciences-DeFENS, University of Milan, via Celoria 2, 20147 Milano, Italy
| | - Gabriella Morini
- University
of Gastronomic Sciences, piazza Vittorio Emanuele 9, 12042 Pollenzo, (Bra, CN), Italy
| | - Antonella Di Pizio
- Leibniz
Institute for Food Systems Biology at the Technical University of
Munich, Lise-Meitner Str. 34, D-85354 Freising, Germany
- . Phone: +49(0)8161716516
| | - Angela Bassoli
- Department
of Food, Environmental and Nutritional Sciences-DeFENS, University of Milan, via Celoria 2, 20147 Milano, Italy
- . Phone: +39(0)250316815
| |
Collapse
|
10
|
A deep generative model enables automated structure elucidation of novel psychoactive substances. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00407-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
11
|
Menke J, Massa J, Koch O. Natural product scores and fingerprints extracted from artificial neural networks. Comput Struct Biotechnol J 2021; 19:4593-4602. [PMID: 34584636 PMCID: PMC8445839 DOI: 10.1016/j.csbj.2021.07.032] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 07/26/2021] [Accepted: 07/26/2021] [Indexed: 11/21/2022] Open
Abstract
Due to their desirable properties, natural products are an important ligand class for medicinal chemists. However, due to their structural distinctiveness, traditional cheminformatic approaches, like ligand-based virtual screening, often perform worse for natural products. Based on our recent work, we evaluated the ability of neural networks to generate fingerprints more appropriate for use with natural products. A manually curated dataset of natural products and synthetic decoys was used to train a multi-layer perceptron network and an autoencoder-like network. In-depth analysis showed that the extracted natural product-specific neural fingerprint outperforms traditional as well as natural product-specific fingerprints on three datasets. Further, we explored how the activations from the output layer of a network can work as a novel natural product likeness score. Overall, two natural product-specific datasets were generated, which are publicly available together with the code to create the fingerprints and the novel natural product likeness score.
Collapse
Affiliation(s)
- Janosch Menke
- Institute of Pharmaceutical and Medicinal Chemistry, Westfälische Wilhelms-Universität Münster, Corrensstraße 48, 48149 Münster, Germany
| | - Joana Massa
- Institute of Pharmaceutical and Medicinal Chemistry, Westfälische Wilhelms-Universität Münster, Corrensstraße 48, 48149 Münster, Germany
| | - Oliver Koch
- Institute of Pharmaceutical and Medicinal Chemistry, Westfälische Wilhelms-Universität Münster, Corrensstraße 48, 48149 Münster, Germany
- Center for Multiscale Theory and Computation, Westfälische Wilhelms-Universität Münster, Corrensstraße 48, 48149 Münster, Germany
| |
Collapse
|
12
|
|
13
|
Ekaney LYE, Eni DB, Ntie-Kang F. Chemical similarity methods for analyzing secondary metabolite structures. PHYSICAL SCIENCES REVIEWS 2021. [DOI: 10.1515/psr-2018-0129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
The relation that exists between the structure of a compound and its function is an integral part of chemoinformatics. The similarity principle states that “structurally similar molecules tend to have similar properties and similar molecules exert similar biological activities”. The similarity of the molecules can either be studied at the structure level or at the descriptor level (properties level). Generally, the objective of chemical similarity measures is to enhance prediction of the biological activities of molecules. In this article, an overview of various methods used to compare the similarity between metabolite structures has been provided, including two-dimensional (2D) and three-dimensional (3D) approaches. The focus has been on methods description; e.g. fingerprint-based similarity in which the molecules under study are first fragmented and their fingerprints are computed, 2D structural similarity by comparing the Tanimoto coefficients and Euclidean distances, as well as the use of physiochemical properties descriptor-based similarity methods. The similarity between molecules could also be measured by using data mining (clustering) techniques, e.g. by using virtual screening (VS)-based similarity methods. In this approach, the molecules with the desired descriptors or /and structures are screened from large databases. Lastly, SMILES-based chemical similarity search is an important method for studying the exact structure search, substructure search and also descriptor similarity. The use of a particular method depends upon the requirements of the researcher.
Collapse
Affiliation(s)
- Lena Y. E. Ekaney
- Faculty of Science, Department of Chemistry , University of Buea , P.O. Box 63 , Buea , Cameroon
| | - Donatus B. Eni
- Faculty of Science, Department of Chemistry , University of Buea , P.O. Box 63 , Buea , Cameroon
- Department of Inorganic Chemistry, Faculty of Science , University of Yaoundé I , Yaoundé , Cameroon
| | - Fidele Ntie-Kang
- Faculty of Science, Department of Chemistry , University of Buea , P.O. Box 63 , Buea , Cameroon
- Department of Pharmaceutical Chemistry , Martin-Luther University Halle-Wittenberg , Kurt-Mothes-Str. 3 , Halle (Saale) , 06120 Germany
- Department of Informatics and Chemistry , University of Chemistry and Technology Prague , Technická 5 Prague 6 , Dejvice , 166 28 Czech Republic
| |
Collapse
|
14
|
Mitrofanov AA, Matveev PI, Yakubova KV, Korotcov A, Sattarov B, Tkachenko V, Kalmykov SN. Deep Learning Insights into Lanthanides Complexation Chemistry. Molecules 2021; 26:molecules26113237. [PMID: 34072262 PMCID: PMC8198800 DOI: 10.3390/molecules26113237] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 05/25/2021] [Accepted: 05/25/2021] [Indexed: 11/16/2022] Open
Abstract
Modern structure-property models are widely used in chemistry; however, in many cases, they are still a kind of a "black box" where there is no clear path from molecule structure to target property. Here we present an example of deep learning usage not only to build a model but also to determine key structural fragments of ligands influencing metal complexation. We have a series of chemically similar lanthanide ions, and we have collected data on complexes' stability, built models, predicting stability constants and decoded the models to obtain key fragments responsible for complexation efficiency. The results are in good correlation with the experimental ones, as well as modern theories of complexation. It was shown that the main influence on the constants had a mutual location of the binding centers.
Collapse
Affiliation(s)
- Artem A. Mitrofanov
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory, 1 bld.3, 119991 Moscow, Russia; (P.I.M.); (S.N.K.)
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
- Correspondence:
| | - Petr I. Matveev
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory, 1 bld.3, 119991 Moscow, Russia; (P.I.M.); (S.N.K.)
| | - Kristina V. Yakubova
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
| | - Alexandru Korotcov
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
| | - Boris Sattarov
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
| | - Valery Tkachenko
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
| | - Stepan N. Kalmykov
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory, 1 bld.3, 119991 Moscow, Russia; (P.I.M.); (S.N.K.)
| |
Collapse
|
15
|
Reder GK, Young A, Altosaar J, Rajniak J, Elhadad N, Fischbach M, Holmes S. Supervised topic modeling for predicting molecular substructure from mass spectrometry. F1000Res 2021. [DOI: 10.12688/f1000research.52549.1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Small-molecule metabolites are principal actors in myriad phenomena across biochemistry and serve as an important source of biomarkers and drug candidates. Given a sample of unknown composition, identifying the metabolites present is difficult given the large number of small molecules both known and yet to be discovered. Even for biofluids such as human blood, building reliable ways of identifying biomarkers is challenging. A workhorse method for characterizing individual molecules in such untargeted metabolomics studies is tandem mass spectrometry (MS/MS). MS/MS spectra provide rich information about chemical composition. However, structural characterization from spectra corresponding to unknown molecules remains a bottleneck in metabolomics. Current methods often rely on matching to pre-existing databases in one form or another. Here we develop a preprocessing scheme and supervised topic modeling approach to identify modular groups of spectrum fragments and neutral losses corresponding to chemical substructures using labeled latent Dirichlet allocation (LLDA) to map spectrum features to known chemical structures. These structures appear in new unknown spectra and can be predicted. We find that LLDA is an interpretable and reliable method for structure prediction from MS/MS spectra. Specifically, the LLDA approach has the following advantages: (a) molecular topics are interpretable; (b) A practitioner can select any set of chemical structure labels relevant to their problem; (c ) LLDA performs well and can exceed the performance of other methods in predicting substructures in novel contexts.
Collapse
|
16
|
Santana K, do Nascimento LD, Lima e Lima A, Damasceno V, Nahum C, Braga RC, Lameira J. Applications of Virtual Screening in Bioprospecting: Facts, Shifts, and Perspectives to Explore the Chemo-Structural Diversity of Natural Products. Front Chem 2021; 9:662688. [PMID: 33996755 PMCID: PMC8117418 DOI: 10.3389/fchem.2021.662688] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 02/25/2021] [Indexed: 12/22/2022] Open
Abstract
Natural products are continually explored in the development of new bioactive compounds with industrial applications, attracting the attention of scientific research efforts due to their pharmacophore-like structures, pharmacokinetic properties, and unique chemical space. The systematic search for natural sources to obtain valuable molecules to develop products with commercial value and industrial purposes remains the most challenging task in bioprospecting. Virtual screening strategies have innovated the discovery of novel bioactive molecules assessing in silico large compound libraries, favoring the analysis of their chemical space, pharmacodynamics, and their pharmacokinetic properties, thus leading to the reduction of financial efforts, infrastructure, and time involved in the process of discovering new chemical entities. Herein, we discuss the computational approaches and methods developed to explore the chemo-structural diversity of natural products, focusing on the main paradigms involved in the discovery and screening of bioactive compounds from natural sources, placing particular emphasis on artificial intelligence, cheminformatics methods, and big data analyses.
Collapse
Affiliation(s)
- Kauê Santana
- Instituto de Biodiversidade, Universidade Federal do Oeste do Pará, Santarém, Brazil
| | | | - Anderson Lima e Lima
- Instituto de Ciências Exatas e Naturais, Universidade Federal do Pará, Belém, Brazil
| | - Vinícius Damasceno
- Instituto de Ciências Exatas e Naturais, Universidade Federal do Pará, Belém, Brazil
| | - Claudio Nahum
- Instituto de Ciências Exatas e Naturais, Universidade Federal do Pará, Belém, Brazil
| | | | - Jerônimo Lameira
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, Brazil
| |
Collapse
|
17
|
Shen WX, Zeng X, Zhu F, Wang YL, Qin C, Tan Y, Jiang YY, Chen YZ. Out-of-the-box deep learning prediction of pharmaceutical properties by broadly learned knowledge-based molecular representations. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00301-6] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
18
|
Skinnider MA, Johnston CW, Gunabalasingam M, Merwin NJ, Kieliszek AM, MacLellan RJ, Li H, Ranieri MRM, Webster ALH, Cao MPT, Pfeifle A, Spencer N, To QH, Wallace DP, Dejong CA, Magarvey NA. Comprehensive prediction of secondary metabolite structure and biological activity from microbial genome sequences. Nat Commun 2020; 11:6058. [PMID: 33247171 PMCID: PMC7699628 DOI: 10.1038/s41467-020-19986-1] [Citation(s) in RCA: 153] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Accepted: 11/05/2020] [Indexed: 11/23/2022] Open
Abstract
Novel antibiotics are urgently needed to address the looming global crisis of antibiotic resistance. Historically, the primary source of clinically used antibiotics has been microbial secondary metabolism. Microbial genome sequencing has revealed a plethora of uncharacterized natural antibiotics that remain to be discovered. However, the isolation of these molecules is hindered by the challenge of linking sequence information to the chemical structures of the encoded molecules. Here, we present PRISM 4, a comprehensive platform for prediction of the chemical structures of genomically encoded antibiotics, including all classes of bacterial antibiotics currently in clinical use. The accuracy of chemical structure prediction enables the development of machine-learning methods to predict the likely biological activity of encoded molecules. We apply PRISM 4 to chart secondary metabolite biosynthesis in a collection of over 10,000 bacterial genomes from both cultured isolates and metagenomic datasets, revealing thousands of encoded antibiotics. PRISM 4 is freely available as an interactive web application at http://prism.adapsyn.com. Large-scale sequencing efforts have uncovered a large number of secondary metabolic pathways, but the chemicals they synthesise remain unknown. Here the authors present PRISM 4, which predicts the chemical structures encoded by microbial genome sequences, including all classes of bacterial antibiotics in clinical use.
Collapse
Affiliation(s)
- Michael A Skinnider
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada. .,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada. .,Adapsyn Bioscience, Hamilton, ON, Canada. .,Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada.
| | - Chad W Johnston
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Adapsyn Bioscience, Hamilton, ON, Canada.,Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Mathusan Gunabalasingam
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada
| | - Nishanth J Merwin
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada
| | | | | | - Haoxin Li
- Adapsyn Bioscience, Hamilton, ON, Canada
| | - Michael R M Ranieri
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada
| | - Andrew L H Webster
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada
| | - My P T Cao
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada
| | | | | | - Q Huy To
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada
| | | | | | - Nathan A Magarvey
- Department of Biochemistry & Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada.,Department of Chemistry & Chemical Biology, Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
19
|
Huang L, Luo H, Li S, Wu FX, Wang J. Drug-drug similarity measure and its applications. Brief Bioinform 2020; 22:5956929. [PMID: 33152756 DOI: 10.1093/bib/bbaa265] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 09/13/2020] [Accepted: 09/14/2020] [Indexed: 02/01/2023] Open
Abstract
Drug similarities play an important role in modern biology and medicine, as they help scientists gain deep insights into drugs' therapeutic mechanisms and conduct wet labs that may significantly improve the efficiency of drug research and development. Nowadays, a number of drug-related databases have been constructed, with which many methods have been developed for computing similarities between drugs for studying associations between drugs, human diseases, proteins (drug targets) and more. In this review, firstly, we briefly introduce the publicly available drug-related databases. Secondly, based on different drug features, interaction relationships and multimodal data, we summarize similarity calculation methods in details. Then, we discuss the applications of drug similarities in various biological and medical areas. Finally, we evaluate drug similarity calculation methods with common evaluation metrics to illustrate the important roles of drug similarity measures on different applications.
Collapse
Affiliation(s)
- Lan Huang
- Hunan Provincial Key Lab of Bioinformatics, School of Computer Science and Engineering at Central South University, Hunan, China
| | - Huimin Luo
- School of Computer and Information Engineering at Henan University, Kaifeng, China
| | - Suning Li
- Hunan Provincial Key Lab of Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, Hunan, China
| | - Fang-Xiang Wu
- College of Engineering and Department of Computer Sciences, University of Saskatchewan, Saskatoon, Canada
| | - Jianxin Wang
- Hunan Provincial Key Lab of Bioinformatics, School of Computer Science and Engineering at Central South University, Hunan, China
| |
Collapse
|
20
|
Alonzo DA, Schmeing TM. Biosynthesis of depsipeptides, or Depsi: The peptides with varied generations. Protein Sci 2020; 29:2316-2347. [PMID: 33073901 DOI: 10.1002/pro.3979] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/11/2020] [Accepted: 10/13/2020] [Indexed: 12/11/2022]
Abstract
Depsipeptides are compounds that contain both ester bonds and amide bonds. Important natural product depsipeptides include the piscicide antimycin, the K+ ionophores cereulide and valinomycin, the anticancer agent cryptophycin, and the antimicrobial kutzneride. Furthermore, database searches return hundreds of uncharacterized systems likely to produce novel depsipeptides. These compounds are made by specialized nonribosomal peptide synthetases (NRPSs). NRPSs are biosynthetic megaenzymes that use a module architecture and multi-step catalytic cycle to assemble monomer substrates into peptides, or in the case of specialized depsipeptide synthetases, depsipeptides. Two NRPS domains, the condensation domain and the thioesterase domain, catalyze ester bond formation, and ester bonds are introduced into depsipeptides in several different ways. The two most common occur during cyclization, in a reaction between a hydroxy-containing side chain and the C-terminal amino acid residue in a peptide intermediate, and during incorporation into the growing peptide chain of an α-hydroxy acyl moiety, recruited either by direct selection of an α-hydroxy acid substrate or by selection of an α-keto acid substrate that is reduced in situ. In this article, we discuss how and when these esters are introduced during depsipeptide synthesis, survey notable depsipeptide synthetases, and review insight into bacterial depsipeptide synthetases recently gained from structural studies.
Collapse
Affiliation(s)
- Diego A Alonzo
- Department of Biochemistry and Centre de Recherche en Biologie Structurale, McGill University, Montréal, Quebec, Canada
| | - T Martin Schmeing
- Department of Biochemistry and Centre de Recherche en Biologie Structurale, McGill University, Montréal, Quebec, Canada
| |
Collapse
|
21
|
Li B, Dai C, Wang L, Deng H, Li Y, Guan Z, Ni H. A novel drug repurposing approach for non-small cell lung cancer using deep learning. PLoS One 2020; 15:e0233112. [PMID: 32525938 PMCID: PMC7289363 DOI: 10.1371/journal.pone.0233112] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Accepted: 04/28/2020] [Indexed: 01/02/2023] Open
Abstract
Drug repurposing is an attractive and pragmatic way offering reduced risks and development time in the complicated process of drug discovery. In the past, drug repurposing has been largely accidental and serendipitous. The most successful examples so far have not involved a systematic approach. Nowadays, remarkable advances in drugs, diseases and bioinformatic knowledge are offering great opportunities for designing novel drug repurposing approach through comprehensive understanding of drug information. In this study, we introduced a novel drug repurposing approach based on transcriptomic data and chemical structures using deep learning. One strong candidate for repurposing has been identified. Pimozide is an anti-dyskinesia agent that is used for the suppression of motor and phonic tics in patients with Tourette's Disorder. However, our pipeline proposed it as a strong candidate for treating non-small cell lung cancer. The cytotoxicity of pimozide against A549 cell lines has been validated.
Collapse
Affiliation(s)
- Bingrui Li
- Beijing Deep Intelligent Pharma Technologies Co., Ltd, Beijing, China
| | - Chan Dai
- Beijing Deep Intelligent Pharma Technologies Co., Ltd, Beijing, China
| | - Lijun Wang
- Beijing Deep Intelligent Pharma Technologies Co., Ltd, Beijing, China
| | - Hailong Deng
- Beijing Deep Intelligent Pharma Technologies Co., Ltd, Beijing, China
| | - Yingying Li
- Beijing Deep Intelligent Pharma Technologies Co., Ltd, Beijing, China
- * E-mail: (YL); (ZG); (HN)
| | - Zheng Guan
- Beijing Deep Intelligent Pharma Technologies Co., Ltd, Beijing, China
- * E-mail: (YL); (ZG); (HN)
| | - Haihong Ni
- Beijing Deep Intelligent Pharma Technologies Co., Ltd, Beijing, China
- * E-mail: (YL); (ZG); (HN)
| |
Collapse
|
22
|
Lautié E, Russo O, Ducrot P, Boutin JA. Unraveling Plant Natural Chemical Diversity for Drug Discovery Purposes. Front Pharmacol 2020; 11:397. [PMID: 32317969 PMCID: PMC7154113 DOI: 10.3389/fphar.2020.00397] [Citation(s) in RCA: 78] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 03/16/2020] [Indexed: 12/11/2022] Open
Abstract
The screening and testing of extracts against a variety of pharmacological targets in order to benefit from the immense natural chemical diversity is a concern in many laboratories worldwide. And several successes have been recorded in finding new actives in natural products, some of which have become new drugs or new sources of inspiration for drugs. But in view of the vast amount of research on the subject, it is surprising that not more drug candidates were found. In our view, it is fundamental to reflect upon the approaches of such drug discovery programs and the technical processes that are used, along with their inherent difficulties and biases. Based on an extensive survey of recent publications, we discuss the origin and the variety of natural chemical diversity as well as the strategies to having the potential to embrace this diversity. It seemed to us that some of the difficulties of the area could be related with the technical approaches that are used, so the present review begins with synthetizing some of the more used discovery strategies, exemplifying some key points, in order to address some of their limitations. It appears that one of the challenges of natural product-based drug discovery programs should be an easier access to renewable sources of plant-derived products. Maximizing the use of the data together with the exploration of chemical diversity while working on reasonable supply of natural product-based entities could be a way to answer this challenge. We suggested alternative ways to access and explore part of this chemical diversity with in vitro cultures. We also reinforced how important it was organizing and making available this worldwide knowledge in an "inventory" of natural products and their sources. And finally, we focused on strategies based on synthetic biology and syntheses that allow reaching industrial scale supply. Approaches based on the opportunities lying in untapped natural plant chemical diversity are also considered.
Collapse
Affiliation(s)
- Emmanuelle Lautié
- Centro de Valorização de Compostos Bioativos da Amazônia (CVACBA)-Instituto de Ciências Biológicas, Universidade Federal do Pará (UFPA), Belém, Brazil
| | - Olivier Russo
- Institut de Recherches Internationales SERVIER, Suresnes, France
| | - Pierre Ducrot
- Molecular Modelling Department, 'PEX Biotechnologie, Chimie & Biologie, Institut de Recherches SERVIER, Croissy-sur-Seine, France
| | - Jean A Boutin
- Institut de Recherches Internationales SERVIER, Suresnes, France
| |
Collapse
|
23
|
DeepRiPP integrates multiomics data to automate discovery of novel ribosomally synthesized natural products. Proc Natl Acad Sci U S A 2019; 117:371-380. [PMID: 31871149 DOI: 10.1073/pnas.1901493116] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Microbial natural products represent a rich resource of evolved chemistry that forms the basis for the majority of pharmacotherapeutics. Ribosomally synthesized and posttranslationally modified peptides (RiPPs) are a particularly interesting class of natural products noted for their unique mode of biosynthesis and biological activities. Analyses of sequenced microbial genomes have revealed an enormous number of biosynthetic loci encoding RiPPs but whose products remain cryptic. In parallel, analyses of bacterial metabolomes typically assign chemical structures to only a minority of detected metabolites. Aligning these 2 disparate sources of data could provide a comprehensive strategy for natural product discovery. Here we present DeepRiPP, an integrated genomic and metabolomic platform that employs machine learning to automate the selective discovery and isolation of novel RiPPs. DeepRiPP includes 3 modules. The first, NLPPrecursor, identifies RiPPs independent of genomic context and neighboring biosynthetic genes. The second module, BARLEY, prioritizes loci that encode novel compounds, while the third, CLAMS, automates the isolation of their corresponding products from complex bacterial extracts. DeepRiPP pinpoints target metabolites using large-scale comparative metabolomics analysis across a database of 10,498 extracts generated from 463 strains. We apply the DeepRiPP platform to expand the landscape of novel RiPPs encoded within sequenced genomes and to discover 3 novel RiPPs, whose structures are exactly as predicted by our platform. By building on advances in machine learning technologies, DeepRiPP integrates genomic and metabolomic data to guide the isolation of novel RiPPs in an automated manner.
Collapse
|
24
|
Zeng X, Zhang P, He W, Qin C, Chen S, Tao L, Wang Y, Tan Y, Gao D, Wang B, Chen Z, Chen W, Jiang YY, Chen YZ. NPASS: natural product activity and species source database for natural product research, discovery and tool development. Nucleic Acids Res 2019; 46:D1217-D1222. [PMID: 29106619 PMCID: PMC5753227 DOI: 10.1093/nar/gkx1026] [Citation(s) in RCA: 136] [Impact Index Per Article: 27.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 10/18/2017] [Indexed: 01/22/2023] Open
Abstract
There has been renewed interests in the exploration of natural products (NPs) for drug discovery, and continuous investigations of the therapeutic claims and mechanisms of traditional and herbal medicines. In-silico methods have been employed for facilitating these studies. These studies and the optimization of in-silico algorithms for NP applications can be facilitated by the quantitative activity and species source data of the NPs. A number of databases collectively provide the structural and other information of ∼470 000 NPs, including qualitative activity information for many NPs, but only ∼4000 NPs are with the experimental activity values. There is a need for the activity and species source data of more NPs. We therefore developed a new database, NPASS (Natural Product Activity and Species Source) to complement other databases by providing the experimental activity values and species sources of 35 032 NPs from 25 041 species targeting 5863 targets (2946 proteins, 1352 microbial species and 1227 cell-lines). NPASS contains 446 552 quantitative activity records (e.g. IC50, Ki, EC50, GI50 or MIC mainly in units of nM) of 222 092 NP-target pairs and 288 002 NP-species pairs. NPASS, http://bidd2.nus.edu.sg/NPASS/, is freely accessible with its contents searchable by keywords, physicochemical property range, structural similarity, species and target search facilities.
Collapse
Affiliation(s)
- Xian Zeng
- Breeding Base-Shenzhen Key Laboratory of Chemical Biology, the Graduate School at Shenzhen, Tsinghua University, Shenzhen Kivita Innovative Drug Discovery Institute, Shenzhen 518055, PR China.,Bioinformatics and Drug Design group, Department of Pharmacy, National University of Singapore, Singapore 117543, Singapore
| | - Peng Zhang
- Bioinformatics and Drug Design group, Department of Pharmacy, National University of Singapore, Singapore 117543, Singapore
| | - Weidong He
- Bioinformatics and Drug Design group, Department of Pharmacy, National University of Singapore, Singapore 117543, Singapore
| | - Chu Qin
- Bioinformatics and Drug Design group, Department of Pharmacy, National University of Singapore, Singapore 117543, Singapore
| | - Shangying Chen
- Bioinformatics and Drug Design group, Department of Pharmacy, National University of Singapore, Singapore 117543, Singapore
| | - Lin Tao
- Bioinformatics and Drug Design group, Department of Pharmacy, National University of Singapore, Singapore 117543, Singapore.,Zhejiang Key Laboratory of Gastro-intestinal Pathophysiology, Zhejiang Hospital of Traditional Chinese Medicine, Zhejiang Chinese Medical University, School of Medicine, Hangzhou Normal University, Hangzhou 310006, RP China
| | - Yali Wang
- Bioinformatics and Drug Design group, Department of Pharmacy, National University of Singapore, Singapore 117543, Singapore
| | - Ying Tan
- Breeding Base-Shenzhen Key Laboratory of Chemical Biology, the Graduate School at Shenzhen, Tsinghua University, Shenzhen Kivita Innovative Drug Discovery Institute, Shenzhen 518055, PR China
| | - Dan Gao
- Breeding Base-Shenzhen Key Laboratory of Chemical Biology, the Graduate School at Shenzhen, Tsinghua University, Shenzhen Kivita Innovative Drug Discovery Institute, Shenzhen 518055, PR China
| | - Bohua Wang
- Key Lab of Agricultural Products Processing and Quality Control of Nanchang City, Jiangxi Agricultural University, Nanchang 330045, PR China.,College of Life and Environmental Sciences, Collaborative Innovation Center for Efficient and Health Production of Fisheries in Hunan Province, Hunan University of Arts and Science, Changde, Hunan 415000, PR China
| | - Zhe Chen
- Zhejiang Key Laboratory of Gastro-intestinal Pathophysiology, Zhejiang Hospital of Traditional Chinese Medicine, Zhejiang Chinese Medical University, School of Medicine, Hangzhou Normal University, Hangzhou 310006, RP China
| | - Weiping Chen
- Key Lab of Agricultural Products Processing and Quality Control of Nanchang City, Jiangxi Agricultural University, Nanchang 330045, PR China
| | - Yu Yang Jiang
- Breeding Base-Shenzhen Key Laboratory of Chemical Biology, the Graduate School at Shenzhen, Tsinghua University, Shenzhen Kivita Innovative Drug Discovery Institute, Shenzhen 518055, PR China
| | - Yu Zong Chen
- Breeding Base-Shenzhen Key Laboratory of Chemical Biology, the Graduate School at Shenzhen, Tsinghua University, Shenzhen Kivita Innovative Drug Discovery Institute, Shenzhen 518055, PR China.,Bioinformatics and Drug Design group, Department of Pharmacy, National University of Singapore, Singapore 117543, Singapore
| |
Collapse
|
25
|
Cheminformatics techniques in antimalarial drug discovery and development from natural products 1: basic concepts. PHYSICAL SCIENCES REVIEWS 2019. [DOI: 10.1515/psr-2018-0130] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Abstract
A large number of natural products, especially those used in ethnomedicine of malaria, have shown varying in vitro antiplasmodial activities. Facilitating antimalarial drug development from this wealth of natural products is an imperative and laudable mission to pursue. However, limited manpower, high research cost coupled with high failure rate during preclinical and clinical studies might militate against the pursuit of this mission. These limitations may be overcome with cheminformatic techniques. Cheminformatics involves the organization, integration, curation, standardization, simulation, mining and transformation of pharmacology data (compounds and bioactivity) into knowledge that can drive rational and viable drug development decisions. This chapter will review the application of cheminformatics techniques (including molecular diversity analysis, quantitative-structure activity/property relationships and Machine learning) to natural products with in vitro and in vivo antiplasmodial activities in order to facilitate their development into antimalarial drug candidates and design of new potential antimalarial compounds.
Collapse
|
26
|
Abstract
We present a bipartite graph-based approach to calculate drug pairwise similarity for identifying potential new indications of approved drugs. Both chemical and molecular features were used in drug similarity calculation. In this paper, we first extracted drug chemical structures and drug-target interactions. Second, we computed chemical structure similarity and drug- target profile similarity. Further, we constructed a bipartite graph model with known relationships between drugs and their target proteins. Finally, we weighted summing drug structure similarity with target profile similarity to derive drug pairwise similarity, so that we can predict potential indication of a drug from its similar drugs. In addition, we summarized some alternative strategies and variations follow-up to each section in the overall analysis.
Collapse
|