1
|
Seal S, Spjuth O, Hosseini-Gerami L, García-Ortegón M, Singh S, Bender A, Carpenter AE. Insights into Drug Cardiotoxicity from Biological and Chemical Data: The First Public Classifiers for FDA Drug-Induced Cardiotoxicity Rank. J Chem Inf Model 2024; 64:1172-1186. [PMID: 38300851 PMCID: PMC10900289 DOI: 10.1021/acs.jcim.3c01834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/11/2024] [Accepted: 01/16/2024] [Indexed: 02/03/2024]
Abstract
Drug-induced cardiotoxicity (DICT) is a major concern in drug development, accounting for 10-14% of postmarket withdrawals. In this study, we explored the capabilities of chemical and biological data to predict cardiotoxicity, using the recently released DICTrank data set from the United States FDA. We found that such data, including protein targets, especially those related to ion channels (e.g., hERG), physicochemical properties (e.g., electrotopological state), and peak concentration in plasma offer strong predictive ability for DICT. Compounds annotated with mechanisms of action such as cyclooxygenase inhibition could distinguish between most-concern and no-concern DICT. Cell Painting features for ER stress discerned most-concern cardiotoxic from nontoxic compounds. Models based on physicochemical properties provided substantial predictive accuracy (AUCPR = 0.93). With the availability of omics data in the future, using biological data promises enhanced predictability and deeper mechanistic insights, paving the way for safer drug development. All models from this study are available at https://broad.io/DICTrank_Predictor.
Collapse
Affiliation(s)
- Srijit Seal
- Imaging
Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - Ola Spjuth
- Department
of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Box
591, SE-75124 Uppsala, Sweden
| | - Layla Hosseini-Gerami
- Ignota
Labs, The Bradfield Centre, Cambridge Science Park, County Hall, Westminster Bridge Road, Cambridge CB4 0GA, U.K.
| | - Miguel García-Ortegón
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - Shantanu Singh
- Imaging
Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States
| | - Andreas Bender
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - Anne E. Carpenter
- Imaging
Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States
| |
Collapse
|
2
|
Seal S, Spjuth O, Hosseini-Gerami L, García-Ortegón M, Singh S, Bender A, Carpenter AE. Insights into Drug Cardiotoxicity from Biological and Chemical Data: The First Public Classifiers for FDA DICTrank. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.15.562398. [PMID: 37905146 PMCID: PMC10614794 DOI: 10.1101/2023.10.15.562398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Drug-induced cardiotoxicity (DICT) is a major concern in drug development, accounting for 10-14% of postmarket withdrawals. In this study, we explored the capabilities of various chemical and biological data to predict cardiotoxicity, using the recently released Drug-Induced Cardiotoxicity Rank (DICTrank) dataset from the United States FDA. We analyzed a diverse set of data sources, including physicochemical properties, annotated mechanisms of action (MOA), Cell Painting, Gene Expression, and more, to identify indications of cardiotoxicity. We found that such data, including protein targets, especially those related to ion channels (such as hERG), physicochemical properties (such as electrotopological state) as well as peak concentration in plasma offer strong predictive ability as well as valuable insights into DICT. We also found compounds annotated with particular mechanisms of action, such as cyclooxygenase inhibition, could distinguish between most-concern and no-concern DICT compounds. Cell Painting features related to ER stress discern the most-concern cardiotoxic compounds from non-toxic compounds. While models based on physicochemical properties currently provide substantial predictive accuracy (AUCPR = 0.93), this study also underscores the potential benefits of incorporating more comprehensive biological data in future DICT predictive models. With the availability of - omics data in the future, using biological data promises enhanced predictability and delivers deeper mechanistic insights, paving the way for safer therapeutic drug development. All models and data used in this study are publicly released at https://broad.io/DICTrank_Predictor.
Collapse
Affiliation(s)
- Srijit Seal
- Imaging Platform, Broad Institute of MIT and Harvard, US
| | - Ola Spjuth
- Department of Pharmaceutical Biosciences, Uppsala University, Sweden
| | | | | | - Shantanu Singh
- Imaging Platform, Broad Institute of MIT and Harvard, US
| | | | | |
Collapse
|
3
|
do Carmo Maquiaveli C, da Silva ER, Hild de Jesus B, Oliveira Monteiro CE, Rodrigues Navarro T, Pereira Branco LO, Souza dos Santos I, Figueiredo Reis N, Portugal AB, Mendes Wanderley JL, Borges Farias A, Correia Romeiro N, de Lima EC. Design and Synthesis of New Anthranyl Phenylhydrazides: Antileishmanial Activity and Structure-Activity Relationship. Pharmaceuticals (Basel) 2023; 16:1120. [PMID: 37631035 PMCID: PMC10458276 DOI: 10.3390/ph16081120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 07/26/2023] [Accepted: 08/07/2023] [Indexed: 08/27/2023] Open
Abstract
Leishmaniasis is a neglected tropical disease affecting millions of people worldwide. A centenary approach to antimonial-based drugs was first initiated with the synthesis of urea stibamine by Upendranath Brahmachari in 1922. The need for new drug development led to resistance toward antimoniates. New drug development to treat leishmaniasis is urgently needed. In this way, searching for new substances with antileishmanial activity, we synthesized ten anthranyl phenylhydrazide and three quinazolinone derivatives and evaluated them against promastigotes and the intracellular amastigotes of Leishmania amazonensis. Three compounds showed good activity against promastigotes 1b, 1d, and 1g, with IC50 between 1 and 5 μM. These new phenylhydrazides were tested against Leishmania arginase, but they all failed to inhibit this parasite enzyme, as we have shown in a previous study. To explain the possible mechanism of action, we proposed the enzyme PTR1 as a new target for these compounds based on in silico analysis. In conclusion, the new anthranyl hydrazide derivatives can be a promising scaffold for developing new substances against the protozoa parasite.
Collapse
Affiliation(s)
- Claudia do Carmo Maquiaveli
- Laboratório de Farmacologia e Bioquímica (LFBq), Departamento de Medicina Veterinária, Universidade de São Paulo Faculdade de Zootecnia e Engenharia de Alimentos, Av. Duque de Caxias Norte 225, Pirassununga 13635-900, SP, Brazil
| | - Edson Roberto da Silva
- Laboratório de Farmacologia e Bioquímica (LFBq), Departamento de Medicina Veterinária, Universidade de São Paulo Faculdade de Zootecnia e Engenharia de Alimentos, Av. Duque de Caxias Norte 225, Pirassununga 13635-900, SP, Brazil
| | - Barbara Hild de Jesus
- Laboratório de Farmacologia e Bioquímica (LFBq), Departamento de Medicina Veterinária, Universidade de São Paulo Faculdade de Zootecnia e Engenharia de Alimentos, Av. Duque de Caxias Norte 225, Pirassununga 13635-900, SP, Brazil
| | - Caio Eduardo Oliveira Monteiro
- Laboratório de Farmacologia e Bioquímica (LFBq), Departamento de Medicina Veterinária, Universidade de São Paulo Faculdade de Zootecnia e Engenharia de Alimentos, Av. Duque de Caxias Norte 225, Pirassununga 13635-900, SP, Brazil
| | - Tiago Rodrigues Navarro
- Laboratório de Catálise e Síntese de Substâncias Bioativas, Instituto Multidisciplinar de Química, CM UFRJ-Macaé, Universidade Federal do Rio de Janeiro, Macaé CEP 27971-525, RJ, Brazil
| | - Luiz Octavio Pereira Branco
- Laboratório de Catálise e Síntese de Substâncias Bioativas, Instituto Multidisciplinar de Química, CM UFRJ-Macaé, Universidade Federal do Rio de Janeiro, Macaé CEP 27971-525, RJ, Brazil
| | - Isabela Souza dos Santos
- Laboratório de Catálise e Síntese de Substâncias Bioativas, Instituto Multidisciplinar de Química, CM UFRJ-Macaé, Universidade Federal do Rio de Janeiro, Macaé CEP 27971-525, RJ, Brazil
| | - Nanashara Figueiredo Reis
- Laboratório de Catálise e Síntese de Substâncias Bioativas, Instituto Multidisciplinar de Química, CM UFRJ-Macaé, Universidade Federal do Rio de Janeiro, Macaé CEP 27971-525, RJ, Brazil
| | - Arieli Bernardo Portugal
- Laboratório de Imunoparasitologia, Instituto de Ciências Médicas, Centro Multidisciplinar UFRJ, Macaé CEP 27979-000, RJ, Brazil
- Programa de Pós Graduação em Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense, Campos dos Goytacazes CEP 28013-602, RJ, Brazil
| | - João Luiz Mendes Wanderley
- Laboratório de Imunoparasitologia, Instituto de Ciências Médicas, Centro Multidisciplinar UFRJ, Macaé CEP 27979-000, RJ, Brazil
| | - André Borges Farias
- Unidad Académica de Yucatán, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Mérida 97302, Yucatán, Mexico
- Integrated Laboratory of Scientific Computing (LICC), Federal University of Rio de Janeiro (UFRJ)—Campus Macaé, Aluízio Silva Gomes Avenue 50, Granjas Cavaleiros, Macaé 27930-560, RJ, Brazil
| | - Nelilma Correia Romeiro
- Integrated Laboratory of Scientific Computing (LICC), Federal University of Rio de Janeiro (UFRJ)—Campus Macaé, Aluízio Silva Gomes Avenue 50, Granjas Cavaleiros, Macaé 27930-560, RJ, Brazil
| | - Evanoel Crizanto de Lima
- Laboratório de Catálise e Síntese de Substâncias Bioativas, Instituto Multidisciplinar de Química, CM UFRJ-Macaé, Universidade Federal do Rio de Janeiro, Macaé CEP 27971-525, RJ, Brazil
| |
Collapse
|
4
|
Hosseini-Gerami L, Ficulle E, Humphryes-Kirilov N, Airey DC, Scherschel J, Kananathan S, Eastwood BJ, Bose S, Collier DA, Laing E, Evans D, Broughton H, Bender A. Mechanism of action deconvolution of the small-molecule pathological tau aggregation inhibitor Anle138b. Alzheimers Res Ther 2023; 15:52. [PMID: 36918909 PMCID: PMC10012450 DOI: 10.1186/s13195-023-01182-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 02/06/2023] [Indexed: 03/16/2023]
Abstract
BACKGROUND A key histopathological hallmark of Alzheimer's disease (AD) is the presence of neurofibrillary tangles of aggregated microtubule-associated protein tau in neurons. Anle138b is a small molecule which has previously shown efficacy in mice in reducing tau aggregates and rescuing AD disease phenotypes. METHODS In this work, we employed bioinformatics analysis-including pathway enrichment and causal reasoning-of an in vitro tauopathy model. The model consisted of cultured rat cortical neurons either unseeded or seeded with tau aggregates derived from human AD patients, both of which were treated with Anle138b to generate hypotheses for its mode of action. In parallel, we used a collection of human target prediction models to predict direct targets of Anle138b based on its chemical structure. RESULTS Combining the different approaches, we found evidence supporting the hypothesis that the action of Anle138b involves several processes which are key to AD progression, including cholesterol homeostasis and neuroinflammation. On the pathway level, we found significantly enriched pathways related to these two processes including those entitled "Superpathway of cholesterol biosynthesis" and "Granulocyte adhesion and diapedesis". With causal reasoning, we inferred differential activity of SREBF1/2 (involved in cholesterol regulation) and mediators of the inflammatory response such as NFKB1 and RELA. Notably, our findings were also observed in Anle138b-treated unseeded neurons, meaning that the inferred processes are independent of tau pathology and thus represent the direct action of the compound in the cellular system. Through structure-based ligand-target prediction, we predicted the intracellular cholesterol carrier NPC1 as well as NF-κB subunits as potential targets of Anle138b, with structurally similar compounds in the model training set known to target the same proteins. CONCLUSIONS This study has generated feasible hypotheses for the potential mechanism of action of Anle138b, which will enable the development of future molecular interventions aiming to reduce tau pathology in AD patients.
Collapse
Affiliation(s)
- Layla Hosseini-Gerami
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK.,AbsoluteAi Ltd, London, UK
| | - Elena Ficulle
- Eli Lilly and Company, Windlesham, UK.,Zifo RnD Solutions, London, UK
| | | | - David C Airey
- Eli Lilly and Company, Corporate Centre, Indianapolis, IN, USA
| | | | | | - Brian J Eastwood
- Eli Lilly and Company, Windlesham, UK.,Eli Lilly and Company, Bracknell, UK.,Eli Lilly and Company (Retired), Bracknell, UK
| | - Suchira Bose
- Eli Lilly and Company, Windlesham, UK.,Eli Lilly and Company, Bracknell, UK
| | - David A Collier
- Eli Lilly and Company, Windlesham, UK.,Eli Lilly and Company, Bracknell, UK.,Social, Genetic and Developmental Psychiatry Centre, IoPPN, Kings's College London and Genetic and Genomic Consulting Ltd, Farnham, UK
| | - Emma Laing
- Eli Lilly and Company, Windlesham, UK.,GSK, Stevenage, UK
| | - David Evans
- Eli Lilly and Company, Windlesham, UK.,DeepMind, London, UK
| | | | - Andreas Bender
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
5
|
Zhu Y, Yang H, Han L, Mervin LH, Hosseini-Gerami L, Li P, Wright P, Trapotsi MA, Liu K, Fan TP, Bender A. In silico prediction and biological assessment of novel angiogenesis modulators from traditional Chinese medicine. Front Pharmacol 2023; 14:1116081. [PMID: 36817116 PMCID: PMC9937659 DOI: 10.3389/fphar.2023.1116081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 01/20/2023] [Indexed: 02/05/2023] Open
Abstract
Uncontrolled angiogenesis is a common denominator underlying many deadly and debilitating diseases such as myocardial infarction, chronic wounds, cancer, and age-related macular degeneration. As the current range of FDA-approved angiogenesis-based medicines are far from meeting clinical demands, the vast reserve of natural products from traditional Chinese medicine (TCM) offers an alternative source for developing pro-angiogenic or anti-angiogenic modulators. Here, we investigated 100 traditional Chinese medicine-derived individual metabolites which had reported gene expression in MCF7 cell lines in the Gene Expression Omnibus (GSE85871). We extracted literature angiogenic activities for 51 individual metabolites, and subsequently analysed their predicted targets and differentially expressed genes to understand their mechanisms of action. The angiogenesis phenotype was used to generate decision trees for rationalising the poly-pharmacology of known angiogenesis modulators such as ferulic acid and curculigoside and validated by an in vitro endothelial tube formation assay and a zebrafish model of angiogenesis. Moreover, using an in silico model we prospectively examined the angiogenesis-modulating activities of the remaining 49 individual metabolites. In vitro, tetrahydropalmatine and 1 beta-hydroxyalantolactone stimulated, while cinobufotalin and isoalantolactone inhibited endothelial tube formation. In vivo, ginsenosides Rb3 and Rc, 1 beta-hydroxyalantolactone and surprisingly cinobufotalin, restored angiogenesis against PTK787-induced impairment in zebrafish. In the absence of PTK787, deoxycholic acid and ursodeoxycholic acid did not affect angiogenesis. Despite some limitations, these results suggest further refinements of in silico prediction combined with biological assessment will be a valuable platform for accelerating the research and development of natural products from traditional Chinese medicine and understanding their mechanisms of action, and also for other traditional medicines for the prevention and treatment of angiogenic diseases.
Collapse
Affiliation(s)
- Yingli Zhu
- Department of Clinical Chinese Pharmacy, School of Chinese Material Medica, Beijing University of Chinese Medicine, Beijing, China,Department of Chemistry, Center for Molecular Science Informatics, University of Cambridge, Cambridge, United Kingdom,Department of Pharmacology, University of Cambridge, Cambridge, United Kingdom
| | - Hongbin Yang
- Department of Chemistry, Center for Molecular Science Informatics, University of Cambridge, Cambridge, United Kingdom
| | - Liwen Han
- Engineering Research Center of Zebrafish Models for Human Diseases and Drug Screening of Shandong Province, Biology Institute, Qilu University of Technology, Shandong Academy of Sciences, Jinan, China,School of Pharmacy and Pharmaceutical Science, Shandong First Medical University, Shandong Academy of Medical Sciences, Jinan, China
| | - Lewis H. Mervin
- Department of Chemistry, Center for Molecular Science Informatics, University of Cambridge, Cambridge, United Kingdom
| | - Layla Hosseini-Gerami
- Department of Chemistry, Center for Molecular Science Informatics, University of Cambridge, Cambridge, United Kingdom
| | - Peihai Li
- Engineering Research Center of Zebrafish Models for Human Diseases and Drug Screening of Shandong Province, Biology Institute, Qilu University of Technology, Shandong Academy of Sciences, Jinan, China
| | - Peter Wright
- Department of Chemistry, Center for Molecular Science Informatics, University of Cambridge, Cambridge, United Kingdom
| | - Maria-Anna Trapotsi
- Department of Chemistry, Center for Molecular Science Informatics, University of Cambridge, Cambridge, United Kingdom
| | - Kechun Liu
- Engineering Research Center of Zebrafish Models for Human Diseases and Drug Screening of Shandong Province, Biology Institute, Qilu University of Technology, Shandong Academy of Sciences, Jinan, China
| | - Tai-Ping Fan
- Department of Pharmacology, University of Cambridge, Cambridge, United Kingdom,*Correspondence: Tai-Ping Fan, ; Andreas Bender,
| | - Andreas Bender
- Department of Chemistry, Center for Molecular Science Informatics, University of Cambridge, Cambridge, United Kingdom,*Correspondence: Tai-Ping Fan, ; Andreas Bender,
| |
Collapse
|
6
|
Thakur M, Bateman A, Brooksbank C, Freeberg M, Harrison M, Hartley M, Keane T, Kleywegt G, Leach A, Levchenko M, Morgan S, McDonagh E, Orchard S, Papatheodorou I, Velankar S, Vizcaino J, Witham R, Zdrazil B, McEntyre J. EMBL's European Bioinformatics Institute (EMBL-EBI) in 2022. Nucleic Acids Res 2023; 51:D9-D17. [PMID: 36477213 PMCID: PMC9825486 DOI: 10.1093/nar/gkac1098] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 10/21/2022] [Accepted: 10/31/2022] [Indexed: 12/13/2022] Open
Abstract
The European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI) is one of the world's leading sources of public biomolecular data. Based at the Wellcome Genome Campus in Hinxton, UK, EMBL-EBI is one of six sites of the European Molecular Biology Laboratory (EMBL), Europe's only intergovernmental life sciences organisation. This overview summarises the status of services that EMBL-EBI data resources provide to scientific communities globally. The scale, openness, rich metadata and extensive curation of EMBL-EBI added-value databases makes them particularly well-suited as training sets for deep learning, machine learning and artificial intelligence applications, a selection of which are described here. The data resources at EMBL-EBI can catalyse such developments because they offer sustainable, high-quality data, collected in some cases over decades and made openly availability to any researcher, globally. Our aim is for EMBL-EBI data resources to keep providing the foundations for tools and research insights that transform fields across the life sciences.
Collapse
Affiliation(s)
| | - Alex Bateman
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Cath Brooksbank
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Mallory Freeberg
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Melissa Harrison
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Matthew Hartley
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Thomas Keane
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Gerard Kleywegt
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Andrew Leach
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Mariia Levchenko
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Sarah Morgan
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Ellen M McDonagh
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
- OpenTargets, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Sandra Orchard
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Irene Papatheodorou
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Sameer Velankar
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Juan Antonio Vizcaino
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Rick Witham
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Barbara Zdrazil
- Data Services Teams, EMBL’s European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | | |
Collapse
|
7
|
Trapotsi MA, Hosseini-Gerami L, Bender A. Computational analyses of mechanism of action (MoA): data, methods and integration. RSC Chem Biol 2022; 3:170-200. [PMID: 35360890 PMCID: PMC8827085 DOI: 10.1039/d1cb00069a] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 12/09/2021] [Indexed: 12/15/2022] Open
Abstract
The elucidation of a compound's Mechanism of Action (MoA) is a challenging task in the drug discovery process, but it is important in order to rationalise phenotypic findings and to anticipate potential side-effects. Bioinformatic approaches, advances in machine learning techniques and the increasing deposition of high-throughput data in public databases have significantly contributed to recent advances in the field, but it is not straightforward to decide which data and methods are most suitable to use in a given case. In this review, we focus on these methods and data and their applications in generating MoA hypotheses for subsequent experimental validation. We discuss compound-specific data such as -omics, cell morphology and bioactivity data, as well as commonly used supplementary prior knowledge such as network and pathway data, and provide information on databases where this data can be accessed. In terms of methodologies, we discuss both well-established methods (connectivity mapping, pathway enrichment) as well as more developing methods (neural networks and multi-omics integration). Finally, we review case studies where the MoA of a compound was successfully suggested from computational analysis by incorporating multiple data modalities and/or methodologies. Our aim for this review is to provide researchers with insights into the benefits and drawbacks of both the data and methods in terms of level of understanding, biases and interpretation - and to highlight future avenues of investigation which we foresee will improve the field of MoA elucidation, including greater public access to -omics data and methodologies which are capable of data integration.
Collapse
Affiliation(s)
- Maria-Anna Trapotsi
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| | - Layla Hosseini-Gerami
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| | - Andreas Bender
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| |
Collapse
|
8
|
Mervin LH, Trapotsi MA, Afzal AM, Barrett IP, Bender A, Engkvist O. Probabilistic Random Forest improves bioactivity predictions close to the classification threshold by taking into account experimental uncertainty. J Cheminform 2021; 13:62. [PMID: 34412708 PMCID: PMC8375213 DOI: 10.1186/s13321-021-00539-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 07/30/2021] [Indexed: 11/24/2022] Open
Abstract
Measurements of protein–ligand interactions have reproducibility limits due to experimental errors. Any model based on such assays will consequentially have such unavoidable errors influencing their performance which should ideally be factored into modelling and output predictions, such as the actual standard deviation of experimental measurements (σ) or the associated comparability of activity values between the aggregated heterogenous activity units (i.e., Ki versus IC50 values) during dataset assimilation. However, experimental errors are usually a neglected aspect of model generation. In order to improve upon the current state-of-the-art, we herein present a novel approach toward predicting protein–ligand interactions using a Probabilistic Random Forest (PRF) classifier. The PRF algorithm was applied toward in silico protein target prediction across ~ 550 tasks from ChEMBL and PubChem. Predictions were evaluated by taking into account various scenarios of experimental standard deviations in both training and test sets and performance was assessed using fivefold stratified shuffled splits for validation. The largest benefit in incorporating the experimental deviation in PRF was observed for data points close to the binary threshold boundary, when such information was not considered in any way in the original RF algorithm. For example, in cases when σ ranged between 0.4–0.6 log units and when ideal probability estimates between 0.4–0.6, the PRF outperformed RF with a median absolute error margin of ~ 17%. In comparison, the baseline RF outperformed PRF for cases with high confidence to belong to the active class (far from the binary decision threshold), although the RF models gave errors smaller than the experimental uncertainty, which could indicate that they were overtrained and/or over-confident. Finally, the PRF models trained with putative inactives decreased the performance compared to PRF models without putative inactives and this could be because putative inactives were not assigned an experimental pXC50 value, and therefore they were considered inactives with a low uncertainty (which in practice might not be true). In conclusion, PRF can be useful for target prediction models in particular for data where class boundaries overlap with the measurement uncertainty, and where a substantial part of the training data is located close to the classification threshold.
Collapse
Affiliation(s)
- Lewis H Mervin
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK.
| | - Maria-Anna Trapotsi
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Avid M Afzal
- Data Sciences & Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK
| | - Ian P Barrett
- Data Sciences & Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK
| | - Andreas Bender
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Ola Engkvist
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden.,Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg, Sweden
| |
Collapse
|
9
|
Tanoli Z, Seemab U, Scherer A, Wennerberg K, Tang J, Vähä-Koskela M. Exploration of databases and methods supporting drug repurposing: a comprehensive survey. Brief Bioinform 2021; 22:1656-1678. [PMID: 32055842 PMCID: PMC7986597 DOI: 10.1093/bib/bbaa003] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 12/09/2019] [Indexed: 02/07/2023] Open
Abstract
Drug development involves a deep understanding of the mechanisms of action and possible side effects of each drug, and sometimes results in the identification of new and unexpected uses for drugs, termed as drug repurposing. Both in case of serendipitous observations and systematic mechanistic explorations, confirmation of new indications for a drug requires hypothesis building around relevant drug-related data, such as molecular targets involved, and patient and cellular responses. These datasets are available in public repositories, but apart from sifting through the sheer amount of data imposing computational bottleneck, a major challenge is the difficulty in selecting which databases to use from an increasingly large number of available databases. The database selection is made harder by the lack of an overview of the types of data offered in each database. In order to alleviate these problems and to guide the end user through the drug repurposing efforts, we provide here a survey of 102 of the most promising and drug-relevant databases reported to date. We summarize the target coverage and types of data available in each database and provide several examples of how multi-database exploration can facilitate drug repurposing.
Collapse
Affiliation(s)
- Ziaurrehman Tanoli
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Finland
| | - Umair Seemab
- Haartman Institute, University of Helsinki, Finland
| | - Andreas Scherer
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Finland
| | - Krister Wennerberg
- Biotech Research & Innovation Centre (BRIC), University of Copenhagen, Denmark
| | - Jing Tang
- Faculty of medicine, University of Helsinki, Finland
| | - Markus Vähä-Koskela
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Finland
| |
Collapse
|
10
|
Tanoli Z, Vähä-Koskela M, Aittokallio T. Artificial intelligence, machine learning, and drug repurposing in cancer. Expert Opin Drug Discov 2021; 16:977-989. [PMID: 33543671 DOI: 10.1080/17460441.2021.1883585] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Introduction: Drug repurposing provides a cost-effective strategy to re-use approved drugs for new medical indications. Several machine learning (ML) and artificial intelligence (AI) approaches have been developed for systematic identification of drug repurposing leads based on big data resources, hence further accelerating and de-risking the drug development process by computational means.Areas covered: The authors focus on supervised ML and AI methods that make use of publicly available databases and information resources. While most of the example applications are in the field of anticancer drug therapies, the methods and resources reviewed are widely applicable also to other indications including COVID-19 treatment. A particular emphasis is placed on the use of comprehensive target activity profiles that enable a systematic repurposing process by extending the target profile of drugs to include potent off-targets with therapeutic potential for a new indication.Expert opinion: The scarcity of clinical patient data and the current focus on genetic aberrations as primary drug targets may limit the performance of anticancer drug repurposing approaches that rely solely on genomics-based information. Functional testing of cancer patient cells exposed to a large number of targeted therapies and their combinations provides an additional source of repurposing information for tissue-aware AI approaches.
Collapse
Affiliation(s)
- Ziaurrehman Tanoli
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLife, University of Helsinki, Helsinki, Finland
| | - Markus Vähä-Koskela
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLife, University of Helsinki, Helsinki, Finland
| | - Tero Aittokallio
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLife, University of Helsinki, Helsinki, Finland.,Institute for Cancer Research, Department of Cancer Genetics, Oslo University Hospital, Oslo, Norway.,Centre for Biostatistics and Epidemiology (OCBE), Faculty of Medicine, University of Oslo, Oslo, Norway
| |
Collapse
|
11
|
Towards understanding antimicrobial activity, cytotoxicity and the mode of action of dichapetalins A and M using in silico and in vitro studies. Toxicon 2021; 193:28-37. [PMID: 33493498 DOI: 10.1016/j.toxicon.2021.01.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Revised: 01/01/2021] [Accepted: 01/10/2021] [Indexed: 11/22/2022]
Abstract
Dichapetalum madagascariense Poir (Dichapetalaceae) is traditionally used to treat bacterial infections, jaundice, urethritis and viral hepatitis in Africa. Its root contains a broad spectrum of biologically active dichapetalins. To evaluate the plant's effect on human MCF-7 cells and its' antibacterial and antiparasitic potentials, we isolated and identified the known dichapetalins A and M from the roots. Both dichapetalins were tested on six bacterial strains (Shigella flexneri, Bacillus cereus, Salmonella paratyphi B, Listeria monocytogenes, Escherichia coli, Staphylococcus aureus) and two parasite strains; Trypanosoma brucei brucei, and Leishmania donovani using the Alamar Blue assay system. Dichapetalins A and M were more potent against B. cereus with IC50 values of 11.15 and 3.15 μg/ml, respectively, compared to the positive control ampicillin (IC50 = 19.50 μg/ml). Dichapetalins A (IC50 = 74.22 μg/ml) and M (IC50 = 72.34 μg/ml) were less active against T. b. brucei, compared to the standard Suramin (IC50 = 4.96 μg/ml). Dichapetalin M showed moderate activity against L. donovani (Amphotericin B: IC50 = 0.21 μg/ml) with an IC50 of 16.80 μg/ml. In human MCF-7 cells expressing the NR1I2 receptor, the activity of dichapetalin M was higher (IC50 = 4.71 μM and 3.95 μM) for 48 and 72 h of treatment, respectively compared to Curcumin with IC50 of 17.49 μM and 12.53 μM for 48 and 72 h of treatment, respectively. Results from in vitro expression studies with qPCR confirmed an antagonistic effect of dichapetalin M on PXR (NR1I2) signaling; supporting the PXR signaling pathway as a possible mode of action of dichapetalin M as predicted by in silico results. These findings confirm previous studies that D. madagascariense can be a source of potential lead compounds for development of novel antibiotic, antiparasitic and anticancer medicines, and provide further insights into the mechanism of action of the dichapetalins.
Collapse
|
12
|
Smit IA, Afzal AM, Allen CHG, Svensson F, Hanser T, Bender A. Systematic Analysis of Protein Targets Associated with Adverse Events of Drugs from Clinical Trials and Postmarketing Reports. Chem Res Toxicol 2020; 34:365-384. [PMID: 33351593 DOI: 10.1021/acs.chemrestox.0c00294] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Adverse drug reactions (ADRs) are undesired effects of medicines that can harm patients and are a significant source of attrition in drug development. ADRs are anticipated by routinely screening drugs against secondary pharmacology protein panels. However, there is still a lack of quantitative information on the links between these off-target proteins and the reporting of ADRs in humans. Here, we present a systematic analysis of associations between measured and predicted in vitro bioactivities of drugs and adverse events (AEs) in humans from two sources of data: the Side Effect Resource, derived from clinical trials, and the Food and Drug Administration Adverse Event Reporting System, derived from postmarketing surveillance. The ratio of a drug's therapeutic unbound plasma concentration over the drug's in vitro potency against a given protein was used to select proteins most likely to be relevant to in vivo effects. In examining individual target bioactivities as predictors of AEs, we found a trade-off between the positive predictive value and the fraction of drugs with AEs that can be detected. However, considering sets of multiple targets for the same AE can help identify a greater fraction of AE-associated drugs. Of the 45 targets with statistically significant associations to AEs, 30 are included on existing safety target panels. The remaining 15 targets include 9 carbonic anhydrases, of which CA5B is significantly associated with cholestatic jaundice. We include the full quantitative data on associations between measured and predicted in vitro bioactivities and AEs in humans in this work, which can be used to make a more informed selection of safety profiling targets.
Collapse
Affiliation(s)
- Ines A Smit
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Avid M Afzal
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Chad H G Allen
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Fredrik Svensson
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Thierry Hanser
- Lhasa Limited, Granary Wharf House, 2 Canal Wharf, Leeds LS11 5PS, United Kingdom
| | - Andreas Bender
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
13
|
Mervin LH, Johansson S, Semenova E, Giblin KA, Engkvist O. Uncertainty quantification in drug design. Drug Discov Today 2020; 26:474-489. [PMID: 33253918 DOI: 10.1016/j.drudis.2020.11.027] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 07/13/2020] [Accepted: 11/23/2020] [Indexed: 01/03/2023]
Abstract
Machine learning and artificial intelligence are increasingly being applied to the drug-design process as a result of the development of novel algorithms, growing access, the falling cost of computation and the development of novel technologies for generating chemically and biologically relevant data. There has been recent progress in fields such as molecular de novo generation, synthetic route prediction and, to some extent, property predictions. Despite this, most research in these fields has focused on improving the accuracy of the technologies, rather than on quantifying the uncertainty in the predictions. Uncertainty quantification will become a key component in autonomous decision making and will be crucial for integrating machine learning and chemistry automation to create an autonomous design-make-test-analyse cycle. This review covers the empirical, frequentist and Bayesian approaches to uncertainty quantification, and outlines how they can be used for drug design. We also outline the impact of uncertainty quantification on decision making.
Collapse
Affiliation(s)
- Lewis H Mervin
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK.
| | - Simon Johansson
- Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg, Sweden; Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | - Elizaveta Semenova
- Data Sciences and Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK
| | - Kathryn A Giblin
- Medicinal Chemistry, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Ola Engkvist
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| |
Collapse
|
14
|
Mervin LH, Afzal AM, Engkvist O, Bender A. Comparison of Scaling Methods to Obtain Calibrated Probabilities of Activity for Protein–Ligand Predictions. J Chem Inf Model 2020; 60:4546-4559. [DOI: 10.1021/acs.jcim.0c00476] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Lewis H. Mervin
- Hit Discovery, Discovery Sciences, R&D, AstraZeneca, Cambridge CB2 0AA, U.K
| | - Avid M. Afzal
- Data Sciences & Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge CB2 0AA, U.K
| | - Ola Engkvist
- Hit Discovery, Discovery Sciences, R&D, AstraZeneca, Mölndal SE-431 83, Sweden
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge CB2 1TN, U.K
| |
Collapse
|
15
|
Norinder U, Spjuth O, Svensson F. Using Predicted Bioactivity Profiles to Improve Predictive Modeling. J Chem Inf Model 2020; 60:2830-2837. [PMID: 32374618 DOI: 10.1021/acs.jcim.0c00250] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Predictive modeling is a cornerstone in early drug development. Using information for multiple domains or across prediction tasks has the potential to improve the performance of predictive modeling. However, aggregating data often leads to incomplete data matrices that might be limiting for modeling. In line with previous studies, we show that by generating predicted bioactivity profiles, and using these as additional features, prediction accuracy of biological endpoints can be improved. Using conformal prediction, a type of confidence predictor, we present a robust framework for the calculation of these profiles and the evaluation of their impact. We report on the outcomes from several approaches to generate the predicted profiles on 16 datasets in cytotoxicity and bioactivity and show that efficiency is improved the most when including the p-values from conformal prediction as bioactivity profiles.
Collapse
Affiliation(s)
- Ulf Norinder
- Department of Computer and Systems Sciences, Stockholm University, Box 7003, SE-164 07 Kista, Sweden.,Department of Pharmaceutical Biosciences, Uppsala University, Box 591, SE-75124 Uppsala, Sweden.,MTM Research Centre, School of Science and Technology, Örebro University, SE-70182 Örebro, Sweden
| | - Ola Spjuth
- Department of Pharmaceutical Biosciences, Uppsala University, Box 591, SE-75124 Uppsala, Sweden.,Science for Life Laboratory, Uppsala University, Box 591, SE-75124 Uppsala, Sweden
| | - Fredrik Svensson
- The Alzheimer's Research UK University College London Drug Discovery Institute, The Cruciform Building, Gower Street, WC1E 6BT London, U.K
| |
Collapse
|
16
|
Baldo F. Prediction of modes of action of components of traditional medicinal preparations. PHYSICAL SCIENCES REVIEWS 2020. [DOI: 10.1515/psr-2018-0115] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
AbstractTraditional medicine preparations are used to treat many ailments in multiple regions across the world. Despite their widespread use, the mode of action of these preparations and their constituents are not fully understood. Traditional methods of elucidating the modes of action of these natural products (NPs) can be expensive and time consuming e. g. biochemical methods, bioactivity guided fractionation, etc. In this review, we discuss some methods for the prediction of the modes of action of traditional medicine preparations, both in mixtures and as isolated NPs. These methods are useful to predict targets of NPs before they are experimentally validated. Case studies of the applications of these methods are also provided herein.
Collapse
|
17
|
Sundar V, Colwell L. The Effect of Debiasing Protein-Ligand Binding Data on Generalization. J Chem Inf Model 2019; 60:56-62. [PMID: 31825609 DOI: 10.1021/acs.jcim.9b00415] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The structured nature of chemical data means machine-learning models trained to predict protein-ligand binding risk overfitting the data, impairing their ability to generalize and make accurate predictions for novel candidate ligands. Data debiasing algorithms, which systematically partition the data to reduce bias and provide a more accurate metric of model performance, have the potential to address this issue. When models are trained using debiased data splits, the reward for simply memorizing the training data is reduced, suggesting that the ability of the model to make accurate predictions for novel candidate ligands will improve. To test this hypothesis, we use distance-based data splits to measure how well a model can generalize. We first confirm that models perform better for randomly split held-out sets than for distant held-out sets. We then debias the data and find, surprisingly, that debiasing typically reduces the ability of models to make accurate predictions for distant held-out test sets and that model performance measured after debiasing is not representative of the ability of a model to generalize. These results suggest that debiasing reduces the information available to a model, impairing its ability to generalize.
Collapse
Affiliation(s)
- Vikram Sundar
- Department of Chemistry , University of Cambridge , Lensfield Road , Cambridge CB2 1EW , U.K
| | - Lucy Colwell
- Department of Chemistry , University of Cambridge , Lensfield Road , Cambridge CB2 1EW , U.K
| |
Collapse
|
18
|
Allen CHG, Mervin LH, Mahmoud SY, Bender A. Leveraging heterogeneous data from GHS toxicity annotations, molecular and protein target descriptors and Tox21 assay readouts to predict and rationalise acute toxicity. J Cheminform 2019; 11:36. [PMID: 31152262 PMCID: PMC6544914 DOI: 10.1186/s13321-019-0356-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 05/15/2019] [Indexed: 01/06/2023] Open
Abstract
Despite the increasing knowledge in both the chemical and biological domains the assimilation and exploration of heterogeneous datasets, encoding information about the chemical, bioactivity and phenotypic properties of compounds, remains a challenge due to requirement for overlap between chemicals assayed across the spaces. Here, we have constructed a novel dataset, larger than we have used in prior work, comprising 579 acute oral toxic compounds and 1427 non-toxic compounds derived from regulatory GHS information, along with their corresponding molecular and protein target descriptors and qHTS in vitro assay readouts from the Tox21 project. We found no clear association between the results of a FAFDrugs4 toxicophore screen and the acute oral toxicity classifications for our compound set; and a screen using a subset of the ToxAlerts toxicophores was also of limited utility, with only slight enrichment toward the toxic set (odds ratio of 1.48). We then investigated to what degree toxic and non-toxic compounds could be separated in each of the spaces, to compare their potential contribution to further analyses. Using an LDA projection, we found the largest degree of separation using chemical descriptors (Cohen’s d of 1.95) and the lowest degree of separation between toxicity classes using qHTS descriptors (Cohen’s d of 0.67). To compare the predictivity of the feature spaces for the toxicity endpoint, we next trained Random Forest (RF) acute oral toxicity classifiers on either molecular, protein target and qHTS descriptors. RFs trained on molecular and protein target descriptors were most predictive, with ROC AUC values of 0.80–0.92 and 0.70–0.85, respectively, across three test sets. RFs trained on both chemical and protein target descriptors combined exhibited similar predictive performance to the single-domain models (ROC AUC of 0.80–0.91). Model interpretability was improved by the inclusion of protein target descriptors, which allow the identification of specific targets (e.g. Retinal dehydrogenase) with literature links to toxic modes of action (e.g. oxidative stress). The dataset compiled in this study has been made available for future application.
Collapse
Affiliation(s)
- Chad H G Allen
- Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Lewis H Mervin
- Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Samar Y Mahmoud
- Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Andreas Bender
- Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK.
| |
Collapse
|
19
|
Worrell BL, Brown AM, Santos WL, Bevan DR. In Silico Characterization of Structural Distinctions between Isoforms of Human and Mouse Sphingosine Kinases for Accelerating Drug Discovery. J Chem Inf Model 2019; 59:2339-2351. [PMID: 30844267 DOI: 10.1021/acs.jcim.8b00931] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Alterations in cellular signaling pathways are associated with multiple disease states including cancers and fibrosis. Current research efforts to attenuate cancers, specifically lymphatic cancer, focus on inhibition of two sphingosine kinase isoforms, sphingosine kinase 1 (SphK1) and sphingosine kinase 2 (SphK2). Determining differences in structural and physicochemical binding site properties of SphKs is attractive to refine inhibitor potency and isoform selectivity. This study utilizes a predictive in silico approach to determine key differences in binding sites in SphK isoforms in human and mouse species. Homology modeling, molecular docking of inhibitors, analysis of binding pocket residue positions, development of pharmacophore models, and analysis of binding cavity volume were performed to determine isoform- and species-selective characteristics of the binding site and generate a system to rank potential inhibitors. Interestingly, docking studies showed compounds bound to mouse SphK1 in a manner more similar to human SphK2 than to human SphK1, indicating that SphKs in mice have structural properties distinct from humans that confounds prediction of ligand selectivity in mice. Our studies aid in the development and production of new compound classes by highlighting structural distinctions and identifying the role of key residues that cause observable, functional differences in isoforms and between orthologues.
Collapse
Affiliation(s)
- Brittney L Worrell
- Department of Biochemistry , Virginia Tech , 201 Engel Hall (0308) 340 West Campus Drive , Blacksburg , Virginia 24061 , United States
| | - Anne M Brown
- Department of Biochemistry , Virginia Tech , 201 Engel Hall (0308) 340 West Campus Drive , Blacksburg , Virginia 24061 , United States.,University Libraries , Virginia Tech , Blacksburg , Virginia 24061 , United States.,Virginia Tech Center for Drug Discovery , Virginia Tech , Blacksburg , Virginia 24061 , United States
| | - Webster L Santos
- Department of Chemistry , Virginia Tech , Blacksburg , Virginia 24061 , United States , and.,Virginia Tech Center for Drug Discovery , Virginia Tech , Blacksburg , Virginia 24061 , United States
| | - David R Bevan
- Department of Biochemistry , Virginia Tech , 201 Engel Hall (0308) 340 West Campus Drive , Blacksburg , Virginia 24061 , United States.,Virginia Tech Center for Drug Discovery , Virginia Tech , Blacksburg , Virginia 24061 , United States
| |
Collapse
|
20
|
Alberga D, Trisciuzzi D, Montaruli M, Leonetti F, Mangiatordi GF, Nicolotti O. A New Approach for Drug Target and Bioactivity Prediction: The Multifingerprint Similarity Search Algorithm (MuSSeL). J Chem Inf Model 2018; 59:586-596. [PMID: 30485097 DOI: 10.1021/acs.jcim.8b00698] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
We present MuSSeL, a multifingerprint similarity search algorithm, able to predict putative drug targets for a given query small molecule as well as to return a quantitative assessment of its bioactivity in terms of Ki or IC50 values. Predictions are automatically made exploiting a large collection of high quality experimental bioactivity data available from ChEMBL (version 22.1) combining, in a consensus-like approach, predictions resulting from a similarity search performed using 13 different fingerprint definitions. Importantly, the herein proposed algorithm is also effective in detecting and handling activity cliffs. A calibration set including small molecules present in the last updated version of ChEMBL (version 23) was employed to properly tune the algorithm parameters. Three randomly built external sets were instead challenged for model performances. The potential use of MuSSeL was also challenged by a prospective exercise for the prediction of five bioactive compounds taken from articles published in the Journal of Medicinal Chemistry just few months ago. The paper emphasizes the importance of implementing multifingerprint consensus strategies to increase the confidence in prediction of similarity search algorithms and provides a fast and easy-to-run tool for drug target and bioactivity prediction.
Collapse
Affiliation(s)
- Domenico Alberga
- Dipartimento di Farmacia-Scienze del Farmaco , Università degli Studi di Bari "Aldo Moro" , Via E. Orabona, 4 , I-70126 Bari , Italy
| | - Daniela Trisciuzzi
- Dipartimento di Farmacia-Scienze del Farmaco , Università degli Studi di Bari "Aldo Moro" , Via E. Orabona, 4 , I-70126 Bari , Italy
| | - Michele Montaruli
- Dipartimento di Farmacia-Scienze del Farmaco , Università degli Studi di Bari "Aldo Moro" , Via E. Orabona, 4 , I-70126 Bari , Italy
| | - Francesco Leonetti
- Dipartimento di Farmacia-Scienze del Farmaco , Università degli Studi di Bari "Aldo Moro" , Via E. Orabona, 4 , I-70126 Bari , Italy
| | - Giuseppe Felice Mangiatordi
- Dipartimento di Farmacia-Scienze del Farmaco , Università degli Studi di Bari "Aldo Moro" , Via E. Orabona, 4 , I-70126 Bari , Italy
| | - Orazio Nicolotti
- Dipartimento di Farmacia-Scienze del Farmaco , Università degli Studi di Bari "Aldo Moro" , Via E. Orabona, 4 , I-70126 Bari , Italy
| |
Collapse
|
21
|
Noori HR, Mervin LH, Bokharaie V, Durmus Ö, Egenrieder L, Fritze S, Gruhlke B, Reinhardt G, Schabel HH, Staudenmaier S, Logothetis NK, Bender A, Spanagel R. Systemic neurotransmitter responses to clinically approved and experimental neuropsychiatric drugs. Nat Commun 2018; 9:4699. [PMID: 30410047 PMCID: PMC6224407 DOI: 10.1038/s41467-018-07239-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Accepted: 10/19/2018] [Indexed: 01/04/2023] Open
Abstract
Neuropsychiatric disorders are the third leading cause of global disease burden. Current pharmacological treatment for these disorders is inadequate, with often insufficient efficacy and undesirable side effects. One reason for this is that the links between molecular drug action and neurobehavioral drug effects are elusive. We use a big data approach from the neurotransmitter response patterns of 258 different neuropsychiatric drugs in rats to address this question. Data from experiments comprising 110,674 rats are presented in the Syphad database [ www.syphad.org ]. Chemoinformatics analyses of the neurotransmitter responses suggest a mismatch between the current classification of neuropsychiatric drugs and spatiotemporal neurostransmitter response patterns at the systems level. In contrast, predicted drug-target interactions reflect more appropriately brain region related neurotransmitter response. In conclusion the neurobiological mechanism of neuropsychiatric drugs are not well reflected by their current classification or their chemical similarity, but can be better captured by molecular drug-target interactions.
Collapse
Affiliation(s)
- Hamid R Noori
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany.
- Max Planck Institute for Biological Cybernetics, Max Planck Ring 8, 72076, Tübingen, Germany.
- Courant Institute for Mathematical Sciences, New York University, 251 Mercer Street, New York, NY, 10012, USA.
- Neuronal Convergence Group, Max Planck Institute for Biological Cybernetics, Max Planck Ring 8, 72076, Tübingen, Germany.
| | - Lewis H Mervin
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Vahid Bokharaie
- Max Planck Institute for Biological Cybernetics, Max Planck Ring 8, 72076, Tübingen, Germany
| | - Özlem Durmus
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany
| | - Lisamon Egenrieder
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany
| | - Stefan Fritze
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany
| | - Britta Gruhlke
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany
| | - Giulia Reinhardt
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany
| | - Hans-Hendrik Schabel
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany
| | - Sabine Staudenmaier
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany
| | - Nikos K Logothetis
- Max Planck Institute for Biological Cybernetics, Max Planck Ring 8, 72076, Tübingen, Germany
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Rainer Spanagel
- Institute of Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, J5 68159, Mannheim, Germany
| |
Collapse
|
22
|
Mason DJ, Eastman RT, Lewis RPI, Stott IP, Guha R, Bender A. Using Machine Learning to Predict Synergistic Antimalarial Compound Combinations With Novel Structures. Front Pharmacol 2018; 9:1096. [PMID: 30333748 PMCID: PMC6176478 DOI: 10.3389/fphar.2018.01096] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Accepted: 09/07/2018] [Indexed: 01/28/2023] Open
Abstract
The parasite Plasmodium falciparum is the most lethal species of Plasmodium to cause serious malaria infection in humans, and with resistance developing rapidly novel treatment modalities are currently being sought, one of which being combinations of existing compounds. The discovery of combinations of antimalarial drugs that act synergistically with one another is hence of great importance; however an exhaustive experimental screen of large drug space in a pairwise manner is not an option. In this study we apply our machine learning approach, Combination Synergy Estimation (CoSynE), which can predict novel synergistic drug interactions using only prior experimental combination screening data and knowledge of compound molecular structures, to a dataset of 1,540 antimalarial drug combinations in which 22.2% were synergistic. Cross validation of our model showed that synergistic CoSynE predictions are enriched 2.74 × compared to random selection when both compounds in a predicted combination are known from other combinations among the training data, 2.36 × when only one compound is known from the training data, and 1.5 × for entirely novel combinations. We prospectively validated our model by making predictions for 185 combinations of 23 entirely novel compounds. CoSynE predicted 20 combinations to be synergistic, which was experimentally validated for nine of them (45%), corresponding to an enrichment of 1.70 × compared to random selection from this prospective data set. Such enrichment corresponds to a 41% reduction in experimental effort. Interestingly, we found that pairwise screening of the compounds CoSynE individually predicted to be synergistic would result in an enrichment of 1.36 × compared to random selection, indicating that synergy among compound combinations is not a random event. The nine novel and correctly predicted synergistic compound combinations mainly (where sufficient bioactivity information is available) consist of efflux or transporter inhibitors (such as hydroxyzine), combined with compounds exhibiting antimalarial activity alone (such as sorafenib, apicidin, or dihydroergotamine). However, not all compound synergies could be rationalized easily in this way. Overall, this study highlights the potential for predictive modeling to expedite the discovery of novel drug combinations in fight against antimalarial resistance, while the underlying approach is also generally applicable.
Collapse
Affiliation(s)
- Daniel J Mason
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Cambridge, United Kingdom.,Healx Ltd., Cambridge, United Kingdom
| | - Richard T Eastman
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Richard P I Lewis
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Cambridge, United Kingdom
| | - Ian P Stott
- Unilever Research and Development, Wirral, United Kingdom
| | - Rajarshi Guha
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Andreas Bender
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|