1
|
Wang T, Pulkkinen OI, Aittokallio T. Target-specific compound selectivity for multi-target drug discovery and repurposing. Front Pharmacol 2022; 13:1003480. [PMID: 36225560 PMCID: PMC9549418 DOI: 10.3389/fphar.2022.1003480] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 08/15/2022] [Indexed: 11/13/2022] Open
Abstract
Most drug molecules modulate multiple target proteins, leading either to therapeutic effects or unwanted side effects. Such target promiscuity partly contributes to high attrition rates and leads to wasted costs and time in the current drug discovery process, and makes the assessment of compound selectivity an important factor in drug development and repurposing efforts. Traditionally, selectivity of a compound is characterized in terms of its target activity profile (wide or narrow), which can be quantified using various statistical and information theoretic metrics. Even though the existing selectivity metrics are widely used for characterizing the overall selectivity of a compound, they fall short in quantifying how selective the compound is against a particular target protein (e.g., disease target of interest). We therefore extended the concept of compound selectivity towards target-specific selectivity, defined as the potency of a compound to bind to the particular protein in comparison to the other potential targets. We decompose the target-specific selectivity into two components: 1) the compound’s potency against the target of interest (absolute potency), and 2) the compound’s potency against the other targets (relative potency). The maximally selective compound-target pairs are then identified as a solution of a bi-objective optimization problem that simultaneously optimizes these two potency metrics. In computational experiments carried out using large-scale kinase inhibitor dataset, which represents a wide range of polypharmacological activities, we show how the optimization-based selectivity scoring offers a systematic approach to finding both potent and selective compounds against given kinase targets. Compared to the existing selectivity metrics, we show how the target-specific selectivity provides additional insights into the target selectivity and promiscuity of multi-targeting kinase inhibitors. Even though the selectivity score is shown to be relatively robust against both missing bioactivity values and the dataset size, we further developed a permutation-based procedure to calculate empirical p-values to assess the statistical significance of the observed selectivity of a compound-target pair in the given bioactivity dataset. We present several case studies that show how the target-specific selectivity can distinguish between highly selective and broadly-active kinase inhibitors, hence facilitating the discovery or repurposing of multi-targeting drugs.
Collapse
Affiliation(s)
- Tianduanyi Wang
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Otto I. Pulkkinen
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Helsinki Institute for Information Technology (HIIT), Department of Computer Science, University of Helsinki, Helsinki, Finland
- Department of Mathematics and Statistics and InFLAMES Research Flagship, University of Turku, Turku, Finland
| | - Tero Aittokallio
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Helsinki Institute for Information Technology (HIIT), Department of Computer Science, University of Helsinki, Helsinki, Finland
- Department of Mathematics and Statistics and InFLAMES Research Flagship, University of Turku, Turku, Finland
- Institute for Cancer Research, Department of Cancer Genetics, Oslo University Hospital, Oslo, Norway
- Oslo Centre for Biostatistics and Epidemiology (OCBE), Faculty of Medicine, University of Oslo, Oslo, Norway
- *Correspondence: Tero Aittokallio,
| |
Collapse
|
2
|
Matharu K, Chana K, Ferro CJ, Jones AM. Polypharmacology of clinical sodium glucose co-transport protein 2 inhibitors and relationship to suspected adverse drug reactions. Pharmacol Res Perspect 2021; 9:e00867. [PMID: 34586753 PMCID: PMC8480305 DOI: 10.1002/prp2.867] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 08/23/2021] [Accepted: 08/24/2021] [Indexed: 12/19/2022] Open
Abstract
Sodium glucose co-transporter 2 inhibitors (SGLT2i) are a promising second-line treatment strategy for type 2 diabetes mellitus (T2DM) with a developing landscape of both beneficial cardio- and nephroprotective properties and emerging adverse drug reactions (ADRs) including diabetic ketoacidosis (DKA), genetic mycotic infections, and amputations among others. A national register study (MHRA Yellow Card, UK) was used to quantify the SGLT2i's suspected ADRs relative to their Rx rate (OpenPrescribing, UK). The polypharmacology profiles of SGLT2i were data-mined (ChEMBL) for the first time. The ADR reports (n = 3629) and prescribing numbers (Rx n = 5,813,325) for each SGLT2i in the United Kingdom (from launch date to the beginning December 2019) were determined. Empagliflozin possesses the most selective SGLT2/SGLT1 inhibition profile at ~2500-fold, ~10-fold more selective than cangliflozin (~260-fold). Canagliflozin was found to also inhibit CYP at clinically achievable concentrations. We find that for overall ADR rates, empagliflozin versus dapagliflozin and empagliflozin versus canagliflozin are statistically significant (χ2 , p < .05), while dapagliflozin versus canagliflozin is not. In terms of overall ADRs, there is a greater relative rate for canagliflozin > dapagliflozin > empagliflozin. For fatalities, there is a greater relative rate for dapagliflozin > canagliflozin > empagliflozin. An organ classification that resulted in a statistically significant difference between SGLT2i was suspected infection/infestation ADRs between empagliflozin and dapagliflozin. Our findings at this stage of SGLT2i usage in the United Kingdom suggest that empagliflozin, the most selective SGLT2i, had the lowest suspected ADR incident rate (relative to prescribing) and in all reported classes of ADRs identified including infections, amputations, and DKA.
Collapse
Affiliation(s)
- Karan Matharu
- School of PharmacyInstitute of Clinical SciencesCollege of Medical and Dental SciencesUniversity of BirminghamBirminghamUnited Kingdom
| | - Kiran Chana
- School of PharmacyInstitute of Clinical SciencesCollege of Medical and Dental SciencesUniversity of BirminghamBirminghamUnited Kingdom
| | - Charles J. Ferro
- Birmingham Cardio‐Renal GroupInstitute of Cardiovascular SciencesCollege of Medical and Dental SciencesUniversity of BirminghamBirminghamUnited Kingdom
| | - Alan M. Jones
- School of PharmacyInstitute of Clinical SciencesCollege of Medical and Dental SciencesUniversity of BirminghamBirminghamUnited Kingdom
| |
Collapse
|
3
|
Born J, Manica M, Cadow J, Markert G, Mill NA, Filipavicius M, Janakarajan N, Cardinale A, Laino T, Rodríguez Martínez M. Data-driven molecular design for discovery and synthesis of novel ligands: a case study on SARS-CoV-2. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abe808] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Abstract
Bridging systems biology and drug design, we propose a deep learning framework for de novo discovery of molecules tailored to bind with given protein targets. Our methodology is exemplified by the task of designing antiviral candidates to target SARS-CoV-2 related proteins. Crucially, our framework does not require fine-tuning for specific proteins but is demonstrated to generalize in proposing ligands with high predicted binding affinities against unseen targets. Coupling our framework with the automatic retrosynthesis prediction of IBM RXN for Chemistry, we demonstrate the feasibility of swift chemical synthesis of molecules with potential antiviral properties that were designed against a specific protein target. In particular, we synthesize an antiviral candidate designed against the host protein angiotensin converting enzyme 2 (ACE2); a surface receptor on human respiratory epithelial cells that facilitates SARS-CoV-2 cell entry through its spike glycoprotein.
This is achieved as follows. First, we train a multimodal ligand–protein binding affinity model on predicting affinities of bioactive compounds to target proteins and couple this model with pharmacological toxicity predictors. Exploiting this multi-objective as a reward function of a conditional molecular generator that consists of two variational autoencoders (VAE), our framework steers the generation toward regions of the chemical space with high-reward molecules. Specifically, we explore a challenging setting of generating ligands against unseen protein targets by performing a leave-one-out-cross-validation on 41 SARS-CoV-2-related target proteins. Using deep reinforcement learning, it is demonstrated that in 35 out of 41 cases, the generation is biased towards sampling binding ligands, with an average increase of 83% comparing to an unbiased VAE. The generated molecules exhibit favorable properties in terms of target binding affinity, selectivity and drug-likeness. We use molecular retrosynthetic models to provide a synthetic accessibility assessment of the best generated hit molecules. Finally, with this end-to-end framework, we synthesize 3-Bromobenzylamine, a potential inhibitor of the host ACE2 protein, solely based on the recommendations of a molecular retrosynthesis model and a synthesis protocol prediction model. We hope that our framework can contribute towards swift discovery of de novo molecules with desired pharmacological properties.
Collapse
|
4
|
Abstract
One of the grand challenges in contemporary chemical biology is the generation of a probe for every member of the human proteome. Probe selection and optimization strategies typically rely on experimental bioactivity data to determine the potency and selectivity of candidate molecules. However, this approach is profoundly limited by the sparsity of the known data, the annotation bias often found in the literature, and the cost of physical screening. Recent advancements in predictive pharmacology, such as the application of multitask and transfer learning, as well as the use of biologically motivated, structure-agnostic features to characterize molecules, should serve to mitigate these issues. Computational modeling likely offers the only cost-effective approach to substantially increasing the bioactivity annotation density both on the local and global scale and thus, we argue, will need to make a substantial contribution if the ambitious goals of probing the human proteome are to be realized in the foreseeable future.
Collapse
Affiliation(s)
- Tim James
- Evotec (U.K.) Ltd. 114 Innovation Drive, Milton Park, Abingdon, Oxfordshire OX14 4RZ, U.K
| | - Adam Sardar
- Evotec (U.K.) Ltd. 114 Innovation Drive, Milton Park, Abingdon, Oxfordshire OX14 4RZ, U.K
| | - Andrew Anighoro
- Evotec (U.K.) Ltd. 114 Innovation Drive, Milton Park, Abingdon, Oxfordshire OX14 4RZ, U.K
| |
Collapse
|
5
|
Ferro CJ, Solkhon F, Jalal Z, Al‐Hamid AM, Jones AM. Relevance of physicochemical properties and functional pharmacology data to predict the clinical safety profile of direct oral anticoagulants. Pharmacol Res Perspect 2020; 8:e00603. [PMID: 32500654 PMCID: PMC7272392 DOI: 10.1002/prp2.603] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Accepted: 05/09/2020] [Indexed: 12/16/2022] Open
Abstract
Direct oral anticoagulants (DOACs) have rapidly become the drug class of choice for anticoagulation therapy in secondary care. It is known that gastrointestinal hemorrhage are potential side effects of the DOAC drug class. In this study we have investigated the relevance of molecular structure and on/off-target pharmacology as a predictor of adverse drug reactions (ADRs) for the DOAC drug class. Use of the Reaxys MedChem module allowed for data mining of all possible reported off-target effects of the DOAC class members. For the first time, the MHRA Yellow card database in combination with prescribing rates in the United Kingdom (data for n = 30 566 936 DOAC Rx (up to 2017) and ADR data n = 22 275 (up to 2018)) were used for our data comparison of DOACs. From the underlying reported data, we were able to rank the DOACs in terms of the likely adverse events we would expect to observe. We identified potential risks of ADRs based on the DOACs pharmacology including the expected GI hemorrhage, but also the unexpected risk of stroke, pulmonary embolism and kidney injury. Statistically significant (P < .001) differences were found between all DOACs and their total number of ADRs. Although the risks are small, strong statistical correlation between observed pharmacology and national ADR data is observed in three out of the five areas of concern.
Collapse
Affiliation(s)
- Charles J. Ferro
- Queen Elizabeth HospitalUniversity Hospitals Birmingham NHS Foundation TrustBirminghamUK
| | - Fay Solkhon
- School of PharmacyUniversity of BirminghamBirminghamUK
| | - Zahraa Jalal
- School of PharmacyUniversity of BirminghamBirminghamUK
| | | | - Alan M. Jones
- School of PharmacyUniversity of BirminghamBirminghamUK
| |
Collapse
|
6
|
Data structures for computational compound promiscuity analysis and exemplary applications to inhibitors of the human kinome. J Comput Aided Mol Des 2019; 34:1-10. [DOI: 10.1007/s10822-019-00266-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2019] [Accepted: 11/26/2019] [Indexed: 02/05/2023]
|
7
|
Dantas RF, Evangelista TCS, Neves BJ, Senger MR, Andrade CH, Ferreira SB, Silva-Junior FP. Dealing with frequent hitters in drug discovery: a multidisciplinary view on the issue of filtering compounds on biological screenings. Expert Opin Drug Discov 2019; 14:1269-1282. [DOI: 10.1080/17460441.2019.1654453] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Rafael Ferreira Dantas
- LaBECFar – Laboratório de Bioquímica Experimental e Computacional de Fármacos, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Tereza Cristina Santos Evangelista
- LaSOPB – Laboratório de Síntese Orgânica e Prospecção Biológica, Instituto de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Bruno Junior Neves
- LabChem – Laboratory of Cheminformatics, Centro Universitário de Anápolis, UniEVANGÉLICA, Anápolis, Brazil
| | - Mario Roberto Senger
- LaBECFar – Laboratório de Bioquímica Experimental e Computacional de Fármacos, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Carolina Horta Andrade
- LabMol – Laboratory for Molecular Modeling and Drug Design, Faculdade de Farmácia, Universidade Federal de Goiás, Goiânia, Brazil
| | - Sabrina Baptista Ferreira
- LaSOPB – Laboratório de Síntese Orgânica e Prospecção Biológica, Instituto de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Floriano Paes Silva-Junior
- LaBECFar – Laboratório de Bioquímica Experimental e Computacional de Fármacos, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| |
Collapse
|
8
|
Systematic computational identification of promiscuity cliff pathways formed by inhibitors of the human kinome. J Comput Aided Mol Des 2019; 33:559-572. [DOI: 10.1007/s10822-019-00198-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 03/12/2019] [Indexed: 11/26/2022]
|