1
|
Miljković F, Bajorath J. Kinase Drug Discovery: Impact of Open Science and Artificial Intelligence. Mol Pharm 2024; 21:4849-4859. [PMID: 39240193 DOI: 10.1021/acs.molpharmaceut.4c00659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2024]
Abstract
Given their central role in signal transduction, protein kinases (PKs) were first implicated in cancer development, caused by aberrant intracellular signaling events. Since then, PKs have become major targets in different therapeutic areas. The preferred approach to therapeutic intervention of PK-dependent diseases is the use of small molecules to inhibit their catalytic phosphate group transfer activity. PK inhibitors (PKIs) are among the most intensely pursued drug candidates, with currently 80 approved compounds and several hundred in clinical trials. Following the elucidation of the human kinome and development of robust PK expression systems and high-throughput assays, large volumes of PK/PKI data have been produced in industrial and academic environments, more so than for many other pharmaceutical targets. In addition, hundreds of X-ray structures of PKs and their complexes with PKIs have been reported. Substantial amounts of PK/PKI data have been made publicly available in part as a result of open science initiatives. PK drug discovery is further supported through the incorporation of data science approaches, including the development of various specialized databases and online resources. Compound and activity data wealth compared to other targets has also made PKs a focal point for the application of artificial intelligence (AI) in pharmaceutical research. Herein, we discuss the interplay of open and data science in PK drug discovery and review exemplary studies that have substantially contributed to its development, including kinome profiling or the analysis of PKI promiscuity versus selectivity. We also take a close look at how AI approaches are beginning to impact PK drug discovery in light of their increasing data orientation.
Collapse
Affiliation(s)
- Filip Miljković
- Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism (CVRM), BioPharmaceuticals R&D, AstraZeneca, Pepparedsleden 1, SE-43183 Gothenburg, Sweden
| | - Jürgen Bajorath
- Department of Life Science Informatics and Data Science, B-IT, Lamarr Institute for Machine Learning and Artificial Intelligence, LIMES Program Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, 53115 Bonn, Germany
| |
Collapse
|
2
|
Wu J, Chen Y, Wu J, Zhao D, Huang J, Lin M, Wang L. Large-scale comparison of machine learning methods for profiling prediction of kinase inhibitors. J Cheminform 2024; 16:13. [PMID: 38291477 PMCID: PMC10829268 DOI: 10.1186/s13321-023-00799-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 12/22/2023] [Indexed: 02/01/2024] Open
Abstract
Conventional machine learning (ML) and deep learning (DL) play a key role in the selectivity prediction of kinase inhibitors. A number of models based on available datasets can be used to predict the kinase profile of compounds, but there is still controversy about the advantages and disadvantages of ML and DL for such tasks. In this study, we constructed a comprehensive benchmark dataset of kinase inhibitors, involving in 141,086 unique compounds and 216,823 well-defined bioassay data points for 354 kinases. We then systematically compared the performance of 12 ML and DL methods on the kinase profiling prediction task. Extensive experimental results reveal that (1) Descriptor-based ML models generally slightly outperform fingerprint-based ML models in terms of predictive performance. RF as an ensemble learning approach displays the overall best predictive performance. (2) Single-task graph-based DL models are generally inferior to conventional descriptor- and fingerprint-based ML models, however, the corresponding multi-task models generally improves the average accuracy of kinase profile prediction. For example, the multi-task FP-GNN model outperforms the conventional descriptor- and fingerprint-based ML models with an average AUC of 0.807. (3) Fusion models based on voting and stacking methods can further improve the performance of the kinase profiling prediction task, specifically, RF::AtomPairs + FP2 + RDKitDes fusion model performs best with the highest average AUC value of 0.825 on the test sets. These findings provide useful information for guiding choices of the ML and DL methods for the kinase profiling prediction tasks. Finally, an online platform called KIPP ( https://kipp.idruglab.cn ) and python software are developed based on the best models to support the kinase profiling prediction, as well as various kinase inhibitor identification tasks including virtual screening, compound repositioning and target fishing.
Collapse
Affiliation(s)
- Jiangxia Wu
- Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering, Joint International Research Laboratory of Synthetic Biology and Medicine, Guangdong Provincial Engineering and Technology Research Center of Biopharmaceuticals, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, 510006, China
| | - Yihao Chen
- Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering, Joint International Research Laboratory of Synthetic Biology and Medicine, Guangdong Provincial Engineering and Technology Research Center of Biopharmaceuticals, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, 510006, China
| | - Jingxing Wu
- Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering, Joint International Research Laboratory of Synthetic Biology and Medicine, Guangdong Provincial Engineering and Technology Research Center of Biopharmaceuticals, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, 510006, China
| | - Duancheng Zhao
- Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering, Joint International Research Laboratory of Synthetic Biology and Medicine, Guangdong Provincial Engineering and Technology Research Center of Biopharmaceuticals, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, 510006, China
| | - Jindi Huang
- Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering, Joint International Research Laboratory of Synthetic Biology and Medicine, Guangdong Provincial Engineering and Technology Research Center of Biopharmaceuticals, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, 510006, China
| | - MuJie Lin
- Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering, Joint International Research Laboratory of Synthetic Biology and Medicine, Guangdong Provincial Engineering and Technology Research Center of Biopharmaceuticals, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, 510006, China
| | - Ling Wang
- Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering, Joint International Research Laboratory of Synthetic Biology and Medicine, Guangdong Provincial Engineering and Technology Research Center of Biopharmaceuticals, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, 510006, China.
| |
Collapse
|
3
|
Zhou Y, Al‐Jarf R, Alavi A, Nguyen TB, Rodrigues CHM, Pires DEV, Ascher DB. kinCSM: Using graph-based signatures to predict small molecule CDK2 inhibitors. Protein Sci 2022; 31:e4453. [PMID: 36305769 PMCID: PMC9597374 DOI: 10.1002/pro.4453] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Revised: 09/14/2022] [Accepted: 09/15/2022] [Indexed: 11/20/2022]
Abstract
Protein phosphorylation acts as an essential on/off switch in many cellular signaling pathways. This has led to ongoing interest in targeting kinases for therapeutic intervention. Computer-aided drug discovery has been proven a useful and cost-effective approach for facilitating prioritization and enrichment of screening libraries, but limited effort has been devoted providing insights on what makes a potent kinase inhibitor. To fill this gap, here we developed kinCSM, an integrative computational tool capable of accurately identifying potent cyclin-dependent kinase 2 (CDK2) inhibitors, quantitatively predicting CDK2 ligand-kinase inhibition constants (pKi ) and classifying different types of inhibitors based on their favorable binding modes. kinCSM predictive models were built using supervised learning and leveraged the concept of graph-based signatures to capture both physicochemical properties and geometry properties of small molecules. CDK2 inhibitors were accurately identified with Matthew's Correlation Coefficients (MCC) of up to 0.74, and inhibition constants predicted with Pearson's correlation of up to 0.76, both with consistent performances of 0.66 and 0.68 on a nonredundant blind test, respectively. kinCSM was also able to identify the potential type of inhibition for a given molecule, achieving MCC of up to 0.80 on cross-validation and 0.73 on the blind test. Analyzing the molecular composition of revealed enriched chemical fragments in CDK2 inhibitors and different types of inhibitors, which provides insights into the molecular mechanisms behind ligand-kinase interactions. kinCSM will be an invaluable tool to guide future kinase drug discovery. To aid the fast and accurate screening of CDK2 inhibitors, kinCSM is freely available at https://biosig.lab.uq.edu.au/kin_csm/.
Collapse
Affiliation(s)
- Yunzhuo Zhou
- School of Chemistry and Molecular BiosciencesUniversity of QueenslandBrisbaneQueenslandAustralia
- Structural Biology and Bioinformatics, Department of BiochemistryUniversity of MelbourneMelbourneVictoriaAustralia
- Systems and Computational Biology, Bio21 InstituteUniversity of MelbourneMelbourneVictoriaAustralia
- Computational Biology and Clinical InformaticsBaker Heart and Diabetes InstituteMelbourneVictoriaAustralia
| | - Raghad Al‐Jarf
- Structural Biology and Bioinformatics, Department of BiochemistryUniversity of MelbourneMelbourneVictoriaAustralia
- Systems and Computational Biology, Bio21 InstituteUniversity of MelbourneMelbourneVictoriaAustralia
- Computational Biology and Clinical InformaticsBaker Heart and Diabetes InstituteMelbourneVictoriaAustralia
| | - Azadeh Alavi
- Structural Biology and Bioinformatics, Department of BiochemistryUniversity of MelbourneMelbourneVictoriaAustralia
- Systems and Computational Biology, Bio21 InstituteUniversity of MelbourneMelbourneVictoriaAustralia
- Computational Biology and Clinical InformaticsBaker Heart and Diabetes InstituteMelbourneVictoriaAustralia
| | - Thanh Binh Nguyen
- School of Chemistry and Molecular BiosciencesUniversity of QueenslandBrisbaneQueenslandAustralia
- Structural Biology and Bioinformatics, Department of BiochemistryUniversity of MelbourneMelbourneVictoriaAustralia
- Systems and Computational Biology, Bio21 InstituteUniversity of MelbourneMelbourneVictoriaAustralia
- Computational Biology and Clinical InformaticsBaker Heart and Diabetes InstituteMelbourneVictoriaAustralia
| | - Carlos H. M. Rodrigues
- School of Chemistry and Molecular BiosciencesUniversity of QueenslandBrisbaneQueenslandAustralia
- Structural Biology and Bioinformatics, Department of BiochemistryUniversity of MelbourneMelbourneVictoriaAustralia
- Systems and Computational Biology, Bio21 InstituteUniversity of MelbourneMelbourneVictoriaAustralia
- Computational Biology and Clinical InformaticsBaker Heart and Diabetes InstituteMelbourneVictoriaAustralia
| | - Douglas E. V. Pires
- School of Chemistry and Molecular BiosciencesUniversity of QueenslandBrisbaneQueenslandAustralia
- Structural Biology and Bioinformatics, Department of BiochemistryUniversity of MelbourneMelbourneVictoriaAustralia
- Systems and Computational Biology, Bio21 InstituteUniversity of MelbourneMelbourneVictoriaAustralia
- Computational Biology and Clinical InformaticsBaker Heart and Diabetes InstituteMelbourneVictoriaAustralia
- School of Computing and Information SystemsUniversity of MelbourneMelbourneVictoriaAustralia
| | - David B. Ascher
- School of Chemistry and Molecular BiosciencesUniversity of QueenslandBrisbaneQueenslandAustralia
- Structural Biology and Bioinformatics, Department of BiochemistryUniversity of MelbourneMelbourneVictoriaAustralia
- Systems and Computational Biology, Bio21 InstituteUniversity of MelbourneMelbourneVictoriaAustralia
- Computational Biology and Clinical InformaticsBaker Heart and Diabetes InstituteMelbourneVictoriaAustralia
| |
Collapse
|
4
|
Pereira Moreira B, Weber MHW, Haeberlein S, Mokosch AS, Spengler B, Grevelding CG, Falcone FH. Drug Repurposing and De Novo Drug Discovery of Protein Kinase Inhibitors as New Drugs against Schistosomiasis. Molecules 2022; 27:molecules27041414. [PMID: 35209202 PMCID: PMC8879451 DOI: 10.3390/molecules27041414] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 02/09/2022] [Accepted: 02/16/2022] [Indexed: 02/01/2023] Open
Abstract
Schistosomiasis is a neglected tropical disease affecting more than 200 million people worldwide. Chemotherapy relies on one single drug, praziquantel, which is safe but ineffective at killing larval stages of this parasite. Furthermore, concerns have been expressed about the rise in resistance against this drug. In the absence of an antischistosomal vaccine, it is, therefore, necessary to develop new drugs against the different species of schistosomes. Protein kinases are important molecules involved in key cellular processes such as signaling, growth, and differentiation. The kinome of schistosomes has been studied and the suitability of schistosomal protein kinases as targets demonstrated by RNA interference studies. Although protein kinase inhibitors are mostly used in cancer therapy, e.g., for the treatment of chronic myeloid leukemia or melanoma, they are now being increasingly explored for the treatment of non-oncological conditions, including schistosomiasis. Here, we discuss the various approaches including screening of natural and synthetic compounds, de novo drug development, and drug repurposing in the context of the search for protein kinase inhibitors against schistosomiasis. We discuss the status quo of the development of kinase inhibitors against schistosomal serine/threonine kinases such as polo-like kinases (PLKs) and mitogen-activated protein kinases (MAP kinases), as well as protein tyrosine kinases (PTKs).
Collapse
Affiliation(s)
- Bernardo Pereira Moreira
- Biomedical Research Center Seltersberg (BFS), Institute of Parasitology, Justus Liebig University Giessen, 35392 Giessen, Germany; (B.P.M.); (M.H.W.W.); (S.H.); (C.G.G.)
| | - Michael H. W. Weber
- Biomedical Research Center Seltersberg (BFS), Institute of Parasitology, Justus Liebig University Giessen, 35392 Giessen, Germany; (B.P.M.); (M.H.W.W.); (S.H.); (C.G.G.)
| | - Simone Haeberlein
- Biomedical Research Center Seltersberg (BFS), Institute of Parasitology, Justus Liebig University Giessen, 35392 Giessen, Germany; (B.P.M.); (M.H.W.W.); (S.H.); (C.G.G.)
| | - Annika S. Mokosch
- Institute of Inorganic and Analytical Chemistry, Justus Liebig University Giessen, 35392 Giessen, Germany; (A.S.M.); (B.S.)
| | - Bernhard Spengler
- Institute of Inorganic and Analytical Chemistry, Justus Liebig University Giessen, 35392 Giessen, Germany; (A.S.M.); (B.S.)
| | - Christoph G. Grevelding
- Biomedical Research Center Seltersberg (BFS), Institute of Parasitology, Justus Liebig University Giessen, 35392 Giessen, Germany; (B.P.M.); (M.H.W.W.); (S.H.); (C.G.G.)
| | - Franco H. Falcone
- Biomedical Research Center Seltersberg (BFS), Institute of Parasitology, Justus Liebig University Giessen, 35392 Giessen, Germany; (B.P.M.); (M.H.W.W.); (S.H.); (C.G.G.)
- Correspondence:
| |
Collapse
|
5
|
Mi X, Shukla D. Predicting the Activities of Drug Excipients on Biological Targets using One-Shot Learning. J Phys Chem B 2022; 126:1492-1503. [PMID: 35142529 DOI: 10.1021/acs.jpcb.1c10574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Excipients are major components of drugs and are used to improve drug attributes such as stability and appearance. Excipients approved by the U.S. Food and Drug Administration (FDA) are regarded as safe for humans in allowed concentrations, but their potential interactions with drug targets have not been investigated systematically, which might influence a drug's efficacy. Deep learning models have been used for the identification of ligands that could bind to the drug targets. However, due to the limited available data, it is challenging to reliably estimate the likelihood of a ligand-protein interaction. One-shot learning techniques provide a potential approach to address this low data problem as these techniques require only one or a few examples to classify the new data. In this study, we apply one-shot learning models to data sets that include ligands binding to G-protein-coupled receptors (GPCRs) and kinases. The predicted results suggest that one-shot learning could be used for predicting ligand-protein interactions, and the models attain better performance when protein targets contain conserved binding pockets. The trained models are also used to predict interactions between excipients and drug targets, which provides a potential efficient strategy to explore the activities of drug excipients. We find that a large number of drug excipients could interact with biological targets and influence their function. The results demonstrate how one-shot learning can be used to make accurate predictions for excipient-protein interactions, and these methods could be used for selecting excipients with limited drug-protein interactions.
Collapse
Affiliation(s)
- Xuenan Mi
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Diwakar Shukla
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.,Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.,Cancer Center at Illinois, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.,Center for Digital Agriculture, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Plant Biology, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.,National Center for Supercomputing Applications, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|