1
|
Duo L, Liu Y, Ren J, Tang B, Hirst JD. Artificial intelligence for small molecule anticancer drug discovery. Expert Opin Drug Discov 2024; 19:933-948. [PMID: 39074493 DOI: 10.1080/17460441.2024.2367014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 06/07/2024] [Indexed: 07/31/2024]
Abstract
INTRODUCTION The transition from conventional cytotoxic chemotherapy to targeted cancer therapy with small-molecule anticancer drugs has enhanced treatment outcomes. This approach, which now dominates cancer treatment, has its advantages. Despite the regulatory approval of several targeted molecules for clinical use, challenges such as low response rates and drug resistance still persist. Conventional drug discovery methods are costly and time-consuming, necessitating more efficient approaches. The rise of artificial intelligence (AI) and access to large-scale datasets have revolutionized the field of small-molecule cancer drug discovery. Machine learning (ML), particularly deep learning (DL) techniques, enables the rapid identification and development of novel anticancer agents by analyzing vast amounts of genomic, proteomic, and imaging data to uncover hidden patterns and relationships. AREA COVERED In this review, the authors explore the important landmarks in the history of AI-driven drug discovery. They also highlight various applications in small-molecule cancer drug discovery, outline the challenges faced, and provide insights for future research. EXPERT OPINION The advent of big data has allowed AI to penetrate and enable innovations in almost every stage of medicine discovery, transforming the landscape of oncology research through the development of state-of-the-art algorithms and models. Despite challenges in data quality, model interpretability, and technical limitations, advancements promise breakthroughs in personalized and precision oncology, revolutionizing future cancer management.
Collapse
Affiliation(s)
- Lihui Duo
- Faculty of Science and Engineering, University of Nottingham Ningbo China, Ningbo, China
| | - Yu Liu
- Faculty of Science and Engineering, University of Nottingham Ningbo China, Ningbo, China
| | - Jianfeng Ren
- Faculty of Science and Engineering, University of Nottingham Ningbo China, Ningbo, China
| | - Bencan Tang
- Faculty of Science and Engineering, University of Nottingham Ningbo China, Ningbo, China
| | - Jonathan D Hirst
- School of Chemistry, University of Nottingham University Park, Nottingham, UK
| |
Collapse
|
2
|
Jana A, Naga R, Saha S, Banerjee DR. 3D QSAR pharmacophore based lead identification of G9a lysine methyltransferase towards epigenetic therapeutics. J Biomol Struct Dyn 2023; 41:8635-8653. [PMID: 36264111 DOI: 10.1080/07391102.2022.2135600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 10/08/2022] [Indexed: 10/24/2022]
Abstract
The G9a, Lysine Methyltransferase that methylates the histone 3 lysine 9 (H3K9) of the nucleosome, is an excellent epigenetic target having no clinically passed inhibitor currently owing to adverse in vivo ADMET toxicities. In this work, we have carried out detailed computational investigations to find novel and safer lead against the target using advanced 3 D QSAR pharmacophore screening of databases containing more than 400000 entrees of natural compounds. The screening was conducted at different levels at increasing stringencies by employing pharmacophore mapping, druglikenesses and interaction profiles of the selected to identify potential hit compounds. The potential hits were further screened by advanced flexible docking, ADME and toxicity analysis to eight hit compounds. Based on the comparative analysis of the hits with the reference inhibitor, we identified one lead inhibitor against the G9a, having better binding efficacy and a safer ADMET profile than the reference inhibitor. Finally, the results were further verified using robust molecular dynamics simulation and MM-GBSA binding energy calculation. The natural compounds are generally considered benign due to their long human uses and this is the first attempt of in silico screening of a large natural compound library against G9a to our best knowledge. Therefore, the finding of this study may add value towards the development of epigenetic therapeutics against the G9a.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Abhisek Jana
- Department of Chemistry, National Institute of technology Durgapur, Durgapur, India
| | - Rahul Naga
- Department of Biotechnology, National Institute of technology Durgapur, Durgapur, India
| | - Sougata Saha
- Department of Biotechnology, National Institute of technology Durgapur, Durgapur, India
| | - Deb Ranjan Banerjee
- Department of Chemistry, National Institute of technology Durgapur, Durgapur, India
| |
Collapse
|
3
|
Artificial intelligence in virtual screening: models versus experiments. Drug Discov Today 2022; 27:1913-1923. [PMID: 35597513 DOI: 10.1016/j.drudis.2022.05.013] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 05/08/2022] [Accepted: 05/12/2022] [Indexed: 12/22/2022]
Abstract
A typical drug discovery project involves identifying active compounds with significant binding potential for selected disease-specific targets. Experimental high-throughput screening (HTS) is a traditional approach to drug discovery, but is expensive and time-consuming when dealing with huge chemical libraries with billions of compounds. The search space can be narrowed down with the use of reliable computational screening approaches. In this review, we focus on various machine-learning (ML) and deep-learning (DL)-based scoring functions developed for solving classification and ranking problems in drug discovery. We highlight studies in which ML and DL models were successfully deployed to identify lead compounds for which the experimental validations are available from bioassay studies.
Collapse
|
4
|
Carracedo-Reboredo P, Liñares-Blanco J, Rodríguez-Fernández N, Cedrón F, Novoa FJ, Carballal A, Maojo V, Pazos A, Fernandez-Lozano C. A review on machine learning approaches and trends in drug discovery. Comput Struct Biotechnol J 2021; 19:4538-4558. [PMID: 34471498 PMCID: PMC8387781 DOI: 10.1016/j.csbj.2021.08.011] [Citation(s) in RCA: 127] [Impact Index Per Article: 42.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 08/06/2021] [Accepted: 08/06/2021] [Indexed: 12/30/2022] Open
Abstract
Drug discovery aims at finding new compounds with specific chemical properties for the treatment of diseases. In the last years, the approach used in this search presents an important component in computer science with the skyrocketing of machine learning techniques due to its democratization. With the objectives set by the Precision Medicine initiative and the new challenges generated, it is necessary to establish robust, standard and reproducible computational methodologies to achieve the objectives set. Currently, predictive models based on Machine Learning have gained great importance in the step prior to preclinical studies. This stage manages to drastically reduce costs and research times in the discovery of new drugs. This review article focuses on how these new methodologies are being used in recent years of research. Analyzing the state of the art in this field will give us an idea of where cheminformatics will be developed in the short term, the limitations it presents and the positive results it has achieved. This review will focus mainly on the methods used to model the molecular data, as well as the biological problems addressed and the Machine Learning algorithms used for drug discovery in recent years.
Collapse
Key Words
- ADMET, Absorption, distribution, metabolism, elimination and toxicity
- ADR, Adverse Drug Reaction
- AI, Artificial Intelligence
- ANN, Artificial Neural Networks
- APFP, Atom Pairs 2d FingerPrint
- AUC, Area under the Curve
- BBB, Blood–Brain barrier
- CDK, Chemical Development Kit
- CNN, Convolutional Neural Networks
- CNS, Central Nervous System
- CPI, Compound-protein interaction
- CV, Cross Validation
- Cheminformatics
- DL, Deep Learning
- DNA, Deoxyribonucleic acid
- Deep Learning
- Drug Discovery
- ECFP, Extended Connectivity Fingerprints
- FDA, Food and Drug Administration
- FNN, Fully Connected Neural Networks
- FP, Fringerprints
- FS, Feature Selection
- GCN, Graph Convolutional Networks
- GEO, Gene Expression Omnibus
- GNN, Graph Neural Networks
- GO, Gene Ontology
- KEGG, Kyoto Encyclopedia of Genes and Genomes
- MACCS, Molecular ACCess System
- MCC, Matthews correlation coefficient
- MD, Molecular Descriptors
- MKL, Multiple Kernel Learning
- ML, Machine Learning
- Machine Learning
- Molecular Descriptors
- NB, Naive Bayes
- OOB, Out of Bag
- PCA, Principal Component Analyisis
- QSAR
- QSAR, Quantitative structure–activity relationship
- RF, Random Forest
- RNA, Ribonucleic Acid
- SMILES, simplified molecular-input line-entry system
- SVM, Support Vector Machines
- TCGA, The Cancer Genome Atlas
- WHO, World Health Organization
- t-SNE, t-Distributed Stochastic Neighbor Embedding
Collapse
Affiliation(s)
- Paula Carracedo-Reboredo
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Jose Liñares-Blanco
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
| | - Nereida Rodríguez-Fernández
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
- Department of Computer Science and Information Technologies, Faculty of Communication Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Francisco Cedrón
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Francisco J. Novoa
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Adrian Carballal
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
- Department of Computer Science and Information Technologies, Faculty of Communication Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Victor Maojo
- Biomedical Informatics Group, Artificial Intelligence Department, Polytechnic University of Madrid, Calle de los Ciruelos, Boadilla del Monte, Madrid 28660, Spain
| | - Alejandro Pazos
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
- Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico Radiológico (RNASA-IMEDIR), Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da Coruña, Instituto de Investigación Biomédica de A Coruña (INIBIC), A Coruña, Spain
| | - Carlos Fernandez-Lozano
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
- Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico Radiológico (RNASA-IMEDIR), Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da Coruña, Instituto de Investigación Biomédica de A Coruña (INIBIC), A Coruña, Spain
| |
Collapse
|
5
|
Bitencourt-Ferreira G, Duarte da Silva A, Filgueira de Azevedo W. Application of Machine Learning Techniques to Predict Binding Affinity for Drug Targets: A Study of Cyclin-Dependent Kinase 2. Curr Med Chem 2021; 28:253-265. [PMID: 31729287 DOI: 10.2174/2213275912666191102162959] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 08/22/2019] [Accepted: 09/24/2019] [Indexed: 11/22/2022]
Abstract
BACKGROUND The elucidation of the structure of cyclin-dependent kinase 2 (CDK2) made it possible to develop targeted scoring functions for virtual screening aimed to identify new inhibitors for this enzyme. CDK2 is a protein target for the development of drugs intended to modulate cellcycle progression and control. Such drugs have potential anticancer activities. OBJECTIVE Our goal here is to review recent applications of machine learning methods to predict ligand- binding affinity for protein targets. To assess the predictive performance of classical scoring functions and targeted scoring functions, we focused our analysis on CDK2 structures. METHODS We have experimental structural data for hundreds of binary complexes of CDK2 with different ligands, many of them with inhibition constant information. We investigate here computational methods to calculate the binding affinity of CDK2 through classical scoring functions and machine- learning models. RESULTS Analysis of the predictive performance of classical scoring functions available in docking programs such as Molegro Virtual Docker, AutoDock4, and Autodock Vina indicated that these methods failed to predict binding affinity with significant correlation with experimental data. Targeted scoring functions developed through supervised machine learning techniques showed a significant correlation with experimental data. CONCLUSION Here, we described the application of supervised machine learning techniques to generate a scoring function to predict binding affinity. Machine learning models showed superior predictive performance when compared with classical scoring functions. Analysis of the computational models obtained through machine learning could capture essential structural features responsible for binding affinity against CDK2.
Collapse
Affiliation(s)
- Gabriela Bitencourt-Ferreira
- Laboratory of Computational Systems Biology. Pontifical Catholic University of Rio Grande do Sul (PUCRS). Av. Ipiranga, 6681 Porto Alegre/RS 90619-900 , Brazil
| | - Amauri Duarte da Silva
- Specialization Program in Bioinformatics. Pontifical Catholic University of Rio Grande do Sul (PUCRS). Av. Ipiranga, 6681 Porto Alegre/RS 90619-900, Brazil
| | - Walter Filgueira de Azevedo
- Laboratory of Computational Systems Biology. Pontifical Catholic University of Rio Grande do Sul (PUCRS). Av. Ipiranga, 6681 Porto Alegre/RS 90619-900 , Brazil
| |
Collapse
|
6
|
Zhu J, Wu Y, Wang M, Li K, Xu L, Chen Y, Cai Y, Jin J. Integrating Machine Learning-Based Virtual Screening With Multiple Protein Structures and Bio-Assay Evaluation for Discovery of Novel GSK3β Inhibitors. Front Pharmacol 2020; 11:566058. [PMID: 33041806 PMCID: PMC7517831 DOI: 10.3389/fphar.2020.566058] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 08/14/2020] [Indexed: 02/04/2023] Open
Abstract
Glycogen synthase kinase-3β (GSK3β) is associated with various key biological processes, and it has been considered as a critical therapeutic target for the treatment of many diseases. However, it is a big challenge to develop ATP-competition GSK3β inhibitors because of the high sequence homology with other kinases. In this work, a novel parallel virtual screening strategy based on multiple GSK3β protein structures, integrating molecular docking, complex-based pharmacophore, and naive Bayesian classification, was developed to screen a large chemical database, the 50 compounds with top-scores then underwent a luminescent kinase assay, which led to the discovery of two GSK3β inhibitor hits. The high screening enrichment rate indicates the reliability and practicability of the integrated protocol. Finally, molecular docking and molecular dynamics simulation were employed to investigate the binding modes of the GSK3β inhibitors, and some “hot residues” critical to GSK3β affinity were highlighted. The present study may provide some valuable guidance for the development of novel GSK3β inhibitors.
Collapse
Affiliation(s)
- Jingyu Zhu
- School of Pharmaceutical Sciences, Jiangnan University, Wuxi, China
| | - Yuanqing Wu
- School of Pharmaceutical Sciences, Jiangnan University, Wuxi, China
| | - Man Wang
- Jiangsu Institute of Hematology, National Clinical Research Center for Hematologic Diseases, NHC Key Laboratory of Thrombosis and Hemostasis, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Kan Li
- School of Pharmaceutical Sciences, Jiangnan University, Wuxi, China
| | - Lei Xu
- School of Electrical and Information Engineering, Institute of Bioinformatics and Medical Engineering, Jiangsu University of Technology, Changzhou, China
| | - Yun Chen
- School of Pharmaceutical Sciences, Jiangnan University, Wuxi, China
| | - Yanfei Cai
- School of Pharmaceutical Sciences, Jiangnan University, Wuxi, China
| | - Jian Jin
- School of Pharmaceutical Sciences, Jiangnan University, Wuxi, China
| |
Collapse
|
7
|
Zhou L, Xu X, Liu H, Hu X, Zhang W, Ye M, Zhu X. Prognosis Analysis of Histone Deacetylases mRNA Expression in Ovarian Cancer Patients. J Cancer 2018; 9:4547-4555. [PMID: 30519361 PMCID: PMC6277648 DOI: 10.7150/jca.26780] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Accepted: 09/09/2018] [Indexed: 12/14/2022] Open
Abstract
Histone deacetylases modulate the dynamic balance of histone acetylation and deacetylation in cells, which participate in epigenetic regulations. Accumulated evidence has demonstrated that histone deacetylases are associated with angiogenesis, cell proliferation and survival in a variety of human cancers. However, the expression and distinct prognostic value of histone deacetylases in ovarian cancer have not been well elucidated. In the present study, we collected the overall survival (OS), progress free survival (PFS), and histone deacetylases (HDAC1-11) mRNA expression in ovarian cancer from the Kaplan-Meier plotter online database. We investigated the relationship between histone deacetylases mRNA level and the clinicopathological parameters of the ovarian cancer patients, such as histology subtypes, clinical stages, grades and TP53 mutation. Our analysis data showed that over-expression of HDAC1, HDAC2, HDAC4, HDAC5 and HDAC11 were correlated to poor overall survival and unfavorable progress free survival in all ovarian cancer patients. Notably, the higher level of HDAC11 was associated with the worse OS and PFS for serous/ stage III+IV/ grade III/ TP53 mutation ovarian cancer patients. In conclusion, HDACs may play a crucial role in the prognosis of ovarian cancer, but it is worth noting that HDAC11 may be a biomarker for poor prognosis in ovarian cancer patients.
Collapse
Affiliation(s)
- Lulu Zhou
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou 325027, China
| | - Xiaohui Xu
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou 325027, China
| | - Hailing Liu
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou 325027, China
| | - Xiaoli Hu
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou 325027, China
| | - Wenwen Zhang
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou 325027, China
| | - Miaomiao Ye
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou 325027, China
| | - Xueqiong Zhu
- Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou 325027, China
| |
Collapse
|
8
|
de Ávila MB, Xavier MM, Pintro VO, de Azevedo WF. Supervised machine learning techniques to predict binding affinity. A study for cyclin-dependent kinase 2. Biochem Biophys Res Commun 2017; 494:305-310. [DOI: 10.1016/j.bbrc.2017.10.035] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 10/06/2017] [Indexed: 02/01/2023]
|