1
|
Diéguez-Santana K, Casañola-Martin GM, Torres R, Rasulev B, Green JR, González-Díaz H. Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds. Mol Pharm 2022; 19:2151-2163. [PMID: 35671399 PMCID: PMC9986951 DOI: 10.1021/acs.molpharmaceut.2c00029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays vs >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs vs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research.
Collapse
Affiliation(s)
- Karel Diéguez-Santana
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain.,Universidad Regional Amazónica IKIAM, Tena, Napo 150150, Ecuador
| | - Gerardo M Casañola-Martin
- Department of Coatings and Polymeric Materials, North Dakota State University, Fargo, North Dakota 58102, United States.,Department of Systems and Computer Engineering, Carleton University, K1S5B6 Ottawa, Ontario, Canada
| | - Roldan Torres
- Universidad Regional Amazónica IKIAM, Tena, Napo 150150, Ecuador
| | - Bakhtiyor Rasulev
- Department of Coatings and Polymeric Materials, North Dakota State University, Fargo, North Dakota 58102, United States
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, K1S5B6 Ottawa, Ontario, Canada
| | - Humbert González-Díaz
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain.,BIOFISIKA, Basque Center for Biophysics CSIC-UPVEH, 48940 Leioa, Spain.,IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Biscay, Spain
| |
Collapse
|
2
|
Halder AK, Moura AS, Cordeiro MNDS. Moving Average-Based Multitasking In Silico Classification Modeling: Where Do We Stand and What Is Next? Int J Mol Sci 2022; 23:ijms23094937. [PMID: 35563327 PMCID: PMC9099502 DOI: 10.3390/ijms23094937] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 04/24/2022] [Accepted: 04/28/2022] [Indexed: 01/27/2023] Open
Abstract
Conventional in silico modeling is often viewed as 'one-target' or 'single-task' computer-aided modeling since it mainly relies on forecasting an endpoint of interest from similar input data. Multitasking or multitarget in silico modeling, in contrast, embraces a set of computational techniques that efficiently integrate multiple types of input data for setting up unique in silico models able to predict the outcome(s) relating to various experimental and/or theoretical conditions. The latter, specifically, based upon the Box-Jenkins moving average approach, has been applied in the last decade to several research fields including drug and materials design, environmental sciences, and nanotechnology. The present review discusses the current status of multitasking computer-aided modeling efforts, meanwhile describing both the existing challenges and future opportunities of its underlying techniques. Some important applications are also discussed to exemplify the ability of multitasking modeling in deriving holistic and reliable in silico classification-based models as well as in designing new chemical entities, either through fragment-based design or virtual screening. Focus will also be given to some software recently developed to automate and accelerate such types of modeling. Overall, this review may serve as a guideline for researchers to grasp the scope of multitasking computer-aided modeling as a promising in silico tool.
Collapse
Affiliation(s)
- Amit Kumar Halder
- LAQV@REQUIMTE, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal; (A.K.H.); (A.S.M.)
- Dr. B. C. Roy College of Pharmacy and Allied Health Sciences, Dr. Meghnad Saha Sarani, Bidhannagar, Durgapur 713212, West Bengal, India
| | - Ana S. Moura
- LAQV@REQUIMTE, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal; (A.K.H.); (A.S.M.)
| | - Maria Natália D. S. Cordeiro
- LAQV@REQUIMTE, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal; (A.K.H.); (A.S.M.)
- Correspondence: ; Tel.: +35-12-2040-2502
| |
Collapse
|
3
|
Multi-Target In Silico Prediction of Inhibitors for Mitogen-Activated Protein Kinase-Interacting Kinases. Biomolecules 2021; 11:biom11111670. [PMID: 34827668 PMCID: PMC8615736 DOI: 10.3390/biom11111670] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2021] [Revised: 11/05/2021] [Accepted: 11/08/2021] [Indexed: 11/26/2022] Open
Abstract
The inhibitors of two isoforms of mitogen-activated protein kinase-interacting kinases (i.e., MNK-1 and MNK-2) are implicated in the treatment of a number of diseases including cancer. This work reports, for the first time, a multi-target (or multi-tasking) in silico modeling approach (mt-QSAR) for probing the inhibitory potential of these isoforms against MNKs. Linear and non-linear mt-QSAR classification models were set up from a large dataset of 1892 chemicals tested under a variety of assay conditions, based on the Box–Jenkins moving average approach, along with a range of feature selection algorithms and machine learning tools, out of which the most predictive one (>90% overall accuracy) was used for mechanistic interpretation of the likely inhibition of MNK-1 and MNK-2. Considering that the latter model is suitable for virtual screening of chemical libraries—i.e., commercial, non-commercial and in-house sets, it was made publicly accessible as a ready-to-use FLASK-based application. Additionally, this work employed a focused kinase library for virtual screening using an mt-QSAR model. The virtual hits identified in this process were further filtered by using a similarity search, in silico prediction of drug-likeness, and ADME profiles as well as synthetic accessibility tools. Finally, molecular dynamic simulations were carried out to identify and select the most promising virtual hits. The information gathered from this work can supply important guidelines for the discovery of novel MNK-1/2 inhibitors as potential therapeutic agents.
Collapse
|
4
|
Abstract
Introduction: Artificial Intelligence (AI) has become a component of our everyday lives, with applications ranging from recommendations on what to buy to the analysis of radiology images. Many of the techniques originally developed for other fields such as language translation and computer vision are now being applied in drug discovery. AI has enabled multiple aspects of drug discovery including the analysis of high content screening data, and the design and synthesis of new molecules.Areas covered: This perspective provides an overview of the application of AI in several areas relevant to drug discovery including property prediction, molecule generation, image analysis, and organic synthesis planning.Expert opinion: While a variety of machine learning methods are now being routinely used to predict biological activity and ADME properties, methods of representing molecules continue to evolve. Molecule generation methods are relatively new and unproven but hold the potential to access new, unexplored areas of chemical space. The application of AI in drug discovery will continue to benefit from dedicated research, as well as AI developments in other fields. With this pairing algorithmic advancements and high-quality data, the impact of AI in drug discovery will continue to grow in the coming years.
Collapse
Affiliation(s)
| | - Regina Barzilay
- Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, MA, USA
| |
Collapse
|
5
|
Jiménez-Luna J, Grisoni F, Weskamp N, Schneider G. Artificial intelligence in drug discovery: recent advances and future perspectives. Expert Opin Drug Discov 2021; 16:949-959. [PMID: 33779453 DOI: 10.1080/17460441.2021.1909567] [Citation(s) in RCA: 83] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Introduction: Artificial intelligence (AI) has inspired computer-aided drug discovery. The widespread adoption of machine learning, in particular deep learning, in multiple scientific disciplines, and the advances in computing hardware and software, among other factors, continue to fuel this development. Much of the initial skepticism regarding applications of AI in pharmaceutical discovery has started to vanish, consequently benefitting medicinal chemistry.Areas covered: The current status of AI in chemoinformatics is reviewed. The topics discussed herein include quantitative structure-activity/property relationship and structure-based modeling, de novo molecular design, and chemical synthesis prediction. Advantages and limitations of current deep learning applications are highlighted, together with a perspective on next-generation AI for drug discovery.Expert opinion: Deep learning-based approaches have only begun to address some fundamental problems in drug discovery. Certain methodological advances, such as message-passing models, spatial-symmetry-preserving networks, hybrid de novo design, and other innovative machine learning paradigms, will likely become commonplace and help address some of the most challenging questions. Open data sharing and model development will play a central role in the advancement of drug discovery with AI.
Collapse
Affiliation(s)
- José Jiménez-Luna
- Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich, Switzerland
| | - Francesca Grisoni
- Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich, Switzerland
| | - Nils Weskamp
- Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an Der Riss, Germany
| | - Gisbert Schneider
- Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich, Switzerland
| |
Collapse
|
6
|
Bugeac CA, Ancuceanu R, Dinu M. QSAR Models for Active Substances against Pseudomonas aeruginosa Using Disk-Diffusion Test Data. Molecules 2021; 26:molecules26061734. [PMID: 33808845 PMCID: PMC8003670 DOI: 10.3390/molecules26061734] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 03/14/2021] [Accepted: 03/15/2021] [Indexed: 12/02/2022] Open
Abstract
Pseudomonas aeruginosa is a Gram-negative bacillus included among the six “ESKAPE” microbial species with an outstanding ability to “escape” currently used antibiotics and developing new antibiotics against it is of the highest priority. Whereas minimum inhibitory concentration (MIC) values against Pseudomonas aeruginosa have been used previously for QSAR model development, disk diffusion results (inhibition zones) have not been apparently used for this purpose in the literature and we decided to explore their use in this sense. We developed multiple QSAR methods using several machine learning algorithms (support vector classifier, K nearest neighbors, random forest classifier, decision tree classifier, AdaBoost classifier, logistic regression and naïve Bayes classifier). We used four sets of molecular descriptors and fingerprints and three different methods of data balancing, together with the “native” data set. In total, 32 models were built for each set of descriptors or fingerprint and balancing method, of which 28 were selected and stacked to create meta-models. In terms of balanced accuracy, the best performance was provided by KNN, logistic regression and decision tree classifier, but the ensemble method had slightly superior results in nested cross-validation.
Collapse
Affiliation(s)
- Cosmin Alexandru Bugeac
- Faculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 6 Traian Vuia Street, Sector 2, 020956 Bucharest, Romania;
| | - Robert Ancuceanu
- Department of Pharmaceutical Botany and Cell Biology, Faculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 6 Traian Vuia Street, Sector 2, 020956 Bucharest, Romania;
- Correspondence:
| | - Mihaela Dinu
- Department of Pharmaceutical Botany and Cell Biology, Faculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 6 Traian Vuia Street, Sector 2, 020956 Bucharest, Romania;
| |
Collapse
|
7
|
Tree-Based QSAR Model for Drug Repurposing in the Discovery of New Antibacterial Compounds Against Escherichia coli. Pharmaceuticals (Basel) 2020; 13:ph13120431. [PMID: 33260726 PMCID: PMC7760995 DOI: 10.3390/ph13120431] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Revised: 11/23/2020] [Accepted: 11/26/2020] [Indexed: 01/31/2023] Open
Abstract
Drug repurposing appears as an increasing popular tool in the search of new treatment options against bacteria. In this paper, a tree-based classification method using Linear Discriminant Analysis (LDA) and discrete indexes was used to create a QSAR (Quantitative Structure-Activity Relationship) model to predict antibacterial activity against Escherichia coli. The model consists on a hierarchical decision tree in which a discrete index is used to divide compounds into groups according to their values for said index in order to construct probability spaces. The second step consists in the calculation of a discriminant function which determines the prediction of the model. The model was used to screen the DrugBank database, identifying 134 drugs as possible antibacterial candidates. Out of these 134 drugs, 8 were antibacterial drugs, 67 were drugs approved for different pathologies and 55 were drugs in experimental stages. This methodology has proven to be a viable alternative to the traditional methods used to obtain prediction models based on LDA and its application provides interesting new drug candidates to be studied as repurposed antibacterial treatments. Furthermore, the topological indexes Nclass and Numhba have proven to have the ability to group active compounds effectively, which suggests a close relationship between them and the antibacterial activity of compounds against E. coli.
Collapse
|
8
|
Kleandrova VV, Speck-Planche A. PTML Modeling for Alzheimer’s Disease: Design and Prediction of Virtual Multi-Target Inhibitors of GSK3B, HDAC1, and HDAC6. Curr Top Med Chem 2020; 20:1661-1676. [DOI: 10.2174/1568026620666200607190951] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 12/12/2019] [Accepted: 01/05/2020] [Indexed: 01/23/2023]
Abstract
Background:
Alzheimer’s disease is characterized by a progressive pattern of cognitive and
functional impairment, which ultimately leads to death. Computational approaches have played an important
role in the context of drug discovery for anti-Alzheimer's therapies. However, most of the computational
models reported to date have been focused on only one protein associated with Alzheimer's,
while relying on small datasets of structurally related molecules.
Objective:
We introduce the first model combining perturbation theory and machine learning based on
artificial neural networks (PTML-ANN) for simultaneous prediction and design of inhibitors of three
Alzheimer’s disease-related proteins, namely glycogen synthase kinase 3 beta (GSK3B), histone deacetylase
1 (HDAC1), and histone deacetylase 6 (HDAC6).
Methods:
The PTML-ANN model was obtained from a dataset retrieved from ChEMBL, and it relied on
a classification approach to predict chemicals as active or inactive.
Results:
The PTML-ANN model displayed sensitivity and specificity higher than 85% in both training
and test sets. The physicochemical and structural interpretation of the molecular descriptors in the model
permitted the direct extraction of fragments suggested to favorably contribute to enhancing the multitarget
inhibitory activity. Based on this information, we assembled ten molecules from several fragments
with positive contributions. Seven of these molecules were predicted as triple target inhibitors while the
remaining three were predicted as dual-target inhibitors. The estimated physicochemical properties of
the designed molecules complied with Lipinski’s rule of five and its variants.
Conclusion:
This work opens new horizons toward the design of multi-target inhibitors for anti- Alzheimer's
therapies.
Collapse
Affiliation(s)
- Valeria V. Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Volokolamskoe Shosse 11, 125080, Moscow, Russian Federation
| | - Alejandro Speck-Planche
- Programa Institucional de Fomento a la Investigacion, Desarrollo e Innovacion, Universidad Tecnologica Metropolitana, Ignacio Valdivieso 2409, P.O. Box 8940577, San Joaquin, Santiago, Chile
| |
Collapse
|
9
|
Nocedo-Mena D, Cornelio C, Camacho-Corona MDR, Garza-González E, Waksman de Torres N, Arrasate S, Sotomayor N, Lete E, González-Díaz H. Modeling Antibacterial Activity with Machine Learning and Fusion of Chemical Structure Information with Microorganism Metabolic Networks. J Chem Inf Model 2019; 59:1109-1120. [PMID: 30802402 DOI: 10.1021/acs.jcim.9b00034] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Predicting the activity of new chemical compounds over pathogenic microorganisms with different metabolic reaction networks (MRN s) is an important goal due to the different susceptibility to antibiotics. The ChEMBL database contains >160 000 outcomes of preclinical assays of antimicrobial activity for 55 931 compounds with >365 parameters of activity (MIC, IC50, etc.) and >90 bacteria strains of >25 bacterial species. In addition, the Leong and Barabàsi data set includes >40 MRNs of microorganisms. However, there are no models able to predict antibacterial activity for multiple assays considering both drug and MRN structures at the same time. In this work, we combined perturbation theory, machine learning, and information fusion techniques to develop the first PTMLIF model. The best linear model found presented values of specificity = 90.31/90.40 and sensitivity = 88.14/88.07 in training/validation series. We carried out a comparison to nonlinear artificial neural network (ANN) techniques and previous models from the literature. Next, we illustrated the practical use of the model with an experimental case of study. We reported for the first time the isolation and characterization of terpenes from the plant Cissus incisa. The antibacterial activity of the terpenes was experimentally determined. The more active compounds were phytol and α-amyrin, with MIC = 100 μg/mL for Vancomycin-resistant Enterococcus faecium and Acinetobacter baumannii resistant to carbapenems. These compounds are already known from other sources. However, they have been isolated and evaluated for the first time here against several strains of multidrug-resistant bacteria including World Health Organization (WHO) priority pathogens. Last, we used the model to predict the activity of these compounds versus other microorganisms with different MRNs in order to find other potential targets.
Collapse
Affiliation(s)
- Deyani Nocedo-Mena
- Department of Organic Chemistry II , University of the Basque Country UPV/EHU , 48940 Leioa , Spain.,Facultad de Ciencias Químicas , Universidad Autónoma de Nuevo León , CP 66455 San Nicolás de los Garza , Nuevo León , México
| | - Carlos Cornelio
- Department of Organic Chemistry II , University of the Basque Country UPV/EHU , 48940 Leioa , Spain
| | - María Del Rayo Camacho-Corona
- Facultad de Ciencias Químicas , Universidad Autónoma de Nuevo León , CP 66455 San Nicolás de los Garza , Nuevo León , México
| | - Elvira Garza-González
- Servicio de Gastroenterología, Hospital Universitario, Dr. Eleuterio González , Universidad Autónoma de Nuevo León , CP 64460 Monterrey , Nuevo León , México
| | - Noemi Waksman de Torres
- Facultad de Medicina , Universidad Autónoma de Nuevo León , CP 64460 Monterrey , Nuevo León , México
| | - Sonia Arrasate
- Department of Organic Chemistry II , University of the Basque Country UPV/EHU , 48940 Leioa , Spain
| | - Nuria Sotomayor
- Department of Organic Chemistry II , University of the Basque Country UPV/EHU , 48940 Leioa , Spain
| | - Esther Lete
- Department of Organic Chemistry II , University of the Basque Country UPV/EHU , 48940 Leioa , Spain
| | - Humbert González-Díaz
- Department of Organic Chemistry II , University of the Basque Country UPV/EHU , 48940 Leioa , Spain.,IKERBASQUE, Basque Foundation for Science , 48011 Bilbao , Biscay , Spain
| |
Collapse
|
10
|
Khan K, Kar S, Sanderson H, Roy K, Leszczynski J. Ecotoxicological Modeling, Ranking and Prioritization of Pharmaceuticals Using QSTR and i‐QSTTR Approaches: Application of 2D and Fragment Based Descriptors. Mol Inform 2018; 38:e1800078. [DOI: 10.1002/minf.201800078] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Accepted: 11/01/2018] [Indexed: 12/22/2022]
Affiliation(s)
- Kabiruddin Khan
- Drug Theoretics and Cheminformatics Laboratory Department of Pharmaceutical Technology Jadavpur University Kolkata 700032 India
| | - Supratik Kar
- Interdisciplinary Center for Nanotoxicity Department of Chemistry, Physics and Atmospheric Sciences Jackson State University Jackson MS-39217 USA
| | - Hans Sanderson
- Department of Environmental Science, Section for Toxicology and Chemistry Aarhus University Frederiksborgvej 399 DK-4000 Roskilde Denmark
| | - Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory Department of Pharmaceutical Technology Jadavpur University Kolkata 700032 India
| | - Jerzy Leszczynski
- Interdisciplinary Center for Nanotoxicity Department of Chemistry, Physics and Atmospheric Sciences Jackson State University Jackson MS-39217 USA
| |
Collapse
|
11
|
Antanasijević D, Antanasijević J, Trišović N, Ušćumlić G, Pocajt V. From Classification to Regression Multitasking QSAR Modeling Using a Novel Modular Neural Network: Simultaneous Prediction of Anticonvulsant Activity and Neurotoxicity of Succinimides. Mol Pharm 2017; 14:4476-4484. [PMID: 29130688 DOI: 10.1021/acs.molpharmaceut.7b00582] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Succinimides, which contain a pharmacophore responsible for anticonvulsant activity, are frequently used antiepileptic drugs and the synthesis of their new derivatives with improved efficacy and tolerability presents an important task. Nowadays, multitarget/tasking methodologies focused on quantitative-structure activity relationships (mt-QSAR/mtk-QSAR) have an important role in the rational design of drugs since they enable simultaneous prediction of several standard measures of biological activities at diverse experimental conditions and against different biological targets. Relating to this very topic, the mt-QSAR/mtk-QSAR methodology can give only binary classification models, and as such, in this study a regression mtk-QSAR (rmtk-QSAR) model based on a novel modular neural network (MNN) has been proposed. The MNN uses standard classification mtk-QSAR models as input modules, while the regression is performed by the output module. The rmtk-QSAR model has been successfully developed for the simultaneous prediction of anticonvulsant activity and neurotoxicity of succinimides, with a satisfactory accuracy in testing (R2 = 0.87). Thus, the proposed mtk-QSAR regression method can be regarded as a viable alternative to the standard QSAR methodology.
Collapse
Affiliation(s)
- Davor Antanasijević
- Innovation Center of the Faculty of Technology and Metallurgy and ‡Faculty of Technology and Metallurgy, University of Belgrade , Karnegijeva 4, Belgrade 11120, Serbia
| | - Jelena Antanasijević
- Innovation Center of the Faculty of Technology and Metallurgy and ‡Faculty of Technology and Metallurgy, University of Belgrade , Karnegijeva 4, Belgrade 11120, Serbia
| | - Nemanja Trišović
- Innovation Center of the Faculty of Technology and Metallurgy and ‡Faculty of Technology and Metallurgy, University of Belgrade , Karnegijeva 4, Belgrade 11120, Serbia
| | - Gordana Ušćumlić
- Innovation Center of the Faculty of Technology and Metallurgy and ‡Faculty of Technology and Metallurgy, University of Belgrade , Karnegijeva 4, Belgrade 11120, Serbia
| | - Viktor Pocajt
- Innovation Center of the Faculty of Technology and Metallurgy and ‡Faculty of Technology and Metallurgy, University of Belgrade , Karnegijeva 4, Belgrade 11120, Serbia
| |
Collapse
|
12
|
Two-class support vector machine with new kernel function based on paths of features for predicting chemical activity. Inf Sci (N Y) 2017. [DOI: 10.1016/j.ins.2017.04.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
13
|
Dobchev D, Karelson M. Have artificial neural networks met expectations in drug discovery as implemented in QSAR framework? Expert Opin Drug Discov 2016; 11:627-39. [PMID: 27149299 DOI: 10.1080/17460441.2016.1186876] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
INTRODUCTION Artificial neural networks (ANNs) are highly adaptive nonlinear optimization algorithms that have been applied in many diverse scientific endeavors, ranging from economics, engineering, physics, and chemistry to medical science. Notably, in the past two decades, ANNs have been used widely in the process of drug discovery. AREAS COVERED In this review, the authors discuss advantages and disadvantages of ANNs in drug discovery as incorporated into the quantitative structure-activity relationships (QSAR) framework. Furthermore, the authors examine the recent studies, which span over a broad area with various diseases in drug discovery. In addition, the authors attempt to answer the question about the expectations of the ANNs in drug discovery and discuss the trends in this field. EXPERT OPINION The old pitfalls of overtraining and interpretability are still present with ANNs. However, despite these pitfalls, the authors believe that ANNs have likely met many of the expectations of researchers and are still considered as excellent tools for nonlinear data modeling in QSAR. It is likely that ANNs will continue to be used in drug development in the future.
Collapse
Affiliation(s)
- Dimitar Dobchev
- a Department of Chemistry , Tallinn University of Technology , Tallinn , Estonia
| | - Mati Karelson
- b Institute of Chemistry , University of Tartu , Tartu , Estonia
| |
Collapse
|
14
|
Lei T, Li Y, Song Y, Li D, Sun H, Hou T. ADMET evaluation in drug discovery: 15. Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling. J Cheminform 2016; 8:6. [PMID: 26839598 PMCID: PMC4736633 DOI: 10.1186/s13321-016-0117-7] [Citation(s) in RCA: 79] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Accepted: 01/20/2016] [Indexed: 01/31/2023] Open
Abstract
Background
Determination of acute toxicity, expressed as median lethal dose (LD50), is one of the most important steps in drug discovery pipeline. Because in vivo assays for oral acute toxicity in mammals are time-consuming and costly, there is thus an urgent need to develop in silico prediction models of oral acute toxicity.
Results In this study, based on a comprehensive data set containing 7314 diverse chemicals with rat oral LD50 values, relevance vector machine (RVM) technique was employed to build the regression models for the prediction of oral acute toxicity in rate, which were compared with those built using other six machine learning approaches, including k-nearest-neighbor regression, random forest (RF), support vector machine, local approximate Gaussian process, multilayer perceptron ensemble, and eXtreme gradient boosting. A subset of the original molecular descriptors and structural fingerprints (PubChem or SubFP) was chosen by the Chi squared statistics. The prediction capabilities of individual QSAR models, measured by qext2 for the test set containing 2376 molecules, ranged from 0.572 to 0.659. Conclusion Considering the overall prediction accuracy for the test set, RVM with Laplacian kernel and RF were recommended to build in silico models with better predictivity for rat oral acute toxicity. By combining the predictions from individual models, four consensus models were developed, yielding better prediction capabilities for the test set (qext2 = 0.669–0.689). Finally, some essential descriptors and substructures relevant to oral acute toxicity were identified and analyzed, and they may be served as property or substructure alerts to avoid toxicity. We believe that the best consensus model with high prediction accuracy can be used as a reliable virtual screening tool to filter out compounds with high rat oral acute toxicity.
Workflow of combinatorial QSAR modelling to predict rat oral acute toxicity ![]()
Collapse
Affiliation(s)
- Tailong Lei
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058 Zhejiang People's Republic of China
| | - Youyong Li
- Institute of Functional Nano and Soft Materials (FUNSOM), Soochow University, Suzhou, 215123 Jiangsu People's Republic of China
| | - Yunlong Song
- Department of Medicinal Chemistry, School of Pharmacy, Second Military Medical University, Shanghai, 200433 People's Republic of China
| | - Dan Li
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058 Zhejiang People's Republic of China
| | - Huiyong Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058 Zhejiang People's Republic of China
| | - Tingjun Hou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058 Zhejiang People's Republic of China ; State Key Lab of CAD&CG, Zhejiang University, Hangzhou, 310058 Zhejiang People's Republic of China
| |
Collapse
|
15
|
Durrant JD, Amaro RE. Machine-learning techniques applied to antibacterial drug discovery. Chem Biol Drug Des 2015; 85:14-21. [PMID: 25521642 DOI: 10.1111/cbdd.12423] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Revised: 08/25/2014] [Accepted: 08/26/2014] [Indexed: 12/01/2022]
Abstract
The emergence of drug-resistant bacteria threatens to revert humanity back to the preantibiotic era. Even now, multidrug-resistant bacterial infections annually result in millions of hospital days, billions in healthcare costs, and, most importantly, tens of thousands of lives lost. As many pharmaceutical companies have abandoned antibiotic development in search of more lucrative therapeutics, academic researchers are uniquely positioned to fill the pipeline. Traditional high-throughput screens and lead-optimization efforts are expensive and labor intensive. Computer-aided drug-discovery techniques, which are cheaper and faster, can accelerate the identification of novel antibiotics, leading to improved hit rates and faster transitions to preclinical and clinical testing. The current review describes two machine-learning techniques, neural networks and decision trees, that have been used to identify experimentally validated antibiotics. We conclude by describing the future directions of this exciting field.
Collapse
Affiliation(s)
- Jacob D Durrant
- Department of Chemistry & Biochemistry and the National Biomedical Computation Resource, University of California, San Diego, La Jolla, CA, 92093, USA
| | | |
Collapse
|
16
|
Casañola-Martin GM, Le-Thi-Thu H, Pérez-Giménez F, Marrero-Ponce Y, Merino-Sanjuán M, Abad C, González-Díaz H. Multi-output model with Box–Jenkins operators of linear indices to predict multi-target inhibitors of ubiquitin–proteasome pathway. Mol Divers 2015; 19:347-56. [DOI: 10.1007/s11030-015-9571-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Accepted: 02/14/2015] [Indexed: 12/29/2022]
|
17
|
Speck-Planche A, Cordeiro MNDS. Multitasking models for quantitative structure–biological effect relationships: current status and future perspectives to speed up drug discovery. Expert Opin Drug Discov 2015; 10:245-56. [DOI: 10.1517/17460441.2015.1006195] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|
18
|
Speck-Planche A, Cordeiro MNDS. A general ANN-based multitasking model for the discovery of potent and safer antibacterial agents. Methods Mol Biol 2015; 1260:45-64. [PMID: 25502375 DOI: 10.1007/978-1-4939-2239-0_4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Bacteria have been one of the world's most dangerous and deadliest pathogens for mankind, nowadays giving rise to significant public health concerns. Given the prevalence of these microbial pathogens and their increasing resistance to existing antibiotics, there is a pressing need for new antibacterial drugs. However, development of a successful drug is a complex, costly, and time-consuming process. Quantitative Structure-Activity Relationships (QSAR)-based approaches are valuable tools for shortening the time of lead compound identification but also for focusing and limiting time-costly synthetic activities and in vitro/vivo evaluations. QSAR-based approaches, supported by powerful statistical techniques such as artificial neural networks (ANNs), have evolved to the point of integrating dissimilar types of chemical and biological data. This chapter reports an overview of the current research and potential applications of QSAR modeling tools toward the rational design of more efficient antibacterial agents. Particular emphasis is given to the setup of multitasking models along with ANNs aimed at jointly predicting different antibacterial activities and safety profiles of drugs/chemicals under diverse experimental conditions.
Collapse
Affiliation(s)
- A Speck-Planche
- Department of Chemistry and Biochemistry, Faculty of Sciences, University of Porto, 4169-007, Porto, Portugal
| | | |
Collapse
|
19
|
Chemoinformatics for medicinal chemistry: in silico model to enable the discovery of potent and safer anti-cocci agents. Future Med Chem 2014; 6:2013-28. [DOI: 10.4155/fmc.14.136] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Background: Gram-positive cocci are increasingly antibiotic-resistant bacteria responsible for causing serious diseases. Chemoinformatics can help to rationalize the discovery of more potent and safer antibacterial drugs. We have developed a chemoinformatic model for simultaneous prediction of anti-cocci activities, and profiles involving absorption, distribution, metabolism, elimination and toxicity (ADMET). Results: A dataset containing 48,874 cases from many different chemicals assayed under dissimilar experimental conditions was created. The best model displayed accuracies around 93% in both training and prediction (test) sets. Quantitative contributions of several fragments to the biological effects were calculated and analyzed. Multiple biological effects of the investigational drug JNJ-Q2 were correctly predicted. Conclusion: Our chemoinformatic model can be used as powerful tool for virtual screening of promising anti-cocci agents.
Collapse
|
20
|
Prediction of multi-target networks of neuroprotective compounds with entropy indices and synthesis, assay, and theoretical study of new asymmetric 1,2-rasagiline carbamates. Int J Mol Sci 2014; 15:17035-64. [PMID: 25255029 PMCID: PMC4200850 DOI: 10.3390/ijms150917035] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Revised: 08/19/2014] [Accepted: 08/21/2014] [Indexed: 11/25/2022] Open
Abstract
In a multi-target complex network, the links (Lij) represent the interactions between the drug (di) and the target (tj), characterized by different experimental measures (Ki, Km, IC50, etc.) obtained in pharmacological assays under diverse boundary conditions (cj). In this work, we handle Shannon entropy measures for developing a model encompassing a multi-target network of neuroprotective/neurotoxic compounds reported in the CHEMBL database. The model predicts correctly >8300 experimental outcomes with Accuracy, Specificity, and Sensitivity above 80%–90% on training and external validation series. Indeed, the model can calculate different outcomes for >30 experimental measures in >400 different experimental protocolsin relation with >150 molecular and cellular targets on 11 different organisms (including human). Hereafter, we reported by the first time the synthesis, characterization, and experimental assays of a new series of chiral 1,2-rasagiline carbamate derivatives not reported in previous works. The experimental tests included: (1) assay in absence of neurotoxic agents; (2) in the presence of glutamate; and (3) in the presence of H2O2. Lastly, we used the new Assessing Links with Moving Averages (ALMA)-entropy model to predict possible outcomes for the new compounds in a high number of pharmacological tests not carried out experimentally.
Collapse
|
21
|
González-Díaz H, Herrera-Ibatá DM, Duardo-Sánchez A, Munteanu CR, Orbegozo-Medina RA, Pazos A. ANN Multiscale Model of Anti-HIV Drugs Activity vs AIDS Prevalence in the US at County Level Based on Information Indices of Molecular Graphs and Social Networks. J Chem Inf Model 2014; 54:744-55. [DOI: 10.1021/ci400716y] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Humberto González-Díaz
- Department
of Organic Chemistry II, Faculty of Science and Technology, University of the Basque Country UPV/EHU, 48940, Leioa, Vizcaya, Spain
- IKERBASQUE, Basque
Foundation for Science, 48011, Bilbao, Vizcaya, Spain
| | - Diana María Herrera-Ibatá
- Department of Information and Communication Technologies, University of A Coruña UDC, 15071, A Coruña, A Coruña, Spain
| | - Aliuska Duardo-Sánchez
- Department of Information and Communication Technologies, University of A Coruña UDC, 15071, A Coruña, A Coruña, Spain
| | - Cristian R. Munteanu
- Department of Information and Communication Technologies, University of A Coruña UDC, 15071, A Coruña, A Coruña, Spain
| | - Ricardo Alfredo Orbegozo-Medina
- Department
of Microbiology and Parasitology, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, A Coruña, Spain
| | - Alejandro Pazos
- Department of Information and Communication Technologies, University of A Coruña UDC, 15071, A Coruña, A Coruña, Spain
| |
Collapse
|
22
|
Alonso N, Caamaño O, Romero-Duran FJ, Luan F, D. S. Cordeiro MN, Yañez M, González-Díaz H, García-Mera X. Model for high-throughput screening of multitarget drugs in chemical neurosciences: synthesis, assay, and theoretic study of rasagiline carbamates. ACS Chem Neurosci 2013; 4:1393-403. [PMID: 23855599 DOI: 10.1021/cn400111n] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The disappointing results obtained in recent clinical trials renew the interest in experimental/computational techniques for the discovery of neuroprotective drugs. In this context, multitarget or multiplexing QSAR models (mt-QSAR/mx-QSAR) may help to predict neurotoxicity/neuroprotective effects of drugs in multiple assays, on drug targets, and in model organisms. In this work, we study a data set downloaded from CHEMBL; each data point (>8000) contains the values of one out of 37 possible measures of activity, 493 assays, 169 molecular or cellular targets, and 11 different organisms (including human) for a given compound. In this work, we introduce the first mx-QSAR model for neurotoxicity/neuroprotective effects of drugs based on the MARCH-INSIDE (MI) method. First, we used MI to calculate the stochastic spectral moments (structural descriptors) of all compounds. Next, we found a model that classified correctly 2955 out of 3548 total cases in the training and validation series with Accuracy, Sensitivity, and Specificity values>80%. The model also showed excellent results in Computational-Chemistry simulations of High-Throughput Screening (CCHTS) experiments, with accuracy=90.6% for 4671 positive cases. Next, we reported the synthesis, characterization, and experimental assays of new rasagiline derivatives. We carried out three different experimental tests: assay (1) in the absence of neurotoxic agents, assay (2) in the presence of glutamate, and assay (3) in the presence of H2O2. Compounds 11 with 27.4%, 8 with 11.6%, and 9 with 15.4% showed the highest neuroprotective effects in assays (1), (2), and (3), respectively. After that, we used the mx-QSAR model to carry out a CCHTS of the new compounds in >400 unique pharmacological tests not carried out experimentally. Consequently, this model may become a promising auxiliary tool for the discovery of new drugs for the treatment of neurodegenerative diseases.
Collapse
Affiliation(s)
- Nerea Alonso
- Department of Organic Chemistry,
Faculty of Pharmacy, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, Spain
| | - Olga Caamaño
- Department of Organic Chemistry,
Faculty of Pharmacy, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, Spain
| | - Francisco J. Romero-Duran
- Department of Organic Chemistry,
Faculty of Pharmacy, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, Spain
| | - Feng Luan
- REQUIMTE/Department of Chemistry
and Biochemistry, University of Porto,
4169-007, Porto, Portugal
- Department of Applied Chemistry, Yantai University, Yantai 264005, People’s Republic
of China
| | | | - Matilde Yañez
- Department of
Pharmacology,
Faculty of Pharmacy, USC, 15782, Santiago
de Compostela, Spain
| | - Humberto González-Díaz
- Departament
of Organic Chemistry
II, University of the Basque Country UPV/EHU, 48940, Leioa, Spain
- IKERBASQUE, Basque Foundation for Science, 48011, Bilbao, Spain
| | - Xerardo García-Mera
- Department of Organic Chemistry,
Faculty of Pharmacy, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, Spain
| |
Collapse
|