1
|
Baltasar-Marchueta M, Llona L, M-Alicante S, Barbolla I, Ibarluzea MG, Ramis R, Salomon AM, Fundora B, Araujo A, Muguruza-Montero A, Nuñez E, Pérez-Olea S, Villanueva C, Leonardo A, Arrasate S, Sotomayor N, Villarroel A, Bergara A, Lete E, González-Díaz H. Identification of Riluzole derivatives as novel calmodulin inhibitors with neuroprotective activity by a joint synthesis, biosensor, and computational guided strategy. Biomed Pharmacother 2024; 174:116602. [PMID: 38636396 DOI: 10.1016/j.biopha.2024.116602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 04/10/2024] [Accepted: 04/11/2024] [Indexed: 04/20/2024] Open
Abstract
The development of new molecules for the treatment of calmodulin related cardiovascular or neurodegenerative diseases is an interesting goal. In this work, we introduce a novel strategy with four main steps: (1) chemical synthesis of target molecules, (2) Förster Resonance Energy Transfer (FRET) biosensor development and in vitro biological assay of new derivatives, (3) Cheminformatics models development and in vivo activity prediction, and (4) Docking studies. This strategy is illustrated with a case study. Firstly, a series of 4-substituted Riluzole derivatives 1-3 were synthetized through a strategy that involves the construction of the 4-bromoriluzole framework and its further functionalization via palladium catalysis or organolithium chemistry. Next, a FRET biosensor for monitoring Ca2+-dependent CaM-ligands interactions has been developed and used for the in vitro assay of Riluzole derivatives. In particular, the best inhibition (80%) was observed for 4-methoxyphenylriluzole 2b. Besides, we trained and validated a new Networks Invariant, Information Fusion, Perturbation Theory, and Machine Learning (NIFPTML) model for predicting probability profiles of in vivo biological activity parameters in different regions of the brain. Next, we used this model to predict the in vivo activity of the compounds experimentally studied in vitro. Last, docking study conducted on Riluzole and its derivatives has provided valuable insights into their binding conformations with the target protein, involving calmodulin and the SK4 channel. This new combined strategy may be useful to reduce assay costs (animals, materials, time, and human resources) in the drug discovery process of calmodulin inhibitors.
Collapse
Affiliation(s)
- Maider Baltasar-Marchueta
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | - Leire Llona
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | | | - Iratxe Barbolla
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | - Markel Garcia Ibarluzea
- Donostia International Physics Center, Donostia, Spain; Departament of Physics, University of the Basque Country, UPV/EHU, Leioa, Spain
| | - Rafael Ramis
- Donostia International Physics Center, Donostia, Spain; Departament of Physics, University of the Basque Country, UPV/EHU, Leioa, Spain
| | - Ane Miren Salomon
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | - Brenda Fundora
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | - Ariane Araujo
- Biofisika Institute, CSIC-UPV/EHU, Leioa 48940, Spain
| | | | - Eider Nuñez
- Biofisika Institute, CSIC-UPV/EHU, Leioa 48940, Spain
| | - Scarlett Pérez-Olea
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | - Christian Villanueva
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | - Aritz Leonardo
- Donostia International Physics Center, Donostia, Spain; Departament of Physics, University of the Basque Country, UPV/EHU, Leioa, Spain
| | - Sonia Arrasate
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | - Nuria Sotomayor
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain
| | | | - Aitor Bergara
- Donostia International Physics Center, Donostia, Spain; Departament of Physics, University of the Basque Country, UPV/EHU, Leioa, Spain.
| | - Esther Lete
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain.
| | - Humberto González-Díaz
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, Leioa 48940, Spain; Biofisika Institute, CSIC-UPV/EHU, Leioa 48940, Spain; IKERBASQUE, Basque Foundation for Science, Bilbao 48011, Spain.
| |
Collapse
|
2
|
Zhuang J, Midgley AC, Wei Y, Liu Q, Kong D, Huang X. Machine-Learning-Assisted Nanozyme Design: Lessons from Materials and Engineered Enzymes. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2210848. [PMID: 36701424 DOI: 10.1002/adma.202210848] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 01/03/2023] [Indexed: 05/11/2023]
Abstract
Nanozymes are nanomaterials that exhibit enzyme-like biomimicry. In combination with intrinsic characteristics of nanomaterials, nanozymes have broad applicability in materials science, chemical engineering, bioengineering, biochemistry, and disease theranostics. Recently, the heterogeneity of published results has highlighted the complexity and diversity of nanozymes in terms of consistency of catalytic capacity. Machine learning (ML) shows promising potential for discovering new materials, yet it remains challenging for the design of new nanozymes based on ML approaches. Alternatively, ML is employed to promote optimization of intelligent design and application of catalytic materials and engineered enzymes. Incorporation of the successful ML algorithms used in the intelligent design of catalytic materials and engineered enzymes can concomitantly facilitate the guided development of next-generation nanozymes with desirable properties. Here, recent progress in ML, its utilization in the design of catalytic materials and enzymes, and how emergent ML applications serve as promising strategies to circumvent challenges associated with time-expensive and laborious testing in nanozyme research and development are summarized. The potential applications of successful examples of ML-aided catalytic materials and engineered enzymes in nanozyme design are also highlighted, with special focus on the unified aims in enhancing design and recapitulation of substrate selectivity and catalytic activity.
Collapse
Affiliation(s)
- Jie Zhuang
- School of Medicine, and State, Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China
| | - Adam C Midgley
- Key Laboratory of Bioactive Materials for the Ministry of Education, College of Life Sciences, State Key Laboratory of Medicinal Chemical Biology, and Frontiers, Science Center for Cell Responses, Nankai University, Tianjin, 300071, China
| | - Yonghua Wei
- Key Laboratory of Bioactive Materials for the Ministry of Education, College of Life Sciences, State Key Laboratory of Medicinal Chemical Biology, and Frontiers, Science Center for Cell Responses, Nankai University, Tianjin, 300071, China
| | - Qiqi Liu
- Key Laboratory of Bioactive Materials for the Ministry of Education, College of Life Sciences, State Key Laboratory of Medicinal Chemical Biology, and Frontiers, Science Center for Cell Responses, Nankai University, Tianjin, 300071, China
| | - Deling Kong
- Key Laboratory of Bioactive Materials for the Ministry of Education, College of Life Sciences, State Key Laboratory of Medicinal Chemical Biology, and Frontiers, Science Center for Cell Responses, Nankai University, Tianjin, 300071, China
| | - Xinglu Huang
- Key Laboratory of Bioactive Materials for the Ministry of Education, College of Life Sciences, State Key Laboratory of Medicinal Chemical Biology, and Frontiers, Science Center for Cell Responses, Nankai University, Tianjin, 300071, China
| |
Collapse
|
3
|
Mubashir M, Ahmad T, Liu X, Rehman LM, de Levay JPBB, Al Nuaimi R, Thankamony R, Lai Z. Artificial intelligence and structural design of inorganic hollow fiber membranes: Materials chemistry. CHEMOSPHERE 2023; 338:139525. [PMID: 37467860 DOI: 10.1016/j.chemosphere.2023.139525] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 07/02/2023] [Accepted: 07/14/2023] [Indexed: 07/21/2023]
Abstract
A key challenge is to produce the uniform morphology and regular pore design of inorganic hollow fiber membranes (HFMs) due to involvement of multiple parameters including, fabrication process and materials chemistry. Inorganic HFMs required technical innovations via novel structural design and artificial intelligence (AI) to produce the uniform structure and regular pore design. Therefore, this review aims at critical analysis on the most recent and relevant approaches to tackle the issues related to tune the morphology and pore design of inorganic HFMs. Structural design and evaluation of routes towards the dope suspension, spinning, and sintering of inorganic HFMs are critically analysed. AI, driving forces and challenges involved for harnessing of materials are revealed in this review. AI programs used for the prediction of pore design and performance of HFMs have also been explained in this review. Overall, this review will provide the understanding to build the equilibrium in spinning and sintering processes to control the design of micro-channels, and structural properties of inorganic HFMs. This review has great significance to control the new design of membranes via AI programs. This review also explain the inorganic membrane efficiency as algal-bioreactor.
Collapse
Affiliation(s)
- Muhammad Mubashir
- Physical Science and Engineering Division, Advanced Membranes and Porous Materials Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
| | - Tausif Ahmad
- Physical Science and Engineering Division, Advanced Membranes and Porous Materials Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Xiaowei Liu
- Physical Science and Engineering Division, Advanced Membranes and Porous Materials Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Lubna Muzamil Rehman
- Physical Science and Engineering Division, Advanced Membranes and Porous Materials Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Jean-Pierre Benjamin Boross de Levay
- Physical Science and Engineering Division, Advanced Membranes and Porous Materials Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Reham Al Nuaimi
- Physical Science and Engineering Division, Advanced Membranes and Porous Materials Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Roshni Thankamony
- Physical Science and Engineering Division, Advanced Membranes and Porous Materials Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Zhiping Lai
- Physical Science and Engineering Division, Advanced Membranes and Porous Materials Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
| |
Collapse
|
4
|
Diéguez-Santana K, Casañola-Martin GM, Torres R, Rasulev B, Green JR, González-Díaz H. Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds. Mol Pharm 2022; 19:2151-2163. [PMID: 35671399 PMCID: PMC9986951 DOI: 10.1021/acs.molpharmaceut.2c00029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays vs >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs vs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research.
Collapse
Affiliation(s)
- Karel Diéguez-Santana
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain.,Universidad Regional Amazónica IKIAM, Tena, Napo 150150, Ecuador
| | - Gerardo M Casañola-Martin
- Department of Coatings and Polymeric Materials, North Dakota State University, Fargo, North Dakota 58102, United States.,Department of Systems and Computer Engineering, Carleton University, K1S5B6 Ottawa, Ontario, Canada
| | - Roldan Torres
- Universidad Regional Amazónica IKIAM, Tena, Napo 150150, Ecuador
| | - Bakhtiyor Rasulev
- Department of Coatings and Polymeric Materials, North Dakota State University, Fargo, North Dakota 58102, United States
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, K1S5B6 Ottawa, Ontario, Canada
| | - Humbert González-Díaz
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain.,BIOFISIKA, Basque Center for Biophysics CSIC-UPVEH, 48940 Leioa, Spain.,IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Biscay, Spain
| |
Collapse
|
5
|
Ortega-Tenezaca B, Quevedo-Tumailli V, Bediaga H, Collados J, Arrasate S, Madariaga G, Munteanu CR, Cordeiro MND, González-Díaz H. PTML Multi-Label Algorithms: Models, Software, and Applications. Curr Top Med Chem 2020; 20:2326-2337. [DOI: 10.2174/1568026620666200916122616] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Revised: 07/19/2020] [Accepted: 07/20/2020] [Indexed: 12/17/2022]
Abstract
By combining Machine Learning (ML) methods with Perturbation Theory (PT), it is possible
to develop predictive models for a variety of response targets. Such combination often known as
Perturbation Theory Machine Learning (PTML) modeling comprises a set of techniques that can handle
various physical, and chemical properties of different organisms, complex biological or material
systems under multiple input conditions. In so doing, these techniques effectively integrate a manifold
of diverse chemical and biological data into a single computational framework that can then be applied
for screening lead chemicals as well as to find clues for improving the targeted response(s).
PTML models have thus been extremely helpful in drug or material design efforts and found to be
predictive and applicable across a broad space of systems. After a brief outline of the applied methodology,
this work reviews the different uses of PTML in Medicinal Chemistry, as well as in other
applications. Finally, we cover the development of software available nowadays for setting up PTML
models from large datasets.
Collapse
Affiliation(s)
| | | | - Harbil Bediaga
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain
| | - Jon Collados
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain
| | - Sonia Arrasate
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain
| | - Gotzon Madariaga
- Department of Condensed Matter Physics, University of Basque Country UPV/EHU, 48940 Leioa, Spain
| | - Cristian R Munteanu
- RNASA-IMEDIR, Computer Science Faculty, University of A Coruna, 15071 A Coruna, Spain
| | - M. Natália D.S. Cordeiro
- LAQV@REQUIMTE, Department of Chemistry and Biochemistry, University of Porto, 4169-007 Porto, Portugal
| | - Humbert González-Díaz
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain
| |
Collapse
|
6
|
Jafari K, Fatemi MH, Toropova AP, Toropov AA. Correlation Intensity Index (CII) as a criterion of predictive potential: Applying to model thermal conductivity of metal oxide-based ethylene glycol nanofluids. Chem Phys Lett 2020. [DOI: 10.1016/j.cplett.2020.137614] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
7
|
Santana R, Zuluaga R, Gañán P, Arrasate S, Onieva E, Montemore MM, González-Díaz H. PTML Model for Selection of Nanoparticles, Anticancer Drugs, and Vitamins in the Design of Drug-Vitamin Nanoparticle Release Systems for Cancer Cotherapy. Mol Pharm 2020; 17:2612-2627. [PMID: 32459098 DOI: 10.1021/acs.molpharmaceut.0c00308] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Nanosystems are gaining momentum in pharmaceutical sciences because of the wide variety of possibilities for designing these systems to have specific functions. Specifically, studies of new cancer cotherapy drug-vitamin release nanosystems (DVRNs) including anticancer compounds and vitamins or vitamin derivatives have revealed encouraging results. However, the number of possible combinations of design and synthesis conditions is remarkably high. In addition, a large number of anticancer and vitamin derivatives have been already assayed, but a notably less number of cases of DVRNs were assayed as a whole (with the anticancer compound and the vitamin linked to them). Our approach combines with the perturbation theory and machine learning (PTML) model to predict the probability of obtaining an interesting DVRN by changing the anticancer compound and/or the vitamin present in a DVRN that is already tested for other anticancer compounds or vitamins that have not been tested yet as part of a DVRN. In a previous work, we built a linear PTML model useful for the design of these nanosystems. In doing so, we used information fusion (IF) techniques to carry out data enrichment of DVRN data compiled from the literature with the data for preclinical assays of vitamins from the ChEMBL database. The design features of DVRNs and the assay conditions of nanoparticles (NPs) and vitamins were included as multiplicative PT operators (PTOs) to the system, which indicates the importance of these variables. However, the previous work omitted experiments with nonlinear ML techniques and different types of PTOs such as metric-based PTOs. More importantly, the previous work does not consider the structure of the anticancer drug to be included in the new DVRNs. In this work, we are going to accomplish three main objectives (tasks). In the first task, we found a new model, alternative to the one published before, for the rational design of DVRNs using metric-based PTOs. The most accurate PTML model was the artificial neural network model, which showed values of specificity, sensitivity, and accuracy in the range of 90-95% in training and external validation series for more than 130,000 cases (DVRNs vs ChEMBL assays). Furthermore, in the second task, we used IF techniques to carry out data enrichment of our previous data set. In doing so, we constructed a new working data set of >970,000 cases with the data of preclinical assays of DVRNs, vitamins, and anticancer compounds from the ChEMBL database. All these assays have multiple continuous variables or descriptors dk and categorical variables cj (conditions of the assays) for drugs (dack, cacj), vitamins (dvk, cvj), and NPs (dnk, cnj). These data include >20,000 potential anticancer compounds with >270 protein targets (cac1), >580 assay cell organisms (cac2), and so forth. Furthermore, we include >36,000 assay vitamin derivatives in >6200 types of cells (c2vit), >120 assay organisms (c3vit), >60 assay strains (c4vit), and so forth. The enriched data set also contains >20 types of DVRNs (c5n) with 9 NP core materials (c4n), 8 synthesis methods (c7n), and so forth. We expressed all this information with PTOs and developed a qualitatively new PTML model that incorporates information of the anticancer drugs. This new model presents 96-97% of accuracy for training and external validation subsets. In the last task, we carried out a comparative study of ML and/or PTML models published and described how the models we are presenting cover the gap of knowledge in terms of drug delivery. In conclusion, we present here for the first time a multipurpose PTML model that is able to select NPs, anticancer compounds, and vitamins and their conditions of assay for DVRN design.
Collapse
Affiliation(s)
- Ricardo Santana
- Department of Chemical and Biomolecular Engineering, Tulane University, 6823 St Charles Avenue, New Orleans, Louisiana 70118, United States.,University of Deusto, Avda. Universidades, 24, 48007 Bilbao, Spain.,Grupo de Investigación Sobre Nuevos Materiales, Facultad de Ingeniería Química, Universidad Pontificia Bolivariana, Circular 1 No. 70-01, 050031 Medellín, Colombia
| | - Robin Zuluaga
- Facultad de Ingeniería Agroindustrial, Universidad Pontificia Bolivariana, Circular 1 No. 70-01, 050031 Medellín, Colombia
| | - Piedad Gañán
- Grupo de Investigación Sobre Nuevos Materiales, Facultad de Ingeniería Química, Universidad Pontificia Bolivariana, Circular 1 No. 70-01, 050031 Medellín, Colombia
| | - Sonia Arrasate
- Department of Organic Chemistry II, University of Basque Country UPV/EHU, 48940 Leioa, Basque Country, Spain
| | - Enrique Onieva
- University of Deusto, Avda. Universidades, 24, 48007 Bilbao, Spain
| | - Matthew M Montemore
- Department of Chemical and Biomolecular Engineering, Tulane University, 6823 St Charles Avenue, New Orleans, Louisiana 70118, United States
| | - Humbert González-Díaz
- Department of Organic Chemistry II, University of Basque Country UPV/EHU, 48940 Leioa, Basque Country, Spain.,Basque Center for Biophysics, Spanish National Research Council (CSIC)-University of Basque Country UPV/EHU, 48940 Leioa, Basque Country, Spain.,Ikerbasque, Basque Foundation for Science, 48013 Bilbao, Basque Country, Spain
| |
Collapse
|
8
|
Santana R, Zuluaga R, Gañán P, Arrasate S, Onieva Caracuel E, González-Díaz H. PTML Model of ChEMBL Compounds Assays for Vitamin Derivatives. ACS COMBINATORIAL SCIENCE 2020; 22:129-141. [PMID: 32011854 DOI: 10.1021/acscombsci.9b00166] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Determining the biological activity of vitamin derivatives is needed given that organic synthesis of analogs of vitamins is an active field of interest for medicinal chemistry, pharmaceuticals, and food additives. Accordingly, scientists from different disciplines perform preclinical assays (nij) with a considerable combination of assay conditions (cj). Indeed, the ChEMBL platform contains a database that includes results from 36 220 different biological activity bioassays of 21 240 different vitamins and vitamin derivatives. These assays present are heterogeneous in terms of assay combinations of cj. They are focused on >500 different biological activity parameters (c0), >340 different targets (c1), >6200 types of cell (c2), >120 organisms of assay (c3), and >60 assay strains (c4). It includes a total of >1850 niacin assays, >1580 tretinoin assays, >1580 retinol assays, 857 ascorbic acid assays, etc. Given the complexity of this combinatorial data in terms of being assimilated by researchers, we propose to build a model by combining perturbation theory (PT) and machine learning (ML). Through this study, we propose a PTML (PT + ML) combinatorial model for ChEMBL results on biological activity of vitamins and vitamins derivatives. The linear discriminant analysis (LDA) model presented the following results for training subset a: specificity (%) = 90.38, sensitivity (%) = 87.51, and accuracy (%) = 89.89. The model showed the following results for the external validation subset: specificity (%) = 90.58, sensitivity (%) = 87.72, and accuracy (%) = 90.09. Different types of linear and nonlinear PTML models, such as logistic regression (LR), classification tree (CT), näive Bayes (NB), and random Forest (RF), were applied to contrast the capacity of prediction. The PTML-LDA model predicts with more accuracy by applying combinatorial descriptors. In addition, a PCA experiment with chemical structure descriptors allowed us to characterize the high structural diversity of the chemical space studied. In any case, PTML models using chemical structure descriptors do not improve the performance of the PTML-LDA model based on ALOGP and PSA. We can conclude that the three variable PTML-LDA model is a simplified and adaptable tool for the prediction, for different experiment combinations, the biological activity of derivative vitamins.
Collapse
Affiliation(s)
- Ricardo Santana
- DeustoTech-Fundación Deusto, Avda. Universidades, 24, 48007 Bilbao, Spain
- Grupo de Investigación sobre Nuevos Materiales, Universidad Pontificia Bolivariana UPB, 050031, Medellín, Colombia
| | - Robin Zuluaga
- Facultad de Ingeniería Agroindustrial, Universidad Pontificia Bolivariana UPB, 050031, Medellín, Colombia
| | - Piedad Gañán
- Facultad de Ingeniería Química, Universidad Pontificia Bolivariana UPB, 050031, Medellín, Colombia
| | - Sonia Arrasate
- Department of Organic Chemistry II, University of Basque Country UPV/EHU, 48940, Leioa, Spain
| | | | - Humbert González-Díaz
- Department of Organic Chemistry II, University of Basque Country UPV/EHU, 48940, Leioa, Spain
- IKERBASQUE, Basque Foundation for Science, 48011, Bilbao, Spain
| |
Collapse
|
9
|
Montes-Bageneta I, Akesolo U, López S, Merino M, Anakabe E, Arrasate S. Pollutants in Organic Chemistry and Medicinal Chemistry Education Laboratory. Experimental and Machine Learning Studies. Curr Top Med Chem 2020; 20:720-730. [PMID: 32066360 DOI: 10.2174/1568026620666200211110043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Revised: 12/27/2019] [Accepted: 12/27/2019] [Indexed: 11/22/2022]
Abstract
AIMS Computational modelling may help us to detect the more important factors governing this process in order to optimize it. BACKGROUND The generation of hazardous organic waste in teaching and research laboratories poses a big problem that universities have to manage. METHODS In this work, we report on the experimental measurement of waste generation on the chemical education laboratories within our department. We measured the waste generated in the teaching laboratories of the Organic Chemistry Department II (UPV/EHU), in the second semester of the 2017/2018 academic year. Likewise, to know the anthropogenic and social factors related to the generation of waste, a questionnaire has been utilized. We focused on all students of Experimentation in Organic Chemistry (EOC) and Organic Chemistry II (OC2) subjects. It helped us to know their prior knowledge about waste, awareness of the problem of separate organic waste and the correct use of the containers. These results, together with the volumetric data, have been analyzed with statistical analysis software. We obtained two Perturbation-Theory Machine Learning (PTML) models including chemical, operational, and academic factors. The dataset analyzed included 6050 cases of laboratory practices vs. practices of reference. RESULTS These models predict the values of acetone waste with R2 = 0.88 and non-halogenated waste with R2 = 0.91. CONCLUSION This work opens a new gate to the implementation of more sustainable techniques and a circular economy with the aim of improving the quality of university education processes.
Collapse
Affiliation(s)
- Iker Montes-Bageneta
- Department of Organic Chemistry II, University of Basque Country UPV/EHU, 48940, Leioa, Spain
| | - Urtzi Akesolo
- Department of Organic Chemistry II, University of Basque Country UPV/EHU, 48940, Leioa, Spain
| | - Sara López
- Faculty of Science and Technology, University of Basque Country UPV/EHU, 48940, Leioa, Spain
| | - Maria Merino
- Department of Applied Mathematics, Statistics, and Operational Research, University of Basque Country UPV/EHU, 48940, Leioa, Spain
| | - Eneritz Anakabe
- Department of Organic Chemistry II, University of Basque Country UPV/EHU, 48940, Leioa, Spain
| | - Sonia Arrasate
- Department of Organic Chemistry II, University of Basque Country UPV/EHU, 48940, Leioa, Spain
| |
Collapse
|
10
|
Toyao T, Maeno Z, Takakusagi S, Kamachi T, Takigawa I, Shimizu KI. Machine Learning for Catalysis Informatics: Recent Applications and Prospects. ACS Catal 2019. [DOI: 10.1021/acscatal.9b04186] [Citation(s) in RCA: 189] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Takashi Toyao
- Institute for Catalysis, Hokkaido University, N-21, W-10, Sapporo 001-0021, Japan
- Elements Strategy Initiative for Catalysts and Batteries, Kyoto University, Katsura, Kyoto 615-8520, Japan
| | - Zen Maeno
- Institute for Catalysis, Hokkaido University, N-21, W-10, Sapporo 001-0021, Japan
| | - Satoru Takakusagi
- Institute for Catalysis, Hokkaido University, N-21, W-10, Sapporo 001-0021, Japan
| | - Takashi Kamachi
- Elements Strategy Initiative for Catalysts and Batteries, Kyoto University, Katsura, Kyoto 615-8520, Japan
- Department of Life, Environment and Materials Science, Fukuoka Institute of Technology, 3-30-1Wajiro-Higashi, Higashi-ku, Fukuoka 811-0295, Japan
| | - Ichigaku Takigawa
- RIKEN Center for Advanced Intelligence Project, 1-4-1 Nihonbashi, Chuo-ku, Tokyo 103-0027, Japan
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, Sapporo, Hokkaido 001-0021, Japan
| | - Ken-ichi Shimizu
- Institute for Catalysis, Hokkaido University, N-21, W-10, Sapporo 001-0021, Japan
- Elements Strategy Initiative for Catalysts and Batteries, Kyoto University, Katsura, Kyoto 615-8520, Japan
| |
Collapse
|
11
|
Santana R, Zuluaga R, Gañán P, Arrasate S, Onieva E, González-Díaz H. Designing nanoparticle release systems for drug-vitamin cancer co-therapy with multiplicative perturbation-theory machine learning (PTML) models. NANOSCALE 2019; 11:21811-21823. [PMID: 31691701 DOI: 10.1039/c9nr05070a] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Nano-systems for cancer co-therapy including vitamins or vitamin derivatives have showed adequate results to continue with further research studies to better understand them. However, the number of different combinations of drugs, vitamins, nanoparticle types, coating agents, synthesis conditions, and system types (nanocapsules, micelles, etc.) to be tested is very large generating a high cost in experimentations. In this context, there are reports of large datasets of preclinical assays of compounds (like in the ChEMBL database) and increasing but yet limited reports of experimental measurements of nano-systems per se. On the other hand, Machine Learning is gaining momentum in Nanotechnology and Pharmaceutical Sciences as a tool for rational design of new drugs and drug-release nano-systems. In this work, we propose to combine Perturbation Theory principles and Machine Learning to develop a PTML model for rational selection of the components of cancer co-therapy drug-vitamin release nano-systems (DVRNs). In doing so, we apply information fusion techniques with 2 data sets: (1) a large ChEMBL dataset of >36 000 preclinical assays of vitamin derivatives and a new dataset of >1000 outcomes of DVRNs, collected herein from the literature for the first time. The ChEMBL dataset used covers a considerable number of assay conditions (cjvit) each one with multiple levels. These conditions included >504 biological activity parameters (c0vit), >340 types of proteins (c1vit), >650 types of cells (c2vit), >120 assay organisms (c3vit), >60 assay strains (c4vit). Regarding the DVRNs, there are 25 different types of nano-systems (njn), with up to 16 conditions (cjn) including also different levels such as 8 biological activity parameters (c0n), 9 raw nanomaterials (c4n), 15 assay cells (c11n), etc. In the first stage, we used Moving Average operators to quantify the perturbations (deviations) in all input variables with respect to the conditions. After that, we used multiplicative PT operators to carry out data fusion, and dimension reduction, and Linear Discriminant Analysis (LDA) to seek the PTML model. The best PTML model found showed values of specificity, sensitivity, and accuracy in the range of 83-88% in training and external validation series for >130 000 cases (DVRNs vs. ChEMBL data pairs) formed after data fusion. To the best of our knowledge, this is the first general purpose model for the rational design of DVRNs for cancer co-therapy.
Collapse
|
12
|
Diez-Alarcia R, Yáñez-Pérez V, Muneta-Arrate I, Arrasate S, Lete E, Meana JJ, González-Díaz H. Big Data Challenges Targeting Proteins in GPCR Signaling Pathways; Combining PTML-ChEMBL Models and [ 35S]GTPγS Binding Assays. ACS Chem Neurosci 2019; 10:4476-4491. [PMID: 31618004 DOI: 10.1021/acschemneuro.9b00302] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
G-protein-coupled receptors (GPCRs), also known as 7-transmembrane receptors, are the single largest class of drug targets. Consequently, a large amount of preclinical assays having GPCRs as molecular targets has been released to public sources like the Chemical European Molecular Biology Laboratory (ChEMBL) database. These data are also very complex covering changes in drug chemical structure and assay conditions like c0 = activity parameter (Ki, IC50, etc.), c1 = target protein, c2 = cell line, c3 = assay organism, etc., making difficult the analysis of these databases that are placed in the borders of a Big Data challenge. One of the aims of this work is to develop a computational model able to predict new GPCRs targeting drugs taking into consideration multiple conditions of assay. Another objective is to perform new predictive and experimental studies of selective 5-HTA2 receptor agonist, antagonist, or inverse agonist in human comparing the results with those from the literature. In this work, we combined Perturbation Theory (PT) and Machine Learning (ML) to seek a general PTML model for this data set. We analyzed 343 738 unique compounds with 812 072 end points (assay outcomes), with 185 different experimental parameters, 592 protein targets, 51 cell lines, and/or 55 organisms (species). The best PTML linear model found has three input variables only and predicted 56 202/58 653 positive outcomes (sensitivity = 95.8%) and 470 230/550 401 control cases (specificity = 85.4%) in training series. The model also predicted correctly 18 732/19 549 (95.8%) of positive outcomes and 156 739/183 469 (85.4%) of cases in external validation series. To illustrate its practical use, we used the model to predict the outcomes of six different 5-HT2A receptor drugs, namely, TCB-2, DOI, DOB, altanserin, pimavanserin, and nelotanserin, in a very large number of different pharmacological assays. 5-HT2A receptors are altered in schizophrenia and represent drug target for antipsychotic therapeutic activity. The model correctly predicted 93.83% (76 of 86) experimental results for these compounds reported in ChEMBL. Moreover, [35S]GTPγS binding assays were performed experimentally with the same six drugs with the aim of determining their potency and efficacy in the modulation of G-proteins in human brain tissue. The antagonist ketanserin was included as inactive drug with demonstrated affinity for 5-HT2A/C receptors. Our results demonstrate that some of these drugs, previously described as serotonin 5-HT2A receptor agonists, antagonists, or inverse agonists, are not so specific and show different intrinsic activity to that previously reported. Overall, this work opens a new gate for the prediction of GPCRs targeting compounds.
Collapse
Affiliation(s)
- Rebeca Diez-Alarcia
- Centro de Investigación Biomédica en Red en Salud Mental, 48940 Leioa, Spain
| | | | | | | | | | - J. Javier Meana
- Centro de Investigación Biomédica en Red en Salud Mental, 48940 Leioa, Spain
| | - Humbert González-Díaz
- Biophysics Institute, CSIC-UPV/EHU, University of the Basque Country UPV/EHU, Leioa, 48940, Spain
- IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Spain
| |
Collapse
|
13
|
Affiliation(s)
- Hanoch Senderowitz
- Department of Chemistry , Bar Ilan University , Ramat-Gan 5290002 , Israel
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| |
Collapse
|
14
|
Pérez-Parras Toledano J, García-Pedrajas N, Cerruela-García G. Multilabel and Missing Label Methods for Binary Quantitative Structure-Activity Relationship Models: An Application for the Prediction of Adverse Drug Reactions. J Chem Inf Model 2019; 59:4120-4130. [PMID: 31514503 DOI: 10.1021/acs.jcim.9b00611] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The prediction of adverse drug reactions in the discovery of new medicines is highly challenging. In the task of predicting the adverse reactions of chemical compounds, information about different targets is often available. Although we can focus on every adverse drug reaction prediction separately, multilabel approaches have been proven useful in many research areas for taking advantage of the relationship among the targets. However, when approaching the prediction problem from a multilabel point of view, we have to deal with the lack of information for some labels. This missing labels problem is a relevant issue in the field of cheminformatics approaches. This paper aims to predict the adverse drug reaction of commercial drugs using a multilabel approach where the possible presence of missing labels is also taken into consideration. We propose the use of multilabel methods to deal with the prediction of a large set of 27 different adverse reaction targets. We also propose the use of multilabel methods specifically designed to deal with the missing labels problem to test their ability to solve this difficult problem. The results show the validity of the proposed approach, demonstrating a superior performance of the multilabel method compared with the single-label approach in addressing the problem of adverse drug reaction prediction.
Collapse
Affiliation(s)
- José Pérez-Parras Toledano
- University of Córdoba , Department of Computing and Numerical Analysis, Campus de Rabanales , Albert Einstein Building , E-14071 Córdoba , Spain
| | - Nicolás García-Pedrajas
- University of Córdoba , Department of Computing and Numerical Analysis, Campus de Rabanales , Albert Einstein Building , E-14071 Córdoba , Spain
| | - Gonzalo Cerruela-García
- University of Córdoba , Department of Computing and Numerical Analysis, Campus de Rabanales , Albert Einstein Building , E-14071 Córdoba , Spain
| |
Collapse
|
15
|
Ma R, Liu Z, Zhang Q, Liu Z, Luo T. Evaluating Polymer Representations via Quantifying Structure-Property Relationships. J Chem Inf Model 2019; 59:3110-3119. [PMID: 31268306 DOI: 10.1021/acs.jcim.9b00358] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Machine learning techniques are being applied in quantifying structure-property relationships for a wide variety of materials, where the properly represented materials play key roles. Although algorithms for representation learning are extensively studied, their applications to domain-specific areas, such as polymers, are limited largely due to the lack of benchmark databases. In this work, we investigate different types of polymer representations, including Morgan fingerprint (MF), molecular embedding (ME), and molecular graph (MG), based on the benchmark database from a subset of the well-known web-based polymer databases, PolyInfo. We evaluate the quality of different polymer representations via quantifying the relationships between the representations and polymer properties, including density, melting temperature, and glass transition temperature. Different representation learning schemes for MEs, such as supervised learning, semisupervised learning, and transfer learning, are investigated. In supervised learning, only labeled molecules in our benchmark database are used for representation learning, in semisupervised learning, both labeled and unlabeled molecules in our benchmark database are used, and in transfer learning, molecules from an external database that is different from the benchmark database are used for representation learning. It is found that ME (with the R2 of 0.724 in the density case, 0.684 in the melting temperature case, and 0.865 in the glass transition temperature case) outperforms the other representations for structure-property relationship quantification in all cases studied, and MG (with the R2 of 0.260 in the density case, -0.149 in the melting temperature case, and 0.711 in the glass transition case) is shown to be much inferior to ME and MF (with the R2 of 0.562 in the density case, 0.645 in the melting temperature case, and 0.849 in the glass transition case), likely due to the relatively small volumes of training data available. For MEs, it is found that the similarities of substructure MEs under different learning schemes (e.g., SL, SSL, and TL) are differently estimated, thus leading to different performance scores in structure-property relation quantification. Combinations of MEs show little effect on predictive performance when comparing to the single MEs in the corresponding regression tasks, proving no information gain in mixing MEs.
Collapse
Affiliation(s)
- Ruimin Ma
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| | - Zeyu Liu
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| | - Quanwei Zhang
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| | - Zhiyu Liu
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| | - Tengfei Luo
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States.,Department of Chemical and Biomolecular Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| |
Collapse
|
16
|
Speck-Planche A. Combining Ensemble Learning with a Fragment-Based Topological Approach To Generate New Molecular Diversity in Drug Discovery: In Silico Design of Hsp90 Inhibitors. ACS OMEGA 2018; 3:14704-14716. [PMID: 30555986 PMCID: PMC6289491 DOI: 10.1021/acsomega.8b02419] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 10/23/2018] [Indexed: 05/05/2023]
Abstract
Machine learning methods have revolutionized modern science, providing fast and accurate solutions to multiple problems. However, they are commonly treated as "black boxes". Therefore, in important scientific fields such as medicinal chemistry and drug discovery, machine learning methods are restricted almost exclusively to the task of performing predictions of large and heterogeneous data sets of chemicals. The lack of interpretability prevents the full exploitation of the machine learning models as generators of new chemical knowledge. This work focuses on the development of an ensemble learning model for the prediction and design of potent dual heat shock protein 90 (Hsp90) inhibitors. The model displays accuracy higher than 80% in both training and test sets. To use the ensemble model as a generator of new chemical knowledge, three steps were followed. First, a physicochemical and/or structural interpretation was provided for each molecular descriptor present in the ensemble learning model. Second, the term "pseudolinear equation" was introduced within the context of machine learning to calculate the relative quantitative contributions of different molecular fragments to the inhibitory activity against the two Hsp90 isoforms studied here. Finally, by assembling the fragments with positive contributions, new molecules were designed, being predicted as potent Hsp90 inhibitors. According to Lipinski's rule of five, the designed molecules were found to exhibit potentially good oral bioavailability, a primordial property that chemicals must have to pass early stages in drug discovery. The present approach based on the combination of ensemble learning and fragment-based topological design holds great promise in drug discovery, and it can be adapted and applied to many different scientific disciplines.
Collapse
|