1
|
Cysewski P, Jeliński T, Przybyłek M. Exploration of the Solubility Hyperspace of Selected Active Pharmaceutical Ingredients in Choline- and Betaine-Based Deep Eutectic Solvents: Machine Learning Modeling and Experimental Validation. Molecules 2024; 29:4894. [PMID: 39459262 PMCID: PMC11510433 DOI: 10.3390/molecules29204894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 10/13/2024] [Accepted: 10/14/2024] [Indexed: 10/28/2024] Open
Abstract
Deep eutectic solvents (DESs) are popular green media used for various industrial, pharmaceutical, and biomedical applications. However, the possible compositions of eutectic systems are so numerous that it is impossible to study all of them experimentally. To remedy this limitation, the solubility landscape of selected active pharmaceutical ingredients (APIs) in choline chloride- and betaine-based deep eutectic solvents was explored using theoretical models based on machine learning. The available solubility data for the selected APIs, comprising a total of 8014 data points, were collected for the available neat solvents, binary solvent mixtures, and DESs. This set was augmented with new measurements for the popular sulfa drugs in dry DESs. The descriptors used in the machine learning protocol were obtained from the σ-profiles of the considered molecules computed within the COSMO-RS framework. A combination of six sets of descriptors and 36 regressors were tested. Taking into account both accuracy and generalization, it was concluded that the best regressor is nuSVR regressor-based predictive models trained using the relative intermolecular interactions and a twelve-step averaged simplification of the relative σ-profiles.
Collapse
Affiliation(s)
- Piotr Cysewski
- Department of Physical Chemistry, Pharmacy Faculty, Collegium Medicum of Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-096 Bydgoszcz, Poland; (T.J.); (M.P.)
| | | | | |
Collapse
|
2
|
Jeliński T, Przybyłek M, Różalski R, Romanek K, Wielewski D, Cysewski P. Tuning Ferulic Acid Solubility in Choline-Chloride- and Betaine-Based Deep Eutectic Solvents: Experimental Determination and Machine Learning Modeling. Molecules 2024; 29:3841. [PMID: 39202918 PMCID: PMC11357058 DOI: 10.3390/molecules29163841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Revised: 08/04/2024] [Accepted: 08/09/2024] [Indexed: 09/03/2024] Open
Abstract
Deep eutectic solvents (DES) represent a promising class of green solvents, offering particular utility in the extraction and development of new formulations of natural compounds such as ferulic acid (FA). The experimental phase of the study undertook a systematic investigation of the solubility of FA in DES, comprising choline chloride or betaine as hydrogen bond acceptors and six different polyols as hydrogen bond donors. The results demonstrated that solvents based on choline chloride were more effective than those based on betaine. The optimal ratio of hydrogen bond acceptors to donors was found to be 1:2 molar. The addition of water to the DES resulted in a notable enhancement in the solubility of FA. Among the polyols tested, triethylene glycol was the most effective. Hence, DES composed of choline chloride and triethylene glycol (TEG) (1:2) with added water in a 0.3 molar ration is suggested as an efficient alternative to traditional organic solvents like DMSO. In the second part of this report, the affinities of FA in saturated solutions were computed for solute-solute and all solute-solvent pairs. It was found that self-association of FA leads to a cyclic structure of the C28 type, common among carboxylic acids, which is the strongest type of FA affinity. On the other hand, among all hetero-molecular bi-complexes, the most stable is the FA-TEG pair, which is an interesting congruency with the high solubility of FA in TEG containing liquids. Finally, this work combined COSMO-RS modeling with machine learning for the development of a model predicting ferulic acid solubility in a wide range of solvents, including not only DES but also classical neat and binary mixtures. A machine learning protocol developed a highly accurate model for predicting FA solubility, significantly outperforming the COSMO-RS approach. Based on the obtained results, it is recommended to use the support vector regressor (SVR) for screening new dissolution media as it is not only accurate but also has sound generalization to new systems.
Collapse
Affiliation(s)
- Tomasz Jeliński
- Department of Physical Chemistry, Faculty of Pharmacy, Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-096 Bydgoszcz, Poland; (T.J.); (M.P.)
| | - Maciej Przybyłek
- Department of Physical Chemistry, Faculty of Pharmacy, Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-096 Bydgoszcz, Poland; (T.J.); (M.P.)
| | - Rafał Różalski
- Department of Clinical Biochemistry, Faculty of Pharmacy, Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, Karłowicza 24, 85-950 Bydgoszcz, Poland;
| | - Karolina Romanek
- Department of Physical Chemistry, Faculty of Pharmacy, Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-096 Bydgoszcz, Poland; (T.J.); (M.P.)
| | - Daniel Wielewski
- Department of Physical Chemistry, Faculty of Pharmacy, Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-096 Bydgoszcz, Poland; (T.J.); (M.P.)
| | - Piotr Cysewski
- Department of Physical Chemistry, Faculty of Pharmacy, Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-096 Bydgoszcz, Poland; (T.J.); (M.P.)
| |
Collapse
|
3
|
Ramani V, Karmakar T. Graph Neural Networks for Predicting Solubility in Diverse Solvents Using MolMerger Incorporating Solute-Solvent Interactions. J Chem Theory Comput 2024. [PMID: 39041858 DOI: 10.1021/acs.jctc.4c00382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/24/2024]
Abstract
The prediction of solubility is a complex and challenging physicochemical problem that has tremendous implications for the chemical and pharmaceutical industry. Recent advancements in machine learning methods have provided a great scope for predicting the reliable solubility of a large number of molecular systems. However, most of these methods rely on using physical properties obtained from experiments and expensive quantum chemical calculations. Here, we developed a method that utilizes a graphical representation of solute-solvent interactions using "MolMerger," which captures the strongest polar interactions between molecules using Gasteiger charges and creates a graph incorporating the true nature of the system. Using these graphs as input, a neural network learns the correlation between the structural properties of a molecule in the form of node embedding and its physicochemical properties as the output. This approach has been used to calculate molecular solubility by predicting the Log solubility values of various organic molecules and pharmaceuticals in diverse sets of solvents.
Collapse
Affiliation(s)
- Vansh Ramani
- Department of Chemical Engineering, Indian Institute of Technology, Delhi, Hauz Khas, New Delhi 110016, India
| | - Tarak Karmakar
- Department of Chemistry, Indian Institute of Technology, Delhi, Hauz Khas, New Delhi 110016, India
| |
Collapse
|
4
|
Cysewski P, Jeliński T, Przybyłek M, Mai A, Kułak J. Experimental and Machine-Learning-Assisted Design of Pharmaceutically Acceptable Deep Eutectic Solvents for the Solubility Improvement of Non-Selective COX Inhibitors Ibuprofen and Ketoprofen. Molecules 2024; 29:2296. [PMID: 38792157 PMCID: PMC11124057 DOI: 10.3390/molecules29102296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Revised: 05/09/2024] [Accepted: 05/12/2024] [Indexed: 05/26/2024] Open
Abstract
Deep eutectic solvents (DESs) are commonly used in pharmaceutical applications as excellent solubilizers of active substances. This study investigated the tuning of ibuprofen and ketoprofen solubility utilizing DESs containing choline chloride or betaine as hydrogen bond acceptors and various polyols (ethylene glycol, diethylene glycol, triethylene glycol, glycerol, 1,2-propanediol, 1,3-butanediol) as hydrogen bond donors. Experimental solubility data were collected for all DES systems. A machine learning model was developed using COSMO-RS molecular descriptors to predict solubility. All studied DESs exhibited a cosolvency effect, increasing drug solubility at modest concentrations of water. The model accurately predicted solubility for ibuprofen, ketoprofen, and related analogs (flurbiprofen, felbinac, phenylacetic acid, diphenylacetic acid). A machine learning approach utilizing COSMO-RS descriptors enables the rational design and solubility prediction of DES formulations for improved pharmaceutical applications.
Collapse
Affiliation(s)
- Piotr Cysewski
- Department of Physical Chemistry, Pharmacy Faculty, Collegium Medicum of Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-096 Bydgoszcz, Poland; (T.J.); (M.P.)
| | | | | | | | | |
Collapse
|
5
|
Kumar N, Acharya V. Advances in machine intelligence-driven virtual screening approaches for big-data. Med Res Rev 2024; 44:939-974. [PMID: 38129992 DOI: 10.1002/med.21995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 07/15/2023] [Accepted: 10/29/2023] [Indexed: 12/23/2023]
Abstract
Virtual screening (VS) is an integral and ever-evolving domain of drug discovery framework. The VS is traditionally classified into ligand-based (LB) and structure-based (SB) approaches. Machine intelligence or artificial intelligence has wide applications in the drug discovery domain to reduce time and resource consumption. In combination with machine intelligence algorithms, VS has emerged into revolutionarily progressive technology that learns within robust decision orders for data curation and hit molecule screening from large VS libraries in minutes or hours. The exponential growth of chemical and biological data has evolved as "big-data" in the public domain demands modern and advanced machine intelligence-driven VS approaches to screen hit molecules from ultra-large VS libraries. VS has evolved from an individual approach (LB and SB) to integrated LB and SB techniques to explore various ligand and target protein aspects for the enhanced rate of appropriate hit molecule prediction. Current trends demand advanced and intelligent solutions to handle enormous data in drug discovery domain for screening and optimizing hits or lead with fewer or no false positive hits. Following the big-data drift and tremendous growth in computational architecture, we presented this review. Here, the article categorized and emphasized individual VS techniques, detailed literature presented for machine learning implementation, modern machine intelligence approaches, and limitations and deliberated the future prospects.
Collapse
Affiliation(s)
- Neeraj Kumar
- Artificial Intelligence for Computational Biology Lab (AICoB), Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology, Palampur, Himachal Pradesh, India
- Academy of Scientific and Innovative Research, Ghaziabad, India
| | - Vishal Acharya
- Artificial Intelligence for Computational Biology Lab (AICoB), Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology, Palampur, Himachal Pradesh, India
- Academy of Scientific and Innovative Research, Ghaziabad, India
| |
Collapse
|
6
|
Llompart P, Minoletti C, Baybekov S, Horvath D, Marcou G, Varnek A. Will we ever be able to accurately predict solubility? Sci Data 2024; 11:303. [PMID: 38499581 PMCID: PMC10948805 DOI: 10.1038/s41597-024-03105-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 02/29/2024] [Indexed: 03/20/2024] Open
Abstract
Accurate prediction of thermodynamic solubility by machine learning remains a challenge. Recent models often display good performances, but their reliability may be deceiving when used prospectively. This study investigates the origins of these discrepancies, following three directions: a historical perspective, an analysis of the aqueous solubility dataverse and data quality. We investigated over 20 years of published solubility datasets and models, highlighting overlooked datasets and the overlaps between popular sets. We benchmarked recently published models on a novel curated solubility dataset and report poor performances. We also propose a workflow to cure aqueous solubility data aiming at producing useful models for bench chemist. Our results demonstrate that some state-of-the-art models are not ready for public usage because they lack a well-defined applicability domain and overlook historical data sources. We report the impact of factors influencing the utility of the models: interlaboratory standard deviation, ionic state of the solute and data sources. The herein obtained models, and quality-assessed datasets are publicly available.
Collapse
Affiliation(s)
- P Llompart
- Laboratory of Chemoinformatics, UMR7140, University of Strasbourg, Strasbourg, France
- IDD/CADD, Sanofi, Vitry-Sur-Seine, France
| | | | - S Baybekov
- Laboratory of Chemoinformatics, UMR7140, University of Strasbourg, Strasbourg, France
| | - D Horvath
- Laboratory of Chemoinformatics, UMR7140, University of Strasbourg, Strasbourg, France
| | - G Marcou
- Laboratory of Chemoinformatics, UMR7140, University of Strasbourg, Strasbourg, France.
| | - A Varnek
- Laboratory of Chemoinformatics, UMR7140, University of Strasbourg, Strasbourg, France
| |
Collapse
|
7
|
Mouhcine M, Kadil Y, Segmani I, Rahmoune I, Filali H. In silico Exploration of a Novel ICMT Inhibitor with More Solubility than Cysmethynil against Membrane Localization of KRAS Mutant in Colorectal Cancer. Curr Comput Aided Drug Des 2024; 20:1055-1069. [PMID: 38835128 DOI: 10.2174/0115734099264451231003172217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Revised: 08/14/2023] [Accepted: 08/31/2023] [Indexed: 06/06/2024]
Abstract
BACKGROUND ICMT (isoprenylcysteine carboxyl methyltransferase) is an enzyme that plays a key role in the post-translational modification of the K-Ras protein. The carboxyl methylation of this protein by ICMT is important for its proper localization and function. Cysmethynil (2-[5-(3-methylphenyl)-l-octyl-lH-indolo-3-yl] acetamide) causes K-Ras mislocalization and interrupts pathways that control cancer cell growth and division through inhibition of ICMT, but its poor water solubility makes it difficult and impractical for clinical use. This indicates that relatively high amounts of cysmethynil would be required to achieve an effective dose, which could result in significant adverse effects in patients. OBJECTIVE The general objective of this work was to find virtually new compounds that present high solubility in water and are similar to the pharmacological activity of cysmethynil. MATERIALS AND METHODS Pharmacophore modeling, pharmacophore-based virtual screening, prediction of ADMET properties (absorption, distribution, metabolism, excretion, and toxicity), and water solubility were performed to recover a water-soluble molecule that shares the same chemical characteristics as cysmethynil using Discovery Studio v16.1.0 (DS16.1), SwissADME server, and pkCSM server. RESULTS In this study, ten pharmacophore model hypotheses were generated by exploiting the characteristics of cysmethynil. The pharmacophore model validated by the set test method was used to screen the "Elite Library®" and "Synergy Library" databases of Asinex. Only 1533 compounds corresponding to all the characteristics of the pharmacophore were retained. Then, the aqueous solubility in water at 25°C of these 1533 compounds was predicted by the Cheng and Merz model. Among these 1533 compounds, two had the optimal water solubility. Finally, the ADMET properties and Log S water solubility by three models (ESOL, Ali, and SILICOS-IT) of the two compounds and cysmethynil were compared, resulting in compound 2 as a potential inhibitor of ICMT. CONCLUSION According to the results obtained, the identified compound presented a high solubility in water and could be similar to the pharmacological activity of cysmethynil.
Collapse
Affiliation(s)
- Mohammed Mouhcine
- Laboratory of Pharmacology and Toxicology, Faculty of Medicine and Pharmacy of Casablanca, Hassan II University of Casablanca, Casablanca, Morocco
| | - Youness Kadil
- Laboratory of Pharmacology and Toxicology, Faculty of Medicine and Pharmacy of Casablanca, Hassan II University of Casablanca, Casablanca, Morocco
| | - Ibtihal Segmani
- Laboratory of Pharmacology and Toxicology, Faculty of Medicine and Pharmacy of Casablanca, Hassan II University of Casablanca, Casablanca, Morocco
| | - Imane Rahmoune
- Laboratory of Pharmacology and Toxicology, Faculty of Medicine and Pharmacy of Casablanca, Hassan II University of Casablanca, Casablanca, Morocco
| | - Houda Filali
- Laboratory of Pharmacology and Toxicology, Faculty of Medicine and Pharmacy of Casablanca, Hassan II University of Casablanca, Casablanca, Morocco
| |
Collapse
|
8
|
Tran TTV, Tayara H, Chong KT. Recent Studies of Artificial Intelligence on In Silico Drug Absorption. J Chem Inf Model 2023; 63:6198-6211. [PMID: 37819031 DOI: 10.1021/acs.jcim.3c00960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
Absorption is an important area of research in pharmacochemistry and drug development, because the drug has to be absorbed before any drug effects can occur. Furthermore, the ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity) profile of drugs can be directly and considerably altered by modulating factors affecting absorption. Many drugs in development fail because of poor absorption. The research and continuous efforts of researchers in recent years have brought many successes and promises in drug absorption property prediction, especially in silico, which helps to reduce the time and cost significantly for screening undesirable drug candidates. In this report, we explicitly provide an overview of recent in silico studies on predicting absorption properties, especially from 2019 to the present, using artificial intelligence. Additionally, we have collected and investigated public databases that support absorption prediction research. On those grounds, we also proposed the challenges and development directions of absorption prediction in the future. We hope this review can provide researchers with valuable guidelines on absorption prediction to facilitate the development of newer approaches in drug discovery.
Collapse
Affiliation(s)
- Thi Tuyet Van Tran
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Faculty of Information Technology, An Giang University, Long Xuyen 880000, Vietnam
- Vietnam National University, Ho Chi Minh City, Ho Chi Minh 700000, Vietnam
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Kil To Chong
- Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea
| |
Collapse
|
9
|
Silva F, Veiga F, Paulo Jorge Rodrigues S, Cardoso C, Cláudia Paiva-Santos A. COSMO Models for the Pharmaceutical Development of Parenteral Drug Formulations. Eur J Pharm Biopharm 2023; 187:156-165. [PMID: 37120066 DOI: 10.1016/j.ejpb.2023.04.019] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 03/31/2023] [Accepted: 04/21/2023] [Indexed: 05/01/2023]
Abstract
The aqueous solubility of active pharmaceutical ingredients is one of the most important features to be considered during the development of parenteral formulations in the pharmaceutical industry. Computational modelling has become in the last years an integral part of pharmaceutical development. In this context, ab initio computational models, such as COnductor-like Screening MOdel (COSMO), have been proposed as promising tools for the prediction of results without the effective use of resources. Nevertheless, despite the clear evaluation of computational resources, some authors had not achieved satisfying results and new calculations and algorithms have been proposed over the years to improve the outcomes. In the development and production of aqueous parenteral formulations, the solubility of Active Pharmaceutical Ingredients (APIs) in an aqueous and biocompatible vehicle is a decisive step. This work aims to study the hypothesis that COSMO models could be useful in the development of new parenteral formulations, mainly aqueous ones.
Collapse
Affiliation(s)
- Fernando Silva
- Department of Pharmaceutical Technology, Faculty of Pharmacy of the University of Coimbra, University of Coimbra, Coimbra, Portugal; REQUIMTE/LAQV, Group of Pharmaceutical Technology, Faculty of Pharmacy of the University of Coimbra, University of Coimbra, Coimbra, Portugal.
| | - Francisco Veiga
- Department of Pharmaceutical Technology, Faculty of Pharmacy of the University of Coimbra, University of Coimbra, Coimbra, Portugal; REQUIMTE/LAQV, Group of Pharmaceutical Technology, Faculty of Pharmacy of the University of Coimbra, University of Coimbra, Coimbra, Portugal
| | - Sérgio Paulo Jorge Rodrigues
- Coimbra Chemistry Centre, Chemistry Department, Faculty of Sciences and Technology of the University of Coimbra of the University of Coimbra, Coimbra, Portugal
| | - Catarina Cardoso
- Laboratórios Basi, Parque Industrial Manuel Lourenço Ferreira, lote 15, 3450-232 Mortágua, Portugal
| | - Ana Cláudia Paiva-Santos
- Department of Pharmaceutical Technology, Faculty of Pharmacy of the University of Coimbra, University of Coimbra, Coimbra, Portugal; REQUIMTE/LAQV, Group of Pharmaceutical Technology, Faculty of Pharmacy of the University of Coimbra, University of Coimbra, Coimbra, Portugal
| |
Collapse
|
10
|
Maeshima T, Yoshida S, Watanabe M, Itagaki F. Prediction model for milk transfer of drugs by primarily evaluating the area under the curve using QSAR/QSPR. Pharm Res 2023; 40:711-719. [PMID: 36720832 PMCID: PMC10036427 DOI: 10.1007/s11095-023-03477-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 01/25/2023] [Indexed: 02/02/2023]
Abstract
PURPOSE Information on milk transferability of drugs is important for patients who wish to breastfeed. The purpose of this study is to develop a prediction model for milk-to-plasma drug concentration ratio based on area under the curve (M/PAUC). The quantitative structure-activity/property relationship (QSAR/QSPR) approach was used to predict compounds involved in active transport during milk transfer. METHODS We collected M/P ratio data from literature, which were curated and divided into M/PAUC ≥ 1 and M/PAUC < 1. Using the ADMET Predictor® and ADMET Modeler™, we constructed two types of binary classification models: an artificial neural network (ANN) and a support vector machine (SVM). RESULTS M/P ratios of 403 compounds were collected, M/PAUC data were obtained for 173 compounds, while 230 compounds only had M/Pnon-AUC values reported. The models were constructed using 129 of the 173 compounds, excluding colostrum data. The sensitivity of the ANN model was 0.969 for the training set and 0.833 for the test set, while the sensitivity of the SVM model was 0.971 for the training set and 0.667 for the test set. The contribution of the charge-based descriptor was high in both models. CONCLUSIONS We built a M/PAUC prediction model using QSAR/QSPR. These predictive models can play an auxiliary role in evaluating the milk transferability of drugs.
Collapse
Affiliation(s)
- Tae Maeshima
- Department of Clinical & Pharmaceutical Sciences, Faculty of Pharma Science, Teikyo University, Itabashi-Ku, Tokyo, 173-8605, Japan
| | - Shin Yoshida
- Department of Clinical & Pharmaceutical Sciences, Faculty of Pharma Science, Teikyo University, Itabashi-Ku, Tokyo, 173-8605, Japan
| | - Machiko Watanabe
- Department of Clinical & Pharmaceutical Sciences, Faculty of Pharma Science, Teikyo University, Itabashi-Ku, Tokyo, 173-8605, Japan
| | - Fumio Itagaki
- Department of Clinical & Pharmaceutical Sciences, Faculty of Pharma Science, Teikyo University, Itabashi-Ku, Tokyo, 173-8605, Japan.
| |
Collapse
|
11
|
Thermodynamic Correlation between Liquid-Liquid Phase Separation and Crystalline Solubility of Drug-Like Molecules. Pharmaceutics 2022; 14:pharmaceutics14122560. [PMID: 36559054 PMCID: PMC9782016 DOI: 10.3390/pharmaceutics14122560] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 11/21/2022] [Accepted: 11/21/2022] [Indexed: 11/25/2022] Open
Abstract
The purpose of the present study was to experimentally confirm the thermodynamic correlation between the intrinsic liquid−liquid phase separation (LLPS) concentration (S0LLPS) and crystalline solubility (S0c) of drug-like molecules. Based on the thermodynamic principles, the crystalline solubility LLPS concentration melting point (Tm) equation (CLME) was derived (log10S0C=log10S0LLPS−0.0095Tm−310 for 310 K). The S0LLPS values of 31 drugs were newly measured by simple bulk phase pH-shift or solvent-shift precipitation tests coupled with laser-assisted visual turbidity detection. To ensure the precipitant was not made crystalline at <10 s, the precipitation tests were also performed under the polarized light microscope. The calculated and observed log10S0C values showed a good correlation (root mean squared error: 0.40 log unit, absolute average error: 0.32 log unit).
Collapse
|
12
|
Xiouras C, Cameli F, Quilló GL, Kavousanakis ME, Vlachos DG, Stefanidis GD. Applications of Artificial Intelligence and Machine Learning Algorithms to Crystallization. Chem Rev 2022; 122:13006-13042. [PMID: 35759465 DOI: 10.1021/acs.chemrev.2c00141] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Artificial intelligence and specifically machine learning applications are nowadays used in a variety of scientific applications and cutting-edge technologies, where they have a transformative impact. Such an assembly of statistical and linear algebra methods making use of large data sets is becoming more and more integrated into chemistry and crystallization research workflows. This review aims to present, for the first time, a holistic overview of machine learning and cheminformatics applications as a novel, powerful means to accelerate the discovery of new crystal structures, predict key properties of organic crystalline materials, simulate, understand, and control the dynamics of complex crystallization process systems, as well as contribute to high throughput automation of chemical process development involving crystalline materials. We critically review the advances in these new, rapidly emerging research areas, raising awareness in issues such as the bridging of machine learning models with first-principles mechanistic models, data set size, structure, and quality, as well as the selection of appropriate descriptors. At the same time, we propose future research at the interface of applied mathematics, chemistry, and crystallography. Overall, this review aims to increase the adoption of such methods and tools by chemists and scientists across industry and academia.
Collapse
Affiliation(s)
- Christos Xiouras
- Chemical Process R&D, Crystallization Technology Unit, Janssen R&D, Turnhoutseweg 30, 2340 Beerse, Belgium
| | - Fabio Cameli
- Department of Chemical and Biomolecular Engineering, University of Delaware, 150 Academy Street, Newark, Delaware 19716, United States
| | - Gustavo Lunardon Quilló
- Chemical Process R&D, Crystallization Technology Unit, Janssen R&D, Turnhoutseweg 30, 2340 Beerse, Belgium.,Chemical and BioProcess Technology and Control, Department of Chemical Engineering, Faculty of Engineering Technology, KU Leuven, Gebroeders de Smetstraat 1, 9000 Ghent, Belgium
| | - Mihail E Kavousanakis
- School of Chemical Engineering, National Technical University of Athens, Heroon Polytechniou 9, 15780 Zografou, Greece
| | - Dionisios G Vlachos
- Department of Chemical and Biomolecular Engineering, University of Delaware, 150 Academy Street, Newark, Delaware 19716, United States
| | - Georgios D Stefanidis
- School of Chemical Engineering, National Technical University of Athens, Heroon Polytechniou 9, 15780 Zografou, Greece.,Laboratory for Chemical Technology, Ghent University; Tech Lane Ghent Science Park 125, B-9052 Ghent, Belgium
| |
Collapse
|
13
|
In Vitro and In Silico Study of Analogs of Plant Product Plastoquinone to Be Effective in Colorectal Cancer Treatment. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27030693. [PMID: 35163957 PMCID: PMC8839215 DOI: 10.3390/molecules27030693] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 01/15/2022] [Accepted: 01/18/2022] [Indexed: 02/06/2023]
Abstract
Plants have paved the way for the attainment of molecules with a wide-range of biological activities. However, plant products occasionally show low biological activities and/or poor pharmacokinetic properties. In that case, development of their derivatives as drugs from the plant world has been actively performed. As plant products, plastoquinones (PQs) have been of high importance in anticancer drug design and discovery; we have previously evaluated and reported the potential cytotoxic effects of a series of PQ analogs. Among these analogs, PQ2, PQ3 and PQ10 were selected for National Cancer Institute (NCI) for in vitro screening of anticancer activity against a wide range of cancer cell lines. The apparent superior anticancer potency of PQ2 on the HCT-116 colorectal cancer cell line than that of PQ3 and PQ10 compared to other tested cell lines has encouraged us to perform further mechanistic studies to enlighten the mode of anti-colorectal cancer action of PQ2. For this purpose, its apoptotic effects on the HCT-116 cell line, DNA binding capacity and several crucial pharmacokinetic properties were investigated. Initially, MTT assay was conducted for PQ2 at different concentrations against HCT-116 cells. Results indicated that PQ2 exhibited significant cytotoxicity in HCT-116 cells with an IC50 value of 4.97 ± 1.93 μM compared to cisplatin (IC50 = 26.65 ± 7.85 μM). Moreover, apoptotic effects of PQ2 on HCT-116 cells were investigated by the annexin V/ethidium homodimer III staining method and PQ2 significantly induced apoptosis in HCT-116 cells compared to cisplatin. Based on the potent DNA cleavage capacity of PQ2, molecular docking studies were conducted in the minor groove of the double helix of DNA and PQ2 presented a key hydrogen bonding through its methoxy moiety. Overall, both in vitro and in silico studies indicated that effective, orally bioavailable drug-like PQ2 attracted attention for colorectal cancer treatment. The most important point to emerge from this study is that appropriate derivatization of a plant product leads to unique biologically active compounds.
Collapse
|
14
|
Tosca EM, Bartolucci R, Magni P. Application of Artificial Neural Networks to Predict the Intrinsic Solubility of Drug-Like Molecules. Pharmaceutics 2021; 13:1101. [PMID: 34371792 PMCID: PMC8309152 DOI: 10.3390/pharmaceutics13071101] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Revised: 07/15/2021] [Accepted: 07/16/2021] [Indexed: 11/25/2022] Open
Abstract
Machine learning (ML) approaches are receiving increasing attention from pharmaceutical companies and regulatory agencies, given their ability to mine knowledge from available data. In drug discovery, for example, they are employed in quantitative structure-property relationship (QSPR) models to predict biological properties from the chemical structure of a drug molecule. In this paper, following the Second Solubility Challenge (SC-2), a QSPR model based on artificial neural networks (ANNs) was built to predict the intrinsic solubility (logS0) of the 100-compound low-variance tight set and the 32-compound high-variance loose set provided by SC-2 as test datasets. First, a training dataset of 270 drug-like molecules with logS0 value experimentally determined was gathered from the literature. Then, a standard three-layer feed-forward neural network was defined by using 10 ChemGPS physico-chemical descriptors as input features. The developed ANN showed adequate predictive performances on both of the SC-2 test datasets. Benefits and limitations of ML approaches have been highlighted and discussed, starting from this case-study. The main findings confirmed that ML approaches are an attractive and promising tool to predict logS0; however, many aspects, such as data quality, molecular descriptor computation and selection, and assessment of applicability domain, are crucial but often neglected, and should be carefully considered to improve predictions based on ML.
Collapse
Affiliation(s)
| | | | - Paolo Magni
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Via Ferrata 5, I-27100 Pavia, Italy; (E.M.T.); (R.B.)
| |
Collapse
|
15
|
Ciftci HI, Bayrak N, Yıldız M, Yıldırım H, Sever B, Tateishi H, Otsuka M, Fujita M, Tuyun AF. Design, synthesis and investigation of the mechanism of action underlying anti-leukemic effects of the quinolinequinones as LY83583 analogs. Bioorg Chem 2021; 114:105160. [PMID: 34328861 DOI: 10.1016/j.bioorg.2021.105160] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Revised: 06/29/2021] [Accepted: 07/06/2021] [Indexed: 12/14/2022]
Abstract
Literature conclusively shows that one of the quinolinequinone analogs (6-anilino-5,8-quinolinequinone), referred to as LY83583 hereafter, an inhibitor of guanylyl cyclase, was used as the inhibitor of the cell proliferation in cancer cells. In the present work, a series of analogs of the LY83583 containing alkoxy group(s) in aminophenyl ring (AQQ1-15) were designed and synthesized via a two-step route and evaluated for their in vitro cytotoxic activity against four different cancer cell lines (K562, Jurkat, MT-2, and HeLa) and human peripheral blood mononuclear cells (PBMCs) by MTT assay. The analog (AQQ13) was identified to possess the most potent cytotoxic activity against K562 human chronic myelogenous (CML) cell line (IC50 = 0.59 ± 0.07 μM) with significant selectivity (SI = 4.51) compared to imatinib (IC50 = 5.46 ± 0.85 μM; SI = 4.60). Based on its superior cytotoxic activity, the analog AQQ13 was selected for further mechanistic studies including determination of its apoptotic effects on K562 cell line via annexin V/ethidium homodimer III staining potency, ABL1 kinase inhibitory activity, and DNA cleaving capacity. Results ascertained that the analog AQQ13 induced apoptosis in K562 cell line with notable DNA-cleaving activity. However, AQQ13 demonstrated weak ABL1 inhibition indicating the correlation between anti-K562 and anti-ABL1 activities. In continuance, respectively conducted in silico molecular docking and Absorption, Distribution, Metabolism, and Excretion (ADME) studies drew attention to enhanced binding interactions of AQQ13 towards DNA and its high compatibility with the potential limits of specified pharmacokinetic parameters making it as a potential anti-leukemic drug candidate. Our findings may provide a new insight for further development of novel quinolinequinone-based anticancer analogs against CML.
Collapse
Affiliation(s)
- Halil I Ciftci
- Department of Drug Discovery, Science Farm Ltd., Kumamoto, Japan; Medicinal and Biological Chemistry Science Farm Joint Research Laboratory, School of Pharmacy, Kumamoto University, Kumamoto, Japan
| | - Nilüfer Bayrak
- Department of Chemistry, Faculty of Engineering, Istanbul University-Cerrahpasa, Avcilar, Istanbul, Turkey
| | - Mahmut Yıldız
- Chemistry Department, Gebze Technical University, Gebze, Kocaeli, Turkey
| | - Hatice Yıldırım
- Department of Chemistry, Faculty of Engineering, Istanbul University-Cerrahpasa, Avcilar, Istanbul, Turkey
| | - Belgin Sever
- Medicinal and Biological Chemistry Science Farm Joint Research Laboratory, School of Pharmacy, Kumamoto University, Kumamoto, Japan; Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Anadolu University, Eskisehir, Turkey
| | - Hiroshi Tateishi
- Medicinal and Biological Chemistry Science Farm Joint Research Laboratory, School of Pharmacy, Kumamoto University, Kumamoto, Japan
| | - Masami Otsuka
- Department of Drug Discovery, Science Farm Ltd., Kumamoto, Japan; Medicinal and Biological Chemistry Science Farm Joint Research Laboratory, School of Pharmacy, Kumamoto University, Kumamoto, Japan
| | - Mikako Fujita
- Medicinal and Biological Chemistry Science Farm Joint Research Laboratory, School of Pharmacy, Kumamoto University, Kumamoto, Japan.
| | - Amaç Fatih Tuyun
- Department of Chemistry, Faculty of Science, Istanbul University, Fatih, Istanbul, Turkey.
| |
Collapse
|
16
|
Albayati S, Uba AI, Yelekçi K. Potential inhibitors of methionine aminopeptidase type II identified via structure-based pharmacophore modeling. Mol Divers 2021; 26:1005-1016. [PMID: 33846894 DOI: 10.1007/s11030-021-10221-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2021] [Accepted: 03/30/2021] [Indexed: 11/29/2022]
Abstract
Methionine aminopeptidase (MetAP2) is a metal-containing enzyme that removes initiator methionine from the N-terminus of a newly synthesized protein. Inhibition of the enzyme is crucial in diminishing cancer growth and metastasis. Fumagillin-a natural irreversible inhibitor of MetAP2-and its derivatives are used as potent MetAP2 inhibitors. However, because of their adverse effects, none of them has progressed to clinical studies. In search for potential reversible inhibitors, we built structure-based pharmacophore models using the crystal structure of MetAP2 complexed with fumagillin (PDB ID: 1BOA). The pharmacophore models were validated using Gunner-Henry scoring method. The best pharmacophore consisting of 1 H-bond donor, 1 H-bond acceptor, and 3 hydrophobic features was used to conduct pharmacophore-based virtual screening of ZINC15 database against MetAP2. The top 10 compounds with pharmacophore fit values > 3.00 were selected for further analysis. These compounds were subjected to absorption, distribution, metabolism, elimination, and toxicity (ADMET) prediction and found to have druglike properties. Furthermore, molecular docking calculations was performed on these hits using AutoDock4 to predict their binding mode and binding energy. Three diverse compounds: ZINC000014903160, ZINC000040174591, and ZINC000409110720 with respective binding energy/docking scores of - 9.22, - 9.21, and -817 kcal/mol, were submitted to 100 ns (MD) simulations using Nanoscale MD (NAMD) software. The compounds showed stable binding mode over time. Therefore, they may serve as a scaffold for further computational and experimental optimization toward the design of more potent and safer MetAP2 inhibitors.
Collapse
Affiliation(s)
- Safana Albayati
- Department of Bioinformatics and Genetics, Faculty of Engineering and Natural Science, Kadir Has University, 34083 Cibali Campus Fatih, Istanbul, Turkey
| | - Abdullahi Ibrahim Uba
- Department of Bioinformatics and Genetics, Faculty of Engineering and Natural Science, Kadir Has University, 34083 Cibali Campus Fatih, Istanbul, Turkey.,Complex Systems Division, Beijing Computational Science Research Center, Beijing, 100193, China
| | - Kemal Yelekçi
- Department of Bioinformatics and Genetics, Faculty of Engineering and Natural Science, Kadir Has University, 34083 Cibali Campus Fatih, Istanbul, Turkey.
| |
Collapse
|
17
|
Sorkun MC, Koelman JVA, Er S. Pushing the limits of solubility prediction via quality-oriented data selection. iScience 2021; 24:101961. [PMID: 33437941 PMCID: PMC7788089 DOI: 10.1016/j.isci.2020.101961] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 11/18/2020] [Accepted: 12/15/2020] [Indexed: 01/19/2023] Open
Abstract
Accurate prediction of the solubility of chemical substances in solvents remains a challenge. The sparsity of high-quality solubility data is recognized as the biggest hurdle in the development of robust data-driven methods for practical use. Nonetheless, the effects of the quality and quantity of data on aqueous solubility predictions have not yet been scrutinized. In this study, the roles of the size and the quality of data sets on the performances of the solubility prediction models are unraveled, and the concepts of actual and observed performances are introduced. In an effort to curtail the gap between actual and observed performances, a quality-oriented data selection method, which evaluates the quality of data and extracts the most accurate part of it through statistical validation, is designed. Applying this method on the largest publicly available solubility database and using a consensus machine learning approach, a top-performing solubility prediction model is achieved.
Collapse
Affiliation(s)
- Murat Cihan Sorkun
- DIFFER - Dutch Institute for Fundamental Energy Research, De Zaale 20, 5612 AJ Eindhoven, the Netherlands
- CCER - Center for Computational Energy Research, De Zaale 20, 5612 AJ Eindhoven, the Netherlands
- Department of Applied Physics, Eindhoven University of Technology, 5600 MB Eindhoven, the Netherlands
| | - J.M. Vianney A. Koelman
- DIFFER - Dutch Institute for Fundamental Energy Research, De Zaale 20, 5612 AJ Eindhoven, the Netherlands
- CCER - Center for Computational Energy Research, De Zaale 20, 5612 AJ Eindhoven, the Netherlands
- Department of Applied Physics, Eindhoven University of Technology, 5600 MB Eindhoven, the Netherlands
| | - Süleyman Er
- DIFFER - Dutch Institute for Fundamental Energy Research, De Zaale 20, 5612 AJ Eindhoven, the Netherlands
- CCER - Center for Computational Energy Research, De Zaale 20, 5612 AJ Eindhoven, the Netherlands
| |
Collapse
|
18
|
Abstract
Molecular dynamics (MD) simulations have become increasingly useful in the modern drug development process. In this review, we give a broad overview of the current application possibilities of MD in drug discovery and pharmaceutical development. Starting from the target validation step of the drug development process, we give several examples of how MD studies can give important insights into the dynamics and function of identified drug targets such as sirtuins, RAS proteins, or intrinsically disordered proteins. The role of MD in antibody design is also reviewed. In the lead discovery and lead optimization phases, MD facilitates the evaluation of the binding energetics and kinetics of the ligand-receptor interactions, therefore guiding the choice of the best candidate molecules for further development. The importance of considering the biological lipid bilayer environment in the MD simulations of membrane proteins is also discussed, using G-protein coupled receptors and ion channels as well as the drug-metabolizing cytochrome P450 enzymes as relevant examples. Lastly, we discuss the emerging role of MD simulations in facilitating the pharmaceutical formulation development of drugs and candidate drugs. Specifically, we look at how MD can be used in studying the crystalline and amorphous solids, the stability of amorphous drug or drug-polymer formulations, and drug solubility. Moreover, since nanoparticle drug formulations are of great interest in the field of drug delivery research, different applications of nano-particle simulations are also briefly summarized using multiple recent studies as examples. In the future, the role of MD simulations in facilitating the drug development process is likely to grow substantially with the increasing computer power and advancements in the development of force fields and enhanced MD methodologies.
Collapse
|
19
|
Falcón-Cano G, Molina C, Cabrera-Pérez MÁ. ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches. ADMET AND DMPK 2020; 8:251-273. [PMID: 35300309 PMCID: PMC8915604 DOI: 10.5599/admet.852] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 08/01/2020] [Indexed: 12/12/2022] Open
Abstract
In-silico prediction of aqueous solubility plays an important role during the drug discovery and development processes. For many years, the limited performance of in-silico solubility models has been attributed to the lack of high-quality solubility data for pharmaceutical molecules. However, some studies suggest that the poor accuracy of solubility prediction is not related to the quality of the experimental data and that more precise methodologies (algorithms and/or set of descriptors) are required for predicting aqueous solubility for pharmaceutical molecules. In this study a large and diverse database was generated with aqueous solubility values collected from two public sources; two new recursive machine-learning approaches were developed for data cleaning and variable selection, and a consensus model based on regression and classification algorithms was created. The modeling protocol, which includes the curation of chemical and experimental data, was implemented in KNIME, with the aim of obtaining an automated workflow for the prediction of new databases. Finally, we compared several methods or models available in the literature with our consensus model, showing results comparable or even outperforming previous published models.
Collapse
Affiliation(s)
- Gabriela Falcón-Cano
- Unit of Modeling and Experimental Biopharmaceutics. Centro de Bioactivos Químicos. Universidad Central “Marta Abreu” de las Villas. Santa Clara 54830, Villa Clara, Cuba
| | | | - Miguel Ángel Cabrera-Pérez
- Unit of Modeling and Experimental Biopharmaceutics. Centro de Bioactivos Químicos. Universidad Central “Marta Abreu” de las Villas. Santa Clara 54830, Villa Clara, Cuba
- Department of Pharmacy and Pharmaceutical Technology, University of Valencia, Burjassot 46100, Valencia, Spain
- Department of Engineering, Area of Pharmacy and Pharmaceutical Technology, Miguel Hernández University, 03550 Sant Joan d'Alacant, Alicante, Spain
| |
Collapse
|
20
|
McDonagh JL, Swope WC, Anderson RL, Johnston MA, Bray DJ. What can digitisation do for formulated product innovation and development? POLYM INT 2020. [DOI: 10.1002/pi.6056] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
| | | | | | | | - David J Bray
- The Hartree Centre STFC Daresbury Laboratory Warrington WA4 4AD UK
| |
Collapse
|
21
|
Chen JH, Tseng YJ. Different molecular enumeration influences in deep learning: an example using aqueous solubility. Brief Bioinform 2020; 22:5851267. [PMID: 32501508 DOI: 10.1093/bib/bbaa092] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Revised: 04/27/2020] [Accepted: 04/27/2020] [Indexed: 12/24/2022] Open
Abstract
Aqueous solubility is the key property driving many chemical and biological phenomena and impacts experimental and computational attempts to assess those phenomena. Accurate prediction of solubility is essential and challenging, even with modern computational algorithms. Fingerprint-based, feature-based and molecular graph-based representations have all been used with different deep learning methods for aqueous solubility prediction. It has been clearly demonstrated that different molecular representations impact the model prediction and explainability. In this work, we reviewed different representations and also focused on using graph and line notations for modeling. In general, one canonical chemical structure is used to represent one molecule when computing its properties. We carefully examined the commonly used simplified molecular-input line-entry specification (SMILES) notation representing a single molecule and proposed to use the full enumerations in SMILES to achieve better accuracy. A convolutional neural network (CNN) was used. The full enumeration of SMILES can improve the presentation of a molecule and describe the molecule with all possible angles. This CNN model can be very robust when dealing with large datasets since no additional explicit chemistry knowledge is necessary to predict the solubility. Also, traditionally it is hard to use a neural network to explain the contribution of chemical substructures to a single property. We demonstrated the use of attention in the decoding network to detect the part of a molecule that is relevant to solubility, which can be used to explain the contribution from the CNN.
Collapse
|
22
|
Bunmahotama W, Lin TF, Yang X. Prediction of adsorption capacity for pharmaceuticals, personal care products and endocrine disrupting chemicals onto various adsorbent materials. CHEMOSPHERE 2020; 238:124658. [PMID: 31548174 DOI: 10.1016/j.chemosphere.2019.124658] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2019] [Revised: 08/20/2019] [Accepted: 08/23/2019] [Indexed: 05/11/2023]
Abstract
Adsorption is a common process used to remove pharmaceuticals, personal care products and endocrine disrupting chemicals (PPCPs/EDCs) from water. However, as PPCPs/EDCs cover a wide range of molecules and chemical structures, prediction of the adsorption capacity is very challenging. In this study, a novel model was developed to predict adsorption isotherms of PPCPs/EDCs onto various types of adsorbents using a combination of Polanyi potential theory, molecular connectivity indices (MCIs) and molecular characteristics. Polanyi theory provided the basic mathematical form for the correlation. MCIs, hydrophobicity and H-bond count were used to normalize the Polanyi equation based on the molecular structure and interactions among the chemicals, the adsorbents, and the solution. In total, adsorption data were collected from 158 reports for 55 PPCPs/EDCs onto 306 different adsorbent materials. The correlation was first trained with 46 PPCPs/EDCs adsorbed onto 162 carbonaceous adsorbents (CAs), with 44.8% SDEV. Then the model was employed to predict 46 PPCPs/EDCs onto 118 other CAs and 9 new PPCPs/EDCs onto 23 CAs in ultrapure water, with error from 42 to 48% SDEV. When applying to non-carbonaceous adsorbents, the models can still predict the adsorption of PPCPs/EDCs with 90.09% SDEV. For the first time, a model, the PD - MCI - hydrophobic - H bond model, was developed to predict adsorption of a wide group of complicated PPCPs/EDCs onto a big variety of carbonaceous and non-carbonaceous adsorbents. The proposed model approach may provide a simple means for predicting adsorption capacities of PPCPs/EDCs onto various adsorbents.
Collapse
Affiliation(s)
- Warisa Bunmahotama
- School of Environmental Science and Engineering, Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat - sen University, Guangzhou, 510275, China
| | - Tsair-Fuh Lin
- Department of Environmental Engineering, National Cheng Kung University, Tainan, 70101, Taiwan; Global Water Quality Research Center, National Cheng Kung University, Tainan, 70955, Taiwan
| | - Xin Yang
- School of Environmental Science and Engineering, Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat - sen University, Guangzhou, 510275, China.
| |
Collapse
|
23
|
Birch H, Redman AD, Letinski DJ, Lyon DY, Mayer P. Determining the water solubility of difficult-to-test substances: A tutorial review. Anal Chim Acta 2019; 1086:16-28. [DOI: 10.1016/j.aca.2019.07.034] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 07/04/2019] [Accepted: 07/18/2019] [Indexed: 11/29/2022]
|
24
|
Tosstorff A, Svilenov H, Peters GH, Harris P, Winter G. Structure-based discovery of a new protein-aggregation breaking excipient. Eur J Pharm Biopharm 2019; 144:207-216. [DOI: 10.1016/j.ejpb.2019.09.010] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2019] [Revised: 07/28/2019] [Accepted: 09/11/2019] [Indexed: 01/06/2023]
|
25
|
Yang X, Wang Y, Byrne R, Schneider G, Yang S. Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery. Chem Rev 2019; 119:10520-10594. [PMID: 31294972 DOI: 10.1021/acs.chemrev.8b00728] [Citation(s) in RCA: 351] [Impact Index Per Article: 70.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Artificial intelligence (AI), and, in particular, deep learning as a subcategory of AI, provides opportunities for the discovery and development of innovative drugs. Various machine learning approaches have recently (re)emerged, some of which may be considered instances of domain-specific AI which have been successfully employed for drug discovery and design. This review provides a comprehensive portrayal of these machine learning techniques and of their applications in medicinal chemistry. After introducing the basic principles, alongside some application notes, of the various machine learning algorithms, the current state-of-the art of AI-assisted pharmaceutical discovery is discussed, including applications in structure- and ligand-based virtual screening, de novo drug design, physicochemical and pharmacokinetic property prediction, drug repurposing, and related aspects. Finally, several challenges and limitations of the current methods are summarized, with a view to potential future directions for AI-assisted drug discovery and design.
Collapse
Affiliation(s)
- Xin Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Yifei Wang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Ryan Byrne
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Gisbert Schneider
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Shengyong Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| |
Collapse
|
26
|
Njoku JO, Amaral Silva D, Mukherjee D, Webster GK, Löbenberg R. In silico Tools at Early Stage of Pharmaceutical Development: Data Needs and Software Capabilities. AAPS PharmSciTech 2019; 20:243. [PMID: 31264126 DOI: 10.1208/s12249-019-1461-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 06/18/2019] [Indexed: 01/17/2023] Open
Abstract
In early drug development, the selection of a formulation platform and decisions on formulation strategies have to be made within a short timeframe and often with minimal use of the active pharmaceutical ingredient (API). The current work evaluated the various physicochemical parameters required to improve the prediction accuracy of simulation software for immediate release tablets in early drug development. DDDPlus™ was used in simulating dissolution test profiles of immediate release tablets of ritonavir and all simulations were compared with experimental results. The minimum data requirements to make useful predictions were assessed using the ADMET predictor (part of DDDPlus) and Chemicalize (an online resource). A surfactant model was developed to estimate the solubility enhancement in media containing surfactant and the software's transfer model based on the USP two-tiered dissolution test was assessed. One measured data point was shown to be sufficient to make predictive simulations in DDDPlus. At pH 2.0, the software overestimated drug release while at pH 1.0 and 6.8, simulations were close to the measured values. A surfactant solubility model established with measured data gave good dissolution predictions. The transfer model uses a single-vessel model and was unable to predict the two in vivo environments separately. For weak bases like ritonavir, a minimum of three solubility data points is recommended for in silico predictions in buffered media. A surfactant solubility model is useful when predicting dissolution behavior in surfactant media and in silico predictions need measured solubility data to be predictive.
Collapse
|
27
|
Hossain S, Kabedev A, Parrow A, Bergström CAS, Larsson P. Molecular simulation as a computational pharmaceutics tool to predict drug solubility, solubilization processes and partitioning. Eur J Pharm Biopharm 2019; 137:46-55. [PMID: 30771454 PMCID: PMC6434319 DOI: 10.1016/j.ejpb.2019.02.007] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 02/05/2019] [Accepted: 02/13/2019] [Indexed: 01/12/2023]
Abstract
In this review we will discuss how computational methods, and in particular classical molecular dynamics simulations, can be used to calculate solubility of pharmaceutically relevant molecules and systems. To the extent possible, we focus on the non-technical details of these calculations, and try to show also the added value of a more thorough and detailed understanding of the solubilization process obtained by using computational simulations. Although the main focus is on classical molecular dynamics simulations, we also provide the reader with some insights into other computational techniques, such as the COSMO-method, and also discuss Flory-Huggins theory and solubility parameters. We hope that this review will serve as a valuable starting point for any pharmaceutical researcher, who has not yet fully explored the possibilities offered by computational approaches to solubility calculations.
Collapse
Affiliation(s)
- Shakhawath Hossain
- Department of Pharmacy, Uppsala Biomedical Center, Uppsala University, 751 23 Uppsala, Sweden; Swedish Drug Delivery Forum (SDDF), Uppsala University, Sweden
| | - Aleksei Kabedev
- Department of Pharmacy, Uppsala Biomedical Center, Uppsala University, 751 23 Uppsala, Sweden
| | - Albin Parrow
- Department of Pharmacy, Uppsala Biomedical Center, Uppsala University, 751 23 Uppsala, Sweden
| | - Christel A S Bergström
- Department of Pharmacy, Uppsala Biomedical Center, Uppsala University, 751 23 Uppsala, Sweden; Swedish Drug Delivery Forum (SDDF), Uppsala University, Sweden
| | - Per Larsson
- Department of Pharmacy, Uppsala Biomedical Center, Uppsala University, 751 23 Uppsala, Sweden; Swedish Drug Delivery Forum (SDDF), Uppsala University, Sweden.
| |
Collapse
|
28
|
Raevsky OA, Grigorev VY, Polianczyk DE, Raevskaja OE, Dearden JC. Aqueous Drug Solubility: What Do We Measure, Calculate and QSPR Predict? Mini Rev Med Chem 2019; 19:362-372. [PMID: 30058484 DOI: 10.2174/1389557518666180727164417] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2018] [Revised: 07/06/2018] [Accepted: 07/20/2018] [Indexed: 01/07/2023]
Abstract
Detailed critical analysis of publications devoted to QSPR of aqueous solubility is presented in the review with discussion of four types of aqueous solubility (three different thermodynamic solubilities with unknown solute structure, intrinsic solubility, solubility in physiological media at pH=7.4 and kinetic solubility), variety of molecular descriptors (from topological to quantum chemical), traditional statistical and machine learning methods as well as original QSPR models.
Collapse
Affiliation(s)
- Oleg A Raevsky
- Department of Computer-Aided Molecular Design, Institute of Physiologically Active Compounds, Russian Academy of Science, Chernogolovka, Russian Federation
| | - Veniamin Y Grigorev
- Department of Computer-Aided Molecular Design, Institute of Physiologically Active Compounds, Russian Academy of Science, Chernogolovka, Russian Federation
| | - Daniel E Polianczyk
- Department of Computer-Aided Molecular Design, Institute of Physiologically Active Compounds, Russian Academy of Science, Chernogolovka, Russian Federation
| | - Olga E Raevskaja
- Department of Computer-Aided Molecular Design, Institute of Physiologically Active Compounds, Russian Academy of Science, Chernogolovka, Russian Federation
| | - John C Dearden
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool, United Kingdom
| |
Collapse
|
29
|
Palmblad M. Visual and Semantic Enrichment of Analytical Chemistry Literature Searches by Combining Text Mining and Computational Chemistry. Anal Chem 2019; 91:4312-4316. [PMID: 30835438 PMCID: PMC6448173 DOI: 10.1021/acs.analchem.8b05818] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
The
open-access scientific literature contains a wealth of information
for meaningful text mining. However, this information is not always
easy to retrieve. This technical note addresses the problem by a new
flexible method combining in a single workflow existing resources
for literature searches, text mining, and large-scale prediction of
physicochemical and biological properties. The results are visualized
as virtual mass spectra, chromatograms, or images in styles new to
text mining but familiar to analytical chemistry. The method is demonstrated
on comparisons of analytical-chemistry techniques and semantically
enriched searches for proteins and their activities, but it may also
be of general utility in experimental design, drug discovery, chemical
syntheses, business intelligence, and historical studies. The method
is realized in shareable scientific workflows using only freely available
data, services, and software that scale to millions of publications
and named chemical entities in the literature.
Collapse
Affiliation(s)
- Magnus Palmblad
- Center for Proteomics and Metabolomics , Leiden University Medical Center , Postzone S3-P, Postbus 9600, 2300 RC Leiden , The Netherlands
| |
Collapse
|
30
|
Pirashvili M, Steinberg L, Belchi Guillamon F, Niranjan M, Frey JG, Brodzki J. Improved understanding of aqueous solubility modeling through topological data analysis. J Cheminform 2018; 10:54. [PMID: 30460426 PMCID: PMC6755597 DOI: 10.1186/s13321-018-0308-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 11/08/2018] [Indexed: 11/10/2022] Open
Abstract
Topological data analysis is a family of recent mathematical techniques seeking to understand the 'shape' of data, and has been used to understand the structure of the descriptor space produced from a standard chemical informatics software from the point of view of solubility. We have used the mapper algorithm, a TDA method that creates low-dimensional representations of data, to create a network visualization of the solubility space. While descriptors with clear chemical implications are prominent features in this space, reflecting their importance to the chemical properties, an unexpected and interesting correlation between chlorine content and rings and their implication for solubility prediction is revealed. A parallel representation of the chemical space was generated using persistent homology applied to molecular graphs. Links between this chemical space and the descriptor space were shown to be in agreement with chemical heuristics. The use of persistent homology on molecular graphs, extended by the use of norms on the associated persistence landscapes allow the conversion of discrete shape descriptors to continuous ones, and a perspective of the application of these descriptors to quantitative structure property relations is presented.
Collapse
Affiliation(s)
| | - Lee Steinberg
- Department of Chemistry, University of Southampton, Southampton, UK
| | - Francisco Belchi Guillamon
- Mathematical Sciences, University of Southampton, Southampton, UK.,Institut de Robòtica i Informàtica industrial, CSIC-UPC, Llorens i Artigas 4-6, 08028, Barcelona, Spain
| | | | - Jeremy G Frey
- Department of Chemistry, University of Southampton, Southampton, UK
| | - Jacek Brodzki
- Mathematical Sciences, University of Southampton, Southampton, UK
| |
Collapse
|
31
|
Tu M, Cheng S, Lu W, Du M. Advancement and prospects of bioinformatics analysis for studying bioactive peptides from food-derived protein: Sequence, structure, and functions. Trends Analyt Chem 2018. [DOI: 10.1016/j.trac.2018.04.005] [Citation(s) in RCA: 129] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
|
32
|
Könczöl Á, Dargó G. Brief overview of solubility methods: Recent trends in equilibrium solubility measurement and predictive models. DRUG DISCOVERY TODAY. TECHNOLOGIES 2018; 27:3-10. [PMID: 30103861 DOI: 10.1016/j.ddtec.2018.06.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 05/31/2018] [Accepted: 06/03/2018] [Indexed: 06/08/2023]
Abstract
Solubility is a crucial physicochemical parameter affecting the whole process of drug discovery and development. Thus, understanding of the methods and concepts to measure and predict this propensity are of utmost importance for the pharmaceutical scientist. Despite their inherent limitations, kinetic solubility screening methods became routine assays by mimicking bioassay conditions and guiding the lead optimization process. In contrast, thermodynamic solubility methods show a clear evolution: miniaturized high throughput assays coupled to analytical techniques such as solid-state characterization, ultra performance liquid chromatography, or polychromatic turbidimetry, have been developed, thereby enabling a more complex physicochemical profiling at the early discovery stage. Solubility prediction still poses a significant challenge at the industrial level. Classification and critical evaluation of recent in silico models are provided. Discussion of experimental and computational methods: was based on relevant industrial references.
Collapse
Affiliation(s)
- Árpád Könczöl
- Compound Profiling Laboratory, Gedeon Richter Plc., H-1475 Budapest, Hungary.
| | - Gergő Dargó
- Compound Profiling Laboratory, Gedeon Richter Plc., H-1475 Budapest, Hungary; Department of Organic Chemistry and Technology, Budapest University of Technology and Economics, Műegyetem rakpart 3, H-1111 Budapest, Hungary
| |
Collapse
|
33
|
Sou T, Bergström CAS. Automated assays for thermodynamic (equilibrium) solubility determination. DRUG DISCOVERY TODAY. TECHNOLOGIES 2018; 27:11-19. [PMID: 30103859 DOI: 10.1016/j.ddtec.2018.04.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Revised: 04/26/2018] [Accepted: 04/27/2018] [Indexed: 06/08/2023]
Abstract
Solubility is a crucial physicochemical property for drug candidates and is important in both drug discovery and development. Poor solubility is detrimental to absorption after oral administration and can mask compound activity in bioassays in various ways. Hence, solubility liabilities should ideally be identified as early as possible in the drug development process. With the increasing number of compounds as potential drug candidates, automated thermodynamic solubility assays for high throughput screening enabling rapid evaluation of a large number of compounds are becoming increasingly important. This review discusses the current status of the most widely used automated assays for thermodynamic solubility, followed by recent high throughput measurements of properties related to solubility (e.g. dissolution rate and supersaturation) and a brief overview of predictive computational methods for thermodynamic solubility reported in the literature.
Collapse
Affiliation(s)
- Tomás Sou
- Department of Pharmacy, Uppsala University, BMC P.O. Box 580, SE-751 23 Uppsala, Sweden
| | - Christel A S Bergström
- Department of Pharmacy, Uppsala University, BMC P.O. Box 580, SE-751 23 Uppsala, Sweden.
| |
Collapse
|
34
|
Mansouri K, Grulke CM, Judson RS, Williams AJ. OPERA models for predicting physicochemical properties and environmental fate endpoints. J Cheminform 2018. [PMID: 29520515 PMCID: PMC5843579 DOI: 10.1186/s13321-018-0263-1] [Citation(s) in RCA: 281] [Impact Index Per Article: 46.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The collection of chemical structure information and associated experimental data for quantitative structure–activity/property relationship (QSAR/QSPR) modeling is facilitated by an increasing number of public databases containing large amounts of useful data. However, the performance of QSAR models highly depends on the quality of the data and modeling methodology used. This study aims to develop robust QSAR/QSPR models for chemical properties of environmental interest that can be used for regulatory purposes. This study primarily uses data from the publicly available PHYSPROP database consisting of a set of 13 common physicochemical and environmental fate properties. These datasets have undergone extensive curation using an automated workflow to select only high-quality data, and the chemical structures were standardized prior to calculation of the molecular descriptors. The modeling procedure was developed based on the five Organization for Economic Cooperation and Development (OECD) principles for QSAR models. A weighted k-nearest neighbor approach was adopted using a minimum number of required descriptors calculated using PaDEL, an open-source software. The genetic algorithms selected only the most pertinent and mechanistically interpretable descriptors (2–15, with an average of 11 descriptors). The sizes of the modeled datasets varied from 150 chemicals for biodegradability half-life to 14,050 chemicals for logP, with an average of 3222 chemicals across all endpoints. The optimal models were built on randomly selected training sets (75%) and validated using fivefold cross-validation (CV) and test sets (25%). The CV Q2 of the models varied from 0.72 to 0.95, with an average of 0.86 and an R2 test value from 0.71 to 0.96, with an average of 0.82. Modeling and performance details are described in QSAR model reporting format and were validated by the European Commission’s Joint Research Center to be OECD compliant. All models are freely available as an open-source, command-line application called OPEn structure–activity/property Relationship App (OPERA). OPERA models were applied to more than 750,000 chemicals to produce freely available predicted data on the U.S. Environmental Protection Agency’s CompTox Chemistry Dashboard.![]()
Collapse
Affiliation(s)
- Kamel Mansouri
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27711, USA. .,Oak Ridge Institute for Science and Education, 1299 Bethel Valley Road, Oak Ridge, TN, 37830, USA. .,ScitoVation LLC, 6 Davis Drive, Research Triangle Park, NC, 27709, USA.
| | - Chris M Grulke
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27711, USA
| | - Richard S Judson
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27711, USA
| | - Antony J Williams
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27711, USA
| |
Collapse
|
35
|
Fearon AD, Stokes GY. Thermodynamics of Indomethacin Adsorption to Phospholipid Membranes. J Phys Chem B 2017; 121:10508-10518. [PMID: 29064244 DOI: 10.1021/acs.jpcb.7b08359] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Using second-harmonic generation, we directly monitored adsorption of indomethacin, a nonsteroidal anti-inflammatory drug, to supported lipid bilayers composed of phospholipids of varying phase, cholesterol content, and head group charge without the use of extrinsic labels at therapeutically relevant aqueous concentrations. Indomethacin adsorbed to gel-phase lipids with a high binding affinity, suggesting that like other arylacetic acid-containing drugs, it preferentially interacts with ordered lipid domains. We discovered that adsorption of indomethacin to gel-phase phospholipids was endothermic and entropically driven, whereas adsorption to fluid-phase phospholipids was exothermic and enthalpically driven. As temperature increased from 19 to 34 °C, binding affinities to gel-phase lipids increased by 7-fold but relative surface concentration decreased to one-fifth of the original value. We also compared our results to the entropies reported for indomethacin adsorbed to surfactant micelles, which are used in drug delivery systems, and assert that adsorbed water molecules in the phospholipid bilayer may be buried deeper into the acyl chains and less accessible for disruption. The thermodynamic studies reported here provide mechanistic insight into indomethacin interactions with mammalian plasma membranes in the gastrointestinal tract and inform studies of drug delivery, where indomethacin is commonly used as a prototypical, hydrophobic small-molecule drug.
Collapse
Affiliation(s)
- Amanda D Fearon
- Department of Chemistry and Biochemistry, Santa Clara University , 500 El Camino Real, Santa Clara, California 95053, United States
| | - Grace Y Stokes
- Department of Chemistry and Biochemistry, Santa Clara University , 500 El Camino Real, Santa Clara, California 95053, United States
| |
Collapse
|
36
|
Zang Q, Mansouri K, Williams AJ, Judson RS, Allen DG, Casey WM, Kleinstreuer NC. In Silico Prediction of Physicochemical Properties of Environmental Chemicals Using Molecular Fingerprints and Machine Learning. J Chem Inf Model 2017; 57:36-49. [PMID: 28006899 PMCID: PMC6131700 DOI: 10.1021/acs.jcim.6b00625] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
There are little available toxicity data on the vast majority of chemicals in commerce. High-throughput screening (HTS) studies, such as those being carried out by the U.S. Environmental Protection Agency (EPA) ToxCast program in partnership with the federal Tox21 research program, can generate biological data to inform models for predicting potential toxicity. However, physicochemical properties are also needed to model environmental fate and transport, as well as exposure potential. The purpose of the present study was to generate an open-source quantitative structure-property relationship (QSPR) workflow to predict a variety of physicochemical properties that would have cross-platform compatibility to integrate into existing cheminformatics workflows. In this effort, decades-old experimental property data sets available within the EPA EPI Suite were reanalyzed using modern cheminformatics workflows to develop updated QSPR models capable of supplying computationally efficient, open, and transparent HTS property predictions in support of environmental modeling efforts. Models were built using updated EPI Suite data sets for the prediction of six physicochemical properties: octanol-water partition coefficient (logP), water solubility (logS), boiling point (BP), melting point (MP), vapor pressure (logVP), and bioconcentration factor (logBCF). The coefficient of determination (R2) between the estimated values and experimental data for the six predicted properties ranged from 0.826 (MP) to 0.965 (BP), with model performance for five of the six properties exceeding those from the original EPI Suite models. The newly derived models can be employed for rapid estimation of physicochemical properties within an open-source HTS workflow to inform fate and toxicity prediction models of environmental chemicals.
Collapse
Affiliation(s)
- Qingda Zang
- Integrated Laboratory Systems, Inc., Research Triangle Park, NC 27709, USA
| | - Kamel Mansouri
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Antony J. Williams
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Richard S. Judson
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - David G. Allen
- Integrated Laboratory Systems, Inc., Research Triangle Park, NC 27709, USA
| | - Warren M. Casey
- National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| | - Nicole C. Kleinstreuer
- National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| |
Collapse
|
37
|
Golovnev NN, Vasil’ev AD. Structure of two new compounds of fluoroquinolone antibiotics with mineral acids. RUSS J INORG CHEM+ 2016. [DOI: 10.1134/s0036023616110073] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
38
|
Liu S, Cao S, Hoang K, Young KL, Paluch AS, Mobley DL. Using MD Simulations To Calculate How Solvents Modulate Solubility. J Chem Theory Comput 2016; 12:1930-41. [PMID: 26878198 DOI: 10.1021/acs.jctc.5b00934] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Here, our interest is in predicting solubility in general, and we focus particularly on predicting how the solubility of particular solutes is modulated by the solvent environment. Solubility in general is extremely important, both for theoretical reasons - it provides an important probe of the balance between solute-solute and solute-solvent interactions - and for more practical reasons, such as how to control the solubility of a given solute via modulation of its environment, as in process chemistry and separations. Here, we study how the change of solvent affects the solubility of a given compound. That is, we calculate relative solubilities. We use MD simulations to calculate relative solubility and compare our calculated values with experiment as well as with results from several other methods, SMD and UNIFAC, the latter of which is commonly used in chemical engineering design. We find that straightforward solubility calculations based on molecular simulations using a general small-molecule force field outperform SMD and UNIFAC both in terms of accuracy and coverage of the relevant chemical space.
Collapse
Affiliation(s)
| | | | | | - Kayla L Young
- Department of Chemical, Paper and Biomedical Engineering, Miami University , Oxford, Ohio 45056, United States
| | - Andrew S Paluch
- Department of Chemical, Paper and Biomedical Engineering, Miami University , Oxford, Ohio 45056, United States
| | | |
Collapse
|
39
|
Nieto-Draghi C, Fayet G, Creton B, Rozanska X, Rotureau P, de Hemptinne JC, Ungerer P, Rousseau B, Adamo C. A General Guidebook for the Theoretical Prediction of Physicochemical Properties of Chemicals for Regulatory Purposes. Chem Rev 2015; 115:13093-164. [PMID: 26624238 DOI: 10.1021/acs.chemrev.5b00215] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Carlos Nieto-Draghi
- IFP Energies nouvelles , 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| | - Guillaume Fayet
- INERIS, Parc Technologique Alata, BP2 , 60550 Verneuil-en-Halatte, France
| | - Benoit Creton
- IFP Energies nouvelles , 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| | - Xavier Rozanska
- Materials Design S.A.R.L. , 18, rue de Saisset, 92120 Montrouge, France
| | - Patricia Rotureau
- INERIS, Parc Technologique Alata, BP2 , 60550 Verneuil-en-Halatte, France
| | | | - Philippe Ungerer
- Materials Design S.A.R.L. , 18, rue de Saisset, 92120 Montrouge, France
| | - Bernard Rousseau
- Laboratoire de Chimie-Physique, Université Paris Sud , UMR 8000 CNRS, Bât. 349, 91405 Orsay Cedex, France
| | - Carlo Adamo
- Institut de Recherche Chimie Paris, PSL Research University, CNRS, Chimie Paristech , 11 rue P. et M. Curie, F-75005 Paris, France.,Institut Universitaire de France , 103 Boulevard Saint Michel, F-75005 Paris, France
| |
Collapse
|
40
|
Abstract
In recent decades, in silico absorption, distribution, metabolism, excretion (ADME), and toxicity (T) modelling as a tool for rational drug design has received considerable attention from pharmaceutical scientists, and various ADME/T-related prediction models have been reported. The high-throughput and low-cost nature of these models permits a more streamlined drug development process in which the identification of hits or their structural optimization can be guided based on a parallel investigation of bioavailability and safety, along with activity. However, the effectiveness of these tools is highly dependent on their capacity to cope with needs at different stages, e.g. their use in candidate selection has been limited due to their lack of the required predictability. For some events or endpoints involving more complex mechanisms, the current in silico approaches still need further improvement. In this review, we will briefly introduce the development of in silico models for some physicochemical parameters, ADME properties and toxicity evaluation, with an emphasis on the modelling approaches thereof, their application in drug discovery, and the potential merits or deficiencies of these models. Finally, the outlook for future ADME/T modelling based on big data analysis and systems sciences will be discussed.
Collapse
|
41
|
Caldwell GW. In silico tools used for compound selection during target-based drug discovery and development. Expert Opin Drug Discov 2015; 10:901-23. [DOI: 10.1517/17460441.2015.1043885] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Gary W Caldwell
- Janssen Research & Development LLC, Discovery Sciences, Spring House, PA, USA
| |
Collapse
|
42
|
Almukainzi M, Okumu A, Wei H, Löbenberg R. Simulation of in vitro dissolution behavior using DDDPlus™. AAPS PharmSciTech 2015; 16:217-21. [PMID: 25409918 DOI: 10.1208/s12249-014-0241-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Accepted: 10/23/2014] [Indexed: 11/30/2022] Open
Abstract
Dissolution testing is a performance test for many dosage forms including tablets and capsules. The objective of this study was to evaluate if computer simulations can predict the in vitro dissolution of two model drugs for which different dissolution data were available. Published montelukast sodium and glyburide dissolution data was used for the simulations. Different pharmacopeial and biorelevant buffers, volumes, and rotations speeds were evaluated. Additionally, a pH change protocol was evaluated using these buffers. DDDPlus™ 3, Beta version (Simulation Plus, Inc.), was used to simulate the in vitro dissolution data. The simulated data were compared with the in vitro data. A regression coefficient between predicted and observed data was used to assess the simulations. The statistical analysis of Montelukast sodium showed that there was a significant correlation between the in vitro release data and the predicted data for all cases except for one buffer. For glyburide, there was also a significant correlation between the experimental data and the predicted data using single pH conditions. Using the dynamic pH protocol, a correlation was significant for one biorelevant media. The simulations showed that both in vitro drug releases were sensitive to solubility effects which confirmed their BCS class II category. Computer simulations of the in vitro release using DDDPlus™ have the potential to estimate the in vivo dissolution at an early stage in the drug development process. This might be used to choose the most appropriate dissolution condition to establish IVIVC and to develop biorelevant in vitro performance tests to capture critical product attributes for quality control procedures in quality by design environments.
Collapse
|
43
|
Shen J, Kromidas L, Schultz T, Bhatia S. An in silico skin absorption model for fragrance materials. Food Chem Toxicol 2014; 74:164-76. [DOI: 10.1016/j.fct.2014.09.015] [Citation(s) in RCA: 831] [Impact Index Per Article: 83.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2014] [Revised: 09/19/2014] [Accepted: 09/23/2014] [Indexed: 11/15/2022]
|
44
|
Newby D, Freitas AA, Ghafourian T. Comparing multilabel classification methods for provisional biopharmaceutics class prediction. Mol Pharm 2014; 12:87-102. [PMID: 25397721 DOI: 10.1021/mp500457t] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
The biopharmaceutical classification system (BCS) is now well established and utilized for the development and biowaivers of immediate oral dosage forms. The prediction of BCS class can be carried out using multilabel classification. Unlike single label classification, multilabel classification methods predict more than one class label at the same time. This paper compares two multilabel methods, binary relevance and classifier chain, for provisional BCS class prediction. Large data sets of permeability and solubility of drug and drug-like compounds were obtained from the literature and were used to build models using decision trees. The separate permeability and solubility models were validated, and a BCS validation set of 127 compounds where both permeability and solubility were known was used to compare the two aforementioned multilabel classification methods for provisional BCS class prediction. Overall, the results indicate that the classifier chain method, which takes into account label interactions, performed better compared to the binary relevance method. This work offers a comparison of multilabel methods and shows the potential of the classifier chain multilabel method for improved biological property predictions for use in drug discovery and development.
Collapse
Affiliation(s)
- Danielle Newby
- Medway School of Pharmacy, Universities of Kent and Greenwich , Chatham, Kent, ME4 4TB, U.K
| | | | | |
Collapse
|
45
|
Lavecchia A. Machine-learning approaches in drug discovery: methods and applications. Drug Discov Today 2014; 20:318-31. [PMID: 25448759 DOI: 10.1016/j.drudis.2014.10.012] [Citation(s) in RCA: 359] [Impact Index Per Article: 35.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2014] [Revised: 09/27/2014] [Accepted: 10/24/2014] [Indexed: 12/19/2022]
Abstract
During the past decade, virtual screening (VS) has evolved from traditional similarity searching, which utilizes single reference compounds, into an advanced application domain for data mining and machine-learning approaches, which require large and representative training-set compounds to learn robust decision rules. The explosive growth in the amount of public domain-available chemical and biological data has generated huge effort to design, analyze, and apply novel learning methodologies. Here, I focus on machine-learning techniques within the context of ligand-based VS (LBVS). In addition, I analyze several relevant VS studies from recent publications, providing a detailed view of the current state-of-the-art in this field and highlighting not only the problematic issues, but also the successes and opportunities for further advances.
Collapse
Affiliation(s)
- Antonio Lavecchia
- Department of Pharmacy, Drug Discovery Laboratory, University of Napoli 'Federico II', via D. Montesano 49, I-80131 Napoli, Italy.
| |
Collapse
|
46
|
Ferreira LA, Chervenak A, Placko S, Kestranek A, Madeira PP, Zaslavsky BY. Responses of polar organic compounds to different ionic environments in aqueous media are interrelated. Phys Chem Chem Phys 2014; 16:23347-54. [DOI: 10.1039/c4cp02084g] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
47
|
McDonagh JL, Nath N, De Ferrari L, van Mourik T, Mitchell JBO. Uniting cheminformatics and chemical theory to predict the intrinsic aqueous solubility of crystalline druglike molecules. J Chem Inf Model 2014; 54:844-56. [PMID: 24564264 PMCID: PMC3965570 DOI: 10.1021/ci4005805] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
![]()
We
present four models of solution free-energy prediction for druglike
molecules utilizing cheminformatics descriptors and theoretically
calculated thermodynamic values. We make predictions of solution free
energy using physics-based theory alone and using machine learning/quantitative
structure–property relationship (QSPR) models. We also develop
machine learning models where the theoretical energies and cheminformatics
descriptors are used as combined input. These models are used to predict
solvation free energy. While direct theoretical calculation does not
give accurate results in this approach, machine learning is able to
give predictions with a root mean squared error (RMSE) of ∼1.1
log S units in a 10-fold cross-validation for our
Drug-Like-Solubility-100 (DLS-100) dataset of 100 druglike molecules.
We find that a model built using energy terms from our theoretical
methodology as descriptors is marginally less predictive than one
built on Chemistry Development Kit (CDK) descriptors. Combining both
sets of descriptors allows a further but very modest improvement in
the predictions. However, in some cases, this is a statistically significant
enhancement. These results suggest that there is little complementarity
between the chemical information provided by these two sets of descriptors,
despite their different sources and methods of calculation. Our machine
learning models are also able to predict the well-known Solubility
Challenge dataset with an RMSE value of 0.9–1.0 log S units.
Collapse
Affiliation(s)
- James L McDonagh
- Biomedical Sciences Research Complex and ‡EaStCHEM, School of Chemistry, Purdie Building, University of St. Andrews , North Haugh, St. Andrews, Scotland , KY16 9ST, United Kingdom
| | | | | | | | | |
Collapse
|
48
|
Mitchell JBO. Machine learning methods in chemoinformatics. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2014; 4:468-481. [PMID: 25285160 PMCID: PMC4180928 DOI: 10.1002/wcms.1183] [Citation(s) in RCA: 248] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Machine learning algorithms are generally developed in computer science or adjacent disciplines and find their way into chemical modeling by a process of diffusion. Though particular machine learning methods are popular in chemoinformatics and quantitative structure-activity relationships (QSAR), many others exist in the technical literature. This discussion is methods-based and focused on some algorithms that chemoinformatics researchers frequently use. It makes no claim to be exhaustive. We concentrate on methods for supervised learning, predicting the unknown property values of a test set of instances, usually molecules, based on the known values for a training set. Particularly relevant approaches include Artificial Neural Networks, Random Forest, Support Vector Machine, k-Nearest Neighbors and naïve Bayes classifiers.
Collapse
|
49
|
Abou-Gharbia M, Childers WE. Discovery of Innovative Therapeutics: Today’s Realities and Tomorrow’s Vision. 2. Pharma’s Challenges and Their Commitment to Innovation. J Med Chem 2014; 57:5525-53. [DOI: 10.1021/jm401564r] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Magid Abou-Gharbia
- Moulder
Center for Drug Discovery
Research, Temple University School of Pharmacy, 3307 North Broad Street, Philadelphia, Pennsylvania 19140, United States
| | - Wayne E. Childers
- Moulder
Center for Drug Discovery
Research, Temple University School of Pharmacy, 3307 North Broad Street, Philadelphia, Pennsylvania 19140, United States
| |
Collapse
|
50
|
Pramanik S, Roy K. Modeling bioconcentration factor (BCF) using mechanistically interpretable descriptors computed from open source tool "PaDEL-Descriptor". ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2014; 21:2955-2965. [PMID: 24170502 DOI: 10.1007/s11356-013-2247-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2013] [Accepted: 10/14/2013] [Indexed: 05/27/2023]
Abstract
Predictive regression-based models for bioconcentration factor (BCF) have been developed using mechanistically interpretable descriptors computed from open source tool PaDEL-Descriptor ( http://padel.nus.edu.sg/software/padeldescriptor/ ). A data set of 522 diverse chemicals has been used for this modeling study, and extended topochemical atom (ETA) indices developed by the present authors' group were chosen as the descriptors. Due to the importance of lipohilicity in modeling BCF, XLogP (computed partition coefficient) was also tried as an additional descriptor. Genetic function approximation followed by multiple linear regression algorithm was applied to select descriptors, and subsequent partial least squares analyses were performed to establish mathematical equations for BCF prediction. The model generated from only ETA indices shows importance of seven descriptors in model development, while the model generated from ETA descriptors along with XlogP shows importance of four descriptors in model development. In general, BCF depends on lipophilicity, presence of heteroatoms, presence of halogens, fused ring system, hydrogen bonding groups, etc. The developed models show excellent statistical qualities and predictive ability. The developed models were used also for prediction of an external data set available from the literature, and good quality of predictions (R (2) pred = 0.812 and 0.826) was demonstrated. Thus, BCF can be predicted using ETA and XlogP descriptors calculated from open source PaDEL-Descriptor software in the context of aquatic chemical toxicity management.
Collapse
Affiliation(s)
- Subrata Pramanik
- Drug Theoretics and Cheminformatics Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700 032, India
| | | |
Collapse
|