1
|
Bao Z, Tom G, Cheng A, Watchorn J, Aspuru-Guzik A, Allen C. Towards the prediction of drug solubility in binary solvent mixtures at various temperatures using machine learning. J Cheminform 2024; 16:117. [PMID: 39468626 PMCID: PMC11520512 DOI: 10.1186/s13321-024-00911-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 09/28/2024] [Indexed: 10/30/2024] Open
Abstract
Drug solubility is an important parameter in the drug development process, yet it is often tedious and challenging to measure, especially for expensive drugs or those available in small quantities. To alleviate these challenges, machine learning (ML) has been applied to predict drug solubility as an alternative approach. However, the majority of existing ML research has focused on the predictions of aqueous solubility and/or solubility at specific temperatures, which restricts the model applicability in pharmaceutical development. To bridge this gap, we compiled a dataset of 27,000 solubility datapoints, including solubility of small molecules measured in a range of binary solvent mixtures under various temperatures. Next, a panel of ML models were trained on this dataset with their hyperparameters tuned using Bayesian optimization. The resulting top-performing models, both gradient boosted decision trees (light gradient boosting machine and extreme gradient boosting), achieved mean absolute errors (MAE) of 0.33 for LogS (S in g/100 g) on the holdout set. These models were further validated through a prospective study, wherein the solubility of four drug molecules were predicted by the models and then validated with in-house solubility experiments. This prospective study demonstrated that the models accurately predicted the solubility of solutes in specific binary solvent mixtures under different temperatures, especially for drugs whose features closely align within the solutes in the dataset (MAE < 0.5 for LogS). To support future research and facilitate advancements in the field, we have made the dataset and code openly available. Scientific contribution Our research advances the state-of-the-art in predicting solubility for small molecules by leveraging ML and a uniquely comprehensive dataset. Unlike existing ML studies that predominantly focus on solubility in aqueous solvents at fixed temperatures, our work enables prediction of drug solubility in a variety of binary solvent mixtures over a broad temperature range, providing practical insights on the modeling of solubility for realistic pharmaceutical applications. These advancements along with the open access dataset and code support significant steps in the drug development process including new molecule discovery, drug analysis and formulation.
Collapse
Affiliation(s)
- Zeqing Bao
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, M5S 3M2, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, M5S 1M1, Canada
| | - Austin Cheng
- Department of Chemistry, University of Toronto, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, M5S 1M1, Canada
| | | | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, M5S 1M1, Canada
- Acceleration Consortium, Toronto, ON, M5S 3H6, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Toronto, ON, M5S 1M1, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, M5S 3E5, Canada
- Department of Materials Science and Engineering, University of Toronto, Toronto, ON, M5S 3E4, Canada
- CIFAR Artificial Intelligence Research Chair, Vector Institute, Toronto, ON, M5S 1M1, Canada
| | - Christine Allen
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, M5S 3M2, Canada.
- Acceleration Consortium, Toronto, ON, M5S 3H6, Canada.
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, M5S 3E5, Canada.
| |
Collapse
|
2
|
Nieoczym D, Banono NS, Stępnik K, Kaczor AA, Szybkowski P, Esguerra CV, Kukula-Koch W, Gawel K. In Silico Analysis, Anticonvulsant Activity, and Toxicity Evaluation of Schisandrin B in Zebrafish Larvae and Mice. Int J Mol Sci 2023; 24:12949. [PMID: 37629132 PMCID: PMC10455331 DOI: 10.3390/ijms241612949] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Revised: 08/16/2023] [Accepted: 08/16/2023] [Indexed: 08/27/2023] Open
Abstract
The aim of this study is to evaluate the anticonvulsant potential of schisandrin B, a main ingredient of Schisandra chinensis extracts. Schisandrin B showed anticonvulsant activity in the zebrafish larva pentylenetetrazole acute seizure assay but did not alter seizure thresholds in the intravenous pentylenetetrazole test in mice. Schisandrin B crosses the blood-brain barrier, which we confirmed in our in silico and in vivo analyses; however, the low level of its unbound fraction in the mouse brain tissue may explain the observed lack of anticonvulsant activity. Molecular docking revealed that the anticonvulsant activity of the compound in larval zebrafish might have been due to its binding to a benzodiazepine site within the GABAA receptor and/or the inhibition of the glutamate NMDA receptor. Although schisandrin B showed a beneficial anticonvulsant effect, toxicological studies revealed that it caused serious developmental impairment in zebrafish larvae, underscoring its teratogenic properties. Further detailed studies are needed to precisely identify the properties, pharmacological effects, and safety of schisandrin B.
Collapse
Affiliation(s)
- Dorota Nieoczym
- Department of Animal Physiology and Pharmacology, Institute of Biological Sciences, Faculty of Biology and Biotechnology, Maria Curie-Skłodowska University, Akademicka 19, 20-033 Lublin, Poland
| | - Nancy Saana Banono
- Chemical Neuroscience Group, Centre for Molecular Medicine Norway, University of Oslo, Gaustadalleen 21, Forskningsparken, 0349 Oslo, Norway; (N.S.B.); (C.V.E.)
| | - Katarzyna Stępnik
- Department of Physical Chemistry, Institute of Chemical Sciences, Faculty of Chemistry, Maria Curie-Skłodowska University, Pl. M. Curie-Skłodowskiej 3/243, 20-031 Lublin, Poland;
| | - Agnieszka A. Kaczor
- Department of Synthesis and Chemical Technology of Pharmaceutical Substances with Computer Modeling Laboratory, Faculty of Pharmacy, Medical University of Lublin, 4A Chodzki St., 20-093 Lublin, Poland;
| | - Przemysław Szybkowski
- Department of Experimental and Clinical Pharmacology, Medical University of Lublin, Jaczewskiego St. 8b, 20-090 Lublin, Poland;
- Clinical Provincial Hospital No. 2 St. Jadwiga Krolowej in Rzeszow, Lwowska St. 60, 35-301 Rzeszow, Poland
| | - Camila Vicencio Esguerra
- Chemical Neuroscience Group, Centre for Molecular Medicine Norway, University of Oslo, Gaustadalleen 21, Forskningsparken, 0349 Oslo, Norway; (N.S.B.); (C.V.E.)
| | - Wirginia Kukula-Koch
- Department of Pharmacognosy with Medicinal Plants Garden, Medical University of Lublin, Chodźki St. 1, 20-093 Lublin, Poland;
| | - Kinga Gawel
- Department of Experimental and Clinical Pharmacology, Medical University of Lublin, Jaczewskiego St. 8b, 20-090 Lublin, Poland;
| |
Collapse
|
3
|
Ma CY, Moldovan AA, Maloney AGP, Roberts KJ. Exploring the CSD Drug Subset: An Analysis of Lattice Energies and Constituent Intermolecular Interactions for the Crystal Structures of Pharmaceuticals. J Pharm Sci 2023; 112:435-445. [PMID: 36462705 DOI: 10.1016/j.xphs.2022.11.027] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 11/25/2022] [Accepted: 11/25/2022] [Indexed: 12/02/2022]
Abstract
Intermolecular (synthonic) modelling is used for a statistical analysis of crystal lattice energies, together with their contributing intermolecular interactions for the crystallographic structures selected from the CCDC's Drug Subset (https://doi.org/10.1016/j.xphs.2018.12.011). Analysis of this selected subset reveal similarities in packing compared to other organic crystals in the CSD with linear relationships between molecular weight and unit cell volume, void space, and packing coefficient. Crystal lattice energy calculations converge within a 30 Å intermolecular radius characterised by a mean lattice energy of ca. -36 kcal mol-1 with ca. 85% and 15% due to dispersive and electrostatic interactions, respectively. The distribution of the strongest synthons within the individual structures reveals an average strength of -5.79 kcal mol-1. The diversity of chemical space within the drug molecules is in agreement with the analysis of atom types across the selected subset with phenyl groups being found to contribute the highest mean energy of -11.28 kcal mol-1, highlighting the importance of aromatic interactions within pharmaceutical compounds. Despite an initial focus on Z' = 1 structures, this automated approach enables rapid and consistent quantitative analysis of lattice energy, synthon strength and functional group contributions, providing solid-form informatics for pharmaceutical R&D and a helpful basis for further investigations.
Collapse
Affiliation(s)
- Cai Y Ma
- Centre for the Digital Design of Drug Products, School of Chemical and Process Engineering, University of Leeds, Leeds, LS2 9JT, UK.
| | - Alexandru A Moldovan
- The Cambridge Crystallographic Data Centre, 12 Union Road, Cambridge, CB2 1EZ, UK
| | - Andrew G P Maloney
- The Cambridge Crystallographic Data Centre, 12 Union Road, Cambridge, CB2 1EZ, UK
| | - Kevin J Roberts
- Centre for the Digital Design of Drug Products, School of Chemical and Process Engineering, University of Leeds, Leeds, LS2 9JT, UK
| |
Collapse
|
4
|
Lai TT, Kuntz D, Wilson AK. Molecular Screening and Toxicity Estimation of 260,000 Perfluoroalkyl and Polyfluoroalkyl Substances (PFASs) through Machine Learning. J Chem Inf Model 2022; 62:4569-4578. [PMID: 36154169 DOI: 10.1021/acs.jcim.2c00374] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Perfluoroalkyl and polyfluoroalkyl substances (PFASs) are a class of chemicals widely used in industrial applications due to their exceptional properties and stability. However, they do not readily degrade in the environment and are linked to contamination and adverse health effects in humans and wildlife. To find alternatives for the most commonly used PFAS molecules that maintain their desirable chemical properties but are not adverse to biological lifeforms, a novel approach based upon machine learning is utilized. The machine learning model is trained on an existing set of PFAS molecules to generate over 260,000 novel PFAS molecules, which we dub PFAS-AI-Gen. Using molecular descriptors with known relationships to toxicity and industrial suitability followed by molecular docking and molecular dynamics simulations, this set of molecules is screened. In this manner, increasingly complex calculations are performed only for candidate molecules that are most likely to yield the desired properties of low binding affinity toward two selected protein receptors, the human pregnane x receptor (hPXR) and peroxisome proliferator-activated receptor γ (PPAR-γ), and high industrial suitability, defined by critical micelle concentration (CMC). The selection criteria of low binding affinity and high industrial suitability are relative to the popular PFAS alternative GenX. hPXR and PPAR-γ are selected as they are PFAS targets and facilitate a variety of functions, such as drug metabolism and glucose regulation, respectively. Through this approach, 22 promising new PFAS substitutes that may warrant experimental investigation are identified. This integrated approach of molecular screening and toxicity estimation may be applicable to other chemical classes.
Collapse
Affiliation(s)
- Thanh T Lai
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48823, United States
| | - David Kuntz
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48823, United States
| | - Angela K Wilson
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48823, United States
| |
Collapse
|
5
|
Wellawatte GP, Seshadri A, White AD. Model agnostic generation of counterfactual explanations for molecules. Chem Sci 2022; 13:3697-3705. [PMID: 35432902 PMCID: PMC8966631 DOI: 10.1039/d1sc05259d] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 02/06/2022] [Indexed: 11/25/2022] Open
Abstract
An outstanding challenge in deep learning in chemistry is its lack of interpretability. The inability of explaining why a neural network makes a prediction is a major barrier to deployment of AI models. This not only dissuades chemists from using deep learning predictions, but also has led to neural networks learning spurious correlations that are difficult to notice. Counterfactuals are a category of explanations that provide a rationale behind a model prediction with satisfying properties like providing chemical structure insights. Yet, counterfactuals have been previously limited to specific model architectures or required reinforcement learning as a separate process. In this work, we show a universal model-agnostic approach that can explain any black-box model prediction. We demonstrate this method on random forest models, sequence models, and graph neural networks in both classification and regression.
Collapse
Affiliation(s)
| | - Aditi Seshadri
- Department of Chemical Engineering, University of Rochester Rochester NY USA
| | - Andrew D White
- Department of Chemical Engineering, University of Rochester Rochester NY USA
| |
Collapse
|
6
|
Ma X, Sui H, Sun X, Ali MM, Debrah AA, Du Z. A risk classification strategy for migrants of food contact material combined with three (Q)SAR tools in silico. JOURNAL OF HAZARDOUS MATERIALS 2021; 419:126422. [PMID: 34182426 DOI: 10.1016/j.jhazmat.2021.126422] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2021] [Revised: 06/15/2021] [Accepted: 06/15/2021] [Indexed: 06/13/2023]
Abstract
The chemical constituents in food contact materials (FCMs) may transfer into food during the contact, which may pose potential risk to humans. So, it is important to evaluate the safety of FCMs. Due to the advantages of cost-effectiveness and high throughput, (Q)SAR tools have been gradually used for risk assessment. In this work, a risk classification strategy for migrants of food contact materials combined with three (Q)SAR tools was developed based on a single endpoint (Mutagenicity) assessment and risk matrix approach, respectively. 419 migrants existing in a self-built toxicology database beneficial from Python crawler technology were evaluated. 5 toxic hazard ranks and 4 risk ranks were obtained for single endpoint assessment and risk matrix respectively, with 21 substances assigned as Toxic hazard Class I and 43 substances assigned as RISK Ⅰ which need the highest safety concern. Besides, for the Toxic hazard Class I substances assessed by the single endpoint, 19 of them were confirmed experimentally, and all of them were overlapped in the RISK Ⅰ substances, which suggests the effectiveness and reliability of this strategy.
Collapse
Affiliation(s)
- Xin Ma
- College of Chemistry, Beijing University of Chemical Technology, Beijing 100029, China
| | - Haixia Sui
- China National Center for Food Safety Risk Assessment, Beijing 100022, China
| | - Xuechun Sun
- College of Chemistry, Beijing University of Chemical Technology, Beijing 100029, China
| | - Muhammad Mujahid Ali
- College of Chemistry, Beijing University of Chemical Technology, Beijing 100029, China
| | - Augustine Atta Debrah
- College of Chemistry, Beijing University of Chemical Technology, Beijing 100029, China
| | - Zhenxia Du
- College of Chemistry, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Environmentally Harmful Chemical Analysis, Beijing University of Chemical Technology, Beijing 100029, China.
| |
Collapse
|
7
|
Gao H, Jiang Y, Zhan J, Sun Y. Pharmacophore-based drug design of AChE and BChE dual inhibitors as potential anti-Alzheimer's disease agents. Bioorg Chem 2021; 114:105149. [PMID: 34252860 DOI: 10.1016/j.bioorg.2021.105149] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 05/25/2021] [Accepted: 07/02/2021] [Indexed: 12/30/2022]
Abstract
For the Alzheimer's disease (AD) with complex pathogenesis, single target drugs represent one of the most effective therapeutic strategies in clinical. However, the traditional concept of "a disease, a target" is difficult to find very effective drugs, and multi-target drugs have already become new hot spot in drug development for this disease. In our present study, our efforts toward discovering new cholinesterase (ChE) inhibitors aided by computational methods will provide useful information as anti-AD agents in the future. The best 3D-QSAR acetylcholinesterase (AChE) and butyrylcholinesterase (BChE) inhibitors pharmacophore hypotheses Hypo1 A and Hypo1 B were generated and validated by HypoGen program in Discovery Studio 2016 based on the training set of flavonoids, and then they were used as 3D query for screening the ZINC database. Next, the hit molecules were then subjected to the ADMET and molecular docking study to prioritize the compounds. Finally, 6 compounds showed good estimated activities and promising ADMET properties. The result of best compound ZINC08751495 with AChE estimate activity (0.028), BChE estimate activity (1.55), AChE fit value (9.369), BChE fit value (8.415), AChE -CDOCKER ENERGY (30.22), BChE -CDOCKER ENERGY (33.13) has the potential for further development as a supplement to treat Alzheimer's disease.
Collapse
Affiliation(s)
- Hongwei Gao
- School of Life Science, Ludong University, Yantai, Shandong 264025, China.
| | - Yingying Jiang
- Key Laboratory of Plant Resources and Chemistry in Arid Regions, Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China
| | - Jiuyu Zhan
- School of Life Science, Ludong University, Yantai, Shandong 264025, China
| | - Yingni Sun
- School of Life Science, Ludong University, Yantai, Shandong 264025, China
| |
Collapse
|
8
|
Stępnik K. Biomimetic Chromatographic Studies Combined with the Computational Approach to Investigate the Ability of Triterpenoid Saponins of Plant Origin to Cross the Blood-Brain Barrier. Int J Mol Sci 2021; 22:3573. [PMID: 33808219 PMCID: PMC8037809 DOI: 10.3390/ijms22073573] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 03/23/2021] [Accepted: 03/26/2021] [Indexed: 01/03/2023] Open
Abstract
Biomimetic (non-cell based in vitro) and computational (in silico) studies are commonly used as screening tests in laboratory practice in the first stages of an experiment on biologically active compounds (potential drugs) and constitute an important step in the research on the drug design process. The main aim of this study was to evaluate the ability of triterpenoid saponins of plant origin to cross the blood-brain barrier (BBB) using both computational methods, including QSAR methodology, and biomimetic chromatographic methods, i.e., High Performance Liquid Chromatography (HPLC) with Immobilized Artificial Membrane (IAM) and cholesterol (CHOL) stationary phases, as well as Bio-partitioning Micellar Chromatography (BMC). The tested compounds were as follows: arjunic acid (Terminalia arjuna), akebia saponin D (Akebia quinata), bacoside A (Bacopa monnieri) and platycodin D (Platycodon grandiflorum). The pharmacokinetic BBB parameters calculated in silico show that three of the four substances, i.e., arjunic acid, akebia saponin D, and bacoside A exhibit similar values of brain/plasma equilibration rate expressed as logPSFubrain (the average logPSFubrain: -5.03), whereas the logPSFubrain value for platycodin D is -9.0. Platycodin D also shows the highest value of the unbound fraction in the brain obtained using the examined compounds (0.98). In these studies, it was found out for the first time that the logarithm of the analyte-micelle association constant (logKMA) calculated based on Foley's equation can describe the passage of substances through the BBB. The most similar logBB values were obtained for hydrophilic platycodin D, applying both biomimetic and computational methods. All of the obtained logBB values and physicochemical parameters of the molecule indicate that platycodin D does not cross the BBB (the average logBB: -1.681), even though the in silico estimated value of the fraction unbound in plasma is relatively high (0.52). As far as it is known, this is the first paper that shows the applicability of biomimetic chromatographic methods in predicting the penetration of triterpenoid saponins through the BBB.
Collapse
Affiliation(s)
- Katarzyna Stępnik
- Department of Physical Chemistry, Institute of Chemical Sciences, Faculty of Chemistry, Maria Curie-Sklodowska University in Lublin, 20-031 Lublin, Poland
| |
Collapse
|
9
|
Falcón-Cano G, Molina C, Cabrera-Pérez MÁ. ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches. ADMET AND DMPK 2020; 8:251-273. [PMID: 35300309 PMCID: PMC8915604 DOI: 10.5599/admet.852] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 08/01/2020] [Indexed: 12/12/2022] Open
Abstract
In-silico prediction of aqueous solubility plays an important role during the drug discovery and development processes. For many years, the limited performance of in-silico solubility models has been attributed to the lack of high-quality solubility data for pharmaceutical molecules. However, some studies suggest that the poor accuracy of solubility prediction is not related to the quality of the experimental data and that more precise methodologies (algorithms and/or set of descriptors) are required for predicting aqueous solubility for pharmaceutical molecules. In this study a large and diverse database was generated with aqueous solubility values collected from two public sources; two new recursive machine-learning approaches were developed for data cleaning and variable selection, and a consensus model based on regression and classification algorithms was created. The modeling protocol, which includes the curation of chemical and experimental data, was implemented in KNIME, with the aim of obtaining an automated workflow for the prediction of new databases. Finally, we compared several methods or models available in the literature with our consensus model, showing results comparable or even outperforming previous published models.
Collapse
Affiliation(s)
- Gabriela Falcón-Cano
- Unit of Modeling and Experimental Biopharmaceutics. Centro de Bioactivos Químicos. Universidad Central “Marta Abreu” de las Villas. Santa Clara 54830, Villa Clara, Cuba
| | | | - Miguel Ángel Cabrera-Pérez
- Unit of Modeling and Experimental Biopharmaceutics. Centro de Bioactivos Químicos. Universidad Central “Marta Abreu” de las Villas. Santa Clara 54830, Villa Clara, Cuba
- Department of Pharmacy and Pharmaceutical Technology, University of Valencia, Burjassot 46100, Valencia, Spain
- Department of Engineering, Area of Pharmacy and Pharmaceutical Technology, Miguel Hernández University, 03550 Sant Joan d'Alacant, Alicante, Spain
| |
Collapse
|
10
|
Das T, Mehta CH, Nayak UY. Multiple approaches for achieving drug solubility: an in silico perspective. Drug Discov Today 2020; 25:1206-1212. [PMID: 32353425 DOI: 10.1016/j.drudis.2020.04.016] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2020] [Revised: 04/12/2020] [Accepted: 04/21/2020] [Indexed: 12/11/2022]
Abstract
Discovering new therapeutically active molecules is the ultimate destination in pharmaceutical research and development. Most drugs discovered are lipophilic and, hence, exhibit poor aqueous solubility, resulting in low bioavailability. Thus, there is a need to use various solubility enhancement techniques. Computational approaches enable the analysis of drug-carrier interactions or the numerous conformational changes in the carrier matrix that might establish an appropriate balance between cohesive and adhesive stability in a formulation. In this review, we discuss research approaches that provided molecular insight into drugs and their modifiers to unravel their solubility, stability, and bioavailability.
Collapse
Affiliation(s)
- Torsa Das
- Department of Pharmaceutics, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal 576104, Karnataka, India
| | - Chetan H Mehta
- Department of Pharmaceutics, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal 576104, Karnataka, India
| | - Usha Y Nayak
- Department of Pharmaceutics, Manipal College of Pharmaceutical Sciences, Manipal Academy of Higher Education, Manipal 576104, Karnataka, India.
| |
Collapse
|
11
|
Maharao N, Antontsev V, Wright M, Varshney J. Entering the era of computationally driven drug development. Drug Metab Rev 2020; 52:283-298. [PMID: 32083960 DOI: 10.1080/03602532.2020.1726944] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Historically, failure rates in drug development are high; increased sophistication and investment throughout the process has shifted the reasons for attrition, but the overall success rates have remained stubbornly and consistently low. Only 8% of new entities entering clinical testing gain regulatory approval, indicating that significant obstacles still exist for efficient therapeutic development. The continued high failure rate can be partially attributed to the inability to link drug exposure with the magnitude of observed safety and efficacy-related pharmacodynamic (PD) responses; frequently, this is a result of nonclinical models exhibiting poor prediction of human outcomes across a wide range of disease conditions, resulting in faulty evaluation of drug toxicology and efficacy. However, the increasing quality and standardization of experimental methods in preclinical stages of testing has created valuable data sets within companies that can be leveraged to further improve the efficiency and accuracy of preclinical prediction for both pharmacokinetics (PK) and PD. Models of Quantitative structure-activity relationships (QSAR), physiologically based pharmacokinetics (PBPK), and PK/PD relationships have also improved efficiency. Founded on a core understanding of biochemistry and physiological interactions of xenobiotics, these in silico methods have the potential to increase the probability of compound success in clinical trials. Integration of traditional computational methods with machine-learning approaches and existing internal pharma databases stands to make a fundamental impact on the speed and accuracy of predictions during the process of drug development and approval.
Collapse
|
12
|
Cui Q, Lu S, Ni B, Zeng X, Tan Y, Chen YD, Zhao H. Improved Prediction of Aqueous Solubility of Novel Compounds by Going Deeper With Deep Learning. Front Oncol 2020; 10:121. [PMID: 32117768 PMCID: PMC7026387 DOI: 10.3389/fonc.2020.00121] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 01/23/2020] [Indexed: 12/22/2022] Open
Abstract
Aqueous solubility is an important physicochemical property of compounds in anti-cancer drug discovery. Artificial intelligence solubility prediction tools have scored impressive performances by employing regression, machine learning, and deep learning methods. The reported performances vary significantly partly because of the different datasets used. Solubility prediction on novel compounds needs to be improved, which may be achieved by going deeper with deep learning. We constructed deeper-net models of ~20-layer modified ResNet convolutional neural network architecture, which were trained and tested with 9,943 compounds encoded by molecular fingerprints. Retrospectively tested by 62 recently-published novel compounds, one deeper-net model outperformed four established tools, shallow-net models, and four human experts. Deeper-net models also outperformed others in predicting the solubility values of a series of novel compounds newly-synthesized for anti-cancer drug discovery. Solubility prediction may be improved by going deeper with deep learning. Our deeper-net models are accessible at http://www.npbdb.net/solubility/index.jsp.
Collapse
Affiliation(s)
- Qiuji Cui
- School of Science, China Pharmaceutical University, Nanjing, China
| | - Shuai Lu
- School of Science, China Pharmaceutical University, Nanjing, China
| | - Bingwei Ni
- School of Science, China Pharmaceutical University, Nanjing, China
| | - Xian Zeng
- Department of Biological Medicine, School of Pharmacy, Fudan University, Shanghai, China
| | - Ying Tan
- The State Key Laboratory of Chemical Oncogenomics, Key Laboratory of Chemical Biology, Shenzhen Technology and Engineering Laboratory for Personalized Cancer Diagnostics and Therapeutics, The Graduate School at Shenzhen, Shenzhen Kivita Innovative Drug Discovery Institute, Tsinghua University, Shenzhen, China
| | - Ya Dong Chen
- School of Science, China Pharmaceutical University, Nanjing, China
| | - Hongping Zhao
- School of Science, China Pharmaceutical University, Nanjing, China
| |
Collapse
|
13
|
Wang N, Xin H, Xu P, Yu Z, Shou D. Erxian Decoction Attenuates TNF-α Induced Osteoblast Apoptosis by Modulating the Akt/Nrf2/HO-1 Signaling Pathway. Front Pharmacol 2019; 10:988. [PMID: 31551787 PMCID: PMC6748068 DOI: 10.3389/fphar.2019.00988] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Accepted: 07/31/2019] [Indexed: 12/20/2022] Open
Abstract
Erxian decoction (EXD), a traditional Chinese medicine formula, has been used for treatment of osteoporosis for many years. The purpose of this study was to investigate the pharmacological effect of EXD in preventing osteoblast apoptosis and the underlying mechanism of prevention. Putative targets of EXD were predicted by network pharmacology, and functional and pathway enrichment analyses were also performed. Evaluations of bone mineral density, serum estradiol level, trabecular area fraction, serum calcium levels, and tumor necrosis factor (TNF)-α levels in ovariectomized rats, as well as cell proliferation assays, apoptosis assays, and western blotting in MC3T3-E1 osteoblasts were performed for further experimental validation. Ninety-three active ingredients in the EXD formula and 259 potential targets were identified. Functional and pathway enrichment analyses indicated that EXD significantly influenced the PI3K-Akt signaling pathway. In vivo experiments indicated that EXD treatment attenuated bone loss and decreased TNF-α levels in rats with osteoporosis. In vitro experiments showed that EXD treatment increased cell viability markedly and decreased levels of caspase-3 and the rate of apoptosis. It also promoted phosphorylation of Akt, nuclear translocation of transcription factor NF-erythroid 2-related factor (Nrf2), and hemeoxygenase-1 (HO-1) expression in TNF-α-induced MC3T3-E1 cells. Our results suggest that EXD exerted profound anti-osteoporosis effects, at least partially by reducing production of TNF-α and attenuating osteoblast apoptosis via Akt/Nrf2/HO-1 signaling pathway.
Collapse
Affiliation(s)
- Nani Wang
- Department of Medicine, Tongde Hospital of Zhejiang Province, Hangzhou, China
- School of Pharmacy, Zhejiang Chinese Medical University, China
| | - Hailiang Xin
- School of Pharmacy, Second Military Medical University, China
| | - Pingcui Xu
- Department of Medicine, Tongde Hospital of Zhejiang Province, Hangzhou, China
- School of Pharmacy, Zhejiang Chinese Medical University, China
| | - Zhongming Yu
- Department of Medicine, Tongde Hospital of Zhejiang Province, Hangzhou, China
| | - Dan Shou
- Department of Medicine, Tongde Hospital of Zhejiang Province, Hangzhou, China
| |
Collapse
|
14
|
Passeri GI, Trisciuzzi D, Alberga D, Siragusa L, Leonetti F, Mangiatordi GF, Nicolotti O. Strategies of Virtual Screening in Medicinal Chemistry. ACTA ACUST UNITED AC 2018. [DOI: 10.4018/ijqspr.2018010108] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Virtual screening represents an effective computational strategy to rise-up the chances of finding new bioactive compounds by accelerating the time needed to move from an initial intuition to market. Classically, the most pursued approaches rely on ligand- and structure-based studies, the former employed when structural data information about the target is missing while the latter employed when X-ray/NMR solved or homology models are instead available for the target. The authors will focus on the most advanced techniques applied in this area. In particular, they will survey the key concepts of virtual screening by discussing how to properly select chemical libraries, how to make database curation, how to applying and- and structure-based techniques, how to wisely use post-processing methods. Emphasis will be also given to the most meaningful databases used in VS protocols. For the ease of discussion several examples will be presented.
Collapse
Affiliation(s)
| | - Daniela Trisciuzzi
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | - Domenico Alberga
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | - Lydia Siragusa
- Molecular Discovery Ltd., Pinner, Middlesex, London, United Kingdom
| | - Francesco Leonetti
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | - Giuseppe F. Mangiatordi
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | | |
Collapse
|
15
|
Gozalbes R, Vicente de Julián-Ortiz J. Applications of Chemoinformatics in Predictive Toxicology for Regulatory Purposes, Especially in the Context of the EU REACH Legislation. ACTA ACUST UNITED AC 2018. [DOI: 10.4018/ijqspr.2018010101] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Chemoinformatics methodologies such as QSAR/QSPR have been used for decades in drug discovery projects, especially for the finding of new compounds with therapeutic properties and the optimization of ADME properties on chemical series. The application of computational techniques in predictive toxicology is much more recent, and they are experiencing an increasingly interest because of the new legal requirements imposed by national and international regulations. In the pharmaceutical field, the US Food and Drug Administration (FDA) support the use of predictive models for regulatory decision-making when assessing the genotoxic and carcinogenic potential of drug impurities. In Europe, the REACH legislation promotes the use of QSAR in order to reduce the huge amount of animal testing needed to demonstrate the safety of new chemical entities subjected to registration, provided they meet specific conditions to ensure their quality and predictive power. In this review, the authors summarize the state of art of in silico methods for regulatory purposes, with especial emphasis on QSAR models.
Collapse
|
16
|
Yin Y, Xu C, Gu S, Li W, Liu G, Tang Y. Quantitative Regression Models for the Prediction of Chemical Properties by an Efficient Workflow. Mol Inform 2016; 34:679-88. [PMID: 27490968 DOI: 10.1002/minf.201400119] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2014] [Accepted: 03/10/2015] [Indexed: 11/08/2022]
Abstract
Rapid safety assessment is more and more needed for the increasing chemicals both in chemical industries and regulators around the world. The traditional experimental methods couldn't meet the current demand any more. With the development of the information technology and the growth of experimental data, in silico modeling has become a practical and rapid alternative for the assessment of chemical properties, especially for the toxicity prediction of organic chemicals. In this study, a quantitative regression workflow was built by KNIME to predict chemical properties. With this regression workflow, quantitative values of chemical properties can be obtained, which is different from the binary-classification model or multi-classification models that can only give qualitative results. To illustrate the usage of the workflow, two predictive models were constructed based on datasets of Tetrahymena pyriformis toxicity and Aqueous solubility. The qcv (2) and qtest (2) of 5-fold cross validation and external validation for both types of models were greater than 0.7, which implies that our models are robust and reliable, and the workflow is very convenient and efficient in prediction of various chemical properties.
Collapse
Affiliation(s)
- Yongmin Yin
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, P.R. China tel: +86-21-64250811; fax: +86-21-64251033
| | - Congying Xu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, P.R. China tel: +86-21-64250811; fax: +86-21-64251033
| | - Shikai Gu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, P.R. China tel: +86-21-64250811; fax: +86-21-64251033
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, P.R. China tel: +86-21-64250811; fax: +86-21-64251033
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, P.R. China tel: +86-21-64250811; fax: +86-21-64251033.
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, P.R. China tel: +86-21-64250811; fax: +86-21-64251033.
| |
Collapse
|
17
|
Phifer JR, Solomon KJ, Young KL, Paluch AS. Computing MOSCED parameters of nonelectrolyte solids with electronic structure methods in SMD and SM8 continuum solvents. AIChE J 2016. [DOI: 10.1002/aic.15413] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Jeremy R. Phifer
- Dept. of Chemical, Paper and Biomedical Engineering; Miami University; Oxford OH 45056
| | - Kimberly J. Solomon
- Dept. of Chemical, Paper and Biomedical Engineering; Miami University; Oxford OH 45056
| | - Kayla L. Young
- Dept. of Chemical, Paper and Biomedical Engineering; Miami University; Oxford OH 45056
| | - Andrew S. Paluch
- Dept. of Chemical, Paper and Biomedical Engineering; Miami University; Oxford OH 45056
| |
Collapse
|
18
|
Gilad Y, Nadassy K, Senderowitz H. A reliable computational workflow for the selection of optimal screening libraries. J Cheminform 2015; 7:61. [PMID: 26692904 PMCID: PMC4676138 DOI: 10.1186/s13321-015-0108-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2015] [Accepted: 11/24/2015] [Indexed: 11/10/2022] Open
Abstract
Background The experimental screening of compound collections is a common starting point in many drug discovery projects. Successes of such screening campaigns critically depend on the quality of the screened library. Many libraries are currently available from different vendors yet the selection of the optimal screening library for a specific project is challenging. We have devised a novel workflow for the rational selection of project-specific screening libraries. Results The workflow accepts as input a set of virtual candidate libraries and applies the following steps to each library: (1) data curation; (2) assessment of ADME/T profile; (3) assessment of the number of promiscuous binders/frequent HTS hitters; (4) assessment of internal diversity; (5) assessment of similarity to known active compound(s) (optional); (6) assessment of similarity to in-house or otherwise accessible compound collections (optional). For ADME/T profiling, Lipinski’s and Veber’s rule-based filters were implemented and a new blood brain barrier permeation model was developed and validated (85 and 74 % success rate for training set and test set, respectively). Diversity and similarity descriptors which demonstrated best performances in terms of their ability to select either diverse or focused sets of compounds from three databases (Drug Bank, CMC and CHEMBL) were identified and used for diversity and similarity assessments. The workflow was used to analyze nine common screening libraries available from six vendors. The results of this analysis are reported for each library providing an assessment of its quality. Furthermore, a consensus approach was developed to combine the results of these analyses into a single score for selecting the optimal library under different scenarios. Conclusions We have devised and tested a new workflow for the rational selection of screening libraries under different scenarios. The current workflow was implemented using the Pipeline Pilot software yet due to the usage of generic components, it can be easily adapted and reproduced by computational groups interested in rational selection of screening libraries. Furthermore, the workflow could be readily modified to include additional components. This workflow has been routinely used in our laboratory for the selection of libraries in multiple projects and consistently selects libraries which are well balanced across multiple parameters.. ![]() Electronic supplementary material The online version of this article (doi:10.1186/s13321-015-0108-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yocheved Gilad
- Department of Chemistry, Bar-Ilan University, Ramat-Gan, 52900 Israel
| | - Katalin Nadassy
- Dassault Systèmes BIOVIA, 334 Cambridge Science Park, Cambridge, CB4 0WN UK
| | | |
Collapse
|
19
|
Raevsky OA, Polianczyk DE, Grigorev VY, Raevskaja OE, Dearden JC. In silico Prediction of Aqueous Solubility: a Comparative Study of Local and Global Predictive Models. Mol Inform 2015; 34:417-30. [PMID: 27490387 DOI: 10.1002/minf.201400144] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2014] [Accepted: 03/05/2015] [Indexed: 11/07/2022]
Abstract
32 Quantitative Structure-Property Relationship (QSPR) models were constructed for prediction of aqueous intrinsic solubility of liquid and crystalline chemicals. Data sets contained 1022 liquid and 2615 crystalline compounds. Multiple Linear Regression (MLR), Support Vector Machine (SVM) and Random Forest (RF) methods were used to construct global models, and k-nearest neighbour (kNN), Arithmetic Mean Property (AMP) and Local Regression Property (LoReP) were used to construct local models. A set of the best QSPR models was obtained: for liquid chemicals with RMSE (root mean square error) of prediction in the range 0.50-0.60 log unit; for crystalline chemicals 0.80-0.90 log unit. In the case of global models the large number of descriptors makes mechanistic interpretation difficult. The local models use only one or two descriptors, so that a medicinal chemist working with sets of structurally-related chemicals can readily estimate their solubility. However, construction of stable local models requires the presence of closely related neighbours for each chemical considered. It is probable that a consensus of global and local QSPR models will be the optimal approach for construction of stable predictive QSPR models with mechanistic interpretation.
Collapse
Affiliation(s)
- Oleg A Raevsky
- Department of Computer-Aided Molecular Design, Institute of Physiologically Active Compounds, Russian Academy of Science, 142432, Russia, Chernogolovka, Severniy proezd 1 phone: +7 496 52 21867.
| | - Daniel E Polianczyk
- Department of Computer-Aided Molecular Design, Institute of Physiologically Active Compounds, Russian Academy of Science, 142432, Russia, Chernogolovka, Severniy proezd 1 phone: +7 496 52 21867
| | - Veniamin Yu Grigorev
- Department of Computer-Aided Molecular Design, Institute of Physiologically Active Compounds, Russian Academy of Science, 142432, Russia, Chernogolovka, Severniy proezd 1 phone: +7 496 52 21867
| | - Olga E Raevskaja
- Department of Computer-Aided Molecular Design, Institute of Physiologically Active Compounds, Russian Academy of Science, 142432, Russia, Chernogolovka, Severniy proezd 1 phone: +7 496 52 21867
| | - John C Dearden
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool L3 3AF, UK
| |
Collapse
|
20
|
Newby D, Freitas AA, Ghafourian T. Comparing multilabel classification methods for provisional biopharmaceutics class prediction. Mol Pharm 2014; 12:87-102. [PMID: 25397721 DOI: 10.1021/mp500457t] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
The biopharmaceutical classification system (BCS) is now well established and utilized for the development and biowaivers of immediate oral dosage forms. The prediction of BCS class can be carried out using multilabel classification. Unlike single label classification, multilabel classification methods predict more than one class label at the same time. This paper compares two multilabel methods, binary relevance and classifier chain, for provisional BCS class prediction. Large data sets of permeability and solubility of drug and drug-like compounds were obtained from the literature and were used to build models using decision trees. The separate permeability and solubility models were validated, and a BCS validation set of 127 compounds where both permeability and solubility were known was used to compare the two aforementioned multilabel classification methods for provisional BCS class prediction. Overall, the results indicate that the classifier chain method, which takes into account label interactions, performed better compared to the binary relevance method. This work offers a comparison of multilabel methods and shows the potential of the classifier chain multilabel method for improved biological property predictions for use in drug discovery and development.
Collapse
Affiliation(s)
- Danielle Newby
- Medway School of Pharmacy, Universities of Kent and Greenwich , Chatham, Kent, ME4 4TB, U.K
| | | | | |
Collapse
|
21
|
Gozalbes R, Mosulén S, Ortí L, Rodríguez-Díaz J, Carbajo RJ, Melnyk P, Pineda-Lucena A. Hit identification of novel heparanase inhibitors by structure- and ligand-based approaches. Bioorg Med Chem 2013; 21:1944-51. [PMID: 23415087 DOI: 10.1016/j.bmc.2013.01.033] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2012] [Revised: 01/09/2013] [Accepted: 01/14/2013] [Indexed: 10/27/2022]
Abstract
Heparanase is a key enzyme involved in the dissemination of metastatic cancer cells. In this study a combination of in silico techniques and experimental methods was used to identify new potential inhibitors against this target. A 3D model of heparanase was built from sequence homology and applied to the virtual screening of a library composed of 27 known heparanase inhibitors and a commercial collection of drugs and drug-like compounds. The docking results from this campaign were combined with those obtained from a pharmacophore model recently published based in the same set of chemicals. Compounds were then ranked according to their theoretical binding affinity, and the top-rated commercial drugs were selected for further experimental evaluation. Biophysical methods (NMR and SPR) were applied to assess experimentally the interaction of the selected compounds with heparanase. The binding site was evaluated via competition experiments, using a known inhibitor of heparanase. Three of the selected drugs were found to bind to the active site of the protein and their KD values were determined. Among them, the antimalarial drug amodiaquine presented affinity towards the protein in the low-micromolar range, and was singled out for a SAR study based on its chemical scaffold. A subset of fourteen 4-arylaminoquinolines from a global set of 249 analogues of amodiaquine was selected based on the application of in silico models, a QSAR solubility prediction model and a chemical diversity analysis. Some of these compounds displayed binding affinities in the micromolar range.
Collapse
Affiliation(s)
- Rafael Gozalbes
- Structural Biochemistry Laboratory, Centro de Investigación Príncipe Felipe, 46012 Valencia, Spain.
| | | | | | | | | | | | | |
Collapse
|
22
|
Zhou W, Dai Z, Chen Y, Wang H, Yuan Z. High-Dimensional descriptor selection and computational QSAR modeling for antitumor activity of ARC-111 analogues Based on Support Vector Regression (SVR). Int J Mol Sci 2012; 13:1161-1172. [PMID: 22312310 PMCID: PMC3269744 DOI: 10.3390/ijms13011161] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2011] [Revised: 01/09/2012] [Accepted: 01/17/2012] [Indexed: 12/02/2022] Open
Abstract
To design ARC-111 analogues with improved efficiency, we constructed the QSAR of 22 ARC-111 analogues with RPMI8402 tumor cells. First, the optimized support vector regression (SVR) model based on the literature descriptors and the worst descriptor elimination multi-roundly (WDEM) method had similar generalization as the artificial neural network (ANN) model for the test set. Secondly, seven and 11 more effective descriptors out of 2,923 features were selected by the high-dimensional descriptor selection nonlinearly (HDSN) and WDEM method, and the SVR models (SVR3 and SVR4) with these selected descriptors resulted in better evaluation measures and a more precise predictive power for the test set. The interpretability system of better SVR models was further established. Our analysis offers some useful parameters for designing ARC-111 analogues with enhanced antitumor activity.
Collapse
Affiliation(s)
- Wei Zhou
- Hunan Provincial Key Laboratory of Crop Germplasm Innovation and Utilization, Changsha 410128, China; E-Mails: (W.Z.); (Z.D.); (Y.C.)
- Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, College of Bio-Safety Science & Technology, Hunan Agricultural University, Changsha 410128, China
| | - Zhijun Dai
- Hunan Provincial Key Laboratory of Crop Germplasm Innovation and Utilization, Changsha 410128, China; E-Mails: (W.Z.); (Z.D.); (Y.C.)
| | - Yuan Chen
- Hunan Provincial Key Laboratory of Crop Germplasm Innovation and Utilization, Changsha 410128, China; E-Mails: (W.Z.); (Z.D.); (Y.C.)
| | - Haiyan Wang
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA; E-Mail:
| | - Zheming Yuan
- Hunan Provincial Key Laboratory of Crop Germplasm Innovation and Utilization, Changsha 410128, China; E-Mails: (W.Z.); (Z.D.); (Y.C.)
- Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, College of Bio-Safety Science & Technology, Hunan Agricultural University, Changsha 410128, China
| |
Collapse
|
23
|
Krasowski MD, Hopfinger AJ. The discovery of new anesthetics by targeting GABAAreceptors. Expert Opin Drug Discov 2011; 6:1187-201. [DOI: 10.1517/17460441.2011.627324] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
24
|
QSAR-based permeability model for drug-like compounds. Bioorg Med Chem 2011; 19:2615-24. [DOI: 10.1016/j.bmc.2011.03.011] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2010] [Revised: 03/04/2011] [Accepted: 03/05/2011] [Indexed: 11/23/2022]
|