1
|
Saravanan KM, Wan JF, Dai L, Zhang J, Zhang JZH, Zhang H. A deep learning based multi-model approach for predicting drug-like chemical compound's toxicity. Methods 2024; 226:164-175. [PMID: 38702021 DOI: 10.1016/j.ymeth.2024.04.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 04/01/2024] [Accepted: 04/28/2024] [Indexed: 05/06/2024] Open
Abstract
Ensuring the safety and efficacy of chemical compounds is crucial in small-molecule drug development. In the later stages of drug development, toxic compounds pose a significant challenge, losing valuable resources and time. Early and accurate prediction of compound toxicity using deep learning models offers a promising solution to mitigate these risks during drug discovery. In this study, we present the development of several deep-learning models aimed at evaluating different types of compound toxicity, including acute toxicity, carcinogenicity, hERG_cardiotoxicity (the human ether-a-go-go related gene caused cardiotoxicity), hepatotoxicity, and mutagenicity. To address the inherent variations in data size, label type, and distribution across different types of toxicity, we employed diverse training strategies. Our first approach involved utilizing a graph convolutional network (GCN) regression model to predict acute toxicity, which achieved notable performance with Pearson R 0.76, 0.74, and 0.65 for intraperitoneal, intravenous, and oral administration routes, respectively. Furthermore, we trained multiple GCN binary classification models, each tailored to a specific type of toxicity. These models exhibited high area under the curve (AUC) scores, with an impressive AUC of 0.69, 0.77, 0.88, and 0.79 for predicting carcinogenicity, hERG_cardiotoxicity, mutagenicity, and hepatotoxicity, respectively. Additionally, we have used the approved drug dataset to determine the appropriate threshold value for the prediction score in model usage. We integrated these models into a virtual screening pipeline to assess their effectiveness in identifying potential low-toxicity drug candidates. Our findings indicate that this deep learning approach has the potential to significantly reduce the cost and risk associated with drug development by expediting the selection of compounds with low toxicity profiles. Therefore, the models developed in this study hold promise as critical tools for early drug candidate screening and selection.
Collapse
Affiliation(s)
- Konda Mani Saravanan
- Department of Biotechnology, Bharath Institute of Higher Education and Research, Chennai 600073, Tamil Nadu, India
| | - Jiang-Fan Wan
- Guangdong-Hong Kong-Macao Greater Bay Area Center for Drug Evaluation and Inspection of NMPA, Shenzhen 518000, China
| | - Liujiang Dai
- Guangdong Immune Cell Therapy Engineering and Technology Research Center, Center for Protein and Cell-Based Drugs, Institute of Biomedicine and Biotechnology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Jiajun Zhang
- Faculty of Synthetic Biology and Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; College of Science, Hunan University of Technology and Business, Changsha 410205, China
| | - John Z H Zhang
- Faculty of Synthetic Biology and Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Haiping Zhang
- Faculty of Synthetic Biology and Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
| |
Collapse
|
2
|
Bo T, Lin Y, Han J, Hao Z, Liu J. Machine learning-assisted data filtering and QSAR models for prediction of chemical acute toxicity on rat and mouse. JOURNAL OF HAZARDOUS MATERIALS 2023; 452:131344. [PMID: 37027914 DOI: 10.1016/j.jhazmat.2023.131344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 03/20/2023] [Accepted: 03/31/2023] [Indexed: 05/03/2023]
Abstract
Machine learning (ML) methods provide a new opportunity to build quantitative structure-activity relationship (QSAR) models for predicting chemicals' toxicity based on large toxicity data sets, but they are limited in insufficient model robustness due to poor data set quality for chemicals with certain structures. To address this issue and improve model robustness, we built a large data set on rat oral acute toxicity for thousands of chemicals, then used ML to filter chemicals favorable for regression models (CFRM). In comparison to chemicals not favorable for regression models (CNRM), CFRM accounted for 67% of chemicals in the original data set, and had a higher structural similarity and a smaller toxicity distribution in 2-4 log10 (mg/kg). The performance of established regression models for CFRM was greatly improved, with root-mean-square deviations (RMSE) in the range of 0.45-0.48 log10 (mg/kg). Classification models were built for CNRM using all chemicals in the original data set, and the area under receiver operating characteristic (AUROC) reached 0.75-0.76. The proposed strategy was successfully applied to a mouse oral acute data set, yielding RMSE and AUROC in the range of 0.36-0.38 log10 (mg/kg) and 0.79, respectively.
Collapse
Affiliation(s)
- Tao Bo
- School of Environment, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 310024, China; State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China
| | - Yaohui Lin
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China; Key Laboratory for Analytical Science of Food Safety and Biology of MOE, Fujian Provincial Key Lab of Analysis and Detection for Food Safety, College of Chemistry, Fuzhou University, Fuzhou, Fujian 350116, China
| | - Jinglong Han
- State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology Shenzhen, Shenzhen 518055, China
| | - Zhineng Hao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China.
| | - Jingfu Liu
- School of Environment, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 310024, China; State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China.
| |
Collapse
|
3
|
Tran TTV, Surya Wibowo A, Tayara H, Chong KT. Artificial Intelligence in Drug Toxicity Prediction: Recent Advances, Challenges, and Future Perspectives. J Chem Inf Model 2023; 63:2628-2643. [PMID: 37125780 DOI: 10.1021/acs.jcim.3c00200] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Toxicity prediction is a critical step in the drug discovery process that helps identify and prioritize compounds with the greatest potential for safe and effective use in humans, while also reducing the risk of costly late-stage failures. It is estimated that over 30% of drug candidates are discarded owing to toxicity. Recently, artificial intelligence (AI) has been used to improve drug toxicity prediction as it provides more accurate and efficient methods for identifying the potentially toxic effects of new compounds before they are tested in human clinical trials, thus saving time and money. In this review, we present an overview of recent advances in AI-based drug toxicity prediction, including the use of various machine learning algorithms and deep learning architectures, of six major toxicity properties and Tox21 assay end points. Additionally, we provide a list of public data sources and useful toxicity prediction tools for the research community and highlight the challenges that must be addressed to enhance model performance. Finally, we discuss future perspectives for AI-based drug toxicity prediction. This review can aid researchers in understanding toxicity prediction and pave the way for new methods of drug discovery.
Collapse
Affiliation(s)
- Thi Tuyet Van Tran
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Faculty of Information Technology, An Giang University, Long Xuyen 880000, Vietnam
- Vietnam National University - Ho Chi Minh City, Ho Chi Minh 700000, Vietnam
| | - Agung Surya Wibowo
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Department of Electrical Engineering, Telkom University, Bandung 40257, Indonesia
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Kil To Chong
- Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea
| |
Collapse
|
4
|
Zaki MEA, Al-Hussain SA, Al-Mutairi AA, Samad A, Ghosh A, Chaudhari S, Khatale PN, Ajmire P, Jawarkar RD. In-silico studies to recognize repurposing therapeutics toward arginase-I inhibitors as a potential onco-immunomodulators. Front Pharmacol 2023; 14:1129997. [PMID: 37144217 PMCID: PMC10151555 DOI: 10.3389/fphar.2023.1129997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 02/27/2023] [Indexed: 05/06/2023] Open
Abstract
Rudolf Virchow was the first person to point out the important link between immune function and cancer. He did this by noticing that leukocytes were often found in tumors. Overexpression of arginase 1 (ARG1) and inducible nitric oxide synthase (iNOS) in myeloid-derived suppressor cells (MDSCs) and tumour-associated macrophages (TAMs) depletes both intracellular and extracellular arginine. TCR signalling is slowed as a result, and the same types of cells produce reactive oxygen and nitrogen species (ROS and RNS), which aggravates the situation. Human arginase I is a double-stranded manganese metalloenzyme that helps L-arginine break down into L-ornithine and urea. Thus, a quantitative structure-activity relationship (QSAR) analysis was performed to unearth the unrecognised structural aspects crucial for arginase-I inhibition. In this work, a balanced QSAR model with good prediction performance and clear mechanistic interpretation was developed using a dataset of 149 molecules encompassing a broad range of structural scaffolds and compositions. The model was made to meet OECD standards, and all of its validation parameters have values that are higher than the minimum requirements (R2 tr = 0.89, Q2 LMO = 0.86, and R2 ex = 0.85). The present QSAR study linked structural factors to arginase-I inhibitory action, including the proximity of lipophilic atoms to the molecule's centre of mass (within 3A), the position of the donor to the ring nitrogen (exactly 3 bonds away), and the surface area ratio. As OAT-1746 and two others are the only arginase-I inhibitors in development at the time, we have performed a QSAR-based virtual screening with 1650 FDA compounds taken from the zinc database. In this screening, 112 potential hit compounds were found to have a PIC50 value of less than 10 nm against the arginase-I receptor. The created QSAR model's application domain was evaluated in relation to the most active hit molecules identified using QSAR-based virtual screening, utilising a training set of 149 compounds and a prediction set of 112 hit molecules. As shown in the Williams plot, the top hit molecule, ZINC000252286875, has a low leverage value of HAT i/i h* = 0.140, placing it towards the boundary of the usable range. Furthermore, one of 112 hit molecules with a docking score of -10.891 kcal/mol (PIC50 = 10.023 M) was isolated from a study of arginase-I using molecular docking. Protonated ZINC000252286875-linked arginase-1 showed 2.9 RMSD, whereas non-protonated had 1.8. RMSD plots illustrate protein stability in protonated and non-protonated ZINC000252286875-bound states. Protonated-ZINC000252286875-bound proteins contain 25 Rg. The non-protonated protein-ligand combination exhibits a 25.2-Rg, indicating compactness. Protonated and non-protonated ZINC000252286875 stabilised protein targets in binding cavities posthumously. Significant root mean square fluctuations (RMSF) were seen in the arginase-1 protein at a small number of residues for a time function of 500 ns in both the protonated and unprotonated states. Protonated and non-protonated ligands interacted with proteins throughout the simulation. ZINC000252286875 bound Lys64, Asp124, Ala171, Arg222, Asp232, and Gly250. Aspartic acid residue 232 exhibited 200% ionic contact. 500-ns simulations-maintained ions. Salt bridges for ZINC000252286875 aided docking. ZINC000252286875 created six ionic bonds with Lys68, Asp117, His126, Ala171, Lys224, and Asp232 residues. Asp117, His126, and Lys224 showed 200% ionic interactions. In protonated and deprotonated states, GbindvdW, GbindLipo, and GbindCoulomb energies played crucial role. Moreover, ZINC000252286875 meets all of the ADMET standards to serve as a drug. As a result, the current analyses were successful in locating a novel and potent hit molecule that inhibits arginase-I effectively at nanomolar concentrations. The results of this investigation can be used to develop brand-new arginase I inhibitors as an alternative immune-modulating cancer therapy.
Collapse
Affiliation(s)
- Magdi E. A. Zaki
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
- *Correspondence: Magdi E. A. Zaki, ; Rahul D. Jawarkar,
| | - Sami A. Al-Hussain
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Aamal A. Al-Mutairi
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Abdul Samad
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Tishk International University, Erbil, Kurdistan Region, Iraq
| | - Arabinda Ghosh
- Microbiology Division, Department of Botany, Gauhati University, Guwahati, India
| | - Somdatta Chaudhari
- Department of Pharmaceutical Chemistry, Progressive Education Society’s Modern College of Pharmacy, Pune, India
| | - Pravin N. Khatale
- Department of Medicinal Chemistry, Dr Rajendra Gode Institute of Pharmacy, Amravati, Maharashtra, India
| | - Prashant Ajmire
- Department of Medicinal Chemistry, Dr Rajendra Gode Institute of Pharmacy, Amravati, Maharashtra, India
| | - Rahul D. Jawarkar
- Department of Medicinal Chemistry, Dr Rajendra Gode Institute of Pharmacy, Amravati, Maharashtra, India
- *Correspondence: Magdi E. A. Zaki, ; Rahul D. Jawarkar,
| |
Collapse
|
5
|
Zwickl CM, Graham J, Jolly R, Bassan A, Ahlberg E, Amberg A, Anger LT, Barton-Maclaren T, Beilke L, Bellion P, Brigo A, Cronin MT, Custer L, Devlin A, Burleigh-Flayers H, Fish T, Glover K, Glowienke S, Gromek K, Jones D, Karmaus A, Kemper R, Piparo EL, Madia F, Martin M, Masuda-Herrera M, McAtee B, Mestre J, Milchak L, Moudgal C, Mumtaz M, Muster W, Neilson L, Patlewicz G, Paulino A, Roncaglioni A, Ruiz P, Suarez D, Szabo DT, Valentin JP, Vardakou I, Woolley D, Myatt G. Principles and Procedures for Assessment of Acute Toxicity Incorporating In Silico Methods. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2022; 24:100237. [PMID: 36818760 PMCID: PMC9934006 DOI: 10.1016/j.comtox.2022.100237] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Acute toxicity in silico models are being used to support an increasing number of application areas including (1) product research and development, (2) product approval and registration as well as (3) the transport, storage and handling of chemicals. The adoption of such models is being hindered, in part, because of a lack of guidance describing how to perform and document an in silico analysis. To address this issue, a framework for an acute toxicity hazard assessment is proposed. This framework combines results from different sources including in silico methods and in vitro or in vivo experiments. In silico methods that can assist the prediction of in vivo outcomes (i.e., LD50) are analyzed concluding that predictions obtained using in silico approaches are now well-suited for reliably supporting assessment of LD50-based acute toxicity for the purpose of GHS classification. A general overview is provided of the endpoints from in vitro studies commonly evaluated for predicting acute toxicity (e.g., cytotoxicity/cytolethality as well as assays targeting specific mechanisms). The increased understanding of pathways and key triggering mechanisms underlying toxicity and the increased availability of in vitro data allow for a shift away from assessments solely based on endpoints such as LD50, to mechanism-based endpoints that can be accurately assessed in vitro or by using in silico prediction models. This paper also highlights the importance of an expert review of all available information using weight-of-evidence considerations and illustrates, using a series of diverse practical use cases, how in silico approaches support the assessment of acute toxicity.
Collapse
Affiliation(s)
| | - Jessica Graham
- Genentech, Inc., 1 DNA Way, South San Francisco, CA 94080, USA
| | - Robert Jolly
- Eli Lilly and Company, Indianapolis, IN 46285, USA
| | - Arianna Bassan
- Innovatune srl, Via Giulio Zanon 130/D, 35129 Padova, Italy
| | - Ernst Ahlberg
- Universal Prediction AB, Gothenburg, Sweden
- Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden
| | - Alexander Amberg
- Sanofi, R&D Preclinical Safety Frankfurt, Industriepark Hoechst, D-65926 Frankfurt am Main, Germany
| | | | - Tara Barton-Maclaren
- Healthy Environments and Consumer Safety Branch, Health Canada / Government of Canada
| | - Lisa Beilke
- Toxicology Solutions, Inc., 10531 4S Commons Dr. #594, San Diego, CA 92127, USA
| | - Phillip Bellion
- Boehringer Ingelheim Animal Health, Binger Str. 128, 55216 Ingelheim am Rhein, Germany
| | - Alessandro Brigo
- Roche Pharmaceutical Research & Early Development, Roche Innovation Center Basel, Grenzacherstrasse 124, 4070, Basel, Switzerland
| | | | | | - Amy Devlin
- FDA Center for Drug Evaluation and Research, Silver Spring, MD 20993, USA
| | | | - Trevor Fish
- Nelson Laboratories, Salt Lake City, Utah, USA
| | | | | | | | - David Jones
- MHRA, 10 South Colonnade, Canary Wharf, London E14 4PU
| | - Agnes Karmaus
- Integrated Laboratory Systems, LLC, Morrisville, NC, USA
| | | | - Elena Lo Piparo
- Chemical Food Safety Group, Nestlé Research, Lausanne, Switzerland
| | - Federica Madia
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| | | | | | | | - Jordi Mestre
- IMIM Institut Hospital Del Mar d’Investigacions Mèdiques and Universitat Pompeu Fabra, Doctor Aiguader 88, Parc de Recerca Biomèdica, 08003 Barcelona, Spain
- Chemotargets SL, Baldiri Reixac 4, Parc Científic de Barcelona, 08028 Barcelona, Spain
| | | | | | - Moiz Mumtaz
- Office of the Associate Director for Science, Agency for Toxic Substances and Disease Registry, Centers for Disease Control and Prevention, Atlanta, GA 30333, USA
| | - Wolfgang Muster
- Roche Pharmaceutical Research & Early Development, Roche Innovation Center Basel, Grenzacherstrasse 124, 4070, Basel, Switzerland
| | | | - Grace Patlewicz
- Centre for Computational Toxicology and Exposure (CCTE), US Environmental Protection Agency, Research Triangle Park, NC, USA
| | | | - Alessandra Roncaglioni
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Patricia Ruiz
- Centers for Disease Control and Prevention (CDC), Atlanta, GA 30341, USA
| | - Diana Suarez
- FSTox Consulting LTD, 2 Brooks Road Raunds Wellingborough NN9 6NS
| | | | - Jean-Pierre Valentin
- UCB-Biopharma SRL, Development Science, Avenue de l’industrie, Braine l’Alleud, Wallonia, Belgium
| | - Ioanna Vardakou
- British American Tobacco (Investments) Ltd., R&D Centre, Southampton, Hampshire SO15 8TL, UK
| | | | - Glenn Myatt
- Instem, 1393 Dublin Rd, Columbus, OH 43215, USA
| |
Collapse
|
6
|
Rezaie H, Asadollahi-Baboli M, Hassaninejad-Darzi SK. Hybrid consensus and k-nearest neighbours (kNN) strategies to classify dual BRD4/PLK1 inhibitors. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2022; 33:779-792. [PMID: 36330747 DOI: 10.1080/1062936x.2022.2139292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 10/17/2022] [Indexed: 06/16/2023]
Abstract
A novel decision-making procedure is proposed here for the first time to identify active/inactive and selective/non-selective dual inhibitors using consensus approaches and pools of k-nearest neighbours (kNN) classifications instead of individual models. Dual BRD4/PLK1 inhibition with adequate selectivity is a potential therapeutic strategy for targeting tumour cells in high-risk patients. We report the unique way to identify both active and selective dual BRD4/PLK1 inhibitors using consensus and kNN strategies together with two sources of receptor-based and ligand-based information which are the ranked binding energies of residues and important molecular features, respectively. The results of consensus approaches were compared with the results of individual kNN models. The chemical space similarity was measured using three different distance functions to increase the reliability. All activity and selectivity classification models were validated using cross-validation and y-randomization tests. The outcomes show that consensus approaches can increase the reliability and accuracy of active/inactive or selective/non-selective detections up to 90%. Consensus approaches also reached more balanced values of sensitivity and specificity compared to the individual kNN models because of the compensation in the integration of diverse sources of information.
Collapse
Affiliation(s)
- H Rezaie
- Department of Chemistry, Faculty of Science, Babol Noshirvani University of Technology, Babol, Iran
| | - M Asadollahi-Baboli
- Department of Chemistry, Faculty of Science, Babol Noshirvani University of Technology, Babol, Iran
| | - S K Hassaninejad-Darzi
- Department of Chemistry, Faculty of Science, Babol Noshirvani University of Technology, Babol, Iran
| |
Collapse
|
7
|
Kong Y, Zhao X, Liu R, Yang Z, Yin H, Zhao B, Wang J, Qin B, Yan A. Integrating concept of pharmacophore with graph neural networks for chemical property prediction and interpretation. J Cheminform 2022; 14:52. [PMID: 35927691 PMCID: PMC9351086 DOI: 10.1186/s13321-022-00634-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 07/16/2022] [Indexed: 11/10/2022] Open
Abstract
Recently, graph neural networks (GNNs) have revolutionized the field of chemical property prediction and achieved state-of-the-art results on benchmark data sets. Compared with the traditional descriptor- and fingerprint-based QSAR models, GNNs can learn task related representations, which completely gets rid of the rules defined by experts. However, due to the lack of useful prior knowledge, the prediction performance and interpretability of the GNNs may be affected. In this study, we introduced a new GNN model called RG-MPNN for chemical property prediction that integrated pharmacophore information hierarchically into message-passing neural network (MPNN) architecture, specifically, in the way of pharmacophore-based reduced-graph (RG) pooling. RG-MPNN absorbed not only the information of atoms and bonds from the atom-level message-passing phase, but also the information of pharmacophores from the RG-level message-passing phase. Our experimental results on eleven benchmark and ten kinase data sets showed that our model consistently matched or outperformed other existing GNN models. Furthermore, we demonstrated that applying pharmacophore-based RG pooling to MPNN architecture can generally help GNN models improve the predictive power. The cluster analysis of RG-MPNN representations and the importance analysis of pharmacophore nodes will help chemists gain insights for hit discovery and lead optimization.
Collapse
Affiliation(s)
- Yue Kong
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, P. O. Box 53, Beijing, 100029, People's Republic of China.,Hyper-Dimension Insight Pharmaceuticals Ltd. Room 511, Block A, No. 2C, DongSanHuan North Road, Beijing, People's Republic of China
| | - Xiaoman Zhao
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, P. O. Box 53, Beijing, 100029, People's Republic of China
| | - Ruizi Liu
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, P. O. Box 53, Beijing, 100029, People's Republic of China
| | - Zhenwu Yang
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, P. O. Box 53, Beijing, 100029, People's Republic of China
| | - Hongyan Yin
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, P. O. Box 53, Beijing, 100029, People's Republic of China.,Hyper-Dimension Insight Pharmaceuticals Ltd. Room 511, Block A, No. 2C, DongSanHuan North Road, Beijing, People's Republic of China
| | - Bowen Zhao
- Hyper-Dimension Insight Pharmaceuticals Ltd. Room 511, Block A, No. 2C, DongSanHuan North Road, Beijing, People's Republic of China
| | - Jinling Wang
- Hyper-Dimension Insight Pharmaceuticals Ltd. Room 511, Block A, No. 2C, DongSanHuan North Road, Beijing, People's Republic of China
| | - Bingjie Qin
- Hyper-Dimension Insight Pharmaceuticals Ltd. Room 511, Block A, No. 2C, DongSanHuan North Road, Beijing, People's Republic of China
| | - Aixia Yan
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, P. O. Box 53, Beijing, 100029, People's Republic of China.
| |
Collapse
|
8
|
Edwards SW, Nelms M, Hench VK, Ponder J, Sullivan K. Mapping Mechanistic Pathways of Acute Oral Systemic Toxicity Using Chemical Structure and Bioactivity Measurements. FRONTIERS IN TOXICOLOGY 2022; 4:824094. [PMID: 35295211 PMCID: PMC8915918 DOI: 10.3389/ftox.2022.824094] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Accepted: 01/31/2022] [Indexed: 12/16/2022] Open
Abstract
Regulatory agencies around the world have committed to reducing or eliminating animal testing for establishing chemical safety. Adverse outcome pathways can facilitate replacement by providing a mechanistic framework for identifying the appropriate non-animal methods and connecting them to apical adverse outcomes. This study separated 11,992 chemicals with curated rat oral acute toxicity information into clusters of structurally similar compounds. Each cluster was then assigned one or more ToxCast/Tox21 assays by looking for the minimum number of assays required to record at least one positive hit call below cytotoxicity for all acutely toxic chemicals in the cluster. When structural information is used to select assays for testing, none of the chemicals required more than four assays and 98% required two assays or less. Both the structure-based clusters and activity from the associated assays were significantly associated with the GHS toxicity classification of the chemicals, which suggests that a combination of bioactivity and structural information could be as reproducible as traditional in vivo studies. Predictivity is improved when the in vitro assay directly corresponds to the mechanism of toxicity, but many indirect assays showed promise as well. Given the lower cost of in vitro testing, a small assay battery including both general cytotoxicity assays and two or more orthogonal assays targeting the toxicological mechanism could be used to improve performance further. This approach illustrates the promise of combining existing in silico approaches, such as the Collaborative Acute Toxicity Modeling Suite (CATMoS), with structure-based bioactivity information as part of an efficient tiered testing strategy that can reduce or eliminate animal testing for acute oral toxicity.
Collapse
Affiliation(s)
- Stephen W. Edwards
- GenOmics, Bioinformatics, and Translational Research Center, RTI International, Research Triangle Park, Durham, NC, United States
- *Correspondence: Stephen W. Edwards, ; Kristie Sullivan,
| | - Mark Nelms
- GenOmics, Bioinformatics, and Translational Research Center, RTI International, Research Triangle Park, Durham, NC, United States
| | - Virginia K. Hench
- GenOmics, Bioinformatics, and Translational Research Center, RTI International, Research Triangle Park, Durham, NC, United States
| | - Jessica Ponder
- Physicians Committee for Responsible Medicine, Washington, DC, United States
| | - Kristie Sullivan
- Physicians Committee for Responsible Medicine, Washington, DC, United States
- *Correspondence: Stephen W. Edwards, ; Kristie Sullivan,
| |
Collapse
|
9
|
Abstract
In this chapter, we give a brief overview of the regulatory requirements for acute systemic toxicity information in the European Union, and we review structure-based computational models that are available and potentially useful in the assessment of acute systemic toxicity. Emphasis is placed on quantitative structure-activity relationship (QSAR) models implemented by means of a range of software tools. The most recently published literature models for acute systemic toxicity are also discussed, and perspectives for future developments in this field are offered.
Collapse
Affiliation(s)
- Ivanka Tsakovska
- Institute of Biophysics and Biomedical Engineering, Bulgarian Academy of Sciences, Sofia, Bulgaria.
| | - Antonia Diukendjieva
- Institute of Biophysics and Biomedical Engineering, Bulgarian Academy of Sciences, Sofia, Bulgaria
| | - Andrew P Worth
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| |
Collapse
|
10
|
|
11
|
Rácz A, Bajusz D, Miranda-Quintana RA, Héberger K. Machine learning models for classification tasks related to drug safety. Mol Divers 2021; 25:1409-1424. [PMID: 34110577 PMCID: PMC8342376 DOI: 10.1007/s11030-021-10239-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 05/27/2021] [Indexed: 12/23/2022]
Abstract
In this review, we outline the current trends in the field of machine learning-driven classification studies related to ADME (absorption, distribution, metabolism and excretion) and toxicity endpoints from the past six years (2015-2021). The study focuses only on classification models with large datasets (i.e. more than a thousand compounds). A comprehensive literature search and meta-analysis was carried out for nine different targets: hERG-mediated cardiotoxicity, blood-brain barrier penetration, permeability glycoprotein (P-gp) substrate/inhibitor, cytochrome P450 enzyme family, acute oral toxicity, mutagenicity, carcinogenicity, respiratory toxicity and irritation/corrosion. The comparison of the best classification models was targeted to reveal the differences between machine learning algorithms and modeling types, endpoint-specific performances, dataset sizes and the different validation protocols. Based on the evaluation of the data, we can say that tree-based algorithms are (still) dominating the field, with consensus modeling being an increasing trend in drug safety predictions. Although one can already find classification models with great performances to hERG-mediated cardiotoxicity and the isoenzymes of the cytochrome P450 enzyme family, these targets are still central to ADMET-related research efforts.
Collapse
Affiliation(s)
- Anita Rácz
- Plasma Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, Budapest, 1117, Hungary.
| | - Dávid Bajusz
- Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, Budapest, 1117, Hungary
| | | | - Károly Héberger
- Plasma Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, Budapest, 1117, Hungary.
| |
Collapse
|
12
|
Kunde PD, Ramkumar S, Kamble SP, Ravikumar A, Kulkarni BD, Kumar VR. On the use of electronegativity and electron affinity based pseudo-molecular field descriptors in developing correlations for quantitative structure-activity relationship modeling of drug activities. Chem Biol Drug Des 2021; 98:258-269. [PMID: 34013630 DOI: 10.1111/cbdd.13895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Revised: 04/21/2021] [Accepted: 05/15/2021] [Indexed: 12/01/2022]
Abstract
For quantitative structure-activity relationship (QSAR) modeling in ligand-based drug discovery programs, pseudo-molecular field (PMF) descriptors using intrinsic atomic properties, namely, electronegativity and electron affinity are studied. In combination with partial least squares analysis and Procrustes transformation, these PMF descriptors were employed successfully to develop correlations that predict the activities of target protein inhibitors involved in various diseases (cancer, neurodegenerative disorders, HIV, and malaria). The results show that the present QSAR approach is competitive to existing QSAR models. In order to demonstrate the use of this algorithm, we present results of screening naturally occurring molecules with unknown bioactivities. The pIC50 predictions can screen molecules that have desirable activity before assessment by docking studies.
Collapse
Affiliation(s)
- Pushkar D Kunde
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory (CSIR-NCL), Pune, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Sudha Ramkumar
- Organic Chemistry Division, CSIR-National Chemical Laboratory (CSIR-NCL), Pune, India
| | - Sanjay P Kamble
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory (CSIR-NCL), Pune, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Ameeta Ravikumar
- Institute of Bioinformatics and Biotechnology (IBB), Savitribai Phule Pune University, Pune, India
| | - Bhaskar D Kulkarni
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory (CSIR-NCL), Pune, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - V Ravi Kumar
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory (CSIR-NCL), Pune, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| |
Collapse
|
13
|
Mansouri K, Karmaus AL, Fitzpatrick J, Patlewicz G, Pradeep P, Alberga D, Alepee N, Allen TE, Allen D, Alves VM, Andrade CH, Auernhammer TR, Ballabio D, Bell S, Benfenati E, Bhattacharya S, Bastos JV, Boyd S, Brown J, Capuzzi SJ, Chushak Y, Ciallella H, Clark AM, Consonni V, Daga PR, Ekins S, Farag S, Fedorov M, Fourches D, Gadaleta D, Gao F, Gearhart JM, Goh G, Goodman JM, Grisoni F, Grulke CM, Hartung T, Hirn M, Karpov P, Korotcov A, Lavado GJ, Lawless M, Li X, Luechtefeld T, Lunghini F, Mangiatordi GF, Marcou G, Marsh D, Martin T, Mauri A, Muratov EN, Myatt GJ, Nguyen DT, Nicolotti O, Note R, Pande P, Parks AK, Peryea T, Polash AH, Rallo R, Roncaglioni A, Rowlands C, Ruiz P, Russo DP, Sayed A, Sayre R, Sheils T, Siegel C, Silva AC, Simeonov A, Sosnin S, Southall N, Strickland J, Tang Y, Teppen B, Tetko IV, Thomas D, Tkachenko V, Todeschini R, Toma C, Tripodi I, Trisciuzzi D, Tropsha A, Varnek A, Vukovic K, Wang Z, Wang L, Waters KM, Wedlake AJ, Wijeyesakere SJ, Wilson D, Xiao Z, Yang H, Zahoranszky-Kohalmi G, Zakharov AV, Zhang FF, Zhang Z, Zhao T, Zhu H, Zorn KM, Casey W, Kleinstreuer NC. CATMoS: Collaborative Acute Toxicity Modeling Suite. ENVIRONMENTAL HEALTH PERSPECTIVES 2021; 129:47013. [PMID: 33929906 PMCID: PMC8086800 DOI: 10.1289/ehp8495] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
BACKGROUND Humans are exposed to tens of thousands of chemical substances that need to be assessed for their potential toxicity. Acute systemic toxicity testing serves as the basis for regulatory hazard classification, labeling, and risk management. However, it is cost- and time-prohibitive to evaluate all new and existing chemicals using traditional rodent acute toxicity tests. In silico models built using existing data facilitate rapid acute toxicity predictions without using animals. OBJECTIVES The U.S. Interagency Coordinating Committee on the Validation of Alternative Methods (ICCVAM) Acute Toxicity Workgroup organized an international collaboration to develop in silico models for predicting acute oral toxicity based on five different end points: Lethal Dose 50 (LD50 value, U.S. Environmental Protection Agency hazard (four) categories, Globally Harmonized System for Classification and Labeling hazard (five) categories, very toxic chemicals [LD50 (LD50≤50mg/kg)], and nontoxic chemicals (LD50>2,000mg/kg). METHODS An acute oral toxicity data inventory for 11,992 chemicals was compiled, split into training and evaluation sets, and made available to 35 participating international research groups that submitted a total of 139 predictive models. Predictions that fell within the applicability domains of the submitted models were evaluated using external validation sets. These were then combined into consensus models to leverage strengths of individual approaches. RESULTS The resulting consensus predictions, which leverage the collective strengths of each individual model, form the Collaborative Acute Toxicity Modeling Suite (CATMoS). CATMoS demonstrated high performance in terms of accuracy and robustness when compared with in vivo results. DISCUSSION CATMoS is being evaluated by regulatory agencies for its utility and applicability as a potential replacement for in vivo rat acute oral toxicity studies. CATMoS predictions for more than 800,000 chemicals have been made available via the National Toxicology Program's Integrated Chemical Environment tools and data sets (ice.ntp.niehs.nih.gov). The models are also implemented in a free, standalone, open-source tool, OPERA, which allows predictions of new and untested chemicals to be made. https://doi.org/10.1289/EHP8495.
Collapse
Affiliation(s)
- Kamel Mansouri
- Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
- National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA
| | - Agnes L. Karmaus
- Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
| | | | - Grace Patlewicz
- Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
| | - Prachi Pradeep
- Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
- Oak Ridge Institute for Science and Education (ORISE) Research Participation Program, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
| | - Domenico Alberga
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | | | - Timothy E.H. Allen
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Dave Allen
- Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
| | - Vinicius M. Alves
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
- Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
| | - Carolina H. Andrade
- Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
| | | | - Davide Ballabio
- Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
| | - Shannon Bell
- Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Sudin Bhattacharya
- Institute for Quantitative Health Science and Engineering, Department of Biomedical Engineering, Michigan State University, East Lansing, Michigan, USA
| | - Joyce V. Bastos
- Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
| | - Stephen Boyd
- Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
| | - J.B. Brown
- Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Stephen J. Capuzzi
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Yaroslav Chushak
- Aeromedical Research Department, Force Health Protection, USAFSAM, Dayton, Ohio, USA
- Henry M Jackson Foundation for the Advancement of Military Medicine, Dayton, Ohio, USA
| | - Heather Ciallella
- Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
| | - Alex M. Clark
- Collaborations Pharmaceuticals, Inc., Raleigh, North Carolina, USA
| | - Viviana Consonni
- Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
| | | | - Sean Ekins
- Collaborations Pharmaceuticals, Inc., Raleigh, North Carolina, USA
| | - Sherif Farag
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Maxim Fedorov
- Skoltech, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Denis Fourches
- Department of Chemistry, North Carolina State University, Raleigh, North Carolina, USA
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
| | - Domenico Gadaleta
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Feng Gao
- Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
| | - Jeffery M. Gearhart
- Aeromedical Research Department, Force Health Protection, USAFSAM, Dayton, Ohio, USA
- Henry M Jackson Foundation for the Advancement of Military Medicine, Dayton, Ohio, USA
| | - Garett Goh
- Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Jonathan M. Goodman
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Francesca Grisoni
- Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
| | - Christopher M. Grulke
- Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
| | | | - Matthew Hirn
- Department of Computational Mathematics, Science & Engineering, Department of Mathematics, Michigan State University, East Lansing, Michigan, USA
| | - Pavel Karpov
- Institute of Structural Biology, Helmholtz Zentrum München (GmbH), Neuherberg, Germany
| | | | - Giovanna J. Lavado
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | | | - Xinhao Li
- Department of Chemistry, North Carolina State University, Raleigh, North Carolina, USA
| | | | - Filippo Lunghini
- Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
| | - Giuseppe F. Mangiatordi
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | - Gilles Marcou
- Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
| | - Dan Marsh
- Underwriters Laboratories, Northbrook, Illinois, USA
| | - Todd Martin
- Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Cincinnati, Ohio, USA
| | | | - Eugene N. Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
- Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
| | | | - Dac-Trung Nguyen
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
| | - Orazio Nicolotti
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | - Reine Note
- L’Oréal Research & Innovation, Aulnay-sous-Bois, France
| | - Paritosh Pande
- Pacific Northwest National Laboratory, Richland, Washington, USA
| | | | - Tyler Peryea
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
| | | | - Robert Rallo
- Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Alessandra Roncaglioni
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | | | - Patricia Ruiz
- Office of Innovation and Analytics, Agency for Toxic Substances and Disease Registry, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| | - Daniel P. Russo
- Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
| | - Ahmed Sayed
- Rosettastein Consulting UG, Freising, Germany
| | - Risa Sayre
- Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
- Oak Ridge Institute for Science and Education (ORISE) Research Participation Program, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
| | - Timothy Sheils
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
| | - Charles Siegel
- Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Arthur C. Silva
- Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
| | - Anton Simeonov
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
| | - Sergey Sosnin
- Skoltech, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Noel Southall
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
| | - Judy Strickland
- Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Brian Teppen
- Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
| | - Igor V. Tetko
- Institute of Structural Biology, Helmholtz Zentrum München (GmbH), Neuherberg, Germany
- BIGCHEM GmbH, Unterschleissheim, Germany
| | - Dennis Thomas
- Pacific Northwest National Laboratory, Richland, Washington, USA
| | | | - Roberto Todeschini
- Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
| | - Cosimo Toma
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Ignacio Tripodi
- Computer Science/Interdisciplinary Quantitative Biology, University of Colorado, Boulder, Colorado, USA
| | - Daniela Trisciuzzi
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Alexandre Varnek
- Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
| | - Kristijan Vukovic
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Zhongyu Wang
- School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
| | - Liguo Wang
- School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
| | | | - Andrew J. Wedlake
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
| | | | - Dan Wilson
- The Dow Chemical Company, Midland, Michigan, USA
| | - Zijun Xiao
- School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
| | - Hongbin Yang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Gergely Zahoranszky-Kohalmi
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
| | - Alexey V. Zakharov
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
| | | | - Zhen Zhang
- Dow Agrosciences, Indianapolis, Indiana, USA
| | - Tongan Zhao
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
| | - Hao Zhu
- Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
| | | | - Warren Casey
- National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA
| | - Nicole C. Kleinstreuer
- National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA
| |
Collapse
|
14
|
Feng H, Zhang L, Li S, Liu L, Yang T, Yang P, Zhao J, Arkin IT, Liu H. Predicting the reproductive toxicity of chemicals using ensemble learning methods and molecular fingerprints. Toxicol Lett 2021; 340:4-14. [PMID: 33421549 DOI: 10.1016/j.toxlet.2021.01.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 10/29/2020] [Accepted: 01/03/2021] [Indexed: 12/20/2022]
Abstract
Reproductive toxicity endpoints are a significant safety concern in the assessment of the adverse effects of chemicals in drug discovery. Computational models that can accurately predict a chemical's toxic potential are increasingly pursued to replace traditional animal experiments. Thus, ensemble learning models were built to predict the reproductive toxicity of compounds. Our ensemble models were developed using support vector machine, random forest, and extreme gradient boosting methods and 9 molecular fingerprints calculated for a dataset containing 1823 chemicals. The best prediction performance was achieved by the Ensemble-Top12 model, with an accuracy (ACC) of 86.33 %, a sensitivity (SEN) of 82.02 %, a specificity (SPE) of 90.19 %, and an area under the receiver operating characteristic curve (AUC) of 0.937 in 5-fold cross-validation and ACC, SEN, SPE, and AUC values of 84.38 %, 86.90 %, 90.67 %, and 0.920, respectively, in external validation. We also defined the applicability domain (AD) of the ensemble model by calculating the Tanimoto distance of the training set. Compared with models in existing literature, our ensemble model achieves relatively high ACC, SPE and AUC values. We also identified several fingerprint features related to chemical reproductive toxicity. Considering the performance of model, we recommend using the Ensemble-Top12 model to predict reproductive toxicity in early drug development.
Collapse
Affiliation(s)
- Huawei Feng
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Li Zhang
- School of Life Science, Liaoning University, Shenyang, 110036, China; Technology Innovation Center for Computer Simulating and Information Processing of Bio-macromolecules of Shenyang, Shenyang, 110036, China; Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Liaoning University, Shenyang, 110036, China
| | - Shimeng Li
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Lili Liu
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Tianzhou Yang
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Pengyu Yang
- School of Information, Liaoning University, Shenyang, 110036, China
| | - Jian Zhao
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Isaiah Tuvia Arkin
- Department of Biological Chemistry, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat-Ram, Jerusalem, 91904, Israel
| | - Hongsheng Liu
- Technology Innovation Center for Computer Simulating and Information Processing of Bio-macromolecules of Shenyang, Shenyang, 110036, China; Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Liaoning University, Shenyang, 110036, China; School of Pharmaceutical Science, Liaoning University, Shenyang, 110036, China.
| |
Collapse
|
15
|
Active learning efficiently converges on rational limits of toxicity prediction and identifies patterns for molecule design. ACTA ACUST UNITED AC 2020. [DOI: 10.1016/j.comtox.2020.100129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
16
|
Keyvanpour MR, Shirzad MB. An Analysis of QSAR Research Based on Machine Learning Concepts. Curr Drug Discov Technol 2020; 18:17-30. [PMID: 32178612 DOI: 10.2174/1570163817666200316104404] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 08/22/2019] [Accepted: 10/28/2019] [Indexed: 11/22/2022]
Abstract
Quantitative Structure-Activity Relationship (QSAR) is a popular approach developed to correlate chemical molecules with their biological activities based on their chemical structures. Machine learning techniques have proved to be promising solutions to QSAR modeling. Due to the significant role of machine learning strategies in QSAR modeling, this area of research has attracted much attention from researchers. A considerable amount of literature has been published on machine learning based QSAR modeling methodologies whilst this domain still suffers from lack of a recent and comprehensive analysis of these algorithms. This study systematically reviews the application of machine learning algorithms in QSAR, aiming to provide an analytical framework. For this purpose, we present a framework called 'ML-QSAR'. This framework has been designed for future research to: a) facilitate the selection of proper strategies among existing algorithms according to the application area requirements, b) help to develop and ameliorate current methods and c) providing a platform to study existing methodologies comparatively. In ML-QSAR, first a structured categorization is depicted which studied the QSAR modeling research based on machine models. Then several criteria are introduced in order to assess the models. Finally, inspired by aforementioned criteria the qualitative analysis is carried out.
Collapse
Affiliation(s)
| | - Mehrnoush Barani Shirzad
- Data Mining Research Laboratory, Department of Computer Engineering, Alzahra University, Tehran, Iran
| |
Collapse
|
17
|
Valsecchi C, Grisoni F, Consonni V, Ballabio D. Consensus versus Individual QSARs in Classification: Comparison on a Large-Scale Case Study. J Chem Inf Model 2020; 60:1215-1223. [PMID: 32073844 PMCID: PMC7997107 DOI: 10.1021/acs.jcim.9b01057] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
![]()
Consensus strategies have been widely
applied in many different
scientific fields, based on the assumption that the fusion of several
sources of information increases the outcome reliability. Despite
the widespread application of consensus approaches, their advantages
in quantitative structure–activity relationship (QSAR) modeling
have not been thoroughly evaluated, mainly due to the lack of appropriate
large-scale data sets. In this study, we evaluated the advantages
and drawbacks of consensus approaches compared to single classification
QSAR models. To this end, we used a data set of three properties (androgen
receptor binding, agonism, and antagonism) for approximately 4000
molecules with predictions performed by more than 20 QSAR models,
made available in a large-scale collaborative project. The individual
QSAR models were compared with two consensus approaches, majority
voting and the Bayes consensus with discrete probability distributions,
in both protective and nonprotective forms. Consensus strategies proved
to be more accurate and to better cover the analyzed chemical space
than individual QSARs on average, thus motivating their widespread
application for property prediction. Scripts and data to reproduce
the results of this study are available for download.
Collapse
Affiliation(s)
- Cecile Valsecchi
- Milano Chemometrics and QSAR Research Group, University of Milano Bicocca, P.za della Scienza 1, 20126 Milano, Italy
| | - Francesca Grisoni
- Department of Chemistry and Applied Biosciences, ETH Zurich, Vladimir-Prelog-Weg 4, 8049 Zurich, Switzerland
| | - Viviana Consonni
- Milano Chemometrics and QSAR Research Group, University of Milano Bicocca, P.za della Scienza 1, 20126 Milano, Italy
| | - Davide Ballabio
- Milano Chemometrics and QSAR Research Group, University of Milano Bicocca, P.za della Scienza 1, 20126 Milano, Italy
| |
Collapse
|
18
|
Li X, Kleinstreuer NC, Fourches D. Hierarchical Quantitative Structure–Activity Relationship Modeling Approach for Integrating Binary, Multiclass, and Regression Models of Acute Oral Systemic Toxicity. Chem Res Toxicol 2020; 33:353-366. [DOI: 10.1021/acs.chemrestox.9b00259] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Xinhao Li
- Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - Nicole C. Kleinstreuer
- Division of Intramural Research/Biostatistics and Computational Biology Branch, NIEHS, Research Triangle
Park, Durham, North Carolina 27709, United States
- National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, NIEHS, Research Triangle Park, Durham, North Carolina 27709, United States
| | - Denis Fourches
- Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina 27695, United States
| |
Collapse
|
19
|
Gadaleta D, Vuković K, Toma C, Lavado GJ, Karmaus AL, Mansouri K, Kleinstreuer NC, Benfenati E, Roncaglioni A. SAR and QSAR modeling of a large collection of LD 50 rat acute oral toxicity data. J Cheminform 2019; 11:58. [PMID: 33430989 PMCID: PMC6717335 DOI: 10.1186/s13321-019-0383-2] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 08/13/2019] [Indexed: 11/10/2022] Open
Abstract
The median lethal dose for rodent oral acute toxicity (LD50) is a standard piece of information required to categorize chemicals in terms of the potential hazard posed to human health after acute exposure. The exclusive use of in vivo testing is limited by the time and costs required for performing experiments and by the need to sacrifice a number of animals. (Quantitative) structure-activity relationships [(Q)SAR] proved a valid alternative to reduce and assist in vivo assays for assessing acute toxicological hazard. In the framework of a new international collaborative project, the NTP Interagency Center for the Evaluation of Alternative Toxicological Methods and the U.S. Environmental Protection Agency's National Center for Computational Toxicology compiled a large database of rat acute oral LD50 data, with the aim of supporting the development of new computational models for predicting five regulatory relevant acute toxicity endpoints. In this article, a series of regression and classification computational models were developed by employing different statistical and knowledge-based methodologies. External validation was performed to demonstrate the real-life predictability of models. Integrated modeling was then applied to improve performance of single models. Statistical results confirmed the relevance of developed models in regulatory frameworks, and confirmed the effectiveness of integrated modeling. The best integrated strategies reached RMSEs lower than 0.50 and the best classification models reached balanced accuracies over 0.70 for multi-class and over 0.80 for binary endpoints. Computed predictions will be hosted on the EPA's Chemistry Dashboard and made freely available to the scientific community.
Collapse
Affiliation(s)
- Domenico Gadaleta
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy.
| | - Kristijan Vuković
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
| | - Cosimo Toma
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
- Institute for Risk Assessment Sciences, Utrecht University, PO Box 80177, 3508 TD, Utrecht, The Netherlands
| | - Giovanna J Lavado
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
| | - Agnes L Karmaus
- Integrated Laboratory Systems, Research Triangle Park, NC, 27560, USA
| | - Kamel Mansouri
- Integrated Laboratory Systems, Research Triangle Park, NC, 27560, USA
| | - Nicole C Kleinstreuer
- NTP Interagency Center for the Evaluation of Alternative Toxicological Methods, National Institute of Environmental Health Sciences, Research Triangle Park, NC, 27560, USA
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
| | - Alessandra Roncaglioni
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
| |
Collapse
|
20
|
Tinkov O, Grigorev V, Polishchuk P, Yarkov A, Raevsky O. QSAR investigation of acute toxicity of organic compounds during oral administration to mice. ACTA ACUST UNITED AC 2019; 65:123-132. [DOI: 10.18097/pbmc20196502123] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The effect of the structure of organic compounds on the acute toxicity upon oral injection in mice was studied using 2D simplex representation of the molecular structure and Random forest (RF) methods. Satisfactory quantitative structure-activity relationship (QSAR) models were constructed (R2 test = 0,61–0,62). The interpretation of the obtained QSAR models was carried out. The contributions of known toxicophores with established mechanisms of action were calculated in order to confirm the ability of the interpretation approach to correctly rank them relative to other structural fragments. The influence of the molecular surroundings of some toxicophores was analyzed. We analyzed the contributions of other highly ranked fragments from the list of common functional groups and ring systems in order to find new potential toxicophores. The on-line version of the expert system “OCHEM” (https://ochem.eu) and Arithmetic Mean Toxicity (AMT) approach were used for a comparative QSAR study.
Collapse
Affiliation(s)
- O.V. Tinkov
- Military Institute of the Ministry of Defense, Tiraspol, Moldova
| | - V.Yu. Grigorev
- Institute of Physiologically Active Compounds, Russian Academy of Sciences, Chernogolovka, Russia
| | - P.G. Polishchuk
- Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacký University, Olomouc, Czech Republic
| | - A.V. Yarkov
- Institute of Physiologically Active Compounds, Russian Academy of Sciences, Chernogolovka, Russia
| | - O.A. Raevsky
- Institute of Physiologically Active Compounds, Russian Academy of Sciences, Chernogolovka, Russia
| |
Collapse
|