Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dragos H, Gilles M, Alexandre V. Predicting the predictability: a unified approach to the applicability domain problem of QSAR models. J Chem Inf Model 2009;49:1762-76. [PMID: 19530661 DOI: 10.1021/ci9000579] [Citation(s) in RCA: 128] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

For:	Dragos H, Gilles M, Alexandre V. Predicting the predictability: a unified approach to the applicability domain problem of QSAR models. J Chem Inf Model 2009;49:1762-76. [PMID: 19530661 DOI: 10.1021/ci9000579] [Citation(s) in RCA: 128] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Collapse

Number

Cited by Other Article(s)

Engler Hart C, Preto AJ, Chanana S, Healey D, Kind T, Domingo-Fernández D. Evaluating the generalizability of graph neural networks for predicting collision cross section. J Cheminform 2024;16:105. [PMID: 39210378 PMCID: PMC11363525 DOI: 10.1186/s13321-024-00899-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 08/19/2024] [Indexed: 09/04/2024] Open

Heyndrickx W, Mervin L, Morawietz T, Sturm N, Friedrich L, Zalewski A, Pentina A, Humbeck L, Oldenhof M, Niwayama R, Schmidtke P, Fechner N, Simm J, Arany A, Drizard N, Jabal R, Afanasyeva A, Loeb R, Verma S, Harnqvist S, Holmes M, Pejo B, Telenczuk M, Holway N, Dieckmann A, Rieke N, Zumsande F, Clevert DA, Krug M, Luscombe C, Green D, Ertl P, Antal P, Marcus D, Do Huu N, Fuji H, Pickett S, Acs G, Boniface E, Beck B, Sun Y, Gohier A, Rippmann F, Engkvist O, Göller AH, Moreau Y, Galtier MN, Schuffenhauer A, Ceulemans H. MELLODDY: Cross-pharma Federated Learning at Unprecedented Scale Unlocks Benefits in QSAR without Compromising Proprietary Information. J Chem Inf Model 2024;64:2331-2344. [PMID: 37642660 PMCID: PMC11005050 DOI: 10.1021/acs.jcim.3c00799] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Indexed: 08/31/2023]

Affiliation(s)

Wouter Heyndrickx Janssen Pharmaceutica NV, Turnhoutseweg 30, Beerse 2340, Belgium
Lewis Mervin AstraZeneca R&D, Biomedical Campus, 1 Francis Crick Ave, Cambridge CB2 0SL, U.K.
Tobias Morawietz Bayer Pharma AG, Global Drug Discovery, Chemical Research, Computational Chemistry, Aprather Weg 18 a, Wuppertal 42096, Germany
Noé Sturm Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland
Lukas Friedrich Merck KGaA, Global Research & Development, Frankfurter Strasse 250, Darmstadt 64293, Germany
Adam Zalewski Amgen Research (Munich) GmbH, Staffelseestraße 2, Munich 81477, Germany
Anastasia Pentina Bayer AG, Machine Learning Research, Research & Development, Pharmaceuticals, Berlin 10117, Germany
Lina Humbeck BI Medicinal Chemistry Department, Boehringer Ingelheim Pharma GmbH & Co. KG, Birkendorfer Str. 65, Biberach an der Riss 88397, Germany
Martijn Oldenhof KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium
Ritsuya Niwayama Institut de recherches Servier, 125 chemin de ronde Croissy-sur-Seine, Île-de-France 78290, France
Peter Schmidtke Discngine, Avenue Ledru Rollin 79, Paris 75012, France
Nikolas Fechner Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland
Jaak Simm KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium
Adam Arany KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium
Nicolas Drizard Iktos, 65 rue de Prony, Paris 75017, France
Rama Jabal Iktos, 65 rue de Prony, Paris 75017, France
Arina Afanasyeva Modality Informatics Group, Digital Research Solutions, Advanced Informatics & Analytics, Astellas Pharma Inc., 21 Miyukigaoka, Tsukuba-shi, Ibaraki 305-8585, Japan
Regis Loeb KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium
Shlok Verma GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.
Simon Harnqvist GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.
Matthew Holmes GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.
Balazs Pejo Budapest University of Technology and Economics, Department of Networked Systems and Services, Műegyetem rkp. 3, Budapest 1111, Hungary
Maria Telenczuk Owkin, 12 Rue Martel, Paris 75010, France
Nicholas Holway Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland
Arne Dieckmann Bayer AG, API Production, Product Supply, Pharmaceuticals, Ernst-Schering-Straße 14, Bergkamen 59192, Germany
Nicola Rieke NVIDIA GmbH, Floessergasse 2, Munich 81369, Germany
Friederike Zumsande Amgen Research (Munich) GmbH, Staffelseestraße 2, Munich 81477, Germany
Djork-Arné Clevert Bayer AG, Machine Learning Research, Research & Development, Pharmaceuticals, Berlin 10117, Germany
Michael Krug Merck KGaA, Global Research & Development, Frankfurter Strasse 250, Darmstadt 64293, Germany
Christopher Luscombe GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.
Darren Green GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.
Peter Ertl Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland
Peter Antal Budapest University of Technology and Economics, Department of Measurement and Information Systems, Műegyetem rkp. 3, Budapest 1111, Hungary
David Marcus GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.
Nicolas Do Huu Iktos, 65 rue de Prony, Paris 75017, France
Hideyoshi Fuji Modality Informatics Group, Digital Research Solutions, Advanced Informatics & Analytics, Astellas Pharma Inc., 21 Miyukigaoka, Tsukuba-shi, Ibaraki 305-8585, Japan
Stephen Pickett GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.
Gergely Acs Budapest University of Technology and Economics, Department of Networked Systems and Services, Műegyetem rkp. 3, Budapest 1111, Hungary
Eric Boniface Substra Foundation - Labelia Labs, 4 rue Voltaire, Nantes 44000, France
Bernd Beck BI Medicinal Chemistry Department, Boehringer Ingelheim Pharma GmbH & Co. KG, Birkendorfer Str. 65, Biberach an der Riss 88397, Germany
Yax Sun Amgen Research, 1 Amgen Center Drive, Thousand Oaks, California 92130, United States
Arnaud Gohier Institut de recherches Servier, 125 chemin de ronde Croissy-sur-Seine, Île-de-France 78290, France
Friedrich Rippmann Merck KGaA, Global Research & Development, Frankfurter Strasse 250, Darmstadt 64293, Germany
Ola Engkvist AstraZeneca, Molecular AI, Discovery Sciences, R&D, Pepparedsleden 1, Mölndal 431 50, Sweden
Andreas H. Göller Bayer Pharma AG, Global Drug Discovery, Chemical Research, Computational Chemistry, Aprather Weg 18 a, Wuppertal 42096, Germany
Yves Moreau KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium
Mathieu N. Galtier Owkin, 4 Rue Voltaire, Nantes 44000, France
Ansgar Schuffenhauer Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland
Hugo Ceulemans Janssen Pharmaceutica NV, Turnhoutseweg 30, Beerse 2340, Belgium

Collapse

Samizo S, Kaneko H. Predictive Modeling of HMG-CoA Reductase Inhibitory Activity and Design of New HMG-CoA Reductase Inhibitors. ACS OMEGA 2023;8:27247-27255. [PMID: 37546661 PMCID: PMC10399166 DOI: 10.1021/acsomega.3c02567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 06/30/2023] [Indexed: 08/08/2023]

Gui C, Li Y, Peng T. Development of predictive QSAR models for the substrates/inhibitors of OATP1B1 by deep neural networks. Toxicol Lett 2023;376:20-25. [PMID: 36649904 DOI: 10.1016/j.toxlet.2023.01.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 01/10/2023] [Accepted: 01/12/2023] [Indexed: 01/15/2023]

In Silico Identification of Anti-SARS-CoV-2 Medicinal Plants Using Cheminformatics and Machine Learning. MOLECULES (BASEL, SWITZERLAND) 2022;28:molecules28010208. [PMID: 36615401 PMCID: PMC9821958 DOI: 10.3390/molecules28010208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Revised: 12/17/2022] [Accepted: 12/23/2022] [Indexed: 12/28/2022]

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative pathogen of COVID-19, is spreading rapidly and has caused hundreds of millions of infections and millions of deaths worldwide. Due to the lack of specific vaccines and effective treatments for COVID-19, there is an urgent need to identify effective drugs. Traditional Chinese medicine (TCM) is a valuable resource for identifying novel anti-SARS-CoV-2 drugs based on the important contribution of TCM and its potential benefits in COVID-19 treatment. Herein, we aimed to discover novel anti-SARS-CoV-2 compounds and medicinal plants from TCM by establishing a prediction method of anti-SARS-CoV-2 activity using machine learning methods. We first constructed a benchmark dataset from anti-SARS-CoV-2 bioactivity data collected from the ChEMBL database. Then, we established random forest (RF) and support vector machine (SVM) models that both achieved satisfactory predictive performance with AUC values of 0.90. By using this method, a total of 1011 active anti-SARS-CoV-2 compounds were predicted from the TCMSP database. Among these compounds, six compounds with highly potent activity were confirmed in the anti-SARS-CoV-2 experiments. The molecular fingerprint similarity analysis revealed that only 24 of the 1011 compounds have high similarity to the FDA-approved antiviral drugs, indicating that most of the compounds were structurally novel. Based on the predicted anti-SARS-CoV-2 compounds, we identified 74 anti-SARS-CoV-2 medicinal plants through enrichment analysis. The 74 plants are widely distributed in 68 genera and 43 families, 14 of which belong to antipyretic detoxicate plants. In summary, this study provided several medicinal plants with potential anti-SARS-CoV-2 activity, which offer an attractive starting point and a broader scope to mine for potentially novel anti-SARS-CoV-2 drugs.

Collapse

Zhao Q, Yu Y, Gao Y, Shen L, Cui S, Gou Y, Zhang C, Zhuang S, Jiang G. Machine Learning-Based Models with High Accuracy and Broad Applicability Domains for Screening PMT/vPvM Substances. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022;56:17880-17889. [PMID: 36475377 DOI: 10.1021/acs.est.2c06155] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]

Affiliation(s)

Qiming Zhao Key Laboratory of Environment Remediation and Ecological Health, Ministry of Education, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou310058, China
Yang Yu Solid Waste and Chemicals Management Center, Ministry of Ecology and Environment of the People's Republic of China, Beijing100029, China
Yuchen Gao Key Laboratory of Environment Remediation and Ecological Health, Ministry of Education, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou310058, China
Lilai Shen Key Laboratory of Environment Remediation and Ecological Health, Ministry of Education, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou310058, China
Shixuan Cui Key Laboratory of Environment Remediation and Ecological Health, Ministry of Education, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou310058, China Women's Reproductive Health Key Laboratory of Zhejiang Province, Women's Hospital, School of Medicine, Zhejiang University, Hangzhou310006, China
Yiyuan Gou Key Laboratory of Environment Remediation and Ecological Health, Ministry of Education, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou310058, China
Chunlong Zhang Department of Environmental Sciences, University of Houston-Clear Lake, 2700 Bay Area Blvd., Houston, Texas77058, United States
Shulin Zhuang Key Laboratory of Environment Remediation and Ecological Health, Ministry of Education, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou310058, China Women's Reproductive Health Key Laboratory of Zhejiang Province, Women's Hospital, School of Medicine, Zhejiang University, Hangzhou310006, China
Guibin Jiang State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing100085, China

Collapse

Hao Y, Fan T, Sun G, Li F, Zhang N, Zhao L, Zhong R. Environmental toxicity risk evaluation of nitroaromatic compounds: Machine learning driven binary/multiple classification and design of safe alternatives. Food Chem Toxicol 2022;170:113461. [PMID: 36243219 DOI: 10.1016/j.fct.2022.113461] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 09/11/2022] [Accepted: 10/04/2022] [Indexed: 11/06/2022]

Bort W, Mazitov D, Horvath D, Bonachera F, Lin A, Marcou G, Baskin I, Madzhidov T, Varnek A. Inverse QSAR: Reversing Descriptor-Driven Prediction Pipeline Using Attention-Based Conditional Variational Autoencoder. J Chem Inf Model 2022;62:5471-5484. [PMID: 36332178 DOI: 10.1021/acs.jcim.2c01086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Rayevsky AV, Poturai AS, Kravets IO, Pashenko AE, Borisova TA, Tolstanova GM, Volochnyuk DM, Borysko PO, Vadzyuk OB, Alieksieieva DO, Zabolotna Y, Klimchuk O, Horvath D, Marcou G, Ryabukhin SV, Varnek A. In Vitro Evaluation of In Silico Screening Approaches in Search for Selective ACE2 Binding Chemical Probes. Molecules 2022;27:molecules27175400. [PMID: 36080168 PMCID: PMC9458095 DOI: 10.3390/molecules27175400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 08/16/2022] [Accepted: 08/18/2022] [Indexed: 11/16/2022] Open

Affiliation(s)

Alexey V. Rayevsky Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine Institute of Food Biotechnology and Genomics, National Academy of Sciences of Ukraine, 2a Osipovskogo Street, 04123 Kyiv, Ukraine
Andrii S. Poturai Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine
Iryna O. Kravets Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine Chemspace LLC, 85 Chervonotkatska Street, 02094 Kyiv, Ukraine
Alexander E. Pashenko Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine Educational and Scientific Institute of High Technologies, Taras Shevchenko National University of Kyiv, 60 Volodymyrska Street, 01033 Kyiv, Ukraine Institute of Organic Chemistry, National Academy of Sciences of Ukraine, 5 Murmanska Street, 03028 Kyiv, Ukraine
Tatiana A. Borisova Palladin Institute of Biochemistry of the National Academy of Sciences of Ukraine, 9 Leontovitcha Street, 01054 Kyiv, Ukraine
Ganna M. Tolstanova Educational and Scientific Institute of High Technologies, Taras Shevchenko National University of Kyiv, 60 Volodymyrska Street, 01033 Kyiv, Ukraine
Dmitriy M. Volochnyuk Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine Chemspace LLC, 85 Chervonotkatska Street, 02094 Kyiv, Ukraine Educational and Scientific Institute of High Technologies, Taras Shevchenko National University of Kyiv, 60 Volodymyrska Street, 01033 Kyiv, Ukraine
Petro O. Borysko Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine
Olga B. Vadzyuk Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine
Diana O. Alieksieieva Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine
Yuliana Zabolotna Laboratory of Chemoinformatics, University of Strasbourg, 4, rue B. Pascal, 67081 Strasbourg, France
Olga Klimchuk Laboratory of Chemoinformatics, University of Strasbourg, 4, rue B. Pascal, 67081 Strasbourg, France
Dragos Horvath Laboratory of Chemoinformatics, University of Strasbourg, 4, rue B. Pascal, 67081 Strasbourg, France
Gilles Marcou Laboratory of Chemoinformatics, University of Strasbourg, 4, rue B. Pascal, 67081 Strasbourg, France
Sergey V. Ryabukhin Enamine Ltd., 78 Chervonotkatska Street, 02660 Kyiv, Ukraine Educational and Scientific Institute of High Technologies, Taras Shevchenko National University of Kyiv, 60 Volodymyrska Street, 01033 Kyiv, Ukraine Institute of Organic Chemistry, National Academy of Sciences of Ukraine, 5 Murmanska Street, 03028 Kyiv, Ukraine Correspondence: (S.V.R.); (A.V.)
Alexandre Varnek Laboratory of Chemoinformatics, University of Strasbourg, 4, rue B. Pascal, 67081 Strasbourg, France Correspondence: (S.V.R.); (A.V.)

Collapse

Grebner C, Matter H, Hessler G. Artificial Intelligence in Compound Design. Methods Mol Biol 2021;2390:349-382. [PMID: 34731477 DOI: 10.1007/978-1-0716-1787-8_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]

Gantzer P, Creton B, Nieto-Draghi C. Comparisons of Molecular Structure Generation Methods Based on Fragment Assemblies and Genetic Graphs. J Chem Inf Model 2021;61:4245-4258. [PMID: 34405674 DOI: 10.1021/acs.jcim.1c00803] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Zhang X, Zhao P, Wang Z, Xu X, Liu G, Tang Y, Li W. In Silico Prediction of CYP2C8 Inhibition with Machine-Learning Methods. Chem Res Toxicol 2021;34:1850-1859. [PMID: 34255486 DOI: 10.1021/acs.chemrestox.1c00078] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Baybekov S, Marcou G, Ramos P, Saurel O, Galzi JL, Varnek A. DMSO Solubility Assessment for Fragment-Based Screening. Molecules 2021;26:3950. [PMID: 34203441 PMCID: PMC8271413 DOI: 10.3390/molecules26133950] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 06/23/2021] [Accepted: 06/23/2021] [Indexed: 11/16/2022] Open

Yoshihama H, Kaneko H. Design of thermoelectric materials with high electrical conductivity, high Seebeck coefficient, and low thermal conductivity. ANALYTICAL SCIENCE ADVANCES 2021;2:289-294. [PMID: 38716157 PMCID: PMC10989581 DOI: 10.1002/ansa.202000114] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 11/25/2020] [Accepted: 11/26/2020] [Indexed: 08/18/2024]

Matsumoto K, Miyao T, Funatsu K. Ranking-Oriented Quantitative Structure-Activity Relationship Modeling Combined with Assay-Wise Data Integration. ACS OMEGA 2021;6:11964-11973. [PMID: 34056351 PMCID: PMC8154010 DOI: 10.1021/acsomega.1c00463] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 04/21/2021] [Indexed: 05/15/2023]

Morger A, Svensson F, Arvidsson McShane S, Gauraha N, Norinder U, Spjuth O, Volkamer A. Assessing the calibration in toxicological in vitro models with conformal prediction. J Cheminform 2021;13:35. [PMID: 33926567 PMCID: PMC8082859 DOI: 10.1186/s13321-021-00511-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 04/10/2021] [Indexed: 11/11/2022] Open

Liu X, Zhang H, Xue Q, Pan W, Zhang A. In silico health effect prioritization of environmental chemicals through transcriptomics data exploration from a chemo-centric view. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021;762:143082. [PMID: 33143927 DOI: 10.1016/j.scitotenv.2020.143082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2020] [Revised: 10/11/2020] [Accepted: 10/11/2020] [Indexed: 06/11/2023]

Horvath D, Marcou G, Varnek A. Trustworthiness, the Key to Grid-Based Map-Driven Predictive Model Enhancement and Applicability Domain Control. J Chem Inf Model 2020;60:6020-6032. [DOI: 10.1021/acs.jcim.0c00998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Kurosaki K, Wu R, Uesawa Y. A Toxicity Prediction Tool for Potential Agonist/Antagonist Activities in Molecular Initiating Events Based on Chemical Structures. Int J Mol Sci 2020;21:ijms21217853. [PMID: 33113912 PMCID: PMC7660166 DOI: 10.3390/ijms21217853] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 10/07/2020] [Accepted: 10/21/2020] [Indexed: 12/15/2022] Open

Rakhimbekova A, Madzhidov TI, Nugmanov RI, Gimadiev TR, Baskin II, Varnek A. Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions. Int J Mol Sci 2020;21:E5542. [PMID: 32756326 PMCID: PMC7432167 DOI: 10.3390/ijms21155542] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 07/27/2020] [Accepted: 07/30/2020] [Indexed: 01/28/2023] Open

Yan X, Sedykh A, Wang W, Yan B, Zhu H. Construction of a web-based nanomaterial database by big data curation and modeling friendly nanostructure annotations. Nat Commun 2020;11:2519. [PMID: 32433469 PMCID: PMC7239871 DOI: 10.1038/s41467-020-16413-3] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 04/22/2020] [Indexed: 12/27/2022] Open

Mora JR, Marrero-Ponce Y, García-Jacas CR, Suarez Causado A. Ensemble Models Based on QuBiLS-MAS Features and Shallow Learning for the Prediction of Drug-Induced Liver Toxicity: Improving Deep Learning and Traditional Approaches. Chem Res Toxicol 2020;33:1855-1873. [PMID: 32406679 DOI: 10.1021/acs.chemrestox.0c00030] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Abstract

Drug-induced liver injury (DILI) is a key safety issue in the drug discovery pipeline and a regulatory concern. Thus, many in silico tools have been proposed to improve the hepatotoxicity prediction of organic-type chemicals. Here, classifiers for the prediction of DILI were developed by using QuBiLS-MAS 0-2.5D molecular descriptors and shallow machine learning techniques, on a training set composed of 1075 molecules. The best ensemble model build, E13, was obtained with good statistical parameters for the learning series, namely, the following: accuracy = 0.840, sensibility = 0.890, specificity = 0.761, Matthew's correlation coefficient = 0.660, and area under the ROC curve = 0.904. The model was also satisfactorily evaluated with Y-scrambling test, and repeated k-fold cross-validation and repeated k-holdout validation. In addition, an exhaustive external validation was also carried out by using two test sets and five external test sets, with an average accuracy value equal to 0.854 (±0.062) and a coverage equal to 98.4% according to its applicability domain. A statistical comparison of the performance of the E13 model, with regard to results and tools (e.g., Padel DDPredictor Software, Deep Learning DILIserver, and Vslead) reported in the literature, was also performed. In general, E13 presented the best global performance in all experiments. The sum of the ranking differences procedure provided a very similar grouping pattern to that of the M-ANOVA statistical analysis, where E13 was identified as the best model for DILI predictions. A noncommercial and fully cross-platform software for the DILI prediction was also developed, which is freely available at http://tomocomd.com/apps/ptoxra. This software was used for the screening of seven data sets, containing natural products, leads, toxic materials, and FDA approved drugs, to assess the usefulness of the QSAR models in the DILI labeling of organic substances; it was found that 50-92% of the evaluated molecules are positive-DILI compounds. All in all, it can be stated that the E13 model is a relevant method for the prediction of DILI risk in humans, as it shows the best results among all of the methods analyzed.

Collapse

Toropov AA, Toropova AP, Marzo M, Carnesecchi E, Selvestrel G, Benfenati E. Pesticides, cosmetics, drugs: identical and opposite influences of various molecular features as measures of endpoints similarity and dissimilarity. Mol Divers 2020;25:1137-1144. [PMID: 32323128 DOI: 10.1007/s11030-020-10085-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Accepted: 04/06/2020] [Indexed: 11/26/2022]

Kato H. Computational prediction of cytochrome P450 inhibition and induction. Drug Metab Pharmacokinet 2019;35:30-44. [PMID: 31902468 DOI: 10.1016/j.dmpk.2019.11.006] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 10/27/2019] [Accepted: 11/17/2019] [Indexed: 12/14/2022]

Shiri F, Bakhshayesh S, Ghasemi JB. Computer-aided molecular design of (E)-N-Aryl-2-ethene-sulfonamide analogues as microtubule targeted agents in prostate cancer. ARAB J CHEM 2019. [DOI: 10.1016/j.arabjc.2014.11.063] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open

Toropova AP, Toropov AA. Whether the Validation of the Predictive Potential of Toxicity Models is a Solved Task? Curr Top Med Chem 2019;19:2643-2657. [PMID: 31702504 DOI: 10.2174/1568026619666191105111817] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 09/02/2019] [Accepted: 09/04/2019] [Indexed: 12/23/2022]

García-Jacas CR, Marrero-Ponce Y, Cortés-Guzmán F, Suárez-Lezcano J, Martinez-Rios FO, García-González LA, Pupo-Meriño M, Martinez-Mayorga K. Enhancing Acute Oral Toxicity Predictions by using Consensus Modeling and Algebraic Form-Based 0D-to-2D Molecular Encodes. Chem Res Toxicol 2019;32:1178-1192. [PMID: 31066547 DOI: 10.1021/acs.chemrestox.9b00011] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Abstract

Quantitative structure-activity relationships (QSAR) are introduced to predict acute oral toxicity (AOT), by using the QuBiLS-MAS (acronym for quadratic, bilinear and N-Linear maps based on graph-theoretic electronic-density matrices and atomic weightings) framework for the molecular encoding. Three training sets were employed to build the models: EPA training set (5931 compounds), EPA-full training set (7413 compounds), and Zhu training set (10 152 compounds). Additionally, the EPA test set (1482 compounds) was used for the validation of the QSAR models built on the EPA training set, while the ProTox (425 compounds) and T3DB (284 compounds) external sets were employed for the assessment of all the models. The k-nearest neighbor, multilayer perceptron, random forest, and support vector machine procedures were employed to build several base (individual) models. The base models with R_EPA-training ≥ 0.75 ( R = correlation coefficient) and MAE_EPA-training ≤ 0.5 (MAE = mean absolute error) were retained to build consensus models. As a result, two consensus models based on the minimum operator and denoted as M19 and M22, as well as a consensus model based on the weighted average operator and denoted as M24, were selected as the best ones for each training set considered. According to the applicability domain (AD) analysis performed, model M19 (built on the EPA training set) has MAE_test-AD = 0.4044, MAE_ProTox-AD = 0.4067 and MAE_T3DB-AD = 0.2586 on the EPA test set, ProTox external set, and T3DB external set, respectively; whereas model M22 (built on the EPA-full set) and model M24 (built on the Zhu set) present MAE_ProTox-AD = 0.3992 and MAE_T3DB-AD = 0.2286, and MAE_ProTox-AD = 0.3773 and MAE_T3DB-AD = 0.2471 on the two external sets accounted for, respectively. These outcomes were compared and statistically validated with respect to 14 QSAR methods (e.g., admetSAR, ProTox-II) from the literature. As a result, model M22 presents the best overall performance. In addition, a retrospective study on 261 withdrawn drugs due to their toxic/side effects was performed, to assess the usefulness of prospectively using the QSAR models proposed in the labeling of chemicals. A comparison with regard to the methods from the literature was also made. As a result, model M22 has the best ability of labeling a compound as toxic according to the globally harmonized system of classification and labeling of chemicals. Therefore, it can be concluded that the models proposed, especially model M22, constitute prominent tools for studying AOT, at providing the best results among all the methods examined. A freely available software was also developed to be used in virtual screening tasks ( http://tomocomd.com/apps/ptoxra ).

Collapse

Toropov AA, Raška I, Toropova AP, Raškova M, Veselinović AM, Veselinović JB. The study of the index of ideality of correlation as a new criterion of predictive potential of QSPR/QSAR-models. THE SCIENCE OF THE TOTAL ENVIRONMENT 2019;659:1387-1394. [PMID: 31096349 DOI: 10.1016/j.scitotenv.2018.12.439] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 12/14/2018] [Accepted: 12/28/2018] [Indexed: 06/09/2023]

Berenger F, Yamanishi Y. A Distance-Based Boolean Applicability Domain for Classification of High Throughput Screening Data. J Chem Inf Model 2019;59:463-476. [PMID: 30567434 DOI: 10.1021/acs.jcim.8b00499] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Hanser T, Barber C, Guesné S, Marchaland JF, Werner S. Applicability Domain: Towards a More Formal Framework to Express the Applicability of a Model and the Confidence in Individual Predictions. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2019. [DOI: 10.1007/978-3-030-16443-0_11] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Chen Y, Yang H, Wu Z, Liu G, Tang Y, Li W. Prediction of Farnesoid X Receptor Disruptors with Machine Learning Methods. Chem Res Toxicol 2018;31:1128-1137. [DOI: 10.1021/acs.chemrestox.8b00162] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Ruiz IL, Gómez-Nieto MÁ. Study of the Applicability Domain of the QSAR Classification Models by Means of the Rivality and Modelability Indexes. Molecules 2018;23:molecules23112756. [PMID: 30356020 PMCID: PMC6278359 DOI: 10.3390/molecules23112756] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 10/14/2018] [Accepted: 10/22/2018] [Indexed: 11/30/2022] Open

Roy K, Ambure P, Kar S. How Precise Are Our Quantitative Structure-Activity Relationship Derived Predictions for New Query Chemicals? ACS OMEGA 2018;3:11392-11406. [PMID: 31459245 PMCID: PMC6645132 DOI: 10.1021/acsomega.8b01647] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 09/06/2018] [Indexed: 05/03/2023]

Abstract

Quantitative structure-activity relationship (QSAR) models have long been used for making predictions and data gap filling in diverse fields including medicinal chemistry, predictive toxicology, environmental fate modeling, materials science, agricultural science, nanoscience, food science, and so forth. Usually a QSAR model is developed based on chemical information of a properly designed training set and corresponding experimental response data while the model is validated using one or more test set(s) for which the experimental response data are available. However, it is interesting to estimate the reliability of predictions when the model is applied to a completely new data set (true external set) even when the new data points are within applicability domain (AD) of the developed model. In the present study, we have categorized the quality of predictions for the test set or true external set into three groups (good, moderate, and bad) based on absolute prediction errors. Then, we have used three criteria [(a) mean absolute error of leave-one-out predictions for 10 most close training compounds for each query molecule; (b) AD in terms of similarity based on the standardization approach; and (c) proximity of the predicted value of the query compound to the mean training response] in different weighting schemes for making a composite score of predictions. It was found that using the most frequently appearing weighting scheme 0.5-0-0.5, the composite score-based categorization showed concordance with absolute prediction error-based categorization for more than 80% test data points while working with 5 different datasets with 15 models for each set derived in three different splitting techniques. These observations were also confirmed with true external sets for another four endpoints suggesting applicability of the scheme to judge the reliability of predictions for new datasets. The scheme has been implemented in a tool "Prediction Reliability Indicator" available at http://dtclab.webs.com/software-tools and http://teqip.jdvu.ac.in/QSAR_Tools/DTCLab/, and the tool is presently valid for multiple linear regression models only.

Collapse

Kaneko H. Data Visualization, Regression, Applicability Domains and Inverse Analysis Based on Generative Topographic Mapping. Mol Inform 2018;38:e1800088. [PMID: 30259699 DOI: 10.1002/minf.201800088] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Accepted: 08/30/2018] [Indexed: 01/11/2023]

McGuinness KN, Pan W, Sheridan RP, Murphy G, Crespo A. Role of simple descriptors and applicability domain in predicting change in protein thermostability. PLoS One 2018;13:e0203819. [PMID: 30192891 PMCID: PMC6128648 DOI: 10.1371/journal.pone.0203819] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 08/28/2018] [Indexed: 01/07/2023] Open

Zhenin M, Bahia MS, Marcou G, Varnek A, Senderowitz H, Horvath D. Rescoring of docking poses under Occam’s Razor: are there simpler solutions? J Comput Aided Mol Des 2018;32:877-888. [DOI: 10.1007/s10822-018-0155-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Accepted: 08/26/2018] [Indexed: 01/04/2023]

Liu R, Glover KP, Feasel MG, Wallqvist A. General Approach to Estimate Error Bars for Quantitative Structure–Activity Relationship Predictions of Molecular Activity. J Chem Inf Model 2018;58:1561-1575. [DOI: 10.1021/acs.jcim.8b00114] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Liang Y, Torralba-Sanchez TL, Di Toro DM. Estimating system parameters for solvent-water and plant cuticle-water using quantum chemically estimated Abraham solute parameters. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2018;20:813-821. [PMID: 29667991 DOI: 10.1039/c7em00601b] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Abstract

Polyparameter Linear Free Energy Relationships (pp-LFERs) using Abraham system parameters have many useful applications. However, developing the Abraham system parameters depends on the availability and quality of the Abraham solute parameters. Using Quantum Chemically estimated Abraham solute Parameters (QCAP) is shown to produce pp-LFERs that have lower root mean square errors (RMSEs) of predictions for solvent-water partition coefficients than parameters that are estimated using other presently available methods. pp-LFERs system parameters are estimated for solvent-water, plant cuticle-water systems, and for novel compounds using QCAP solute parameters and experimental partition coefficients. Refitting the system parameter improves the calculation accuracy and eliminates the bias. Refitted models for solvent-water partition coefficients using QCAP solute parameters give better results (RMSE = 0.278 to 0.506 log units for 24 systems) than those based on ABSOLV (0.326 to 0.618) and QSPR (0.294 to 0.700) solute parameters. For munition constituents and munition-like compounds not included in the calibration of the refitted model, QCAP solute parameters produce pp-LFER models with much lower RMSEs for solvent-water partition coefficients (RMSE = 0.734 and 0.664 for original and refitted model, respectively) than ABSOLV (4.46 and 5.98) and QSPR (2.838 and 2.723). Refitting plant cuticle-water pp-LFER including munition constituents using QCAP solute parameters also results in lower RMSE (RMSE = 0.386) than that using ABSOLV (0.778) and QSPR (0.512) solute parameters. Therefore, for fitting a model in situations for which experimental data exist and system parameters can be re-estimated, or for which system parameters do not exist and need to be developed, QCAP is the quantum chemical method of choice.

Collapse

Svensson F, Aniceto N, Norinder U, Cortes-Ciriano I, Spjuth O, Carlsson L, Bender A. Conformal Regression for Quantitative Structure–Activity Relationship Modeling—Quantifying Prediction Uncertainty. J Chem Inf Model 2018;58:1132-1140. [DOI: 10.1021/acs.jcim.8b00054] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Mansouri K, Grulke CM, Judson RS, Williams AJ. OPERA models for predicting physicochemical properties and environmental fate endpoints. J Cheminform 2018. [PMID: 29520515 PMCID: PMC5843579 DOI: 10.1186/s13321-018-0263-1] [Citation(s) in RCA: 271] [Impact Index Per Article: 45.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Abstract

The collection of chemical structure information and associated experimental data for quantitative structure–activity/property relationship (QSAR/QSPR) modeling is facilitated by an increasing number of public databases containing large amounts of useful data. However, the performance of QSAR models highly depends on the quality of the data and modeling methodology used. This study aims to develop robust QSAR/QSPR models for chemical properties of environmental interest that can be used for regulatory purposes. This study primarily uses data from the publicly available PHYSPROP database consisting of a set of 13 common physicochemical and environmental fate properties. These datasets have undergone extensive curation using an automated workflow to select only high-quality data, and the chemical structures were standardized prior to calculation of the molecular descriptors. The modeling procedure was developed based on the five Organization for Economic Cooperation and Development (OECD) principles for QSAR models. A weighted k-nearest neighbor approach was adopted using a minimum number of required descriptors calculated using PaDEL, an open-source software. The genetic algorithms selected only the most pertinent and mechanistically interpretable descriptors (2–15, with an average of 11 descriptors). The sizes of the modeled datasets varied from 150 chemicals for biodegradability half-life to 14,050 chemicals for logP, with an average of 3222 chemicals across all endpoints. The optimal models were built on randomly selected training sets (75%) and validated using fivefold cross-validation (CV) and test sets (25%). The CV Q² of the models varied from 0.72 to 0.95, with an average of 0.86 and an R² test value from 0.71 to 0.96, with an average of 0.82. Modeling and performance details are described in QSAR model reporting format and were validated by the European Commission’s Joint Research Center to be OECD compliant. All models are freely available as an open-source, command-line application called OPEn structure–activity/property Relationship App (OPERA). OPERA models were applied to more than 750,000 chemicals to produce freely available predicted data on the U.S. Environmental Protection Agency’s CompTox Chemistry Dashboard.

Collapse

Kaneko H. Discussion on Regression Methods Based on Ensemble Learning and Applicability Domains of Linear Submodels. J Chem Inf Model 2018;58:480-489. [PMID: 29425038 DOI: 10.1021/acs.jcim.7b00649] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Gramatica P, Papa E, Sangion A. QSAR modeling of cumulative environmental end-points for the prioritization of hazardous chemicals. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2018;20:38-47. [PMID: 29226926 DOI: 10.1039/c7em00519a] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Grisoni F, Ballabio D, Todeschini R, Consonni V. Molecular Descriptors for Structure-Activity Applications: A Hands-On Approach. Methods Mol Biol 2018;1800:3-53. [PMID: 29934886 DOI: 10.1007/978-1-4939-7899-1_1] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Marcou G, Delouis G, Mokshyna O, Horvath D, Lachiche N, Varnek A. Transductive Ridge Regression in Structure-activity Modeling. Mol Inform 2017;37. [PMID: 29095574 DOI: 10.1002/minf.201700112] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 10/08/2017] [Indexed: 11/06/2022]

Perspectives from the NanoSafety Modelling Cluster on the validation criteria for (Q)SAR models used in nanotechnology. Food Chem Toxicol 2017;112:478-494. [PMID: 28943385 DOI: 10.1016/j.fct.2017.09.037] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2016] [Revised: 08/31/2017] [Accepted: 09/19/2017] [Indexed: 11/20/2022]

Liang Y, Xiong R, Sandler SI, Di Toro DM. Quantum Chemically Estimated Abraham Solute Parameters Using Multiple Solvent-Water Partition Coefficients and Molecular Polarizability. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2017;51:9887-9898. [PMID: 28742336 DOI: 10.1021/acs.est.7b01737] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Verras A, Waller CL, Gedeck P, Green DVS, Kogej T, Raichurkar A, Panda M, Shelat AA, Clark J, Guy RK, Papadatos G, Burrows J. Shared Consensus Machine Learning Models for Predicting Blood Stage Malaria Inhibition. J Chem Inf Model 2017;57:445-453. [DOI: 10.1021/acs.jcim.6b00572] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Horvath D, Marcou G, Varnek A. Generative Topographic Mapping Approach to Chemical Space Analysis. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2017. [DOI: 10.1007/978-3-319-56850-8_6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]

Kleandrova VV, Luan F, Speck-Planche A, Cordeiro MNDS. QSAR-Based Studies of Nanomaterials in the Environment. PHARMACEUTICAL SCIENCES 2017. [DOI: 10.4018/978-1-5225-1762-7.ch051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open

Aniceto N, Freitas AA, Bender A, Ghafourian T. A novel applicability domain technique for mapping predictive reliability across the chemical space of a QSAR: reliability-density neighbourhood. J Cheminform 2016. [PMCID: PMC5395519 DOI: 10.1186/s13321-016-0182-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open

Abstract

The ability to define the regions of chemical space where a predictive model can be safely used is a necessary condition to assure the reliability of new predictions. This implies that reliability must be determined across chemical space in the attempt to localize “safe” and “unsafe” regions for prediction. As a result we devised an applicability domain technique that addresses the data locally instead of handling it as a whole—the reliability-density neighbourhood (RDN). The main novelty aspect of this method is that it characterizes each single training instance according to the density of its neighbourhood in the training set, as well as its individual bias and precision. By scanning through the chemical space (by iteratively increasing the applicability domain area), it was observed that new test compounds are successively included into the applicability domain region in such a manner that strongly correlates to their predictive performance. This allows the mapping of local reliability across different locations in the training set space, and thus allows identifying regions where the model has low reliability. This method also showed matching profiles between two external sets, which is an indication that it performs robustly with new data. Another novel aspect in this technique is that it is paired with a specific feature selection algorithm. As a result, the impact of the feature set used was studied from which the top 20 features selected by ReliefF yielded the best results, as opposed to using the model’s features or the entire feature set as commonly done. As the third novel aspect, in this work we propose a new scoring function to help evaluate the quality of an applicability domain profile (i.e., the curve of accuracy vs the applicability domain measure in question). Overall, the RDN showed to be a promising method that can correctly sort new instances according to predictive performance. As a result, this technique can be received by an end-user as proof of concept for the performance of a QSAR model in new data, thus promoting the user’s trust on the QSAR output.Graphical abstract