1
|
Mitrofanov AA, Matveev PI, Yakubova KV, Korotcov A, Sattarov B, Tkachenko V, Kalmykov SN. Deep Learning Insights into Lanthanides Complexation Chemistry. Molecules 2021; 26:molecules26113237. [PMID: 34072262 PMCID: PMC8198800 DOI: 10.3390/molecules26113237] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 05/25/2021] [Accepted: 05/25/2021] [Indexed: 11/16/2022] Open
Abstract
Modern structure-property models are widely used in chemistry; however, in many cases, they are still a kind of a "black box" where there is no clear path from molecule structure to target property. Here we present an example of deep learning usage not only to build a model but also to determine key structural fragments of ligands influencing metal complexation. We have a series of chemically similar lanthanide ions, and we have collected data on complexes' stability, built models, predicting stability constants and decoded the models to obtain key fragments responsible for complexation efficiency. The results are in good correlation with the experimental ones, as well as modern theories of complexation. It was shown that the main influence on the constants had a mutual location of the binding centers.
Collapse
Affiliation(s)
- Artem A. Mitrofanov
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory, 1 bld.3, 119991 Moscow, Russia; (P.I.M.); (S.N.K.)
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
- Correspondence:
| | - Petr I. Matveev
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory, 1 bld.3, 119991 Moscow, Russia; (P.I.M.); (S.N.K.)
| | - Kristina V. Yakubova
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
| | - Alexandru Korotcov
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
| | - Boris Sattarov
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
| | - Valery Tkachenko
- Science Data Software, 14909 Forest Landing Cir, Rockville, MD 20850, USA; (K.V.Y.); (A.K.); (B.S.); (V.T.)
| | - Stepan N. Kalmykov
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory, 1 bld.3, 119991 Moscow, Russia; (P.I.M.); (S.N.K.)
| |
Collapse
|
2
|
Solov'ev V, Tsivadze A, Marcou G, Varnek A. Classification of Metal Binders by Naïve Bayes Classifier on the Base of Molecular Fragment Descriptors and Ensemble Modeling. Mol Inform 2019; 38:e1900002. [DOI: 10.1002/minf.201900002] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Accepted: 03/15/2019] [Indexed: 12/31/2022]
Affiliation(s)
- Vitaly Solov'ev
- A.N. Frumkin Institute of Physical Chemistry and ElectrochemistryRussian Academy of Sciences, Leninskiy prosp., 31 119071 Moscow Russia
| | - Aslan Tsivadze
- A.N. Frumkin Institute of Physical Chemistry and ElectrochemistryRussian Academy of Sciences, Leninskiy prosp., 31 119071 Moscow Russia
| | - Gilles Marcou
- Laboratoire de Chémoinformatique, UMR 7140 CNRSUniversité de Strasbourg 1, rue Blaise Pascal 67000 Strasbourg France
| | - Alexandre Varnek
- Laboratoire de Chémoinformatique, UMR 7140 CNRSUniversité de Strasbourg 1, rue Blaise Pascal 67000 Strasbourg France
| |
Collapse
|
3
|
Predictive cartography of metal binders using generative topographic mapping. J Comput Aided Mol Des 2017; 31:701-714. [DOI: 10.1007/s10822-017-0033-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2017] [Accepted: 06/11/2017] [Indexed: 12/27/2022]
|
4
|
Mohamed D. Evaluation of unsymmetrical dithiodiglycolamide as novel extractant for application in selective separation of palladium(II) from aqueous solutions. RUSS J APPL CHEM+ 2016. [DOI: 10.1134/s1070427216080176] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
5
|
Alves VM, Muratov E, Fourches D, Strickland J, Kleinstreuer N, Andrade CH, Tropsha A. Predicting chemically-induced skin reactions. Part I: QSAR models of skin sensitization and their application to identify potentially hazardous compounds. Toxicol Appl Pharmacol 2015; 284:262-72. [PMID: 25560674 PMCID: PMC4546933 DOI: 10.1016/j.taap.2014.12.014] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 12/14/2014] [Accepted: 12/21/2014] [Indexed: 12/20/2022]
Abstract
Repetitive exposure to a chemical agent can induce an immune reaction in inherently susceptible individuals that leads to skin sensitization. Although many chemicals have been reported as skin sensitizers, there have been very few rigorously validated QSAR models with defined applicability domains (AD) that were developed using a large group of chemically diverse compounds. In this study, we have aimed to compile, curate, and integrate the largest publicly available dataset related to chemically-induced skin sensitization, use this data to generate rigorously validated and QSAR models for skin sensitization, and employ these models as a virtual screening tool for identifying putative sensitizers among environmental chemicals. We followed best practices for model building and validation implemented with our predictive QSAR workflow using Random Forest modeling technique in combination with SiRMS and Dragon descriptors. The Correct Classification Rate (CCR) for QSAR models discriminating sensitizers from non-sensitizers was 71-88% when evaluated on several external validation sets, within a broad AD, with positive (for sensitizers) and negative (for non-sensitizers) predicted rates of 85% and 79% respectively. When compared to the skin sensitization module included in the OECD QSAR Toolbox as well as to the skin sensitization model in publicly available VEGA software, our models showed a significantly higher prediction accuracy for the same sets of external compounds as evaluated by Positive Predicted Rate, Negative Predicted Rate, and CCR. These models were applied to identify putative chemical hazards in the Scorecard database of possible skin or sense organ toxicants as primary candidates for experimental validation.
Collapse
Affiliation(s)
- Vinicius M Alves
- Laboratory of Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-220, Brazil; Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Eugene Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA; Laboratory of Theoretical Chemistry, A.V. Bogatsky Physical-Chemical Institute NAS of Ukraine, Odessa 65080, Ukraine
| | - Denis Fourches
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Judy Strickland
- ILS/Contractor Supporting the NTP Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM), P.O. Box 13501, Research Triangle Park, NC 27709, USA
| | - Nicole Kleinstreuer
- ILS/Contractor Supporting the NTP Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM), P.O. Box 13501, Research Triangle Park, NC 27709, USA
| | - Carolina H Andrade
- Laboratory of Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-220, Brazil
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA.
| |
Collapse
|
6
|
Alves VM, Muratov E, Fourches D, Strickland J, Kleinstreuer N, Andrade CH, Tropsha A. Predicting chemically-induced skin reactions. Part II: QSAR models of skin permeability and the relationships between skin permeability and skin sensitization. Toxicol Appl Pharmacol 2015; 284:273-80. [PMID: 25560673 PMCID: PMC4408226 DOI: 10.1016/j.taap.2014.12.013] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 12/14/2014] [Accepted: 12/21/2014] [Indexed: 12/02/2022]
Abstract
Skin permeability is widely considered to be mechanistically implicated in chemically-induced skin sensitization. Although many chemicals have been identified as skin sensitizers, there have been very few reports analyzing the relationships between molecular structure and skin permeability of sensitizers and non-sensitizers. The goals of this study were to: (i) compile, curate, and integrate the largest publicly available dataset of chemicals studied for their skin permeability; (ii) develop and rigorously validate QSAR models to predict skin permeability; and (iii) explore the complex relationships between skin sensitization and skin permeability. Based on the largest publicly available dataset compiled in this study, we found no overall correlation between skin permeability and skin sensitization. In addition, cross-species correlation coefficient between human and rodent permeability data was found to be as low as R2=0.44. Human skin permeability models based on the random forest method have been developed and validated using OECD-compliant QSAR modeling workflow. Their external accuracy was high (Q2ext = 0.73 for 63% of external compounds inside the applicability domain). The extended analysis using both experimentally-measured and QSAR-imputed data still confirmed the absence of any overall concordance between skin permeability and skin sensitization. This observation suggests that chemical modifications that affect skin permeability should not be presumed a priori to modulate the sensitization potential of chemicals. The models reported herein as well as those developed in the companion paper on skin sensitization suggest that it may be possible to rationally design compounds with the desired high skin permeability but low sensitization potential.
Collapse
Affiliation(s)
- Vinicius M Alves
- Laboratory of Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-220, Brazil; Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Eugene Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA; Laboratory of Theoretical Chemistry, A.V. Bogatsky Physical-Chemical Institute NAS of Ukraine, Odessa 65080, Ukraine
| | - Denis Fourches
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Judy Strickland
- ILS/Contractor supporting the NTP Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM), P.O. Box 13501, Research Triangle Park, NC 27709, USA
| | - Nicole Kleinstreuer
- ILS/Contractor supporting the NTP Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM), P.O. Box 13501, Research Triangle Park, NC 27709, USA
| | - Carolina H Andrade
- Laboratory of Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-220, Brazil
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA.
| |
Collapse
|
7
|
Sitnikov GV, Zhokhova NI, Ustynyuk YA, Varnek A, Baskin II. Continuous indicator fields: a novel universal type of molecular fields. J Comput Aided Mol Des 2014; 29:233-47. [DOI: 10.1007/s10822-014-9818-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Accepted: 11/24/2014] [Indexed: 11/25/2022]
|
8
|
Fourches D. Cheminformatics: At the Crossroad of Eras. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2014. [DOI: 10.1007/978-94-017-9257-8_16] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
9
|
Salahinejad M, Zolfonoun E. 3D-QSARStudies of Polyazaheterocyclic Ligands Used in Lanthanide and Actinide Extraction Processes. SOLVENT EXTRACTION AND ION EXCHANGE 2013. [DOI: 10.1080/07366299.2013.810967] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
10
|
Chagnes A, Moncomble A, Cote G. In-Silico Calculations as a Helpful Tool for Designing New Extractants in Liquid-Liquid Extraction. SOLVENT EXTRACTION AND ION EXCHANGE 2013. [DOI: 10.1080/07366299.2013.775884] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Alexandre Chagnes
- a Chimie ParisTech, Laboratoire d'Electrochimie, Chimie aux Interfaces et Modélisation pour l'Energie (LECIME) , Paris , France
- b CNRS , Paris , France
| | | | - Gérard Cote
- a Chimie ParisTech, Laboratoire d'Electrochimie, Chimie aux Interfaces et Modélisation pour l'Energie (LECIME) , Paris , France
- b CNRS , Paris , France
| |
Collapse
|
11
|
Lan JH, Shi WQ, Yuan LY, Li J, Zhao YL, Chai ZF. Recent advances in computational modeling and simulations on the An(III)/Ln(III) separation process. Coord Chem Rev 2012. [DOI: 10.1016/j.ccr.2012.04.002] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
12
|
Fourches D, Barnes JC, Day NC, Bradley P, Reed JZ, Tropsha A. Cheminformatics analysis of assertions mined from literature that describe drug-induced liver injury in different species. Chem Res Toxicol 2010; 23:171-83. [PMID: 20014752 DOI: 10.1021/tx900326k] [Citation(s) in RCA: 97] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Drug-induced liver injury is one of the main causes of drug attrition. The ability to predict the liver effects of drug candidates from their chemical structures is critical to help guide experimental drug discovery projects toward safer medicines. In this study, we have compiled a data set of 951 compounds reported to produce a wide range of effects in the liver in different species, comprising humans, rodents, and nonrodents. The liver effects for this data set were obtained as assertional metadata, generated from MEDLINE abstracts using a unique combination of lexical and linguistic methods and ontological rules. We have analyzed this data set using conventional cheminformatics approaches and addressed several questions pertaining to cross-species concordance of liver effects, chemical determinants of liver effects in humans, and the prediction of whether a given compound is likely to cause a liver effect in humans. We found that the concordance of liver effects was relatively low (ca. 39-44%) between different species, raising the possibility that species specificity could depend on specific features of chemical structure. Compounds were clustered by their chemical similarity, and similar compounds were examined for the expected similarity of their species-dependent liver effect profiles. In most cases, similar profiles were observed for members of the same cluster, but some compounds appeared as outliers. The outliers were the subject of focused assertion regeneration from MEDLINE as well as other data sources. In some cases, additional biological assertions were identified, which were in line with expectations based on compounds' chemical similarities. The assertions were further converted to binary annotations of underlying chemicals (i.e., liver effect vs no liver effect), and binary quantitative structure-activity relationship (QSAR) models were generated to predict whether a compound would be expected to produce liver effects in humans. Despite the apparent heterogeneity of data, models have shown good predictive power assessed by external 5-fold cross-validation procedures. The external predictive power of binary QSAR models was further confirmed by their application to compounds that were retrieved or studied after the model was developed. To the best of our knowledge, this is the first study for chemical toxicity prediction that applied QSAR modeling and other cheminformatics techniques to observational data generated by the means of automated text mining with limited manual curation, opening up new opportunities for generating and modeling chemical toxicology data.
Collapse
Affiliation(s)
- Denis Fourches
- Laboratory for Molecular Modeling, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | | | | | | | | | | |
Collapse
|
13
|
Kolarik Z. Complexation and Separation of Lanthanides(III) and Actinides(III) by Heterocyclic N-Donors in Solutions. Chem Rev 2008; 108:4208-52. [DOI: 10.1021/cr078003i] [Citation(s) in RCA: 372] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Zdenek Kolarik
- Consultant, Kolberger Strasse 9, D-76139 Karlsruhe, Germany
| |
Collapse
|
14
|
Varnek A, Fourches D, Solov'ev V, Klimchuk O, Ouadi A, Billard I. Successful “In Silico” Design of New Efficient Uranyl Binders. SOLVENT EXTRACTION AND ION EXCHANGE 2007. [DOI: 10.1080/07366290701415820] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|