1
|
Fioressi SE, Bacelo DE, Aranda JF, Duchowicz PR. Prediction of the aqueous solubility of diverse compounds by 2D-QSPR. J Mol Liq 2020. [DOI: 10.1016/j.molliq.2020.112572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
2
|
Saavedra LM, Romanelli GP, Duchowicz PR. A non-conformational QSAR study for plant-derived larvicides against Zika Aedes aegypti L. vector. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2020; 27:6205-6214. [PMID: 31865579 DOI: 10.1007/s11356-019-06630-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 09/25/2019] [Indexed: 06/10/2023]
Abstract
A set of 263 plant-derived compounds with larvicidal activity against Aedes aegypti L. (Diptera: Culicidae) vector is collected from the literature, and is studied by means of a non-conformational quantitative structure-activity relationships (QSAR) approach. The balanced subsets method (BSM) is employed to split the complete dataset into training, validation and test sets. From 26,775 freely available molecular descriptors, the most relevant structural features of compounds affecting the bioactivity are taken. The molecular descriptors are calculated through four different freewares, such as PaDEL, Mold2, EPI Suite and QuBiLs-MAS. The replacement method (RM) variable subset selection technique leads to the best linear regression models. A successful QSAR equation involves 7-conformation-independent molecular descriptors, fulfiling the evaluated internal (loo, l30%o, VIF and Y-randomization) and external (test set with Ntest = 65 compounds) validation criteria. The practical application of this QSAR model reveals promising predicted values for some natural compounds with unknown experimental larvicidal activity. Therefore, the present model constitutes the first one based on a large molecular set, being a useful computational tool for identifying and guiding the synthesis of new active molecules inspired by natural products.
Collapse
Affiliation(s)
- Laura M Saavedra
- Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), CONICET, UNLP, Diag. 113 y 64, C.C. 16, Sucursal 4, 1900, La Plata, Argentina.
| | - Gustavo P Romanelli
- Departamento de Química, Facultad de Ciencias Exactas, CONICET, UNLP, Centro de Investigación y Desarrollo en Ciencias Aplicadas "Dr. J.J. Ronco" (CINDECA), Calle 47 No. 257, B1900AJK, La Plata, Argentina
- Cátedra de Química Orgánica, Centro de Investigación en Sanidad Vegetal (CISaV), Facultad de Ciencias Agrarias y Forestales, Universidad Nacional de La Plata, Calles 60 y 119 s/n, B1904AAN, La Plata, Argentina
| | - Pablo R Duchowicz
- Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), CONICET, UNLP, Diag. 113 y 64, C.C. 16, Sucursal 4, 1900, La Plata, Argentina.
| |
Collapse
|
3
|
QSAR study of human epidermal growth factor receptor (EGFR) inhibitors: conformation-independent models. Med Chem Res 2019. [DOI: 10.1007/s00044-019-02437-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
4
|
Przybyłek M, Studziński W, Gackowska A, Gaca J. The use of fast molecular descriptors and artificial neural networks approach in organochlorine compounds electron ionization mass spectra classification. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2019; 26:28188-28201. [PMID: 31363975 PMCID: PMC6791912 DOI: 10.1007/s11356-019-05968-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/24/2018] [Accepted: 07/12/2019] [Indexed: 06/10/2023]
Abstract
Developing of theoretical tools can be very helpful for supporting new pollutant detection. Nowadays, a combination of mass spectrometry and chromatographic techniques are the most basic environmental monitoring methods. In this paper, two organochlorine compound mass spectra classification systems were proposed. The classification models were developed within the framework of artificial neural networks (ANNs) and fast 1D and 2D molecular descriptor calculations. Based on the intensities of two characteristic MS peaks, namely, [M] and [M-35], two classification criterions were proposed. According to criterion I, class 1 comprises [M] signals with the intensity higher than 800 NIST units, while class 2 consists of signals with the intensity lower or equal than 800. According to criterion II, class 1 consists of [M-35] signals with the intensity higher than 100, while signals with the intensity lower or equal than 100 belong to class 2. As a result of ANNs learning stage, five models for both classification criterions were generated. The external model validation showed that all ANNs are characterized by high predicting power; however, criterion I-based ANNs are much more accurate and therefore are more suitable for analytical purposes. In order to obtain another confirmation, selected ANNs were tested against additional dataset comprising popular sunscreen agents disinfection by-products reported in previous works.
Collapse
Affiliation(s)
- Maciej Przybyłek
- Chair and Department of Physical Chemistry, Pharmacy Faculty, Collegium Medicum of Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-950, Bydgoszcz, Poland.
| | - Waldemar Studziński
- Faculty of Chemical Technology and Engineering, University of Technology and Life Science, Seminaryjna 3, 85-326, Bydgoszcz, Poland
| | - Alicja Gackowska
- Faculty of Chemical Technology and Engineering, University of Technology and Life Science, Seminaryjna 3, 85-326, Bydgoszcz, Poland
| | - Jerzy Gaca
- Faculty of Chemical Technology and Engineering, University of Technology and Life Science, Seminaryjna 3, 85-326, Bydgoszcz, Poland
| |
Collapse
|
5
|
Fioressi SE, Bacelo DE, Rojas C, Aranda JF, Duchowicz PR. Conformation-independent quantitative structure-property relationships study on water solubility of pesticides. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2019; 171:47-53. [PMID: 30594756 DOI: 10.1016/j.ecoenv.2018.12.056] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2018] [Revised: 12/14/2018] [Accepted: 12/19/2018] [Indexed: 06/09/2023]
Abstract
Water solubility is a key physicochemical parameter in pesticide control and regulation, although sometimes its experimental determination is not an easy task. In this study, we present Quantitative Structure-Property Relationships (QSPRs) for predicting the water solubility at 20 °C of 1211 approved heterogeneous pesticide compounds, collected from the online Pesticides Properties Data Base (PPDB). Validated and generally applicable Multivariable Linear Regression (MLR) models were established, including molecular descriptors carrying constitutional and topological aspects of the analyzed compounds. The most representative descriptors were selected from the exploration of a large number of about 18,000 structural variables. A hybrid approach that involves a molecular descriptor, a fingerprint, and a flexible descriptor showed the best predictive performance.
Collapse
Affiliation(s)
- Silvina E Fioressi
- Departamento de Química, Facultad de Ciencias Exactas y Naturales, Universidad de Belgrano, Villanueva 1324, CP 1426, Buenos Aires, Argentina.
| | - Daniel E Bacelo
- Departamento de Química, Facultad de Ciencias Exactas y Naturales, Universidad de Belgrano, Villanueva 1324, CP 1426, Buenos Aires, Argentina
| | - Cristian Rojas
- Facultad de Ciencia y Tecnología, Universidad del Azuay, Av. 24 de Mayo 7-77 y Hernán Malo, Cuenca, Ecuador
| | - José F Aranda
- Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), CONICET, UNLP, Diag. 113 y 64, C.C. 16, Sucursal 4, 1900 La Plata, Argentina
| | - Pablo R Duchowicz
- Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), CONICET, UNLP, Diag. 113 y 64, C.C. 16, Sucursal 4, 1900 La Plata, Argentina.
| |
Collapse
|
6
|
Saavedra LM, Romanelli GP, Duchowicz PR. Quantitative structure-activity relationship (QSAR) analysis of plant-derived compounds with larvicidal activity against Zika Aedes aegypti (Diptera: Culicidae) vector using freely available descriptors. PEST MANAGEMENT SCIENCE 2018; 74:1608-1615. [PMID: 29314584 DOI: 10.1002/ps.4850] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 11/24/2017] [Accepted: 12/29/2017] [Indexed: 06/07/2023]
Abstract
BACKGROUND We have developed a quantitative structure-activity relationship (QSAR) model for predicting the larvicidal activity of 60 plant-derived molecules against Aedes aegypti L. (Diptera: Culicidae), a vector of several diseases such as dengue, yellow fever, chikungunya and Zika. The balanced subsets method (BSM) based on k-means cluster analysis (k-MCA) was employed to split the data set. The replacement method (RM) variable subset selection technique coupled with multivariable linear regression (MLR) proved to be successful for exploring 18 326 molecular descriptors and fingerprints calculated with PaDEL, Mold2 and EPI Suite open-source softwares. RESULTS A robust QSAR model (Rtrain2=0.84, Strain = 0.20 and Rtest2=0.92, Stest = 0.23) involving five non-conformational descriptors was established. The model was validated and tested through the use of an external test set of compounds, the leave-one-out (LOO) and leave-more-out (LMO) cross-validation methods, Y-randomization and applicability domain (AD) analysis. CONCLUSION The QSAR model surpasses previously published models based on geometrical descriptors, thereby representing a suitable tool for predicting larvicidal activity against the vector A. aegypti using a conformation-independent approach. © 2018 Society of Chemical Industry.
Collapse
Affiliation(s)
- Laura M Saavedra
- Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad Nacional de La Plata (UNLP), La Plata, Argentina
| | - Gustavo P Romanelli
- Centro de Investigación y Desarrollo en Ciencias Aplicadas "Dr. J.J. Ronco" (CINDECA), Departamento de Química, Facultad de Ciencias Exactas, CONICET, UNLP, La Plata, Argentina
- Cátedra de Química Orgánica, Centro de Investigación en Sanidad Vegetal (CISaV), Facultad de Ciencias Agrarias y Forestales, Universidad Nacional de La Plata, La Plata, Argentina
| | - Pablo R Duchowicz
- Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad Nacional de La Plata (UNLP), La Plata, Argentina
| |
Collapse
|
7
|
Duchowicz PR. Linear Regression QSAR Models for Polo-Like Kinase-1 Inhibitors. Cells 2018; 7:cells7020013. [PMID: 29443884 PMCID: PMC5850101 DOI: 10.3390/cells7020013] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2017] [Revised: 01/23/2018] [Accepted: 02/02/2018] [Indexed: 01/22/2023] Open
Abstract
A structurally diverse dataset of 530 polo-like kinase-1 (PLK1) inhibitors is compiled from the ChEMBL database and studied by means of a conformation-independent quantitative structure-activity relationship (QSAR) approach. A large number (26,761) of molecular descriptors are explored with the main intention of capturing the most relevant structural characteristics affecting the bioactivity. The structural descriptors are derived with different freeware, such as PaDEL, Mold², and QuBiLs-MAS; such descriptor software complements each other and improves the QSAR results. The best multivariable linear regression models are found with the replacement method variable subset selection technique. The balanced subsets method partitions the dataset into training, validation, and test sets. It is found that the proposed linear QSAR model improves previously reported models by leading to a simpler alternative structure-activity relationship.
Collapse
Affiliation(s)
- Pablo R Duchowicz
- Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), CONICET, UNLP, Diag. 113 y 64, C.C. 16, Sucursal 4, La Plata 1900, Argentina.
| |
Collapse
|