1
|
Stankovic B, Marinkovic F. A novel procedure for selection of molecular descriptors: QSAR model for mutagenicity of nitroaromatic compounds. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:54603-54617. [PMID: 39207617 DOI: 10.1007/s11356-024-34800-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 08/22/2024] [Indexed: 09/04/2024]
Abstract
Nitroaromatic compounds (NACs) stand out as pervasive organic pollutants, prompting an imperative need to investigate their hazardous effects. Computational chemistry methods play a crucial role in this exploration, offering a safer and more time-efficient approach, mandated by various legislations. In this study, our focus lay on the development of transparent, interpretable, reproducible, and publicly available methodologies aimed at deriving quantitative structure-activity relationship models and testing them by modelling the mutagenicity of NACs against the Salmonella typhimurium TA100 strain. Descriptors were selected from Mordred and RDKit molecular descriptors, along with several quantum chemistry descriptors. For that purpose, the genetic algorithm (GA), as the most widely used method in the literature, and three alternative algorithms (Boruta, Featurewiz, and ForwardSelector) combined with the forward stepwise selection technique were used. The construction of models utilized the multiple linear regression method, with subsequent scrutiny of fitting and predictive performance, reliability, and robustness through various statistical validation criteria. The models were ranked by the Multi-Criteria Decision Making procedure. Findings have revealed that the proposed methodology for descriptor selection outperforms GA, with Featurewiz showing a slight advantage over Boruta and ForwardSelector. These constructed models can serve as valuable tools for the quick and reliable prediction of NACs mutagenicity.
Collapse
Affiliation(s)
- Branislav Stankovic
- Department for Nuclear and Plasma Physics, Vinča Institute of Nuclear Sciences -National Institute of the Republic of Serbia, University of Belgrade, Belgrade, Serbia.
| | | |
Collapse
|
2
|
Cao Y, Kamel M, Mohammadifard K, Heshmati J. M A, Poor Heravi MR, Ghaffar Ebadi A. Probing and comparison of graphene, boron nitride and boron carbide nanosheets for Flutamide adsorption: A DFT computational study. J Mol Liq 2021. [DOI: 10.1016/j.molliq.2021.117487] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
|
3
|
Maleki PA, Nemati‐Kande E, Saray AA. Using Quantum Density Functional Theory Methods to Study the Adsorption of Fluorouracil Drug on Pristine and Al, Ga, P and As Doped Boron Nitride Nanosheets. ChemistrySelect 2021. [DOI: 10.1002/slct.202101333] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
| | - Ebrahim Nemati‐Kande
- Department of Physical Chemistry Faculty of Chemistry Urmia University Urmia Iran
| | - Akbar Abdi Saray
- Department of Physics Basic Science Faculty Urmia University Urmia Iran
| |
Collapse
|
4
|
Chemometrics for Selection, Prediction, and Classification of Sustainable Solutions for Green Chemistry—A Review. Symmetry (Basel) 2020. [DOI: 10.3390/sym12122055] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
In this review, we present the applications of chemometric techniques for green and sustainable chemistry. The techniques, such as cluster analysis, principal component analysis, artificial neural networks, and multivariate ranking techniques, are applied for dealing with missing data, grouping or classification purposes, selection of green material, or processes. The areas of application are mainly finding sustainable solutions in terms of solvents, reagents, processes, or conditions of processes. Another important area is filling the data gaps in datasets to more fully characterize sustainable options. It is significant as many experiments are avoided, and the results are obtained with good approximation. Multivariate statistics are tools that support the application of quantitative structure–property relationships, a widely applied technique in green chemistry.
Collapse
|
5
|
QM Calculations in ADMET Prediction. Methods Mol Biol 2020; 2114:285-305. [PMID: 32016900 DOI: 10.1007/978-1-0716-0282-9_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Abstract
In recent years, there has been an increase in the application of quantum mechanics (QM) methods to describe properties related to the ADMET profile of small molecules. The application of these methods allows calculating useful descriptors and physiochemical properties contributing to ADMET prediction. Considering that QM methods are the only one that describe the electronic state of a molecules, such methods are particularly useful for studying the metabolism of drugs; furthermore, the introduction of mixed QM and molecular mechanics (QM/MM) is also increasing the understanding of drug interaction with cytochromes from a mechanistic point of view. Finally, combining the increase number of experimental data with machine learning algorithms and QM-derived descriptors allowed the creation of an end-user software capable of affecting the drug discovery process.
Collapse
|
6
|
Keshavarz MH, Akbarzadeh AR. A simple approach for assessment of toxicity of nitroaromatic compounds without using complex descriptors and computer codes. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2019; 30:347-361. [PMID: 31020866 DOI: 10.1080/1062936x.2019.1595135] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Accepted: 03/11/2019] [Indexed: 06/09/2023]
Abstract
A simple approach is introduced to assess the toxicity of nitroaromatic compounds in terms of an oral LD50 dose (50% lethal dose) for rats. Most of the presented Quantitative Structure-Activity Relationship (QSAR) models for prediction of in vivo toxicity of nitroaromatics are calculated by quantum computing descriptors which are more difficult to interpret and apply, while the new model requires only the molecular structure of a desirable nitroaromatic compound. The novel model is based on the constitutional descriptors, such as the number of oxygen, sulphur, phosphorous and molecular fragments. Experimental data of 90 nitroaromatics are used to derive and test the new model as the logarithm of LD50 values, i.e. -log (LD50). Although it is based on only simple structural parameters, the reliability of the new model is also higher than the complex QSAR model because the values of the root-mean-square deviation (RMSD) of -log (LD50) for the new and the outputs of the latest QSAR method are 0.342 and 0.377, respectively.
Collapse
Affiliation(s)
- M H Keshavarz
- a Department of Chemistry , Malek-ashtar University of Technology , Shahin-shahr , Islamic Republic of Iran
| | - A R Akbarzadeh
- b Department of Chemistry , Iran University of Science and Technology , Tehran , Islamic Republic of Iran
| |
Collapse
|
7
|
Ahmadi MA, Ahmadi MH, Fahim Alavi M, Nazemzadegan MR, Ghasempour R, Shamshirband S. Determination of thermal conductivity ratio of CuO/ethylene glycol nanofluid by connectionist approach. J Taiwan Inst Chem Eng 2018. [DOI: 10.1016/j.jtice.2018.06.003] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
|
8
|
Ahmadi MA, Zendehboudi S, James LA. Hybrid connectionist model determines CO 2-oil swelling factor. PETROLEUM SCIENCE 2018; 15:591-604. [PMID: 30956651 PMCID: PMC6417373 DOI: 10.1007/s12182-018-0230-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Indexed: 06/09/2023]
Abstract
In-depth understanding of interactions between crude oil and CO2 provides insight into the CO2-based enhanced oil recovery (EOR) process design and simulation. When CO2 contacts crude oil, the dissolution process takes place. This phenomenon results in the oil swelling, which depends on the temperature, pressure, and composition of the oil. The residual oil saturation in a CO2-based EOR process is inversely proportional to the oil swelling factor. Hence, it is important to estimate this influential parameter with high precision. The current study suggests the predictive model based on the least-squares support vector machine (LS-SVM) to calculate the CO2-oil swelling factor. A genetic algorithm is used to optimize hyperparameters (γ and σ 2) of the LS-SVM model. This model showed a high coefficient of determination (R 2 = 0.9953) and a low value for the mean-squared error (MSE = 0.0003) based on the available experimental data while estimating the CO2-oil swelling factor. It was found that LS-SVM is a straightforward and accurate method to determine the CO2-oil swelling factor with negligible uncertainty. This method can be incorporated in commercial reservoir simulators to include the effect of the CO2-oil swelling factor when adequate experimental data are not available.
Collapse
Affiliation(s)
- Mohammad Ali Ahmadi
- Faculty of Engineering and Applied Science, Memorial University of Newfoundland, St. John’s, NL A1C 5S7 Canada
| | - Sohrab Zendehboudi
- Faculty of Engineering and Applied Science, Memorial University of Newfoundland, St. John’s, NL A1C 5S7 Canada
| | - Lesley A. James
- Faculty of Engineering and Applied Science, Memorial University of Newfoundland, St. John’s, NL A1C 5S7 Canada
| |
Collapse
|
9
|
Onguéné PA, Simoben CV, Fotso GW, Andrae-Marobela K, Khalid SA, Ngadjui BT, Mbaze LM, Ntie-Kang F. In silico toxicity profiling of natural product compound libraries from African flora with anti-malarial and anti-HIV properties. Comput Biol Chem 2018; 72:136-149. [DOI: 10.1016/j.compbiolchem.2017.12.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Revised: 08/30/2017] [Accepted: 12/05/2017] [Indexed: 10/18/2022]
|
10
|
Wang B, Zhou L, Xu K, Wang Q. Prediction of Minimum Ignition Energy from Molecular Structure Using Quantitative Structure–Property Relationship (QSPR) Models. Ind Eng Chem Res 2016. [DOI: 10.1021/acs.iecr.6b04347] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Beibei Wang
- School
of Resources and Civil Engineering, Northeastern University, Shenyang, Liaoning 110819, China
- Department of Fire Protection & Safety, Oklahoma State University, Stillwater, Oklahoma 74078, United States
| | - Lulu Zhou
- Department of Fire Protection & Safety, Oklahoma State University, Stillwater, Oklahoma 74078, United States
| | - Kaili Xu
- School
of Resources and Civil Engineering, Northeastern University, Shenyang, Liaoning 110819, China
| | - Qingsheng Wang
- Department of Fire Protection & Safety, Oklahoma State University, Stillwater, Oklahoma 74078, United States
- Department
of Chemical Engineering, Oklahoma State University, Stillwater, Oklahoma 74078, United States
| |
Collapse
|
11
|
Toropova AP, Schultz TW, Toropov AA. Building up a QSAR model for toxicity toward Tetrahymena pyriformis by the Monte Carlo method: A case of benzene derivatives. ENVIRONMENTAL TOXICOLOGY AND PHARMACOLOGY 2016; 42:135-145. [PMID: 26851376 DOI: 10.1016/j.etap.2016.01.010] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2015] [Revised: 01/12/2016] [Accepted: 01/14/2016] [Indexed: 06/05/2023]
Abstract
Data on toxicity toward Tetrahymena pyriformis is indicator of applicability of a substance in ecologic and pharmaceutical aspects. Quantitative structure-activity relationships (QSARs) between the molecular structure of benzene derivatives and toxicity toward T. pyriformis (expressed as the negative logarithms of the population growth inhibition dose, mmol/L) are established. The available data were randomly distributed three times into the visible training and calibration sets, and invisible validation sets. The statistical characteristics for the validation set are the following: r(2)=0.8179 and s=0.338 (first distribution); r(2)=0.8682 and s=0.341 (second distribution); r(2)=0.8435 and s=0.323 (third distribution). These models are built up using only information on the molecular structure: no data on physicochemical parameters, 3D features of the molecular structure and quantum mechanics descriptors are involved in the modeling process.
Collapse
Affiliation(s)
- Alla P Toropova
- IRCCS-Istituto di Ricerche Farmacologiche Mario Negri, Via La Masa 19, Milano, Italy.
| | - Terry W Schultz
- College of Veterinary Medicine, The University of Tennessee, 2407 River Drive, Knoxville, TN 37996-4543, United States
| | - Andrey A Toropov
- IRCCS-Istituto di Ricerche Farmacologiche Mario Negri, Via La Masa 19, Milano, Italy
| |
Collapse
|
12
|
Salehi E, Abdi J, Aliei MH. Assessment of Cu(II) adsorption from water on modified membrane adsorbents using LS-SVM intelligent approach. JOURNAL OF SAUDI CHEMICAL SOCIETY 2016. [DOI: 10.1016/j.jscs.2014.02.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
13
|
Soltanpour S, Shahbazy M, Omidikia N, Kompany-Zareh M, Baharifard MT. A comprehensive QSPR model for dielectric constants of binary solvent mixtures. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2016; 27:165-181. [PMID: 26911475 DOI: 10.1080/1062936x.2015.1120779] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The dielectric constant is a key physicochemical property in solubility, chemical equilibrium and the synthesis of compounds in pharmaceutical/chemical sciences. In this context, a quantitative structure-property relationship (QSPR) model was designed from 3207 binary solvent mixtures by using 23 calculated experimental-theoretical descriptors including solvent fractions (f1 and f2), individual dielectric constants of solvents (dc1 and dc2), temperature, and Abraham and Hansen solvation parameters. The QSPR model was developed using a genetic algorithm based multiple linear regression (GA-MLR) and robust regression. Jackknifing was implemented for internal-external validation of the selected descriptors by GA containing f1, f2, dc1 and dc2. Implementation of jackknifing on the selected descriptors revealed that p values were close to zero. Consequently, the significance of selected descriptors was confirmed through the sign change point of view and their validity was verified. The model was evaluated using the r2 and Q2(F3) parameters as criteria of model prediction ability. The r2 values were equal to 0.925 and 0.922, and Q2(F3) were reported as 0.873 and 0.862 for the cross-validation and prediction steps, respectively. Finally, model performance was clearly acceptable to anticipate the modelling of dielectric constants for a wide range of binary solvent mixtures.
Collapse
Affiliation(s)
- S Soltanpour
- a Faculty of Pharmacy , Zanjan University of Medical Sciences , Zanjan , Iran
| | - M Shahbazy
- b Department of Chemistry, Institute for Advanced Studies in Basic Sciences , Zanjan , Iran
| | - N Omidikia
- b Department of Chemistry, Institute for Advanced Studies in Basic Sciences , Zanjan , Iran
| | - M Kompany-Zareh
- b Department of Chemistry, Institute for Advanced Studies in Basic Sciences , Zanjan , Iran
| | - M Taghi Baharifard
- c Department of Chemistry , College of Science, Qom branch, Islamic Azad University , Qom , Iran
| |
Collapse
|
14
|
Marvuglia A, Kanevski M, Benetto E. Machine learning for toxicity characterization of organic chemical emissions using USEtox database: Learning the structure of the input space. ENVIRONMENT INTERNATIONAL 2015; 83:72-85. [PMID: 26101085 DOI: 10.1016/j.envint.2015.05.011] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Revised: 05/25/2015] [Accepted: 05/28/2015] [Indexed: 06/04/2023]
Abstract
Toxicity characterization of chemical emissions in Life Cycle Assessment (LCA) is a complex task which usually proceeds via multimedia (fate, exposure and effect) models attached to models of dose-response relationships to assess the effects on target. Different models and approaches do exist, but all require a vast amount of data on the properties of the chemical compounds being assessed, which are hard to collect or hardly publicly available (especially for thousands of less common or newly developed chemicals), therefore hampering in practice the assessment in LCA. An example is USEtox, a consensual model for the characterization of human toxicity and freshwater ecotoxicity. This paper places itself in a line of research aiming at providing a methodology to reduce the number of input parameters necessary to run multimedia fate models, focusing in particular to the application of the USEtox toxicity model. By focusing on USEtox, in this paper two main goals are pursued: 1) performing an extensive exploratory analysis (using dimensionality reduction techniques) of the input space constituted by the substance-specific properties at the aim of detecting particular patterns in the data manifold and estimating the dimension of the subspace in which the data manifold actually lies; and 2) exploring the application of a set of linear models, based on partial least squares (PLS) regression, as well as a nonlinear model (general regression neural network--GRNN) in the seek for an automatic selection strategy of the most informative variables according to the modelled output (USEtox factor). After extensive analysis, the intrinsic dimension of the input manifold has been identified between three and four. The variables selected as most informative may vary according to the output modelled and the model used, but for the toxicity factors modelled in this paper the input variables selected as most informative are coherent with prior expectations based on scientific knowledge of toxicity factors modelling. Thus the outcomes of the analysis are promising for the future application of the approach to other portions of the model, affected by important data gaps, e.g., to the calculation of human health effect factors.
Collapse
Affiliation(s)
- Antonino Marvuglia
- Luxembourg Institute of Science and Technology (LIST), Environmental Research & Innovation Department, 41, rue du Brill, L-4422 Belvaux, Luxembourg.
| | - Mikhail Kanevski
- University of Lausanne (UNIL), Faculty of Geosciences and Environment, Geopolis Building CH-1015 Lausanne, Switzerland
| | - Enrico Benetto
- Luxembourg Institute of Science and Technology (LIST), Environmental Research & Innovation Department, 41, rue du Brill, L-4422 Belvaux, Luxembourg
| |
Collapse
|
15
|
Singh PK, Negi A, Gupta PK, Chauhan M, Kumar R. Toxicophore exploration as a screening technology for drug design and discovery: techniques, scope and limitations. Arch Toxicol 2015; 90:1785-802. [PMID: 26341667 DOI: 10.1007/s00204-015-1587-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 08/13/2015] [Indexed: 01/11/2023]
Abstract
Toxicity is a common drawback of newly designed chemotherapeutic agents. With the exception of pharmacophore-induced toxicity (lack of selectivity at higher concentrations of a drug), the toxicity due to chemotherapeutic agents is based on the toxicophore moiety present in the drug. To date, methodologies implemented to determine toxicophores may be broadly classified into biological, bioanalytical and computational approaches. The biological approach involves analysis of bioactivated metabolites, whereas the computational approach involves a QSAR-based method, mapping techniques, an inverse docking technique and a few toxicophore identification/estimation tools. Being one of the major steps in drug discovery process, toxicophore identification has proven to be an essential screening step in drug design and development. The paper is first of its kind, attempting to cover and compare different methodologies employed in predicting and determining toxicophores with an emphasis on their scope and limitations. Such information may prove vital in the appropriate selection of methodology and can be used as screening technology by researchers to discover the toxicophoric potentials of their designed and synthesized moieties. Additionally, it can be utilized in the manipulation of molecules containing toxicophores in such a manner that their toxicities might be eliminated or removed.
Collapse
Affiliation(s)
- Pankaj Kumar Singh
- Laboratory for Drug Design and Synthesis, Centre for Pharmaceutical Sciences and Natural Products, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India
| | - Arvind Negi
- Laboratory for Drug Design and Synthesis, Centre for Pharmaceutical Sciences and Natural Products, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India
| | - Pawan Kumar Gupta
- Centre for Computational Sciences, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India
| | - Monika Chauhan
- Laboratory for Drug Design and Synthesis, Centre for Pharmaceutical Sciences and Natural Products, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India
| | - Raj Kumar
- Laboratory for Drug Design and Synthesis, Centre for Pharmaceutical Sciences and Natural Products, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India.
| |
Collapse
|
16
|
Niazi A, Khorshidi N, Ghaemmaghami P. Microwave-assisted of dispersive liquid-liquid microextraction and spectrophotometric determination of uranium after optimization based on Box-Behnken design and chemometrics methods. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2015; 135:69-75. [PMID: 25062051 DOI: 10.1016/j.saa.2014.06.148] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Revised: 06/19/2014] [Accepted: 06/29/2014] [Indexed: 06/03/2023]
Abstract
In this study an analytical procedure based on microwave-assisted dispersive liquid-liquid microextraction (MA-DLLME) and spectrophotometric coupled with chemometrics methods is proposed to determine uranium. In the proposed method, 4-(2-pyridylazo) resorcinol (PAR) is used as a chelating agent, and chloroform and ethanol are selected as extraction and dispersive solvent. The optimization strategy is carried out by using two level full factorial designs. Results of the two level full factorial design (2(4)) based on an analysis of variance demonstrated that the pH, concentration of PAR, amount of dispersive and extraction solvents are statistically significant. Optimal condition for three variables: pH, concentration of PAR, amount of dispersive and extraction solvents are obtained by using Box-Behnken design. Under the optimum conditions, the calibration graphs are linear in the range of 20.0-350.0 ng mL(-1) with detection limit of 6.7 ng mL(-1) (3δB/slope) and the enrichment factor of this method for uranium reached at 135. The relative standard deviation (R.S.D.) is 1.64% (n=7, c=50 ng mL(-1)). The partial least squares (PLS) modeling was used for multivariate calibration of the spectrophotometric data. The orthogonal signal correction (OSC) was used for preprocessing of data matrices and the prediction results of model, with and without using OSC, were statistically compared. MA-DLLME-OSC-PLS method was presented for the first time in this study. The root mean squares error of prediction (RMSEP) for uranium determination using PLS and OSC-PLS models were 4.63 and 0.98, respectively. This procedure allows the determination of uranium synthesis and real samples such as waste water with good reliability of the determination.
Collapse
Affiliation(s)
- Ali Niazi
- Department of Chemistry, Faculty of Science, Arak Branch, Islamic Azad University, Arak, Iran.
| | - Neda Khorshidi
- Department of Chemistry, Faculty of Science, Arak Branch, Islamic Azad University, Arak, Iran
| | - Pegah Ghaemmaghami
- Department of Chemistry, Faculty of Science, Arak Branch, Islamic Azad University, Arak, Iran
| |
Collapse
|
17
|
Jain (Pancholi) N, Gupta S, Sapre N, Sapre NS. In silico de novo design of novel NNRTIs: a bio-molecular modelling approach. RSC Adv 2015. [DOI: 10.1039/c4ra15478a] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Six novel NNRTIs (DABO) with high efficacy are designed by assessing the interaction potential and structural requirements using chemometric analyses (SVM, BPNN and MLR) on structural descriptors.
Collapse
Affiliation(s)
| | - Swagata Gupta
- Department of Chemistry
- Govt. BLPPG College
- MHOW, India
| | - Neelima Sapre
- Department of Mathematics and Computational Sc
- SGSITS
- Indore, India
| | | |
Collapse
|
18
|
Support Vector Regression Based QSPR for the Prediction of Retention Time of Peptides in Reversed-Phase Liquid Chromatography. Chromatographia 2014. [DOI: 10.1007/s10337-014-2819-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
19
|
MEHDIKHANI ALI, LOTFIZADEH HAMIDREZA, ARMAN KAMYAR, NOORIZADEH HADI. AN IMPROVED QSPR STUDY OF REVERSE FACTOR OF NANOPARTICLES IN ROADSIDE ATMOSPHERE ON KERNEL PARTIAL LEAST SQUARES AND GENETIC ALGORITHM. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2013. [DOI: 10.1142/s0219633612501064] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Thermal desorption-comprehensive two-dimensional gas chromatography high-resolution time-of-flight mass spectrometry (TD–GC × GC–HRTOF-MS) is one of the most powerful tools in analytical nanoparticle compounds. Genetic algorithm and partial least square (GA-PLS) and kernel PLS (GA-KPLS) models were used to investigate the correlation between reverse factor (RF) and descriptors for 50 nanoparticles fraction with a diameter of 29–58 nm in roadside atmosphere which obtained by TD–GC×GC–HRTOF-MS. The correlation coefficient leave-group-out cross validation (LGO-CV (Q2)) of prediction for the GA-PLS and GA-KPLS models for training and test sets were (0.761 and 0.718) and (0.825 and 0.814), respectively, revealing the reliability of these models. This is the first research on the quantitative structure-property relationship (QSPR) of the nanoparticles in roadside atmosphere using the GA-PLS and GA-KPLS.
Collapse
Affiliation(s)
- ALI MEHDIKHANI
- General Inspection Organization, Ilam Office, Ilam City, Iran
| | | | - KAMYAR ARMAN
- Department of Water and Wastewater Engineering, School of Environment and Energy, Payame Noor University, Tehran, Iran
| | - HADI NOORIZADEH
- Department of Chemistry, Faculty of Science, Ilam Branch, Islamic Azad University, Ilam, Iran
| |
Collapse
|
20
|
Support vector regression based QSPR for the prediction of retention time of pesticide residues in gas chromatography–mass spectroscopy. Microchem J 2013. [DOI: 10.1016/j.microc.2012.05.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
21
|
Keshavarz MH, Gharagheizi F, Shokrolahi A, Zakinejad S. Accurate prediction of the toxicity of benzoic acid compounds in mice via oral without using any computer codes. JOURNAL OF HAZARDOUS MATERIALS 2012; 237-238:79-101. [PMID: 22959133 DOI: 10.1016/j.jhazmat.2012.07.048] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2011] [Revised: 03/30/2012] [Accepted: 07/25/2012] [Indexed: 06/01/2023]
Abstract
Most of benzoic acid derivatives are toxic, which may cause serious public health and environmental problems. Two novel simple and reliable models are introduced for desk calculations of the toxicity of benzoic acid compounds in mice via oral LD(50) with more reliance on their answers as one could attach to the more complex outputs. They require only elemental composition and molecular fragments without using any computer codes. The first model is based on only the number of carbon and hydrogen atoms, which can be improved by several molecular fragments in the second model. For 57 benzoic compounds, where the computed results of quantitative structure-toxicity relationship (QSTR) were recently reported, the predicted results of two simple models of present method are more reliable than QSTR computations. The present simple method is also tested with further 324 benzoic acid compounds including complex molecular structures, which confirm good forecasting ability of the second model.
Collapse
Affiliation(s)
- Mohammad Hossein Keshavarz
- Department of Chemistry, Malek-ashtar University of Technology, Shahin-shahr P.O. Box 83145/115, Isfahan, Islamic Republic of Iran.
| | | | | | | |
Collapse
|
22
|
Golmohammadi H, Dashtbozorgi Z, Acree WE. Quantitative structure–activity relationship prediction of blood-to-brain partitioning behavior using support vector machine. Eur J Pharm Sci 2012; 47:421-9. [DOI: 10.1016/j.ejps.2012.06.021] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2012] [Revised: 06/15/2012] [Accepted: 06/20/2012] [Indexed: 11/25/2022]
|
23
|
Khajeh A, Modarress H. Quantitative Structure–Property Relationship Prediction of Liquid Heat Capacity at 298.15 K for Organic Compounds. Ind Eng Chem Res 2012. [DOI: 10.1021/ie202153e] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Aboozar Khajeh
- Department of Chemical Engineering, Amirkabir University of Technology (Tehran Polytechnic), Hafez
Avenue, 15914 Tehran, Iran
| | - Hamid Modarress
- Department of Chemical Engineering, Amirkabir University of Technology (Tehran Polytechnic), Hafez
Avenue, 15914 Tehran, Iran
| |
Collapse
|
24
|
Abbasitabar F, Zare-Shahabadi V. Development predictive QSAR models for artemisinin analogues by various feature selection methods: a comparative study. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2012; 23:1-15. [PMID: 22040327 DOI: 10.1080/1062936x.2011.623316] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Quantitative structure-activity relationship (QSAR) models were derived for 179 analogues of artemisinin, a potent antimalarial agent. The activities of these compounds were investigated by means of multiple linear regression (MLR). To select relevant descriptors, several methods including stepwise selection, successive projection algorithm and an ant colony optimization algorithm (called memorized_ACS) were employed. A wide variety of molecular descriptors belonging to various structural properties were calculated for each molecule. Two matrixes (D1 and D2) of molecular properties were built. The D1 matrix included the calculated descriptors and the D2 matrix contained the first to third orders of the calculated descriptors and the logarithm of absolute values of the calculated descriptors. For both data matrixes, significant QSAR models were obtained by the memorized_ACS algorithm. The reactive and PEOE (partial equalization of orbital electronegativity) descriptors represented the highest impact on the antimalarial activity. The PEOE descriptors belong to partial charge descriptors and the reactive descriptor is an indicator of the presence of the reactive groups in the molecule. The best MLR model has a training error of 0.71 log RA units (r (2 )= 0.81) and a prediction error of 0.48 log RA units (r (2) = 0.88).
Collapse
Affiliation(s)
- F Abbasitabar
- Department of Chemistry, Marvdasht Branch, Islamic Azad University, Marvdasht, Iran.
| | | |
Collapse
|
25
|
Xie K, Qiao S, Fu C, Qi JS. Estimation of the physicochemical properties of PCDD/Fs using three-dimensional holographic vector of atomic interaction field. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART A, TOXIC/HAZARDOUS SUBSTANCES & ENVIRONMENTAL ENGINEERING 2012; 47:704-710. [PMID: 22416864 DOI: 10.1080/10934529.2012.660062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Polychlorinated dibenzo-p-dioxins and dibenzofurans (PCDD/Fs) are a group of important persistent organic pollutants. In the present study, the three-dimensional holographic vector of atomic interaction field (3D-HoVAIF) method is used to describe the chemical structures of PCDD/Fs. After variable screening using a stepwise multiple regression (SMR) technique, the linear relationships among six physicochemical properties of PCDD/Fs and 3D-HoVAIF descriptors are built using a partial least-squares (PLS) regression model. The results show that the 3D-HoVAIF descriptors can be used to express the quantitative structure-property relationships of PCDD/Fs. The predictive capabilities of the models have also been confirmed by leave-one-out cross-validation. The optimum model has been used to estimate values for PCDD/Fs for which no experimental data on physicochemical properties are available. Supplemental materials are available for this article. Go to the publisher's online edition of Journal of Environmental Science and Health: Part A to view the free supplemental file.
Collapse
Affiliation(s)
- Kun Xie
- College of Chemistry and Environmental Engineering, Chongqing Three Gorges University, Wanzhou, Chongqing, China
| | | | | | | |
Collapse
|
26
|
|
27
|
Khajeh A, Modarress H. QSPR prediction of flash point of esters by means of GFA and ANFIS. JOURNAL OF HAZARDOUS MATERIALS 2010; 179:715-720. [PMID: 20381958 DOI: 10.1016/j.jhazmat.2010.03.060] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2009] [Revised: 03/10/2010] [Accepted: 03/13/2010] [Indexed: 05/29/2023]
Abstract
A quantitative structure property relationship (QSPR) study was performed to develop a model for prediction of flash point of esters based on a diverse set of 95 components. The most five important descriptors were selected from a set of 1124 descriptors to build the QSPR model by means of a genetic function approximation (GFA). For considering the nonlinear behavior of these molecular descriptors, adaptive neuro-fuzzy inference system (ANFIS) method was used. The ANFIS and GFA squared correlation coefficient for testing set was 0.969 and 0.965, respectively. The results obtained showed the ability of developed GFA and ANFIS for prediction of flash point of esters.
Collapse
Affiliation(s)
- Aboozar Khajeh
- Islamic Azad University, Birjand Branch, Birjand, Southern Khorasan, Iran
| | | |
Collapse
|
28
|
Zhu X, Shan Y, Li G, Huang A, Zhang Z. Prediction of wood property in Chinese Fir based on visible/near-infrared spectroscopy and least square-support vector machine. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2009; 74:344-8. [PMID: 19576843 DOI: 10.1016/j.saa.2009.06.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2009] [Revised: 05/12/2009] [Accepted: 06/07/2009] [Indexed: 05/13/2023]
Abstract
A method for the quantification of density of Chinese Fir samples based on visible/near-infrared (vis-NIR) spectrometry and least squares-support vector machine (LS-SVM) was proposed. Sample set partitioning based on joint x-y distances (SPXY) algorithm was used for dividing calibration and prediction samples, it is of value for prediction of property involving complex matrices. A stepwise procedure is employed to select samples according to their differences in both x (instrumental responses) and y (predicted parameter) spaces. For comparison, the models were also constructed by Kennard-Stone method, as well as by using the duplex and random sampling methods for subset partitioning. The results revealed that the SPXY algorithm may be an advantageous alternative to the other three strategies. To validate the reliability of LS-SVM, comparisons were made among other modeling methods such as support vector machine (SVM) and partial least squares (PLS) regression. Satisfactory models were built using LS-SVM, with lower prediction errors and superior performance in relation to SVM and PLS. These results showed possibility of building robust models to quantify the density of Chinese Fir using near-infrared spectroscopy and LS-SVM combined SPXY algorithm as a nonlinear multivariate calibration procedure.
Collapse
Affiliation(s)
- Xiangrong Zhu
- Hunan Agricultural Product Processing Institute, Changsha 410125, PR China
| | | | | | | | | |
Collapse
|
29
|
Pan Y, Jiang J, Wang R, Cao H, Cui Y. A novel QSPR model for prediction of lower flammability limits of organic compounds based on support vector machine. JOURNAL OF HAZARDOUS MATERIALS 2009; 168:962-969. [PMID: 19329246 DOI: 10.1016/j.jhazmat.2009.02.122] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2008] [Revised: 02/21/2009] [Accepted: 02/23/2009] [Indexed: 05/27/2023]
Abstract
A quantitative structure-property relationship (QSPR) study is suggested for the prediction of lower flammability limits (LFLs) of organic compounds. Various kinds of molecular descriptors were calculated to represent the molecular structures of compounds, such as topological, charge, and geometric descriptors. Genetic algorithm was employed to select optimal subset of descriptors that have significant contribution to the overall LFL property. The novel chemometrics method of support vector machine was employed to model the possible quantitative relationship between these selected descriptors and LFL. The resulted model showed high prediction ability that the obtained root mean square error and average absolute error for the whole dataset were 0.069 and 0.051vol.%, respectively. The results were also compared with those of previously published models. The comparison results indicate the superiority of the presented model and reveal that it can be effectively used to predict the LFL of organic compounds from the molecular structures alone.
Collapse
Affiliation(s)
- Yong Pan
- Jiangsu Key Laboratory of Urban and Industrial Safety, Institute of Safety Engineering, Nanjing University of Technology, Nanjing 210009, China.
| | | | | | | | | |
Collapse
|
30
|
Massarelli I, Imbriani M, Coi A, Saraceno M, Carli N, Bianucci A. Development of QSAR models for predicting hepatocarcinogenic toxicity of chemicals. Eur J Med Chem 2009; 44:3658-64. [DOI: 10.1016/j.ejmech.2009.02.014] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2008] [Revised: 01/29/2009] [Accepted: 02/12/2009] [Indexed: 11/16/2022]
|
31
|
Current mathematical methods used in QSAR/QSPR studies. Int J Mol Sci 2009; 10:1978-1998. [PMID: 19564933 PMCID: PMC2695261 DOI: 10.3390/ijms10051978] [Citation(s) in RCA: 126] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2009] [Accepted: 04/28/2009] [Indexed: 02/07/2023] Open
Abstract
This paper gives an overview of the mathematical methods currently used in quantitative structure-activity/property relationship (QASR/QSPR) studies. Recently, the mathematical methods applied to the regression of QASR/QSPR models are developing very fast, and new methods, such as Gene Expression Programming (GEP), Project Pursuit Regression (PPR) and Local Lazy Regression (LLR) have appeared on the QASR/QSPR stage. At the same time, the earlier methods, including Multiple Linear Regression (MLR), Partial Least Squares (PLS), Neural Networks (NN), Support Vector Machine (SVM) and so on, are being upgraded to improve their performance in QASR/QSPR studies. These new and upgraded methods and algorithms are described in detail, and their advantages and disadvantages are evaluated and discussed, to show their application potential in QASR/QSPR studies in the future.
Collapse
|
32
|
Dearden JC, Cronin MTD, Kaiser KLE. How not to develop a quantitative structure-activity or structure-property relationship (QSAR/QSPR). SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2009; 20:241-66. [PMID: 19544191 DOI: 10.1080/10629360902949567] [Citation(s) in RCA: 289] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Although thousands of quantitative structure-activity and structure-property relationships (QSARs/QSPRs) have been published, as well as numerous papers on the correct procedures for QSAR/QSPR analysis, many analyses are still carried out incorrectly, or in a less than satisfactory manner. We have identified 21 types of error that continue to be perpetrated in the QSAR/QSPR literature, and each of these is discussed, with examples (including some of our own). Where appropriate, we make recommendations for avoiding errors and for improving and enhancing QSAR/QSPR analyses.
Collapse
Affiliation(s)
- J C Dearden
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool L3 3AF, UK.
| | | | | |
Collapse
|
33
|
Niazi A, Azizi A, Ramezani M. Simultaneous spectrophotometric determination of mercury and palladium with Thio-Michler's Ketone using partial least squares regression and orthogonal signal correction. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2008; 71:1172-1177. [PMID: 18440861 DOI: 10.1016/j.saa.2008.03.017] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2008] [Revised: 03/12/2008] [Accepted: 03/18/2008] [Indexed: 05/26/2023]
Abstract
A simple, novel and sensitive spectrophotometric method was described for simultaneous determination of mercury and palladium. The method is based on the complex formation of mercury and palladium with Thio-Michler's Ketone (TMK) at pH 3.5. All factors affecting on the sensitivity were optimized and the linear dynamic range for determination of mercury and palladium found. The simultaneous determination of mercury and palladium mixtures by using spectrophotometric method is a difficult problem, due to spectral interferences. By multivariate calibration methods such as partial least squares (PLS), it is possible to obtain a model adjusted to the concentration values of the mixtures used in the calibration range. Orthogonal signal correction (OSC) is a preprocessing technique used for removing the information unrelated to the target variables based on constrained principal component analysis. OSC is a suitable preprocessing method for PLS calibration of mixtures without loss of prediction capacity using spectrophotometric method. In this study, the calibration model is based on absorption spectra in the 360-660 nm range for 25 different mixtures of mercury and palladium. Calibration matrices were containing 0.025-1.60 and 0.05-0.50 microg mL(-1) of mercury and palladium, respectively. The RMSEP for mercury and palladium with OSC and without OSC were 0.013, 0.006 and 0.048, 0.030, respectively. This procedure allows the simultaneous determination of mercury and palladium in synthetic and real matrix samples good reliability of the determination.
Collapse
Affiliation(s)
- Ali Niazi
- Department of Chemistry, Faculty of Sciences, Azad University of Arak, Arak, Iran.
| | | | | |
Collapse
|
34
|
Niazi A, Sharifi S, Amjadi E. Least-squares support vector machines for simultaneous voltammetric determination of lead and tin: A comparison between LS-SVM and PLS in voltammetric data. J Electroanal Chem (Lausanne) 2008. [DOI: 10.1016/j.jelechem.2008.06.021] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|