1
|
Tutar H, Celik S, Er H, Gönülal E. Impact of morphological traits and irrigation levels on fresh herbage yield of sorghum x sudangrass hybrid: Modelling data mining techniques. PLoS One 2025; 20:e0318230. [PMID: 39908253 DOI: 10.1371/journal.pone.0318230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2024] [Accepted: 01/14/2025] [Indexed: 02/07/2025] Open
Abstract
In this study, the effect of morphological traits on fresh herbage yield of sorghum x sudangrass hybrid plant grown in Konya province, which is the largest cereal production area in Turkey, was analyzed with some data mining methods. For this purpose, Artificial Neural Networks (ANN), Automatic Linear Model (ALM), Random Forest (RF) Algorithm and Multivariate Adaptive Regression Spline (MARS) Algorithm were used, and the prediction performances of these methods were compared. Plant height of 251.22 cm, stem diameter of 7.03 mm, fresh herbage yield of 8010.69 kg da-1, crude protein ratio of 9.09%, acid detergent fiber 33.23%, neutral detergent fiber 57.44%, acid detergent lignin 7.43%, dry matter digestibility of 63.01%, dry matter intake 2.11%, and relative feed value of 103.02 were the descriptive statistical values that were computed. Model fit statistics, including coefficient of determination (R2), adjusted R2, root of mean square error (RMSE), mean absolute percentage error (MAPE), standard deviation ratio (SD ratio), Mean Absolution Error (MAE) and Relative Absolution Error (RAE), were used to evaluate the prediction abilities of the fitted models. The MARS method was shown to be the best model for describing fresh herbage yield, with the lowest values of RMSE, MAPE, SD ratio, MAE and RAE (137.7, 1.488, 0.072, 109.718 and 0.017, respectively), as well as the highest R2 value (0.995) and adjusted R2 value (0.991). The experimental results show that the MARS algorithm is the most suitable model for predicting fresh herbage yield in sorghum x sudangrass hybrid, providing a good alternative to other data mining algorithms.
Collapse
Affiliation(s)
- Halit Tutar
- Faculty of Agriculture, Department of Field Crops, Bingöl University, Bingöl, Türkiye
| | - Senol Celik
- Faculty of Agriculture, Department of Animal Science, Bingöl University, Bingöl, Türkiye
| | - Hasan Er
- Faculty of Agriculture, Department of Biosystems Engineering, Bingöl University, Bingöl, Türkiye
| | - Erdal Gönülal
- Bahri Dagdas International Agriculture Research Institute, Konya, Türkiye
| |
Collapse
|
2
|
Çelik Ş, Tutar H, Gönülal E, Er H. Prediction of fresh herbage yield using data mining techniques with limited plant quality parameters. Sci Rep 2024; 14:21396. [PMID: 39271726 PMCID: PMC11399138 DOI: 10.1038/s41598-024-72746-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 09/10/2024] [Indexed: 09/15/2024] Open
Abstract
The purpose of this study was to ascertain the fresh herbage yield, fertilizer dosage, and plant characteristics of the Sorghum-Sudangrass hybrid grown in arid and semi-arid regions, as well as their interrelationships. For this reason, data from the Sorghum-Sudangrass hybrid were used to assess the predictive performance of several data mining techniques, including CHAID, CART, MARS, and Bagging MARS. Plant traits were measured in Konya and Sanliurfa during 2021 and 2022. The descriptive statistical values were calculated as follows: plant height 306.27 cm, stem diameter 9.47 mm, fresh herbage yield 10852.51 kg/da, crude protein ratio 9.66%, acid detergent fiber 33.39%, neutral detergent fiber 51.85%, acid detergent lignin 9.76%, dry matter digestibility 62.88%, dry matter intake 2.34%, and relative feed value 114.68 (average values). The predictive capacities of the fitted models were assessed using model fit statistics such as the coefficient of determination (R²), adjusted R², root mean square error (RMSE), mean absolute percentage error (MAPE), standard deviation ratio (SD ratio), and Akaike Information Criterion (AIC). With the lowest values for RMSE, MAPE, SD ratio, and AIC (246, 1.926, 0.085, and 845, respectively), and the highest R² value (0.993) and adjusted R² value (0.989), the MARS algorithm was determined to be the best model for characterizing fresh herbage yield. As a solid alternative to other data mining techniques, the MARS algorithm was shown to be the most appropriate model for forecasting fresh herbage production.
Collapse
Affiliation(s)
- Şenol Çelik
- Biometry and Genetic Unit, Department of Animal Science, Faculty of Agriculture, Bingol University, 12000, Bingöl, Turkey.
| | - Halit Tutar
- Department of Field Crops, Faculty of Agriculture, Bingol University, 12000, Bingöl, Turkey
| | - Erdal Gönülal
- Bahri Dagdas International Agriculture Research Institute, 42000, Konya, Turkey
| | - Hasan Er
- Department of Biosystems Engineering, Faculty of Agriculture, Bingol University, 12000, Bingöl, Turkey
| |
Collapse
|
3
|
Çelik Ş, Yılmaz O. Investigation of the Relationships between Coat Colour, Sex, and Morphological Characteristics in Donkeys Using Data Mining Algorithms. Animals (Basel) 2023; 13:2366. [PMID: 37508143 PMCID: PMC10376350 DOI: 10.3390/ani13142366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 07/02/2023] [Accepted: 07/19/2023] [Indexed: 07/30/2023] Open
Abstract
This study was carried out in order to determine the morphological characteristics, body coat colour distribution, and body dimensions of donkeys raised in Turkey, as well as to determine the relationships between these factors. For this reason, the predictive performance of various machine learning algorithms (i.e., CHAID, Random Forest, ALM, MARS, and Bagging MARS) were compared, utilising the biometric data of donkeys. In particular, mean measurements were taken from a total of 371 donkeys (252 male and 119 female) with descriptive statistical values as follows: height at withers, 100.7 cm; rump height, 103.1 cm; body length, 103.8 cm; chest circumference, 112.8 cm; chest depth, 45.7 cm; chest width, 29.1 cm; front shin circumference, 13.5 cm; head length, 55 cm; and ear length, 22 cm. The body colour distribution of the donkeys considered in this study was calculated as 39.35% grey, 19.95% white, 21.83% black, and 18.87% brown. Model fit statistics, including the coefficient of determination (R2), mean square error, root-mean-square error (RMSE), mean absolute percentage error (MAPE), and standard deviation ratio (SD ratio), were calculated to measure the predictive ability of the fitted models. The MARS algorithm was found to be the best model for defining the body length of donkeys, with the highest R2 value (0.916) and the lowest RMSE, MAPE, and SD ratio values (2.173, 1.615, and 0.291, respectively). The experimental results indicate that the most suitable model is the MARS algorithm, which provides a good alternative to other data mining algorithms for predicting the body length of donkeys.
Collapse
Affiliation(s)
- Şenol Çelik
- Biometry and Genetic Unit, Department of Animal Science, Faculty of Agriculture, Bingol University, Bingol 12000, Turkey
| | - Orhan Yılmaz
- Plant and Animal Production Department, Posof Vocational School, Ardahan University, Ardahan 75000, Turkey
| |
Collapse
|
4
|
Genomic prediction through machine learning and neural networks for traits with epistasis. Comput Struct Biotechnol J 2022; 20:5490-5499. [PMID: 36249559 PMCID: PMC9547190 DOI: 10.1016/j.csbj.2022.09.029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 09/20/2022] [Accepted: 09/20/2022] [Indexed: 11/22/2022] Open
Abstract
Performance of machine learning and neural netowrks in Genomic analysis. Heritability and QTL number impacts on performance machine learning methods. Machine learning models in genomic analyses. Neural networks can present better performance for complex quantitative traits.
Genomic wide selection (GWS) is one contributions of molecular genetics to breeding. Machine learning (ML) and artificial neural networks (ANN) methods are non-parameterized and can develop more accurate and parsimonious models for GWS analysis. Multivariate Adaptive Regression Splines (MARS) is considered one of the most flexible ML methods, automatically modeling nonlinearities and interactions of the predictor variables. This study aimed to evaluate and compare methods based on ANN, ML, including MARS, and G-BLUP through GWS. An F2 population formed by 1000 individuals and genotyped for 4010 SNP markers and twelve traits from a model considering epistatic effect, with QTL numbers ranging from eight to 480 and heritability (h2) of 0.3, 0.5 or 0.8 were simulated. Variation in heritability and number of QTL impacts the performance of methods. About quantitative traits (40, 80, 120, 240, and 480 QTLs) was observed highest R2 to Radial Base Network (RBF) and G-BLUP, followed by Random Forest (RF), Bagging (BA), and Boosting (BO). RF and BA also showed better results for traits to h2 of 0.3 with R2 values 16.51% and 16.30%, respectively, while MARS methods showed better results for oligogenic traits with R2 values ranging from 39,12 % to 43,20 % in h2 of 0.5 and from 59.92% to 78,56% in h2 of 0.8. Non-additive MARS methods also showed high R2 for traits with high heritability and 240 QTLs or more. ANN and ML methods are powerful tools to predict genetic values in traits with epistatic effect, for different degrees of heritability and QTL numbers.
Collapse
|
5
|
Gajewicz-Skretna A, Furuhama A, Yamamoto H, Suzuki N. Generating accurate in silico predictions of acute aquatic toxicity for a range of organic chemicals: Towards similarity-based machine learning methods. CHEMOSPHERE 2021; 280:130681. [PMID: 34162070 DOI: 10.1016/j.chemosphere.2021.130681] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 04/21/2021] [Accepted: 04/22/2021] [Indexed: 06/13/2023]
Abstract
There has been an increase in the use of non-animal approaches, such as in silico and/or in vitro methods, for assessing the risks of hazardous chemicals. A number of machine learning algorithms link molecular descriptors that interpret chemical structural properties with their biological activity. These computer-aided methods encounter several challenges, the most significant being the heterogeneity of datasets; more efficient and inclusive computational methods that are able to process large and heterogeneous chemical datasets are needed. In this context, this study verifies the utility of similarity-based machine learning methods in predicting the acute aquatic toxicity of diverse organic chemicals on Daphnia magna and Oryzias latipes. Two similarity-based methods were tested that employ a limited training dataset, most similar to a given fitting point, instead of using the entire dataset that encompasses a wide range of chemicals. The kernel-weighted local polynomial approach had a number of advantages over the distance-weighted k-nearest neighbor (k-NN) algorithm. The results highlight the importance of lipophilicity, electrophilic reactivity, molecular polarizability, and size in determining acute toxicity. The rigorous model validation ensures that this approach is an important tool for estimating toxicity in new or untested chemicals.
Collapse
Affiliation(s)
- Agnieszka Gajewicz-Skretna
- Laboratory of Environmental Chemometrics, Faculty of Chemistry, University of Gdansk, Wita Stwosza 63, 80-308, Gdansk, Poland.
| | - Ayako Furuhama
- Center for Health and Environmental Risk Research, National Institute for Environmental Studies (NIES), 16-2 Onogawa, Tsukuba, 305-8506, Japan; Division of Genetics and Mutagenesis, National Institute of Health Sciences (NIHS), 3-25-26 Tonomachi, Kawasaki-ku, Kawasaki City, Kanagawa, 210-9501, Japan
| | - Hiroshi Yamamoto
- Center for Health and Environmental Risk Research, National Institute for Environmental Studies (NIES), 16-2 Onogawa, Tsukuba, 305-8506, Japan
| | - Noriyuki Suzuki
- Center for Health and Environmental Risk Research, National Institute for Environmental Studies (NIES), 16-2 Onogawa, Tsukuba, 305-8506, Japan
| |
Collapse
|
6
|
In Silico Prediction of Intestinal Permeability by Hierarchical Support Vector Regression. Int J Mol Sci 2020; 21:ijms21103582. [PMID: 32438630 PMCID: PMC7279352 DOI: 10.3390/ijms21103582] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 05/14/2020] [Accepted: 05/17/2020] [Indexed: 11/17/2022] Open
Abstract
The vast majority of marketed drugs are orally administrated. As such, drug absorption is one of the important drug metabolism and pharmacokinetics parameters that should be assessed in the process of drug discovery and development. A nonlinear quantitative structure-activity relationship (QSAR) model was constructed in this investigation using the novel machine learning-based hierarchical support vector regression (HSVR) scheme to render the extremely complicated relationships between descriptors and intestinal permeability that can take place through various passive diffusion and carrier-mediated active transport routes. The predictions by HSVR were found to be in good agreement with the observed values for the molecules in the training set (n = 53, r2 = 0.93, q CV 2 = 0.84, RMSE = 0.17, s = 0.08), test set (n = 13, q2 = 0.75-0.89, RMSE = 0.26, s = 0.14), and even outlier set (n = 8, q2 = 0.78-0.92, RMSE = 0.19, s = 0.09). The built HSVR model consistently met the most stringent criteria when subjected to various statistical assessments. A mock test also assured the predictivity of HSVR. Consequently, this HSVR model can be adopted to facilitate drug discovery and development.
Collapse
|
7
|
Shin M, Jang D, Nam H, Lee KH, Lee D. Predicting the Absorption Potential of Chemical Compounds Through a Deep Learning Approach. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:432-440. [PMID: 26930688 DOI: 10.1109/tcbb.2016.2535233] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The human colorectal carcinoma cell line (Caco-2) is a commonly used in-vitro test that predicts the absorption potential of orally administered drugs. In-silico prediction methods, based on the Caco-2 assay data, may increase the effectiveness of the high-throughput screening of new drug candidates. However, previously developed in-silico models that predict the Caco-2 cellular permeability of chemical compounds use handcrafted features that may be dataset-specific and induce over-fitting problems. Deep Neural Network (DNN) generates high-level features based on non-linear transformations for raw features, which provides high discriminant power and, therefore, creates a good generalized model. We present a DNN-based binary Caco-2 permeability classifier. Our model was constructed based on 663 chemical compounds with in-vitro Caco-2 apparent permeability data. Two hundred nine molecular descriptors are used for generating the high-level features during DNN model generation. Dropout regularization is applied to solve the over-fitting problem and the non-linear activation. The Rectified Linear Unit (ReLU) is adopted to reduce the vanishing gradient problem. The results demonstrate that the high-level features generated by the DNN are more robust than handcrafted features for predicting the cellular permeability of structurally diverse chemical compounds in Caco-2 cell lines.
Collapse
|
8
|
Rocher F, Roblin G, Chollet JF. Modifications of the chemical structure of phenolics differentially affect physiological activities in pulvinar cells of Mimosa pudica L. II. Influence of various molecular properties in relation to membrane transport. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2017; 24:6910-6922. [PMID: 26820642 DOI: 10.1007/s11356-016-6048-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Accepted: 01/04/2016] [Indexed: 06/05/2023]
Abstract
Early prediction of compound absorption by cells is of considerable importance in the building of an integrated scheme describing the impact of a compound on intracellular biological processes. In this scope, we study the structure-activity relationships of several benzoic acid-related phenolics which are involved in many plant biological phenomena (growth, flowering, allelopathy, defense processes). Using the partial least squares (PLS) regression method, the impact of molecular descriptors that have been shown to play an important role concerning the uptake of pharmacologically active compounds by animal cells was analyzed in terms of the modification of membrane potential, variations in proton flux, and inhibition of the osmocontractile reaction of pulvinar cells of Mimosa pudica leaves. The hydrogen bond donors (HBD) and hydrogen bond acceptors (HBA), polar surface area (PSA), halogen ratio (Hal ratio), number of rotatable bonds (FRB), molar volume (MV), molecular weight (MW), and molar refractivity (MR) were considered in addition to two physicochemical properties (logD and the amount of non-dissociated form in relation to pKa). HBD + HBA and PSA predominantly impacted the three biological processes compared to the other descriptors. The coefficient of determination in the quantitative structure-activity relationship (QSAR) models indicated that a major part of the observed seismonasty inhibition and proton flux modification can be explained by the impact of these descriptors, whereas this was not the case for membrane potential variations. These results indicate that the transmembrane transport of the compounds is a predominant component. An increasing number of implicated descriptors as the biological processes become more complex may reflect their impacts on an increasing number of sites in the cell. The determination of the most efficient effectors may lead to a practical use to improve drugs in the control of microbial attacks on plants.
Collapse
Affiliation(s)
- Françoise Rocher
- IC2MP (Institut de Chimie des Milieux et des Matériaux de Poitiers), UMR CNRS 7285, Université de Poitiers, 4 rue Michel Brunet, TSA 51106, F-86073, Poitiers cedex 9, France
| | - Gabriel Roblin
- Laboratoire EBI (Écologie et Biologie des Interactions), UMR CNRS 7267, Équipe SEVE (Sucres, Échanges Végétaux, Environnement), Université de Poitiers, 3 rue Jacques Fort, TSA 51106, F-86073, Poitiers cedex 9, France
| | - Jean-François Chollet
- IC2MP (Institut de Chimie des Milieux et des Matériaux de Poitiers), UMR CNRS 7285, Université de Poitiers, 4 rue Michel Brunet, TSA 51106, F-86073, Poitiers cedex 9, France.
| |
Collapse
|
9
|
Structure-based predictions of 13C-NMR chemical shifts for a series of 2-functionalized 5-(methylsulfonyl)-1-phenyl-1H-indoles derivatives using GA-based MLR method. J Mol Struct 2012. [DOI: 10.1016/j.molstruc.2012.06.042] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
10
|
Jalali-Heravi M, Mani-Varnosfaderani A, Taherinia D, Mahmoodi MM. The use of Bayesian nonlinear regression techniques for the modelling of the retention behaviour of volatile components of Artemisia species. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2012; 23:461-483. [PMID: 22452344 DOI: 10.1080/1062936x.2012.665083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The main aim of this work was to assess the ability of Bayesian multivariate adaptive regression splines (BMARS) and Bayesian radial basis function (BRBF) techniques for modelling the gas chromatographic retention indices of volatile components of Artemisia species. A diverse set of molecular descriptors was calculated and used as descriptor pool for modelling the retention indices. The ability of BMARS and BRBF techniques was explored for the selection of the most relevant descriptors and proper basis functions for modelling. The results revealed that BRBF technique is more reproducible than BMARS for modelling the retention indices and can be used as a method for variable selection and modelling in quantitative structure-property relationship (QSPR) studies. It is also concluded that the Markov chain Monte Carlo (MCMC) search engine, implemented in BRBF algorithm, is a suitable method for selecting the most important features from a vast number of them. The values of correlation between the calculated retention indices and the experimental ones for the training and prediction sets (0.935 and 0.902, respectively) revealed the prediction power of the BRBF model in estimating the retention index of volatile components of Artemisia species.
Collapse
Affiliation(s)
- M Jalali-Heravi
- Department of Chemistry, Sharif University of Technology, Tehran, Iran
| | | | | | | |
Collapse
|
11
|
Luan F, Melo A, Borges F, Cordeiro MND. Affinity prediction on A3 adenosine receptor antagonists: The chemometric approach. Bioorg Med Chem 2011; 19:6853-9. [DOI: 10.1016/j.bmc.2011.09.032] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2011] [Revised: 09/17/2011] [Accepted: 09/19/2011] [Indexed: 10/17/2022]
|
12
|
Classification models for neocryptolepine derivatives as inhibitors of the β-haematin formation. Anal Chim Acta 2011; 705:98-110. [DOI: 10.1016/j.aca.2011.04.019] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2011] [Revised: 04/06/2011] [Accepted: 04/13/2011] [Indexed: 11/18/2022]
|
13
|
Ali AE. Synthesis, spectral, thermal and antimicrobial studies of some new tri metallic biologically active ceftriaxone complexes. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2011; 78:224-230. [PMID: 21074487 DOI: 10.1016/j.saa.2010.09.025] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2010] [Revised: 08/25/2010] [Accepted: 09/29/2010] [Indexed: 05/30/2023]
Abstract
Iron, cobalt, nickel and copper complexes of ceftriaxone were prepared in 1:3 ligand:metal ratio to examine the ligating properties of the different moieties of the drug. The complexes were found to have high percentages of coordinated water molecules. The modes of bonding were discussed depending on the infrared spectral absorption peaks of the different allowed vibrations. The Nujol mull electronic absorption spectra and the magnetic moment values indicated the Oh geometry of the metal ions in the complexes. The ESR spectra of the iron, cobalt, and copper complexes were determined and discussed. The thermal behaviors of the complexes were studied by TG and DTA techniques. The antimicrobial activities of the complexes were examined and compared to that of the ceftriaxone itself.
Collapse
Affiliation(s)
- Alaa E Ali
- Chemistry Department, Faculty of Science, Alexandria University, Damanhour, Egypt.
| |
Collapse
|
14
|
Wen Y, Liu H, Tian L, Han P, Luan F. Analysis of alkaloids in pharmaceutical preparations containing Kushen by capillary electrophoresis with application of experimental design and a quantitative structure-property relationship approach. ACTA CHROMATOGR 2010. [DOI: 10.1556/achrom.22.2010.3.8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
15
|
Liu KP, Xia BB, Zhang XY. Review of QSPR Modeling of Mobilities of Peptides in Capillary Zone Electrophoresis. J LIQ CHROMATOGR R T 2010. [DOI: 10.1080/10826070802129001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- K. P. Liu
- a Department of Chemistry , Lanzhou University, Lanzhou , Gansu, P. R. China
| | - B. B. Xia
- a Department of Chemistry , Lanzhou University, Lanzhou , Gansu, P. R. China
| | - X. Y. Zhang
- a Department of Chemistry , Lanzhou University, Lanzhou , Gansu, P. R. China
| |
Collapse
|
16
|
Abstract
AbstractThe activity of fungicide agents containing a quinazolinone ring was described using the quantitative structure-activity relationship (QSAR) model by applying it to data taken from literature. The title compounds exhibit two important types of activity against certain fungal pathogens, i.e. activity against yeast and activity against filamentous fungi. A correlation between both antifungal activities (e.g. FA(yst) and FA(ff)) and physicochemical parameters such as the logarithm of the n-octanol/water partition coefficient (log P), the polarizability (P), the global minimum energy (TE), the energy difference between the frontier molecular orbital (DELH) and the molar refractivity (MR), was established using multiple linear regression. The molecular descriptors of the antifungal agents were obtained by quantum chemical calculations combined with molecular modeling calculations. Statistical analysis shows that the antifungal activity depends mainly on the calculated partition coefficients, log P, of the compounds. Bi-parametric models reveal that antifungal activity relates linearly to log P and P.
Collapse
|
17
|
Reynolds DP, Lanevskij K, Japertas P, Didziapetris R, Petrauskas A. Ionization-specific analysis of human intestinal absorption. J Pharm Sci 2010; 98:4039-54. [PMID: 19360843 DOI: 10.1002/jps.21730] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
This study presents a mechanistic QSAR analysis of human intestinal absorption of drugs and drug-like compounds using a data set of 567 %HIA values. Experimental data represent passive diffusion across intestinal membranes, and are considered to be reasonably free of carrier-mediated transport or other unwanted effects. A nonlinear model was developed relating %HIA to physicochemical properties of drugs (lipophilicity, ionization, hydrogen bonding, and molecular size). The model describes ion-specific intestinal permeability of drugs by both transcellular and paracellular routes, and also accounts for unstirred water layer effects. The obtained model was validated on two external data sets consisting of in vivo human jejunal permeability coefficients (P(eff)) and absorption rate constants (K(a)). Validation results demonstrate good predictive power of the model (RMSE = 0.35-0.45 log units for log K(a) and log P(eff)). High prediction accuracy together with clear physicochemical interpretation (log P, pK(a)) makes this model particularly suitable for use in property-based drug design.
Collapse
|
18
|
Prediction of hydrophile–lipophile balance values of anionic surfactants using a quantitative structure–property relationship. J Colloid Interface Sci 2009; 336:773-9. [DOI: 10.1016/j.jcis.2009.04.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2008] [Revised: 03/31/2009] [Accepted: 04/01/2009] [Indexed: 11/23/2022]
|
19
|
Jalali-Heravi M, Asadollahi-Baboli M, Mani-Varnosfaderani A. Shuffling multivariate adaptive regression splines and adaptive neuro-fuzzy inference system as tools for QSAR study of SARS inhibitors. J Pharm Biomed Anal 2009; 50:853-60. [PMID: 19665859 PMCID: PMC7126869 DOI: 10.1016/j.jpba.2009.07.009] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2009] [Revised: 07/04/2009] [Accepted: 07/06/2009] [Indexed: 11/04/2022]
Abstract
In this work, the inhibitory activity of pyridine N-oxide derivatives against human severe acute respiratory syndrome (SARS) is predicted in terms of quantitative structure–activity relationship (QSAR) models. These models were developed with the aid of multivariate adaptive regression spline (MARS) and adaptive neuro-fuzzy inference system (ANFIS) combined with shuffling cross-validation technique. A shuffling MARS algorithm is utilized to select the most important variables in QSAR modeling and then these variables were used as inputs of ANFIS to predict SARS inhibitory activities of pyridine N-oxide derivatives. A data set of 119 drug-like compounds was coded with over hundred calculated meaningful molecular descriptors. The best descriptors describing the inhibition mechanism were solvation connectivity index, length to breadth ratio, relative negative charge, harmonic oscillator of aromatic index, average molecular weight and total path count. These parameters are among topological, electronic, geometric, constitutional and aromaticity descriptors. The statistical parameters of R2 and root mean square error (RMSE) are 0.884 and 0.359, respectively. The accuracy and robustness of shuffling MARS–ANFIS model in predicting inhibition behavior of pyridine N-oxide derivatives (pIC50) was illustrated using leave-one-out and leave-multiple-out cross-validation techniques and also by Y-randomization. Comparison of the results of the proposed model with those of GA-PLS-ANFIS shows that the shuffling MARS–ANFIS model is superior and can be considered as a tool for predicting the inhibitory behavior of SARS drug-like molecules.
Collapse
Affiliation(s)
- M Jalali-Heravi
- Department of Chemistry, Sharif University of Technology, P.O. Box 11155-9516, Tehran, Iran.
| | | | | |
Collapse
|
20
|
Quantitative structure-property relationship study on the determination of binding constant by fluorescence quenching. OPEN CHEM 2009. [DOI: 10.2478/s11532-008-0095-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
AbstractModels to predict binding constant (logK) to bovine serum albumin (BSA) should be very useful in the pharmaceutical industry to help speed up the design of new compounds, especially as far as pharmacokinetics is concerned. We present here an extensive list of logK binding constants for thirty-five compounds to BSA determined by florescence quenching from the literature. These data have allowed us the derivation of a quantitative structure-property relationship (QSPR) model to predict binding constants to BSA of compounds on the basis of their structure. A stepwise multiple linear regression (MLR) was performed to build the model. The statistical parameter provided by the MLR model (R = 0.9200, RMS = 0.3305) indicated satisfactory stability and predictive ability for the model. Using florescence quenching spectroscopy, we also experimentally determined the binding constants to BSA for two bioactive components in traditional Chinese medicines. Using the proposed model it was possible to predict the binding constants for each, which were in good agreement with the experimental results. This QSPR approach can contribute to a better understanding of structural factors of the compounds responsible for drug-protein interactions, and be useful in predicting the binding constants of other compounds.
Collapse
|
21
|
Considerations and recent advances in QSAR models for cytochrome P450-mediated drug metabolism prediction. J Comput Aided Mol Des 2008; 22:843-55. [DOI: 10.1007/s10822-008-9225-4] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2007] [Accepted: 06/08/2008] [Indexed: 02/07/2023]
|
22
|
Luan F, Liu HT, Wen Y, Zhang X. Prediction of quantitative calibration factors of some organic compounds in gas chromatography. Analyst 2008; 133:881-7. [PMID: 18575640 DOI: 10.1039/b800148k] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
A quantitative structure-property relationship (QSPR) methodology that involves multilinear (Hansch-type) and nonlinear (radial basis function neural network (RBFNN)) approaches was performed to correlate the quantitative molar calibration factors (f(M)) of 140 organic compounds against structural factors. The statistical characteristics provided by the multiple linear model (R(2) = 0.963; RMS = 0.089; AARD = 3.86% for test set) indicated satisfactory stability and predictive ability, while the predictive ability of the RBFNN model is somewhat superior (R(2) = 0.983; RMS = 0.075; AARD = 3.19% for test set). The multilinear model provided some insight into the main structure factors that modulate the quantitative calibration factor of the investigated compounds.
Collapse
Affiliation(s)
- Feng Luan
- Department of Applied Chemistry, Yantai University, Yantai, 264005, PR China.
| | | | | | | |
Collapse
|
23
|
Luan F, Liu HT, Wen Y, Zhang X. Quantitative structure-property relationship study for estimation of quantitative calibration factors of some organic compounds in gas chromatography. Anal Chim Acta 2008; 612:126-35. [PMID: 18358857 DOI: 10.1016/j.aca.2008.02.037] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2007] [Revised: 02/20/2008] [Accepted: 02/21/2008] [Indexed: 11/18/2022]
Abstract
Quantitative structure-property relationship (QSPR) models have been used to predict and explain gas chromatographic data of quantitative calibration factors (f(M)). This method allows for the prediction of quantitative calibration factors in a variety of organic compounds based on their structures alone. Stepwise multiple linear regression (MLR) and non-linear radial basis function neural network (RBFNN) were performed to build the models. The statistical characteristics provided by multiple linear model (R2=0.927, RMS=0.073; AARD=6.34% for test set) indicated satisfactory stability and predictive ability, while the predictive ability of RBFNN model is somewhat superior (R2=0.959; RMS=0.0648; AARD=4.85% for test set). This QSPR approach can contribute to a better understanding of structural factors of the compounds responsible for quantitative analysis by gas chromatography, and can be useful in predicting the quantitative calibration factors of other compounds.
Collapse
Affiliation(s)
- Feng Luan
- Department of Applied Chemistry, Yantai University, Yantai 264005, PR China.
| | | | | | | |
Collapse
|
24
|
Deconinck E, Zhang M, Petitet F, Dubus E, Ijjaali I, Coomans D, Vander Heyden Y. Boosted regression trees, multivariate adaptive regression splines and their two-step combinations with multiple linear regression or partial least squares to predict blood–brain barrier passage: A case study. Anal Chim Acta 2008; 609:13-23. [DOI: 10.1016/j.aca.2007.12.033] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2007] [Revised: 12/04/2007] [Accepted: 12/19/2007] [Indexed: 11/16/2022]
|
25
|
Konovalov DA, Sim N, Deconinck E, Vander Heyden Y, Coomans D. Statistical Confidence for Variable Selection in QSAR Models via Monte Carlo Cross-Validation. J Chem Inf Model 2008; 48:370-83. [DOI: 10.1021/ci700283s] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Dmitry A. Konovalov
- School of Mathematics, Physics and Information Technology, James Cook University, Townsville, Queensland 4811, Australia, Laboratory for Pharmaceutical Technology and Biopharmacy, Katholieke Universiteit Leuven, B-3000 Leuven, Belgium, and Department of Analytical Chemistry and Pharmaceutical Technology, Pharmaceutical Institute, Vrije Universiteit Brussel, B-1050 Brussels, Belgium
| | - Nigel Sim
- School of Mathematics, Physics and Information Technology, James Cook University, Townsville, Queensland 4811, Australia, Laboratory for Pharmaceutical Technology and Biopharmacy, Katholieke Universiteit Leuven, B-3000 Leuven, Belgium, and Department of Analytical Chemistry and Pharmaceutical Technology, Pharmaceutical Institute, Vrije Universiteit Brussel, B-1050 Brussels, Belgium
| | - Eric Deconinck
- School of Mathematics, Physics and Information Technology, James Cook University, Townsville, Queensland 4811, Australia, Laboratory for Pharmaceutical Technology and Biopharmacy, Katholieke Universiteit Leuven, B-3000 Leuven, Belgium, and Department of Analytical Chemistry and Pharmaceutical Technology, Pharmaceutical Institute, Vrije Universiteit Brussel, B-1050 Brussels, Belgium
| | - Yvan Vander Heyden
- School of Mathematics, Physics and Information Technology, James Cook University, Townsville, Queensland 4811, Australia, Laboratory for Pharmaceutical Technology and Biopharmacy, Katholieke Universiteit Leuven, B-3000 Leuven, Belgium, and Department of Analytical Chemistry and Pharmaceutical Technology, Pharmaceutical Institute, Vrije Universiteit Brussel, B-1050 Brussels, Belgium
| | - Danny Coomans
- School of Mathematics, Physics and Information Technology, James Cook University, Townsville, Queensland 4811, Australia, Laboratory for Pharmaceutical Technology and Biopharmacy, Katholieke Universiteit Leuven, B-3000 Leuven, Belgium, and Department of Analytical Chemistry and Pharmaceutical Technology, Pharmaceutical Institute, Vrije Universiteit Brussel, B-1050 Brussels, Belgium
| |
Collapse
|