1
|
Alharthi AM, Kadir DH, Al-Fakih AM, Algamal ZY, Al-Thanoon NA, Qasim MK. Quantitative structure-property relationship modelling for predicting retention indices of essential oils based on an improved horse herd optimization algorithm. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023; 34:831-846. [PMID: 37885432 DOI: 10.1080/1062936x.2023.2261855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 09/17/2023] [Indexed: 10/28/2023]
Abstract
The horse herd optimization algorithm (HOA), one of the more contemporary metaheuristic algorithms, has demonstrated superior performance in a number of challenging optimization tasks. In the present work, the descriptor selection issue is resolved by classifying different essential oil retention indices using the binary form, BHOA. Based on internal and external prediction criteria, Z-shape transfer functions (ZTF) were tested to verify their efficiency in improving BHOA performance in QSPR modelling for predicting retention indices of essential oils. The evaluation criteria involved the mean-squared error of the training and testing datasets (MSE), and leave-one-out internal and external validation (Q2). The degree of convergence of the proposed Z-shaped transfer functions was compared. In addition, K-fold cross validation with k = 5 was applied. The results show that ZTF, especially ZTF1, greatly improves the performance of the original BHOA. Comparatively speaking, ZTF, especially ZTF1, exhibits the fastest convergence behaviour of the binary algorithms. It chooses the fewest descriptors and requires the fewest iterations to achieve excellent prediction performance.
Collapse
Affiliation(s)
- A M Alharthi
- Department of Mathematics, Turabah University College, Taif University, Taif, Saudi Arabia
| | - D H Kadir
- Department of Statistics, College of Administration and Economics, Salahaddin University-Erbil, Erbil, F.R. Iraq
- Department of Business Administration, Cihan University-Erbil, Erbil, Iraq
| | - A M Al-Fakih
- Department of Chemistry, Faculty of Science, Universiti Teknologi Malaysia, Johor, Malaysia
- Department of Chemistry, Faculty of Science, Sana'a University, Sana'a, Yemen
| | - Z Y Algamal
- Department of Statistics and Informatics, University of Mosul, Mosul, Iraq
| | - N A Al-Thanoon
- Department of Operations Research and Intelligent Techniques, University of Mosul, Mosul, Iraq
| | - M K Qasim
- Department of General Science, University of Mosul, Mosul, Iraq
| |
Collapse
|
2
|
Kumari P, Van Laethem T, Hubert P, Fillet M, Sacré PY, Hubert C. Quantitative Structure Retention-Relationship Modeling: Towards an Innovative General-Purpose Strategy. Molecules 2023; 28:1696. [PMID: 36838689 PMCID: PMC9964055 DOI: 10.3390/molecules28041696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Revised: 02/05/2023] [Accepted: 02/08/2023] [Indexed: 02/12/2023] Open
Abstract
Reversed-Phase Liquid Chromatography (RPLC) is a common liquid chromatographic mode used for the control of pharmaceutical compounds during their drug life cycle. Nevertheless, determining the optimal chromatographic conditions that enable this separation is time consuming and requires a lot of lab work. Quantitative Structure Retention Relationship models (QSRR) are helpful for doing this job with minimal time and cost expenditures by predicting retention times of known compounds without performing experiments. In the current work, several QSRR models were built and compared for their adequacy in predicting the retention times. The regression models were based on a combination of linear and non-linear algorithms such as Multiple Linear Regression, Support Vector Regression, Least Absolute Shrinkage and Selection Operator, Random Forest, and Gradient Boosted Regression. Models were built for five pH conditions, i.e., at pH 2.7, 3.5, 6.5, and 8.0. In the end, the model predictions were combined using stacking and the performances of all models were compared. The k-nearest neighbor-based application domain filter was established to assess the reliability of the prediction for further compound prioritization. Altogether, this study can be insightful for analytical chemists working with RPLC to begin with the computational prediction modeling such as QSRR to predict the separation of small molecules.
Collapse
Affiliation(s)
- Priyanka Kumari
- Department of Pharmacy, Laboratory of Pharmaceutical Analytical Chemistry, University of Liège (ULiege), CIRM, Quartier Hopital (B36 Tower 4), Avenue Hippocrate, 4000 Liège, Belgium
- Laboratory for the Analysis of Medicines, University of Liège (ULiege), CIRM, Quartier Hopital (B36 Tower 4), Avenue Hippocrate, 4000 Liège, Belgium
| | - Thomas Van Laethem
- Department of Pharmacy, Laboratory of Pharmaceutical Analytical Chemistry, University of Liège (ULiege), CIRM, Quartier Hopital (B36 Tower 4), Avenue Hippocrate, 4000 Liège, Belgium
- Laboratory for the Analysis of Medicines, University of Liège (ULiege), CIRM, Quartier Hopital (B36 Tower 4), Avenue Hippocrate, 4000 Liège, Belgium
| | - Philippe Hubert
- Department of Pharmacy, Laboratory of Pharmaceutical Analytical Chemistry, University of Liège (ULiege), CIRM, Quartier Hopital (B36 Tower 4), Avenue Hippocrate, 4000 Liège, Belgium
| | - Marianne Fillet
- Laboratory for the Analysis of Medicines, University of Liège (ULiege), CIRM, Quartier Hopital (B36 Tower 4), Avenue Hippocrate, 4000 Liège, Belgium
| | - Pierre-Yves Sacré
- Department of Pharmacy, Laboratory of Pharmaceutical Analytical Chemistry, University of Liège (ULiege), CIRM, Quartier Hopital (B36 Tower 4), Avenue Hippocrate, 4000 Liège, Belgium
| | - Cédric Hubert
- Department of Pharmacy, Laboratory of Pharmaceutical Analytical Chemistry, University of Liège (ULiege), CIRM, Quartier Hopital (B36 Tower 4), Avenue Hippocrate, 4000 Liège, Belgium
| |
Collapse
|
3
|
Al-Fakih AM, Algamal ZY, Lee MH, Aziz M. A penalized quantitative structure-property relationship study on melting point of energetic carbocyclic nitroaromatic compounds using adaptive bridge penalty. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2018; 29:339-353. [PMID: 29493376 DOI: 10.1080/1062936x.2018.1439531] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2017] [Accepted: 02/07/2018] [Indexed: 06/08/2023]
Abstract
A penalized quantitative structure-property relationship (QSPR) model with adaptive bridge penalty for predicting the melting points of 92 energetic carbocyclic nitroaromatic compounds is proposed. To ensure the consistency of the descriptor selection of the proposed penalized adaptive bridge (PBridge), we proposed a ridge estimator ([Formula: see text]) as an initial weight in the adaptive bridge penalty. The Bayesian information criterion was applied to ensure the accurate selection of the tuning parameter ([Formula: see text]). The PBridge based model was internally and externally validated based on [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text], the Y-randomization test, [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text] and the applicability domain. The validation results indicate that the model is robust and not due to chance correlation. The descriptor selection and prediction performance of PBridge for the training dataset outperforms the other methods used. PBridge shows the highest [Formula: see text] of 0.959, [Formula: see text] of 0.953, [Formula: see text] of 0.949 and [Formula: see text] of 0.959, and the lowest [Formula: see text] and [Formula: see text]. For the test dataset, PBridge shows a higher [Formula: see text] of 0.945 and [Formula: see text] of 0.948, and a lower [Formula: see text] and [Formula: see text], indicating its better prediction performance. The results clearly reveal that the proposed PBridge is useful for constructing reliable and robust QSPRs for predicting melting points prior to synthesizing new organic compounds.
Collapse
Affiliation(s)
- A M Al-Fakih
- a Faculty of Science, Department of Chemistry , Universiti Teknologi Malaysia , Johor , Malaysia
- b Faculty of Science, Department of Chemistry , Sana'a University , Sana'a , Yemen
| | - Z Y Algamal
- c Department of Statistics and Informatics , University of Mosul , Mosul , Iraq
| | - M H Lee
- d Faculty of Science, Department of Mathematical Sciences , Universiti Teknologi Malaysia , Johor , Malaysia
| | - M Aziz
- a Faculty of Science, Department of Chemistry , Universiti Teknologi Malaysia , Johor , Malaysia
- e Advanced Membrane Technology Centre , Universiti Teknologi Malaysia , Johor , Malaysia
| |
Collapse
|