1
|
Kajtazi A, Russo G, Wicht K, Eghbali H, Lynen F. Facilitating structural elucidation of small environmental solutes in RPLC-HRMS by retention index prediction. CHEMOSPHERE 2023; 337:139361. [PMID: 37392796 DOI: 10.1016/j.chemosphere.2023.139361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 06/06/2023] [Accepted: 06/26/2023] [Indexed: 07/03/2023]
Abstract
Implementing effective environmental management strategies requires a comprehensive understanding of the chemical composition of environmental pollutants, particularly in complex mixtures. Utilizing innovative analytical techniques, such as high-resolution mass spectrometry and predictive retention index models, can provide valuable insights into the molecular structures of environmental contaminants. Liquid Chromatography-High-Resolution Mass Spectrometry is a powerful tool for the identification of isomeric structures in complex samples. However, there are some limitations that can prevent accurate isomeric structure identification, particularly in cases where the isomers have similar mass and fragmentation patterns. Liquid chromatographic retention, determined by the size, shape, and polarity of the analyte and its interactions with the stationary phase, contains valuable 3D structural information that is vastly underutilized. Therefore, a predictive retention index model is developed which is transferrable to LC-HRMS systems and can assist in the structural elucidation of unknowns. The approach is currently restricted to carbon, hydrogen, and oxygen-based molecules <500 g mol-1. The methodology facilitates the acceptance of accurate structural formulas and the exclusion of erroneous hypothetical structural representations by leveraging retention time estimations, thereby providing a permissible tolerance range for a given elemental composition and experimental retention time. This approach serves as a proof of concept for the development of a Quantitative Structure-Retention Relationship model using a generic gradient LC approach. The use of a widely used reversed-phase (U)HPLC column and a relatively large set of training (101) and test compounds (14) demonstrates the feasibility and potential applicability of this approach for predicting the retention behaviour of compounds in complex mixtures. By providing a standard operating procedure, this approach can be easily replicated and applied to various analytical challenges, further supporting its potential for broader implementation.
Collapse
Affiliation(s)
- Ardiana Kajtazi
- Separation Science Group, Department of Organic and Macromolecular Chemistry, Ghent University, Krijgslaan 281 S4bis, B-9000 Ghent, Belgium
| | - Giacomo Russo
- School of Applied Sciences, Sighthill Campus, Edinburgh Napier University, 9 Sighthill Ct, EH11 4BN, Edinburgh, United Kingdom
| | - Kristina Wicht
- Separation Science Group, Department of Organic and Macromolecular Chemistry, Ghent University, Krijgslaan 281 S4bis, B-9000 Ghent, Belgium
| | - Hamed Eghbali
- Packaging and Specialty Plastics R&D, Dow Benelux B.V., Terneuzen, 4530 AA, the Netherlands
| | - Frédéric Lynen
- Separation Science Group, Department of Organic and Macromolecular Chemistry, Ghent University, Krijgslaan 281 S4bis, B-9000 Ghent, Belgium.
| |
Collapse
|
2
|
Oguike OE, Ugwuishiwu CH, Asogwa CN, Nnadi CO, Obonga WO, Attama AA. Systematic review on the application of machine learning to quantitative structure-activity relationship modeling against Plasmodium falciparum. Mol Divers 2022; 26:3447-3462. [PMID: 35064444 PMCID: PMC8782692 DOI: 10.1007/s11030-022-10380-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 01/07/2022] [Indexed: 11/29/2022]
Abstract
Malaria accounts for over two million deaths globally. To flatten this curve, there is a need to develop new and high potent drugs against Plasmodium falciparum. Some major challenges include the dearth of suitable animal models for anti-P. falciparum assays, resistance to first-line drugs, lack of vaccines and the complex life cycle of Plasmodium. Gladly, newer approaches to antimalarial drug discovery have emerged due to the release of large datasets by pharmaceutical companies. This review provides insights into these new approaches to drug discovery covering different machine learning tools, which enhance the development of new compounds. It provides a systematic review on the use and prospects of machine learning in predicting, classifying and clustering IC50 values of bioactive compounds against P. falciparum. The authors identified many machine learning tools yet to be applied for this purpose. However, Random Forest and Support Vector Machines have been extensively applied though on a limited dataset of compounds.
Collapse
Affiliation(s)
- Osondu Everestus Oguike
- Machine Learning Research Group, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria.,Department of Computer Science, Faculty of Physical Sciences, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria
| | - Chikodili Helen Ugwuishiwu
- Machine Learning Research Group, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria.,Department of Computer Science, Faculty of Physical Sciences, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria
| | - Caroline Ngozi Asogwa
- Machine Learning Research Group, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria.,Department of Computer Science, Faculty of Physical Sciences, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria
| | - Charles Okeke Nnadi
- Machine Learning Research Group, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria. .,Deprtment of Pharmaceutical and Medicinal Chemistry, Faculty of Pharmaceutical Sciences, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria.
| | - Wilfred Ofem Obonga
- Machine Learning Research Group, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria.,Deprtment of Pharmaceutical and Medicinal Chemistry, Faculty of Pharmaceutical Sciences, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria
| | - Anthony Amaechi Attama
- Machine Learning Research Group, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria.,Department of Pharmaceutics, Faculty of Pharmaceutical Sciences, University of Nigeria, Nsukka, 410001, Enugu State, Nigeria
| |
Collapse
|
3
|
Correlation between the structure and skin permeability of compounds. Sci Rep 2021; 11:10076. [PMID: 33980965 PMCID: PMC8115152 DOI: 10.1038/s41598-021-89587-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 04/28/2021] [Indexed: 12/02/2022] Open
Abstract
A three-descriptor quantitative structure–activity/toxicity relationship (QSAR/QSTR) model was developed for the skin permeability of a sufficiently large data set consisting of 274 compounds, by applying support vector machine (SVM) together with genetic algorithm. The optimal SVM model possesses the coefficient of determination R2 of 0.946 and root mean square (rms) error of 0.253 for the training set of 139 compounds; and a R2 of 0.872 and rms of 0.302 for the test set of 135 compounds. Compared with other models reported in the literature, our SVM model shows better statistical performance in a model that deals with more samples in the test set. Therefore, applying a SVM algorithm to develop a nonlinear QSAR model for skin permeability was achieved.
Collapse
|
5
|
Liu Q, Deng J, Liu M. Classification models for predicting the antimalarial activity against Plasmodium falciparum. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2020; 31:313-324. [PMID: 32191533 DOI: 10.1080/1062936x.2020.1740890] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Accepted: 03/07/2020] [Indexed: 06/10/2023]
Abstract
Support vector machine (SVM) and general regression neural network (GRNN) were used to develop classification models for predicting the antimalarial activity against Plasmodium falciparum. Only 15 molecular descriptors were used to build the classification models for the antimalarial activities of 4750 compounds, which were divided into a training set (3887 compounds) and a test set (863 compounds). For the SVM model, its prediction accuracies are 89.5% for the training set and 87.3% for the test set. For the GRNN model, the prediction accuracies for the two sets are 99.7% and 88.9%, respectively. Both SVC and GRNN models have better prediction ability than the classification model based on binary logistic regression (BLR) analysis. Compared with previously published classification models both SVC and GRNN models are satisfactory in predicting antimalarial activities of compounds with in addition of fewer descriptors.
Collapse
Affiliation(s)
- Q Liu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, China
| | - J Deng
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, China
| | - M Liu
- School of Chemistry and Materials Engineering, Huizhou University, Huizhou, PR China
| |
Collapse
|
6
|
Ancuceanu R, Tamba B, Stoicescu CS, Dinu M. Use of QSAR Global Models and Molecular Docking for Developing New Inhibitors of c-src Tyrosine Kinase. Int J Mol Sci 2019; 21:ijms21010019. [PMID: 31861445 PMCID: PMC6981969 DOI: 10.3390/ijms21010019] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Revised: 12/15/2019] [Accepted: 12/16/2019] [Indexed: 12/11/2022] Open
Abstract
A prototype of a family of at least nine members, cellular Src tyrosine kinase is a therapeutically interesting target because its inhibition might be of interest not only in a number of malignancies, but also in a diverse array of conditions, from neurodegenerative pathologies to certain viral infections. Computational methods in drug discovery are considerably cheaper than conventional methods and offer opportunities of screening very large numbers of compounds in conditions that would be simply impossible within the wet lab experimental settings. We explored the use of global quantitative structure-activity relationship (QSAR) models and molecular ligand docking in the discovery of new c-src tyrosine kinase inhibitors. Using a dataset of 1038 compounds from ChEMBL database, we developed over 350 QSAR classification models. A total of 49 models with reasonably good performance were selected and the models were assembled by stacking with a simple majority vote and used for the virtual screening of over 100,000 compounds. A total of 744 compounds were predicted by at least 50% of the QSAR models as active, 147 compounds were within the applicability domain and predicted by at least 75% of the models to be active. The latter 147 compounds were submitted to molecular ligand docking using AutoDock Vina and LeDock, and 89 were predicted to be active based on the energy of binding.
Collapse
Affiliation(s)
- Robert Ancuceanu
- Faculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 020956 Bucharest, Romania; (R.A.); (M.D.)
| | - Bogdan Tamba
- Advanced Research and Development Center for Experimental Medicine (CEMEX), Grigore T. Popa, University of Medicine and Pharmacy of Iasi, 700115 Iasi, Romania
- Correspondence:
| | - Cristina Silvia Stoicescu
- Department of Chemical Thermodynamics, Institute of Physical Chemistry “Ilie Murgulescu”, 060021 Bucharest, Romania;
| | - Mihaela Dinu
- Faculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 020956 Bucharest, Romania; (R.A.); (M.D.)
| |
Collapse
|