1
|
Xiao F, Ding X, Shi Y, Wang D, Wang Y, Cui C, Zhu T, Chen K, Xiang P, Luo X. Application of ensemble learning for predicting GABA A receptor agonists. Comput Biol Med 2024; 169:107958. [PMID: 38194778 DOI: 10.1016/j.compbiomed.2024.107958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 12/29/2023] [Accepted: 01/01/2024] [Indexed: 01/11/2024]
Abstract
BACKGROUND Over the past few decades, agonists binding to the benzodiazepine site of the GABAA receptor have been successfully developed as clinical drugs. Different modulators (agonist, antagonist, and reverse agonist) bound to benzodiazepine sites exhibit different or even opposite pharmacological effects, however, their structures are so similar that it is difficult to distinguish them based solely on molecular skeleton. This study aims to develop classification models for predicting the agonists. METHODS 306 agonists or non-agonists were collected from literature. Six machine learning algorithms including RF, XGBoost, AdaBoost, GBoost, SVM, and ANN algorithms were employed for model development. Using six descriptors including 1D/2D Descriptors, ECFP4, 2D-Pharmacophore, MACCS, PubChem, and Estate fingerprint to characterize chemical structures. The model interpretability was explored by SHAP method. RESULTS The best model demonstrated an AUC value of 0.905 and an MCC value of 0.808 for the test set. The PubMac-based model (PubMac-GB) achieved best AUC values of 0.935 for test set. The SHAP analysis results emphasized that MaccsFP62, ECFP_624, ECFP_724, and PubchemFP213 were the crucial molecular features. Applicability domain analysis was also performed to determine reliable prediction boundaries for the model. The PubMac-GB model was applied to virtual screening for potential GABAA agonists and the top 100 compounds were given. CONCLUSION Overall, our ensemble learning-based model (PubMac-GB) achieved comparable performance and would be helpful in effectively identifying agonists of GABAA receptors.
Collapse
Affiliation(s)
- Fu Xiao
- School of Chinese Materia Medica, Nanjing University of Chinese Medicine, Nanjing, 210023, China; Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China
| | - Xiaoyu Ding
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China; University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Yan Shi
- Academy of Forensic Science, Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, 200063, China
| | - Dingyan Wang
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China; University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Yitian Wang
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China; University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Chen Cui
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China; University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Tingfei Zhu
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China; University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Kaixian Chen
- School of Chinese Materia Medica, Nanjing University of Chinese Medicine, Nanjing, 210023, China; Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China; University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Ping Xiang
- Academy of Forensic Science, Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Key Laboratory of Forensic Science, Ministry of Justice, Shanghai, 200063, China.
| | - Xiaomin Luo
- School of Chinese Materia Medica, Nanjing University of Chinese Medicine, Nanjing, 210023, China; Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai, 201203, China; University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China.
| |
Collapse
|
2
|
Toropov AA, Barnes DA, Toropova AP, Roncaglioni A, Irvine AR, Masereeuw R, Benfenati E. CORAL Models for Drug-Induced Nephrotoxicity. TOXICS 2023; 11:293. [PMID: 37112520 PMCID: PMC10142465 DOI: 10.3390/toxics11040293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 03/20/2023] [Accepted: 03/21/2023] [Indexed: 06/19/2023]
Abstract
Drug-induced nephrotoxicity is a major cause of kidney dysfunction with potentially fatal consequences. The poor prediction of clinical responses based on preclinical research hampers the development of new pharmaceuticals. This emphasises the need for new methods for earlier and more accurate diagnosis to avoid drug-induced kidney injuries. Computational predictions of drug-induced nephrotoxicity are an attractive approach to facilitate such an assessment and such models could serve as robust and reliable replacements for animal testing. To provide the chemical information for computational prediction, we used the convenient and common SMILES format. We examined several versions of so-called optimal SMILES-based descriptors. We obtained the highest statistical values, considering the specificity, sensitivity and accuracy of the prediction, by applying recently suggested atoms pairs proportions vectors and the index of ideality of correlation, which is a special statistical measure of the predictive potential. Implementation of this tool in the drug development process might lead to safer drugs in the future.
Collapse
Affiliation(s)
- Andrey A. Toropov
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milano, Italy; (A.P.T.); (A.R.); (E.B.)
| | - Devon A. Barnes
- Utrecht Institute for Pharmaceutical Sciences, div. Pharmacology, Universiteitsweg 99, 3584 CG Utrecht, The Netherlands; (D.A.B.); (A.R.I.); (R.M.)
| | - Alla P. Toropova
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milano, Italy; (A.P.T.); (A.R.); (E.B.)
| | - Alessandra Roncaglioni
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milano, Italy; (A.P.T.); (A.R.); (E.B.)
| | - Alasdair R. Irvine
- Utrecht Institute for Pharmaceutical Sciences, div. Pharmacology, Universiteitsweg 99, 3584 CG Utrecht, The Netherlands; (D.A.B.); (A.R.I.); (R.M.)
| | - Rosalinde Masereeuw
- Utrecht Institute for Pharmaceutical Sciences, div. Pharmacology, Universiteitsweg 99, 3584 CG Utrecht, The Netherlands; (D.A.B.); (A.R.I.); (R.M.)
| | - Emilio Benfenati
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milano, Italy; (A.P.T.); (A.R.); (E.B.)
| |
Collapse
|
3
|
Manganelli S, Gamba A, Colombo E, Benfenati E. Using VEGAHUB Within a Weight-of-Evidence Strategy. Methods Mol Biol 2022; 2425:479-495. [PMID: 35188643 DOI: 10.1007/978-1-0716-1960-5_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Industrial needs and regulatory requirements have played a significant role in accelerating the use of nontesting methods including in silico tools as alternatives to animal testing. The main interest is not solely on the use of in silico tools, or in read-across, but on better toxicological safety assessment of substances, and for this purpose more advanced, integrated strategies have to be implemented. VEGAHUB wants to promote this broader view, not necessarily focused on a specific approach. Applying multiple tools and complementary approaches instead of one technique may provide more elements for a more robust evaluation, but at the same time it is important to have a conceptual scheme to integrate multiple, heterogeneous lines of evidence. We will show how the user can benefit from the diversity of tools available within the platform VEGAHUB for assessing the biological properties of chemical substances on an example of (non)mutagenicity.
Collapse
Affiliation(s)
| | - Alessio Gamba
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Erika Colombo
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Emilio Benfenati
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| |
Collapse
|
4
|
Toropova AP, Toropov AA, Benfenati E. Semi-correlations as a tool to model for skin sensitization. Food Chem Toxicol 2021; 157:112580. [PMID: 34560179 DOI: 10.1016/j.fct.2021.112580] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 09/20/2021] [Indexed: 01/10/2023]
Abstract
Semi-correlation specifically assesses the correlation between a binary variable and a continuous variable. Semi-correlations were applied to develop binary models for various endpoints. We applied the semi-correlation to develop models of two kinds of skin sensitization one related to animals (local lymph node assay LLNA) and one to human beings (direct peptide reactivity assay DPRA and/or human cell line activation test h-CLAT). The models refer to binary classification for a two-level strategy: the first level (analysis of all compounds) is used in the format "sensitizer or non-sensitizer", and the second level (only sensitizers) is a further classification in the format "strong or weak sensitizer". The ranges of statistical characteristics of the models depend on the endpoint, LLNA or DPRA/h-CLAT: for the first level, sensitivity: 0.69-0.88, specificity: 0.75-0.89, accuracy: 0.77-0.87, Matthew's correlation coefficient (MCC): 0.54-0.57 and for the second level, sensitivity: 0.70-1.0, specificity: 0.78-0.83, accuracy: 0.77-0.87, MCC: 0.54-0.76. Thus, the described approach can be applied to building up models of the skin sensitization potency.
Collapse
Affiliation(s)
- Alla P Toropova
- Department of Environmental Health Science, Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milano, Italy.
| | - Andrey A Toropov
- Department of Environmental Health Science, Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milano, Italy
| | - Emilio Benfenati
- Department of Environmental Health Science, Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milano, Italy
| |
Collapse
|
5
|
Toropov AA, Toropova AP, Roncaglioni A, Benfenati E. The system of self-consistent semi-correlations as one of the tools of cheminformatics for designing antiviral drugs. NEW J CHEM 2021. [DOI: 10.1039/d1nj03394h] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The development of antiviral agents against SARS-CoV-2 is necessary.
Collapse
Affiliation(s)
- Andrey A. Toropov
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milano, Italy
| | - Alla P. Toropova
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milano, Italy
| | - Alessandra Roncaglioni
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milano, Italy
| | - Emilio Benfenati
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milano, Italy
| |
Collapse
|
6
|
Toropov AA, Toropova AP. The Monte Carlo Method as a Tool to Build up Predictive QSPR/QSAR. Curr Comput Aided Drug Des 2020; 16:197-206. [DOI: 10.2174/1573409915666190328123112] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Revised: 02/15/2019] [Accepted: 03/19/2019] [Indexed: 11/22/2022]
Abstract
Background:
The Monte Carlo method has a wide application in various scientific researches.
For the development of predictive models in a form of the quantitative structure-property / activity relationships
(QSPRs/QSARs), the Monte Carlo approach also can be useful. The CORAL software provides the
Monte Carlo calculations aimed to build up QSPR/QSAR models for different endpoints.
Methods:
Molecular descriptors are a mathematical function of so-called correlation weights of various
molecular features. The numerical values of the correlation weights give the maximal value of a target
function. The target function leads to a correlation between endpoint and optimal descriptor for the visible
training set. The predictive potential of the model is estimated with the validation set, i.e. compounds that
are not involved in the process of building up the model.
Results:
The approach gave quite good models for a large number of various physicochemical, biochemical,
ecological, and medicinal endpoints. Bibliography and basic statistical characteristics of several CORAL
models are collected in the present review. In addition, the extended version of the approach for more
complex systems (nanomaterials and peptides), where behaviour of systems is defined by a group of conditions
besides the molecular structure is demonstrated.
Conclusion:
The Monte Carlo technique available via the CORAL software can be a useful and convenient
tool for the QSPR/QSAR analysis.
Collapse
Affiliation(s)
- Andrey A. Toropov
- Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa 19, 20156 Milan, Italy
| | - Alla P. Toropova
- Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa 19, 20156 Milan, Italy
| |
Collapse
|
7
|
Carnesecchi E, Toropov AA, Toropova AP, Kramer N, Svendsen C, Dorne JL, Benfenati E. Predicting acute contact toxicity of organic binary mixtures in honey bees (A. mellifera) through innovative QSAR models. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 704:135302. [PMID: 31810690 DOI: 10.1016/j.scitotenv.2019.135302] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 10/29/2019] [Accepted: 10/29/2019] [Indexed: 06/10/2023]
Abstract
Pollinators such as honey bees are of considerable importance, because of the crucial pollination services they provide for food crops and wild plants. Since bees are exposed to a wide range of multiple chemicals "mixtures" both of anthropogenic (e.g. plant protection products) and natural origin (e.g. plant toxins), understanding their combined toxicity is critical. Although honey bees are employed worldwide as surrogate species for Apis and non-Apis bees in toxicity tests, it is practically unfeasible to perform in vivo tests for all mixtures of chemicals. Therefore, Quantitative Structure-Activity Relationships (QSAR) models can be developed using available data and can provide useful tools to predict such combined toxicity. Here, three different QSAR models within the CORAL software have been calibrated and validated for honey bees (A. mellifera) to predict the acute contact mixtures potency (LD50-mix), in two regression based-models, and the nature of combined toxicity (synergism / non-synergism) in a classification-based model. Experimental data on binary mixtures (n = 123) (LD50-mix) including dose response data (n = 97) and corresponding Toxic Unit values were retrieved from EFSA databases. The models were built using the principle of extraction of attributes from SMILES (or quasi-SMILES) while calculating so-called correlation weights for these attributes using Monte Carlo techniques. The two regression models were validated for their reliability and robustness (R2 = 0.89, CCC = 0.92, Q2 = 0.81; R2 = 0.87, CCC = 0.89, Q2 = 0.75). The classification model was validated using sensitivity (=0.86), specificity (=1), accuracy (=0.96), and Matthews correlation coefficient (MCC = 0.90) as qualitative statistical validation parameters. Results indicate that these QSAR models successfully predict acute contact toxicity of binary mixtures in honey bees and can support prioritisation of multiple chemicals of concerns. Data gaps and further development of QSAR models for honey bees are highlighted particularly for chronic and sub-lethal effects.
Collapse
Affiliation(s)
- Edoardo Carnesecchi
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via la Masa 19, 20156 Milan, Italy; Institute for Risk Assessment Sciences (IRAS), Utrecht University, PO Box 80177, 3508 TD Utrecht, The Netherlands.
| | - Andrey A Toropov
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via la Masa 19, 20156 Milan, Italy
| | - Alla P Toropova
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via la Masa 19, 20156 Milan, Italy
| | - Nynke Kramer
- Institute for Risk Assessment Sciences (IRAS), Utrecht University, PO Box 80177, 3508 TD Utrecht, The Netherlands
| | - Claus Svendsen
- Centre for Ecology and Hydrology, Maclean Building, Benson Lane, Wallingford, Oxfordshire OX10 8BB, UK
| | - Jean Lou Dorne
- European Food Safety Authority (EFSA), Scientific Committee and Emerging Risks Unit, Via Carlo Magno 1A, 43126 Parma, Italy
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via la Masa 19, 20156 Milan, Italy
| |
Collapse
|