1
|
Jayakody LN, Jin YS. In-depth understanding of molecular mechanisms of aldehyde toxicity to engineer robust Saccharomyces cerevisiae. Appl Microbiol Biotechnol 2021; 105:2675-2692. [PMID: 33743026 DOI: 10.1007/s00253-021-11213-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Revised: 02/17/2021] [Accepted: 02/28/2021] [Indexed: 11/25/2022]
Abstract
Aldehydes are ubiquitous electrophilic compounds that ferment microorganisms including Saccharomyces cerevisiae encounter during the fermentation processes to produce food, fuels, chemicals, and pharmaceuticals. Aldehydes pose severe toxicity to the growth and metabolism of the S. cerevisiae through a variety of toxic molecular mechanisms, predominantly via damaging macromolecules and hampering the production of targeted compounds. Compounds with aldehyde functional groups are far more toxic to S. cerevisiae than all other functional classes, and toxic potency depends on physicochemical characteristics of aldehydes. The yeast synthetic biology community established a design-build-test-learn framework to develop S. cerevisiae cell factories to valorize the sustainable and renewable biomass, including the lignin-derived substrates. However, thermochemically pretreated biomass-derived substrate streams contain diverse aldehydes (e.g., glycolaldehyde and furfural), and biological conversions routes of lignocellulosic compounds consist of toxic aldehyde intermediates (e.g., formaldehyde and methylglyoxal), and some of the high-value targeted products have aldehyde functional group (e.g., vanillin and benzaldehyde). Numerous studies comprehensively characterized both single and additive effects of aldehyde toxicity via systems biology investigations, and novel molecular approaches have been discovered to overcome the aldehyde toxicity. Based on those novel approaches, researchers successfully developed synthetic yeast cell factories to convert lignocellulosic substrates to valuable products, including aldehyde compounds. In this mini-review, we highlight the salient relationship of physicochemical characteristics and molecular toxicity of aldehydes, the molecular detoxification and macromolecules protection mechanisms of aldehydes, and the advances of engineering robust S. cerevisiae against complex mixtures of aldehyde inhibitors. KEY POINTS: • We reviewed structure-activity relationships of aldehyde toxicity on S. cerevisiae. • Two-tier protection mechanisms to alleviate aldehyde toxicity are presented. • We highlighted the strategies to overcome the synergistic toxicity of aldehydes.
Collapse
Affiliation(s)
- Lahiru N Jayakody
- School of Biological Sciences, Southern Illinois University Carbondale, Carbondale, IL, USA.
- Fermentation Science Institute, Southern Illinois University Carbondale, Carbondale, IL, USA.
| | - Yong-Su Jin
- Department of Food Science and Human Nutrition, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| |
Collapse
|
2
|
Khan K, Baderna D, Cappelli C, Toma C, Lombardo A, Roy K, Benfenati E. Ecotoxicological QSAR modeling of organic compounds against fish: Application of fragment based descriptors in feature analysis. AQUATIC TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2019; 212:162-174. [PMID: 31128417 DOI: 10.1016/j.aquatox.2019.05.011] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Revised: 05/16/2019] [Accepted: 05/16/2019] [Indexed: 06/09/2023]
Abstract
Organic compounds (OCs) constitute an enormously large class of highly persistent and toxic chemicals widely used for various purposes throughout the world. Their increased detection in water bodies, mainly sewage treatment plants via industrial discharge, has rendered them to become a cause for ecological concern. The limited availability of experimental toxicological data has necessitated development of models that can help us identify the most hazardous and potentially toxic compounds thus prioritizing the experiments on the selected chemicals. Computational tools such as quantitative structure-activity relationship (QSAR) can be used to predict the missing data and classify the chemicals based on their acute predicted responses for existing as well as not yet synthesized chemicals. In the current study, novel, externally validated, highly robust local QSAR models for different chemical classes and moderately robust global QSAR models were developed using partial least squares (PLS) regression technique using a large dataset of 1121 OCs for the fish mortality endpoint. For feature selection, genetic algorithm along with stepwise regression was used while model validation was performed using various stringent validation criteria following the strict rules of OECD guidelines of QSAR validation. The variables included in the models were obtained from simplex representation of molecular structures (SiRMS) (Version 4.1.2.270), Dragon (Version 7.0) and PaDEL-descriptor software (Version 2.20). The final developed models were robust, externally predictive and characterized by a large chemical as well as biological domain. The predictive efficiency of the developed models was then compared with the ECOSAR tool in order to justify the applicability of the developed models in ecotoxicological predictions for organic chemicals. Better predictive efficiency of the developed QSAR models compared to the ECOSAR derived predictions signifies their applicability in early risk assessment of known as well as untested chemicals in order to design safer alternatives to the environment.
Collapse
Affiliation(s)
- Kabiruddin Khan
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, 188 Raja S C Mullick Road, 700032, Kolkata, India
| | - Diego Baderna
- Laboratory of Environmental Chemistry and Toxicology, Istituto Di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa, 19, 20156, Milano, Italy
| | - Claudia Cappelli
- Laboratory of Environmental Chemistry and Toxicology, Istituto Di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa, 19, 20156, Milano, Italy
| | - Cosimo Toma
- Laboratory of Environmental Chemistry and Toxicology, Istituto Di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa, 19, 20156, Milano, Italy
| | - Anna Lombardo
- Laboratory of Environmental Chemistry and Toxicology, Istituto Di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa, 19, 20156, Milano, Italy
| | - Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, 188 Raja S C Mullick Road, 700032, Kolkata, India; Laboratory of Environmental Chemistry and Toxicology, Istituto Di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa, 19, 20156, Milano, Italy.
| | - Emilio Benfenati
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, 188 Raja S C Mullick Road, 700032, Kolkata, India.
| |
Collapse
|
3
|
Li F, Fan D, Wang H, Yang H, Li W, Tang Y, Liu G. In silico prediction of pesticide aquatic toxicity with chemical category approaches. Toxicol Res (Camb) 2017; 6:831-842. [PMID: 30090546 PMCID: PMC6062408 DOI: 10.1039/c7tx00144d] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2017] [Accepted: 07/27/2017] [Indexed: 01/03/2023] Open
Abstract
Aquatic toxicity is an important issue in pesticide development. In this study, using nine molecular fingerprints to describe pesticides, binary and ternary classification models were constructed to predict aquatic toxicity of pesticides via six machine learning methods: Naïve Bayes (NB), Artificial Neural Network (ANN), k-Nearest Neighbor (kNN), Classification Tree (CT), Random Forest (RF) and Support Vector Machine (SVM). For the binary models, local models were obtained with 829 pesticides on rainbow trout (RT) and 151 pesticides on lepomis (LP), and global models were constructed on the basis of 1258 diverse pesticides on RT and LP and 278 on other fish species. After analyzing the local binary models, we found that fish species caused influence in terms of accuracy. Considering the data size and predictive range, the 1258 pesticides were also used to build global ternary models. The best local binary models were Maccs_ANN for RT and Maccs_SVM for LP, which exhibited accuracies of 0.90 and 0.90, respectively. For global binary models, the best model was Graph_SVM with an accuracy of 0.89. Accuracy of the best global ternary model Graph_SVM was 0.81, which was a little lower than that of the best global binary model. In addition, several substructural alerts were identified including nitrobenzene, chloroalkene and nitrile, which could significantly correlate with pesticide aquatic toxicity. This study provides a useful tool for an early evaluation of pesticide aquatic toxicity in environmental risk assessment.
Collapse
Affiliation(s)
- Fuxing Li
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Defang Fan
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Hao Wang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Hongbin Yang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| |
Collapse
|
4
|
Basant N, Gupta S, Singh KP. Modeling the toxicity of chemical pesticides in multiple test species using local and global QSTR approaches. Toxicol Res (Camb) 2016; 5:340-353. [PMID: 30090350 PMCID: PMC6060685 DOI: 10.1039/c5tx00321k] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2015] [Accepted: 11/18/2015] [Indexed: 01/10/2023] Open
Abstract
The safety assessment processes require the toxicity data of chemicals in multiple test species and thus, emphasize the need for computational methods capable of toxicity prediction in multiple test species. Pesticides are designed toxic substances and find extensive applications worldwide. In this study, we have established local and global QSTR (quantitative structure-toxicity relationship) and ISC QSAAR (interspecies correlation quantitative structure activity-activity relationship) models for predicting the toxicities of pesticides in multiple aquatic test species using the toxicity data in crustacean (Daphnia magna, Americamysis bahia, Gammarus fasciatus, and Penaeus duorarum) and fish (Oncorhynchus mykiss and Lepomis macrochirus) species in accordance with the OECD guidelines. The ensemble learning based QSTR models (decision tree forest, DTF and decision tree boost, DTB) were constructed and validated using several statistical coefficients derived on the test data. In all the QSTR and QSAAR models, Log P was an important predictor. The constructed local, global and interspecies QSAAR models yielded high correlations (R2) of >0.941; >0.943 and >0.826, respectively between the measured and model predicted endpoint toxicity values in the test data. The performances of the local and global QSTR models were comparable. Furthermore, the chemical applicability domains of these QSTR/QSAAR models were determined using the leverage and standardization approaches. The results suggest for the appropriateness of the developed QSTR/QSAAR models to reliably predict the aquatic toxicity of structurally diverse pesticides in multiple test species and can be used for the screening and prioritization of new pesticides.
Collapse
Affiliation(s)
| | - Shikha Gupta
- Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Post Box 80 , Mahatma Gandhi Marg , Lucknow-226 001 , India . ; ; ; Tel: +91-522-2476091
| | - Kunwar P Singh
- Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Post Box 80 , Mahatma Gandhi Marg , Lucknow-226 001 , India . ; ; ; Tel: +91-522-2476091
| |
Collapse
|
5
|
Sun L, Zhang C, Chen Y, Li X, Zhuang S, Li W, Liu G, Lee PW, Tang Y. In silico prediction of chemical aquatic toxicity with chemical category approaches and substructural alerts. Toxicol Res (Camb) 2015. [DOI: 10.1039/c4tx00174e] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Aquatic toxicity is an important endpoint in the evaluation of chemically adverse effects on ecosystems.
Collapse
Affiliation(s)
- Lu Sun
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Chen Zhang
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Yingjie Chen
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Xiao Li
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Shulin Zhuang
- College of Environmental and Resource Sciences
- Zhejiang University
- Hangzhou 310058
- China
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Philip W. Lee
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design
- School of Pharmacy
- East China University of Science and Technology
- Shanghai 200237
- China
| |
Collapse
|
6
|
Abstract
AbstractAbstract The CORAL software (http://www.insilico.eu/coral/) has been examined as a tool for modeling anti-HIV-1 activity by quantitative structure — activity relationships (QSAR) for three different sets: (i) TIBO derivatives (n=82) (ii) anti-HIV-1 activity of 2-amino-6-arylsulfonylbenzonitriles and their congeners (n=64), and (iii) the measured binding affinity for fullerene-based HIV-1 PR inhibitors (n=48). A new global invariant ATOMPAIR of the molecular structure which can be calculated with the simplified molecular input line entry system (SMILES) was studied. The ATOMPAIR is an indicator of the joint presence of pairs of chemical elements (F, Cl, Br, N, O, S, and P) and three types of bonds (double covalent bond, triple covalent bond, and stereo chemical bond). Six random splits into sub-training, calibration, and test set were examined for each set. For the three aforementioned sets, the use of ATOMPAIR in the modeling process improves the predictive potential of the models for six random splits. Graphical abstract
Collapse
|
7
|
Toropova AP, Toropov AA, Lombardo A, Roncaglioni A, Benfenati E, Gini G. Coral: QSAR models for acute toxicity in fathead minnow (Pimephales promelas). J Comput Chem 2012; 33:1218-23. [DOI: 10.1002/jcc.22953] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2011] [Revised: 10/17/2011] [Accepted: 01/13/2012] [Indexed: 11/09/2022]
|
8
|
Wang Y, Zheng M, Xiao J, Lu Y, Wang F, Lu J, Luo X, Zhu W, Jianga H, Chen K. Using support vector regression coupled with the genetic algorithm for predicting acute toxicity to the fathead minnow. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2010; 21:559-570. [PMID: 20818588 DOI: 10.1080/1062936x.2010.502300] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
The potential toxicity of chemicals may present adverse effects to the environment and human health. The quantitative structure-activity relationship (QSAR) provides a useful method for hazard assessment. In this study, we constructed a QSAR model based on a highly heterogeneous data set of 571 compounds from the US Environmental Protection Agency, for predicting acute toxicity to the fathead minnow (Pimephales promelas). An approach coupling support vector regression (SVR) with the genetic algorithm (GA) was developed to build the model. The generated QSAR model showed excellent data fitting and prediction abilities: the squared correlation coefficients (r(2)) for the training set and the test set were 0.826 and 0.802, respectively. Only eight critical descriptors, most of which are closely related to the toxicity mechanism, were chosen by GA-SVR, making the derived model readily interpretable. In summary, the successful case reported here highlights that our GA-SVR approach can be used as a general machine learning method for toxicity prediction.
Collapse
Affiliation(s)
- Y Wang
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Toropova AP, Toropov AA, Benfenati E, Gini G, Leszczynska D, Leszczynski J. CORAL: QSPR models for solubility of [C60] and [C70] fullerene derivatives. Mol Divers 2010; 15:249-56. [PMID: 20349134 DOI: 10.1007/s11030-010-9245-6] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2010] [Accepted: 03/02/2010] [Indexed: 01/29/2023]
Abstract
Quantitative structure-property relationships (QSPRs) between the molecular structure of [C(60)] and [C(70)] fullerene derivatives and their solubility in chlorobenzene (mg/mL) have been established by means of CORAL (CORrelations And Logic) freeware. The CORAL models are based on representation of the molecular structure by simplified molecular input line entry system (SMILES). Three random splits into the training and the external validation sets have been examined. The ranges of statistical characteristics of these models are as follows: n = 18, r (2) = 0.748-0.815, s = 15.1 -17.5 (mg/mL), F = 47-71 (training set); n = 9, r (2) = 0.806-0.936, s = 12.5-17.5 (mg/mL), F = 29-103 (validation set).
Collapse
Affiliation(s)
- Alla P Toropova
- Istituto di Ricerche Farmacologiche Mario Negri, Via La Masa 19, 20156, Milano, Italy
| | | | | | | | | | | |
Collapse
|
10
|
Toropov AA, Toropova AP, Benfenati E. QSAR-modeling of toxicity of organometallic compounds by means of the balance of correlations for InChI-based optimal descriptors. Mol Divers 2009; 14:183-92. [DOI: 10.1007/s11030-009-9156-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2009] [Accepted: 04/27/2009] [Indexed: 10/20/2022]
|
11
|
|
12
|
Toropov AA, Rasulev BF, Leszczynski J. QSAR modeling of acute toxicity by balance of correlations. Bioorg Med Chem 2008; 16:5999-6008. [DOI: 10.1016/j.bmc.2008.04.055] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2007] [Accepted: 04/23/2008] [Indexed: 10/22/2022]
|
13
|
Toropov A, Rasulev B, Leszczynski J. QSAR Modeling of Acute Toxicity for Nitrobenzene Derivatives Towards Rats: Comparative Analysis by MLRA and Optimal Descriptors. ACTA ACUST UNITED AC 2007. [DOI: 10.1002/qsar.200610135] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
14
|
Roy K, Toropov A, Raska I. QSAR Modeling of PeripheralVersus Central Benzodiazepine Receptor Binding Affinity of 2-Phenylimidazo[1,2-a]pyridineacetamides using Optimal Descriptors Calculated with SMILES. ACTA ACUST UNITED AC 2007. [DOI: 10.1002/qsar.200630072] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
15
|
Toropov AA, Benfenati E. SMILES as an alternative to the graph in QSAR modelling of bee toxicity. Comput Biol Chem 2007; 31:57-60. [PMID: 17275412 DOI: 10.1016/j.compbiolchem.2007.01.003] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2006] [Accepted: 01/02/2007] [Indexed: 11/30/2022]
Abstract
Simplified Molecular Input Line Entry System (SMILES) nomenclature has been used as elucidating the molecular structure in construction of the quantitative structure-activity relationships (QSAR) for predicting bee toxicity. On the basis of the symbols used in the SMILES notation numerical parameters have been obtained, which are simple and fast to calculate. The method has been used to develop a QSAR model to predict toxicity of pesticides on bees. Results on a heterogeneous set of pesticides are good. Statistical characteristics of this model are: n=85, R2=0.68, s=0.82, F=180 (training set); n=20, R2=0.72, s=0.68, F=46 (test set).
Collapse
Affiliation(s)
- Andrey A Toropov
- Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri, Via Eritrea 62, 20157 Milan, Italy.
| | | |
Collapse
|
16
|
Toropov AA, Benfenati E. Optimisation of correlation weights of SMILES invariants for modelling oral quail toxicity. Eur J Med Chem 2006; 42:606-13. [PMID: 17218040 DOI: 10.1016/j.ejmech.2006.11.018] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2006] [Accepted: 11/30/2006] [Indexed: 11/29/2022]
Abstract
The SMILES (simplified molecular input line entry system) nomenclature was used to elucidate the molecular structure in constructing the quantitative structure-property/activity relationships (QSPR/QSAR) for predicting quail toxicity after oral exposure. The presence of chemical elements in different electronic states (e.g., C, c, O, o, Cl, Br, etc.) and of different covalent bonds (i.e., -,=, and #) was used as local invariants. Combinations of different presence/absence codes for local features of the SMILES were used as global invariants. The statistical characteristics of this model are n=97, r(2)=0.755, s=0.445, F=293 (training set); n=18, r(2)=0.731, s=0.587, F=43 (test set).
Collapse
Affiliation(s)
- Andrey A Toropov
- Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri, Via Eritrea 62, Milan, Italy.
| | | |
Collapse
|
17
|
Raska I, Toropov A. Comparison of QSPR models of octanol/water partition coefficient for vitamins and non vitamins. Eur J Med Chem 2006; 41:1271-8. [PMID: 16920228 DOI: 10.1016/j.ejmech.2006.06.006] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2005] [Accepted: 06/27/2006] [Indexed: 11/17/2022]
Abstract
Comparison of QSPR models for vitamins and for various substances, which are not vitamins indicates that vitamins have less number of molecular features (topologic and kinds of atomic orbitals), but, most probably, more complex mechanisms of interactions with molecules of water and octanol.
Collapse
Affiliation(s)
- I Raska
- 3rd Medical Department, 1st Faculty of Medicine, Charles University in Prague, U Nemocnice 1, 12808 Prague 2, Czech Republic
| | | |
Collapse
|
18
|
Cruz-Monteagudo M, González-Díaz H, Uriarte E. Simple Stochastic Fingerprints Towards Mathematical Modeling in Biology and Medicine 2. Unifying Markov Model for Drugs Side Effects. Bull Math Biol 2006; 68:1527-54. [PMID: 16847720 DOI: 10.1007/s11538-005-9013-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2005] [Accepted: 05/09/2005] [Indexed: 10/24/2022]
Abstract
Most of present mathematical models for biological activity consider just the molecular structure. In the present article we pretend extending the use of Markov chain models to define novel molecular descriptors, which consider in addition other parameters like target site or biological effect. Specifically, this mathematical model takes into consideration not only the molecular structure but the specific biological system the drug affects too. Herein, a general Markov model is developed that describes 19 different drugs side effects grouped in eight affected biological systems for 178 drugs, being 270 cases finally. The data was processed by linear discriminant analysis (LDA) classifying drugs according to their specific side effects, forward stepwise was fixed as strategy for variables selection. The average percentage of good classification and number of compounds used in the training/predicting sets were 100/95.8% for endocrine manifestations, (18 out of 18)/(13 out of 14); 90.5/92.3% for gastrointestinal manifestations, (38 out of 42)/(30 out of 32); 88.5/86.5% for systemic phenomena, (23 out of 26)/(17 out of 20); 81.8/77.3% for neurological manifestations, (27 out of 33)/(19 out of 25); 81.6/86.2% for dermal manifestations, (31 out of 38)/(25 out of 29); 78.4/85.1% for cardiovascular manifestation, (29 out of 37)/(24 out of 28); 77.1/75.7% for breathing manifestations, (27 out of 35)/(20 out of 26) and 75.6/75% for psychiatric manifestations, (31 out of 41)/(23 out of 31). Additionally a back-projection analysis (BPA) was carried out for two ulcerogenic drugs to prove in structural terms the physical interpretation of the models obtained. This article develops a mathematical model that encompasses a large number of drugs side effects grouped in specifics biological systems using stochastic absolute probabilities of interaction ((A)pi(k)(j)) by the first time.
Collapse
Affiliation(s)
- Maykel Cruz-Monteagudo
- Applied Chemistry Research Center and Chemical Bioactives Center, Central University of Las Villas, Santa Clara, 54830, Cuba
| | | | | |
Collapse
|
19
|
Toropov AA, Benfenati E. Correlation weighting of valence shells in QSAR analysis of toxicity. Bioorg Med Chem 2006; 14:3923-8. [PMID: 16460943 DOI: 10.1016/j.bmc.2006.01.038] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2005] [Revised: 01/18/2006] [Accepted: 01/20/2006] [Indexed: 11/15/2022]
Abstract
In the rainbow trout (Oncorhynchus mykiss), we studied the acute toxicity LC(50)-96 h of 274 organic pesticides with a wide variety of molecular structures. Optimization of correlation weights of local and global graph invariants (OCWLGI) gave quantitative structure-activity relationships (QSARs) for predicting toxicity. We used a labeled hydrogen-filled graph (LHFG) to elucidate the molecular structure. We also used the extended connectivity of zero ((0)EC(k)), first ((1)EC(k)), and second ((2)EC(k)) order, numbers of path lengths 2 (P2(k)) and 3 (P3(k)) starting from a given vertex in the LHFG, and valence shells of second order (S2(k)). S2(k) is the sum of the degree of vertices at distance 2 from a given vertex k. The presence of three-, five-, and six-member cycles and hydrogen bond indices suggested they might be used as global LHFG invariants. We applied this method to a broad set of pesticides, to predict toxicity for the trout. The best model used weighted S2(k) and global LHFG invariants. Statistical characteristics of this model are as follows: n=233, r(2)=0.7689, r(2)(pred)=0.7688, s=0.75, F=769 (training set); n=41, r(2)=0.6421, r(2)(pred)=0.4241, s=1.14, F=70 (test set).
Collapse
Affiliation(s)
- Andrey A Toropov
- Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche 'Mario Negri', Via Eritrea 62, 20157 Milan, Italy.
| | | |
Collapse
|
20
|
Vracko M, Bandelj V, Barbieri P, Benfenati E, Chaudhry Q, Cronin M, Devillers J, Gallegos A, Gini G, Gramatica P, Helma C, Mazzatorta P, Neagu D, Netzeva T, Pavan M, Patlewicz G, Randić M, Tsakovska I, Worth A. Validation of counter propagation neural network models for predictive toxicology according to the OECD principles: a case study. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2006; 17:265-84. [PMID: 16815767 DOI: 10.1080/10659360600787650] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
The OECD has proposed five principles for validation of QSAR models used for regulatory purposes. Here we present a case study investigating how these principles can be applied to models based on Kohonen and counter propagation neural networks. The study is based on a counter propagation network model that has been built using toxicity data in fish fathead minnow for 541 compounds. The study demonstrates that most, if not all, of the OECD criteria may be met when modeling using this neural network approach.
Collapse
Affiliation(s)
- M Vracko
- European Chemical Beaureau, Institute for Health and Consumer Protection, European Commission Joint Research Centre, 21020 Ispra, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Toropov AA, Benfenati E. QSAR models of quail dietary toxicity based on the graph of atomic orbitals. Bioorg Med Chem Lett 2006; 16:1941-3. [PMID: 16442289 DOI: 10.1016/j.bmcl.2005.12.085] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2005] [Revised: 12/21/2005] [Accepted: 12/21/2005] [Indexed: 11/22/2022]
Abstract
Graphs of atomic orbitals (GAOs) have been used to represent molecular structures. We describe rules to convert the labelled hydrogen-filled graphs (LHFGs) into GAOs. The GAO is one possible way of taking account of the structure of atoms (i.e., atomic orbitals, such as 1s(1), 2p(2) and 3d(10)) for QSPR/QSAR analyses. Optimization of correlation weights of local invariants (OCWLI) of the LHFGs and the GAOs was used to obtain a method of quail dietary toxicity modelling. Statistical characteristics of the models based on the OCWLI of GAO are better than those based on the OCWLI of the LHFGs.
Collapse
Affiliation(s)
- Andrey A Toropov
- Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri, Via Eritrea 62, 20157 Milan, Italy.
| | | |
Collapse
|
22
|
Toropov AA, Benfenati E. QSAR models for Daphnia toxicity of pesticides based on combinations of topological parameters of molecular structures. Bioorg Med Chem 2005; 14:2779-88. [PMID: 16377200 DOI: 10.1016/j.bmc.2005.11.060] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2005] [Revised: 11/29/2005] [Accepted: 11/29/2005] [Indexed: 11/17/2022]
Abstract
A topological parameter is defined as an integer value of a given local or global invariant of a molecular graph. We examined three types of local graph invariants, the vertex degrees (0EC), the extended connectivity of first order (1EC), and the numbers of paths of length two (P2), as elementary invariants for construction of quantitative structure-activity relationships (QSAR). We also examined combined invariants, obtained by multiplying one of these three elementary types with another (i.e., [0EC.1EC], [0EC.P2], and [1EC.P2]), as graph invariants. Finally, global invariants were used in the QSAR analyses, codifying the presence and nature of cycles in the molecular structures under consideration. We used the correlation weights of these invariants to obtain optimal descriptors. These descriptors have been used in one-variable models to predict toxicity toward Daphnia magna for a set of pesticides. Statistical characteristics of the best model, based on the correlation weight of local topological parameters (the [0EC.P2]) together with the global topological parameters, are the following: n=220, r2=0.7822, s=0.849, F=783 (training set); n=42, r2=0.7388, s=0.941, F=113 (test set). The role of these topological parameters is discussed.
Collapse
Affiliation(s)
- Andrey A Toropov
- Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri, Via Eritrea 62, 20157 Milan, Italy.
| | | |
Collapse
|
23
|
Cruz-Monteagudo M, González-Díaz H. Unified drug–target interaction thermodynamic Markov model using stochastic entropies to predict multiple drugs side effects. Eur J Med Chem 2005; 40:1030-41. [PMID: 15951063 DOI: 10.1016/j.ejmech.2005.04.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2005] [Revised: 04/12/2005] [Accepted: 04/27/2005] [Indexed: 11/26/2022]
Abstract
Most of present molecular descriptors consider just the molecular structure. In the present article we pretend extending the use of Markov chain (MC) models to define novel molecular descriptors, which consider in addition other parameters like target site or toxic effect. Specifically, this molecular descriptor takes into consideration not only the molecular structure but the specific system the drug affects too. Herein, it is developed a general Markov model that describes 21 different drugs side effects grouped in 10 affected biological systems for 193 drugs, being 311 cases finally. The data were processed by linear discriminant analysis (LDA) classifying drugs according to their specific side effects, forward stepwise was fixed as strategy for variables selection. The average percentage of good classification and number of compounds used in the training/predicting sets were 92.6/91.7% for cardiovascular manifestation (25 out of 27)/(18 out of 20); 89.3/83.9% for dermal manifestations (25 out of 18)/(18 out of 21); 88.9/88.9% for endocrine manifestations (16 out of 18)/(12 out of 14); 88.9/88.2% for psychiatric manifestations (32 out of 36)/(24 out of 27); 88.5/85.6% for systemic phenomena (23 out of 26)/(17 out of 20); 85.7/91.7% for gastrointestinal manifestations (36 out of 42)/(29 out of 32); 83.3/79.2% for metabolic manifestations (15 out of 18)/(11 out of 14); 81.8/78.0% for neurological manifestations (27 out of 33)/(20 out of 25); 75.0/74.0% for hematological manifestations (36 out of 48)/(27 out of 36) and 74.3/72.8% for breathing manifestations (26 out of 35)/(19 out of 26). Finally, application of back-projection analysis (BPA) provides physic interpretation in structural terms through molecular graphics of the toxic effects predicted with these QSTR models. This article develops a mathematical model that encompasses a large number of drugs side effects grouped in specifics systems using stochastic entropies of interaction (Thetak (j)) by the first time.
Collapse
Affiliation(s)
- Maykel Cruz-Monteagudo
- Applied Chemistry Research Center, Central University of Las Villas, Santa Clara 54830, Cuba.
| | | |
Collapse
|
24
|
González-Díaz H, Cruz-Monteagudo M, Molina R, Tenorio E, Uriarte E. Predicting multiple drugs side effects with a general drug-target interaction thermodynamic Markov model. Bioorg Med Chem 2005; 13:1119-29. [PMID: 15670920 DOI: 10.1016/j.bmc.2004.11.030] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2004] [Revised: 11/09/2004] [Accepted: 11/12/2004] [Indexed: 10/26/2022]
Abstract
Most of present molecular descriptors just consider the molecular structure. In the present article we pretend extending the use of Markov chain models to define novel molecular descriptors, which consider in addition to molecular structure other parameters like target site or toxic effect. Specifically, this molecular descriptor takes into consideration not only the molecular structure but the specific system the drug affects too. Herein, it is developed a general Markov model that describes 39 different drugs side effects grouped in 11 affected systems for 301 drugs, being 686 cases finally. The data was processed by linear discriminant analysis (LDA) classifying drugs according to their specific side effects, forward stepwise was fixed as strategy for variables selection. The average percentage of good classification and number of compounds used in the training/predicting sets were 100/100% for systemic phenomena (47 out of 47)/(12 out of 12) and metabolic (18 out of 18)/(5 out of 5), muscular-skeletal (23 out of 23)/(6 out of 6) and neurological manifestations (33 out of 33)/(8 out of 8); 97.6/96.7% for cardiovascular manifestation (122 out of 125)/(30 out of 31); 97.1/97.5% for breathing manifestations (34 out of 35)/(8 out of 9); 97/99.4% for gastrointestinal manifestations (159 out of 164)/(40 out of 41); 97/95% for endocrine manifestations (32 out of 33)/(7 out of 8); 96.4/94.6% for psychiatric manifestations (53 out of 55)/(13 out of 14); 95.1/99.1% for hematological manifestations (98 out of 103)/(25 out of 26) and 88/92.3% for dermal manifestations (44 out of 50)/(12 out of 13). In addition, we report preliminary experimental reversible decrease of lymphocytes differential count after administration of the antibacterial drug G-1 in mice, which coincide with a posterior probability (P%=74.91) predicted by the model. This article develops a model that encompasses a large number of side effects grouped in specific organ systems in a single stochastic framework for the first time.
Collapse
Affiliation(s)
- Humberto González-Díaz
- Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela 15782, Spain
| | | | | | | | | |
Collapse
|
25
|
Modelling Aquatic Toxicity with Advanced Computational Techniques: Procedures to Standardize Data and Compare Models. KNOWLEDGE EXPLORATION IN LIFE SCIENCE INFORMATICS 2004. [DOI: 10.1007/978-3-540-30478-4_20] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|