1
|
Zhang Y, Yu W, Zhou S, Xiao J, Zhang X, Yang H, Zhang J. Finding key genes (UBE2T, KIF4A, CDCA3, and CDCA5) co-expressed in hepatitis, cirrhosis and hepatocellular carcinoma based on multiple bioinformatics techniques. BMC Gastroenterol 2024; 24:205. [PMID: 38890649 PMCID: PMC11184838 DOI: 10.1186/s12876-024-03288-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 06/07/2024] [Indexed: 06/20/2024] Open
Abstract
BACKGROUND Hepatocellular carcinoma (HCC) is one of the most common cancers worldwide. Hepatitis B virus (HBV) is one of the major causes of liver cirrhosis (LC) and HCC. Therefore, the discovery of common markers for hepatitis B or LC and HCC is crucial for the prevention of HCC. METHODS Expressed genes for to chronic active hepaititis B (CAH-B), LC and HCC were obtained from the GEO and TCGA databases, and co-expressed genes were screened using Protein-protein interaction (PPI) networks, least absolute shrinkage and selection operator (LASSO), random forest (RF) and support vector machine - recursive feature elimination (SVM-RFE). The prognostic value of genes was assessed using Kaplan-Meier (KM) survival curves. Columnar line plots, calibration curves and receiver operating characteristic (ROC) curves of individual genes were used for evaluation. Validation was performed using GEO datasets. The association of these key genes with HCC clinical features was explored using the UALCAN database ( https://ualcan.path.uab.edu/index.html ). RESULTS Based on WGCNA analysis and TCGA database, the co-expressed genes (565) were screened. Moreover, the five algorithms of MCODE (ClusteringCoefficient, MCC, Degree, MNC, and DMNC) was used to select one of the most important and most closely linked clusters (the top 50 genes ranked). Using, LASSO regression model, RF model and SVM-RFE model, four key genes (UBE2T, KIF4A, CDCA3, and CDCA5) were identified for subsequent research analysis. These 4 genes were highly expressed and associated with poor prognosis and clinical features in HCC patients. CONCLUSION These four key genes (UBE2T, KIF4A, CDCA3, and CDCA5) may be common biomarkers for CAH-B and HCC or LC and HCC, promising to advance our understanding of the molecular basis of CAH-B/LC/HCC progression.
Collapse
Affiliation(s)
- Yingai Zhang
- Central Laboratory, Affiliated Haikou Hospital of Xiangya Medical College, Central South University, No.43 Renmin Road, Haikou, Hainan, 570208, China
- School of Life Sciences, Hainan University, No.58 Renmin Road, Haikou, Hainan, 570228, China
| | - Weiling Yu
- Department of Chemotherapy, Affiliated Haikou Hospital of Xiangya Medical College, Central South University, No.43 Renmin Road, Haikou, Hainan, 570208, China
| | - Shuai Zhou
- Hepatobiliary surgery, Affiliated Haikou Hospital of Xiangya Medical College, Central South University, No.43 Renmin Road, Haikou, Hainan, 570208, China
| | - Jingchuan Xiao
- Central Laboratory, Affiliated Haikou Hospital of Xiangya Medical College, Central South University, No.43 Renmin Road, Haikou, Hainan, 570208, China
| | - Xiaoyu Zhang
- Hepatobiliary surgery, Affiliated Haikou Hospital of Xiangya Medical College, Central South University, No.43 Renmin Road, Haikou, Hainan, 570208, China
| | - Haoliang Yang
- Central Laboratory, Affiliated Haikou Hospital of Xiangya Medical College, Central South University, No.43 Renmin Road, Haikou, Hainan, 570208, China
| | - Jianquan Zhang
- Hepatobiliary surgery, Affiliated Haikou Hospital of Xiangya Medical College, Central South University, No.43 Renmin Road, Haikou, Hainan, 570208, China.
| |
Collapse
|
2
|
Chen C, Yang B, Li M, Huang S, Huang X. Quantitative structure-activity relationship predicting toxicity of pesticides towards Daphnia magna. ECOTOXICOLOGY (LONDON, ENGLAND) 2024:10.1007/s10646-024-02751-1. [PMID: 38592644 DOI: 10.1007/s10646-024-02751-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 03/31/2024] [Indexed: 04/10/2024]
Abstract
Global pesticide usage reaching 2.7 million metric tons annually, brings a grave threat to non-target organisms, especially aquatic organisms, resulting in serious concerns. Predicting aquatic toxicity of pesticides towards Daphnia magna is significant. In this work, random forest (RF) algorithm, together with ten Dragon molecular descriptors, was successfully utilized to develop a quantitative structure-activity/toxicity relationship (QSAR/QSTR) model for the toxicity pEC50 of 745 pesticides towards Daphnia magna. The optimal QSTR model (RF Model I) based on the RF parameters of ntree = 50, mtry = 3 and nodesize = 5, yielded R2 = 0.877, MAE = 0.570, rms = 0.739 (training set of 596 pEC50), R2 = 0.807, MAE = 0.732, rms = 0.902 (test set of 149 pEC50), and R2 = 0.863, MAE = 0.602, rms = 0.774 (total set of 745 pEC50), which are accurate and satisfactory. The optimal RF model is comparable to other published QSTR models for Daphnia magna, although the optimal RF model possessed a small descriptor subset and dealt with a large dataset of pesticide toxicity pEC50. Thus, the investigation in this work provides a reliable, applicable QSTR model for predicting the toxicity pEC50 of pesticides towards Daphnia magna.
Collapse
Affiliation(s)
- Cong Chen
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China
| | - Bowen Yang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China
| | - Mingwang Li
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China
| | - Saijin Huang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China.
| | - Xianwei Huang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China.
| |
Collapse
|
3
|
Wei J, Tian L, Nie F, Shao Z, Wang Z, Xu Y, He M. Quantitative structure-activity relationship model development for estimating the predicted No-effect concentration of petroleum hydrocarbon and derivatives in the ecological risk assessment. Heliyon 2024; 10:e26808. [PMID: 38468969 PMCID: PMC10925994 DOI: 10.1016/j.heliyon.2024.e26808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 02/20/2024] [Accepted: 02/20/2024] [Indexed: 03/13/2024] Open
Abstract
Quantitative structure-activity relationship (QSAR) is a cost-effective solution to directly and accurately estimating the environmental safety thresholds (ESTs) of pollutants in the ecological risk assessment due to the lack of toxicity data. In this study, QSAR models were developed for estimating the Predicted No-Effect Concentrations (PNECs) of petroleum hydrocarbons and their derivatives (PHDs) under dietary exposure, based on the quantified molecular descriptors and the obtained PNECs of 51 PHDs with given acute or chronic toxicity concentrations. Three high-reliable QSAR models were respectively developed for PHDs, aromatic hydrocarbons and their derivatives (AHDs), and alkanes, alkenes and their derivatives (ALKDs), with excellent fitting performance evidenced by high correlation coefficient (0.89-0.95) and low root mean square error (0.13-0.2 mg/kg), and high stability and predictive performance reflected by high internal and external verification coefficient (Q2LOO, 0.66-0.89; Q2F1, 0.62-0.78; Q2F2, 0.60-0.73). The investigated quantitative relationships between molecular structure and PNECs indicated that 18 autocorrelation descriptors, 3 information index descriptors, 4 barysz matrix descriptors, 6 burden modified eigenvalues descriptors, and 1 BCUT descriptor were important molecular descriptors affecting the PNECs of PHDs. The obtained results supported that PNECs of PHDs can be accurately estimated by the influencing molecular descriptors and the quantitative relationship from the developed QSAR models, that provided a new feasible solution for ESTs derivation in the ecological risk assessment.
Collapse
Affiliation(s)
- Jiajia Wei
- State Key Laboratory of Petroleum Pollution Control, CNPC Research Institute of Safety and Environmental Technology Co., Ltd, Beijing, 102206, China
- Hubei Key Laboratory of Petroleum Geochemistry and Environment (Yangtze University), Wuhan, 430100, China
- School of Resources and Environment, Yangtze University, Wuhan, 430100, China
| | - Lei Tian
- State Key Laboratory of Petroleum Pollution Control, CNPC Research Institute of Safety and Environmental Technology Co., Ltd, Beijing, 102206, China
- Hubei Key Laboratory of Petroleum Geochemistry and Environment (Yangtze University), Wuhan, 430100, China
- School of Petroleum Engineering, Yangtze University, Wuhan, 430100, China
| | - Fan Nie
- State Key Laboratory of Petroleum Pollution Control, CNPC Research Institute of Safety and Environmental Technology Co., Ltd, Beijing, 102206, China
| | - Zhiguo Shao
- State Key Laboratory of Petroleum Pollution Control, CNPC Research Institute of Safety and Environmental Technology Co., Ltd, Beijing, 102206, China
| | - Zhansheng Wang
- State Key Laboratory of Petroleum Pollution Control, CNPC Research Institute of Safety and Environmental Technology Co., Ltd, Beijing, 102206, China
| | - Yu Xu
- State Key Laboratory of Petroleum Pollution Control, CNPC Research Institute of Safety and Environmental Technology Co., Ltd, Beijing, 102206, China
| | - Mei He
- State Key Laboratory of Petroleum Pollution Control, CNPC Research Institute of Safety and Environmental Technology Co., Ltd, Beijing, 102206, China
- Hubei Key Laboratory of Petroleum Geochemistry and Environment (Yangtze University), Wuhan, 430100, China
- School of Resources and Environment, Yangtze University, Wuhan, 430100, China
| |
Collapse
|
4
|
Song W, Wu F, Yan Y, Li Y, Wang Q, Hu X, Li Y. Gut microbiota landscape and potential biomarker identification in female patients with systemic lupus erythematosus using machine learning. Front Cell Infect Microbiol 2023; 13:1289124. [PMID: 38169617 PMCID: PMC10758415 DOI: 10.3389/fcimb.2023.1289124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 11/28/2023] [Indexed: 01/05/2024] Open
Abstract
Objectives Systemic Lupus Erythematosus (SLE) is a complex autoimmune disease that disproportionately affects women. Early diagnosis and prevention are crucial for women's health, and the gut microbiota has been found to be strongly associated with SLE. This study aimed to identify potential biomarkers for SLE by characterizing the gut microbiota landscape using feature selection and exploring the use of machine learning (ML) algorithms with significantly dysregulated microbiotas (SDMs) for early identification of SLE patients. Additionally, we used the SHapley Additive exPlanations (SHAP) interpretability framework to visualize the impact of SDMs on the risk of developing SLE in females. Methods Stool samples were collected from 54 SLE patients and 55 Negative Controls (NC) for microbiota analysis using 16S rRNA sequencing. Feature selection was performed using Elastic Net and Boruta on species-level taxonomy. Subsequently, four ML algorithms, namely logistic regression (LR), Adaptive Boosting (AdaBoost), Random Forest (RF), and eXtreme gradient boosting (XGBoost), were used to achieve early identification of SLE with SDMs. Finally, the best-performing algorithm was combined with SHAP to explore how SDMs affect the risk of developing SLE in females. Results Both alpha and beta diversity were found to be different in SLE group. Following feature selection, 68 and 21 microbiota were retained in Elastic Net and Boruta, respectively, with 16 microbiota overlapping between the two, i.e., SDMs for SLE. The four ML algorithms with SDMs could effectively identify SLE patients, with XGBoost performing the best, achieving Accuracy, Sensitivity, Specificity, Positive Predictive Value, Negative Predictive Value, and AUC values of 0.844, 0.750, 0.938, 0.923, 0.790, and 0.930, respectively. The SHAP interpretability framework showed a complex non-linear relationship between the relative abundance of SDMs and the risk of SLE, with Escherichia_fergusonii having the largest SHAP value. Conclusions This study revealed dysbiosis in the gut microbiota of female SLE patients. ML classifiers combined with SDMs can facilitate early identification of female patients with SLE, particularly XGBoost. The SHAP interpretability framework provides insight into the impact of SDMs on the risk of SLE and may inform future scientific treatment for SLE.
Collapse
Affiliation(s)
- Wenzhu Song
- School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Feng Wu
- Department of Nephrology, Shanxi Provincial People's Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, China
| | - Yan Yan
- Department of Nephrology, Shanxi Provincial People's Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, China
| | - Yaheng Li
- Shanxi Provincial Key Laboratory of Kidney Disease, Taiyuan, Shanxiuan, China
| | - Qian Wang
- Shanxi Provincial Key Laboratory of Kidney Disease, Taiyuan, Shanxiuan, China
| | - Xueli Hu
- Department of Nephrology, Hejin People’s Hospital, Yuncheng, Shanxi, China
| | - Yafeng Li
- Department of Nephrology, Shanxi Provincial People's Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, China
- Shanxi Provincial Key Laboratory of Kidney Disease, Taiyuan, Shanxiuan, China
- Core Laboratory, Shanxi Provincial People's Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, China
- Academy of Microbial Ecology, Shanxi Medical University, Taiyuan, China
| |
Collapse
|
5
|
Yu X. Global classification models for predicting acute toxicity of chemicals towards Daphnia magna. ENVIRONMENTAL RESEARCH 2023; 238:117239. [PMID: 37778597 DOI: 10.1016/j.envres.2023.117239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 09/10/2023] [Accepted: 09/18/2023] [Indexed: 10/03/2023]
Abstract
Molecular descriptors reflecting structural information on hydrophobicity, reactivity, polarizability, hydrogen bond and charged groups, were used to predict the toxicity (pLC50) of chemicals towards Daphnia magna with global quantitative structure-activity/toxicity relationship (QSAR/QSTR) models. A sufficiently large dataset including 1517 chemical toxicity to Daphnia magna was divided into a training set (758 pLC50) and a test set (759 pLC50). By applying random forest algorithm, two classification models, Class Model A and Class Model B were developed, having prediction accuracy, sensitivity and specificity above 85% for Class 1 (with pLC50 ≤ 4.48) and Class 2 (with pLC50 > 4.48). The Class Model A was based on nine molecular descriptors and RF parameters of nodesize = 1, ntree = 80 and mtry = 2, and yielded accuracy of 92.3% (training set), 85.6% (test set) and 88.9% (total data set). Class Model B was based on ten descriptors and parameters, nodesize = 1, ntree = 90 and mtry = 2, produced accuracy of 88.3% (training set), 86.8% (test set) and 87.5% (total data set). The two classification models were satisfactory compared with other classification model reported in the literature, although classification models in this work dealt with more samples. Thus, the two classification models with a larger applicability domain provided efficient tools for assessing chemical aquatic toxicity towards Daphnia magna.
Collapse
Affiliation(s)
- Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China.
| |
Collapse
|
6
|
Xu X, Zhou M, Xie K, Zhang S, Ji X, Sun Y, Li Q, Dong Z. Mitigation of avermectin exposure-induced brain tissue damage in carp by quercetin. FISH PHYSIOLOGY AND BIOCHEMISTRY 2023; 49:1171-1185. [PMID: 37831371 DOI: 10.1007/s10695-023-01249-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Accepted: 10/02/2023] [Indexed: 10/14/2023]
Abstract
Avermectin is widely used as an important insecticide in agricultural production, but it also shows strong toxicity to non-target organisms. Quercetin is a natural flavonoid that is widely used due to its good anti-inflammatory and antioxidant effects. We believe that quercetin may have a potential therapeutic effect on avermectin poisoning. This experiment was proposed to observe the effect of quercetin on the toxic response to avermectin by observing the toxic response caused by avermectin in the brain of carp. In this project, 60 carp were studied as control group (Control), quercetin administration group (QUE), avermectin exposure group (AVM) and quercetin treatment avermectin exposure group (QUE + AVM) with different interventions to study the effect of quercetin on avermectin. The carp brain tissues were stained and simultaneously analyzed for blood-brain barrier (BBB), oxidative stress indicators, inflammatory factors, and apoptosis using qPCR technique. The results of the study indicate that avermectin exhibits a neurotoxic mechanism of action in fish by decreasing the transcript levels of tight junction protein-related genes, which in turn leads to the rupture of the BBB in the carp brain tissue. Avermectin induced apoptosis in carp brain tissue by increasing oxidative stress response and promoting inflammatory cell infiltration. Quercetin could reduce the accumulation of reactive oxygen species (ROS) in the brain tissue of carp caused by avermectin exposure toxicity, maintain redox homeostasis, reduce inflammatory response, and protect brain tissue cells from apoptosis. The present study confirmed the therapeutic and protective effects of quercetin on neurotoxicity in carp caused by avermectin exposure.
Collapse
Affiliation(s)
- Xuhui Xu
- Jiangsu Key Laboratory of Marine Bioresources and Environment, Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, College of Pharmacy, Jiangsu Ocean University, Lianyungang, 222005, China
| | - Mengyuan Zhou
- Jiangsu Key Laboratory of Marine Bioresources and Environment, Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, College of Pharmacy, Jiangsu Ocean University, Lianyungang, 222005, China
| | - Kunmei Xie
- Jiangsu Key Laboratory of Marine Bioresources and Environment, Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, College of Pharmacy, Jiangsu Ocean University, Lianyungang, 222005, China
| | - Shuai Zhang
- Jiangsu Key Laboratory of Marine Bioresources and Environment, Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, College of Pharmacy, Jiangsu Ocean University, Lianyungang, 222005, China
| | - Xiaomeng Ji
- Jiangsu Key Laboratory of Marine Bioresources and Environment, Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, College of Pharmacy, Jiangsu Ocean University, Lianyungang, 222005, China
| | - Ying Sun
- Jiangsu Key Laboratory of Marine Bioresources and Environment, Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, College of Pharmacy, Jiangsu Ocean University, Lianyungang, 222005, China
| | - Qiulu Li
- Jiangsu Key Laboratory of Marine Bioresources and Environment, Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, College of Pharmacy, Jiangsu Ocean University, Lianyungang, 222005, China
| | - Zibo Dong
- Jiangsu Key Laboratory of Marine Bioresources and Environment, Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, College of Pharmacy, Jiangsu Ocean University, Lianyungang, 222005, China.
| |
Collapse
|
7
|
Ghosh S, Chatterjee M, Roy K. Quantitative Read-across structure-activity relationship (q-RASAR): A new approach methodology to model aquatic toxicity of organic pesticides against different fish species. AQUATIC TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2023; 265:106776. [PMID: 38006764 DOI: 10.1016/j.aquatox.2023.106776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/17/2023] [Accepted: 11/19/2023] [Indexed: 11/27/2023]
Abstract
We have developed quantitative toxicity prediction models for organic pesticides of agricultural importance considering different fish species using a novel quantitative Read-across structure-activity relationship (q-RASAR) approach. The current study uses experimental (Log 1/LC50) data of organic pesticides to various fish species, including Rainbow trout (RT: Oncorhynchus mykiss: 715 data points), Lepomis (LP: Lepomis macrochirus: 136 data points), and Miscellaneous (Pimephales promelas, Brachydanio rerio: 226 data points). This study has also discussed the validation of the developed models and the analysis of structural features that are important for aquatic toxicity towards fishes. The read-across-derived similarity, error, and concordance measures (RASAR descriptors) have been extracted from the preliminary 0D-2D descriptors; the combined pool of RASAR and selected 0D-2D descriptors have been used to develop the final models by employing partial least squares algorithm. All the q-RASAR models are acceptable in terms of goodness of fit, robustness, and external predictivity, superseding the quality of the respective QSAR models, as seen from the computed validation metrics. The q-RASAR is an effective approach that has the potential to be used as a good alternative way to enhance external predictivity, interpretability, and transferability for aquatic toxicity prediction as well as ecotoxicity potential identification.
Collapse
Affiliation(s)
- Shilpayan Ghosh
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India
| | - Mainak Chatterjee
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India
| | - Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India.
| |
Collapse
|
8
|
Viljanen M, Minnema J, Wassenaar PNH, Rorije E, Peijnenburg W. What is the ecotoxicity of a given chemical for a given aquatic species? Predicting interactions between species and chemicals using recommender system techniques. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023; 34:765-788. [PMID: 37670728 DOI: 10.1080/1062936x.2023.2254225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 08/27/2023] [Indexed: 09/07/2023]
Abstract
Ecotoxicological safety assessment of chemicals requires toxicity data on multiple species, despite the general desire of minimizing animal testing. Predictive models, specifically machine learning (ML) methods, are one of the tools capable of solving this apparent contradiction as they allow to generalize toxicity patterns across chemicals and species. However, despite the availability of large public toxicity datasets, the data is highly sparse, complicating model development. The aim of this study is to provide insights into how ML can predict toxicity using a large but sparse dataset. We developed models to predict LC50-values, based on experimental LC50-data covering 2431 organic chemicals and 1506 aquatic species from the ECOTOX-database. Several well-known ML techniques were evaluated and a new ML model was developed, inspired by recommender systems. This new model involves a simple linear model that learns low-rank interactions between species and chemicals using factorization machines. We evaluated the predictive performances of the developed models based on two validation settings: 1) predicting unseen chemical-species pairs, and 2) predicting unseen chemicals. The results of this study show that ML models can accurately predict LC50-values in both validation settings. Moreover, we show that the novel factorization machine approach can match well-tuned, complex, ML approaches.
Collapse
Affiliation(s)
- M Viljanen
- Department of Statistics, Data Science and Modelling, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - J Minnema
- Center for Safety of Substances and Products, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - P N H Wassenaar
- Center for Safety of Substances and Products, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - E Rorije
- Center for Safety of Substances and Products, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - W Peijnenburg
- Center for Safety of Substances and Products, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
- Institute of Environmental Sciences (CML), Leiden University, Leiden, The Netherlands
| |
Collapse
|
9
|
Terrados-Cristos M, Ortega-Fernández F, Díaz-Piloñeta M, Rodríguez Montequín V, Álvarez Cabal JV. Hybrid system model for wind abrasion segmentation using semi-automatic classification of remote sensing multispectral areas. Heliyon 2023; 9:e19655. [PMID: 37809392 PMCID: PMC10558917 DOI: 10.1016/j.heliyon.2023.e19655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 08/29/2023] [Accepted: 08/29/2023] [Indexed: 10/10/2023] Open
Abstract
Wind abrasion, caused by particles transported by strong winds impacting on structures, can lead to their degradation. Although this phenomenon has hardly been studied in this context, it is becoming increasingly important due to new trends in infrastructure location, especially in renewable energy terms. Metallic structures are particularly vulnerable to degradation by the action of windblown sand particles. However, characterising such secluded sites is complicated, and remote sensing systems and satellite information become crucial. The objective of this research is to identify and delineate the geographic areas that are vulnerable to this phenomenon by employing a hybrid model with historical data and the semi-automatic classification of multispectral satellite images. The model is based on critical variables identified by the scientific community and case studies documented in the literature. The methodology used for the study consists of four phases, including creating a scientifically robust database, downloading and managing satellite and historical long-term information, segmenting the regions of interest, and modelling using supervised classification techniques. The proposed algorithm shows very accurate results (R2 = 0.9922) and the overall system approach is presented as a useful and generalizable method to address this problem, increasing the existing knowledge on material wear by particle action, and contributing to optimizing the initial design of resilient structures.
Collapse
|
10
|
Jiang S, Dong Y, Wang J, Zhang X, Liu W, Wei Y, Zhou H, Shen L, Yang J, Zhu Q. Identification of immunogenic cell death-related signature on prognosis and immunotherapy in kidney renal clear cell carcinoma. Front Immunol 2023; 14:1207061. [PMID: 37662929 PMCID: PMC10472448 DOI: 10.3389/fimmu.2023.1207061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 07/04/2023] [Indexed: 09/05/2023] Open
Abstract
Background Immunogenic cell death (ICD) is considered a particular cell death modality of regulated cell death (RCD) and plays a significant role in various cancers. The connection between kidney renal clear cell carcinoma (KIRC) and ICD remains to be thoroughly explored. Methods We conducted a variety of bioinformatics analyses using R software, including cluster analysis, prognostic analysis, enrichment analysis and immune infiltration analysis. In addition, we performed Quantitative Real-time PCR to evaluate RNA levels of specific ICD genes. The proliferation was measured through Cell Counting Kit-8 (CCK-8) assay and colony-formation assay in RCC cell lines. Results We determined two ICD subtypes through consensus clustering analysis. The two subtypes showed significantly different clinical outcomes, genomic alterations and tumor immune microenvironment. Moreover, we constructed the ICD prognostic signature based on TF, FOXP3, LY96, SLC7A11, HSP90AA1, UCN, IFNB1 and TLR3 and calculated the risk score for each patient. Kaplan-Meier survival analysis and ROC curve demonstrated that patients in the high-risk group had significantly poorer prognosis compared with the low-risk group. We then validated the signature through external cohort and further evaluated the relation between the signature and clinical features, tumor immune microenvironment and immunotherapy response. Given its critical role in ICD, we conducted further analysis on LY96. Our results indicated that downregulation of LY96 inhibited the proliferation ability of RCC cells. Conclusions Our research revealed the underlying function of ICD in KIRC and screened out a potential biomarker, which provided a novel insight into individualized immunotherapy in KIRC.
Collapse
Affiliation(s)
- Silin Jiang
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Yuxiang Dong
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Jun Wang
- Department of Urology, Nanjing University of Chinese Medicine, Nanjing, China
| | - Xi Zhang
- The State Key Lab of Reproductive; Department of Urology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Wei Liu
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Yong Wei
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Hai Zhou
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Luming Shen
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Jian Yang
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Qingyi Zhu
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, China
| |
Collapse
|
11
|
Yu X, He M, Su L. Large Dataset-Based Regression Model of Chemical Toxicity to Vibrio fischeri. ARCHIVES OF ENVIRONMENTAL CONTAMINATION AND TOXICOLOGY 2023:10.1007/s00244-023-01010-4. [PMID: 37407875 DOI: 10.1007/s00244-023-01010-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 06/20/2023] [Indexed: 07/07/2023]
Abstract
For the first time, a global regression quantitative structure-toxicity/activity relationship (QSTR/QSAR) model was developed for the toxicity of a large data set including 1236 chemicals towards Vibrio fischeri, by using random forest (RF) regression algorithm. The optimal RF model with RF parameters of mtry = 3, ntree = 150 and nodesize = 5 was based on 13 molecular descriptors. It can achieve accurate prediction for the toxicity of 99.1% of 1236 chemicals, and yield coefficients of determination R2 of 0.893 for 930 log(Mw/IBC50) in the training set, 0.723 for 306 log(Mw/IBC50) in the test se, and 0.865 for 1236 toxicity log(Mw/IBC50) in the total set. The optimal RF global model proposed in this work is comparable to other published local QSTR models on small datasets of the toxicity to Vibrio fischeri.
Collapse
Affiliation(s)
- Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis and Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, 411104, Hunan, People's Republic of China.
| | - Minghui He
- School of Environment, and State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, Northeast Normal University, Changchun, 130117, Jilin, People's Republic of China
| | - Limin Su
- School of Environment, and State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, Northeast Normal University, Changchun, 130117, Jilin, People's Republic of China.
| |
Collapse
|
12
|
Liao M, Wu F, Yu X, Zhao L, Wu H, Zhou J. Random Forest Algorithm-Based Prediction of Solvation Gibbs Energies. J SOLUTION CHEM 2023. [DOI: 10.1007/s10953-023-01247-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
13
|
Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri. Molecules 2023; 28:molecules28062703. [PMID: 36985675 PMCID: PMC10057455 DOI: 10.3390/molecules28062703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 03/12/2023] [Accepted: 03/15/2023] [Indexed: 03/19/2023] Open
Abstract
Vibrio fischeri is widely used as the model species in toxicity and risk assessment. For the first time, a global classification model was proposed in this paper for a two-class problem (Class − 1 with log1/IBC50 ≤ 4.2 and Class + 1 with log1/IBC50 > 4.2, the unit of IBC50: mol/L) by utilizing a large data set of 601 toxicity log1/IBC50 of organic compounds to Vibrio fischeri. Dragon software was used to calculate 4885 molecular descriptors for each compound. Stepwise multiple linear regression (MLR) analysis was used to select the descriptor subset for the models. The ten molecular descriptors used in the classification model reflect the structural information on the Michael-type addition of nucleophiles, molecular branching, molecular size, polarizability, hydrophobic, and so on. Furthermore, these descriptors were interpreted from the point of view of toxicity mechanisms. The optimal support vector machine (SVM) model (C = 253.8 and γ = 0.009) was obtained with the genetic algorithm. The SVM classification model produced a prediction accuracy of 89.1% for the training set (451 log1/IBC50), of 80.0% for the test set (150 log1/IBC50), and of 86.9% for the total data set (601 log1/IBC50), which are higher than that (80.5%, 76%, and 79.4%, respectively) from the binary logistic regression (BLR) model. The global SVM classification model is successful, although it deals with a large data set in relation to the toxicity of organics to Vibrio fischeri.
Collapse
|
14
|
Yu X, Acree Jr. WE. QSPR-based model extrapolation prediction of enthalpy of solvation. J Mol Liq 2023. [DOI: 10.1016/j.molliq.2023.121455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
|
15
|
Quantitative relationships between national cultures and the increase in cases of novel coronavirus pneumonia. Sci Rep 2023; 13:1646. [PMID: 36717639 PMCID: PMC9885052 DOI: 10.1038/s41598-023-28980-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 01/27/2023] [Indexed: 02/01/2023] Open
Abstract
Support vector machine (SVM) and genetic algorithm were successfully used to predict the changes in the prevalence rate (ΔPR) measured by the increase of reported cases per million population from the 16th to the 45th day during a nation's lockdown after the COVID-19 outbreak. The national cultural indices [individualism-collectivism (Ind), tightness-looseness (Tight)], and the number of people per square kilometer (Pop_density) were used to develop the SVM model of lnΔPR. The SVM model has R2 of 0.804 for the training set (44 samples) and 0.853 for the test set (11 samples), which were much higher than those (0.416 and 0.593) of the multiple linear regression model. The statistical results indicate that there are nonlinear relationships between lnΔPR and Tight, Ind, and Pop_density. It is feasible to build the model for lnΔPR with SVM algorithm. The results suggested that the risk of COVID-19 epidemic spread will be reduced if a nation implements severe measures to strengthen the tightness of national culture and individuals realize the importance of collectivism.
Collapse
|