51
|
Zhang T, Liu Z, Ma Q, Hu D, Dai Y, Zhang X, Zhou Z. Identification of Dendrobium Using Laser-Induced Breakdown Spectroscopy in Combination with a Multivariate Algorithm Model. Foods 2024; 13:1676. [PMID: 38890910 PMCID: PMC11172223 DOI: 10.3390/foods13111676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 05/23/2024] [Accepted: 05/25/2024] [Indexed: 06/20/2024] Open
Abstract
Dendrobium, a highly effective traditional Chinese medicinal herb, exhibits significant variations in efficacy and price among different varieties. Therefore, achieving an efficient classification of Dendrobium is crucial. However, most of the existing identification methods for Dendrobium make it difficult to simultaneously achieve both non-destructiveness and high efficiency, making it challenging to truly meet the needs of industrial production. In this study, we combined Laser-Induced Breakdown Spectroscopy (LIBS) with multivariate models to classify 10 varieties of Dendrobium. LIBS spectral data for each Dendrobium variety were collected from three circular medicinal blocks. During the data analysis phase, multivariate models to classify different Dendrobium varieties first preprocess the LIBS spectral data using Gaussian filtering and stacked correlation coefficient feature selection. Subsequently, the constructed fusion model is utilized for classification. The results demonstrate that the classification accuracy of 10 Dendrobium varieties reached 100%. Compared to Support Vector Machine (SVM), Random Forest (RF), and K-Nearest Neighbors (KNN), our method improved classification accuracy by 14%, 20%, and 20%, respectively. Additionally, it outperforms three models (SVM, RF, and KNN) with added Principal Component Analysis (PCA) by 10%, 10%, and 17%. This fully validates the excellent performance of our classification method. Finally, visualization analysis of the entire research process based on t-distributed Stochastic Neighbor Embedding (t-SNE) technology further enhances the interpretability of the model. This study, by combining LIBS and machine learning technologies, achieves efficient classification of Dendrobium, providing a feasible solution for the identification of Dendrobium and even traditional Chinese medicinal herbs.
Collapse
Affiliation(s)
- Tingsong Zhang
- College of Opto-Electro-Mechanical Engineering, Zhejiang A&F University, Hangzhou 311300, China (Z.L.); (Y.D.)
| | - Ziyuan Liu
- College of Opto-Electro-Mechanical Engineering, Zhejiang A&F University, Hangzhou 311300, China (Z.L.); (Y.D.)
| | - Qing Ma
- College of Opto-Electro-Mechanical Engineering, Zhejiang A&F University, Hangzhou 311300, China (Z.L.); (Y.D.)
| | - Dong Hu
- College of Opto-Electro-Mechanical Engineering, Zhejiang A&F University, Hangzhou 311300, China (Z.L.); (Y.D.)
| | - Yujia Dai
- College of Opto-Electro-Mechanical Engineering, Zhejiang A&F University, Hangzhou 311300, China (Z.L.); (Y.D.)
| | - Xinfeng Zhang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou 311300, China
| | - Zhu Zhou
- College of Opto-Electro-Mechanical Engineering, Zhejiang A&F University, Hangzhou 311300, China (Z.L.); (Y.D.)
| |
Collapse
|
52
|
Wang F, Wang A, Huang Y, Gao W, Xu Y, Zhang W, Guo G, Song W, Kong Y, Wang Q, Wang S, Shi F. Lipoproteins and metabolites in diagnosing and predicting Alzheimer's disease using machine learning. Lipids Health Dis 2024; 23:152. [PMID: 38773573 PMCID: PMC11107010 DOI: 10.1186/s12944-024-02141-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 05/09/2024] [Indexed: 05/24/2024] Open
Abstract
BACKGROUND Alzheimer's disease (AD) is a chronic neurodegenerative disorder that poses a substantial economic burden. The Random forest algorithm is effective in predicting AD; however, the key factors influencing AD onset remain unclear. This study aimed to analyze the key lipoprotein and metabolite factors influencing AD onset using machine-learning methods. It provides new insights for researchers and medical personnel to understand AD and provides a reference for the early diagnosis, treatment, and early prevention of AD. METHODS A total of 603 participants, including controls and patients with AD with complete lipoprotein and metabolite data from the Alzheimer's disease Neuroimaging Initiative (ADNI) database between 2005 and 2016, were enrolled. Random forest, Lasso regression, and CatBoost algorithms were employed to rank and filter 213 lipoprotein and metabolite variables. Variables with consistently high importance rankings from any two methods were incorporated into the models. Finally, the variables selected from the three methods, with the participants' age, sex, and marital status, were used to construct a random forest predictive model. RESULTS Fourteen lipoprotein and metabolite variables were screened using the three methods, and 17 variables were included in the AD prediction model based on age, sex, and marital status of the participants. The optimal random forest modeling was constructed with "mtry" set to 3 and "ntree" set to 300. The model exhibited an accuracy of 71.01%, a sensitivity of 79.59%, a specificity of 65.28%, and an AUC (95%CI) of 0.724 (0.645-0.804). When Mean Decrease Accuracy and Gini were used to rank the proteins, age, phospholipids to total lipids ratio in intermediate-density lipoproteins (IDL_PL_PCT), and creatinine were among the top five variables. CONCLUSIONS Age, IDL_PL_PCT, and creatinine levels play crucial roles in AD onset. Regular monitoring of lipoproteins and their metabolites in older individuals is significant for early AD diagnosis and prevention.
Collapse
Affiliation(s)
- Fenglin Wang
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Aimin Wang
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Yiming Huang
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Wenfeng Gao
- Department of Rheumatology and Immunology, Affiliated Hospital of Shandong Second Medical University, Weifang, Shandong, 261031, China
| | - Yaqi Xu
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Wenjing Zhang
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Guiya Guo
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Wangchen Song
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Yujia Kong
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Qinghua Wang
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China
| | - Suzhen Wang
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China.
| | - Fuyan Shi
- Department of Health Statistics, School of Public Health, Shandong Second Medical University, Weifang, Shandong, 261053, China.
| |
Collapse
|
53
|
Orzeszko W, Piotrowski D. Prediction of robo-advisory acceptance in banking services using tree-based algorithms. PLoS One 2024; 19:e0302359. [PMID: 38709756 PMCID: PMC11073725 DOI: 10.1371/journal.pone.0302359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 04/02/2024] [Indexed: 05/08/2024] Open
Abstract
The banking sector is increasingly recognising the need to implement robo-advisory. The introduction of this service may lead to increased efficiency of banks, improved quality of customer service, and a strengthened image of banks as innovative institutions. Robo-advisory uses data relating to customers, their behaviors and preferences obtained by banks from various communication channels. In the research carried out in the work, an attempt was made to obtain an answer to the question whether the data collected by banks can also be used to determine the degree of consumer interest in this type of service. This is important because the identification of customers interested in the service will allow banks to direct a properly prepared message to a selected group of addressees, increasing the effectiveness of their promotional activities. The aim of the article is to construct and examine the effectiveness of predictive models of consumer acceptance of robo-advisory services provided by banks. Based on the authors' survey on the use of artificial intelligence technology in the banking sector in Poland, in this article we construct tree-based models to predict customers' attitudes towards using robo-advisory in banking services using, as predictors, their socio-demographic characteristics, behaviours and attitudes towards modern digital technologies, experience in using banking services, as well as trust towards banks. In our study, we use selected machine learning algorithms, including a decision tree and several tree-based ensemble models. We showed that constructed models allow to effectively predict consumer acceptance of robo-advisory services.
Collapse
Affiliation(s)
- Witold Orzeszko
- Department of Applied Informatics and Mathematics in Economics, Nicolaus Copernicus University, Toruń, Poland
| | - Dariusz Piotrowski
- Department of Financial Management, Nicolaus Copernicus University, Toruń, Poland
| |
Collapse
|
54
|
Mohebbi F, Forati AM, Torres L, deRoon-Cassini TA, Harris J, Tomas CW, Mantsch JR, Ghose R. Exploring the Association Between Structural Racism and Mental Health: Geospatial and Machine Learning Analysis. JMIR Public Health Surveill 2024; 10:e52691. [PMID: 38701436 PMCID: PMC11102033 DOI: 10.2196/52691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 01/15/2024] [Accepted: 03/20/2024] [Indexed: 05/05/2024] Open
Abstract
BACKGROUND Structural racism produces mental health disparities. While studies have examined the impact of individual factors such as poverty and education, the collective contribution of these elements, as manifestations of structural racism, has been less explored. Milwaukee County, Wisconsin, with its racial and socioeconomic diversity, provides a unique context for this multifactorial investigation. OBJECTIVE This research aimed to delineate the association between structural racism and mental health disparities in Milwaukee County, using a combination of geospatial and deep learning techniques. We used secondary data sets where all data were aggregated and anonymized before being released by federal agencies. METHODS We compiled 217 georeferenced explanatory variables across domains, initially deliberately excluding race-based factors to focus on nonracial determinants. This approach was designed to reveal the underlying patterns of risk factors contributing to poor mental health, subsequently reintegrating race to assess the effects of racism quantitatively. The variable selection combined tree-based methods (random forest) and conventional techniques, supported by variance inflation factor and Pearson correlation analysis for multicollinearity mitigation. The geographically weighted random forest model was used to investigate spatial heterogeneity and dependence. Self-organizing maps, combined with K-means clustering, were used to analyze data from Milwaukee communities, focusing on quantifying the impact of structural racism on the prevalence of poor mental health. RESULTS While 12 influential factors collectively accounted for 95.11% of the variability in mental health across communities, the top 6 factors-smoking, poverty, insufficient sleep, lack of health insurance, employment, and age-were particularly impactful. Predominantly, African American neighborhoods were disproportionately affected, which is 2.23 times more likely to encounter high-risk clusters for poor mental health. CONCLUSIONS The findings demonstrate that structural racism shapes mental health disparities, with Black community members disproportionately impacted. The multifaceted methodological approach underscores the value of integrating geospatial analysis and deep learning to understand complex social determinants of mental health. These insights highlight the need for targeted interventions, addressing both individual and systemic factors to mitigate mental health disparities rooted in structural racism.
Collapse
Affiliation(s)
- Fahimeh Mohebbi
- College of Engineering and Applied Science, University of Wisconsin-Milwaukee, Milwaukee, WI, United States
| | - Amir Masoud Forati
- Department of Medicine, University of Wisconsin-Madison, Madison, WI, United States
| | - Lucas Torres
- Department of Psychology, Marquette University, Milwaukee, WI, United States
| | - Terri A deRoon-Cassini
- Division of Trauma & Acute Care Surgery, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Jennifer Harris
- Community Relations-Social Development Commission, Milwaukee, WI, United States
| | - Carissa W Tomas
- Division of Epidemiology, Institute for Health and Equity, Medical College of Wisconsin, Milwaukee, WI, United States
| | - John R Mantsch
- Department of Pharmacology & Toxicology, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Rina Ghose
- College of Engineering and Applied Science, University of Wisconsin-Milwaukee, Milwaukee, WI, United States
| |
Collapse
|
55
|
Zhou Q, Liu X, Yun H, Zhao Y, Shu K, Chen Y, Chen S. Leveraging artificial intelligence to identify high-risk patients for postoperative sore throat: An observational study. BIOMOLECULES & BIOMEDICINE 2024; 24:593-605. [PMID: 37870482 PMCID: PMC11088886 DOI: 10.17305/bb.2023.9519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 10/09/2023] [Accepted: 10/22/2023] [Indexed: 03/13/2024]
Abstract
Postoperative sore throat (POST) is a prevalent complication after general anesthesia and targeting high-risk patients helps in its prevention. This study developed and validated a machine learning model to predict POST. A total number of 834 patients who underwent general anesthesia with endotracheal intubation were included in this study. Data from a cohort of 685 patients was used for model development and validation, while a cohort of 149 patients served for external validation. The prediction performance of random forest (RF), neural network (NN), and extreme gradient boosting (XGBoost) models was compared using comprehensive performance metrics. The Local Interpretable Model-Agnostic Explanations (LIME) methods elucidated the best-performing model. POST incidences across training, validation, and testing cohorts were 41.7%, 38.4%, and 36.2%, respectively. Five predictors were age, sex, endotracheal tube cuff pressure, endotracheal tube insertion depth, and the time interval between extubation and the first drinking of water after extubation. After incorporating these variables, the NN model demonstrated superior generalization capabilities in predicting POST when compared to the XGBoost and RF models in external validation, achieving an area under the receiver operating characteristic curve (AUROC) of 0.81 (95% CI 0.74-0.89) and a precision-recall curve (AUPRC) of 0.77 (95% CI 0.66-0.86). The model also showed good calibration and clinical usage values. The NN model outperforms the XGBoost and RF models in predicting POST, with potential applications in the healthcare industry for reducing the incidence of this common postoperative complication.
Collapse
Affiliation(s)
- Qiangqiang Zhou
- Department of Anesthesiology, The Affiliated Changzhou Second People’s Hospital of Nanjing Medical University, Changzhou, Jiangsu, China
| | - Xiaoya Liu
- The Third Affiliated Hospital of Soochow University, Changzhou, Jiangsu, China
| | - Huifang Yun
- Department of Anesthesiology, The Affiliated Changzhou Second People’s Hospital of Nanjing Medical University, Changzhou, Jiangsu, China
| | - Yahong Zhao
- Department of Anesthesiology, The Affiliated Changzhou Second People’s Hospital of Nanjing Medical University, Changzhou, Jiangsu, China
| | - Kun Shu
- Department of Anesthesiology, The Affiliated Changzhou Second People’s Hospital of Nanjing Medical University, Changzhou, Jiangsu, China
| | - Yong Chen
- Department of Anesthesiology, The Affiliated Changzhou Second People’s Hospital of Nanjing Medical University, Changzhou, Jiangsu, China
| | - Song Chen
- Department of Orthopedics, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People’s Hospital, Quzhou, Zhejiang Province, China
| |
Collapse
|
56
|
Campler MR, Cheng TY, Lee CW, Hofacre CL, Lossie G, Silva GS, El-Gazzar MM, Arruda AG. Investigating the uses of machine learning algorithms to inform risk factor analyses: The example of avian infectious bronchitis virus (IBV) in broiler chickens. Res Vet Sci 2024; 171:105201. [PMID: 38442531 DOI: 10.1016/j.rvsc.2024.105201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 11/16/2023] [Accepted: 02/24/2024] [Indexed: 03/07/2024]
Abstract
Infectious bronchitis virus (IBV) is a contagious coronavirus causing respiratory and urogenital disease in chickens and is responsible for significant economic losses for both the broiler and table egg layer industries. Despite IBV being regularly monitored using standard epidemiologic surveillance practices, knowledge and evidence of risk factors associated with IBV transmission remain limited. The study objective was to compare risk factor modeling outcomes between a traditional stepwise variable selection approach and a machine learning-based random forest Boruta algorithm using routinely collected IBV antibody titer data from broiler flocks. IBV antibody sampling events (n = 1111) from 166 broiler sites between 2016 and 2021 were accessed. Ninety-two geospatial-related and poultry-density variables were obtained using a geographic information system and data sets from publicly available sources. Seventeen and 27 candidate variables were screened to potentially have an association with elevated IBV antibody titers according to the manual selection and machine learning algorithm, respectively. Selected variables from both methods were further investigated by construction of multivariable generalized mixed logistic regression models. Six variables were shortlisted by both screening methods, which included year, distance to urban areas, main roads, landcover, density of layer sites and year, however, final models for both approaches only shared year as an important predictor. Despite limited significance of clinical outcomes, this work showcases the potential of a novel explorative modeling approach in combination with often unutilized resources such as publicly available geospatial data, surveillance health data and machine learning as potential supplementary tools to investigate risk factors related to infectious diseases.
Collapse
Affiliation(s)
- Magnus R Campler
- Department of Veterinary Preventive Medicine, The Ohio State University, OH 43210, USA
| | - Ting-Yu Cheng
- Department of Veterinary Preventive Medicine, The Ohio State University, OH 43210, USA
| | - Chang-Won Lee
- Exotic and Emerging Avian Diseases, Southeast Poultry Research Laboratory, National Poultry Research Center, Agricultural Research Service, U.S. Department of Agriculture, Athens, GA 30605, USA
| | | | - Geoffrey Lossie
- Department of Comparative Pathobiology and Animal Disease Diagnostic Laboratory, College of Veterinary Medicine, Purdue University, IN 47907, USA
| | - Gustavo S Silva
- Department of Comparative Pathobiology and Animal Disease Diagnostic Laboratory, College of Veterinary Medicine, Purdue University, IN 47907, USA
| | - Mohamed M El-Gazzar
- Department of Veterinary Diagnostic and Production Animal Medicine, College of Veterinary Medicine, Iowa State University, IA 50011, USA
| | - Andréia G Arruda
- Department of Veterinary Preventive Medicine, The Ohio State University, OH 43210, USA.
| |
Collapse
|
57
|
Yu C, Xu G, Cai M, Li Y, Wang L, Zhang Y, Lin H. Predicting environmental impacts of smallholder wheat production by coupling life cycle assessment and machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 921:171097. [PMID: 38387559 DOI: 10.1016/j.scitotenv.2024.171097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 02/05/2024] [Accepted: 02/17/2024] [Indexed: 02/24/2024]
Abstract
Wheat grain production is a vital component of the food supply produced by smallholder farms but faces significant threats from climate change. This study evaluated eight environmental impacts of wheat production using life cycle assessment based on survey data from 274 households, then built random forest models with 21 input features to contrast the environmental responses of different farming practices across three shared socioeconomic pathways (SSPs), spanning from 2024 to 2100. The results indicate significant environmental repercussions. Compared to the baseline period of 2018-2020, a similar upward trend in environmental impacts is observed, showing an average annual growth rate of 5.88 % (ranging from 0.45 to 18.56 %) under the sustainable pathway (SSP119) scenario; 5.90 % (ranging from 1.00 to 18.15 %) for the intermediate development pathway (SSP245); and 6.22 % (ranging from 1.16 to 17.74 %) under the rapid economic development pathway (SSP585). Variation in rainfall is identified as the primary driving factor of the increased environmental impacts, whereas its relationship with rising temperatures is not significant. The results suggest adopting farming practices as a vital strategy for smallholder farms to mitigate climate change impacts. Emphasizing appropriate fertilizer application and straw recycling can significantly reduce the environmental footprint of wheat production. Standardized fertilization could reduce the environmental impact index by 11.10 to 47.83 %, while straw recycling might decrease respiratory inorganics and photochemical oxidant formation potential by over 40 %. Combined, these approaches could lower the impact index by 12.31 to 63.38 %. The findings highlight the importance of adopting enhanced farming practices within smallholder farming systems in the context of climate change. SPOTLIGHTS.
Collapse
Affiliation(s)
- Chunxiao Yu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou 730020, China
| | - Gang Xu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou 730020, China.
| | - Ming Cai
- Yunnan Academy of Grassland and Animal Science, Kunming 650212, China
| | - Yuan Li
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou 730020, China
| | - Lijia Wang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou 730020, China
| | - Yan Zhang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou 730020, China
| | - Huilong Lin
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou 730020, China
| |
Collapse
|
58
|
Zhu H, Yuan J, Wan Q, Cheng F, Dong X, Xia S, Zhou C. A UV-Vis spectroscopic detection method for cobalt ions in zinc sulfate solution based on discrete wavelet transform and extreme gradient boosting. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 311:123982. [PMID: 38320470 DOI: 10.1016/j.saa.2024.123982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 01/16/2024] [Accepted: 01/29/2024] [Indexed: 02/08/2024]
Abstract
Zinc is a crucial strategic metal resource. The concentration of cobalt ions in zinc refining solution significantly impacts the efficiency of zinc electrolysis production. The traditional method of detecting cobalt ions in zinc solution is time-consuming, labor-intensive and ineffective. However, optical detection offers the advantage of high efficiency and low cost, making it a potential replacement for the traditional method. In this study, the spectral curve of cobalt ions in zinc solution is detected by ultraviolet-visible (UV-Vis) spectrophotometry. Additionally, we propose a model for the concentration-absorbance relationship of cobalt ions in zinc solution based on discrete wavelet transform and extreme gradient boosting (DWT-XGBoost) algorithms. First, the spectral curve's information region is denoised by using Savitzky-Golay (S-G) smoothing. Then, the denoised spectra is utilized to extract features through discrete wavelet transform and principal component analysis. These features are used as inputs to the XGBoost model to establish prediction models for low and high cobalt ions in zinc solution. Bayesian optimization is implemented to adjust the model's hyperparameters, including learning rate, feature sampling ratio, to enhance the prediction performance. Finally, applying the model to zinc solution samples from a zinc smelter and compared with other state-of-the-art algorithms, the DWT-XGBoost algorithm exhibits the lowest RMSE, MAE and MAPE, with values of 0.034 mg/L, 0.025 mg/L, 6.983 % for low cobalt and with values of 0.231 mg/L, 0.067 mg/L and 0.472 % for high cobalt. The experimental results demonstrate that the DWT-XGBoost model exhibits significantly superior prediction performance.
Collapse
Affiliation(s)
- Hongqiu Zhu
- School of Automation, Central South University, Changsha 410083, China
| | - Jianqiang Yuan
- School of Automation, Central South University, Changsha 410083, China
| | - Qilong Wan
- School of Automation, Central South University, Changsha 410083, China.
| | - Fei Cheng
- School of Automation, Central South University, Changsha 410083, China
| | - Xinran Dong
- School of Automation, Central South University, Changsha 410083, China
| | - Sibo Xia
- School of Automation, Central South University, Changsha 410083, China
| | - Can Zhou
- School of Automation, Central South University, Changsha 410083, China.
| |
Collapse
|
59
|
Vizza P, Aracri F, Guzzi PH, Gaspari M, Veltri P, Tradigo G. Machine learning pipeline to analyze clinical and proteomics data: experiences on a prostate cancer case. BMC Med Inform Decis Mak 2024; 24:93. [PMID: 38584282 PMCID: PMC11000316 DOI: 10.1186/s12911-024-02491-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Accepted: 03/25/2024] [Indexed: 04/09/2024] Open
Abstract
Proteomic-based analysis is used to identify biomarkers in blood samples and tissues. Data produced by devices such as mass spectrometry requires platforms to identify and quantify proteins (or peptides). Clinical information can be related to mass spectrometry data to identify diseases at an early stage. Machine learning techniques can be used to support physicians and biologists in studying and classifying pathologies. We present the application of machine learning techniques to define a pipeline aimed at studying and classifying proteomics data enriched using clinical information. The pipeline allows users to relate established blood biomarkers with clinical parameters and proteomics data. The proposed pipeline entails three main phases: (i) feature selection, (ii) models training, and (iii) models ensembling. We report the experience of applying such a pipeline to prostate-related diseases. Models have been trained on several biological datasets. We report experimental results about two datasets that result from the integration of clinical and mass spectrometry-based data in the contexts of serum and urine analysis. The pipeline receives input data from blood analytes, tissue samples, proteomic analysis, and urine biomarkers. It then trains different models for feature selection, classification and voting. The presented pipeline has been applied on two datasets obtained in a 2 years research project which aimed to extract hidden information from mass spectrometry, serum, and urine samples from hundreds of patients. We report results on analyzing prostate datasets serum with 143 samples, including 79 PCa and 84 BPH patients, and an urine dataset with 121 samples, including 67 PCa and 54 BPH patients. As results pipeline allowed to identify interesting peptides in the two datasets, 6 for the first one and 2 for the second one. The best model for both serum (AUC=0.87, Accuracy=0.83, F1=0.81, Sensitivity=0.84, Specificity=0.81) and urine (AUC=0.88, Accuracy=0.83, F1=0.83, Sensitivity=0.85, Specificity=0.80) datasets showed good predictive performances. We made the pipeline code available on GitHub and we are confident that it will be successfully adopted in similar clinical setups.
Collapse
Affiliation(s)
- Patrizia Vizza
- Department of Surgical and Medical Sciences, Magna Græcia University, 88100, Catanzaro, Italy
| | - Federica Aracri
- Department of Surgical and Medical Sciences, Magna Græcia University, 88100, Catanzaro, Italy.
| | - Pietro Hiram Guzzi
- Department of Surgical and Medical Sciences, Magna Græcia University, 88100, Catanzaro, Italy
| | - Marco Gaspari
- Department of Experimental and Clinical Medicine, Magna Græcia University, 88100, Catanzaro, Italy
| | - Pierangelo Veltri
- Department of Computers, Modeling, Electronics and Systems Engineering, University of Calabria, 87036, Rende, Italy
| | - Giuseppe Tradigo
- Department of Theoretical and Applied Sciences, eCampus University, 22060, Novedrate, CO, Italy
| |
Collapse
|
60
|
Reveguk I, Simonson T. Classifying protein kinase conformations with machine learning. Protein Sci 2024; 33:e4918. [PMID: 38501429 PMCID: PMC10962494 DOI: 10.1002/pro.4918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 01/02/2024] [Accepted: 01/22/2024] [Indexed: 03/20/2024]
Abstract
Protein kinases are key actors of signaling networks and important drug targets. They cycle between active and inactive conformations, distinguished by a few elements within the catalytic domain. One is the activation loop, whose conserved DFG motif can occupy DFG-in, DFG-out, and some rarer conformations. Annotation and classification of the structural kinome are important, as different conformations can be targeted by different inhibitors and activators. Valuable resources exist; however, large-scale applications will benefit from increased automation and interpretability of structural annotation. Interpretable machine learning models are described for this purpose, based on ensembles of decision trees. To train them, a set of catalytic domain sequences and structures was collected, somewhat larger and more diverse than existing resources. The structures were clustered based on the DFG conformation and manually annotated. They were then used as training input. Two main models were constructed, which distinguished active/inactive and in/out/other DFG conformations. They considered initially 1692 structural variables, spanning the whole catalytic domain, then identified ("learned") a small subset that sufficed for accurate classification. The first model correctly labeled all but 3 of 3289 structures as active or inactive, while the second assigned the correct DFG label to all but 17 of 8826 structures. The most potent classifying variables were all related to well-known structural elements in or near the activation loop and their ranking gives insights into the conformational preferences. The models were used to automatically annotate 3850 kinase structures predicted recently with the Alphafold2 tool, showing that Alphafold2 reproduced the active/inactive but not the DFG-in proportions seen in the Protein Data Bank. We expect the models will be useful for understanding and engineering kinases.
Collapse
Affiliation(s)
- Ivan Reveguk
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654)Ecole PolytechniquePalaiseauFrance
| | - Thomas Simonson
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654)Ecole PolytechniquePalaiseauFrance
| |
Collapse
|
61
|
Zhang S, Yuan Y, Wang Z, Li J. The application of laser‑induced fluorescence in oil spill detection. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:23462-23481. [PMID: 38466385 DOI: 10.1007/s11356-024-32807-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 03/03/2024] [Indexed: 03/13/2024]
Abstract
Over the past two decades, oil spills have been one of the most serious ecological disasters, causing massive damage to the aquatic and terrestrial ecosystems as well as the socio-economy. In view of this situation, several methods have been developed and utilized to analyze oil samples. Among these methods, laser-induced fluorescence (LIF) technology has been widely used in oil spill detection due to its classification method, which is based on the fluorescence characteristics of chemical material in oil. This review systematically summarized the LIF technology from the perspective of excitation wavelength selection and the application of traditional and novel machine learning algorithms to fluorescence spectrum processing, both of which are critical for qualitative and quantitative analysis of oil spills. It can be seen that an appropriate excitation wavelength is indispensable for spectral discrimination due to different kinds of polycyclic aromatic hydrocarbons' (PAHs) compounds in petroleum products. By summarizing some articles related to LIF technology, we discuss the influence of the excitation wavelength on the accuracy of the oil spill detection model and proposed several suggestions on the selection of excitation wavelength. In addition, we introduced some traditional and novel machine learning (ML) algorithms and discussed the strengths and weaknesses of these algorithms and their applicable scenarios. With an appropriate excitation wavelength and data processing algorithm, it is believed that laser-induced fluorescence technology will become an efficient technique for real-time detection and analysis of oil spills.
Collapse
Affiliation(s)
- Shubo Zhang
- Department of Optical Science and Engineering, Fudan University, Shanghai, 200433, China
| | - Yafei Yuan
- Department of Sports Media and Information Technology, Shandong Sport University, Jinan, 250102, Shandong, China.
| | - Zhanhu Wang
- Shanghai Institute of Technical Physics, Chinese Academy of Sciences, Shanghai, 200083, China
| | - Jing Li
- Department of Optical Science and Engineering, Fudan University, Shanghai, 200433, China
| |
Collapse
|
62
|
Sun S, Sun Y, Geng J, Geng L, Meng F, Wang Q, Qi H. Machine learning reveals the selection pressure exerted by nonantibiotic pharmaceuticals at environmentally relevant concentrations on antibiotic resistance genotypes. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 357:120829. [PMID: 38579474 DOI: 10.1016/j.jenvman.2024.120829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/07/2024] [Accepted: 04/01/2024] [Indexed: 04/07/2024]
Abstract
The emergence and increasing prevalence of antibiotic resistance pose a global public risk for human health, and nonantimicrobial pharmaceuticals play an important role in this process. Herein, five nonantimicrobial pharmaceuticals, including acetaminophen (ACT), clofibric acid (CA), carbamazepine (CBZ), caffeine (CF) and nicotine (NCT), tetracycline-resistant strains, five ARGs (sul1, sul2, tetG, tetM and tetW) and one integrase gene (intI1), were detected in 101 wastewater samples during two typical sewage treatment processes including anaerobic-oxic (A/O) and biological aerated filter (BAF) in Harbin, China. The impact of nonantibiotic pharmaceuticals at environmentally relevant concentrations on both the resistance genotypes and resistance phenotypes were explored. The results showed that a significant impact of nonantibiotic pharmaceuticals at environmentally relevant concentrations on tetracycline resistance genes encoding ribosomal protection proteins (RPPs) was found, while no changes in antibiotic phenotypes, such as minimal inhibitory concentrations (MICs), were observed. Machine learning was applied to further sort out the contribution of nonantibiotic pharmaceuticals at environmentally relevant concentrations to different ARG subtypes. The highest contribution and correlation were found at concentrations of 1400-1800 ng/L for NCT, 900-1500 ng/L for ACT and 7000-10,000 ng/L for CF for tetracycline resistance genes encoding RPPs, while no significant correlation was found between the target compounds and ARGs when their concentrations were lower than 500 ng/L for NCT, 100 ng/L for ACT and 1000 ng/L for CF, which were higher than the concentrations detected in effluent samples. Therefore, the removal of nonantibiotic pharmaceuticals in WWTPs can reduce their selection pressure for resistance genes in wastewater.
Collapse
Affiliation(s)
- Shaojing Sun
- College of Energy and Environmental Engineering, Hebei Key Laboratory of Air Pollution Cause and Impact, Hebei Engineering Research Center of Sewage Treatment and Resource Utilization, Hebei University of Engineering, Handan, 056038, China; State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, 150090, China
| | - Yan Sun
- College of Energy and Environmental Engineering, Hebei Key Laboratory of Air Pollution Cause and Impact, Hebei Engineering Research Center of Sewage Treatment and Resource Utilization, Hebei University of Engineering, Handan, 056038, China
| | - Jialu Geng
- Ecological Environmental Monitoring Centre of Hinggan League, Hinggan League, 137400, China
| | - Linlin Geng
- College of Energy and Environmental Engineering, Hebei Key Laboratory of Air Pollution Cause and Impact, Hebei Engineering Research Center of Sewage Treatment and Resource Utilization, Hebei University of Engineering, Handan, 056038, China
| | - Fan Meng
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, 150090, China
| | - Qing Wang
- College of Energy and Environmental Engineering, Hebei Key Laboratory of Air Pollution Cause and Impact, Hebei Engineering Research Center of Sewage Treatment and Resource Utilization, Hebei University of Engineering, Handan, 056038, China
| | - Hong Qi
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, 150090, China.
| |
Collapse
|
63
|
Schiavo G, Bertolini F, Bovo S, Galimberti G, Muñoz M, Bozzi R, Čandek-Potokar M, Óvilo C, Fontanesi L. Identification of population-informative markers from high-density genotyping data through combined feature selection and machine learning algorithms: Application to European autochthonous and cosmopolitan pig breeds. Anim Genet 2024; 55:193-205. [PMID: 38191264 DOI: 10.1111/age.13396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 11/09/2023] [Accepted: 12/27/2023] [Indexed: 01/10/2024]
Abstract
Large genotyping datasets, obtained from high-density single nucleotide polymorphism (SNP) arrays, developed for different livestock species, can be used to describe and differentiate breeds or populations. To identify the most discriminating genetic markers among thousands of genotyped SNPs, a few statistical approaches have been proposed. In this study, we applied the Boruta algorithm, a wrapper of the machine learning random forest algorithm, on a database of 23 European pig breeds (20 autochthonous and three cosmopolitan breeds) genotyped with a 70k SNP chip, to pre-select informative SNPs. To identify different sets of SNPs, these pre-selected markers were then ranked with random forest based on their mean decrease accuracy and mean decrease gene indexes. We evaluated the efficiency of these subsets for breed classification and the usefulness of this approach to detect candidate genes affecting breed-specific phenotypes and relevant production traits that might differ among breeds. The lowest overall classification error (2.3%) was reached with a subpanel including only 398 SNPs (ranked based on their mean decrease accuracy), with no classification error in seven breeds using up to 49 SNPs. Several SNPs of these selected subpanels were in genomic regions in which previous studies had identified signatures of selection or genes associated with morphological or production traits that distinguish the analysed breeds. Therefore, even if these approaches have not been originally designed to identify signatures of selection, the obtained results showed that they could potentially be useful for this purpose.
Collapse
Affiliation(s)
- Giuseppina Schiavo
- Animal and Food Genomics Group, Division of Animal Sciences, Department of Agricultural and Food Sciences, University of Bologna, Bologna, Italy
| | - Francesca Bertolini
- Animal and Food Genomics Group, Division of Animal Sciences, Department of Agricultural and Food Sciences, University of Bologna, Bologna, Italy
| | - Samuele Bovo
- Animal and Food Genomics Group, Division of Animal Sciences, Department of Agricultural and Food Sciences, University of Bologna, Bologna, Italy
| | - Giuliano Galimberti
- Department of Statistical Sciences 'Paolo Fortunati', University of Bologna, Bologna, Italy
| | - María Muñoz
- Departamento Mejora Genética Animal, INIA-CSIC, Madrid, Spain
| | - Riccardo Bozzi
- Animal Science Division, Dipartimento di Scienze e Tecnologie Agrarie, Alimentari, Ambientali e Forestali, Università di Firenze, Firenze, Italy
| | | | - Cristina Óvilo
- Departamento Mejora Genética Animal, INIA-CSIC, Madrid, Spain
| | - Luca Fontanesi
- Animal and Food Genomics Group, Division of Animal Sciences, Department of Agricultural and Food Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
64
|
Li Y, Liu M, Wu X. Insights into biogeochemistry and hot spots distribution characteristics of redox-sensitive elements in the hyporheic zone: Transformation mechanisms and contributing factors. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 918:170587. [PMID: 38309342 DOI: 10.1016/j.scitotenv.2024.170587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 01/05/2024] [Accepted: 01/29/2024] [Indexed: 02/05/2024]
Abstract
Biogeochemical hot spots play a crucial role in the cycling and transport of redox-sensitive elements (RSEs) in the hyporheic zone (HZ). However, the transformation mechanisms of RSEs and patterns of RSEs hot spots in the HZ remain poorly understood. In this study, hydrochemistry and multi-isotope (N/C/S/O) datasets were collected to investigate the transformation mechanisms of RSEs, and explore the distribution characteristics of RSEs transformation hot spots. The results showed that spatial variability in key drivers was evident, while temporal change in RSEs concentration was not significant, except for dissolved organic carbon. Bacterial sulfate reduction (BSR) was the primary biogeochemical process for sulfate and occurred throughout the area. Ammonium enrichment was mainly caused by the mineralization of nitrogenous organic matter and anthropogenic inputs, with adsorption serving as the primary attenuation mechanism. Carbon dynamics were influenced by various biogeochemical processes, with dissolved organic carbon mainly derived from C3 plants and dissolved inorganic carbon from weathering of carbonate rocks and decomposition of organic matter. The peak contribution of dissolved organic carbon decomposition to the DIC pool was 46.44 %. The concentration thresholds for the ammonium enrichment and BSR hot spots were identified as 1.5 mg/L and 8.84 mg/L, respectively. The distribution pattern of RSEs hot spots was closely related to the hydrogeological conditions. Our findings reveal the complex evolution mechanisms and hot spots distribution characteristics of RSEs in the HZ, providing a basis for the safe utilization and protection of groundwater resources.
Collapse
Affiliation(s)
- Yu Li
- Beijing Key Laboratory of Water Resources & Environmental Engineering, China University of Geosciences (Beijing), Beijing 100083, China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, China
| | - Mingzhu Liu
- Beijing Key Laboratory of Water Resources & Environmental Engineering, China University of Geosciences (Beijing), Beijing 100083, China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, China.
| | - Xiong Wu
- School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, China
| |
Collapse
|
65
|
Zhang T, Yan L, Wei M, Su R, Qi J, Sun S, Song Y, Li X, Zhang D. Bioaerosols in the coastal region of Qingdao: Community diversity, impact factors and synergistic effect. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 916:170246. [PMID: 38246385 DOI: 10.1016/j.scitotenv.2024.170246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 12/26/2023] [Accepted: 01/15/2024] [Indexed: 01/23/2024]
Abstract
Atmospheric bioaerosols are influenced by multiple factors, including physical, chemical, and biotic interactions, and pose a significant threat to the public health and the environment. The nonnegligible truth however is that the primary driver of the changes in bioaerosol community diversity remains unknown. In this study, putative biological association (PBA) was obtained by constructing an ecological network. The relationship between meteorological conditions, atmospheric pollutants, water-soluble inorganic ions, PBA and bioaerosol community diversity was analyzed using random forest regression (RFR)-An ensemble learning algorithm based on a decision tree that performs regression tasks by constructing multiple decision trees and integrating the predicted results, and the contribution of different rich species to PBA was predicted. The species richness, evenness and diversity varied significantly in different seasons, with the highest in summer, followed by autumn and spring, and was lowest in winter. The RFR suggested that the explanation rate of alpha diversity increased significantly from 73.74 % to 85.21 % after accounting for the response of the PBA to diversity. The PBA, temperature, air pollution, and marine source air masses were the most crucial factors driving community diversity. PBA, particularly putative positive association (PPA), had the highest significance in diversity. We found that under changing external conditions, abundant taxa tend to cooperate to resist external pressure, thereby promoting PPA. In contrast, rare taxa were more responsive to the putative negative association because of their sensitivity to environmental changes. The results of this research provided scientific advance in the understanding of the dynamic and temporal changes in bioaerosols, as well as support for the prevention and control of microbial contamination of the atmosphere.
Collapse
Affiliation(s)
- Ting Zhang
- Key Laboratory of Marine Chemistry Theory and Technology, Ministry of Education, Ocean University of China, Qingdao 266100, PR China
| | - Lingchong Yan
- Key Laboratory of Marine Chemistry Theory and Technology, Ministry of Education, Ocean University of China, Qingdao 266100, PR China
| | - Mingming Wei
- Laoshan District Meteorological Bureau, Qingdao 266107, PR China
| | - Rongguo Su
- Key Laboratory of Marine Chemistry Theory and Technology, Ministry of Education, Ocean University of China, Qingdao 266100, PR China
| | - Jianhua Qi
- Key Laboratory of Marine Environment and Ecology, Ministry of Education, Ocean University of China, Qingdao 266100, PR China
| | - Shaohua Sun
- Laoshan District Meteorological Bureau, Qingdao 266107, PR China
| | - Yongzhong Song
- Jufeng Peak Tourist Management Service Center of Laoshan Scenic Spot, Qingdao 266100, PR China
| | - Xianguo Li
- Key Laboratory of Marine Chemistry Theory and Technology, Ministry of Education, Ocean University of China, Qingdao 266100, PR China
| | - Dahai Zhang
- Key Laboratory of Marine Chemistry Theory and Technology, Ministry of Education, Ocean University of China, Qingdao 266100, PR China.
| |
Collapse
|
66
|
Young JA, Chang CW, Scales CW, Menon SV, Holy CE, Blackie CA. Machine Learning Methods Using Artificial Intelligence Deployed on Electronic Health Record Data for Identification and Referral of At-Risk Patients From Primary Care Physicians to Eye Care Specialists: Retrospective, Case-Controlled Study. JMIR AI 2024; 3:e48295. [PMID: 38875582 PMCID: PMC11041486 DOI: 10.2196/48295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 07/11/2023] [Accepted: 02/10/2024] [Indexed: 06/16/2024]
Abstract
BACKGROUND Identification and referral of at-risk patients from primary care practitioners (PCPs) to eye care professionals remain a challenge. Approximately 1.9 million Americans suffer from vision loss as a result of undiagnosed or untreated ophthalmic conditions. In ophthalmology, artificial intelligence (AI) is used to predict glaucoma progression, recognize diabetic retinopathy (DR), and classify ocular tumors; however, AI has not yet been used to triage primary care patients for ophthalmology referral. OBJECTIVE This study aimed to build and compare machine learning (ML) methods, applicable to electronic health records (EHRs) of PCPs, capable of triaging patients for referral to eye care specialists. METHODS Accessing the Optum deidentified EHR data set, 743,039 patients with 5 leading vision conditions (age-related macular degeneration [AMD], visually significant cataract, DR, glaucoma, or ocular surface disease [OSD]) were exact-matched on age and gender to 743,039 controls without eye conditions. Between 142 and 182 non-ophthalmic parameters per patient were input into 5 ML methods: generalized linear model, L1-regularized logistic regression, random forest, Extreme Gradient Boosting (XGBoost), and J48 decision tree. Model performance was compared for each pathology to select the most predictive algorithm. The area under the curve (AUC) was assessed for all algorithms for each outcome. RESULTS XGBoost demonstrated the best performance, showing, respectively, a prediction accuracy and an AUC of 78.6% (95% CI 78.3%-78.9%) and 0.878 for visually significant cataract, 77.4% (95% CI 76.7%-78.1%) and 0.858 for exudative AMD, 79.2% (95% CI 78.8%-79.6%) and 0.879 for nonexudative AMD, 72.2% (95% CI 69.9%-74.5%) and 0.803 for OSD requiring medication, 70.8% (95% CI 70.5%-71.1%) and 0.785 for glaucoma, 85.0% (95% CI 84.2%-85.8%) and 0.924 for type 1 nonproliferative diabetic retinopathy (NPDR), 82.2% (95% CI 80.4%-84.0%) and 0.911 for type 1 proliferative diabetic retinopathy (PDR), 81.3% (95% CI 81.0%-81.6%) and 0.891 for type 2 NPDR, and 82.1% (95% CI 81.3%-82.9%) and 0.900 for type 2 PDR. CONCLUSIONS The 5 ML methods deployed were able to successfully identify patients with elevated odds ratios (ORs), thus capable of patient triage, for ocular pathology ranging from 2.4 (95% CI 2.4-2.5) for glaucoma to 5.7 (95% CI 5.0-6.4) for type 1 NPDR, with an average OR of 3.9. The application of these models could enable PCPs to better identify and triage patients at risk for treatable ophthalmic pathology. Early identification of patients with unrecognized sight-threatening conditions may lead to earlier treatment and a reduced economic burden. More importantly, such triage may improve patients' lives.
Collapse
Affiliation(s)
- Joshua A Young
- Department of Ophthalmology, New York University School of Medicine, New York, NY, United States
| | - Chin-Wen Chang
- Data Science, Johnson & Johnson MedTech, Raritan, NJ, United States
| | - Charles W Scales
- Medical and Scientific Operations, Johnson & Johnson Medtech, Vision, Jacksonville, FL, United States
| | - Saurabh V Menon
- Mu Sigma Business Solutions Private Limited, Bangalore, India
| | - Chantal E Holy
- Epidemiology and Real-World Data Sciences, Johnson & Johnson MedTech, New Brunswick, NJ, United States
| | - Caroline Adrienne Blackie
- Medical and Scientific Operations, Johnson & Johnson MedTech, Vision, Jacksonville, FL, United States
| |
Collapse
|
67
|
Hadish JA, Hargarten HL, Zhang H, Mattheis JP, Honaas LA, Ficklin SP. Towards identification of postharvest fruit quality transcriptomic markers in Malus domestica. PLoS One 2024; 19:e0297015. [PMID: 38446822 PMCID: PMC10917293 DOI: 10.1371/journal.pone.0297015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 12/27/2023] [Indexed: 03/08/2024] Open
Abstract
Gene expression is highly impacted by the environment and can be reflective of past events that affected developmental processes. It is therefore expected that gene expression can serve as a signal of a current or future phenotypic traits. In this paper we identify sets of genes, which we call Prognostic Transcriptomic Biomarkers (PTBs), that can predict firmness in Malus domestica (apple) fruits. In apples, all individuals of a cultivar are clones, and differences in fruit quality are due to the environment. The apples transcriptome responds to these differences in environment, which makes PTBs an attractive predictor of future fruit quality. PTBs have the potential to enhance supply chain efficiency, reduce crop loss, and provide higher and more consistent quality for consumers. However, several questions must be addressed. In this paper we answer the question of which of two common modeling approaches, Random Forest or ElasticNet, outperforms the other. We answer if PTBs with few genes are efficient at predicting traits. This is important because we need few genes to perform qPCR, and we answer the question if qPCR is a cost-effective assay as input for PTBs modeled using high-throughput RNA-seq. To do this, we conducted a pilot study using fruit texture in the 'Gala' variety of apples across several postharvest storage regiments. Fruit texture in 'Gala' apples is highly controllable by post-harvest treatments and is therefore a good candidate to explore the use of PTBs. We find that the RandomForest model is more consistent than an ElasticNet model and is predictive of firmness (r2 = 0.78) with as few as 15 genes. We also show that qPCR is reasonably consistent with RNA-seq in a follow up experiment. Results are promising for PTBs, yet more work is needed to ensure that PTBs are robust across various environmental conditions and storage treatments.
Collapse
Affiliation(s)
- John A. Hadish
- Molecular Plant Science Department, Washington State University, Pullman, Washington, United States of America
- Department of Horticulture, Washington State University, Pullman, Washington, United States of America
| | - Heidi L. Hargarten
- USDA Agricultural Research Service Physiology and Pathology of Tree Fruits Research, Wenatchee, Washington, United States of America
| | - Huiting Zhang
- Department of Horticulture, Washington State University, Pullman, Washington, United States of America
| | - James P. Mattheis
- USDA Agricultural Research Service Physiology and Pathology of Tree Fruits Research, Wenatchee, Washington, United States of America
| | - Loren A. Honaas
- USDA Agricultural Research Service Physiology and Pathology of Tree Fruits Research, Wenatchee, Washington, United States of America
| | - Stephen P. Ficklin
- Molecular Plant Science Department, Washington State University, Pullman, Washington, United States of America
- Department of Horticulture, Washington State University, Pullman, Washington, United States of America
| |
Collapse
|
68
|
Yu L, Ruan X, Huang W, Huang N, Zeng J, He J, He R, Yang K. Machine learning-based prediction of in-hospital mortality in patients with pneumonic chronic obstructive pulmonary disease exacerbations. J Asthma 2024; 61:212-221. [PMID: 37738216 DOI: 10.1080/02770903.2023.2263071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 09/15/2023] [Accepted: 09/21/2023] [Indexed: 09/24/2023]
Abstract
OBJECTIVE While linear regression and LASSO models have been established for predicting in-hospital mortality, there is currently no validated clinical prediction algorithm to predict in-hospital mortality for patients with chronic obstructive pulmonary disease (COPD) exacerbations using machine learning. Thus, we will evaluate the BAP-65 and CURB-65, and construct a novel prediction model using the random forest (RF) technique. METHODS A dataset of 1,418 patients with COPD exacerbations was collected. Age, gender, mental status, vital signs, and laboratory results were all taken into account for predictors. The categorical outcome variable was hospital-based mortality of people over 65 years. The dataset was divided randomly into a training dataset (70%) and a testing dataset (30%). We trained three prediction models, BAP-65, CURB-65, and the RF model, estimated the area under the receiver operating characteristic curve (AUROC) for the entire dataset. We also conducted a comparison of the AUROC values using the Delong test. RESULTS A total of 658 individuals with COPD acute exacerbations were enrolled. Our analysis using the receiver operating characteristic curve demonstrated that the RF model exhibited excellent performance, with an AUROC of 0.80 (95% confidence interval: 0.75-0.84). In comparison, the BAP-65 prediction model yielded an AUROC of 0.72 (0.68-0.75), while the CURB-65 prediction model achieved an AUROC of 0.69 (0.67-0.73). CONCLUSIONS The RF model demonstrated superior predictive capabilities than the BAP-65 and CURB-65 models in predicting in-hospital mortality. The results further highlighted significant factors for predicting in-hospital mortality, including blood eosinophil count, systolic blood pressure, and prior history of asthma.
Collapse
Affiliation(s)
- Lin Yu
- Department of Respiratory and Critical Care Medicine, First Affiliated Hospital of Chengdu Medical College, Chengdu, Sichuan, China
- School of Clinical Medicine, Chengdu Medical College, Chengdu, Sichuan, China
| | - Xia Ruan
- Department of Respiratory and Critical Care Medicine, First Affiliated Hospital of Chengdu Medical College, Chengdu, Sichuan, China
- School of Clinical Medicine, Chengdu Medical College, Chengdu, Sichuan, China
| | - Wenbo Huang
- School of Clinical Medicine, Chengdu Medical College, Chengdu, Sichuan, China
| | - Na Huang
- Department of Respiratory and Critical Care Medicine, First Affiliated Hospital of Chengdu Medical College, Chengdu, Sichuan, China
- School of Clinical Medicine, Chengdu Medical College, Chengdu, Sichuan, China
| | - Jun Zeng
- Department of Respiratory and Critical Care Medicine, First Affiliated Hospital of Chengdu Medical College, Chengdu, Sichuan, China
- School of Clinical Medicine, Chengdu Medical College, Chengdu, Sichuan, China
| | - Jie He
- Department of Respiratory and Critical Care Medicine, First Affiliated Hospital of Chengdu Medical College, Chengdu, Sichuan, China
- School of Clinical Medicine, Chengdu Medical College, Chengdu, Sichuan, China
| | - Rong He
- Department of Respiratory and Critical Care Medicine, First Affiliated Hospital of Chengdu Medical College, Chengdu, Sichuan, China
- School of Clinical Medicine, Chengdu Medical College, Chengdu, Sichuan, China
| | - Kai Yang
- Department of Respiratory and Critical Care Medicine, First Affiliated Hospital of Chengdu Medical College, Chengdu, Sichuan, China
- School of Clinical Medicine, Chengdu Medical College, Chengdu, Sichuan, China
| |
Collapse
|
69
|
You Y, Wang Y, Yu X, Gao F, Li M, Li Y, Wang X, Jia L, Shi G, Yang L. Prediction of lymph node metastasis in advanced gastric adenocarcinoma based on dual-energy CT radiomics: focus on the features of lymph nodes with a short axis diameter ≥6 mm. Front Oncol 2024; 14:1369051. [PMID: 38496754 PMCID: PMC10940341 DOI: 10.3389/fonc.2024.1369051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 02/15/2024] [Indexed: 03/19/2024] Open
Abstract
Objective To explore the value of the features of lymph nodes (LNs) with a short-axis diameter ≥6 mm in predicting lymph node metastasis (LNM) in advanced gastric adenocarcinoma (GAC) based on dual-energy CT (DECT) radiomics. Materials and methods Data of patients with GAC who underwent radical gastrectomy and LN dissection were retrospectively analyzed. To ensure the correspondence between imaging and pathology, metastatic LNs were only selected from patients with pN3, nonmetastatic LNs were selected from patients with pN0, and the short-axis diameters of the enrolled LNs were all ≥6 mm. The traditional features of LNs were recorded, including short-axis diameter, long-axis diameter, long-to-short-axis ratio, position, shape, density, edge, and the degree of enhancement; univariate and multivariate logistic regression analyses were used to establish a clinical model. Radiomics features at the maximum level of LNs were extracted in venous phase equivalent 120 kV linear fusion images and iodine maps. Intraclass correlation coefficients and the Boruta algorithm were used to screen significant features, and random forest was used to build a radiomics model. To construct a combined model, we included the traditional features with statistical significance in univariate analysis and radiomics scores (Rad-score) in multivariate logistic regression analysis. Receiver operating curve (ROC) curves and the DeLong test were used to evaluate and compare the diagnostic performance of the models. Decision curve analysis (DCA) was used to evaluate the clinical benefits of the models. Results This study included 114 metastatic LNs from 36 pN3 cases and 65 nonmetastatic LNs from 28 pN0 cases. The samples were divided into a training set (n=125) and a validation set (n=54) at a ratio of 7:3. Long-axis diameter and LN shape were independent predictors of LNM and were used to establish the clinical model; 27 screened radiomics features were used to build the radiomics model. LN shape and Rad-score were independent predictors of LNM and were used to construct the combined model. Both the radiomics model (area under the curve [AUC] of 0.986 and 0.984) and the combined model (AUC of 0.970 and 0.977) outperformed the clinical model (AUC of 0.772 and 0.820) in predicting LNM in both the training and validation sets. DCA showed superior clinical benefits from radiomics and combined models. Conclusion The models based on DECT LN radiomics features or combined traditional features have high diagnostic performance in determining the nature of each LN with a short-axis diameter of ≥6 mm in advanced GAC.
Collapse
Affiliation(s)
- Yang You
- Department of Computed Tomography and Magnetic Resonance, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Yan Wang
- Department of Computed Tomography and Magnetic Resonance, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Xianbo Yu
- CT Collaboration, Siemens Healthineers Ltd., Beijing, China
| | - Fengxiao Gao
- Department of Computed Tomography and Magnetic Resonance, Xing Tai People’s Hospital, Xingtai, China
| | - Min Li
- Department of Computed Tomography and Magnetic Resonance, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Yang Li
- Department of Computed Tomography and Magnetic Resonance, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Xiangming Wang
- Department of Computed Tomography and Magnetic Resonance, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Litao Jia
- Department of Computed Tomography and Magnetic Resonance, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Gaofeng Shi
- Department of Computed Tomography and Magnetic Resonance, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Li Yang
- Department of Computed Tomography and Magnetic Resonance, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| |
Collapse
|
70
|
Shi X, Chen S, Wang Q, Lu Y, Ren S, Huang J. Mechanical Framework for Geopolymer Gels Construction: An Optimized LSTM Technique to Predict Compressive Strength of Fly Ash-Based Geopolymer Gels Concrete. Gels 2024; 10:148. [PMID: 38391478 PMCID: PMC10887719 DOI: 10.3390/gels10020148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 02/08/2024] [Accepted: 02/11/2024] [Indexed: 02/24/2024] Open
Abstract
As an environmentally responsible alternative to conventional concrete, geopolymer concrete recycles previously used resources to prepare the cementitious component of the product. The challenging issue with employing geopolymer concrete in the building business is the absence of a standard mix design. According to the chemical composition of its components, this work proposes a thorough system or framework for estimating the compressive strength of fly ash-based geopolymer concrete (FAGC). It could be possible to construct a system for predicting the compressive strength of FAGC using soft computing methods, thereby avoiding the requirement for time-consuming and expensive experimental tests. A complete database of 162 compressive strength datasets was gathered from the research papers that were published between the years 2000 and 2020 and prepared to develop proposed models. To address the relationships between inputs and output variables, long short-term memory networks were deployed. Notably, the proposed model was examined using several soft computing methods. The modeling process incorporated 17 variables that affect the CSFAG, such as percentage of SiO2 (SiO2), percentage of Na2O (Na2O), percentage of CaO (CaO), percentage of Al2O3 (Al2O3), percentage of Fe2O3 (Fe2O3), fly ash (FA), coarse aggregate (CAgg), fine aggregate (FAgg), Sodium Hydroxide solution (SH), Sodium Silicate solution (SS), extra water (EW), superplasticizer (SP), SH concentration, percentage of SiO2 in SS, percentage of Na2O in SS, curing time, curing temperature that the proposed model was examined to several soft computing methods such as multi-layer perception neural network (MLPNN), Bayesian regularized neural network (BRNN), generalized feed-forward neural networks (GFNN), support vector regression (SVR), decision tree (DT), random forest (RF), and LSTM. Three main innovations of this study are using the LSTM model for predicting FAGC, optimizing the LSTM model by a new evolutionary algorithm called the marine predators algorithm (MPA), and considering the six new inputs in the modeling process, such as aggregate to total mass ratio, fine aggregate to total aggregate mass ratio, FASiO2:Al2O3 molar ratio, FA SiO2:Fe2O3 molar ratio, AA Na2O:SiO2 molar ratio, and the sum of SiO2, Al2O3, and Fe2O3 percent in FA. The performance capacity of LSTM-MPA was evaluated with other artificial intelligence models. The results indicate that the R2 and RMSE values for the proposed LSTM-MPA model were as follows: MLPNN (R2 = 0.896, RMSE = 3.745), BRNN (R2 = 0.931, RMSE = 2.785), GFFNN (R2 = 0.926, RMSE = 2.926), SVR-L (R2 = 0.921, RMSE = 3.017), SVR-P (R2 = 0.920, RMSE = 3.291), SVR-S (R2 = 0.934, RMSE = 2.823), SVR-RBF (R2 = 0.916, RMSE = 3.114), DT (R2 = 0.934, RMSE = 2.711), RF (R2 = 0.938, RMSE = 2.892), LSTM (R2 = 0.9725, RMSE = 1.7816), LSTM-MPA (R2 = 0.9940, RMSE = 0.8332), and LSTM-PSO (R2 = 0.9804, RMSE = 1.5221). Therefore, the proposed LSTM-MPA model can be employed as a reliable and accurate model for predicting CSFAG. Noteworthy, the results demonstrated the significance and influence of fly ash and sodium silicate solution chemical compositions on the compressive strength of FAGC. These variables could adequately present variations in the best mix designs discovered in earlier investigations. The suggested approach may also save time and money by accurately estimating the compressive strength of FAGC with low calcium content.
Collapse
Affiliation(s)
- Xuyang Shi
- School of Mines, China University of Mining and Technology, Xuzhou 221116, China
| | - Shuzhao Chen
- School of Mines, China University of Mining and Technology, Xuzhou 221116, China
| | - Qiang Wang
- School of Mines, China University of Mining and Technology, Xuzhou 221116, China
| | - Yijun Lu
- School of Civil Engineering, Guangzhou University, Guangzhou 510006, China
| | - Shisong Ren
- Faculty of Civil Engineering and Geosciences, Delft University of Technology, 2628 Delft, The Netherlands
| | - Jiandong Huang
- School of Civil Engineering, Guangzhou University, Guangzhou 510006, China
| |
Collapse
|
71
|
Zhang YH, Xie LH, Li J, Qi YW, Shi JJ. Classification and clinical significance of immunogenic cell death-related genes in Plasmodium falciparum infection determined by integrated bioinformatics analysis and machine learning. Malar J 2024; 23:48. [PMID: 38360586 PMCID: PMC10868002 DOI: 10.1186/s12936-024-04877-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 02/10/2024] [Indexed: 02/17/2024] Open
Abstract
BACKGROUND Immunogenic cell death (ICD) is a type of regulated cell death that plays a crucial role in activating the immune system in response to various stressors, including cancer cells and pathogens. However, the involvement of ICD in the human immune response against malaria remains to be defined. METHODS In this study, data from Plasmodium falciparum infection cohorts, derived from cross-sectional studies, were analysed to identify ICD subtypes and their correlation with parasitaemia and immune responses. Using consensus clustering, ICD subtypes were identified, and their association with the immune landscape was assessed by employing ssGSEA. Differentially expressed genes (DEGs) analysis, functional enrichment, protein-protein interaction networks, and machine learning (least absolute shrinkage and selection operator (LASSO) regression and random forest) were used to identify ICD-associated hub genes linked with high parasitaemia. A nomogram visualizing these genes' correlation with parasitaemia levels was developed, and its performance was evaluated using receiver operating characteristic (ROC) curves. RESULTS In the P. falciparum infection cohort, two ICD-associated subtypes were identified, with subtype 1 showing better adaptive immune responses and lower parasitaemia compared to subtype 2. DEGs analysis revealed upregulation of proliferative signalling pathways, T-cell receptor signalling pathways and T-cell activation and differentiation in subtype 1, while subtype 2 exhibited elevated cytokine signalling and inflammatory responses. PPI network construction and machine learning identified CD3E and FCGR1A as candidate hub genes. A constructed nomogram integrating these genes demonstrated significant classification performance of high parasitaemia, which was evidenced by AUC values ranging from 0.695 to 0.737 in the training set and 0.911 to 0.933 and 0.759 to 0.849 in two validation sets, respectively. Additionally, significant correlations between the expressions of these genes and the clinical manifestation of P. falciparum infection were observed. CONCLUSION This study reveals the existence of two ICD subtypes in the human immune response against P. falciparum infection. Two ICD-associated candidate hub genes were identified, and a nomogram was constructed for the classification of high parasitaemia. This study can deepen the understanding of the human immune response to P. falciparum infection and provide new targets for the prevention and control of malaria.
Collapse
Affiliation(s)
- Yan-Hui Zhang
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China.
| | - Li-Hua Xie
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
| | - Jian Li
- State Key Laboratory of Cellular Stress Biology, Innovation Center for Cell Signaling Network, School of Life Sciences, Xiamen University, Xiamen, Fujian, China
| | - Yan-Wei Qi
- Department of Pathogenic Biology and Immunology, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Jia-Jian Shi
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
| |
Collapse
|
72
|
Zou F, Xiao J, Jin Y, Jian R, Hu Y, Liang X, Ma W, Zhu S. Multilayer factors associated with excess all-cause mortality during the omicron and non-omicron waves of the COVID-19 pandemic: time series analysis in 29 countries. BMC Public Health 2024; 24:350. [PMID: 38308279 PMCID: PMC10835930 DOI: 10.1186/s12889-024-17803-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Accepted: 01/18/2024] [Indexed: 02/04/2024] Open
Abstract
BACKGROUND The COVID-19 pandemic has resulted in significant excess mortality globally. However, the differences in excess mortality between the Omicron and non-Omicron waves, as well as the contribution of local epidemiological characteristics, population immunity, and social factors to excess mortality, remain poorly understood. This study aims to solve the above problems. METHODS Weekly all-cause death data and covariates from 29 countries for the period 2015-2022 were collected and used. The Bayesian Structured Time Series Model predicted expected weekly deaths, stratified by gender and age groups for the period 2020-2022. The quantile-based g-computation approach accounted for the effects of factors on the excess all-cause mortality rate. Sensitivity analyses were conducted using alternative Omicron proportion thresholds. RESULTS From the first week of 2021 to the 30th week of 2022, the estimated cumulative number of excess deaths due to COVID-19 globally was nearly 1.39 million. The estimated weekly excess all-cause mortality rate in the 29 countries was approximately 2.17 per 100,000 (95% CI: 1.47 to 2.86). Weekly all-cause excess mortality rates were significantly higher in both male and female groups and all age groups during the non-Omicron wave, except for those younger than 15 years (P < 0.001). Sensitivity analysis confirmed the stability of the results. Positive associations with all-cause excess mortality were found for the constituent ratio of non-Omicron in all variants, new cases per million, positive rate, cardiovascular death rate, people fully vaccinated per hundred, extreme poverty, hospital patients per million humans, people vaccinated per hundred, and stringency index. Conversely, other factors demonstrated negative associations with all-cause excess mortality from the first week of 2021 to the 30th week of 2022. CONCLUSION Our findings indicate that the COVID-19 Omicron wave was associated with lower excess mortality compared to the non-Omicron wave. This study's analysis of the factors influencing excess deaths suggests that effective strategies to mitigate all-cause mortality include improving economic conditions, promoting widespread vaccination, and enhancing overall population health. Implementing these measures could significantly reduce the burden of COVID-19, facilitate coexistence with the virus, and potentially contribute to its elimination.
Collapse
Affiliation(s)
- Fengjuan Zou
- Department of Epidemiology, School of Medicine, Jinan University, No.601 Huangpu Road West, Guangzhou, Guangdong, 510632, China
| | - Jianpeng Xiao
- Guangdong Provincial Institute of Public Health, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, Guangdong, 511430, China
| | - Yingying Jin
- Department of Epidemiology, School of Medicine, Jinan University, No.601 Huangpu Road West, Guangzhou, Guangdong, 510632, China
| | - Ronghua Jian
- Department of Epidemiology, School of Medicine, Jinan University, No.601 Huangpu Road West, Guangzhou, Guangdong, 510632, China
| | - Yijun Hu
- Department of Epidemiology, School of Medicine, Jinan University, No.601 Huangpu Road West, Guangzhou, Guangdong, 510632, China
| | - Xiaofeng Liang
- Disease Control and Prevention Institute, Jinan University, No.601 Huangpu Road West, Guangzhou, Guangdong, 510632, China
- Chinese Preventive Medicine Association, Beijing, 100062, China
| | - Wenjun Ma
- Department of Epidemiology, School of Medicine, Jinan University, No.601 Huangpu Road West, Guangzhou, Guangdong, 510632, China.
| | - Sui Zhu
- Department of Epidemiology, School of Medicine, Jinan University, No.601 Huangpu Road West, Guangzhou, Guangdong, 510632, China.
| |
Collapse
|
73
|
Uddin MG, Nash S, Rahman A, Dabrowski T, Olbert AI. Data-driven modelling for assessing trophic status in marine ecosystems using machine learning approaches. ENVIRONMENTAL RESEARCH 2024; 242:117755. [PMID: 38008200 DOI: 10.1016/j.envres.2023.117755] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 10/05/2023] [Accepted: 11/20/2023] [Indexed: 11/28/2023]
Abstract
Assessing eutrophication in coastal and transitional waters is of utmost importance, yet existing Trophic Status Index (TSI) models face challenges like multicollinearity, data redundancy, inappropriate aggregation methods, and complex classification schemes. To tackle these issues, we developed a novel tool that harnesses machine learning (ML) and artificial intelligence (AI), enhancing the reliability and accuracy of trophic status assessments. Our research introduces an improved data-driven methodology specifically tailored for transitional and coastal (TrC) waters, with a focus on Cork Harbour, Ireland, as a case study. Our innovative approach, named the Assessment Trophic Status Index (ATSI) model, comprises three main components: the selection of pertinent water quality indicators, the computation of ATSI scores, and the implementation of a new classification scheme. To optimize input data and minimize redundancy, we employed ML techniques, including advanced deep learning methods. Specifically, we developed a CHL prediction model utilizing ten algorithms, among which XGBoost demonstrated exceptional performance, showcasing minimal errors during both training (RMSE = 0.0, MSE = 0.0, MAE = 0.01) and testing (RMSE = 0.0, MSE = 0.0, MAE = 0.01) phases. Utilizing a novel linear rescaling interpolation function, we calculated ATSI scores and evaluated the model's sensitivity and efficiency across diverse application domains, employing metrics such as R2, the Nash-Sutcliffe efficiency (NSE), and the model efficiency factor (MEF). The results consistently revealed heightened sensitivity and efficiency across all application domains. Additionally, we introduced a brand new classification scheme for ranking the trophic status of transitional and coastal waters. To assess spatial sensitivity, we applied the ATSI model to four distinct waterbodies in Ireland, comparing trophic assessment outcomes with the Assessment of Trophic Status of Estuaries and Bays in Ireland (ATSEBI) System. Remarkably, significant disparities between the ATSI and ATSEBI System were evident in all domains, except for Mulroy Bay. Overall, our research significantly enhances the accuracy of trophic status assessments in marine ecosystems. The ATSI model, combined with cutting-edge ML techniques and our new classification scheme, represents a promising avenue for evaluating and monitoring trophic conditions in TrC waters. The study also demonstrated the effectiveness of ATSI in assessing trophic status across various waterbodies, including lakes, rivers, and more. These findings make substantial contributions to the field of marine ecosystem management and conservation.
Collapse
Affiliation(s)
- Md Galal Uddin
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland.
| | - Stephen Nash
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland
| | - Azizur Rahman
- School of Computing, Mathematics and Engineering, Charles Sturt University, Wagga Wagga, Australia; The Gulbali Institute of Agriculture, Water and Environment, Charles Sturt University, Wagga Wagga, Australia
| | | | - Agnieszka I Olbert
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland
| |
Collapse
|
74
|
Ji IH, Lee JH, Kang MJ, Park WJ, Jeon SH, Seo JT. Artificial Intelligence-Based Anomaly Detection Technology over Encrypted Traffic: A Systematic Literature Review. SENSORS (BASEL, SWITZERLAND) 2024; 24:898. [PMID: 38339615 PMCID: PMC10857182 DOI: 10.3390/s24030898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/31/2023] [Accepted: 01/26/2024] [Indexed: 02/12/2024]
Abstract
As cyber-attacks increase in unencrypted communication environments such as the traditional Internet, protected communication channels based on cryptographic protocols, such as transport layer security (TLS), have been introduced to the Internet. Accordingly, attackers have been carrying out cyber-attacks by hiding themselves in protected communication channels. However, the nature of channels protected by cryptographic protocols makes it difficult to distinguish between normal and malicious network traffic behaviors. This means that traditional anomaly detection models with features from packets extracted a deep packet inspection (DPI) have been neutralized. Recently, studies on anomaly detection using artificial intelligence (AI) and statistical characteristics of traffic have been proposed as an alternative. In this review, we provide a systematic review for AI-based anomaly detection techniques over encrypted traffic. We set several research questions on the review topic and collected research according to eligibility criteria. Through the screening process and quality assessment, 30 research articles were selected with high suitability to be included in the review from the collected literature. We reviewed the selected research in terms of dataset, feature extraction, feature selection, preprocessing, anomaly detection algorithm, and performance indicators. As a result of the literature review, it was confirmed that various techniques used for AI-based anomaly detection over encrypted traffic were used. Some techniques are similar to those used for AI-based anomaly detection over unencrypted traffic, but some technologies are different from those used for unencrypted traffic.
Collapse
Affiliation(s)
- Il Hwan Ji
- Department of Information Security, Gachon University, Seongnam-si 1342, Republic of Korea; (I.H.J.); (J.H.L.)
| | - Ju Hyeon Lee
- Department of Information Security, Gachon University, Seongnam-si 1342, Republic of Korea; (I.H.J.); (J.H.L.)
| | - Min Ji Kang
- Department of Computer Engineering (Smart Security), Gachon University, Seongnam-si 1342, Republic of Korea; (M.J.K.); (S.H.J.)
| | - Woo Jin Park
- Department of Software, Gachon University, Seongnam-si 1342, Republic of Korea;
| | - Seung Ho Jeon
- Department of Computer Engineering (Smart Security), Gachon University, Seongnam-si 1342, Republic of Korea; (M.J.K.); (S.H.J.)
| | - Jung Taek Seo
- Department of Computer Engineering, Gachon University, Seongnam-si 1342, Republic of Korea
| |
Collapse
|
75
|
Behnoush AH, Shariatnia MM, Khalaji A, Asadi M, Yaghoobi A, Rezaee M, Soleimani H, Sheikhy A, Aein A, Yadangi S, Jenab Y, Masoudkabir F, Mehrani M, Iskander M, Hosseini K. Predictive modeling for acute kidney injury after percutaneous coronary intervention in patients with acute coronary syndrome: a machine learning approach. Eur J Med Res 2024; 29:76. [PMID: 38268045 PMCID: PMC10807059 DOI: 10.1186/s40001-024-01675-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 01/15/2024] [Indexed: 01/26/2024] Open
Abstract
BACKGROUND Acute kidney injury (AKI) is one of the preventable complications of percutaneous coronary intervention (PCI). This study aimed to develop machine learning (ML) models to predict AKI after PCI in patients with acute coronary syndrome (ACS). METHODS This study was conducted at Tehran Heart Center from 2015 to 2020. Several variables were used to design five ML models: Naïve Bayes (NB), Logistic Regression (LR), CatBoost (CB), Multi-layer Perception (MLP), and Random Forest (RF). Feature importance was evaluated with the RF model, CB model, and LR coefficients while SHAP beeswarm plots based on the CB model were also used for deriving the importance of variables in the population using pre-procedural variables and all variables. Sensitivity, specificity, and the area under the receiver operating characteristics curve (ROC-AUC) were used as the evaluation measures. RESULTS A total of 4592 patients were included, and 646 (14.1%) experienced AKI. The train data consisted of 3672 and the test data included 920 cases. The patient population had a mean age of 65.6 ± 11.2 years and 73.1% male predominance. Notably, left ventricular ejection fraction (LVEF) and fasting plasma glucose (FPG) had the highest feature importance when training the RF model on only pre-procedural features. SHAP plots for all features demonstrated LVEF and age as the top features. With pre-procedural variables only, CB had the highest AUC for the prediction of AKI (AUC 0.755, 95% CI 0.713 to 0.797), while RF had the highest sensitivity (75.9%) and MLP had the highest specificity (64.35%). However, when considering pre-procedural, procedural, and post-procedural features, RF outperformed other models (AUC: 0.775). In this analysis, CB achieved the highest sensitivity (82.95%) and NB had the highest specificity (82.93%). CONCLUSION Our analyses showed that ML models can predict AKI with acceptable performance. This has potential clinical utility for assessing the individualized risk of AKI in ACS patients undergoing PCI. Additionally, the identified features in the models may aid in mitigating these risk factors.
Collapse
Affiliation(s)
- Amir Hossein Behnoush
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Non-Communicable Diseases Research Center, Endocrinology and Metabolism Population Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - M Moein Shariatnia
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Amirmohammad Khalaji
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Non-Communicable Diseases Research Center, Endocrinology and Metabolism Population Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Mahsa Asadi
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Alireza Yaghoobi
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Department of Pharmacology, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Malihe Rezaee
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Non-Communicable Diseases Research Center, Endocrinology and Metabolism Population Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
- Department of Pharmacology, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Hamidreza Soleimani
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Non-Communicable Diseases Research Center, Endocrinology and Metabolism Population Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Ali Sheikhy
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Non-Communicable Diseases Research Center, Endocrinology and Metabolism Population Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Afsaneh Aein
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Somayeh Yadangi
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Yaser Jenab
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Farzad Masoudkabir
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Mehdi Mehrani
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Mina Iskander
- Department of Cardiovascular Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Kaveh Hosseini
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran.
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
76
|
Zhai X, Sun P, Yu X, Wang S, Li X, Sun W, Liu X, Tian T, Zhang B. CT-based radiomics signature for differentiating pyelocaliceal upper urinary tract urothelial carcinoma from infiltrative renal cell carcinoma. Front Oncol 2024; 13:1244585. [PMID: 38304033 PMCID: PMC10830825 DOI: 10.3389/fonc.2023.1244585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 12/22/2023] [Indexed: 02/03/2024] Open
Abstract
Objectives To develop a CT-based radiomics model and a combined model for preoperatively discriminating infiltrative renal cell carcinoma (RCC) and pyelocaliceal upper urinary tract urothelial carcinoma (UTUC), which invades the renal parenchyma. Materials and methods Eighty patients (37 pathologically proven infiltrative RCCs and 43 pathologically proven pyelocaliceal UTUCs) were retrospectively enrolled and randomly divided into a training set (n = 56) and a testing set (n = 24) at a ratio of 7:3. Traditional CT imaging characteristics in the portal venous phase were collected by two radiologists (SPH and ZXL, who have 4 and 30 years of experience in abdominal radiology, respectively). Patient demographics and traditional CT imaging characteristics were used to construct the clinical model. The radiomics score was calculated based on the radiomics features extracted from the portal venous CT images and the random forest (RF) algorithm to construct the radiomics model. The combined model was constructed using the radiomics score and significant clinical factors according to the multivariate logistic regression. The diagnostic efficacy of the models was evaluated using receiver operating characteristic (ROC) curve analysis and the area under the curve (AUC). Results The RF score based on the eight validated features extracted from the portal venous CT images was used to build the radiomics model. Painless hematuria as an independent risk factor was used to build the clinical model. The combined model was constructed using the RF score and the selected clinical factor. Both the radiomics model and combined model showed higher efficacy in differentiating infiltrative RCC and pyelocaliceal UTUC in the training and testing cohorts with AUC values of 0.95 and 0.90, respectively, for the radiomics model and 0.99 and 0.90, respectively, for the combined model. The decision curves of the combined model as well as the radiomics model indicated an overall net benefit over the clinical model. Both the radiomics model and the combined model achieved a notable reduction in false-positive and false-negativerates, resulting in significantly higher accuracy compared to the visual assessments in both the training and testing cohorts. Conclusion The radiomics model and combined model had the potential to accurately differentiate infiltrative RCC and pyelocaliceal UTUC, which invades the renal parenchyma, and provide a new potentially non-invasive method to guide surgery strategies.
Collapse
Affiliation(s)
- Xiaoli Zhai
- Department of Radiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Penghui Sun
- Department of Radiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Xianbo Yu
- CT Collaboration, Siemens Healthineers Ltd., Beijing, China
| | - Shuangkun Wang
- Department of Radiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Xue Li
- Department of Pathology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Weiqian Sun
- Huiying Medical Technology (Beijing) Co., Ltd., Beijing, China
| | - Xin Liu
- Department of Radiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Tian Tian
- Department of Radiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Bowen Zhang
- Department of Radiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| |
Collapse
|
77
|
Rasaizadi A, Hafizi F, Seyedabrishami S. Dimensions management of traffic big data for short-term traffic prediction on suburban roadways. Sci Rep 2024; 14:1484. [PMID: 38233666 PMCID: PMC10794253 DOI: 10.1038/s41598-024-51988-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 01/11/2024] [Indexed: 01/19/2024] Open
Abstract
Since intelligent systems were developed to collect traffic data, this data can be collected at high volume, velocity, and variety, resulting in big traffic data. In previous studies, dealing with the large volume of big traffic data has always been discussed. In this study, big traffic data were used to predict traffic state on a section of suburban road from Karaj to Chalous located in the north of Iran. Due to the many and various extracted features, data dimensions management is necessary. This management was accomplished using principal component analysis to reduce the number of features, genetic algorithms to select features influencing traffic states, and cyclic features to change the nature of features. The data set obtained from each method is used as input to the models. The models used include long short-term memory, support vector machine, and random forest. The results show that using cyclic features can increase traffic state prediction's accuracy than the model, including all the initial features (base model). Long short-term memory model with 71 cyclic features offers the highest accuracy, equivalent to 88.09%. Additionally, this model's reduced number of features led to a shorter modelling execution time, from 456 s (base model) to 201 s.
Collapse
Affiliation(s)
- Arash Rasaizadi
- Faculty of Civil and Environmental Engineering, Tarbiat Modares University, Tehran, Iran
| | - Fateme Hafizi
- Faculty of Civil and Environmental Engineering, Tarbiat Modares University, Tehran, Iran
| | | |
Collapse
|
78
|
Li R, Wei D, Wang Z. Synergizing Machine Learning Algorithm with Triboelectric Nanogenerators for Advanced Self-Powered Sensing Systems. NANOMATERIALS (BASEL, SWITZERLAND) 2024; 14:165. [PMID: 38251130 PMCID: PMC10819602 DOI: 10.3390/nano14020165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 12/25/2023] [Accepted: 01/07/2024] [Indexed: 01/23/2024]
Abstract
The advancement of the Internet of Things (IoT) has increased the demand for large-scale intelligent sensing systems. The periodic replacement of power sources for ubiquitous sensing systems leads to significant resource waste and environmental pollution. Human staffing costs associated with replacement also increase the economic burden. The triboelectric nanogenerators (TENGs) provide both an energy harvesting scheme and the possibility of self-powered sensing. Based on contact electrification from different materials, TENGs provide a rich material selection to collect complex and diverse data. As the data collected by TENGs become increasingly numerous and complex, different approaches to machine learning (ML) and deep learning (DL) algorithms have been proposed to efficiently process output signals. In this paper, the latest advances in ML algorithms assisting solid-solid TENG and liquid-solid TENG sensors are reviewed based on the sample size and complexity of the data. The pros and cons of various algorithms are analyzed and application scenarios of various TENG sensing systems are presented. The prospects of synergizing hardware (TENG sensors) with software (ML algorithms) in a complex environment and their main challenges for future developments are discussed.
Collapse
Affiliation(s)
- Roujuan Li
- Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing 101400, China;
- School of Nanoscience and Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Di Wei
- Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing 101400, China;
| | - Zhonglin Wang
- Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing 101400, China;
- School of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0245, USA
| |
Collapse
|
79
|
Naderi K, Kalami Yazdi MS, Jafarabadi H, Bahmanzadegan F, Ghaemi A, Mosavi MR. Modeling based on machine learning to investigate flue gas desulfurization performance by calcium silicate absorbent in a sand bed reactor. Sci Rep 2024; 14:954. [PMID: 38200150 PMCID: PMC10781758 DOI: 10.1038/s41598-024-51586-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 01/07/2024] [Indexed: 01/12/2024] Open
Abstract
Flue gas desulfurization (FGD) is a critical process for reducing sulfur dioxide (SO2) emissions from industrial sources, particularly power plants. This research uses calcium silicate absorbent in combination with machine learning (ML) to predict SO2 concentration within an FGD process. The collected dataset encompasses four input parameters, specifically relative humidity, absorbent weight, temperature, and time, and incorporates one output parameter, which pertains to the concentration of SO2. Six ML models were developed to estimate the output parameters. Statistical metrics such as the coefficient of determination (R2) and mean squared error (MSE) were employed to identify the most suitable model and assess its fitting effectiveness. The random forest (RF) model emerged as the top-performing model, boasting an R2 of 0.9902 and an MSE of 0.0008. The model's predictions aligned closely with experimental results, confirming its high accuracy. The most suitable hyperparameter values for RF model were found to be 74 for n_estimators, 41 for max_depth, false for bootstrap, sqrt for max_features, 1 for min_samples_leaf, absolute_error for criterion, and 3 for min_samples_split. Three-dimensional surface plots were generated to explore the impact of input variables on SO2 concentration. Global sensitivity analysis (GSA) revealed absorbent weight and time significantly influence SO2 concentration. The integration of ML into FGD modeling offers a novel approach to optimizing the efficiency and effectiveness of this environmentally crucial process.
Collapse
Affiliation(s)
- Kamyar Naderi
- School of Chemical, Petroleum and Gas Engineering, Iran University of Science and Technology, Narmak, Tehran, 16846, Iran
| | - Mohammad Sadegh Kalami Yazdi
- Department of Electrical Engineering, Iran University of Science and Technology, Narmak, Tehran, 16846-13114, Iran
| | - Hanieh Jafarabadi
- School of Chemical, Petroleum and Gas Engineering, Iran University of Science and Technology, Narmak, Tehran, 16846, Iran
| | - Fatemeh Bahmanzadegan
- School of Chemical, Petroleum and Gas Engineering, Iran University of Science and Technology, Narmak, Tehran, 16846, Iran
| | - Ahad Ghaemi
- School of Chemical, Petroleum and Gas Engineering, Iran University of Science and Technology, Narmak, Tehran, 16846, Iran.
| | - Mohammad Reza Mosavi
- Department of Electrical Engineering, Iran University of Science and Technology, Narmak, Tehran, 16846-13114, Iran
| |
Collapse
|
80
|
Farahat IS, Sharafeldeen A, Ghazal M, Alghamdi NS, Mahmoud A, Connelly J, van Bogaert E, Zia H, Tahtouh T, Aladrousy W, Tolba AE, Elmougy S, El-Baz A. An AI-based novel system for predicting respiratory support in COVID-19 patients through CT imaging analysis. Sci Rep 2024; 14:851. [PMID: 38191606 PMCID: PMC10774502 DOI: 10.1038/s41598-023-51053-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 12/29/2023] [Indexed: 01/10/2024] Open
Abstract
The proposed AI-based diagnostic system aims to predict the respiratory support required for COVID-19 patients by analyzing the correlation between COVID-19 lesions and the level of respiratory support provided to the patients. Computed tomography (CT) imaging will be used to analyze the three levels of respiratory support received by the patient: Level 0 (minimum support), Level 1 (non-invasive support such as soft oxygen), and Level 2 (invasive support such as mechanical ventilation). The system will begin by segmenting the COVID-19 lesions from the CT images and creating an appearance model for each lesion using a 2D, rotation-invariant, Markov-Gibbs random field (MGRF) model. Three MGRF-based models will be created, one for each level of respiratory support. This suggests that the system will be able to differentiate between different levels of severity in COVID-19 patients. The system will decide for each patient using a neural network-based fusion system, which combines the estimates of the Gibbs energy from the three MGRF-based models. The proposed system were assessed using 307 COVID-19-infected patients, achieving an accuracy of [Formula: see text], a sensitivity of [Formula: see text], and a specificity of [Formula: see text], indicating a high level of prediction accuracy.
Collapse
Affiliation(s)
- Ibrahim Shawky Farahat
- Department of Computer Science, Faculty of Computers and Information, Mansoura University, Mansoura, Egypt
| | | | - Mohammed Ghazal
- Electrical, Computer and Biomedical Engineering Department, Abu Dhabi University, Abu Dhabi, UAE
| | - Norah Saleh Alghamdi
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Ali Mahmoud
- Department of Bioengineering, University of Louisville, Louisville, USA
| | - James Connelly
- Department of Radiology, University of Louisville, Louisville, USA
| | - Eric van Bogaert
- Department of Radiology, University of Louisville, Louisville, USA
| | - Huma Zia
- Electrical, Computer and Biomedical Engineering Department, Abu Dhabi University, Abu Dhabi, UAE
| | - Tania Tahtouh
- College of Health Sciences, Abu Dhabi University, Abu Dhabi, UAE
| | - Waleed Aladrousy
- Department of Computer Science, Faculty of Computers and Information, Mansoura University, Mansoura, Egypt
| | - Ahmed Elsaid Tolba
- Department of Computer Science, Faculty of Computers and Information, Mansoura University, Mansoura, Egypt
- The Higher Institute of Engineering and Automotive Technology and Energy, Kafr El Sheikh, Egypt
| | - Samir Elmougy
- Department of Computer Science, Faculty of Computers and Information, Mansoura University, Mansoura, Egypt
| | - Ayman El-Baz
- Department of Bioengineering, University of Louisville, Louisville, USA.
| |
Collapse
|
81
|
Zheng X, Pan F, Naumovski N, Wei Y, Wu L, Peng W, Wang K. Precise prediction of metabolites patterns using machine learning approaches in distinguishing honey and sugar diets fed to mice. Food Chem 2024; 430:136915. [PMID: 37515908 DOI: 10.1016/j.foodchem.2023.136915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 07/13/2023] [Accepted: 07/15/2023] [Indexed: 07/31/2023]
Abstract
As a natural sweetener produced by honey bees, honey was recognized as being healthier for consumption than table sugar. Our previous study also indicated thatmetaboliteprofiles in mice fed honey and mixedsugardiets aredifferent. However, it is still noteworthy about the batch-to-batch consistency of the metabolic differences between two diet types. Here, the machine learning (ML) algorithms were applied to complement and calibrate HPLC-QTOF/MS-based untargeted metabolomics data. Data were generated from three batches of mice that had the same treatment, which can further mine the metabolite biomarkers. Random Forest and Extra-Trees models could better discriminate between honey and mixed sugar dietary patterns under five-fold cross-validation. Finally, SHapley Additive exPlanations tool identified phosphatidylethanolamine and phosphatidylcholine as reliable metabolic biomarkers to discriminate the honey diet from the mixed sugar diet. This study provides us new ideas for metabolomic analysis of larger data sets.
Collapse
Affiliation(s)
- Xing Zheng
- State Key Laboratory of Resource Insects, Institute of Apiculture Research, Chinese Academy of Agricultural Sciences, Beijing 100093, China
| | - Fei Pan
- State Key Laboratory of Resource Insects, Institute of Apiculture Research, Chinese Academy of Agricultural Sciences, Beijing 100093, China
| | - Nenad Naumovski
- University of Canberra Health Research Institute (UCHRI), University of Canberra, Locked Bag 1, Bruce, Canberra, ACT 2601, Australia
| | - Yue Wei
- College of Science & Technology, Hebei Agricultural University, Huanghua, Hebei 061100, China
| | - Liming Wu
- State Key Laboratory of Resource Insects, Institute of Apiculture Research, Chinese Academy of Agricultural Sciences, Beijing 100093, China
| | - Wenjun Peng
- State Key Laboratory of Resource Insects, Institute of Apiculture Research, Chinese Academy of Agricultural Sciences, Beijing 100093, China.
| | - Kai Wang
- State Key Laboratory of Resource Insects, Institute of Apiculture Research, Chinese Academy of Agricultural Sciences, Beijing 100093, China.
| |
Collapse
|
82
|
Kazemian S, Issaiy M, Hosseini K. Challenges in developing and validating machine learning models for transcatheter aortic valve implantation mortality risk prediction. EUROPEAN HEART JOURNAL. DIGITAL HEALTH 2024; 5:1-2. [PMID: 38264706 PMCID: PMC10802814 DOI: 10.1093/ehjdh/ztad059] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]
Affiliation(s)
- Sina Kazemian
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran Heart Center, Kargar St. Jalal al-Ahmad Cross, 1411713138, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Imam Khomeini Hospital Complex, Tohid Square, 1419733141, Tehran, Iran
| | - Mahbod Issaiy
- Advanced Diagnostic and Interventional Radiology Research Center (ADHR), Tehran University of Medical Sciences, Tehran, Iran
| | - Kaveh Hosseini
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Tehran Heart Center, Kargar St. Jalal al-Ahmad Cross, 1411713138, Tehran, Iran
- Tehran Heart Center, Cardiovascular Diseases Research Institute, Tehran University of Medical Sciences, Imam Khomeini Hospital Complex, Tohid Square, 1419733141, Tehran, Iran
| |
Collapse
|
83
|
Yu Y, Zhang Z, Xia F, Sun B, Liu S, Wang X, Zhou X, Zhao J. Exploration of the pathophysiology of high myopia via proteomic profiling of human corneal stromal lenticules. Exp Eye Res 2024; 238:109726. [PMID: 37979904 DOI: 10.1016/j.exer.2023.109726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 11/08/2023] [Accepted: 11/10/2023] [Indexed: 11/20/2023]
Abstract
This study aimed to investigate the underlying pathophysiology of high myopia by analyzing the proteome of human corneal stromal lenticule samples obtained through small incision lenticule extraction (SMILE). A total of thirty-two patients who underwent SMILE were included in the study. Label-free quantitative proteomic analysis was performed on corneal stromal lenticule samples, equally representing high myopia (n = 10) and low myopia (n = 10) groups. The identified and profiled lenticule proteomes were analyzed using in silico tools to explore biological characteristics of differentially expressed proteins (DEPs). Additionally, LASSO regression and random forest model were employed to identify key proteins associated with the pathophysiology of high myopia. The DEPs were found to be closely linked to immune activation, extracellular matrix, and cell adhesion-related pathways according to gene ontology analysis. Specifically, decreased expression of COL1A1 and increased expression of CDH11 were associated with the pathogenesis of high myopia and validated by western blotting (n = 6) and quantitative real time polymerase chain reaction (n = 6). Overall, this study provides evidence that COL1A1 and CDH11 may contribute to the pathophysiology of high myopia based on comparative proteomic profiling of human corneal stromal lenticules obtained through SMILE.
Collapse
Affiliation(s)
- Yanze Yu
- Department of Ophthalmology and Vision Science, Eye and ENT Hospital, Fudan University, Shanghai, China; NHC Key Laboratory of Myopia (Fudan University), Shanghai, China; Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China; Fudan University Shanghai Medical College, Shanghai 200032, China
| | - Zhe Zhang
- Department of Ophthalmology and Vision Science, Eye and ENT Hospital, Fudan University, Shanghai, China; NHC Key Laboratory of Myopia (Fudan University), Shanghai, China; Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China
| | - Fei Xia
- Department of Ophthalmology and Vision Science, Eye and ENT Hospital, Fudan University, Shanghai, China; NHC Key Laboratory of Myopia (Fudan University), Shanghai, China; Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China
| | - Bingqing Sun
- Department of Ophthalmology and Vision Science, Eye and ENT Hospital, Fudan University, Shanghai, China; NHC Key Laboratory of Myopia (Fudan University), Shanghai, China; Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China
| | - Shengtao Liu
- Department of Ophthalmology and Vision Science, Eye and ENT Hospital, Fudan University, Shanghai, China; NHC Key Laboratory of Myopia (Fudan University), Shanghai, China; Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China
| | - Xiaoying Wang
- Department of Ophthalmology and Vision Science, Eye and ENT Hospital, Fudan University, Shanghai, China; NHC Key Laboratory of Myopia (Fudan University), Shanghai, China; Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China
| | - Xingtao Zhou
- Department of Ophthalmology and Vision Science, Eye and ENT Hospital, Fudan University, Shanghai, China; NHC Key Laboratory of Myopia (Fudan University), Shanghai, China; Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China.
| | - Jing Zhao
- Department of Ophthalmology and Vision Science, Eye and ENT Hospital, Fudan University, Shanghai, China; NHC Key Laboratory of Myopia (Fudan University), Shanghai, China; Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China.
| |
Collapse
|
84
|
Mądziel M. Instantaneous CO 2 emission modelling for a Euro 6 start-stop vehicle based on portable emission measurement system data and artificial intelligence methods. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:6944-6959. [PMID: 38155311 PMCID: PMC11294266 DOI: 10.1007/s11356-023-31022-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 11/07/2023] [Indexed: 12/30/2023]
Abstract
One of the increasingly common methods to counteract the increased fuel consumption of vehicles is start-stop technology. This paper introduces a methodology which presents the process of measuring and creating a computational model of CO2 emissions using artificial intelligence techniques for a vehicle equipped with start-stop technology. The method requires only measurement data of velocity, acceleration of vehicle, and gradient of road to predict the emission of CO2. In this paper, three methods of machine learning techniques were analyzed, while the best prediction results are shown by the gradient boosting method. For the developed models, the results were validated using the coefficient of determination, the mean squared error, and based on visual evaluation of residual and instantaneous emission plots and CO2 emission maps. The developed models present a novel methodology and can be used for microscale environmental analysis.
Collapse
Affiliation(s)
- Maksymilian Mądziel
- Faculty of Mechanical Engineering and Aeronautics, Rzeszow University of Technology, 35-959, Rzeszow, Poland.
| |
Collapse
|
85
|
Wan C, Yu S, Dang P, Gao L, Ge J, Li Y, Yang H, Yang P, Feng B, Gao J. Nitrogen regulates the synthesis of hydrophobic amino acids to improve protein structural and gel properties in common buckwheat. Int J Biol Macromol 2023; 253:126871. [PMID: 37716662 DOI: 10.1016/j.ijbiomac.2023.126871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 08/31/2023] [Accepted: 09/10/2023] [Indexed: 09/18/2023]
Abstract
Nitrogen (N) fertilizer impacts the grain quality of common buckwheat, but the effects and regulatory mechanisms of N on various protein parameters of buckwheat are not fully understood. The purpose of this study was to investigate the particle morphology, structural and gel properties, and regulation mechanism of buckwheat protein under four N levels. The bulk density, surface hydrophobicity, particle size, and thermal properties of the buckwheat protein were maximized through the optimal N application (180 kg N/ha), further enhancing the thermal stability of the protein. N application increased the β-sheet content and reduced the random coil content. Appropriate N fertilizer input enhanced the tertiary structure stability and gel elasticity of buckwheat protein by promoting hydrophobic interactions, disulfide bonds, ionic bonds, storage modulus and loss modulus. The differentially expressed proteins induced by N are primarily enriched in small ribosomal subunit and ribosome, improving protein quality mainly by promoting the synthesis of hydrophobic amino acids. Future agriculture should pay attention to the hydrophobic amino acid content of buckwheat to effectively improve protein quality. This study further advances the application of buckwheat protein in the field of food processing and provides a theoretical basis for the extensive development and utilization of buckwheat protein.
Collapse
Affiliation(s)
- Chenxi Wan
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China.
| | - Shaopeng Yu
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China
| | - Pengfei Dang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China
| | - Licheng Gao
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China; Faculty of Bioscience Engineering, Ghent University, Valentin Vaerwyckweg 1, 9000 Gent, Belgium
| | - Jiahao Ge
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China
| | - Yaxin Li
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China
| | - Hao Yang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China
| | - Pu Yang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China
| | - Baili Feng
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China
| | - Jinfeng Gao
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A & F University, Yangling, Shaanxi Province 712100, China.
| |
Collapse
|
86
|
Mahlknecht J, Torres-Martínez JA, Kumar M, Mora A, Kaown D, Loge FJ. Nitrate prediction in groundwater of data scarce regions: The futuristic fresh-water management outlook. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 905:166863. [PMID: 37690767 DOI: 10.1016/j.scitotenv.2023.166863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 08/28/2023] [Accepted: 09/03/2023] [Indexed: 09/12/2023]
Abstract
Nitrate contamination in groundwater poses a significant threat to water quality and public health, especially in regions with limited data availability. This study addresses this challenge by employing machine learning (ML) techniques to predict nitrate (NO3--N) concentrations in Mexico's groundwater. Four ML algorithms-Extreme Gradient Boosting (XGB), Boosted Regression Trees (BRT), Random Forest (RF), and Support Vector Machines (SVM)-were executed to model NO3--N concentrations across the country. Despite data limitations, the ML models achieved robust predictive performances. XGB and BRT algorithms demonstrated superior accuracy (0.80 and 0.78, respectively). Notably, this was achieved using ∼10 times less information than previous large-scale assessments. The novelty lies in the first-ever implementation of the 'Support Points-based Split Approach' during data pre-processing. The models considered initially 68 covariates and identified 13-19 significant predictors of NO3--N concentration spanning from climate, geomorphology, soil, hydrogeology, and human factors. Rainfall, elevation, and slope emerged as key predictors. A validation incorporated nationwide waste disposal sites, yielding an encouraging correlation. Spatial risk mapping unveiled significant pollution hotspots across Mexico. Regions with elevated NO3--N concentrations (>10 mg/L) were identified, particularly in the north-central and northeast parts of the country, associated with agricultural and industrial activities. Approximately 21 million people, accounting for 10 % of Mexico's population, are potentially exposed to elevated NO3--N levels in groundwater. Moreover, the NO3--N hotspots align with reported NO3--N health implications such as gastric and colorectal cancer. This study not only demonstrates the potential of ML in data-scarce regions but also offers actionable insights for policy and management strategies. Our research underscores the urgency of implementing sustainable agricultural practices and comprehensive domestic waste management measures to mitigate NO3--N contamination. Moreover, it advocates for the establishment of effective policies based on real-time monitoring and collaboration among stakeholders.
Collapse
Affiliation(s)
- Jürgen Mahlknecht
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico
| | - Juan Antonio Torres-Martínez
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico.
| | - Manish Kumar
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico; Sustainability Cluster, School of Advanced Engineering, UPES, Dehradun, Uttarakhand 248007, India
| | - Abrahan Mora
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Puebla, Atlixcáyotl 5718, Puebla de Zaragoza, Puebla 72453, Mexico
| | - Dugin Kaown
- School of Earth and Environmental Sciences, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, South Korea
| | - Frank J Loge
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico; Department of Civil and Environmental Engineering, University of California Davis, One Shields Avenue, Davis, CA 95616, USA
| |
Collapse
|
87
|
Gale MG, Cary GJ, van Dijk AIJM, Yebra M. Untangling fuel, weather and management effects on fire severity: Insights from large-sample LiDAR remote sensing analysis of conditions preceding the 2019-20 Australian wildfires. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 348:119474. [PMID: 37925987 DOI: 10.1016/j.jenvman.2023.119474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 09/28/2023] [Accepted: 10/18/2023] [Indexed: 11/07/2023]
Abstract
Evaluation of fire severity reduction strategies requires the quantification of intervention outcomes and, more broadly, the extent to which fuel characteristics affect fire severity. However, investigations are currently limited by the availability of accurate data on fire severity predictors, particularly relating to fuel. Here, we used airborne LiDAR data collected before the 2019-20 Australian Black Summer fires to investigate the contribution of fuel structure to fire severity under a range of weather conditions. Fire severity was estimated using the Relative Burn Ratio calculated from Sentinel-2 optical remote sensing imagery. We modelled the effects of various fuel structure estimates and other environmental predictors using Random Forest models. In addition to variables estimated at each observation point, we investigated the influence of surrounding landscape characteristics using an innovative method to estimate fireline progression direction. Our models explained 63-76% of fire severity variance using parsimonious predictor sets. Fuel cover in the understorey and canopy, and vertical vegetation heterogeneity, were positively associated with fire severity. Up-fire burnt area and recent planned and unplanned fire reduced fire severity, whereby unplanned fire provided a longer-lasting reduction of fire severity (up to 15 years) than planned fire (up to 10 years). Although fuel structure and land management effects were important predictors, weather and canopy height effects were dominant. By mapping continuous interactions between weather and fuel-related variables, we found strong evidence of diminishing fuel effects below 20-40% relative air humidity. While our findings suggest that land management interventions can provide meaningful fire severity reduction, they also highlight the risk of warmer and drier future climates constraining these advantages.
Collapse
Affiliation(s)
- Matthew G Gale
- Fenner School of Environment & Society, The Australian National University, Canberra, ACT, 2601, Australia.
| | - Geoffrey J Cary
- Fenner School of Environment & Society, The Australian National University, Canberra, ACT, 2601, Australia
| | - Albert I J M van Dijk
- Fenner School of Environment & Society, The Australian National University, Canberra, ACT, 2601, Australia
| | - Marta Yebra
- Fenner School of Environment & Society, The Australian National University, Canberra, ACT, 2601, Australia; School of Engineering, The Australian National University, Canberra, ACT, 2601, Australia
| |
Collapse
|
88
|
Su Y, Li Y, Chen W, Yang W, Qin J, Liu L. Automated machine learning-based model for predicting benign anastomotic strictures in patients with rectal cancer who have received anterior resection. EUROPEAN JOURNAL OF SURGICAL ONCOLOGY 2023; 49:107113. [PMID: 37857102 DOI: 10.1016/j.ejso.2023.107113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 09/26/2023] [Accepted: 10/08/2023] [Indexed: 10/21/2023]
Abstract
BACKGROUND Benign anastomotic strictures (BAS) significantly impact patients' quality of life and long-term prognosis. However, the current clinical practice lacks accurate tools for predicting BAS. This study aimed to develop a machine-learning model to predict BAS in patients with rectal cancer who have undergone anterior resection. METHODS Data from 1973 patients who underwent anterior resection for rectal cancer were collected. Multiple machine learning classification models were integrated to analyze the data and identify the optimal model. Model performance was evaluated using receiver operator characteristic (ROC) curves, decision curve analysis (DCA), and calibration curves. The Shapley Additive exPlanation (SHAP) algorithm was utilized to assess the impact of various clinical characteristics on the optimal model to enhance the interpretability of the model results. RESULTS A total of 10 clinical features were considered in constructing the machine learning model. The model evaluation results indicated that the random forest (RF)model was optimal, with the area under the test set curve (AUC: 0.888, 95% CI: 0.810-0.965), accuracy: 0.792, sensitivity: 0.846, specificity: 0.791. The SHAP algorithm analysis identified prophylactic ileostomy, operative time, and anastomotic leakage as significant contributing factors influencing the predictions of the RF model. CONCLUSION We developed a robust machine-learning model and user-friendly online prediction tool for predicting BAS following anterior resection of rectal cancer. This tool offers a potential foundation for BAS prevention and aids clinical practice by enabling more efficient disease management and precise medical interventions.
Collapse
Affiliation(s)
- Yang Su
- Department of Gastrointestinal Surgery Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China; Molecular Medicine Center, Tongi Hospita, Tongi Medical College, Huazhong University of Science and Technology, 430030, Wuhan, China.
| | - Yanqi Li
- Department of Gastrointestinal Surgery Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China; Molecular Medicine Center, Tongi Hospita, Tongi Medical College, Huazhong University of Science and Technology, 430030, Wuhan, China.
| | - Wenshu Chen
- School of Computer Science(National Pilot Software Engineering School), Beijing University of Posts and Telecommunication, 100876, Beijing, China.
| | - Wangshuo Yang
- Department of Gastrointestinal Surgery Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China; Molecular Medicine Center, Tongi Hospita, Tongi Medical College, Huazhong University of Science and Technology, 430030, Wuhan, China.
| | - Jichao Qin
- Department of Gastrointestinal Surgery Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China; Molecular Medicine Center, Tongi Hospita, Tongi Medical College, Huazhong University of Science and Technology, 430030, Wuhan, China.
| | - Lu Liu
- Department of Gastrointestinal Surgery Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China; Molecular Medicine Center, Tongi Hospita, Tongi Medical College, Huazhong University of Science and Technology, 430030, Wuhan, China.
| |
Collapse
|
89
|
Lu W, Yu G, Li Y, Yin C, Long J, Chen X, Chen Y, Zheng Z, Lai Y, Zhou X, Xu D. Identifying prognostic biomarker related to immune infiltration in acute myeloid leukemia. Clin Exp Med 2023; 23:4553-4562. [PMID: 37561221 DOI: 10.1007/s10238-023-01164-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 08/02/2023] [Indexed: 08/11/2023]
Abstract
The immune cells of tumor microenvironment (TME) constitute a vital element of the tumor tissue. There is increasing evidence for their clinical significance in predicting prognosis and therapeutic outcomes. However, the TME immune cell infiltrating pattern of the bone marrow in acute myeloid leukemia (AML) patients remains unclear. Here, RNA-sequencing results of AML patients from TCGA database were used to quantify the abundance of 28 types of immune cells in the TME using the single-sample gene set enrichment analysis algorithm. We comprehensively evaluated the immune infiltration status in the TCGA-LAML cohort and defined two immunophenotypes: the immune hot and immune cold subtypes. Additionally, we constructed a TME score reflecting the immune infiltrating pattern of the patients using Cox regression algorithm. Subtypes with high TME score were characterized by over-activation of immune inflammation-related pathways, release of inflammatory factors, T-cell dysfunction, and poor prognosis. Subtypes with a low TME score were characterized by relatively low immune infiltration and immune exclusion. Our analysis indicated that patients in the low TME score group were more sensitive to chemotherapeutic drugs, and those in high TME score were more likely to respond to immunotherapy. Our study provides a new direction to evaluate anti-tumor therapy from immune infiltration of the TME, and the individualized scoring system in this study has important clinical significance in identifying patients who respond to immunotherapy.
Collapse
Affiliation(s)
- Weixiang Lu
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China
| | - Guopan Yu
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China
| | - Yanlin Li
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China
| | - Changxin Yin
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China
| | - Jiaxin Long
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China
| | - Xiaofan Chen
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China
| | - Yanxiao Chen
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China
| | - Zhongxin Zheng
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China
| | - Yujie Lai
- The First Clinical Medical College, Guangdong Medical University, Zhanjiang, 524000, China
| | - Xinyu Zhou
- The First Clinical Medical College, Guangdong Medical University, Zhanjiang, 524000, China
| | - Dan Xu
- Department of Hematology, Nanfang Hospital, Southern Medical University, No.1838 Guangzhou Avenue North, Guangzhou, 510515, China.
| |
Collapse
|
90
|
Li L, Cui X, Yang J, Wu X, Zhao G. Using feature optimization and LightGBM algorithm to predict the clinical pregnancy outcomes after in vitro fertilization. Front Endocrinol (Lausanne) 2023; 14:1305473. [PMID: 38093967 PMCID: PMC10716466 DOI: 10.3389/fendo.2023.1305473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Accepted: 11/09/2023] [Indexed: 12/18/2023] Open
Abstract
Background According to a recent report by the WHO, approximately 17.5\% (about one-sixth) of the global adult population is affected by infertility. Consequently, researchers worldwide have proposed various machine learning models to improve the prediction of clinical pregnancy outcomes during IVF cycles. The objective of this study is to develop a machine learning(ML) model that predicts the outcomes of pregnancies following in vitro fertilization (IVF) and assists in clinical treatment. Methods This study conducted a retrospective analysis on provincial reproductive centers in China from March 2020 to March 2021, utilizing 13 selected features. The algorithms used included XGBoost, LightGBM, KNN, Naïve Bayes, Random Forest, and Decision Tree. The results were evaluated using performance metrics such as precision, recall, F1-score, accuracy and AUC, employing five-fold cross-validation repeated five times. Results Among the models, LightGBM achieved the best performance, with an accuracy of 92.31%, recall of 87.80%, F1-score of 90.00\%, and an AUC of 90.41%. The model identified the estrogen concentration at the HCG injection(etwo), endometrium thickness (mm) on HCG day(EM TNK), years of infertility(Years), and body mass index(BMI) as the most important features. Conclusion This study successfully demonstrates the LightGBM model has the best predictive effect on pregnancy outcomes during IVF cycles. Additionally, etwo was found to be the most significant predictor for successful IVF compared to other variables. This machine learning approach has the potential to assist fertility specialists in providing counseling and adjusting treatment strategies for patients.
Collapse
Affiliation(s)
- Lu Li
- School of Basic Medicine, Anhui Medical University, Hefei, China
- Center of Reproductive Medicine, Children’s Hospital of Shanxi and Women Health Center of Shanxi, Taiyuan, China
| | - Xiangrong Cui
- Center of Reproductive Medicine, Children’s Hospital of Shanxi and Women Health Center of Shanxi, Taiyuan, China
| | - Jian Yang
- School of Information, Shanxi University of Finance and Economics, Taiyuan, China
| | - Xueqing Wu
- Center of Reproductive Medicine, Children’s Hospital of Shanxi and Women Health Center of Shanxi, Taiyuan, China
| | - Gang Zhao
- School of Basic Medicine, Anhui Medical University, Hefei, China
- Department of Electronic Science and Technology, University of Science and Technology of China, Hefei, China
| |
Collapse
|
91
|
Jin Y, Li C, Huang Z, Jiang L. Simultaneous Quantitative Determination of Low-Concentration Preservatives and Heavy Metals in Tricholoma Matsutakes Based on SERS and FLU Spectral Data Fusion. Foods 2023; 12:4267. [PMID: 38231731 DOI: 10.3390/foods12234267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 11/19/2023] [Accepted: 11/23/2023] [Indexed: 01/19/2024] Open
Abstract
As an ingredient of great economic value, Tricholoma matsutake has received widespread attention. However, heavy metal residues and preservatives in it will affect the quality of Tricholoma matsutake and endanger the health of consumers. Here, we present a method for the simultaneous detection of low concentrations of potassium sorbate and lead in Tricholoma matsutakes based on surface-enhanced Raman spectroscopy (SERS) and fluorescence (FLU) spectroscopy to test the safety of consumption. Data fusion strategies combined with multiple machine learning methods, including partial least-squares regression (PLSR), deep forest (DF) and convolutional neural networks (CNN) are used for model training. The results show that combined with reasonable band selection, the CNN prediction model based on decision-level fusion achieves the best performance, the correlation coefficients (R2) were increased to 0.9963 and 0.9934, and the root mean square errors (RMSE) were reduced to 0.0712 g·kg-1 and 0.0795 mg·kg-1, respectively. The method proposed in this paper accurately predicts preservatives and heavy metals remaining in Tricholoma matsutake and provides a reference for other food safety testing.
Collapse
Affiliation(s)
- Yuanyin Jin
- College of Information Science and Technology, Nanjing Forestry University, 159 Longpan Road, Nanjing 210037, China
| | - Chun Li
- College of Information Science and Technology, Nanjing Forestry University, 159 Longpan Road, Nanjing 210037, China
| | - Zhengwei Huang
- College of Information Science and Technology, Nanjing Forestry University, 159 Longpan Road, Nanjing 210037, China
| | - Ling Jiang
- College of Information Science and Technology, Nanjing Forestry University, 159 Longpan Road, Nanjing 210037, China
| |
Collapse
|
92
|
Fu M, He R, Zhang Z, Ma F, Shen L, Zhang Y, Duan M, Zhang Y, Wang Y, Zhu L, He J. Multinomial machine learning identifies independent biomarkers by integrated metabolic analysis of acute coronary syndrome. Sci Rep 2023; 13:20535. [PMID: 37996510 PMCID: PMC10667512 DOI: 10.1038/s41598-023-47783-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 11/18/2023] [Indexed: 11/25/2023] Open
Abstract
A multi-class classification model for acute coronary syndrome (ACS) remains to be constructed based on multi-fluid metabolomics. Major confounders may exert spurious effects on the relationship between metabolism and ACS. The study aims to identify an independent biomarker panel for the multiclassification of HC, UA, and AMI by integrating serum and urinary metabolomics. We performed a liquid chromatography-tandem mass spectrometry (LC-MS/MS)-based metabolomics study on 300 serum and urine samples from 44 patients with unstable angina (UA), 77 with acute myocardial infarction (AMI), and 29 healthy controls (HC). Multinomial machine learning approaches, including multinomial adaptive least absolute shrinkage and selection operator (LASSO) regression and random forest (RF), and assessment of the confounders were applied to integrate a multi-class classification biomarker panel for HC, UA and AMI. Different metabolic landscapes were portrayed during the transition from HC to UA and then to AMI. Glycerophospholipid metabolism and arginine biosynthesis were predominant during the progression from HC to UA and then to AMI. The multiclass metabolic diagnostic model (MDM) dependent on ACS, including 2-ketobutyric acid, LysoPC(18:2(9Z,12Z)), argininosuccinic acid, and cyclic GMP, demarcated HC, UA, and AMI, providing a C-index of 0.84 (HC vs. UA), 0.98 (HC vs. AMI), and 0.89 (UA vs. AMI). The diagnostic value of MDM largely derives from the contribution of 2-ketobutyric acid, and LysoPC(18:2(9Z,12Z)) in serum. Higher 2-ketobutyric acid and cyclic GMP levels were positively correlated with ACS risk and atherosclerosis plaque burden, while LysoPC(18:2(9Z,12Z)) and argininosuccinic acid showed the reverse relationship. An independent multiclass biomarker panel for HC, UA, and AMI was constructed using the multinomial machine learning methods based on serum and urinary metabolite signatures.
Collapse
Affiliation(s)
- Meijiao Fu
- Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Ruhua He
- Department of Cardiology, General Hospital of Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Zhihan Zhang
- Department of Cardiology, Hanzhong Central Hospital, Hanzhong, 723200, Shanxi, China
| | - Fuqing Ma
- Department of Cardiology, The Fifth People's Hospital of Ningxia, Shizuishan, 753000, Ningxia, China
| | - Libo Shen
- Center for Cardiovascular Diseases, People's Hospital of Ningxia Hui Autonomous Region, Yinchuan, 750002, Ningxia, China
| | - Yu Zhang
- Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Mingyu Duan
- Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Yameng Zhang
- Department of Cardiology, The Second Affiliated Hospital of Henan University of Science and Technology, Luoyang, 471000, Henan, China
| | - Yifan Wang
- Department of Radiology, General Hospital of Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Li Zhu
- Department of Radiology, General Hospital of Ningxia Medical University, Yinchuan, 750004, Ningxia, China.
| | - Jun He
- Department of Cardiology, General Hospital of Ningxia Medical University, Yinchuan, 750004, Ningxia, China.
| |
Collapse
|
93
|
Saleem M, Aslam W, Lali MIU, Rauf HT, Nasr EA. Predicting Thalassemia Using Feature Selection Techniques: A Comparative Analysis. Diagnostics (Basel) 2023; 13:3441. [PMID: 37998577 PMCID: PMC10670018 DOI: 10.3390/diagnostics13223441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Revised: 10/25/2023] [Accepted: 11/06/2023] [Indexed: 11/25/2023] Open
Abstract
Thalassemia represents one of the most common genetic disorders worldwide, characterized by defects in hemoglobin synthesis. The affected individuals suffer from malfunctioning of one or more of the four globin genes, leading to chronic hemolytic anemia, an imbalance in the hemoglobin chain ratio, iron overload, and ineffective erythropoiesis. Despite the challenges posed by this condition, recent years have witnessed significant advancements in diagnosis, therapy, and transfusion support, significantly improving the prognosis for thalassemia patients. This research empirically evaluates the efficacy of models constructed using classification methods and explores the effectiveness of relevant features that are derived using various machine-learning techniques. Five feature selection approaches, namely Chi-Square (χ2), Exploratory Factor Score (EFS), tree-based Recursive Feature Elimination (RFE), gradient-based RFE, and Linear Regression Coefficient, were employed to determine the optimal feature set. Nine classifiers, namely K-Nearest Neighbors (KNN), Decision Trees (DT), Gradient Boosting Classifier (GBC), Linear Regression (LR), AdaBoost, Extreme Gradient Boosting (XGB), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Support Vector Machine (SVM), were utilized to evaluate the performance. The χ2 method achieved accuracy, registering 91.56% precision, 91.04% recall, and 92.65% f-score when aligned with the LR classifier. Moreover, the results underscore that amalgamating over-sampling with Synthetic Minority Over-sampling Technique (SMOTE), RFE, and 10-fold cross-validation markedly elevates the detection accuracy for αT patients. Notably, the Gradient Boosting Classifier (GBC) achieves 93.46% accuracy, 93.89% recall, and 92.72% F1 score.
Collapse
Affiliation(s)
- Muniba Saleem
- Department of Computer Science & Information Technology, The Government Sadiq College Women University Bahawalpur, Bahawalpur 63100, Pakistan;
| | - Waqar Aslam
- Department of Information Security, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan
| | | | - Hafiz Tayyab Rauf
- Centre for Smart Systems, AI and Cybersecurity, Staffordshire University, Stoke-on-Trent ST4 2DE, UK;
| | - Emad Abouel Nasr
- Industrial Engineering Department, College of Engineering, King Saud University, Riyadh 11421, Saudi Arabia;
| |
Collapse
|
94
|
Li Y, Li X, Yang Y, Qiao X, Tao Q, Peng C, Han M, Dong K, Xu M, Wang D, Han G. Association of genes in hereditary metabolic diseases with diagnosis, prognosis, and treatment outcomes in gastric cancer. Front Immunol 2023; 14:1289700. [PMID: 38022516 PMCID: PMC10665511 DOI: 10.3389/fimmu.2023.1289700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 10/19/2023] [Indexed: 12/01/2023] Open
Abstract
Background Aberrant metabolism is a major hallmark of cancers and hereditary diseases. Genes associated with inborn metabolic errors may also play roles in cancer development. This study evaluated the overall impact of these genes on gastric cancer (GC). Methods In total, 162 genes involved in 203 hereditary metabolic diseases were identified in the Human Phenotype Ontology database. Clinical and multi-omic data were acquired from the GC cohort of the Affiliated Hospital of Jiangsu University and other published cohorts. A 4-gene and 32-gene signature was established for diagnosis and prognosis or therapeutic prediction, respectively, and corresponding abnormal metabolism scores (AMscores) were calculated. Results The diagnostic AMscore showed high sensitivity (0.88-1.00) and specificity (0.89-1.00) to distinguish between GC and paired normal tissues, with area under the receiver operating characteristic curve (AUC) ranging from 0.911 to 1.000 in four GC cohorts. The prognostic or predictive AMscore was an independent predictor of overall survival (OS) in five GC cohorts and a predictor of the OS and disease-free survival benefit of postoperative chemotherapy or chemoradiotherapy in one GC cohort with such data. The AMscore adversely impacts immune biomarkers, including tumor mutation burden, tumor neoantigen burden, microsatellite instability, programmed death-ligand 1 protein expression, tumor microenvironment score, T cell receptor clonality, and immune cell infiltration detected by multiplex immunofluorescence staining. The AUC of the AMscore for predicting immunotherapy response ranging from 0.780 to 0.964 in four cohorts involving GC, urothelial cancer, melanoma, and lung cancer. The objective response rates in the low and high AMscore subgroups were 78.6% and 3.2%, 40.4% and 7%, 52.6% and 0%, and 72.7% and 0%, respectively (all p<0.001). In cohorts with survival data, a high AMscore was hazardous for OS or progression-free survival, with hazard ratios ranged from 5.79 to 108.59 (all p<0.001). Importantly, the AMscore significantly improved the prediction of current immune biomarkers for both response and survival, thus redefining the advantaged and disadvantaged immunotherapy populations. Conclusions Signatures based on genes associated with hereditary metabolic diseases and their corresponding scores could be used to guide the diagnosis and treatment of GC. Therefore, further validation is required.
Collapse
Affiliation(s)
- Yiping Li
- Department of Oncology, The Affiliated Taizhou People’s Hospital of Nanjing Medical University, Taizhou, China
| | - Xiaoqin Li
- Department of Oncology, Digestive Disease Institute & Cancer Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Yufei Yang
- Department of Oncology, Digestive Disease Institute & Cancer Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Xuehan Qiao
- Department of Oncology, Digestive Disease Institute & Cancer Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Qing Tao
- Department of Oncology, Digestive Disease Institute & Cancer Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Chen Peng
- Department of Oncology, Digestive Disease Institute & Cancer Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Miao Han
- Department of Oncology, Digestive Disease Institute & Cancer Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Kebin Dong
- Department of Oncology, Digestive Disease Institute & Cancer Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Min Xu
- Department of Gastroenterology, Digestive Disease Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Deqiang Wang
- Department of Oncology, Digestive Disease Institute & Cancer Institute of Jiangsu University, Affiliated Hospital of Jiangsu University, Zhenjiang, China
| | - Gaohua Han
- Department of Oncology, The Affiliated Taizhou People’s Hospital of Nanjing Medical University, Taizhou, China
| |
Collapse
|
95
|
Li Q, Yang H, Hao N, Du M, Zhao Y, Li Y, Li X. Biodegradability analysis of Dioxins through in silico methods: Model construction and mechanism analysis. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 345:118898. [PMID: 37657295 DOI: 10.1016/j.jenvman.2023.118898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 08/18/2023] [Accepted: 08/27/2023] [Indexed: 09/03/2023]
Abstract
The biodegradation treatment of dioxins has long been of interest due to its good ecological and economic effects. In this study, the biodegradability of polychlorinated dibenzo-p-dioxins (PCDDs) were investigated by constructing machine learning and multiple linear regression models. The maximum chlorine atomic charge (qHirshfeldCl+), which characterizes the biodegradation ability of PCDDs, was used as the response value. The random forest model was used to rank the importance on the 1471 descriptors of PCDDs, and the BCUTp-1 h, QXZ, JGI4, ATSC8c, VE3_Dt, topoShape, and maxwHBa were screened as the important descriptors by Pearson's correlation coefficient method. A quantitative structure-activity relationship (QSAR) model was constructed to predict the biodegradability of PCDDs. In addition, the extreme gradient boosting (XGBoost) and random forest model were also constructed and proved the good predictability of QSAR model. The biodegradability of polychlorinated dibenzofurans (PCDFs) can also be predicted by the constructed three models from a certain level after adjusting some model parameters, which further proved the versatility of the models. Besides, the sensitivity analysis of the QSAR model and a 3D-QSAR model was developed to investigate the biodegradability mechanisms of PCDDs. Results showed that the descriptors BCUTp-1 h, JGI4, and maxwHBa were the key descriptors in the biodegradability effect by the sensitivity analysis of the QSAR model. Coupled with the results of PCDDs biodegradability 3D-QSAR model, BCUTp-1 h, JGI4, and maxwHBa were confirmed as the main descriptors that affect the biodegradability of dioxins. This study provides a novel theoretical perspective for the research of the biodegradation of both PCDDs and PCDFs dioxins.
Collapse
Affiliation(s)
- Qing Li
- College of Environmental Science and Engineering, North China Electric Power University, Beijing, 102206, China.
| | - Hao Yang
- College of Environmental Science and Engineering, North China Electric Power University, Beijing, 102206, China.
| | - Ning Hao
- College of New Energy and Environment, Jilin University, Changchun, 130012, China.
| | - Meijn Du
- College of Environmental Science and Engineering, North China Electric Power University, Beijing, 102206, China.
| | - Yuanyuan Zhao
- College of Environmental Science and Engineering, North China Electric Power University, Beijing, 102206, China.
| | - Yu Li
- College of Environmental Science and Engineering, North China Electric Power University, Beijing, 102206, China.
| | - Xixi Li
- Center for Environmental Health Risk Assessment and Research, Chinese Research Academy of Environmental Sciences, Beijing, 100012, China; Northern Region Persistent Organic Pollution Control (NRPOP) Laboratory, Faculty of Engineering and Applied Science, Memorial University, St. John's, NL A1B 3X5, Canada.
| |
Collapse
|
96
|
Adamu M, Rehman KU, Ibrahim YE, Shatanawi W. Predicting the strengths of date fiber reinforced concrete subjected to elevated temperature using artificial neural network, and Weibull distribution. Sci Rep 2023; 13:18649. [PMID: 37903786 PMCID: PMC10616177 DOI: 10.1038/s41598-023-45462-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 10/19/2023] [Indexed: 11/01/2023] Open
Abstract
Date palm fiber (DPF) is normally used as fiber material in concrete. Though its addition to concrete leads to decline in durability and mechanical strengths performance. Additionally, due to its high ligno-cellulose content and organic nature, when used in concrete for high temperature application, the DPF can easily degrade causing reduction in strength and increase in weight loss. To reduce these effects, the DPF is treated using alkaline solutions. Furthermore, pozzolanic materials are normally added to the DPF composites to reduce the effects of the ligno-cellulose content. Therefore, in this study silica fume was used as supplementary cementitious material in DPF reinforced concrete (DPFRC) to reduce the negative effects of elevated temperature. Hence this study aimed at predicting the residual strengths of DPFRC enhanced/improved with silica fume subjected to elevated temperature using different models such as artificial neural network (ANN), multi-variable regression analysis (MRA) and Weibull distribution. The DPFRC is produced by adding DPF in proportions of 0%, 1%, 2% and 3% by mass. Silica fume was used as partial substitute to cement in dosages of 0%, 5%, 10% and 15% by volume. The DPFRC was then subjected to elevated temperatures between 200 and 800 °C. The weight loss, residual compressive strength and relative strengths were measured. The residual compressive strength and relative strength of the DPFRC declined with addition of DPF at any temperature. Silica fume enhanced the residual and relative strengths of the DPFRC when heated to a temperature up to 400 °C. To forecast residual compressive strength (RCS) and relative strength (RS), we provide two distinct ANN models. The first layer's inputs include DPF (%), silica fume (%), temperature (°C), and weight loss (%). The hidden layer is thought to have ten neurons. M-I is the scenario in which we use RCS as an output, whereas M-II is the scenario in which we use RS as an output. The ANN models were trained using the Levenberg-Marquardt backpropagation algorithm (LMBA). Both neural networking models exhibit a significant correlation between the predicted and actual values, as seen by their respective R = 0.99462 and R = 0.98917. The constructed neural models M-I and M-II are highly accurate at predicting RCS and RS values. MRA and Weibull distribution were used for prediction of the strengths of the DPFRC under high temperature. The developed MRA was found to have a good prediction accuracy. The residual compressive strength and relative strength followed the two-parameter Weibull distribution.
Collapse
Affiliation(s)
- Musa Adamu
- Engineering Management Department, College of Engineering, Prince Sultan University, 11586, Riyadh, Saudi Arabia.
| | - Khalil Ur Rehman
- Department of Mathematics and Sciences, College of Humanities and Sciences, Prince Sultan University, 11586, Riyadh, Saudi Arabia
- Department of Mathematics, Air University, PAF Complex E-9, Islamabad, 44000, Pakistan
| | - Yasser E Ibrahim
- Engineering Management Department, College of Engineering, Prince Sultan University, 11586, Riyadh, Saudi Arabia
| | - Wasfi Shatanawi
- Department of Mathematics and Sciences, College of Humanities and Sciences, Prince Sultan University, 11586, Riyadh, Saudi Arabia.
- Department of Medical Research, China Medical University Hospital, China Medical University, Taichung, 40402, Taiwan.
- Department of Mathematics, Faculty of Science, The Hashemite University, P.O. Box 330127, Zarqa, 13133, Jordan.
| |
Collapse
|
97
|
Tai J, Wang L, Yan Z, Liu J. Single-cell sequencing and transcriptome analyses in the construction of a liquid-liquid phase separation-associated gene model for rheumatoid arthritis. Front Genet 2023; 14:1210722. [PMID: 37953920 PMCID: PMC10634374 DOI: 10.3389/fgene.2023.1210722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Accepted: 10/09/2023] [Indexed: 11/14/2023] Open
Abstract
Background: Rheumatoid arthritis (RA) is a disabling autoimmune disease that affects multiple joints. Accumulating evidence suggests that imbalances in liquid-liquid phase separation (LLPS) can lead to altered spatiotemporal coordination of biomolecular condensates, which play important roles in carcinogenesis and inflammatory diseases. However, the role of LLPS in the development and progression of RA remains unclear. Methods: We screened RA and normal samples from GSE12021, GSE55235, and GSE55457 transcriptome datasets and GSE129087 and GSE109449 single-cell sequencing datasets from Gene Expression Omnibus database to investigate the pathogenesis of LLPS-related hub genes at the transcriptome and single cell sequencing levels. Machine learning algorithms and weighted gene co-expression network analysis were applied to screen hub genes, and hub genes were validated using correlation studies. Results: Differential analysis showed that 36 LLPS-related genes were significantly differentially expressed in RA, further random forest and support vector machine identified four and six LLPS-related genes, respectively, and weighted gene co-expression network analysis identified 396 modular genes. Hybridization of the three sets revealed two hub genes, MYC and MAP1LC3B, with AUCs of 0.907 and 0.911, respectively. Further ROC analysis of the hub genes in the GSE55457 dataset showed that the AUCs of MYC and MAP1LC3B were 0.815 and 0.785, respectively. qRT-PCR showed that the expression of MYC and MAP1LC3B in RA synovial tissues was significantly lower than that in the normal control synovial tissues. Correlation analysis between hub genes and the immune microenvironment and single-cell sequencing analysis revealed that both MYC and MAP1LC3B were significantly correlated with the degree of infiltration of various innate and acquired immune cells. Conclusion: Our study reveals a possible mechanism for LLPS in RA pathogenesis and suggests that MYC and MAP1LC3B may be potential novel molecular markers for RA with immunological significance.
Collapse
Affiliation(s)
- Jiaojiao Tai
- Department of Orthopedics, Honghui Hospital, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| | - Linbang Wang
- Department of Orthopedics, Peking University Third Hospital, Beijing, China
| | - Ziqiang Yan
- Department of Orthopedics, Honghui Hospital, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| | - Jingkun Liu
- Department of Orthopedics, Honghui Hospital, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| |
Collapse
|
98
|
Li S, Zhu P, Cai G, Li J, Huang T, Tang W. Application of machine learning models in predicting insomnia severity: an integrative approach with constitution of traditional Chinese medicine. Front Med (Lausanne) 2023; 10:1292761. [PMID: 37928471 PMCID: PMC10625410 DOI: 10.3389/fmed.2023.1292761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 10/06/2023] [Indexed: 11/07/2023] Open
Abstract
Objective This study sought to explore the utility of machine learning models in predicting insomnia severity based on Traditional Chinese Medicine (TCM) constitution classifications, with an aim to discuss the potential applications of such models in the treatment and prevention of insomnia. Methods We analyzed a dataset of 165 insomnia patients from the Shanghai Minhang District Integrated Traditional Chinese and Western Medicine Hospital. TCM constitution was assessed using a standardized Constitution in Chinese Medicine (CCM) scale. Sleep quality, or insomnia severity, was evaluated using the Spiegel Sleep Questionnaire (SSQ). Machine learning models, including Random Forest Classifier (RFC), Support Vector Classifier (SVC), and K-Nearest Neighbors (KNN), were utilized. These models were optimized using Grid Search algorithm and were trained and tested on stratified patient data, with the TCM constitution classifications serving as primary predictors. Results The RFC outperformed others, achieving a weighted average accuracy, precision, recall, and F1-score of 0.91, 0.94, 0.92, and 0.92 respectively, it also effectively classified the severity of insomnia with high area under receiver operating characteristic curve (AUC-ROC) values. Feature importance analysis demonstrated the Damp-heat constitution as the most influential predictor, followed by Yang-deficiency, Qi-depression, Qi-deficiency, and Blood-stasis constitutions. Conclusion The results demonstrate the potent utility of machine learning, specifically RFC, coupled with TCM constitution classifications in predicting insomnia severity. Notably, the constitution classifications such as Damp-heat and Yang-deficiency emerged as crucial determinants, emphasizing its potential in guiding targeted insomnia treatments. This approach enables the development of more personalized and efficient interventions, thereby enhancing patient outcomes.
Collapse
Affiliation(s)
- Shenguang Li
- Shanghai Minhang Hospital of Integrated Traditional Chinese and Western Medicine, Shanghai, China
| | - Po Zhu
- Shanghai Minhang Hospital of Integrated Traditional Chinese and Western Medicine, Shanghai, China
| | - Guoying Cai
- Shanghai Minhang Hospital of Integrated Traditional Chinese and Western Medicine, Shanghai, China
| | - Jing Li
- Shanghai Minhang Hospital of Integrated Traditional Chinese and Western Medicine, Shanghai, China
| | - Tao Huang
- Yueyang Hospital of Integrated Traditional Chinese and Western Medicine Affiliated to Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Wenchao Tang
- School of Acupuncture-Moxibustion and Tuina, Shanhgai University of Traditional Chinese Medicine, Shanghai, China
| |
Collapse
|
99
|
Ge F, Chen G, Qian M, Xu C, Liu J, Cao J, Li X, Hu D, Xu Y, Xin Y, Wang D, Zhou J, Shi H, Tan Z. Artificial Intelligence Aided Lipase Production and Engineering for Enzymatic Performance Improvement. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:14911-14930. [PMID: 37800676 DOI: 10.1021/acs.jafc.3c05029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/07/2023]
Abstract
With the development of artificial intelligence (AI), tailoring methods for enzyme engineering have been widely expanded. Additional protocols based on optimized network models have been used to predict and optimize lipase production as well as properties, namely, catalytic activity, stability, and substrate specificity. Here, different network models and algorithms for the prediction and reforming of lipase, focusing on its modification methods and cases based on AI, are reviewed in terms of both their advantages and disadvantages. Different neural networks coupled with various algorithms are usually applied to predict the maximum yield of lipase by optimizing the external cultivations for lipase production, while one part is used to predict the molecule variations affecting the properties of lipase. However, few studies have directly utilized AI to engineer lipase by affecting the structure of the enzyme, and a set of research gaps needs to be explored. Additionally, future perspectives of AI application in enzymes, including lipase engineering, are deduced to help the redesign of enzymes and the reform of new functional biocatalysts. This review provides a new horizon for developing effective and innovative AI tools for lipase production and engineering and facilitating lipase applications in the food industry and biomass conversion.
Collapse
Affiliation(s)
- Feiyin Ge
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Gang Chen
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Minjing Qian
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Cheng Xu
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Jiao Liu
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Jiaqi Cao
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Xinchao Li
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Die Hu
- School of Pharmacy & School of Biological and Food Engineering, Changzhou University, Changzhou 213164, People's Republic of China
| | - Yangsen Xu
- Dongtai Hanfangyuan Biotechnology Co. Ltd., Yancheng 224241, People's Republic of China
| | - Ya Xin
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Dianlong Wang
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Jia Zhou
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Hao Shi
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Zhongbiao Tan
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| |
Collapse
|
100
|
Shearah Z, Ullah Z, Fakieh B. Intelligent Framework for Early Detection of Severe Pediatric Diseases from Mild Symptoms. Diagnostics (Basel) 2023; 13:3204. [PMID: 37892025 PMCID: PMC10606417 DOI: 10.3390/diagnostics13203204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 10/05/2023] [Accepted: 10/11/2023] [Indexed: 10/29/2023] Open
Abstract
Children's health is one of the most significant fields in medicine. Most diseases that result in children's death or long-term morbidity are caused by preventable and treatable etiologies, and they appear in the child at the early stages as mild symptoms. This research aims to develop a machine learning (ML) framework to detect the severity of disease in children. The proposed framework helps in discriminating children's urgent/severe conditions and notifying parents whether a child needs to visit the emergency room immediately or not. The model considers several variables to detect the severity of cases, which are the symptoms, risk factors (e.g., age), and the child's medical history. The framework is implemented by using nine ML methods. The results achieved show the high performance of the proposed framework in identifying serious pediatric diseases, where decision tree and random forest outperformed the other methods with an accuracy rate of 94%. This shows the reliability of the proposed framework to be used as a pediatric decision-making system for detecting serious pediatric illnesses. The results are promising when compared to recent state-of-the-art studies. The main contribution of this research is to propose a framework that is viable for use by parents when their child suffers from any commonly developed symptoms.
Collapse
Affiliation(s)
- Zelal Shearah
- Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia; (Z.U.); (B.F.)
| | | | | |
Collapse
|