Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

465
(from Reference Citation Analysis)

Article PDFs (131)

Cited by > 0 (244)

Searched Name

XGBoost

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Balian J, Sakowitz S, Verma A, Vadlakonda A, Cruz E, Ali K, Benharash P. Machine learning based predictive modeling of readmissions following extracorporeal membrane oxygenation hospitalizations. Surg Open Sci 2024;19:125-130. [PMID: 38655069 PMCID: PMC11035075 DOI: 10.1016/j.sopen.2024.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 04/05/2024] [Indexed: 04/26/2024] Open

Abstract

Background

Despite increasing utilization and survival benefit over the last decade, extracorporeal membrane oxygenation (ECMO) remains resource-intensive with significant complications and rehospitalization risk. We thus utilized machine learning (ML) to develop prediction models for 90-day nonelective readmission following ECMO.

Methods

All adult patients receiving ECMO who survived index hospitalization were tabulated from the 2016-2020 Nationwide Readmissions Database. Extreme Gradient Boosting (XGBoost) models were developed to identify features associated with readmission following ECMO. Area under the receiver operating characteristic (AUROC), mean Average Precision (mAP), and the Brier score were calculated to estimate model performance relative to logistic regression (LR). Shapley Additive Explanation summary (SHAP) plots evaluated the relative impact of each factor on the model. An additional sensitivity analysis solely included patient comorbidities and indication for ECMO as potential model covariates.

Results

Of ∼22,947 patients, 4495 (19.6 %) were readmitted nonelectively within 90 days. The XGBoost model exhibited superior discrimination (AUROC 0.64 vs 0.49), classification accuracy (mAP 0.30 vs 0.20) and calibration (Brier score 0.154 vs 0.165, all P < 0.001) in predicting readmission compared to LR. SHAP plots identified duration of index hospitalization, undergoing heart/lung transplantation, and Medicare insurance to be associated with increased odds of readmission. Upon sub-analysis, XGBoost demonstrated superior disclination compared to LR (AUROC 0.61 vs 0.60, P < 0.05). Chronic liver disease and frailty were linked with increased odds of nonelective readmission.

Conclusions

ML outperformed LR in predicting readmission following ECMO. Future work is needed to identify other factors linked with readmission and further optimize post-ECMO care among this cohort.

Collapse

Chauhan R, Goel A, Alankar B, Kaur H. Predictive modeling and web-based tool for cervical cancer risk assessment: A comparative study of machine learning models. MethodsX 2024;12:102653. [PMID: 38524310 PMCID: PMC10957413 DOI: 10.1016/j.mex.2024.102653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 03/08/2024] [Indexed: 03/26/2024] Open

Jiang Y, Zhao Q, Guan J, Wang Y, Chen J, Li Y. Analyzing prehospital delays in recurrent acute ischemic stroke: Insights from interpretable machine learning. Patient Educ Couns 2024;123:108228. [PMID: 38458092 DOI: 10.1016/j.pec.2024.108228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 02/18/2024] [Accepted: 02/24/2024] [Indexed: 03/10/2024]

Guo L, Xu X, Niu C, Wang Q, Park J, Zhou L, Lei H, Wang X, Yuan X. Machine learning-based prediction and experimental validation of heavy metal adsorption capacity of bentonite. Sci Total Environ 2024;926:171986. [PMID: 38552979 DOI: 10.1016/j.scitotenv.2024.171986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 03/23/2024] [Accepted: 03/24/2024] [Indexed: 04/01/2024]

Ebrahimian A, Mohammadi H, Maftoon N. Material characterization of human middle ear using machine-learning-based surrogate models. J Mech Behav Biomed Mater 2024;153:106478. [PMID: 38493562 DOI: 10.1016/j.jmbbm.2024.106478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 02/09/2024] [Accepted: 02/24/2024] [Indexed: 03/19/2024]

Abstract

This study aims to introduce a novel non-invasive method for rapid material characterization of middle-ear structures, taking into consideration the invaluable insights provided by the mechanical properties of ear tissues. Valuable insights into various ear pathologies can be gleaned from the mechanical properties of ear tissues, yet conventional techniques for assessing these properties often entail invasive procedures that preclude their use on living patients. In this study, in the first step, we developed machine-learning models of the middle ear to predict its responses with a significantly lower computational cost in comparison to finite-element models. Leveraging findings from prior research, we focused on the most influential model parameters: the Young's modulus and thickness of the tympanic membrane and the Young's modulus of the stapedial annular ligament. The eXtreme Gradient Boosting (XGBoost) method was implemented for creating the machine-learning models. Subsequently, we combined the created machine-learning models with Bayesian optimization (BoTorch) for fast and efficient estimation of the Young's moduli of the tympanic membrane and the stapedial annular ligament. We demonstrate that the resultant surrogate models can fairly represent the vibrational responses of the umbo, stapes footplate, and vibration patterns of the tympanic membrane at most frequencies. Also, our proposed material characterization approach successfully estimated the Young's moduli of the tympanic membrane and stapedial annular ligament (separately and simultaneously) with values of mean absolute percentage error of less than 7%. The remarkable accuracy achieved through the proposed material characterization method underscores its potential for eventual clinical applications of estimating mechanical properties of the middle-ear structures for diagnostic purposes.

Collapse

Schoonemann J, Nagelkerke J, Seuntjens TG, Osinga N, van Liere D. Applying XGBoost and SHAP to Open Source Data to Identify Key Drivers and Predict Likelihood of Wolf Pair Presence. Environ Manage 2024;73:1072-1087. [PMID: 38372749 DOI: 10.1007/s00267-024-01941-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 01/20/2024] [Indexed: 02/20/2024]

Sawant PA, Hiralkar SS, Hulsurkar YP, Phutane MS, Mahajan US, Kudale AM. Predicting over-the-counter antibiotic use in rural Pune, India, using machine learning methods. Epidemiol Health 2024:e2024044. [PMID: 38637971 DOI: 10.4178/epih.e2024044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 03/25/2024] [Indexed: 04/20/2024] Open

Nikpour P, Shafiei M, Khatibi V. Gelato: a new hybrid deep learning-based Informer model for multivariate air pollution prediction. Environ Sci Pollut Res Int 2024:10.1007/s11356-024-33190-4. [PMID: 38592633 DOI: 10.1007/s11356-024-33190-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 03/29/2024] [Indexed: 04/10/2024]

Rasouli S, Dakkali MS, Azarbad R, Ghazvini A, Asani M, Mirzaasgari Z, Arish M. Predicting the conversion from clinically isolated syndrome to multiple sclerosis: An explainable machine learning approach. Mult Scler Relat Disord 2024;86:105614. [PMID: 38642495 DOI: 10.1016/j.msard.2024.105614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 04/04/2024] [Accepted: 04/07/2024] [Indexed: 04/22/2024]

Huckvale ED, Moseley HN. Predicting The Pathway Involvement Of Metabolites Based on Combined Metabolite and Pathway Features. bioRxiv 2024:2024.04.01.587582. [PMID: 38617261 PMCID: PMC11014601 DOI: 10.1101/2024.04.01.587582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]

Reveguk I, Simonson T. Classifying protein kinase conformations with machine learning. Protein Sci 2024;33:e4918. [PMID: 38501429 PMCID: PMC10962494 DOI: 10.1002/pro.4918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 01/02/2024] [Accepted: 01/22/2024] [Indexed: 03/20/2024]

Abstract

Protein kinases are key actors of signaling networks and important drug targets. They cycle between active and inactive conformations, distinguished by a few elements within the catalytic domain. One is the activation loop, whose conserved DFG motif can occupy DFG-in, DFG-out, and some rarer conformations. Annotation and classification of the structural kinome are important, as different conformations can be targeted by different inhibitors and activators. Valuable resources exist; however, large-scale applications will benefit from increased automation and interpretability of structural annotation. Interpretable machine learning models are described for this purpose, based on ensembles of decision trees. To train them, a set of catalytic domain sequences and structures was collected, somewhat larger and more diverse than existing resources. The structures were clustered based on the DFG conformation and manually annotated. They were then used as training input. Two main models were constructed, which distinguished active/inactive and in/out/other DFG conformations. They considered initially 1692 structural variables, spanning the whole catalytic domain, then identified ("learned") a small subset that sufficed for accurate classification. The first model correctly labeled all but 3 of 3289 structures as active or inactive, while the second assigned the correct DFG label to all but 17 of 8826 structures. The most potent classifying variables were all related to well-known structural elements in or near the activation loop and their ranking gives insights into the conformational preferences. The models were used to automatically annotate 3850 kinase structures predicted recently with the Alphafold2 tool, showing that Alphafold2 reproduced the active/inactive but not the DFG-in proportions seen in the Protein Data Bank. We expect the models will be useful for understanding and engineering kinases.

Collapse

Mohit A, Remya N. Exploring effects of carbon, nitrogen, and phosphorus on greywater treatment by polyculture microalgae using response surface methodology and machine learning. J Environ Manage 2024;356:120728. [PMID: 38531138 DOI: 10.1016/j.jenvman.2024.120728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/20/2024] [Accepted: 03/19/2024] [Indexed: 03/28/2024]

Abstract

The microalgae-based wastewater treatment is a promising technique that contribute to achieving sustainable development goals (SDGs), such as SDG-6, "Clean Water and Sanitation". However, it is strongly influenced by the initial composition of wastewater. In this study, the impact of initial organics and nutrient concentration on the removal of total organic carbon (TOC), total carbon (TC), ammonium (NH4+), total nitrogen (TN), and phosphate (PO43-) from greywater using native polyculture microalgae was explored. Response surface methodology was employed along with two machine learning approaches, AdaBoost and XGBoost, to evaluate the interactions among three main factors: TOC, NH4+, and PO43-, and their effects on treatment efficiency. The C/N ratios for achieving maximum TOC and TC removal efficiency of 99.2% and 97.7% were determined to be 10.3, and 65.4-73.6, respectively. Notably, the N/P ratio did not significantly affect their removal. The highest NH4+ removal efficiency, reaching 96.2%, was attained at C/N ratios of 4.3, 24.0, 38.2, and 212.9, coupled with N/P ratios of 0.3, 2.6, and 23.4. Highest TN removal efficiency of 77.2% was achieved at C/N and N/P ratios of 12.2 and 2.0, respectively. Highest PO43- removal of 78.8% was obtained at N/P ratio 12.8. However, C/N ratio did not affect the removal efficiency. Maintaining these specified C/N and N/P ratios in the influent greywater would ensure that the treated greywater meets the required standards for various reuse applications, including flushing, groundwater recharge, and surface water discharge. The integration of RSM with AdaBoost and XGBoost provided accurate predictions of removal efficiencies. For all the models, XGBoost had the highest R2, and lowest MAE and MSE values. The cross validation of RSM models with AdaBoost and XGBoost further reinforced the reliability of these models in predicting treatment outcomes.

Collapse

Schönnagel L, Tani S, Vu-Han TL, Zhu J, Camino-Willhuber G, Dodo Y, Caffard T, Chiapparelli E, Oezel L, Shue J, Zelenty WD, Lebl DR, Cammisa FP, Girardi FP, Sokunbi G, Hughes AP, Sama AA. Predicting conversion of ambulatory ACDF patients to inpatient: a machine learning approach. Spine J 2024;24:563-571. [PMID: 37980960 DOI: 10.1016/j.spinee.2023.11.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/27/2023] [Revised: 10/29/2023] [Accepted: 11/12/2023] [Indexed: 11/21/2023]

Abstract

BACKGROUND CONTEXT

Machine learning is a powerful tool that has become increasingly important in the orthopedic field. Recently, several studies have reported that predictive models could provide new insights into patient risk factors and outcomes. Anterior cervical discectomy and fusion (ACDF) is a common operation that is performed as an outpatient procedure. However, some patients are required to convert to inpatient status and prolonged hospitalization due to their condition. Appropriate patient selection and identification of risk factors for conversion could provide benefits to patients and the use of medical resources.

PURPOSE

This study aimed to develop a machine-learning algorithm to identify risk factors associated with unplanned conversion from outpatient to inpatient status for ACDF patients.

STUDY DESIGN/SETTING

This is a machine-learning-based analysis using retrospectively collected data.

PATIENT SAMPLE

Patients who underwent one- or two-level ACDF in an ambulatory setting at a single specialized orthopedic hospital between February 2016 to December 2021.

OUTCOME MEASURES

Length of stay, conversion rates from ambulatory setting to inpatient.

METHODS

Patients were divided into two groups based on length of stay: (1) Ambulatory (discharge within 24 hours) or Extended Stay (greater than 24 hours but fewer than 48 hours), and (2) Inpatient (greater than 48 hours). Factors included in the model were based on literature review and clinical expertise. Patient demographics, comorbidities, and intraoperative factors, such as surgery duration and time, were included. We compared the performance of different machine learning algorithms: Logistic Regression, Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost). We split the patient data into a training and validation dataset using a 70/30 split. The different models were trained in the training dataset using cross-validation. The performance was then tested in the unseen validation set. This step is important to detect overfitting. The performance was evaluated using the area under the curve (AUC) of the receiver operating characteristics analysis (ROC) as the primary outcome. An AUC of 0.7 was considered fair, 0.8 good, and 0.9 excellent, according to established cut-offs.

RESULTS

A total of 581 patients (59% female) were available for analysis. Of those, 140 (24.1%) were converted to inpatient status. The median age was 51 (IQR 44-59), and the median BMI was 28 kg/m2 (IQR 24-32). The XGBoost model showed the best performance with an AUC of 0.79. The most important features were the length of the operation, followed by sex (based on biological attributes), age, and operation start time. The logistic regression model and the SVM showed worse results, with an AUC of 0.71 each.

CONCLUSIONS

This study demonstrated a novel approach to predicting conversion to inpatient status in eligible patients for ambulatory surgery. The XGBoost model showed good predictive capabilities, superior to the older machine learning approaches. This model also revealed the importance of surgical duration time, BMI, and age as risk factors for patient conversion. A developing field of study is using machine learning in clinical decision-making. Our findings contribute to this field by demonstrating the feasibility and accuracy of such methods in predicting outcomes and identifying risk factors, although external and multi-center validation studies are needed.

Collapse

Affiliation(s)

Lukas Schönnagel Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA; Center for Musculoskeletal Surgery, Charité - Universitätsmedizin Berlin, Freie Universität Berlin, Charitéplatz 1, 10117 Berlin, Germany
Soji Tani Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA; Department of Orthopaedic Surgery, Showa University School of Medicine, 1-5-8 Hatanodai, Shinagawa-ku, Tokyo 142-8666, Japan
Tu-Lan Vu-Han Center for Musculoskeletal Surgery, Charité - Universitätsmedizin Berlin, Freie Universität Berlin, Charitéplatz 1, 10117 Berlin, Germany
Jiaqi Zhu Biostatistics Core, Hospital for Special Surgery, 541 E. 71st Street, New York, NY 10021, USA
Gaston Camino-Willhuber Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
Yusuke Dodo Department of Orthopaedic Surgery, Showa University School of Medicine, 1-5-8 Hatanodai, Shinagawa-ku, Tokyo 142-8666, Japan
Thomas Caffard Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA; Department of Orthopedic Surgery, University of Ulm, Oberer Eselsberg 45, 89081 Ulm, Germany
Erika Chiapparelli Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
Lisa Oezel Department of Orthopedic Surgery and Traumatology, University Hospital Duesseldorf, Moorenstraße 5, 40225 Duesseldorf, Germany
Jennifer Shue Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
William D Zelenty Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
Darren R Lebl Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
Frank P Cammisa Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
Federico P Girardi Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
Gbolabo Sokunbi Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
Alexander P Hughes Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA
Andrew A Sama Spine Care Institute, Hospital for Special Surgery, 535 East 70th Street, New York, NY 10021, USA.

Collapse

Alan N, Zenkin S, Lavadi RS, Legarreta AD, Hudson JS, Fields DP, Agarwal N, Mamindla P, Ak M, Peddagangireddy V, Puccio L, Buell TJ, Hamilton DK, Kanter AS, Okonkwo DO, Zinn PO, Colen RR. Associating T1-Weighted and T2-Weighted Magnetic Resonance Imaging Radiomic Signatures With Preoperative Symptom Severity in Patients With Cervical Spondylotic Myelopathy. World Neurosurg 2024;184:e137-e143. [PMID: 38253177 DOI: 10.1016/j.wneu.2024.01.072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Accepted: 01/14/2024] [Indexed: 01/24/2024]

Affiliation(s)

Nima Alan Department of Neurological Surgery, University of California, San Francisco, San Francisco, California.
Serafettin Zenkin Department of Radiology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
Raj Swaroop Lavadi Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
Andrew D Legarreta Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
Joseph S Hudson Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
Daryl P Fields Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
Nitin Agarwal Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania; Department of Neurological Surgery, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
Priyadarshini Mamindla Department of Radiology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
Murat Ak Department of Radiology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
Vishal Peddagangireddy Department of Radiology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
Lauren Puccio Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
Thomas J Buell Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
D Kojo Hamilton Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
Adam S Kanter Department of Neurosurgery, Hoag Neurosciences Institute, Newport Beach, California
David O Okonkwo Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
Pascal O Zinn Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
Rivka R Colen Department of Radiology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania; Hillman Cancer Center, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania

Collapse

Mishra M, Chen PH, Lin GY, Nguyen TTN, Le TC, Dejchanchaiwong R, Tekasakul P, Shih SH, Jhang CW, Tsai CJ. Photochemical oxidation of VOCs and their source impact assessment on ozone under de-weather conditions in Western Taiwan. Environ Pollut 2024;346:123662. [PMID: 38417604 DOI: 10.1016/j.envpol.2024.123662] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 02/17/2024] [Accepted: 02/25/2024] [Indexed: 03/01/2024]

Abstract

The application of statistical models has excellent potential to provide crucial information for mitigating the challenging issue of ozone (O3) pollution by capturing its associations with explanatory variables, including reactive precursors (VOCs and NOX) and meteorology. Considering the large contribution of O3 in degrading the air quality of western Taiwan, three-year (2019-2021) hourly concentration data of VOC, NOX and O3 from 4 monitoring stations of western Taiwan: Tucheng (TC), Zhongming (ZM), Taixi (TX) and Xiaogang (XG), was evaluated to identify the effect of anthropogenic emissions on O3 formation. Owing to the high-ambient reactivity of VOCs on the underestimation of sources, photochemical oxidation was assessed to calculate the consumed VOC (VOCcons) which was followed by the source identification of their initial concentrations. VOCcons was observed to be highest in the summer season (16.7 and 22.7 ppbC) at north (TC and ZM) and in the autumn season (17.8 and 11.4 ppbC) in southward-located stations (TX and XG, respectively). Results showed that VOCs from solvents (25-27%) were the major source at northward stations whereas VOCs-industrial emissions (30%) dominated in south. Furthermore, machine learning (ML): eXtreme Gradient Boost (XGBoost) model based de-weather analysis identified that meteorological factors favor to reduce ambient O3 levels at TC, ZM and XG stations (-67%, -47% and -21%, respectively) but they have a major role in accumulating the O3 (+38%) at the TX station which is primarily transported from the upwind region of south-central Taiwan. Crucial insights using ML outputs showed that the finding of the study can be utilized for region-specific data-driven control of emission from VOCs-sources and prioritized to limit the O3-pollution at the study location-ns as well as their accumulation in distant regions.

Collapse

Yanagawa R, Iwadoh K, Akabane M, Imaoka Y, Bozhilov KK, Melcher ML, Sasaki K. LightGBM outperforms other machine learning techniques in predicting graft failure after liver transplantation: Creation of a predictive model through large-scale analysis. Clin Transplant 2024;38:e15316. [PMID: 38607291 DOI: 10.1111/ctr.15316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 03/18/2024] [Accepted: 03/24/2024] [Indexed: 04/13/2024]

Wu J, Chen X, Li R, Wang A, Huang S, Li Q, Qi H, Liu M, Cheng H, Wang Z. A novel framework for high resolution air quality index prediction with interpretable artificial intelligence and uncertainties estimation. J Environ Manage 2024;357:120785. [PMID: 38583378 DOI: 10.1016/j.jenvman.2024.120785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 02/02/2024] [Accepted: 03/27/2024] [Indexed: 04/09/2024]

Abstract

Accurate air quality index (AQI) prediction is essential in environmental monitoring and management. Given that previous studies neglect the importance of uncertainty estimation and the necessity of constraining the output during prediction, we proposed a new hybrid model, namely TMSSICX, to forecast the AQI of multiple cities. Firstly, time-varying filtered based empirical mode decomposition (TVFEMD) was adopted to decompose the AQI sequence into multiple internal mode functions (IMF) components. Secondly, multi-scale fuzzy entropy (MFE) was applied to evaluate the complexity of each IMF component and clustered them into high and low-frequency portions. In addition, the high-frequency portion was secondarily decomposed by successive variational mode decomposition (SVMD) to reduce volatility. Then, six air pollutant concentrations, namely CO, SO2, PM2.5, PM10, O3, and NO2, were used as inputs. The secondary decomposition and preliminary portion were employed as the outputs for the bidirectional long short-term memory network optimized by the snake optimization algorithm (SOABiLSTM) and improved Catboost (ICatboost), respectively. Furthermore, extreme gradient boosting (XGBoost) was applied to ensemble each predicted sub-model to acquire the consequence. Ultimately, we introduced adaptive kernel density estimation (AKDE) for interval estimation. The empirical outcome indicated the TMSSICX model achieved the best performance among the other 23 models across all datasets. Moreover, implementing the XGBoost to ensemble each predicted sub-model led to an 8.73%, 8.94%, and 0.19% reduction in RMSE, compared to SVM. Additionally, by utilizing SHapley Additive exPlanations (SHAP) to assess the impact of the six pollutant concentrations on AQI, the results reveal that PM2.5 and PM10 had the most notable positive effects on the long-term trend of AQI. We hope this model can provide guidance for air quality management.

Collapse

Yilmaz R, Yagin FH, Colak C, Toprak K, Abdel Samee N, Mahmoud NF, Alshahrani AA. Analysis of hematological indicators via explainable artificial intelligence in the diagnosis of acute heart failure: a retrospective study. Front Med (Lausanne) 2024;11:1285067. [PMID: 38633310 PMCID: PMC11023638 DOI: 10.3389/fmed.2024.1285067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 03/14/2024] [Indexed: 04/19/2024] Open

Abstract

Introduction

Acute heart failure (AHF) is a serious medical problem that necessitates hospitalization and often results in death. Patients hospitalized in the emergency department (ED) should therefore receive an immediate diagnosis and treatment. Unfortunately, there is not yet a fast and accurate laboratory test for identifying AHF. The purpose of this research is to apply the principles of explainable artificial intelligence (XAI) to the analysis of hematological indicators for the diagnosis of AHF.

Methods

In this retrospective analysis, 425 patients with AHF and 430 healthy individuals served as assessments. Patients' demographic and hematological information was analyzed to diagnose AHF. Important risk variables for AHF diagnosis were identified using the Least Absolute Shrinkage and Selection Operator (LASSO) feature selection. To test the efficacy of the suggested prediction model, Extreme Gradient Boosting (XGBoost), a 10-fold cross-validation procedure was implemented. The area under the receiver operating characteristic curve (AUC), F1 score, Brier score, Positive Predictive Value (PPV), and Negative Predictive Value (NPV) were all computed to evaluate the model's efficacy. Permutation-based analysis and SHAP were used to assess the importance and influence of the model's incorporated risk factors.

Results

White blood cell (WBC), monocytes, neutrophils, neutrophil-lymphocyte ratio (NLR), red cell distribution width-standard deviation (RDW-SD), RDW-coefficient of variation (RDW-CV), and platelet distribution width (PDW) values were significantly higher than the healthy group (p < 0.05). On the other hand, erythrocyte, hemoglobin, basophil, lymphocyte, mean platelet volume (MPV), platelet, hematocrit, mean erythrocyte hemoglobin (MCH), and procalcitonin (PCT) values were found to be significantly lower in AHF patients compared to healthy controls (p < 0.05). When XGBoost was used in conjunction with LASSO to diagnose AHF, the resulting model had an AUC of 87.9%, an F1 score of 87.4%, a Brier score of 0.036, and an F1 score of 87.4%. PDW, age, RDW-SD, and PLT were identified as the most crucial risk factors in differentiating AHF.

Conclusion

The results of this study showed that XAI combined with ML could successfully diagnose AHF. SHAP descriptions show that advanced age, low platelet count, high RDW-SD, and PDW are the primary hematological parameters for the diagnosis of AHF.

Collapse

Nath SJ, Girach IA, Harithasree S, Bhuyan K, Ojha N, Kumar M. Urban ozone variability using automated machine learning: inference from different feature importance schemes. Environ Monit Assess 2024;196:393. [PMID: 38520559 DOI: 10.1007/s10661-024-12549-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 03/16/2024] [Indexed: 03/25/2024]

Abstract

Tropospheric ozone is an air pollutant at the ground level and a greenhouse gas which significantly contributes to the global warming. Strong anthropogenic emissions in and around urban environments enhance surface ozone pollution impacting the human health and vegetation adversely. However, observations are often scarce and the factors driving ozone variability remain uncertain in the developing regions of the world. In this regard, here, we conducted machine learning (ML) simulations of ozone variability and comprehensively examined the governing factors over a major urban environment (Ahmedabad) in western India. Ozone precursors (NO2, NO, CO, C5H8 and CH2O) from the CAMS (Copernicus Atmosphere Monitoring Service) reanalysis and meteorological parameters from the ERA5 (European Centre for Medium-Range Weather Forecast's (ECMWF) fifth-generation reanalysis) were included as features in the ML models. Automated ML (AutoML) fitted the deep learning model optimally and simulated the daily ozone with root mean square error (RMSE) of ~2 ppbv reproducing 84-88% of variability. The model performance achieved here is comparable to widely used ML models (RF-Random Forest and XGBoost-eXtreme Gradient Boosting). Explainability of the models is discussed through different schemes of feature importance, including SAGE (Shapley Additive Global importancE) and permutation importance. The leading features are found to be different from different feature importance schemes. We show that urban ozone could be simulated well (RMSE = 2.5 ppbv and R2 = 0.78) by considering first four leading features, from different schemes, which are consistent with ozone photochemistry. Our study underscores the need to conduct science-informed analysis of feature importance from multiple schemes to infer the roles of input variables in ozone variability. AutoML-based studies, exploiting potentials of long-term observations, can strongly complement the conventional chemistry-transport modelling and can also help in accurate simulation and forecast of urban ozone.

Collapse

Myśliwiec P, Kubit A, Szawara P. Optimization of 2024-T3 Aluminum Alloy Friction Stir Welding Using Random Forest, XGBoost, and MLP Machine Learning Techniques. Materials (Basel) 2024;17:1452. [PMID: 38611968 PMCID: PMC11012866 DOI: 10.3390/ma17071452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 03/18/2024] [Accepted: 03/20/2024] [Indexed: 04/14/2024]

Miao R, Dong Q, Liu X, Chen Y, Wang J, Chen J. A cost-effective, machine learning-driven approach for screening arterial functional aging in a large-scale Chinese population. Front Public Health 2024;12:1365479. [PMID: 38572001 PMCID: PMC10987946 DOI: 10.3389/fpubh.2024.1365479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 02/23/2024] [Indexed: 04/05/2024] Open

Abstract

Introduction

An easily accessible and cost-free machine learning model based on prior probabilities of vascular aging enables an application to pinpoint high-risk populations before physical checks and optimize healthcare investment.

Methods

A dataset containing questionnaire responses and physical measurement parameters from 77,134 adults was extracted from the electronic records of the Health Management Center at the Third Xiangya Hospital. The least absolute shrinkage and selection operator and recursive feature elimination-Lightweight Gradient Elevator were employed to select features from a pool of potential covariates. The participants were randomly divided into training (70%) and test cohorts (30%). Four machine learning algorithms were applied to build the screening models for elevated arterial stiffness (EAS), and the performance of models was evaluated by calculating the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and accuracy.

Results

Fourteen easily accessible features were selected to construct the model, including "systolic blood pressure" (SBP), "age," "waist circumference," "history of hypertension," "sex," "exercise," "awareness of normal blood pressure," "eat fruit," "work intensity," "drink milk," "eat bean products," "smoking," "alcohol consumption," and "Irritableness." The extreme gradient boosting (XGBoost) model outperformed the other three models, achieving AUC values of 0.8722 and 0.8710 in the training and test sets, respectively. The most important five features are SBP, age, waist, history of hypertension, and sex.

Conclusion

The XGBoost model ideally assesses the prior probability of the current EAS in the general population. The integration of the model into primary care facilities has the potential to lower medical expenses and enhance the management of arterial aging.

Collapse

Codde C, Rivals F, Destere A, Fromage Y, Labriffe M, Marquet P, Benoist C, Ponthier L, Faucher JF, Woillard JB. A machine learning approach to predict daptomycin exposure from two concentrations based on Monte Carlo simulations. Antimicrob Agents Chemother 2024:e0141523. [PMID: 38501807 DOI: 10.1128/aac.01415-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 02/23/2024] [Indexed: 03/20/2024] Open

Zhou X, Chen X, Tang L, Wang Y, Zheng J, Zhang W. Event-related driver stress detection with smartphones in an urban environment: a naturalistic driving study. Ergonomics 2024:1-19. [PMID: 38501496 DOI: 10.1080/00140139.2024.2323997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 02/23/2024] [Indexed: 03/20/2024]

Liu L, Zhang P, Liu Z, Sun T, Qiao H. Joint global and local interpretation method for CIN status classification in breast cancer. Heliyon 2024;10:e27054. [PMID: 38562500 PMCID: PMC10982965 DOI: 10.1016/j.heliyon.2024.e27054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 12/10/2023] [Accepted: 02/22/2024] [Indexed: 04/04/2024] Open

Zhang Y, Xiao L, LYu L, Zhang L. Construction of a predictive model for bone metastasis from first primary lung adenocarcinoma within 3 cm based on machine learning algorithm: a retrospective study. PeerJ 2024;12:e17098. [PMID: 38495760 PMCID: PMC10944632 DOI: 10.7717/peerj.17098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 02/21/2024] [Indexed: 03/19/2024] Open

Zhang G, Shao F, Yuan W, Wu J, Qi X, Gao J, Shao R, Tang Z, Wang T. Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers. Eur J Med Res 2024;29:156. [PMID: 38448999 PMCID: PMC10918942 DOI: 10.1186/s40001-024-01756-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 02/28/2024] [Indexed: 03/08/2024] Open

Abstract

BACKGROUND

This study aimed to develop and validate an interpretable machine-learning model that utilizes clinical features and inflammatory biomarkers to predict the risk of in-hospital mortality in critically ill patients suffering from sepsis.

METHODS

We enrolled all patients diagnosed with sepsis in the Medical Information Mart for Intensive Care IV (MIMIC-IV, v.2.0), eICU Collaborative Research Care (eICU-CRD 2.0), and the Amsterdam University Medical Centers databases (AmsterdamUMCdb 1.0.2). LASSO regression was employed for feature selection. Seven machine-learning methods were applied to develop prognostic models. The optimal model was chosen based on its accuracy, F1 score and area under curve (AUC) in the validation cohort. Moreover, we utilized the SHapley Additive exPlanations (SHAP) method to elucidate the effects of the features attributed to the model and analyze how individual features affect the model's output. Finally, Spearman correlation analysis examined the associations among continuous predictor variables. Restricted cubic splines (RCS) explored potential non-linear relationships between continuous risk factors and in-hospital mortality.

RESULTS

3535 patients with sepsis were eligible for participation in this study. The median age of the participants was 66 years (IQR, 55-77 years), and 56% were male. After selection, 12 of the 45 clinical parameters collected on the first day after ICU admission remained associated with prognosis and were used to develop machine-learning models. Among seven constructed models, the eXtreme Gradient Boosting (XGBoost) model achieved the best performance, with an AUC of 0.94 and an F1 score of 0.937 in the validation cohort. Feature importance analysis revealed that Age, AST, invasive ventilation treatment, and serum urea nitrogen (BUN) were the top four features of the XGBoost model with the most significant impact. Inflammatory biomarkers may have prognostic value. Furthermore, SHAP force analysis illustrated how the constructed model visualized the prediction of the model.

CONCLUSIONS

This study demonstrated the potential of machine-learning approaches for early prediction of outcomes in patients with sepsis. The SHAP method could improve the interoperability of machine-learning models and help clinicians better understand the reasoning behind the outcome.

Collapse

Banat R, Daoud S, Taha MO. Ligand-based pharmacophore modeling and machine learning for the discovery of potent aurora A kinase inhibitory leads of novel chemotypes. Mol Divers 2024:10.1007/s11030-024-10814-y. [PMID: 38446372 DOI: 10.1007/s11030-024-10814-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Accepted: 01/19/2024] [Indexed: 03/07/2024]

Fu R, Hao X, Yu J, Wang D, Zhang J, Yu Z, Gao F, Zhou C. Machine learning-based prediction of sertraline concentration in patients with depression through therapeutic drug monitoring. Front Pharmacol 2024;15:1289673. [PMID: 38510645 PMCID: PMC10953499 DOI: 10.3389/fphar.2024.1289673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 02/21/2024] [Indexed: 03/22/2024] Open

Guo Y, Yang Y, Li R, Liao X, Li Y. Cadmium accumulation in tropical island paddy soils: From environment and health risk assessment to model prediction. J Hazard Mater 2024;465:133212. [PMID: 38101012 DOI: 10.1016/j.jhazmat.2023.133212] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/22/2023] [Accepted: 12/07/2023] [Indexed: 12/17/2023]

Patel RH, Fan L, Kelly NR, Gelsey F, Hertzberg JK, Brnabic AJM. A machine learning-based algorithm to identify U-500R insulin candidates among adults with type 2 diabetes mellitus in US retrospective databases. Curr Med Res Opin 2024;40:367-375. [PMID: 38259227 DOI: 10.1080/03007995.2023.2293116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 12/06/2023] [Indexed: 01/24/2024]

Abstract

OBJECTIVE

To develop a machine learning-based predictive algorithm to identify patients with type 2 diabetes mellitus (T2DM) who are candidates for initiation of U-500R insulin (U-500R).

METHODS

A retrospective cohort of patients with T2DM was used from a large US administrative claims and electronic health records (EHR) database affiliated with Optum. Predictor variables derived from the data were used to identify appropriate supervised machine learning models including least absolute shrinkage and selection operator (LASSO) and extreme gradient boosted (XGBoost) methods. Predictive performance was assessed using precision-recall (PR) and receiver operating characteristic (ROC) area under the curve (AUC). The clinical interpretation of the final model was supported by fitting the final set of variables from the LASSO and XGBoost models to a traditional logistic regression model. Model choice was determined by comparing Akaike Information Criterion (AIC), residual deviances, and scaled Brier scores.

RESULTS

Among 81,242 patients who met the study eligibility criteria, 577 initiated U-500R and were assigned to the positive class. Predictors of U-500R initiation included overweight/obesity, neuropathy, HbA1c ≥9% and 8%-9%, BUN 23.8 to <112 mg/dl, ALT 35.9-2056.2 U/L, no radiological chest exams, no GFR labs, and gait/mobility abnormalities. The best performing model was the LASSO model with an ROC AUC of 0.776 on the hold-out test set.

CONCLUSION

This study successfully developed and validated a machine learning-based algorithm to identify U-500R candidates among patients with T2DM. This may help health care providers and decision-makers to understand important characteristics of patients who could use U-500R therapies which in turn could support policies and guidelines for optimal patient management.

Collapse

Chen C, He Y, Ni Y, Tang Z, Zhang W. Identification of crosstalk genes relating to ECM-receptor interaction genes in MASH and DN using bioinformatics and machine learning. J Cell Mol Med 2024;28:e18156. [PMID: 38429902 PMCID: PMC10907849 DOI: 10.1111/jcmm.18156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 01/01/2024] [Accepted: 01/12/2024] [Indexed: 03/03/2024] Open

Ayinde BO, Musa MR, Ayinde AAO. Application of machine learning models and landsat 8 data for estimating seasonal pm 2.5 concentrations. Environ Anal Health Toxicol 2024;39:e2024011-0. [PMID: 38631403 DOI: 10.5620/eaht.2024011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 03/12/2024] [Indexed: 04/19/2024] Open

Barry KA, Manzali Y, Flouchi R, Balouki Y, Chelhi K, Elfar M. Exploring the use of association rules in random forest for predicting heart disease. Comput Methods Biomech Biomed Engin 2024;27:338-346. [PMID: 36877167 DOI: 10.1080/10255842.2023.2185477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 02/07/2023] [Accepted: 02/16/2023] [Indexed: 03/07/2023]

Ren Y, Cui M, Zhou Y, Sun S, Guo F, Ma J, Han Z, Park J, Son Y, Khim J. Utilizing machine learning for reactive material selection and width design in permeable reactive barrier (PRB). Water Res 2024;251:121097. [PMID: 38218071 DOI: 10.1016/j.watres.2023.121097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 12/19/2023] [Accepted: 12/30/2023] [Indexed: 01/15/2024]

Liu F, Wu R, Liu S, Liu C, Su M. Assessing the determinants of corporate environmental investment: a machine learning approach. Environ Sci Pollut Res Int 2024;31:17401-17416. [PMID: 38337115 DOI: 10.1007/s11356-024-32158-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 01/19/2024] [Indexed: 02/12/2024]

Shopsowitz K, Lofroth J, Chan G, Kim J, Rana M, Brinkman R, Weng A, Medvedev N, Wang X. MAGIC-DR: An interpretable machine-learning guided approach for acute myeloid leukemia measurable residual disease analysis. Cytometry B Clin Cytom 2024. [PMID: 38415807 DOI: 10.1002/cyto.b.22168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 02/08/2024] [Accepted: 02/13/2024] [Indexed: 02/29/2024]

Saylam B, İncel ÖD. Multitask Learning for Mental Health: Depression, Anxiety, Stress (DAS) Using Wearables. Diagnostics (Basel) 2024;14:501. [PMID: 38472973 DOI: 10.3390/diagnostics14050501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 02/23/2024] [Accepted: 02/24/2024] [Indexed: 03/14/2024] Open

Sharif S, Wunder C, Amendt J, Qamar A. Deciphering the impact of microenvironmental factors on cuticular hydrocarbon degradation in Lucilia sericata empty Puparia: Bridging ecological and forensic entomological perspectives using machine learning models. Sci Total Environ 2024;913:169719. [PMID: 38171456 DOI: 10.1016/j.scitotenv.2023.169719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 12/23/2023] [Accepted: 12/25/2023] [Indexed: 01/05/2024]

Abstract

Blow flies (Calliphoridae) play essential ecological roles in nutrient recycling by consuming decaying organic matter. They serve as valuable bioindicators in ecosystem management and forensic entomology, with their unique feeding behavior leading to the accumulation of environmental pollutants in their cuticular hydrocarbons (CHCs), making them potential indicators of exposure history. This study focuses on CHC degradation dynamics in empty puparia of Lucilia sericata under different environmental conditions for up to 90 days. The three distinct conditions were considered: outdoor-buried, outdoor-above-ground, and indoor environments. Five predominant CHCs, n-Pentacosane (n-C25), n-Hexacosane (n-C26), n-Heptacosane (n-C27), n-Octacosane (n-C28), and n-Nonacosane (n-C29), were analyzed using Gas Chromatography-Mass Spectrometry (GC-MS). The findings revealed variations in CHC concentrations over time, influenced by environmental factors, with significant differences at different time points. Correlation heatmap analysis indicated negative correlations between weathering time and certain CHCs, suggesting decreasing concentrations over time. Machine learning techniques Support Vector Machine (SVM), Multilayer Perceptron (MLP), and eXtreme Gradient Boosting (XGBoost) models explored the potential of CHCs as age indicators. SVM achieved an R-squared value of 0.991, demonstrating high accuracy in age estimation based on CHC concentrations. MLP also exhibited satisfactory performance in outdoor conditions, while SVM and MLP yielded unsatisfactory results indoors due to the lack of significant CHC variations. After comprehensive model selection and performance evaluations, it was found that the XGBoost model excelled in capturing the patterns in all three datasets. This study bridges the gap between baseline and ecological/forensic use of empty puparia, offering valuable insights into the potential of CHCs in environmental monitoring and investigations. Understanding CHCs' stability and degradation enhances blow flies' utility as bioindicators for pollutants and exposure history, benefiting environmental monitoring and forensic entomology.

Collapse

Xu A, Gao J, Sui X, Wang C, Shi Z. LiDAR Dynamic Target Detection Based on Multidimensional Features. Sensors (Basel) 2024;24:1369. [PMID: 38474905 DOI: 10.3390/s24051369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Revised: 02/17/2024] [Accepted: 02/18/2024] [Indexed: 03/14/2024]

Abstract

To address the limitations of LiDAR dynamic target detection methods, which often require heuristic thresholding, indirect computational assistance, supplementary sensor data, or postdetection, we propose an innovative method based on multidimensional features. Using the differences between the positions and geometric structures of point cloud clusters scanned by the same target in adjacent frame point clouds, the motion states of the point cloud clusters are comprehensively evaluated. To enable the automatic precision pairing of point cloud clusters from adjacent frames of the same target, a double registration algorithm is proposed for point cloud cluster centroids. The iterative closest point (ICP) algorithm is employed for approximate interframe pose estimation during coarse registration. The random sample consensus (RANSAC) and four-parameter transformation algorithms are employed to obtain precise interframe pose relations during fine registration. These processes standardize the coordinate systems of adjacent point clouds and facilitate the association of point cloud clusters from the same target. Based on the paired point cloud cluster, a classification feature system is used to construct the XGBoost decision tree. To enhance the XGBoost training efficiency, a Spearman's rank correlation coefficient-bidirectional search for a dimensionality reduction algorithm is proposed to expedite the optimal classification feature subset construction. After preliminary outcomes are generated by XGBoost, a double Boyer-Moore voting-sliding window algorithm is proposed to refine the final LiDAR dynamic target detection accuracy. To validate the efficacy and efficiency of our method in LiDAR dynamic target detection, an experimental platform is established. Real-world data are collected and pertinent experiments are designed. The experimental results illustrate the soundness of our method. The LiDAR dynamic target correct detection rate is 92.41%, the static target error detection rate is 1.43%, and the detection efficiency is 0.0299 s. Our method exhibits notable advantages over open-source comparative methods, achieving highly efficient and precise LiDAR dynamic target detection.

Collapse

Wang P, Wu S, Tian M, Liu K, Cong J, Zhang W, Wei B. A conformal regressor for predicting negative conversion time of Omicron patients. Med Biol Eng Comput 2024:10.1007/s11517-024-03029-8. [PMID: 38363486 DOI: 10.1007/s11517-024-03029-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 01/09/2024] [Indexed: 02/17/2024]

Mehdary A, Chehri A, Jakimi A, Saadane R. Hyperparameter Optimization with Genetic Algorithms and XGBoost: A Step Forward in Smart Grid Fraud Detection. Sensors (Basel) 2024;24:1230. [PMID: 38400385 PMCID: PMC10892895 DOI: 10.3390/s24041230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 02/07/2024] [Accepted: 02/13/2024] [Indexed: 02/25/2024]

Cao C, Zhang T, Xin T. The effect of reading engagement on scientific literacy - an analysis based on the XGBoost method. Front Psychol 2024;15:1329724. [PMID: 38420178 PMCID: PMC10899671 DOI: 10.3389/fpsyg.2024.1329724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Accepted: 01/22/2024] [Indexed: 03/02/2024] Open

Abstract

Scientific literacy is a key factor of personal competitiveness, and reading is the most common activity in daily learning life, and playing the influence of reading on individuals day by day is the most convenient way to improve the level of scientific literacy of all people. Reading engagement is one of the important student characteristics related to reading literacy, which is highly malleable and is jointly reflected by behavioral, cognitive, and affective engagement, and it is of theoretical and practical significance to explore the relationship between reading engagement and scientific literacy using reading engagement as an entry point. In this study, we used PISA2018 data from China to explore the relationship between reading engagement and scientific literacy with a sample of 15-year-old students in mainland China. 36 variables related to reading engagement and background variables (gender, grade, and socioeconomic and cultural status of the family) were selected from the questionnaire as the independent variables, and the score of the Scientific Literacy Assessment (SLA) was taken as the outcome variable, and supervised machine learning method, the XGBoost algorithm, to construct the model. The dataset is randomly divided into training set and test set to optimize the model, which can verify that the obtained model has good fitting degree and generalization ability. Meanwhile, global and local personalized interpretation is done by introducing the SHAP value, a cutting-edge machine model interpretation method. It is found that among the three major components of reading engagement, cognitive engagement is the more influential factor, and students with high reading cognitive engagement level are more likely to get high scores in scientific literacy assessment, which is relatively dominant in the model of this study. On the other hand, this study verifies the feasibility of the current popular machine learning model, i.e., XGBoost, in a large-scale international education assessment program, with a better model adaptability and conditions for global and local interpretation.

Collapse

Radhakrishnan BL, Ezra K, Jebadurai IJ, Selvakumar I, Karthikeyan P. An Autonomous Sleep-Stage Detection Technique in Disruptive Technology Environment. Sensors (Basel) 2024;24:1197. [PMID: 38400354 PMCID: PMC10892786 DOI: 10.3390/s24041197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 02/07/2024] [Accepted: 02/08/2024] [Indexed: 02/25/2024]

Navratil G, Giannopoulos I. Classifying Motorcyclist Behaviour with XGBoost Based on IMU Data. Sensors (Basel) 2024;24:1042. [PMID: 38339759 PMCID: PMC10857319 DOI: 10.3390/s24031042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 01/31/2024] [Accepted: 02/01/2024] [Indexed: 02/12/2024]

Zheng Z, Liang L, Luo X, Chen J, Lin M, Wang G, Xue C. Diagnosing and tracking depression based on eye movement in response to virtual reality. Front Psychiatry 2024;15:1280935. [PMID: 38374979 PMCID: PMC10875075 DOI: 10.3389/fpsyt.2024.1280935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 01/16/2024] [Indexed: 02/21/2024] Open

Abstract

Introduction

Depression is a prevalent mental illness that is primarily diagnosed using psychological and behavioral assessments. However, these assessments lack objective and quantitative indices, making rapid and objective detection challenging. In this study, we propose a novel method for depression detection based on eye movement data captured in response to virtual reality (VR).

Methods

Eye movement data was collected and used to establish high-performance classification and prediction models. Four machine learning algorithms, namely eXtreme Gradient Boosting (XGBoost), multilayer perceptron (MLP), Support Vector Machine (SVM), and Random Forest, were employed. The models were evaluated using five-fold cross-validation, and performance metrics including accuracy, precision, recall, area under the curve (AUC), and F1-score were assessed. The predicted error for the Patient Health Questionnaire-9 (PHQ-9) score was also determined.

Results

The XGBoost model achieved a mean accuracy of 76%, precision of 94%, recall of 73%, and AUC of 82%, with an F1-score of 78%. The MLP model achieved a classification accuracy of 86%, precision of 96%, recall of 91%, and AUC of 86%, with an F1-score of 92%. The predicted error for the PHQ-9 score ranged from -0.6 to 0.6.To investigate the role of computerized cognitive behavioral therapy (CCBT) in treating depression, participants were divided into intervention and control groups. The intervention group received CCBT, while the control group received no treatment. After five CCBT sessions, significant changes were observed in the eye movement indices of fixation and saccade, as well as in the PHQ-9 scores. These two indices played significant roles in the predictive model, indicating their potential as biomarkers for detecting depression symptoms.

Discussion

The results suggest that eye movement indices obtained using a VR eye tracker can serve as useful biomarkers for detecting depression symptoms. Specifically, the fixation and saccade indices showed promise in predicting depression. Furthermore, CCBT demonstrated effectiveness in treating depression, as evidenced by the observed changes in eye movement indices and PHQ-9 scores. In conclusion, this study presents a novel approach for depression detection using eye movement data captured in VR. The findings highlight the potential of eye movement indices as biomarkers and underscore the effectiveness of CCBT in treating depression.

Collapse

Joe H, Kim HG. Multi-label classification with XGBoost for metabolic pathway prediction. BMC Bioinformatics 2024;25:52. [PMID: 38297220 PMCID: PMC10832249 DOI: 10.1186/s12859-024-05666-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 01/22/2024] [Indexed: 02/02/2024] Open

Lei L, Zhang L, Han Z, Chen Q, Liao P, Wu D, Tai J, Xie B, Su Y. Advancing chronic toxicity risk assessment in freshwater ecology by molecular characterization-based machine learning. Environ Pollut 2024;342:123093. [PMID: 38072027 DOI: 10.1016/j.envpol.2023.123093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 11/30/2023] [Accepted: 12/02/2023] [Indexed: 01/26/2024]

Abstract

The continuously increased production of various chemicals and their release into environments have raised potential negative effects on ecological health. However, traditional labor-intensive assessment methods cannot effectively and rapidly evaluate these hazards, especially for chronic risk. In this study, machine learning (ML) was employed to construct quantitative structure-activity relationship (QSAR) models, enabling the prediction of chronic toxicity to aquatic organisms by leveraging the molecular characteristics of pollutants, namely, the molecular descriptors, fingerprints, and graphs. The limited dataset size hindered the notable advantages of the graph attention network (GAT) model for the molecular graphs. Considering computational efficiency and performance (R2 = 0.78; RMSE = 0.77), XGBoost (XGB) was used for reliable QSAR-ML models predicting chronic toxicity using small- or medium-sized tabular data and the molecular descriptors. Further kernel density estimation analysis confirmed the high accuracy of the model for pollutant concentrations ranging from 10-3 to 102 mg/L, effectively aligning with most environmental scenarios. Model interpretation showed SlogP and exposure duration as the primary influential factors. SlogP, representing the distribution coefficient of a molecule between lipophilic and hydrophilic environments, had a negative effect on the toxicity outcomes. Additionally, the exposure duration played a crucial role in determining the chronic toxicity. Finally, the chronic toxicity data of bisphenol A validated the robustness and reliability of the model established in this research. Our study provided a robust and feasible methodology for chronic ecological risk evaluation of various types of pollutants and could facilitate and increase the use of ML applications in environmental fields.

Collapse

Affiliation(s)

Lang Lei Shanghai Engineering Research Center of Biotransformation of Organic Solid Waste, School of Ecological and Environmental Sciences, East China Normal University, Shanghai, 200241, China
Liangmao Zhang Shanghai Engineering Research Center of Biotransformation of Organic Solid Waste, School of Ecological and Environmental Sciences, East China Normal University, Shanghai, 200241, China
Zhibang Han Shanghai Engineering Research Center of Biotransformation of Organic Solid Waste, School of Ecological and Environmental Sciences, East China Normal University, Shanghai, 200241, China
Qirui Chen Shanghai Engineering Research Center of Biotransformation of Organic Solid Waste, School of Ecological and Environmental Sciences, East China Normal University, Shanghai, 200241, China
Pengcheng Liao Shanghai Engineering Research Center of Biotransformation of Organic Solid Waste, School of Ecological and Environmental Sciences, East China Normal University, Shanghai, 200241, China
Dong Wu Shanghai Engineering Research Center of Biotransformation of Organic Solid Waste, School of Ecological and Environmental Sciences, East China Normal University, Shanghai, 200241, China; Chongqing Key Laboratory of Precision Optics, Chongqing Institute of East China Normal University, Chongqing, 401120, China; Shanghai Institute of Pollution Control and Ecological Security, Shanghai, 200092, China
Jun Tai Shanghai Environmental Sanitation Engineering Design Institute Co., Ltd., Shanghai, 200232, China
Bing Xie Shanghai Engineering Research Center of Biotransformation of Organic Solid Waste, School of Ecological and Environmental Sciences, East China Normal University, Shanghai, 200241, China; Shanghai Institute of Pollution Control and Ecological Security, Shanghai, 200092, China
Yinglong Su Shanghai Engineering Research Center of Biotransformation of Organic Solid Waste, School of Ecological and Environmental Sciences, East China Normal University, Shanghai, 200241, China; Chongqing Key Laboratory of Precision Optics, Chongqing Institute of East China Normal University, Chongqing, 401120, China; Shanghai Institute of Pollution Control and Ecological Security, Shanghai, 200092, China.

Collapse

Tao Q, Wu L, An J, Liu Z, Zhang K, Zhou L, Zhang X. Proteomic analysis of human aqueous humor from fuchs uveitis syndrome. Exp Eye Res 2024;239:109752. [PMID: 38123010 DOI: 10.1016/j.exer.2023.109752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 11/25/2023] [Accepted: 12/11/2023] [Indexed: 12/23/2023]

Abstract

Fuchs uveitis syndrome (FUS) is a commonly misdiagnosed uveitis syndrome often presenting as an asymptomatic mild inflammatory condition until complications arise. The diagnosis of this disease remains clinical because of the lack of specific laboratory tests. The aqueous humor (AH) is a complex fluid containing nutrients and metabolic wastes from the eye. Changes in the AH protein provide important information for diagnosing intraocular diseases. This study aimed to analyze the proteomic profile of AH in individuals diagnosed with FUS and to identify potential biomarkers of the disease. We used liquid chromatography-tandem mass spectrometry-based proteomic methods to evaluate the AH protein profiles of all 37 samples, comprising 15 patients with FUS, six patients with Posner-Schlossman syndrome (PSS), and 16 patients with age-related cataract. A total of 538 proteins were identified from a comprehensive spectral library of 634 proteins. Subsequent differential expression analysis, enrichment analysis, and construction of key sub-networks revealed that the inflammatory response, complement activation and hypoxia might be crucial in mediating the process of FUS. The hypoxia inducible factor-1 may serve as a key regulator and therapeutic target. Additionally, the innate and adaptive immune responses are considered dominant in the patients with FUS. A diagnostic model was constructed using machine-learning algorithm to classify FUS, PSS, and normal controls. Two proteins, complement C1q subcomponent subunit B and secretogranin-1, were found to have the highest scores by the Extreme Gradient Boosting, suggesting their potential utility as a biomarker panel. Furthermore, these two proteins as biomarkers were validated in a cohort of 18 patients using high resolution multiple reaction monitoring assays. Therefore, this study contributes to advancing of the current knowledge of FUS pathogenesis and promotes the development of effective diagnostic strategies.

Collapse

Alabi RO, Almangush A, Elmusrati M, Leivo I, Mäkitie AA. Interpretable machine learning model for prediction of overall survival in laryngeal cancer. Acta Otolaryngol 2024:1-7. [PMID: 38279817 DOI: 10.1080/00016489.2023.2301648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 12/21/2023] [Indexed: 01/29/2024]

Liu SH, Ting CE, Wang JJ, Chang CJ, Chen W, Sharma AK. Estimation of Gait Parameters for Adults with Surface Electromyogram Based on Machine Learning Models. Sensors (Basel) 2024;24:734. [PMID: 38339451 PMCID: PMC10857519 DOI: 10.3390/s24030734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 01/18/2024] [Accepted: 01/22/2024] [Indexed: 02/12/2024]