Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wen X, Xie Y, Wu L, Jiang L. Quantifying and comparing the effects of key risk factors on various types of roadway segment crashes with LightGBM and SHAP. Accid Anal Prev 2021;159:106261. [PMID: 34182322 DOI: 10.1016/j.aap.2021.106261] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 05/25/2021] [Accepted: 06/14/2021] [Indexed: 06/13/2023]

For:	Wen X, Xie Y, Wu L, Jiang L. Quantifying and comparing the effects of key risk factors on various types of roadway segment crashes with LightGBM and SHAP. Accid Anal Prev 2021;159:106261. [PMID: 34182322 DOI: 10.1016/j.aap.2021.106261] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 05/25/2021] [Accepted: 06/14/2021] [Indexed: 06/13/2023]

Number

Cited by Other Article(s)

Long Y, Xu X, Chen J, Liu S, Li J, Dong Y. An explainable predictive model of direct pulp capping in carious mature permanent teeth. J Dent 2024;149:105269. [PMID: 39094974 DOI: 10.1016/j.jdent.2024.105269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 07/24/2024] [Accepted: 07/26/2024] [Indexed: 08/04/2024] Open

Affiliation(s)

Yunzi Long Department of Cariology and Endodontology, Peking University School and Hospital of Stomatology & National Center for Stomatology & National Clinical Research Center for Oral Diseases & National Engineering Research Center of Oral Biomaterials and Digital Medical Devices & Beijing Key Laboratory of Digital Stomatology & NHC Key Laboratory of Digital Stomatology & NMPA Key Laboratory for Dental Materials, Beijing 100081, PR China; Department of General Dentistry II, Peking University School and Hospital of Stomatology, Beijing 100081, PR China
Xiaowei Xu Institute of Medical Information/ Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, China; College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou 310058, China
Jiaqi Chen Department of Cariology and Endodontology, Peking University School and Hospital of Stomatology & National Center for Stomatology & National Clinical Research Center for Oral Diseases & National Engineering Research Center of Oral Biomaterials and Digital Medical Devices & Beijing Key Laboratory of Digital Stomatology & NHC Key Laboratory of Digital Stomatology & NMPA Key Laboratory for Dental Materials, Beijing 100081, PR China
Siyi Liu Department of Cariology and Endodontology, Peking University School and Hospital of Stomatology & National Center for Stomatology & National Clinical Research Center for Oral Diseases & National Engineering Research Center of Oral Biomaterials and Digital Medical Devices & Beijing Key Laboratory of Digital Stomatology & NHC Key Laboratory of Digital Stomatology & NMPA Key Laboratory for Dental Materials, Beijing 100081, PR China.
Jiao Li Institute of Medical Information/ Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, China.
Yanmei Dong Department of Cariology and Endodontology, Peking University School and Hospital of Stomatology & National Center for Stomatology & National Clinical Research Center for Oral Diseases & National Engineering Research Center of Oral Biomaterials and Digital Medical Devices & Beijing Key Laboratory of Digital Stomatology & NHC Key Laboratory of Digital Stomatology & NMPA Key Laboratory for Dental Materials, Beijing 100081, PR China

Collapse

Samerei SA, Aghabayk K. Interpretable machine learning for evaluating risk factors of freeway crash severity. Int J Inj Contr Saf Promot 2024;31:534-550. [PMID: 38768184 DOI: 10.1080/17457300.2024.2351972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 04/27/2024] [Accepted: 05/02/2024] [Indexed: 05/22/2024]

Yue H. Investigating the influence of streetscape environmental characteristics on pedestrian crashes at intersections using street view images and explainable machine learning. ACCIDENT; ANALYSIS AND PREVENTION 2024;205:107693. [PMID: 38955107 DOI: 10.1016/j.aap.2024.107693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 06/05/2024] [Accepted: 06/24/2024] [Indexed: 07/04/2024]

Abstract

Examining the relationship between streetscape features and road traffic accidents is pivotal for enhancing roadway safety. While previous studies have primarily focused on the influence of street design characteristics, sociodemographic features, and land use features on crash occurrence, the impact of streetscape features on pedestrian crashes has not been thoroughly investigated. Furthermore, while machine learning models demonstrate high accuracy in prediction and are increasingly utilized in traffic safety research, understanding the prediction results poses challenges. To address these gaps, this study extracts streetscape environment characteristics from street view images (SVIs) using a combination of semantic segmentation and object detection deep learning networks. These characteristics are then incorporated into the eXtreme Gradient Boosting (XGBoost) algorithm, along with a set of control variables, to model the occurrence of pedestrian crashes at intersections. Subsequently, the SHapley Additive exPlanations (SHAP) method is integrated with XGBoost to establish an interpretable framework for exploring the association between pedestrian crash occurrence and the surrounding streetscape built environment. The results are interpreted from global, local, and regional perspectives. The findings indicate that, from a global perspective, traffic volume and commercial land use are significant contributors to pedestrian-vehicle collisions at intersections, while road, person, and vehicle elements extracted from SVIs are associated with higher risks of pedestrian crash onset. At a local level, the XGBoost-SHAP framework enables quantification of features' local contributions for individual intersections, revealing spatial heterogeneity in factors influencing pedestrian crashes. From a regional perspective, similar intersections can be grouped to define geographical regions, facilitating the formulation of spatially responsive strategies for distinct regions to reduce traffic accidents. This approach can potentially enhance the quality and accuracy of local policy making. These findings underscore the underlying relationship between streetscape-level environmental characteristics and vehicle-pedestrian crashes. The integration of SVIs and deep learning techniques offers a visually descriptive portrayal of the streetscape environment at locations where traffic crashes occur at eye level. The proposed framework not only achieves excellent prediction performance but also enhances understanding of traffic crash occurrences, offering guidance for optimizing traffic accident prevention and treatment programs.

Collapse

Wang S, Liu Y, Wang W, Zhao G, Liang H. Interpretable machine learning guided by physical mechanisms reveals drivers of runoff under dynamic land use changes. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024;367:121978. [PMID: 39067339 DOI: 10.1016/j.jenvman.2024.121978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 06/14/2024] [Accepted: 07/17/2024] [Indexed: 07/30/2024]

Abstract

Human activities continuously impact water balances and cycling in watersheds, making it essential to accurately identify the responses of runoff to dynamic changes in land use types. Although machine learning models demonstrate promise in capturing the intricate interplay between hydrological factors, their "black box" nature makes it challenging to identify the dynamic drivers of runoff. To overcome this challenge, we employed an interpretable machine learning method to inversely deduce the dynamic determinants within hydrological processes. In this study, we analyzed land use changes in the Ningxia section of the middle Yellow River across four periods, laying the foundation for revealing how these changes affect runoff. The sub-watershed attributes and meteorological characteristics generated by the Soil and Water Assessment Tool (SWAT) model were used as input variables of the Extreme Gradient Boosting (XGBoost) model to simulate substantial sub-watershed rainfall runoff in the region. The XGBoost was interpreted using the SHapley Additive exPlanations (SHAP) to identify the dynamic responses of runoff to the land use changes over different periods. The results revealed increasingly frequent interchanges between the land use types in the study area. The XGBoost effectively captured the characteristics of the hydrological processes in the SWAT-derived sub-watersheds. The SHAP analysis results demonstrated that the promoting effect of agricultural land (AGRL) on runoff gradually weakens, while forests (FRST) continuously strengthen their restraining effect on runoff. Relevant land use policies provide empirical support for these findings. Furthermore, the interaction between meteorological variables and land use impacts the runoff generation mechanism and exhibits a threshold effect, with the thresholds for relative humidity (RH), maximum temperature (MaxT), and minimum temperature (MinT) determined to be 0.8, 25 °C, and 15 °C, respectively. This reverse deduction method can reveal hydrological patterns and the mechanisms of interaction between variables, helping to effectively addressing constantly changing human activities and meteorological conditions.

Collapse

Li Y, Huang T, Lee HF, Heo Y, Ho KF, Yim SHL. Integrating Doppler LiDAR and machine learning into land-use regression model for assessing contribution of vertical atmospheric processes to urban PM_2.5 pollution. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;952:175632. [PMID: 39168320 DOI: 10.1016/j.scitotenv.2024.175632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 08/06/2024] [Accepted: 08/17/2024] [Indexed: 08/23/2024]

Zan J, Dong X, Yang H, Yan J, He Z, Tian J, Zhang Y. Application of the Unbalanced Ensemble Algorithm for Prognostic Prediction Outcomes of All-Cause Mortality in Coronary Heart Disease Patients Comorbid with Hypertension. Risk Manag Healthc Policy 2024;17:1921-1936. [PMID: 39135612 PMCID: PMC11317517 DOI: 10.2147/rmhp.s472398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Accepted: 07/24/2024] [Indexed: 08/15/2024] Open

Abstract

Purpose

This study sought to develop an unbalanced-ensemble model that could accurately predict death outcomes of patients with comorbid coronary heart disease (CHD) and hypertension and evaluate the factors contributing to death.

Patients and Methods

Medical records of 1058 patients with coronary heart disease combined with hypertension and excluding those acute coronary syndrome were collected. Patients were followed-up at the first, third, sixth, and twelfth months after discharge to record death events. Follow-up ended two years after discharge. Patients were divided into survival and nonsurvival groups. According to medical records, gender, smoking, drinking, COPD, cerebral stroke, diabetes, hyperhomocysteinemia, heart failure and renal insufficiency of the two groups were sorted and compared and other influencing factors of the two groups, feature selection was carried out to construct models. Owing to data unbalance, we developed four unbalanced-ensemble prediction models based on Balanced Random Forest (BRF), EasyEnsemble, RUSBoost, SMOTEBoost and the two base classification algorithms based on AdaBoost and Logistic. Each model was optimised using hyperparameters based on GridSearchCV and evaluated using area under the curve (AUC), sensitivity, recall, Brier score, and geometric mean (G-mean). Additionally, to understand the influence of variables on model performance, we constructed a SHapley Additive explanation (SHAP) model based on the optimal model.

Results

There were significant differences in age, heart rate, COPD, cerebral stroke, heart failure and renal insufficiency in the nonsurvival group compared with the survival group. Among all models, BRF yielded the highest AUC (0.810; 95% CI, 0.778-0.839), sensitivity (0.990; 95% CI, 0.981-1.000), recall (0.990; 95% CI, 0.981-1.000), and G-mean (0.806; 95% CI, 0.778-0.827), and the lowest Brier score (0.181; 95% CI, 0.178-0.185). Therefore, we identified BRF as the optimal model. Furthermore, red blood cell count (RBC), body mass index (BMI), and lactate dehydrogenase were found to be important mortality-associated risk factors.

Conclusion

BRF combined with advanced machine learning methods and SHAP is highly effective and accurately predicts mortality in patients with CHD comorbid with hypertension. This model has the potential to assist clinicians in modifying treatment strategies to improve patient outcomes.

Collapse

Yasin P, Yimit Y, Cai X, Aimaiti A, Sheng W, Mamat M, Nijiati M. Machine learning-enabled prediction of prolonged length of stay in hospital after surgery for tuberculosis spondylitis patients with unbalanced data: a novel approach using explainable artificial intelligence (XAI). Eur J Med Res 2024;29:383. [PMID: 39054495 PMCID: PMC11270948 DOI: 10.1186/s40001-024-01988-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Accepted: 07/18/2024] [Indexed: 07/27/2024] Open

Abstract

BACKGROUND

Tuberculosis spondylitis (TS), commonly known as Pott's disease, is a severe type of skeletal tuberculosis that typically requires surgical treatment. However, this treatment option has led to an increase in healthcare costs due to prolonged hospital stays (PLOS). Therefore, identifying risk factors associated with extended PLOS is necessary. In this research, we intended to develop an interpretable machine learning model that could predict extended PLOS, which can provide valuable insights for treatments and a web-based application was implemented.

METHODS

We obtained patient data from the spine surgery department at our hospital. Extended postoperative length of stay (PLOS) refers to a hospitalization duration equal to or exceeding the 75th percentile following spine surgery. To identify relevant variables, we employed several approaches, such as the least absolute shrinkage and selection operator (LASSO), recursive feature elimination (RFE) based on support vector machine classification (SVC), correlation analysis, and permutation importance value. Several models using implemented and some of them are ensembled using soft voting techniques. Models were constructed using grid search with nested cross-validation. The performance of each algorithm was assessed through various metrics, including the AUC value (area under the curve of receiver operating characteristics) and the Brier Score. Model interpretation involved utilizing methods such as Shapley additive explanations (SHAP), the Gini Impurity Index, permutation importance, and local interpretable model-agnostic explanations (LIME). Furthermore, to facilitate the practical application of the model, a web-based interface was developed and deployed.

RESULTS

The study included a cohort of 580 patients and 11 features include (CRP, transfusions, infusion volume, blood loss, X-ray bone bridge, X-ray osteophyte, CT-vertebral destruction, CT-paravertebral abscess, MRI-paravertebral abscess, MRI-epidural abscess, postoperative drainage) were selected. Most of the classifiers showed better performance, where the XGBoost model has a higher AUC value (0.86) and lower Brier Score (0.126). The XGBoost model was chosen as the optimal model. The results obtained from the calibration and decision curve analysis (DCA) plots demonstrate that XGBoost has achieved promising performance. After conducting tenfold cross-validation, the XGBoost model demonstrated a mean AUC of 0.85 ± 0.09. SHAP and LIME were used to display the variables' contributions to the predicted value. The stacked bar plots indicated that infusion volume was the primary contributor, as determined by Gini, permutation importance (PFI), and the LIME algorithm.

CONCLUSIONS

Our methods not only effectively predicted extended PLOS but also identified risk factors that can be utilized for future treatments. The XGBoost model developed in this study is easily accessible through the deployed web application and can aid in clinical research.

Collapse

Samerei SA, Aghabayk K. Analyzing the transition from two-vehicle collisions to chain reaction crashes: A hybrid approach using random parameters logit model, interpretable machine learning, and clustering. ACCIDENT; ANALYSIS AND PREVENTION 2024;202:107603. [PMID: 38701559 DOI: 10.1016/j.aap.2024.107603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/02/2024] [Accepted: 04/27/2024] [Indexed: 05/05/2024]

Abstract

Chain reaction crashes (CRC) begin with a two-vehicle collision and rapidly intensify as more vehicles get directly involved. CRCs result in more extensive damage compared to two-vehicle crashes and understanding the progression of a two-vehicle collision into a CRC can unveil preventive strategies that have received less attention. In this study, to align with recent research direction and overcome the limitations of econometric and machine learning (ML) modelling, a hybrid approach is adopted. Moreover, to tackle the existing challenges in crash analysis, addressing unobserved heterogeneity in ML, and exploring random parameter effects and interactions more precisely, a new approach is proposed. To achieve this, a hybrid random parameter logit model and interpretable ML, joint with prior latent class clustering is implemented. Notably, this is the first attempt at using a clustering with hybrid modeling. The significant risk factors, their critical values, distinct effects, and interactions are interpreted using both marginal effects and the SHAP (SHapley Additive exPlanations) method across clusters. This study utilizes crash, traffic, and geometric data from eleven suburban freeways in Iran collected over a 5-year period. The overall results indicate an increased risk of CRC in congested traffic, higher traffic variation, and on horizontal curves combined with longitudinal slopes. Some parameters exhibit distinct or fluctuating effects, which are discussed across different conditions or considering interactions. For instance, during nighttime, heightened congestion on 2-lane freeways, increased traffic variation in less congested conditions, and adverse weather combined with horizontal curves and slopes pose risks. During daytime, increased traffic variation within highly congested sections, higher proportion of heavy vehicle traffic in moderately congested sections, and two lanes in each direction coupled with curves, elevate the levels of risk. The results of this study provide a better understanding of risk factors impact across different conditions, which are usable for policy makers.

Collapse

Zhu Y, Qian Y, Xu J, Hu W. Young novice drivers' road crash injuries and contributing factors: A crash data investigation. TRAFFIC INJURY PREVENTION 2024:1-8. [PMID: 38917367 DOI: 10.1080/15389588.2024.2367504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 06/10/2024] [Indexed: 06/27/2024]

Chen H, Yang F, Duan Y, Yang L, Li J. A novel higher performance nomogram based on explainable machine learning for predicting mortality risk in stroke patients within 30 days based on clinical features on the first day ICU admission. BMC Med Inform Decis Mak 2024;24:161. [PMID: 38849903 PMCID: PMC11161998 DOI: 10.1186/s12911-024-02547-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 05/21/2024] [Indexed: 06/09/2024] Open

Abstract

BACKGROUND

This study aimed to develop a higher performance nomogram based on explainable machine learning methods, and to predict the risk of death of stroke patients within 30 days based on clinical characteristics on the first day of intensive care units (ICU) admission.

METHODS

Data relating to stroke patients were extracted from the Medical Information Marketplace of the Intensive Care (MIMIC) IV and III database. The LightGBM machine learning approach together with Shapely additive explanations (termed as explain machine learning, EML) was used to select clinical features and define cut-off points for the selected features. These selected features and cut-off points were then evaluated using the Cox proportional hazards regression model and Kaplan-Meier survival curves. Finally, logistic regression-based nomograms for predicting 30-day mortality of stroke patients were constructed using original variables and variables dichotomized by cut-off points, respectively. The performance of two nomograms were evaluated in overall and individual dimension.

RESULTS

A total of 2982 stroke patients and 64 clinical features were included, and the 30-day mortality rate was 23.6% in the MIMIC-IV datasets. 10 variables ("sofa (sepsis-related organ failure assessment)", "minimum glucose", "maximum sodium", "age", "mean spo2 (blood oxygen saturation)", "maximum temperature", "maximum heart rate", "minimum bun (blood urea nitrogen)", "minimum wbc (white blood cells)" and "charlson comorbidity index") and respective cut-off points were defined from the EML. In the Cox proportional hazards regression model (Cox regression) and Kaplan-Meier survival curves, after grouping stroke patients according to the cut-off point of each variable, patients belonging to the high-risk subgroup were associated with higher 30-day mortality than those in the low-risk subgroup. The evaluation of nomograms found that the EML-based nomogram not only outperformed the conventional nomogram in NIR (net reclassification index), brier score and clinical net benefits in overall dimension, but also significant improved in individual dimension especially for low "maximum temperature" patients.

CONCLUSIONS

The 10 selected first-day ICU admission clinical features require greater attention for stroke patients. And the nomogram based on explainable machine learning will have greater clinical application.

Collapse

Xue H, Guo P, Li Y, Ma J. Integrating visual factors in crash rate analysis at Intersections: An AutoML and SHAP approach towards cycling safety. ACCIDENT; ANALYSIS AND PREVENTION 2024;200:107544. [PMID: 38493612 DOI: 10.1016/j.aap.2024.107544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 02/18/2024] [Accepted: 03/09/2024] [Indexed: 03/19/2024]

Yoo JW, Park J, Park H. Enhancing safety of construction workers in Korea: an integrated text mining and machine learning framework for predicting accident types. Int J Inj Contr Saf Promot 2024;31:203-215. [PMID: 38164519 DOI: 10.1080/17457300.2023.2300424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 12/24/2023] [Indexed: 01/03/2024]

Yu Q, Shi C, Bai Y, Zhang J, Lu Z, Xu Y, Li W, Liu C, Soomro SEH, Tian L, Hu C. Interpretable baseflow segmentation and prediction based on numerical experiments and deep learning. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024;360:121089. [PMID: 38733842 DOI: 10.1016/j.jenvman.2024.121089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2024] [Revised: 04/11/2024] [Accepted: 05/03/2024] [Indexed: 05/13/2024]

Abstract

Baseflow is a crucial water source in the inland river basins of high-cold mountainous region, playing a significant role in maintaining runoff stability. It is challenging to select the most suitable baseflow separation method in data-scarce high-cold mountainous region and to evaluate effects of climate factors and underlying surface changes on baseflow variability and seasonal distribution characteristics. Here we attempt to address how meteorological factors and underlying surface changes affect baseflow using the Grey Wolf Optimizer Digital Filter Method (GWO-DFM) for rapid baseflow separation and the Long Short-Term Memory (LSTM) neural network model for baseflow prediction, clarifying interpretability of the LSTM model in baseflow forecasting. The proposed method was successfully implemented using a 63-year time series (1958-2020) of flow data from the Tai Lan River (TLR) basin in the high-cold mountainous region, along with 21 years of ERA5-land meteorological data and MODIS data (2000-2020). The results indicate that: (1) GWO-DFM can rapidly identify the optimal filtering parameters. It employs the arithmetic average of three methods, namely Chapman, Chapman-Maxwell and Eckhardt filter, as the best baseflow separation approach for the TLR basin. Additionally, the baseflow significantly increases after the second mutation of the baseflow rate. (2) Baseflow sources are mainly influenced by precipitation infiltration, glacier frozen soil layers, and seasonal ponding. (3) Solar radiation, temperature, precipitation, and NDVI are the primary factors influencing baseflow changes, with Nash-Sutcliffe efficiency coefficients exceeding 0.78 in both the LSTM model training and prediction periods. (4) Changes in baseflow are most influenced by solar radiation, temperature, and NDVI. This study systematically analyzes the changes in baseflow and response mechanisms in high-cold mountainous region, contributing to the management of water resources in mountainous basins under changing environmental conditions.

Collapse

Khattak A, Chan PW, Chen F, Peng H. Interpretable ensemble imbalance learning strategies for the risk assessment of severe-low-level wind shear based on LiDAR and PIREPs. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2024;44:1084-1102. [PMID: 37700727 DOI: 10.1111/risa.14215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 06/05/2023] [Accepted: 08/22/2023] [Indexed: 09/14/2023]

Matin M, Dehghanian A, Dastranj M, Darijani H. Explainable artificial intelligence modeling of internal arc in a medium voltage switchgear based on different CFD simulations. Heliyon 2024;10:e29594. [PMID: 38665570 PMCID: PMC11044042 DOI: 10.1016/j.heliyon.2024.e29594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/17/2024] [Accepted: 04/10/2024] [Indexed: 04/28/2024] Open

Yu X, Ma J, Tang Y, Yang T, Jiang F. Can we trust our eyes? Interpreting the misperception of road safety from street view images and deep learning. ACCIDENT; ANALYSIS AND PREVENTION 2024;197:107455. [PMID: 38218132 DOI: 10.1016/j.aap.2023.107455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 12/20/2023] [Accepted: 12/31/2023] [Indexed: 01/15/2024]

Yao S, Wu Q, Kang Q, Chen YW, Lu Y. An interpretable XGBoost-based approach for Arctic navigation risk assessment. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2024;44:459-476. [PMID: 37330273 DOI: 10.1111/risa.14175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 03/14/2023] [Accepted: 05/07/2023] [Indexed: 06/19/2023]

Wang X, Zhang X, Pei Y. A systematic approach to macro-level safety assessment and contributing factors analysis considering traffic crashes and violations. ACCIDENT; ANALYSIS AND PREVENTION 2024;194:107323. [PMID: 37864889 DOI: 10.1016/j.aap.2023.107323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 09/03/2023] [Accepted: 09/17/2023] [Indexed: 10/23/2023]

Abstract

During rapid urbanization and increase in motorization, it becomes particularly important to understand the relationships between traffic safety and risk factors in order to provide targeted improvements and policy recommendations. Violations and police enforcement are key variables, but the endogenous relationship between crashes and violations has made these variables unreliable and has limited their use. To manage this problem, this study developed a systematic approach for the joint modeling of crashes and violations to identify crash and violation hotspots and examine the mechanisms underlying macro-level contributing factors. Socio-economic, road network, public facility, traffic enforcement, and land use intensity data from 115 towns in Suzhou, China, were collected as independent variables. A bivariate negative binomial spatial conditional autoregressive model (BNB-CAR) and the potential for safety improvement (PSI) method were adopted to identify crash-prone and violation-prone areas, and an interpretable machine learning framework was applied to explore the factors' effects by area. Results showed that the proposed framework was able to accurately identify problem areas and quantify the impact of key factors, which, in Suzhou, were the number of traffic police and their daily patrol time. Considering such enforcement-related information provided important insights into reducing crash and violation frequency; for example, keeping the number of traffic police and daily patrol time under certain thresholds (number of police lower than 11 and patrol time lower than 2.3 h in this sample) was as effective as increasing these numbers for reducing the probability of high-crash and high-violation areas. The proposed approach can help traffic administrators identify the key contributing factors, especially enforcement factors, in crash-prone and violation-prone areas and provide guidelines for improvement.

Collapse

Chen H, Wang M, Li J. Exploring the association between two groups of metals with potentially opposing renal effects and renal function in middle-aged and older adults: Evidence from an explainable machine learning method. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2024;269:115812. [PMID: 38091680 DOI: 10.1016/j.ecoenv.2023.115812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 11/12/2023] [Accepted: 12/08/2023] [Indexed: 01/12/2024]

Abstract

BACKGROUND

Machine learning models have promising applications in capturing the complex relationship between mixtures of exposures and outcomes.

OBJECTIVE

Our study aimed at introducing an explainable machine learning (EML) model to assess the association between metal mixtures with potentially opposing renal effects and renal function in middle-aged and older adults.

METHODS

This study extracted data from two cycle years of the National Health and Nutrition Examination Survey (NHANES). Participants aged 45 years or older with complete data on six metals (lead, cadmium, manganese, mercury, and selenium) and related covariates were enrolled. The EML model was developed by the optimized machine learning model together with Shapley Additive exPlanations (SHAP) to assess the chronic kidney disease (CKD) risk with metal mixtures. The results from EML were further compared in detail with multiple logistic regression (MLR) and Bayesian kernel machine regression (BKMR).

RESULTS

After adjusting for included covariates, MLR pointed out the lead and arsenic were generally positively associated with CKD, but manganese had a negative association. In the BKMR analysis, each metal was found to have a non-linear association with the risk of CKD, and interactions can exist between metals, especially for arsenic and lead. The EML ranked the feature importance: lead, manganese, arsenic and selenium were close behind in importance after gender, age or BMI for participants with CKD. Strong interactions between mercury and lead, manganese and cadmium and arsenic and manganese were identified by partial dependence plot (PDP) of SHAP and bivariate exposure-response effect plots of BKMR. The EML model determined the "trigger point" at which the risk of CKD abruptly changed.

CONCLUSION

Co-exposure to metals with different nephrotoxicity could have different joint association with renal function, and EML can be a powerful method for studying complex exposure mixtures.

Collapse

Sun Z, Wang D, Gu X, Abdel-Aty M, Xing Y, Wang J, Lu H, Chen Y. A hybrid approach of random forest and random parameters logit model of injury severity modeling of vulnerable road users involved crashes. ACCIDENT; ANALYSIS AND PREVENTION 2023;192:107235. [PMID: 37557001 DOI: 10.1016/j.aap.2023.107235] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 07/12/2023] [Accepted: 07/23/2023] [Indexed: 08/11/2023]

Zhang R, Zhu R, Jia M, Pang Y, Zhang B, Bao X, Wang Y. Improvement of a Rapid Method of Detecting Gasoline Detergency Based on the Image Recognition. ACS OMEGA 2023;8:34134-34145. [PMID: 37744810 PMCID: PMC10515347 DOI: 10.1021/acsomega.3c05350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 08/29/2023] [Indexed: 09/26/2023]

Almannaa M, Zawad MN, Moshawah M, Alabduljabbar H. Investigating the effect of road condition and vacation on crash severity using machine learning algorithms. Int J Inj Contr Saf Promot 2023;30:392-402. [PMID: 37079354 DOI: 10.1080/17457300.2023.2202660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 03/14/2023] [Accepted: 04/10/2023] [Indexed: 04/21/2023]

Sun Z, Wang D, Gu X, Xing Y, Wang J, Lu H, Chen Y. A hybrid clustering and random forest model to analyse vulnerable road user to motor vehicle (VRU-MV) crashes. Int J Inj Contr Saf Promot 2023;30:338-351. [PMID: 37643462 DOI: 10.1080/17457300.2023.2180804] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 12/28/2022] [Accepted: 02/11/2023] [Indexed: 02/24/2023]

Farzipour A, Elmi R, Nasiri H. Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods. Diagnostics (Basel) 2023;13:2391. [PMID: 37510135 PMCID: PMC10378557 DOI: 10.3390/diagnostics13142391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 07/03/2023] [Accepted: 07/11/2023] [Indexed: 07/30/2023] Open

Masello L, Castignani G, Sheehan B, Guillen M, Murphy F. Using contextual data to predict risky driving events: A novel methodology from explainable artificial intelligence. ACCIDENT; ANALYSIS AND PREVENTION 2023;184:106997. [PMID: 36854225 DOI: 10.1016/j.aap.2023.106997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 01/07/2023] [Accepted: 02/01/2023] [Indexed: 06/18/2023]

Abstract

Usage-based insurance has allowed insurers to dynamically tailor insurance premiums by understanding when and how safe policyholders drive. However, telematics information can also be used to understand the driving contexts experienced by the driver within each trip (e.g., road types, weather, traffic). Since different combinations of these conditions affect exposure to accidents, this understanding introduces predictive opportunities in driving risk assessment. This paper investigates the relationships between driving context combinations and risk using a naturalistic driving dataset of 77,859 km. In particular, XGBoost and Random Forests are used to determine the predictive significance of driving contexts for near-misses, speeding and distraction events. Moreover, the most important contextual factors in predicting these risky events are identified and ranked through Shapley Additive Explanations. The results show that the driving context has significant power in predicting driving risk. Speed limit, weather temperature, wind speed, traffic conditions and road slope appear in the top ten most relevant features for most risky events. Analysing contextual feature variations and their influence on risky events showed that low-speed limits increase the predicted frequency of speeding and phone unlocking events, whereas high-speed limits decrease harsh accelerations. Low temperatures decrease the expected frequency of harsh manoeuvres, and precipitations increase harsh acceleration, harsh braking, and distraction events. Furthermore, road slope, intersections and pavement quality are the most critical factors among road layout attributes. The methodology presented in this study aims to support road safety stakeholders and insurers by providing insights to study the contextual risk factors that influence road accident frequency and driving risk.

Collapse

Modeling industrial hydrocyclone operational variables by SHAP-CatBoost - A “conscious lab” approach. POWDER TECHNOL 2023. [DOI: 10.1016/j.powtec.2023.118416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]

Yi Z, Wu L. Identification of factors influencing net primary productivity of terrestrial ecosystems based on interpretable machine learning --evidence from the county-level administrative districts in China. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023;326:116798. [PMID: 36435139 DOI: 10.1016/j.jenvman.2022.116798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 11/10/2022] [Accepted: 11/13/2022] [Indexed: 06/16/2023]

Zou Z, Wu Q, Wang J, Xu L, Zhou M, Lu Z, He Y, Wang Y, Liu B, Zhao Y. Research on non-destructive testing of hotpot oil quality by fluorescence hyperspectral technology combined with machine learning. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2023;284:121785. [PMID: 36058172 DOI: 10.1016/j.saa.2022.121785] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 08/21/2022] [Accepted: 08/23/2022] [Indexed: 06/15/2023]

Wang ZZ, Lu YN, Zou ZH, Ma YH, Wang T. Applying OHSA to Detect Road Accident Blackspots. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:16970. [PMID: 36554851 PMCID: PMC9779212 DOI: 10.3390/ijerph192416970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 12/12/2022] [Accepted: 12/14/2022] [Indexed: 06/17/2023]

Iranmanesh M, Seyedabrishami S, Moridpour S. Identifying high crash risk segments in rural roads using ensemble decision tree-based models. Sci Rep 2022;12:20024. [PMID: 36414672 PMCID: PMC9681741 DOI: 10.1038/s41598-022-24476-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 11/16/2022] [Indexed: 11/24/2022] Open

Abstract

Traffic safety forecast models are mainly used to rank road segments. While existing studies have primarily focused on identifying segments in urban networks, rural networks have received less attention. However, rural networks seem to have a higher risk of severe crashes. This paper aims to analyse traffic crashes on rural roads to identify the influencing factors on the crash frequency and present a framework to develop a spatial-temporal crash risk map to prioritise high-risk segments on different days. The crash data of Khorasan Razavi province is used in this study. Crash frequency data with the temporal resolution of one day and spatial resolution of 1500 m from loop detectors are analysed. Four groups of influential factors, including traffic parameters (e.g. traffic flow, speed, time headway), road characteristics (e.g. road type, number of lanes), weather data (e.g. daily rainfall, snow depth, temperature), and calendar variables (e.g. day of the week, public holidays, month, year) are used for model calibration. Three different decision tree algorithms, including, Decision Tree (DT), Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) have been employed to predict crash frequency. Results show that based on the traditional evaluation measures, the XGBosst is better for the explanation and interpretation of the factors affecting crash frequency, while the RF model is better for detecting trends and forecasting crash frequency. According to the results, the traffic flow rate, road type, year of the crash, and wind speed are the most influencing variables in predicting crash frequency on rural roads. Forecasting the high and medium risk segment-day in the rural network can be essential to the safety management plan. This risk will be sensitive to real traffic data, weather forecasts and road geometric characteristics. Seventy percent of high and medium risk segment-day are predicted for the case study.

Collapse

Wu Q, Xu L, Zou Z, Wang J, Zeng Q, Wang Q, Zhen J, Wang Y, Zhao Y, Zhou M. Rapid nondestructive detection of peanut varieties and peanut mildew based on hyperspectral imaging and stacked machine learning models. FRONTIERS IN PLANT SCIENCE 2022;13:1047479. [PMID: 36438117 PMCID: PMC9685660 DOI: 10.3389/fpls.2022.1047479] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 10/12/2022] [Indexed: 06/16/2023]

Hou L, Liu Y, Xie W, Dai Z, Yang W, Zhao Y. Statistical neural network (SNN) for predicting signal-to-noise ratio (SNR) from static parameters and its validation in 16-bit, 125-MSPS analog-to-digital converters (ADCs). THE REVIEW OF SCIENTIFIC INSTRUMENTS 2022;93:084701. [PMID: 36050066 DOI: 10.1063/5.0093709] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 07/11/2022] [Indexed: 06/15/2023]

An Explainable Machine Learning Framework for Forecasting Crude Oil Price during the COVID-19 Pandemic. AXIOMS 2022. [DOI: 10.3390/axioms11080374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]

Fatahi R, Nasiri H, Dadfar E, Chehreh Chelgani S. Modeling of energy consumption factors for an industrial cement vertical roller mill by SHAP-XGBoost: a "conscious lab" approach. Sci Rep 2022;12:7543. [PMID: 35534588 PMCID: PMC9085744 DOI: 10.1038/s41598-022-11429-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 04/25/2022] [Indexed: 11/30/2022] Open

Wen X, Xie Y, Jiang L, Li Y, Ge T. On the interpretability of machine learning methods in crash frequency modeling and crash modification factor development. ACCIDENT; ANALYSIS AND PREVENTION 2022;168:106617. [PMID: 35202941 DOI: 10.1016/j.aap.2022.106617] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 01/29/2022] [Accepted: 02/15/2022] [Indexed: 06/14/2023]

Dong S, Khattak A, Ullah I, Zhou J, Hussain A. Predicting and Analyzing Road Traffic Injury Severity Using Boosting-Based Ensemble Learning Models with SHAPley Additive exPlanations. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:ijerph19052925. [PMID: 35270617 PMCID: PMC8910532 DOI: 10.3390/ijerph19052925] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 02/20/2022] [Accepted: 02/28/2022] [Indexed: 12/10/2022]

Abstract

Road traffic accidents are one of the world’s most serious problems, as they result in numerous fatalities and injuries, as well as economic losses each year. Assessing the factors that contribute to the severity of road traffic injuries has proven to be insightful. The findings may contribute to a better understanding of and potential mitigation of the risk of serious injuries associated with crashes. While ensemble learning approaches are capable of establishing complex and non-linear relationships between input risk variables and outcomes for the purpose of injury severity prediction and classification, most of them share a critical limitation: their “black-box” nature. To develop interpretable predictive models for road traffic injury severity, this paper proposes four boosting-based ensemble learning models, namely a novel Natural Gradient Boosting, Adaptive Gradient Boosting, Categorical Gradient Boosting, and Light Gradient Boosting Machine, and uses a recently developed SHapley Additive exPlanations analysis to rank the risk variables and explain the optimal model. Among four models, LightGBM achieved the highest classification accuracy (73.63%), precision (72.61%), and recall (70.09%), F1-scores (70.81%), and AUC (0.71) when tested on 2015–2019 Pakistan’s National Highway N-5 (Peshawar to Rahim Yar Khan Section) accident data. By incorporating the SHapley Additive exPlanations approach, we were able to interpret the model’s estimation results from both global and local perspectives. Following interpretation, it was determined that the Month_of_Year, Cause_of_Accident, Driver_Age and Collision_Type all played a significant role in the estimation process. According to the analysis, young drivers and pedestrians struck by a trailer have a higher risk of suffering fatal injuries. The combination of trailers and passenger vehicles, as well as driver at-fault, hitting pedestrians and rear-end collisions, significantly increases the risk of fatal injuries. This study suggests that combining LightGBM and SHAP has the potential to develop an interpretable model for predicting road traffic injury severity.

Collapse

Chang I, Park H, Hong E, Lee J, Kwon N. Predicting effects of built environment on fatal pedestrian accidents at location-specific level: Application of XGBoost and SHAP. ACCIDENT; ANALYSIS AND PREVENTION 2022;166:106545. [PMID: 34995959 DOI: 10.1016/j.aap.2021.106545] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 12/05/2021] [Accepted: 12/13/2021] [Indexed: 06/14/2023]

Abstract

Understanding locally heterogeneous physical contexts in built environment is of great importance in developing preemptive countermeasures to mitigate pedestrian fatality risks. In this study, we aim to investigate the non-linear relationship between physical factors and pedestrian fatality at a location-specific level using a machine learning approach. The state-of-art machine learning algorithm, eXtreme Gradient Boosting (XGBoost), is employed for a binary classification problem, in which nationwide locations where fatal pedestrian accidents occurred for the years from 2012 to 2019 in Korea serve as positive samples (n_p = 13,366). For negative samples, locations with no pedestrian accidents are selected randomly to the size that is 10 times larger (n_n = 133,660) than positive samples. Fifteen features under the categories of road conditions, road facilities, road networks, and land uses are assigned to both the positive and negative sample locations using Geographic Information System (GIS). A method is proposed to avoid the class imbalance problem, and a final unbiased model is utilized to predict fatal pedestrian risks at the negative sample locations. In addition, Shapley Additive Explanations (SHAP) is introduced to provide a robust interpretation of the XGBoos prediction results. It is shown that 21.6% of the negative sample locations have a probability of fatal pedestrian accidents greater than 0.5 (or 78.4% accuracy). Generally, a road segment that lies in many of the shortest routes in a dense residential area with many lively activities from aligned buildings is a potential spot for fatal pedestrian accidents. However, based on the SHAP interpretation, the relationships between the features and pedestrian fatality are found nonlinear and locally heterogeneous. We discuss the implications of this result has for drafting policy recommendations to reduce pedestrian fatalities.

Collapse

Mukhopadhyay A, Pettet G, Vazirizade SM, Lu D, Jaimes A, Said SE, Baroud H, Vorobeychik Y, Kochenderfer M, Dubey A. A Review of Incident Prediction, Resource Allocation, and Dispatch Models for Emergency Management. ACCIDENT; ANALYSIS AND PREVENTION 2022;165:106501. [PMID: 34929574 DOI: 10.1016/j.aap.2021.106501] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Revised: 11/14/2021] [Accepted: 11/15/2021] [Indexed: 06/14/2023]

Wei N, Zhang Q, Zhang Y, Jin J, Chang J, Yang Z, Ma C, Jia Z, Ren C, Wu L, Peng J, Mao H. Super-learner model realizes the transient prediction of CO₂ and NOx of diesel trucks: Model development, evaluation and interpretation. ENVIRONMENT INTERNATIONAL 2022;158:106977. [PMID: 34775187 DOI: 10.1016/j.envint.2021.106977] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 10/20/2021] [Accepted: 11/08/2021] [Indexed: 06/13/2023]

Abstract

The transient simulation of CO₂ and NO_X from motor vehicles has essential applications in evaluating vehicular greenhouse gas emissions and pollutant emissions. However, accurately estimating vehicular transient emissions is challenging due to the heterogeneity between different vehicles and the continuous upgrading of vehicle exhaust purification technology. To accurately characterize the transient emissions of motor vehicles, a Super-learner model is used to build CO₂ and NOx transient emission models. The actual onboard test data of 9 China VI N₂ vehicles were used to train the model, and the test data of another China VI N₂ vehicle were selected for further robustness verification. There were significant differences in the emissions between the vehicles, but the constructed transient model could capture the common law of transient emissions from China VI N₂ vehicles. The R² values of CO₂ and NOx emission in the test data of the validation vehicle were 0.71 and 0.82, respectively. In addition, to further prove the model's robustness, the training data were synchronously modelled based on the Moves-method. The Super-learner model has a smaller RMSE on the validation set than the model based on the Moves-method, indicating that the Super-learner model has more transient simulation advantages. The marginal contributions of the model characteristics to the model results were analysed by SHapley Additive exPlanation (SHAP) value interpretation, and the marginal contributions of different pollutant characteristic parameters varied. Therefore, when establishing transient models of different pollutants, the selection of the model parameters demands considering the generation and purification process of different pollutants. The present work provides novel insights into the parameter selection, construction, and interpretation of the transient vehicle emission model.

Collapse

Affiliation(s)

Ning Wei Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Qijun Zhang Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China.
Yanjie Zhang Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Jiaxin Jin Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Junyu Chang Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Zhiwen Yang Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Chao Ma Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Zhenyu Jia Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Chunzhe Ren Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Lin Wu Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Jianfei Peng Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China
Hongjun Mao Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China.

Collapse

Chen S, Shao H, Ji X. Insights into Factors Affecting Traffic Accident Severity of Novice and Experienced Drivers: A Machine Learning Approach. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:ijerph182312725. [PMID: 34886451 PMCID: PMC8656871 DOI: 10.3390/ijerph182312725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 11/24/2021] [Accepted: 11/30/2021] [Indexed: 11/16/2022]

A Cost-Sensitive Diagnosis Method Based on the Operation and Maintenance Data of UAV. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app112311116] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]