Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cha Y, Shin J, Go B, Lee DS, Kim Y, Kim T, Park YS. An interpretable machine learning method for supporting ecosystem management: Application to species distribution models of freshwater macroinvertebrates. J Environ Manage 2021;291:112719. [PMID: 33946026 DOI: 10.1016/j.jenvman.2021.112719] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 03/30/2021] [Accepted: 04/24/2021] [Indexed: 06/12/2023]

For:	Cha Y, Shin J, Go B, Lee DS, Kim Y, Kim T, Park YS. An interpretable machine learning method for supporting ecosystem management: Application to species distribution models of freshwater macroinvertebrates. J Environ Manage 2021;291:112719. [PMID: 33946026 DOI: 10.1016/j.jenvman.2021.112719] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 03/30/2021] [Accepted: 04/24/2021] [Indexed: 06/12/2023]

Number

Cited by Other Article(s)

Zhang W, Zhao Y, Zhang F, Shi X, Zeng C, Maerker M. Understanding the mechanism of gully erosion in the alpine region through an interpretable machine learning approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;949:174949. [PMID: 39067585 DOI: 10.1016/j.scitotenv.2024.174949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 07/19/2024] [Accepted: 07/20/2024] [Indexed: 07/30/2024]

Abstract

In the alpine region, climate warming has led to the retreat of glaciers, snow cover, and permafrost. This has intensified water cycling, soil erosion, and increased the occurrence of natural disasters in the alpine region. This study investigated the Lhasa River Basin in the southern Tibetan Plateau, serving as a representative case study of a typical alpine basin, with a specific focus on gully erosion. Based on field investigations and interpretation using high-resolution satellite remote sensing images, the Random Forest (RF) algorithm was applied to evaluate gully erosion susceptibility on watershed level. The Shapley Additive Interpretation method was then used to interpret the RF model and gain deeper insights into the influencing variables of gully erosion. The results showed that the RF model achieved an area under the receiver operating characteristic (AUC) accuracy of 0.99 and 0.98 for the training and testing datasets, respectively, indicating an outstanding performance of the model. The resulting susceptibility map based on the RF model shows that areas with moderate and higher levels of gully erosion susceptibility are covering 50 % of the basin. The model interpretation results indicated that elevation, slope, permafrost, rainstorm, silt loam topsoil, human activity, stream power, and vegetation were the explaining variables with the highest importance for gully erosion occurrence. Different variables are characterized by specific thresholds promoting gully erosion such as: i) elevations higher than 4950 m, ii) slopes steeper than 13.5°, iii) extreme rainstorms longer than 11 days per year, iv) silt loam topsoil, v) presence of permafrost, vi) stream power index higher than 1.2, and vii) normalized difference vegetation index (NDVI) lower than 0.25. Our findings provide the scientific basis to improve soil erosion control in such highly vulnerable alpine area.

Collapse

Nong X, Lai C, Chen L, Wei J. A novel coupling interpretable machine learning framework for water quality prediction and environmental effect understanding in different flow discharge regulations of hydro-projects. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;950:175281. [PMID: 39117235 DOI: 10.1016/j.scitotenv.2024.175281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 08/01/2024] [Accepted: 08/02/2024] [Indexed: 08/10/2024]

Abstract

Machine learning models (MLMs) have been increasingly used to forecast water pollution. However, the "black box" characteristic for understanding mechanism processes still limits the applicability of MLMs for water quality management in hydro-projects under complex and frequently artificial regulation. This study proposes an interpretable machine learning framework for water quality prediction coupled with a hydrodynamic (flow discharge) scenario-based Random Forest (RF) model with multiple model-agnostic techniques and quantifies global, local, and joint interpretations (i.e., partial dependence, individual conditional expectation, and accumulated local effects) of environmental factor implications. The framework was applied and verified to predict the permanganate index (CODMn) under different flow discharge regulation scenarios in the Middle Route of the South-to-North Water Diversion Project of China (MRSNWDPC). A total of 4664 sampling cases data matrices, including water quality, meteorological, and hydrological indicators from eight national stations along the main canal of the MRSNWDPC, were collected from May 2019 to December 2020. The results showed that the RF models were effective in forecasting CODMn in all flow discharge scenarios, with a mean square error, coefficient of determination, and mean absolute error of 0.006-0.026, 0.481-0.792, and 0.069-0.104, respectively, in the testing dataset. A global interpretation indicated that dissolved oxygen, flow discharge, and surface pressure are the three most important variables of CODMn. Local and joint interpretations indicated that the RF-based prediction model provides a basic understanding of the physical mechanisms of environmental systems. The proposed framework can effectively learn the fundamental environmental implications of water quality variations and provide reliable prediction performance, highlighting the importance of model interpretability for trustworthy machine learning applications in water management projects. This study provides scientific references for applying advanced data-driven MLMs to water quality forecasting and a reliable methodological framework for water quality management and similar hydro-projects.

Collapse

Guo Y, Zhang S, Ren L, Tian X, Tang S, Xian Y, Wu X, Zhang Z. Prediction of Chinese suitable habitats of Panax notoginseng under climate change based on MaxEnt and chemometric methods. Sci Rep 2024;14:16434. [PMID: 39014061 PMCID: PMC11252130 DOI: 10.1038/s41598-024-67178-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 07/09/2024] [Indexed: 07/18/2024] Open

Delaney JT, Larson DM. Using explainable machine learning methods to evaluate vulnerability and restoration potential of ecosystem state transitions. CONSERVATION BIOLOGY : THE JOURNAL OF THE SOCIETY FOR CONSERVATION BIOLOGY 2024;38:e14203. [PMID: 37817744 DOI: 10.1111/cobi.14203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 09/27/2023] [Accepted: 10/05/2023] [Indexed: 10/12/2023]

Tseng KY, Hsieh YT, Lin HC. Machine learning prediction on wetland succession and the impact of artificial structures from a decade of field data. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;937:173426. [PMID: 38796015 DOI: 10.1016/j.scitotenv.2024.173426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 05/19/2024] [Accepted: 05/19/2024] [Indexed: 05/28/2024]

Abstract

The artificial structures can influence wetland topology and sediment properties, thereby shaping plant distribution and composition. Macrobenthos composition was correlated with plant cover. Previous studies on the impact of artificial structures on plant distribution are scarce in incorporating time-series data or extended field surveys. In this study, a machine-learning-based species distribution model with decade-long observation was analyzed to investigate the correlation between the shift in the distribution of B. planiculmis, artificial structure-induced elevation changes and the expansion of other plants, as well as their connection to soil properties and crab composition dynamics under plants in Gaomei Wetland. Long short-term memory model (LSTM) with Shapley additive explanations (SHAP) was employed for predicting the distribution of B. planiculmis and explaining feature importance. The results indicated that wetland topology was influenced by both artificial structures and plants. Areas initially colonized by B. planiculmis were replaced by other species. Soil properties showed significant differences among plant patches; however, principal component analysis (PCA) of sediment properties and niche similarity analysis showed that the niche of plants was overlapped. Crab composition was different under different plants. The presence probability of B. planiculmis near woody paths decreased according to LSTM and field survey data. SHAP analysis suggested that the distribution of other plants, historical distribution of B. planiculmis and sediment properties significantly contributed to the presence probability of B. planiculmis. A sharp decrease in SHAP values with increasing NDVI at suitable elevations, overlap in PCA of sediment properties and niche similarity indicated potential competition among plants. This decade-long time-series field survey revealed the joint effects of artificial structure and vegetation on the topology and soil properties dynamics. These changes influenced the plant distribution through potential plant competition. LSTM with SHAP provided valuable insights in the underlying the mechanisms of artificial structure effects on the plant zonation process.

Collapse

Talukdar S, Shahfahad, Bera S, Naikoo MW, Ramana GV, Mallik S, Kumar PA, Rahman A. Optimisation and interpretation of machine and deep learning models for improved water quality management in Lake Loktak. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024;351:119866. [PMID: 38147770 DOI: 10.1016/j.jenvman.2023.119866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/28/2023] [Accepted: 12/13/2023] [Indexed: 12/28/2023]

Abstract

Loktak Lake, one of the largest freshwater lakes in Manipur, India, is critical for the eco-hydrology and economy of the region, but faces deteriorating water quality due to urbanisation, anthropogenic activities, and domestic sewage. Addressing the urgent need for effective pollution management, this study aims to assess the lake's water quality status using the water quality index (WQI) and develop advanced machine learning (ML) tools for WQI assessment and ML model interpretation to improve pollution management decision making. The WQI was assessed using entropy-based weighting arithmetic and three ML models - Gradient Boosting Machine (GBM), Random Forest (RF) and Deep Neural Network (DNN) - were optimised using a grid search algorithm in the H2O Application Programming Interface (API). These models were validated by various metrics and interpreted globally and locally via Partial Dependency Plot (PDP), Accumulated Local Effect (ALE) and SHapley Additive exPlanations (SHAP). The results show a WQI range of 72.38-100, with 52.7% of samples categorised as very poor. The RF model outperformed GBM and DNN and showed the highest accuracy and generalisation ability, which is reflected in the superior R2 values (0.97 in training, 0.9 in test) and the lower root mean square error (RMSE). RF's minimal margin of error and reliable feature interpretation contrasted with DNN's larger margin of error and inconsistency, which affected its usefulness for decision making. Turbidity was found to be a critical predictive feature in all models, significantly influencing WQI, with other variables such as pH and temperature also playing an important role. SHAP dependency plots illustrated the direct relationship between key water quality parameters such as turbidity and WQI predictions. The novelty of this study lies in its comprehensive approach to the evaluation and interpretation of ML models for WQI estimation, which provides a nuanced understanding of water quality dynamics in Loktak Lake. By identifying the most effective ML models and key predictive functions, this study provides invaluable insights for water quality management and paves the way for targeted strategies to monitor and improve water quality in this vital freshwater ecosystem.

Collapse

Zhang H, Guo W, Wang W. The dimensionality reductions of environmental variables have a significant effect on the performance of species distribution models. Ecol Evol 2023;13:e10747. [PMID: 38020673 PMCID: PMC10659948 DOI: 10.1002/ece3.10747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Revised: 10/29/2023] [Accepted: 11/06/2023] [Indexed: 12/01/2023] Open

Abstract

How to effectively obtain species-related low-dimensional data from massive environmental variables has become an urgent problem for species distribution models (SDMs). In this study, we will explore whether dimensionality reduction on environmental variables can improve the predictive performance of SDMs. We first used two linear (i.e., principal component analysis (PCA) and independent components analysis) and two nonlinear (i.e., kernel principal component analysis (KPCA) and uniform manifold approximation and projection) dimensionality reduction techniques (DRTs) to reduce the dimensionality of high-dimensional environmental data. Then, we established five SDMs based on the environmental variables of dimensionality reduction for 23 real plant species and nine virtual species, and compared the predictive performance of those with the SDMs based on the selected environmental variables through Pearson's correlation coefficient (PCC). In addition, we studied the effects of DRTs, model complexity, and sample size on the predictive performance of SDMs. The predictive performance of SDMs under DRTs other than KPCA is better than using PCC. And the predictive performance of SDMs using linear DRTs is better than using nonlinear DRTs. In addition, using DRTs to deal with environmental variables has no less impact on the predictive performance of SDMs than model complexity and sample size. When the model complexity is at the complex level, PCA can improve the predictive performance of SDMs the most by 2.55% compared with PCC. At the middle level of sample size, the PCA improved the predictive performance of SDMs by 2.68% compared with the PCC. Our study demonstrates that DRTs have a significant effect on the predictive performance of SDMs. Specifically, linear DRTs, especially PCA, are more effective at improving model predictive performance under relatively complex model complexity or large sample sizes.

Collapse

Aryal K, Maraseni T, Apan A. Preference, perceived change, and professed relationship among ecosystem services in the Himalayas. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023;344:118522. [PMID: 37390580 DOI: 10.1016/j.jenvman.2023.118522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 06/20/2023] [Accepted: 06/24/2023] [Indexed: 07/02/2023]

Abstract

The demand side of ecosystem service (ES), especially preference and perception of supply and interactions among ES, is an important yet underexplored research area for landscape planning and management in human-dominated landscapes. Taking a case of multifunctional landscape in the Hindu-Kush Himalayan region, we carried out a social survey of ES, focusing on preference, perceived change, and observed relationship among six major ES from the local people's perspective. Using a semi-structured questionnaire, data collection was done from 300 households from 10 categories of human settlements, based on watershed and land cover types. Garrett mean score (GMS), ordinal logistic regression estimates, and Chi-square test were performed for quantitative data, while an inductive approach was adopted for qualitative data analysis. The results show that at the landscape level, local people preferred water yield (GMS = 70) and crop production (GMS = 66) as the most preferred ES, whereas habitat quality (GMS = 37) and carbon sequestration (GMS = 35) were among the least preferred ES. More than 70% of the respondents believed that the supply of crop production has decreased over the last two decades; however, the supply of other provisioning and non-provisioning ES has increased as observed by majority of the respondents. Among the 15 pairs of ES, local people believe that co-occurrence of ES is possible. Majority of the respondents said that there exist synergistic relationship among 13 pairs of ES, except crop production which is negatively related with timber production and carbon sequestration. Among the identified trade-offs in ES, majority of local people believed that direct trade-offs (i.e., linear inverse relationship) is dominant as observed in 8 pairs of ES, followed by concave and convex trade-offs. Based on our analysis, we argue that the preference and perceived change of ES is more dependent on spatial heterogeneity of communities (i.e., watershed type, municipal category, and land cover type of residence) than socio-economic determinants. Further, we have discussed and suggested few policy and management measures including place-based spatial assessment of the social demand and preference, embracing agroforestry practices in ecosystem management programs, mainstreaming non-local ES in local decision making by incentives, and optimizing the supply of desired ES though integrated biophysical and socio-economic assessment of the landscape.

Collapse

Sotomayor G, Romero J, Ballari D, Vázquez RF, Ramírez-Morales I, Hampel H, Galarza X, Montesinos B, Forio MAE, Goethals PLM. Occurrence Prediction of Riffle Beetles (Coleoptera: Elmidae) in a Tropical Andean Basin of Ecuador Using Species Distribution Models. BIOLOGY 2023;12:biology12030473. [PMID: 36979164 PMCID: PMC10045380 DOI: 10.3390/biology12030473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 03/16/2023] [Accepted: 03/16/2023] [Indexed: 03/30/2023]

Lim SJ, Son M, Ki SJ, Suh SI, Chung J. Opportunities and challenges of machine learning in bioprocesses: Categorization from different perspectives and future direction. BIORESOURCE TECHNOLOGY 2023;370:128518. [PMID: 36565818 DOI: 10.1016/j.biortech.2022.128518] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/15/2022] [Accepted: 12/17/2022] [Indexed: 06/17/2023]

Wikle CK, Datta A, Hari BV, Boone EL, Sahoo I, Kavila I, Castruccio S, Simmons SJ, Burr WS, Chang W. An illustration of model agnostic explainability methods applied to environmental data. ENVIRONMETRICS 2023;34:e2772. [PMID: 37200542 PMCID: PMC10187774 DOI: 10.1002/env.2772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 09/20/2022] [Indexed: 05/20/2023]

Lee DS, Lee DY, Park YS. Interpretable machine learning approach to analyze the effects of landscape and meteorological factors on mosquito occurrences in Seoul, South Korea. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023;30:532-546. [PMID: 35900627 PMCID: PMC9813121 DOI: 10.1007/s11356-022-22099-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 07/14/2022] [Indexed: 06/15/2023]

Bifarin OO. Interpretable machine learning with tree-based shapley additive explanations: Application to metabolomics datasets for binary classification. PLoS One 2023;18:e0284315. [PMID: 37141218 PMCID: PMC10159207 DOI: 10.1371/journal.pone.0284315] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 03/28/2023] [Indexed: 05/05/2023] Open

Maloney KO, Buchanan C, Jepsen RD, Krause KP, Cashman MJ, Gressler BP, Young JA, Schmid M. Explainable machine learning improves interpretability in the predictive modeling of biological stream conditions in the Chesapeake Bay Watershed, USA. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2022;322:116068. [PMID: 36058075 DOI: 10.1016/j.jenvman.2022.116068] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 08/03/2022] [Accepted: 08/19/2022] [Indexed: 06/15/2023]

Abstract

Anthropogenic alterations have resulted in widespread degradation of stream conditions. To aid in stream restoration and management, baseline estimates of conditions and improved explanation of factors driving their degradation are needed. We used random forests to model biological conditions using a benthic macroinvertebrate index of biotic integrity for small, non-tidal streams (upstream area ≤200 km²) in the Chesapeake Bay watershed (CBW) of the mid-Atlantic coast of North America. We utilized several global and local model interpretation tools to improve average and site-specific model inferences, respectively. The model was used to predict condition for 95,867 individual catchments for eight periods (2001, 2004, 2006, 2008, 2011, 2013, 2016, 2019). Predicted conditions were classified as Poor, FairGood, or Uncertain to align with management needs and individual reach lengths and catchment areas were summed by condition class for the CBW for each period. Global permutation and local Shapley importance values indicated percent of forest, development, and agriculture in upstream catchments had strong impacts on predictions. Development and agriculture negatively influenced stream condition for model average (partial dependence [PD] and accumulated local effect [ALE] plots) and local (individual condition expectation and Shapley value plots) levels. Friedman's H-statistic indicated large overall interactions for these three land covers, and bivariate global plots (PD and ALE) supported interactions among agriculture and development. Total stream length and catchment area predicted in FairGood conditions decreased then increased over the 19-years (length/area: 66.6/65.4% in 2001, 66.3/65.2% in 2011, and 66.6/65.4% in 2019). Examination of individual catchment predictions between 2001 and 2019 showed those predicted to have the largest decreases in condition had large increases in development; whereas catchments predicted to exhibit the largest increases in condition showed moderate increases in forest cover. Use of global and local interpretative methods together with watershed-wide and individual catchment predictions support conservation practitioners that need to identify widespread and localized patterns, especially acknowledging that management actions typically take place at individual-reach scales.

Collapse

An C, Yang H, Yu X, Han ZY, Cheng Z, Liu F, Dou J, Li B, Li Y, Li Y, Yu J, Liang P. A Machine Learning Model Based on Health Records for Predicting Recurrence After Microwave Ablation of Hepatocellular Carcinoma. J Hepatocell Carcinoma 2022;9:671-684. [PMID: 35923613 PMCID: PMC9342890 DOI: 10.2147/jhc.s358197] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 07/08/2022] [Indexed: 12/24/2022] Open

Abstract

Background and Aim

Early recurrence (ER) presents a challenge for the survival prognosis of patients with hepatocellular carcinoma (HCC). The aim of this study was to investigate machine learning (ML) models using clinical data for predicting ER after microwave ablation (MWA).

Methods

Between August 2005 and December 2019, 1574 patients with early-stage HCC underwent MWA at four hospitals were reviewed. Then, 36 clinical data points per patient were collected, and the patients were assigned to the training, internal, and external validation set. Apart from traditional logistic regression (LR), three ML models—random forest, support vector machine, and eXtreme Gradient Boosting (XGBoost)—were built and validated for their predictive ability with the area under ROC curve (AUC). Algorithms such as SHapley Additive exPlanations (SHAP) and local interpretable model-agnostic explanations (LIME) were used to realize their interpretability.

Results

The three ML models all outperformed LR (P < 0.001 for all) in predictive ability. When nine variables (tumor number, platelet, α-fetoprotein, comorbidity score, white blood cell, cholinesterase, prothrombin time, neutrophils, and etiology) were extracted simultaneously using recursive feature elimination with cross-validation, the XGBoost model achieved the best discrimination among all models, with an AUC value 0.75 (95% CI [confidence interval]: 0.72–0.78) in the training set, 0.74 (95% CI: 0.69–0.80) in the internal validation set, and 0.76 (95% CI: 0.70–0.82) in the external validation set, and it was interpreted depending on the visualization of risk factors by the SHAP and LIME algorithms. The predictive system of post-ablation recurrence risk stratification was provided on online (http://114.251.235.51:8001/) based on XGboost analysis.

Conclusion

The XGBoost model based on clinical data can effectively predict ER risk after MWA, which can contribute to surveillance, prevention, and treatment strategies for HCC.

Collapse

Affiliation(s)

Chao An Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China
Hongcai Yang Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China School of Medicine, Nankai University, Tianjin, People’s Republic of China
Xiaoling Yu Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China
Zhi-Yu Han Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China
Zhigang Cheng Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China
Fangyi Liu Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China
Jianping Dou Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China
Bing Li National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences, Beijing, People’s Republic of China
Yansheng Li DHC Mediway Technology CO, Ltd, Beijing, People’s Republic of China
Yichao Li DHC Mediway Technology CO, Ltd, Beijing, People’s Republic of China
Jie Yu Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China
Ping Liang Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China Correspondence: Ping Liang; Jie Yu, Department of Ultrasound, PLA Medical College & 5th Medical Center of Chinese PLA General Hospital, Beijing, 100853, People’s Republic of China, Tel +86-10-66939530, Fax +86-10-68161218, Email ;

Collapse

Bellin N, Tesi G, Marchesani N, Rossi V. Species distribution modeling and machine learning in assessing the potential distribution of freshwater zooplankton in Northern Italy. ECOL INFORM 2022. [DOI: 10.1016/j.ecoinf.2022.101682] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Kim T, Lee D, Shin J, Kim Y, Cha Y. Learning hierarchical Bayesian networks to assess the interaction effects of controlling factors on spatiotemporal patterns of fecal pollution in streams. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022;812:152520. [PMID: 34953848 DOI: 10.1016/j.scitotenv.2021.152520] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/28/2021] [Accepted: 12/14/2021] [Indexed: 06/14/2023]

Abstract

The dynamics of fecal indicator bacteria, such as fecal coliforms (FC) in streams, are influenced by the interactions of a myriad of factors. To predict complex spatiotemporal patterns of FC in streams and assess the relative importance of numerous controlling factors, the adoption of a hierarchical Bayesian network (HBN) was proposed in this study. By introducing latent variables correlated to the observed variables into a Bayesian network, the HBN can represent causal relationships among a large set of variables with a multilevel hierarchy. The study area encompasses 215 sites across the watersheds of the four major rivers in South Korea. The monitoring data collected during the 2012-2019 period included 32 input variables pertaining to meteorology, geography, soil characteristics, land cover, urbanization index, livestock density, and point sources. As model endpoints, the exceedance probability of the FC standard concentration as well as two pollution characteristics (i.e., pollution degree and type), derived from FC load duration curves were used. The probability of exceeding an FC threshold value (200 CFU/100 mL) showed spatiotemporal variations, whereas pollution degree and type showed spatial variations that represent long-term severity and relative dominance of nonpoint and point source fecal pollution, respectively. The conceptual model was validated using structural equation modeling to develop the HBN. The results demonstrate that the HBN effectively simplified the model structure, while showing strong model performance (AUC = 0.81, accuracy = 0.74). The results of the sensitivity analysis indicate that land cover is the most important factor in predicting the probability of exceedance and pollution degree, whereas the urbanization index explains most of the variability in pollution type. Furthermore, the results of the scenario analysis suggest that the HBN provides an interpretable framework in which the interaction of controlling factors has causal relationships at different levels that can be identified and visualized.

Collapse

An Interpretable Machine Learning Model for Daily Global Solar Radiation Prediction. ENERGIES 2021. [DOI: 10.3390/en14217367] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]