Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang L, Zhu Z, Sassoubre L, Yu G, Liao C, Hu Q, Wang Y. Improving the robustness of beach water quality modeling using an ensemble machine learning approach. Sci Total Environ 2021;765:142760. [PMID: 33131841 DOI: 10.1016/j.scitotenv.2020.142760] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 09/28/2020] [Accepted: 09/28/2020] [Indexed: 05/12/2023]

For:	Wang L, Zhu Z, Sassoubre L, Yu G, Liao C, Hu Q, Wang Y. Improving the robustness of beach water quality modeling using an ensemble machine learning approach. Sci Total Environ 2021;765:142760. [PMID: 33131841 DOI: 10.1016/j.scitotenv.2020.142760] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 09/28/2020] [Accepted: 09/28/2020] [Indexed: 05/12/2023]

Number

Cited by Other Article(s)

Moeinzadeh H, Yong KT, Withana A. A critical analysis of parameter choices in water quality assessment. WATER RESEARCH 2024;258:121777. [PMID: 38781620 DOI: 10.1016/j.watres.2024.121777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 04/25/2024] [Accepted: 05/12/2024] [Indexed: 05/25/2024]

Abstract

The determination of water quality heavily depends on the selection of parameters recorded from water samples for the water quality index (WQI). Data-driven methods, including machine learning models and statistical approaches, are frequently used to refine the parameter set for four main reasons: reducing cost and uncertainty, addressing the eclipsing problem, and enhancing the performance of models predicting the WQI. Despite their widespread use, there is a noticeable gap in comprehensive reviews that systematically examine previous studies in this area. Such reviews are essential to assess the validity of these objectives and to demonstrate the effectiveness of data-driven methods in achieving these goals. This paper sets out with two primary aims: first, to provide a review of the existing literature on methods for selecting parameters. Second, it seeks to delineate and evaluate the four principal motivations for parameter selection identified in the literature. This manuscript categorizes existing studies into two methodological groups for refining parameters: one focuses on preserving information within the dataset, and another ensures consistent prediction using the full set of parameters. It characterizes each group and evaluates how effectively each approach meets the four predefined objectives. The study presents that the minimal WQI approach, common to both categories, is the only approach that has successfully reduced recording costs. Nonetheless, it notes that simply reducing the number of parameters does not guarantee cost savings. Furthermore, the group of studies classified as preserving information within the dataset has demonstrated potential to decrease the eclipsing problem, whereas studies in the consistent prediction group have not been able to mitigate this issue. Additionally, since data-driven approaches still rely on the initial parameters chosen by experts, they do not eliminate the need for expert judgment. The study further points out that the WQI formula is a straightforward and expedient tool for assessing water quality. Consequently, the paper argues that employing machine learning solely to reduce the number of parameters to enhance WQI prediction is not a standalone solution. Rather, this objective should be integrated with a more comprehensive set of research goals. The critical analysis of research objectives and the characterization of previous studies lay the groundwork for future research. This groundwork will enable subsequent studies to evaluate how their proposed methods can effectively achieve these objectives.

Collapse

Lloyd SD, Carvajal G, Campey M, Taylor N, Osmond P, Roser DJ, Khan SJ. Predicting recreational water quality and public health safety in urban estuaries using Bayesian Networks. WATER RESEARCH 2024;254:121319. [PMID: 38422692 DOI: 10.1016/j.watres.2024.121319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 02/05/2024] [Accepted: 02/14/2024] [Indexed: 03/02/2024]

Peng T, Xiong J, Sun K, Qian S, Tao Z, Nazir MS, Zhang C. Research and application of a novel selective stacking ensemble model based on error compensation and parameter optimization for AQI prediction. ENVIRONMENTAL RESEARCH 2024;247:118176. [PMID: 38215922 DOI: 10.1016/j.envres.2024.118176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 12/11/2023] [Accepted: 01/09/2024] [Indexed: 01/14/2024]

Essamlali I, Nhaila H, El Khaili M. Advances in machine learning and IoT for water quality monitoring: A comprehensive review. Heliyon 2024;10:e27920. [PMID: 38533055 PMCID: PMC10963334 DOI: 10.1016/j.heliyon.2024.e27920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 02/22/2024] [Accepted: 03/08/2024] [Indexed: 03/28/2024] Open

Sakizadeh M, Zhang C, Milewski A. Spatial distribution pattern and health risk of groundwater contamination by cadmium, manganese, lead and nitrate in groundwater of an arid area. ENVIRONMENTAL GEOCHEMISTRY AND HEALTH 2024;46:80. [PMID: 38367130 DOI: 10.1007/s10653-023-01845-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Accepted: 12/21/2023] [Indexed: 02/19/2024]

Tselemponis A, Stefanis C, Giorgi E, Kalmpourtzi A, Olmpasalis I, Tselemponis A, Adam M, Kontogiorgis C, Dokas IM, Bezirtzoglou E, Constantinidis TC. Coastal Water Quality Modelling Using E. coli, Meteorological Parameters and Machine Learning Algorithms. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023;20:6216. [PMID: 37444064 PMCID: PMC10341787 DOI: 10.3390/ijerph20136216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 06/19/2023] [Accepted: 06/21/2023] [Indexed: 07/15/2023]

Affiliation(s)

Athanasios Tselemponis Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Christos Stefanis Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Elpida Giorgi Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Aikaterini Kalmpourtzi Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Ioannis Olmpasalis Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Antonios Tselemponis Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Maria Adam Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Christos Kontogiorgis Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Ioannis M. Dokas Department of Civil Engineering, Democritus University of Thrace, 69100 Komotini, Greece;
Eugenia Bezirtzoglou Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
Theodoros C. Constantinidis Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)

Collapse

Yang R, Liu H, Li Y. Quantifying uncertainty of marine water quality forecasts for environmental management using a dynamic multi-factor analysis and multi-resolution ensemble approach. CHEMOSPHERE 2023;331:138831. [PMID: 37137396 DOI: 10.1016/j.chemosphere.2023.138831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 04/25/2023] [Accepted: 04/30/2023] [Indexed: 05/05/2023]

Zheng HL, An SY, Qiao BJ, Guan P, Huang DS, Wu W. A data-driven interpretable ensemble framework based on tree models for forecasting the occurrence of COVID-19 in the USA. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023;30:13648-13659. [PMID: 36131178 PMCID: PMC9492466 DOI: 10.1007/s11356-022-23132-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 09/16/2022] [Indexed: 06/15/2023]

Lučin I, Družeta S, Mauša G, Alvir M, Grbčić L, Lušić DV, Sikirica A, Kranjčević L. Predictive modeling of microbiological seawater quality in karst region using cascade model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022;851:158009. [PMID: 35987218 DOI: 10.1016/j.scitotenv.2022.158009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 08/06/2022] [Accepted: 08/09/2022] [Indexed: 06/15/2023]

Abstract

This paper presents an in-depth analysis of seawater quality measurements during the bathing seasons from year 2009 to 2020 in the city of Rijeka, Croatia. Due to rare occurrences of measurements with less than excellent water quality, considered dataset is deeply imbalanced. Additionally, it incorporates measurements under the influence of submerged groundwater discharges (SGD), which were observed in some bathing locations. These discharges were previously thought to dry up during the summer season and are now suspected to be one of the causes of increased Escherichia coli values. Consequently, and in view of the fact that the accuracy of prediction models can be significantly influenced by temporal and spatial variation of the input data, a novel cascade prediction modeling strategy was proposed. It consists of a sequence of prediction models which tend to identify general environmental conditions which confidently lead to excellent bathing water quality. The proposed model uses environmental features which can rather easily be estimated or obtained from the weather forecast. The model was trained on a highly biased dataset, consisting of data from locations with and without SGD influence, and for the time period spanning extremely dry and warm seasons, extremely wet seasons, as well as normal seasons. To simulate realistic application, the model was tested using temporal and spatial stratification of data. The cascade strategy was shown to be a good approach for reliably detecting environmental parameters which produce excellent water quality. Proposed model is designed as a filter method, where instances classified as less-than-excellent water quality require further analysis. The cascade model provides great flexibility as it can be customized to the particular needs of the investigated area and dataset specifics.

Collapse

Affiliation(s)

Ivana Lučin Department of Fluid Mechanics and Computational Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, Rijeka 51000, Croatia; Center for Advanced Computing and Modelling, University of Rijeka, Radmile Matejčić 2, Rijeka 51000, Croatia
Siniša Družeta Department of Fluid Mechanics and Computational Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, Rijeka 51000, Croatia; Center for Advanced Computing and Modelling, University of Rijeka, Radmile Matejčić 2, Rijeka 51000, Croatia
Goran Mauša Department of Computer Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, Rijeka 51000, Croatia; Center for Advanced Computing and Modelling, University of Rijeka, Radmile Matejčić 2, Rijeka 51000, Croatia
Marta Alvir Department of Fluid Mechanics and Computational Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, Rijeka 51000, Croatia
Luka Grbčić Department of Fluid Mechanics and Computational Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, Rijeka 51000, Croatia; Center for Advanced Computing and Modelling, University of Rijeka, Radmile Matejčić 2, Rijeka 51000, Croatia
Darija Vukić Lušić Center for Advanced Computing and Modelling, University of Rijeka, Radmile Matejčić 2, Rijeka 51000, Croatia; Department of Environmental Health, Faculty of Medicine, University of Rijeka, Braće Branchetta 20/1, Rijeka 51000, Croatia; Department of Environmental Health, Teaching Institute of Public Health of Primorje-Gorski Kotar County, Krešimirova 52a, Rijeka 51000, Croatia
Ante Sikirica Department of Fluid Mechanics and Computational Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, Rijeka 51000, Croatia; Center for Advanced Computing and Modelling, University of Rijeka, Radmile Matejčić 2, Rijeka 51000, Croatia
Lado Kranjčević Department of Fluid Mechanics and Computational Engineering, Faculty of Engineering, University of Rijeka, Vukovarska 58, Rijeka 51000, Croatia; Center for Advanced Computing and Modelling, University of Rijeka, Radmile Matejčić 2, Rijeka 51000, Croatia.

Collapse

Prediction and Interpretation of Water Quality Recovery after a Disturbance in a Water Treatment System Using Artificial Intelligence. WATER 2022. [DOI: 10.3390/w14152423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Fei S, Hassan MA, Xiao Y, Su X, Chen Z, Cheng Q, Duan F, Chen R, Ma Y. UAV-based multi-sensor data fusion and machine learning algorithm for yield prediction in wheat. PRECISION AGRICULTURE 2022;24:187-212. [PMID: 35967193 PMCID: PMC9362526 DOI: 10.1007/s11119-022-09938-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 06/30/2022] [Indexed: 05/31/2023]

Abstract

UNLABELLED

Early prediction of grain yield helps scientists to make better breeding decisions for wheat. Use of machine learning (ML) methods for fusion of unmanned aerial vehicle (UAV)-based multi-sensor data can improve the prediction accuracy of crop yield. For this, five ML algorithms including Cubist, support vector machine (SVM), deep neural network (DNN), ridge regression (RR) and random forest (RF) were used for multi-sensor data fusion and ensemble learning for grain yield prediction in wheat. A set of thirty wheat cultivars and breeding lines were grown under three irrigation treatments i.e., light, moderate and high irrigation treatments to evaluate the yield prediction capabilities of a low-cost multi-sensor (RGB, multi-spectral and thermal infrared) UAV platform. Multi-sensor data fusion-based yield prediction showed higher accuracy compared to individual-sensor data in each ML model. The coefficient of determination (R ²) values for Cubist, SVM, DNN and RR models regarding grain yield prediction were observed from 0.527 to 0.670. Moreover, the results of ensemble learning through integrating the above models illustrated further increase in accuracy. The predictions of ensemble learning showed high R ² values up to 0.692, which was higher as compared to individual ML models across the multi-sensor data. Root mean square error (RMSE), residual prediction deviation (RPD) and ratio of prediction performance to inter-quartile range (RPIQ) were calculated to be 0.916 t ha^-1, 1.771 and 2.602, respectively. The results proved that low altitude UAV-based multi-sensor data can be used for early grain yield prediction using data fusion and an ensemble learning framework with high accuracy. This high-throughput phenotyping approach is valuable for improving the efficiency of selection in large breeding activities.

SUPPLEMENTARY INFORMATION

The online version contains supplementary material available at 10.1007/s11119-022-09938-8.

Collapse

Li Z, Zhang C, Liu H, Zhang C, Zhao M, Gong Q, Fu G. Developing stacking ensemble models for multivariate contamination detection in water distribution systems. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022;828:154284. [PMID: 35247409 DOI: 10.1016/j.scitotenv.2022.154284] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 02/25/2022] [Accepted: 02/28/2022] [Indexed: 06/14/2023]

Zhu M, Wang J, Yang X, Zhang Y, Zhang L, Ren H, Wu B, Ye L. A review of the application of machine learning in water quality evaluation. ECO-ENVIRONMENT & HEALTH (ONLINE) 2022;1:107-116. [PMID: 38075524 PMCID: PMC10702893 DOI: 10.1016/j.eehl.2022.06.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 05/19/2022] [Accepted: 06/01/2022] [Indexed: 12/31/2023]

Li L, Qiao J, Yu G, Wang L, Li HY, Liao C, Zhu Z. Interpretable tree-based ensemble model for predicting beach water quality. WATER RESEARCH 2022;211:118078. [PMID: 35066260 DOI: 10.1016/j.watres.2022.118078] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Revised: 11/29/2021] [Accepted: 01/12/2022] [Indexed: 06/14/2023]

A Stacking Ensemble Learning Model for Monthly Rainfall Prediction in the Taihu Basin, China. WATER 2022. [DOI: 10.3390/w14030492] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]

Sokolova E, Ivarsson O, Lillieström A, Speicher NK, Rydberg H, Bondelind M. Data-driven models for predicting microbial water quality in the drinking water source using E. coli monitoring and hydrometeorological data. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022;802:149798. [PMID: 34454142 DOI: 10.1016/j.scitotenv.2021.149798] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 07/08/2021] [Accepted: 08/16/2021] [Indexed: 06/13/2023]

Abstract

Rapid changes in microbial water quality in surface waters pose challenges for production of safe drinking water. If not treated to an acceptable level, microbial pathogens present in the drinking water can result in severe consequences for public health. The aim of this paper was to evaluate the suitability of data-driven models of different complexity for predicting the concentrations of E. coli in the river Göta älv at the water intake of the drinking water treatment plant in Gothenburg, Sweden. The objectives were to (i) assess how the complexity of the model affects the model performance; and (ii) identify relevant factors and assess their effect as predictors of E. coli levels. To forecast E. coli levels one day ahead, the data on laboratory measurements of E. coli and total coliforms, Colifast measurements of E. coli, water temperature, turbidity, precipitation, and water flow were used. The baseline approaches included Exponential Smoothing and ARIMA (Autoregressive Integrated Moving Average), which are commonly used univariate methods, and a naive baseline that used the previous observed value as its next prediction. Also, models common in the machine learning domain were included: LASSO (Least Absolute Shrinkage and Selection Operator) Regression and Random Forest, and a tool for optimising machine learning pipelines - TPOT (Tree-based Pipeline Optimization Tool). Also, a multivariate autoregressive model VAR (Vector Autoregression) was included. The models that included multiple predictors performed better than univariate models. Random Forest and TPOT resulted in higher performance but showed a tendency of overfitting. Water temperature, microbial concentrations upstream and at the water intake, and precipitation upstream were shown to be important predictors. Data-driven modelling enables water producers to interpret the measurements in the context of what concentrations can be expected based on the recent historic data, and thus identify unexplained deviations warranting further investigation of their origin.

Collapse

Bourel M, Segura AM, Crisci C, López G, Sampognaro L, Vidal V, Kruk C, Piccini C, Perera G. Machine learning methods for imbalanced data set for prediction of faecal contamination in beach waters. WATER RESEARCH 2021;202:117450. [PMID: 34352535 DOI: 10.1016/j.watres.2021.117450] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 07/09/2021] [Accepted: 07/15/2021] [Indexed: 06/13/2023]

Affiliation(s)

Mathias Bourel IMERL, Facultad de Ingeniería, Universidad de la República, Montevideo, Uruguay; Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay.
Angel M Segura Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay
Carolina Crisci Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay
Guzmán López Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay
Lia Sampognaro Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay
Victoria Vidal Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay
Carla Kruk Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay; Departamento de Microbiología, Instituto de Investigaciones Biológicas Clemente Estable, Ministerio de Educación y Cultura, Montevideo, Uruguay; Instituto de Ecología y Ciencias Ambientales, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
Claudia Piccini Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay; Departamento de Microbiología, Instituto de Investigaciones Biológicas Clemente Estable, Ministerio de Educación y Cultura, Montevideo, Uruguay
Gonzalo Perera Departamento de Modelización Estadística de Datos e Inteligencia Artificial (MEDIA), Centro Universitario Regional Este, Universidad de la República, Rocha, Uruguay

Collapse

Heasley C, Sanchez JJ, Tustin J, Young I. Systematic review of predictive models of microbial water quality at freshwater recreational beaches. PLoS One 2021;16:e0256785. [PMID: 34437625 PMCID: PMC8389397 DOI: 10.1371/journal.pone.0256785] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 08/14/2021] [Indexed: 11/19/2022] Open

Abstract

Monitoring of fecal indicator bacteria at recreational waters is an important public health measure to minimize water-borne disease, however traditional culture methods for quantifying bacteria can take 18-24 hours to obtain a result. To support real-time notifications of water quality, models using environmental variables have been created to predict indicator bacteria levels on the day of sampling. We conducted a systematic review of predictive models of fecal indicator bacteria at freshwater recreational sites in temperate climates to identify and describe the existing approaches, trends, and their performance to inform beach water management policies. We conducted a comprehensive search strategy, including five databases and grey literature, screened abstracts for relevance, and extracted data using structured forms. Data were descriptively summarized. A total of 53 relevant studies were identified. Most studies (n = 44, 83%) were conducted in the United States and evaluated water quality using E. coli as fecal indicator bacteria (n = 46, 87%). Studies were primarily conducted in lakes (n = 40, 75%) compared to rivers (n = 13, 25%). The most commonly reported predictive model-building method was multiple linear regression (n = 37, 70%). Frequently used predictors in best-fitting models included rainfall (n = 39, 74%), turbidity (n = 31, 58%), wave height (n = 24, 45%), and wind speed and direction (n = 25, 47%, and n = 23, 43%, respectively). Of the 19 (36%) studies that measured accuracy, predictive models averaged an 81.0% accuracy, and all but one were more accurate than traditional methods. Limitations identifed by risk-of-bias assessment included not validating models (n = 21, 40%), limited reporting of whether modelling assumptions were met (n = 40, 75%), and lack of reporting on handling of missing data (n = 37, 70%). Additional research is warranted on the utility and accuracy of more advanced predictive modelling methods, such as Bayesian networks and artificial neural networks, which were investigated in comparatively fewer studies and creating risk of bias tools for non-medical predictive modelling.

Collapse

Ye GH, Alim M, Guan P, Huang DS, Zhou BS, Wu W. Improving the precision of modeling the incidence of hemorrhagic fever with renal syndrome in mainland China with an ensemble machine learning approach. PLoS One 2021;16:e0248597. [PMID: 33725011 PMCID: PMC7963064 DOI: 10.1371/journal.pone.0248597] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 03/02/2021] [Indexed: 11/19/2022] Open

Abstract

OBJECTIVE

Hemorrhagic fever with renal syndrome (HFRS), one of the main public health concerns in mainland China, is a group of clinically similar diseases caused by hantaviruses. Statistical approaches have always been leveraged to forecast the future incidence rates of certain infectious diseases to effectively control their prevalence and outbreak potential. Compared to the use of one base model, model stacking can often produce better forecasting results. In this study, we fitted the monthly reported cases of HFRS in mainland China with a model stacking approach and compared its forecasting performance with those of five base models.

METHOD

We fitted the monthly reported cases of HFRS ranging from January 2004 to June 2019 in mainland China with an autoregressive integrated moving average (ARIMA) model; the Holt-Winter (HW) method, seasonal decomposition of the time series by LOESS (STL); a neural network autoregressive (NNAR) model; and an exponential smoothing state space model with a Box-Cox transformation; ARMA errors; and trend and seasonal components (TBATS), and we combined the forecasting results with the inverse rank approach. The forecasting performance was estimated based on several accuracy criteria for model prediction, including the mean absolute percentage error (MAPE), root-mean-squared error (RMSE) and mean absolute error (MAE).

RESULT

There was a slight downward trend and obvious seasonal periodicity inherent in the time series data for HFRS in mainland China. The model stacking method was selected as the best approach with the best performance in terms of both fitting (RMSE 128.19, MAE 85.63, MAPE 8.18) and prediction (RMSE 151.86, MAE 118.28, MAPE 13.16).

CONCLUSION

The results showed that model stacking by using the optimal mean forecasting weight of the five abovementioned models achieved the best performance in terms of predicting HFRS one year into the future. This study has corroborated the conclusion that model stacking is an easy way to enhance prediction accuracy when modeling HFRS.

Collapse