Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Yi HS, Park S, An KG, Kwak KC. Algal Bloom Prediction Using Extreme Learning Machine Models at Artificial Weirs in the Nakdong River, Korea. Int J Environ Res Public Health 2018;15:E2078. [PMID: 30248912 DOI: 10.3390/ijerph15102078] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2018] [Revised: 09/18/2018] [Accepted: 09/19/2018] [Indexed: 11/17/2022]

For:	Yi HS, Park S, An KG, Kwak KC. Algal Bloom Prediction Using Extreme Learning Machine Models at Artificial Weirs in the Nakdong River, Korea. Int J Environ Res Public Health 2018;15:E2078. [PMID: 30248912 DOI: 10.3390/ijerph15102078] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2018] [Revised: 09/18/2018] [Accepted: 09/19/2018] [Indexed: 11/17/2022]

Number

Cited by Other Article(s)

Miller T, Michoński G, Durlik I, Kozlovska P, Biczak P. Artificial Intelligence in Aquatic Biodiversity Research: A PRISMA-Based Systematic Review. BIOLOGY 2025;14:520. [PMID: 40427709 PMCID: PMC12109572 DOI: 10.3390/biology14050520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2025] [Revised: 04/30/2025] [Accepted: 05/06/2025] [Indexed: 05/29/2025]

Jeong B, Shin H, Shin J, Cha Y. The analysis of spatiotemporal effects of environmental factors on harmful algal blooms in a bloom-prone river using partial least squares structural equation modeling. WATER SCIENCE AND TECHNOLOGY : A JOURNAL OF THE INTERNATIONAL ASSOCIATION ON WATER POLLUTION RESEARCH 2025;91:1128-1140. [PMID: 40448456 DOI: 10.2166/wst.2025.066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2024] [Accepted: 03/02/2025] [Indexed: 06/02/2025]

Sheik AG, Sireesha M, Kumar A, Dasari PR, Patnaik R, Bagchi SK, Ansari FA, Bux F. The role of industry 4.0 enabling technologies for predicting, and managing of algal blooms: Bridging gaps and unlocking potential. MARINE POLLUTION BULLETIN 2025;212:117493. [PMID: 39740519 DOI: 10.1016/j.marpolbul.2024.117493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 12/19/2024] [Accepted: 12/19/2024] [Indexed: 01/02/2025]

Kim JH, Byeon S, Lee H, Lee DH, Lee MY, Shin JK, Chon K, Jeong DS, Park Y. Deep-learning and data-resampling: A novel approach to predict cyanobacterial alert levels in a reservoir. ENVIRONMENTAL RESEARCH 2024;263:120135. [PMID: 39393456 DOI: 10.1016/j.envres.2024.120135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 10/05/2024] [Accepted: 10/08/2024] [Indexed: 10/13/2024]

Abstract

The proliferation of harmful algal blooms results in adverse impacts on aquatic ecosystems and public health. Early warning system monitors algal bloom occurrences and provides management strategies for promptly addressing high-concentration algal blooms following their occurrence. In this study, we aimed to develop a proactive prediction model for cyanobacterial alert levels to enable efficient decision-making in management practices. We utilized 11 years of water quality, hydrodynamic, and meteorological data from a reservoir that experiences frequent harmful cyanobacterial blooms in summer. We used these data to construct a deep-learning model, specifically a 1D convolution neural network (1D-CNN) model, to predict cyanobacterial alert levels one week in advance. However, the collected distribution of algal alert levels was imbalanced, leading to the biased training of data-driven models and performance degradation in model predictions. Therefore, an adaptive synthetic sampling method was applied to address the imbalance in the minority class data and improve the predictive performance of the 1D-CNN. The adaptive synthetic sampling method resolved the imbalance in the data during the training phase by incorporating an additional 156 and 196 data points for the caution and warning levels, respectively. The selected optimal 1D-CNN model with a filter size of 5 and comprising 16 filters achieved training and testing prediction accuracies of 97.3% and 85.0%, respectively. During the test phase, the prediction accuracies for each algal alert level (L-0, L-1, and L-2) were 89.9%, 79.2%, and 71.4%, respectively, indicating reasonably consistent predictive results for all three alert levels. Therefore, the use of synthetic data addressed data imbalances and enhanced the predictive performance of the data-driven model. The reliable forecasts produced by the improved model can support the development of management strategies to mitigate harmful algal blooms in reservoirs and can aid in building an early warning system to facilitate effective responses.

Collapse

Park J, Seong B, Park Y, Lee WH, Heo TY. Explainable artificial intelligence for the interpretation of ensemble learning performance in algal bloom estimation. WATER ENVIRONMENT RESEARCH : A RESEARCH PUBLICATION OF THE WATER ENVIRONMENT FEDERATION 2024;96:e11140. [PMID: 39382139 DOI: 10.1002/wer.11140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2024] [Revised: 08/26/2024] [Accepted: 09/18/2024] [Indexed: 10/10/2024]

Abstract

Chlorophyll-a (Chl-a) concentrations, a key indicator of algal blooms, were estimated using the XGBoost machine learning model with 23 variables, including water quality and meteorological factors. The model performance was evaluated using three indices: root mean square error (RMSE), RMSE-observation standard deviation ratio (RSR), and Nash-Sutcliffe efficiency. Nine datasets were created by averaging 1 hour data to cover time frequencies ranging from 1 hour to 1 month. The dataset with relatively high observation frequencies (1-24 h) maintained stability, with an RSR ranging between 0.61 and 0.65. However, the model's performance declined significantly for datasets with weekly and monthly intervals. The Shapley value (SHAP) analysis, an explainable artificial intelligence method, was further applied to provide a quantitative understanding of how environmental factors in the watershed impact the model's performance and is also utilized to enhance the practical applicability of the model in the field. The number of input variables for model construction increased sequentially from 1 to 23, starting from the variable with the highest SHAP value to that with the lowest. The model's performance plateaued after considering five or more variables, demonstrating that stable performance could be achieved using only a small number of variables, including relatively easily measured data collected by real-time sensors, such as pH, dissolved oxygen, and turbidity. This result highlights the practicality of employing machine learning models and real-time sensor-based measurements for effective on-site water quality management. PRACTITIONER POINTS: XAI quantifies the effects of environmental factors on algal bloom prediction models The effects of input variable frequency and seasonality were analyzed using XAI XAI analysis on key variables ensures cost-effective model development.

Collapse

Lee B, Im JK, Han JW, Kang T, Kim W, Kim M, Lee S. Multiple remotely sensed datasets and machine learning models to predict chlorophyll-a concentration in the Nakdong River, South Korea. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024;31:58505-58526. [PMID: 39316212 DOI: 10.1007/s11356-024-35005-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Accepted: 09/13/2024] [Indexed: 09/25/2024]

Abstract

The Nakdong River is a crucial water resource in South Korea, supplying water for various purposes such as potable water, irrigation, and recreation. However, the river is vulnerable to algal blooms due to the inflow of pollutants from multiple points and non-point sources. Monitoring chlorophyll-a (Chl-a) concentrations, a proxy for algal biomass is essential for assessing the trophic status of the river and managing its ecological health. This study aimed to improve the accuracy and reliability of Chl-a estimation in the Nakdong River using machine learning models (MLMs) and simultaneous use of multiple remotely sensed datasets. This study compared the performances of four MLMs: multi-layer perceptron (MLP), support vector machine (SVM), random forest (RF), and eXetreme Gradient Boosting (XGB) using three different input datasets: (1) two remotely sensed datasets (Sentinel-2 and Landsat-8), (2) standalone Sentinel-2, and (3) standalone Landsat-8. The results showed that the MLP model with multiple remotely sensed datasets outperformed other MLMs with 0.43 - 0.86 greater in R2 and 0.36 - 5.88 lower in RMSE. The MLP model demonstrated the highest performance across the range of Chl-a concentrations and predicted peaks above 20 mg/m3 relatively well compared to other models. This was likely due to the capacity of MLP to handle imbalanced datasets. The predictive map of the spatial distribution of Chl-a generated by MLP well captured the areas with high and low Chl-a concentrations. This study pointed out the impacts of imbalanced Chl-a concentration observations (dominated by low Chl-a concentrations) on the performance of MLMs. The data imbalance likely led to MLMs poorly trained for high Chl-a values, producing low prediction accuracy. In conclusion, this study demonstrated the value of multiple remotely sensed datasets in enhancing the accuracy and reliability of Chl-a estimation, mainly when using the MLP model. These findings would provide valuable insights into utilizing MLMs effectively for Chl-a monitoring.

Collapse

Ugulen HS, Koestner D, Sandven H, Hamre B, Kristoffersen AS, Saetre C. Neural network approach for correction of multiple scattering errors in the LISST-VSF instrument. OPTICS EXPRESS 2023;31:32737-32751. [PMID: 37859069 DOI: 10.1364/oe.495523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 08/31/2023] [Indexed: 10/21/2023]

Kim J, Jung W, An J, Oh HJ, Park J. Self-optimization of training dataset improves forecasting of cyanobacterial bloom by machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;866:161398. [PMID: 36621510 DOI: 10.1016/j.scitotenv.2023.161398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 11/30/2022] [Accepted: 01/01/2023] [Indexed: 06/17/2023]

Abstract

Data-driven model (DDM) prediction of aquatic ecological responses, such as cyanobacterial harmful algal blooms (CyanoHABs), is critically influenced by the choice of training dataset. However, a systematic method to choose the optimal training dataset considering data history has not yet been developed. Providing a comprehensive procedure with self-based optimal training dataset-selecting algorithm would self-improve the DDM performance. In this study, a novel algorithm was developed to self-generate possible training dataset candidates from the available input and output variable data and self-choose the optimal training dataset that maximizes CyanoHAB forecasting performance. Nine years of meteorological and water quality data (input) and CyanoHAB data (output) from a site on the Nakdong River, South Korea, were acquired and pretreated via an automated process. An artificial neural network (ANN) was chosen from among the DDM candidates by first-cut training and validation using the entire collected dataset. Optimal training datasets for the ANN were self-selected from among the possible self-generated training datasets by systematically simulating the performance in response to 46 periods and 40 sizes (number of data elements) of the generated training datasets. The best-performing models were screened to identify the candidate models. The best performance corresponded to 6-7 years of training data (∼18 % lower error) for forecasting 1-28 d ahead (1-28 d of forecasting lead time (FLT)). After the hyperparameters of the screened model candidates were fine-tuned, the best-performing model (7 years of data with 14 d FLT) was self-determined by comparing the forecasts with unseen CyanoHAB events. The self-determined model could reasonably predict CyanoHABs occurring in Korean waters (cyanobacteria cells/mL ≥ 1000). Thus, our proposed method of self-optimizing the training dataset effectively improved the predictive accuracy and operational efficiency of the DDM prediction of CyanoHAB.

Collapse

Wen J, Yang J, Li Y, Gao L. Harmful algal bloom warning based on machine learning in maritime site monitoring. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108569] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

A Classification-Based Machine Learning Approach to the Prediction of Cyanobacterial Blooms in Chilgok Weir, South Korea. WATER 2022. [DOI: 10.3390/w14040542] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Xia R, Zou L, Zhang Y, Zhang Y, Chen Y, Liu C, Yang Z, Ma S. Algal bloom prediction influenced by the Water Transfer Project in the Middle-lower Hanjiang River. Ecol Modell 2022. [DOI: 10.1016/j.ecolmodel.2021.109814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Liu L, Wang M, Li G, Wang Q. Construction of Predictive Model for Type 2 Diabetic Retinopathy Based on Extreme Learning Machine. Diabetes Metab Syndr Obes 2022;15:2607-2617. [PMID: 36046759 PMCID: PMC9420743 DOI: 10.2147/dmso.s374767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 08/18/2022] [Indexed: 12/02/2022] Open

Abstract

PURPOSE

The common cause of blindness in people with type 2 diabetes (T2D) is diabetic retinopathy (DR). Early fundus examinations have been shown to prevent vision loss, but routine ophthalmic screenings for patients with diabetes present significant financial and material challenges to existing health-care systems. The purpose of this study is to build a DR prediction model based on the extreme learning machine (ELM) and to compare the performance with the DR prediction models based on support machine vector (SVM), K proximity (KNN), random forest (RF) and artificial neural network (ANN).

METHODS

From January 1, 2020 to November 31, 2021, data were collected from electronic inpatient medical records at Lu'an Hospital of Anhui Medical University in China. An extreme learning machine (ELM) algorithm was used to develop a prediction model based on demographic data and blood testing and urine test results. Several metrics were used to evaluate the model's performance: (1) classification accuracy (ACC), (2) sensitivity, (3) specificity, (4) Precision,(5) Negative predictive value (NPV), (6) Training time and (7) area under the receiver operating characteristic (ROC) curve (AUC).

RESULTS

In terms of ACC, Sensitivity, Specificity, Precision, NPV and AUC, DR prediction model based on SVM and ELM is better than DR prediction model based on ANN, KNN and RF. The prediction model for diabetic retinopathy based on elm is the best among them in terms of ACC, Precision, Specificity, Training time and AUC, with 84.45%, 83.93%, 93.16%,1.24s, and 88.34%, respectively. The DR prediction model based on SVM is the best in terms of sensitivity and NPV, which are, respectively, 70.82% and 85.60%.

CONCLUSION

According to the findings of this study, the model based on the extreme learning machine presents an outstanding performance in predicting diabetic retinopathy thus providing technological assistance for screening of diabetic retinopathy.

Collapse

Kim JH, Shin JK, Lee H, Lee DH, Kang JH, Cho KH, Lee YG, Chon K, Baek SS, Park Y. Improving the performance of machine learning models for early warning of harmful algal blooms using an adaptive synthetic sampling method. WATER RESEARCH 2021;207:117821. [PMID: 34781184 DOI: 10.1016/j.watres.2021.117821] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Revised: 10/23/2021] [Accepted: 10/26/2021] [Indexed: 06/13/2023]

Abstract

Many countries have attempted to monitor and predict harmful algal blooms to mitigate related problems and establish management practices. The current alert system-based sampling of cell density is used to intimate the bloom status and to inform rapid and adequate response from water-associated organizations. The objective of this study was to develop an early warning system for cyanobacterial blooms to allow for efficient decision making prior to the occurrence of algal blooms and to guide preemptive actions regarding management practices. In this study, two machine learning models: artificial neural network (ANN) and support vector machine (SVM), were constructed for the timely prediction of alert levels of algal bloom using eight years' worth of meteorological, hydrodynamic, and water quality data in a reservoir where harmful cyanobacterial blooms frequently occur during summer. However, the proportion imbalance on all alert level data as the output variable leads to biased training of the data-driven model and degradation of model prediction performance. Therefore, the synthetic data generated by an adaptive synthetic (ADASYN) sampling method were used to resolve the imbalance of minority class data in the original data and to improve the prediction performance of the models. The results showed that the overall prediction performance yielded by the caution level (L1) and warning level (L2) in the models constructed using a combination of original and synthetic data was higher than the models constructed using original data only. In particular, the optimal ANN and SVM constructed using a combination of original and synthetic data during both training (including validation) and test generated distinctively improved recall and precision values of L1, which is a very critical alert level as it indicates a transition status from normalcy to bloom formation. In addition, both optimal models constructed using synthetic-added data exhibited improvement in recall and precision by more than 33.7% while predicting L-1 and L-2 during the test. Therefore, the application of synthetic data can improve detection performance of machine learning models by solving the imbalance of observed data. Reliable prediction by the improved models can be used to aid the design of management practices to mitigate algal blooms within a reservoir.

Collapse

Ly QV, Nguyen XC, Lê NC, Truong TD, Hoang THT, Park TJ, Maqbool T, Pyo J, Cho KH, Lee KS, Hur J. Application of Machine Learning for eutrophication analysis and algal bloom prediction in an urban river: A 10-year study of the Han River, South Korea. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021;797:149040. [PMID: 34311376 DOI: 10.1016/j.scitotenv.2021.149040] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/29/2021] [Accepted: 07/10/2021] [Indexed: 06/13/2023]

Abstract

The increasing release of nutrients to aquatic environments has led to great concern regarding eutrophication and the risk of unwanted algal blooms. Based on observational data of 20 water quality parameters measured on a monthly basis at 40 stations from 2011 to 2020, this study applied different Machine Learning (ML) algorithms to suggest the best option for algal bloom prediction in the Han River, a large river in South Korea. Eight different ML algorithms were categorized into several groups of statistical learning, regression family, and deep learning, and were then compared for their suitability to predict the chlorophyll-derived trophic index (TSI-Chla). ML algorithms helped identify the most important water quality parameters contributing to algal bloom prediction. The ML results confirmed that eutrophication and algal proliferation were governed by the complex interplay between nutrients (nitrogen and phosphorus), organic contaminants, and environmental factors. Of the models tested, the adaptive neuro-fuzzy inference system (ANFIS) exhibited the best performance owing to its consistent and outperforming prediction both quantitatively (i.e., via regression) and qualitatively (i.e., via classification), which was evidenced by the lowest value of mean absolute error (MAE) of 0.09, and the highest F1-score, Recall and Precision of 0.97, 0.98 and 0.96, respectively. In a further step, a representative web application was constructed to assist common users to predict the trophic status of the Han River. This study demonstrated that ML techniques are not only promising for highly accurate water quality modeling of urban rivers, but also reduce time and labor intensity for experiments, which decreases the number of monitored water quality parameters, providing further insights into the driving factors of water quality deterioration. They ultimately help devise proactive strategies for sustainable water management.

Collapse

Sadeghi H, Mohandes SR, Hosseini MR, Banihashemi S, Mahdiyar A, Abdullah A. Developing an Ensemble Predictive Safety Risk Assessment Model: Case of Malaysian Construction Projects. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:ijerph17228395. [PMID: 33202768 PMCID: PMC7696253 DOI: 10.3390/ijerph17228395] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 10/02/2020] [Accepted: 10/15/2020] [Indexed: 11/16/2022]

Alsayed A, Sadir H, Kamil R, Sari H. Prediction of Epidemic Peak and Infected Cases for COVID-19 Disease in Malaysia, 2020. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:E4076. [PMID: 32521641 PMCID: PMC7312594 DOI: 10.3390/ijerph17114076] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 05/08/2020] [Accepted: 05/11/2020] [Indexed: 12/12/2022]

Using Machine-Learning Algorithms for Eutrophication Modeling: Case Study of Mar Menor Lagoon (Spain). INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:ijerph17041189. [PMID: 32069834 PMCID: PMC7068380 DOI: 10.3390/ijerph17041189] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Revised: 02/07/2020] [Accepted: 02/09/2020] [Indexed: 11/16/2022]

Hussein AM, Abd Elaziz M, Abdel Wahed MS, Sillanpää M. A new approach to predict the missing values of algae during water quality monitoring programs based on a hybrid moth search algorithm and the random vector functional link network. JOURNAL OF HYDROLOGY 2019;575:852-863. [DOI: 10.1016/j.jhydrol.2019.05.073] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]