1
|
Elsayed A, Rixon S, Levison J, Binns A, Goel P. Application of classification machine learning algorithms for characterizing nutrient transport in a clay plain agricultural watershed. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 345:118924. [PMID: 37678017 DOI: 10.1016/j.jenvman.2023.118924] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Revised: 08/28/2023] [Accepted: 08/30/2023] [Indexed: 09/09/2023]
Abstract
Excess nutrients in surface water and groundwater can lead to water quality deterioration in available water resources. Thus, the classification of nutrient concentrations in water resources has gained significant attention during recent decades. Machine learning (ML) algorithms are considered an efficient tool to describe nutrient loss from agricultural land to surface water and groundwater. Previous studies have applied regression and classification ML algorithms to predict nutrient concentrations in surface water and/or groundwater, or to categorize an output variable using a limited number of input variables. However, there have been no studies that examined the application of different ML classification algorithms in agricultural settings to classify various output variables using a wide range of input variables. In this study, twenty-four ML classification algorithms were implemented on a dataset from three locations within the Upper Parkhill watershed, an agricultural watershed in southern Ontario, Canada. Nutrient concentrations in surface water were classified using geochemical and physical water parameters of surface water and groundwater (e.g., pH), climate and field conditions as the input variables. The performance of these algorithms was evaluated using four evaluation metrics (e.g., classification accuracy) to identify the optimal algorithm for classifying the output variables. Ensemble bagged trees was found to be the optimal ML algorithm for classifying nitrate concentration in surface water (accuracy of 90.9%), while the weighted KNN was the most appropriate algorithm for categorizing the total phosphorus concentration (accuracy of 87%). The ensemble subspace discriminant algorithm gave the highest overall classification accuracy for the concentration of soluble reactive phosphorus and total dissolved phosphorus in surface water with an accuracy of 79.2% and 77.9%, respectively. This study exemplifies that ML algorithms can be used to signify exceedance of recommended concentrations of nutrients in surface waters in agricultural watersheds. Results are useful for decision makers to develop nutrient management strategies.
Collapse
Affiliation(s)
- Ahmed Elsayed
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada; Irrigation and Hydraulics Department, Faculty of Engineering, Cairo University, 1 Gamaa Street, Giza, 12613, Egypt.
| | - Sarah Rixon
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Jana Levison
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Andrew Binns
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Pradeep Goel
- Ministry of the Environment, Conservation and Parks (MECP), 125 Resources Road, Etobicoke, Ontario, M9P 3V6, Canada
| |
Collapse
|
2
|
Han H, Kim B, Kim K, Kim D, Kim HS. Machine learning approach for the estimation of missing precipitation data: a case study of South Korea. WATER SCIENCE AND TECHNOLOGY : A JOURNAL OF THE INTERNATIONAL ASSOCIATION ON WATER POLLUTION RESEARCH 2023; 88:556-571. [PMID: 37578874 PMCID: wst_2023_237 DOI: 10.2166/wst.2023.237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/16/2023]
Abstract
Precipitation is one of the driving forces in water cycles, and it is vital for understanding the water cycle, such as surface runoff, soil moisture, and evapotranspiration. However, missing precipitation data at the observatory becomes an obstacle to improving the accuracy and efficiency of hydrological analysis. To address this issue, we developed a machine learning algorithm-based precipitation data recovery tool to detect and predict missing precipitation data at observatories. This study investigated 30 weather stations in South Korea, evaluating the applicability of machine learning algorithms (artificial neural network and random forest) for precipitation data recovery using environmental variables, such as air pressure, temperature, humidity, and wind speed. The proposed model showed a high performance in detecting the missing precipitation occurrence with an accuracy of 80%. In addition, the prediction results from the models showed predictive ability with a correlation coefficient ranging from 0.5 to 0.7 and R2 values of 0.53. Although both algorithms performed similarly in estimating precipitation, ANN performed slightly better. Based on the results of this study, we expect that the machine learning algorithms can contribute to improving hydrological modeling performance by recovering missing precipitation data at observation stations.
Collapse
Affiliation(s)
- Heechan Han
- Department of Civil Engineering, Chosun University, Gwangju, South Korea E-mail:
| | - Boran Kim
- Department of Civil and Environmental Engineering, Colorado State University, Fort Collins, CO, USA
| | - Kyunghun Kim
- Department of Civil Engineering, Inha University, Incheon, South Korea
| | - Donghyun Kim
- Institute of Water Resource System, Inha University, Incheon, South Korea
| | - Hung Soo Kim
- Department of Civil Engineering, Inha University, Incheon, South Korea
| |
Collapse
|