Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tyralis H, Papacharalampous G, Langousis A. A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources. Water 2019;11:910. [DOI: 10.3390/w11050910] [Citation(s) in RCA: 93] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

For:	Tyralis H, Papacharalampous G, Langousis A. A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources. Water 2019;11:910. [DOI: 10.3390/w11050910] [Citation(s) in RCA: 93] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Number

Cited by Other Article(s)

Classifying Crop Types Using Two Generations of Hyperspectral Sensors (Hyperion and DESIS) with Machine Learning on the Cloud. REMOTE SENSING 2021. [DOI: 10.3390/rs13224704] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Abstract Advances in spaceborne hyperspectral (HS) remote sensing, cloud-computing, and machine learning can help measure, model, map and monitor agricultural crops to address global food and water security issues, such as by providing accurate estimates of crop area and yield to model agricultural productivity. Leveraging these advances, we used the Earth Observing-1 (EO-1) Hyperion historical archive and the new generation DLR Earth Sensing Imaging Spectrometer (DESIS) data to evaluate the performance of hyperspectral narrowbands in classifying major agricultural crops of the U.S. with machine learning (ML) on Google Earth Engine (GEE). EO-1 Hyperion images from the 2010–2013 growing seasons and DESIS images from the 2019 growing season were used to classify three world crops (corn, soybean, and winter wheat) along with other crops and non-crops near Ponca City, Oklahoma, USA. The supervised classification algorithms: Random Forest (RF), Support Vector Machine (SVM), and Naive Bayes (NB), and the unsupervised clustering algorithm WekaXMeans (WXM) were run using selected optimal Hyperion and DESIS HS narrowbands (HNBs). RF and SVM returned the highest overall producer’s, and user’s accuracies, with the performances of NB and WXM being substantially lower. The best accuracies were achieved with two or three images throughout the growing season, especially a combination of an earlier month (June or July) and a later month (August or September). The narrow 2.55 nm bandwidth of DESIS provided numerous spectral features along the 400–1000 nm spectral range relative to smoother Hyperion spectral signatures with 10 nm bandwidth in the 400–2500 nm spectral range. Out of 235 DESIS HNBs, 29 were deemed optimal for agricultural study. Advances in ML and cloud-computing can greatly facilitate HS data analysis, especially as more HS datasets, tools, and algorithms become available on the Cloud. Collapse

Parsimonious Models of Precipitation Phase Derived from Random Forest Knowledge: Intercomparing Logistic Models, Neural Networks, and Random Forest Models. WATER 2021. [DOI: 10.3390/w13213022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Paepae T, Bokoro PN, Kyamakya K. From Fully Physical to Virtual Sensing for Water Quality Assessment: A Comprehensive Review of the Relevant State-of-the-Art. SENSORS (BASEL, SWITZERLAND) 2021;21:6971. [PMID: 34770278 PMCID: PMC8587795 DOI: 10.3390/s21216971] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 10/17/2021] [Accepted: 10/17/2021] [Indexed: 12/17/2022]

Yang H, Huang K, Zhang K, Weng Q, Zhang H, Wang F. Predicting Heavy Metal Adsorption on Soil with Machine Learning and Mapping Global Distribution of Soil Adsorption Capacities. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2021;55:14316-14328. [PMID: 34617744 DOI: 10.1021/acs.est.1c02479] [Citation(s) in RCA: 66] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Upscaling Evapotranspiration from a Single-Site to Satellite Pixel Scale. REMOTE SENSING 2021. [DOI: 10.3390/rs13204072] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract It is of great significance for the validation of remotely sensed evapotranspiration (ET) products to solve the spatial-scale mismatch between site observations and remote sensing estimations. To overcome this challenge, this paper proposes a comprehensive framework for obtaining the ground truth ET at the satellite pixel scale (1 × 1 km resolution in MODIS satellite imagery). The main idea of this framework is to first quantitatively evaluate the spatial heterogeneity of the land surface, then combine the eddy covariance (EC)-observed ET (ET_EC) to be able to compare and optimize the upscaling methods (among five data-driven and three mechanism-driven methods) through direct validation and cross-validation, and finally use the optimal method to obtain the ground truth ET at the satellite pixel scale. The results showed that the ET_EC was superior over homogeneous underlying surfaces with a root mean square error (RMSE) of 0.34 mm/d. Over moderately and highly heterogeneous underlying surfaces, the Gaussian process regression (GPR) method performed better (the RMSEs were 0.51 mm/d and 0.60 mm/d, respectively). Finally, an integrated method (namely, using the ET_EC for homogeneous surfaces and the GPR method for moderately and highly heterogeneous underlying surfaces) was proposed to obtain the ground truth ET over fifteen typical underlying surfaces in the Heihe River Basin. Furthermore, the uncertainty of ground truth ET was quantitatively evaluated. The results showed that the ground truth ET at the satellite pixel scale is relatively reliable with an uncertainty of 0.02–0.41 mm/d. The upscaling framework proposed in this paper can be used to obtain the ground truth ET at the satellite pixel scale and its uncertainty, and it has great potential to be applied in more regions around the globe for remotely sensed ET products’ validation. Collapse

Machine Learning Reveals a Significant Shift in Water Regime Types Due to Projected Climate Change. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2021. [DOI: 10.3390/ijgi10100660] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Singha S, Pasupuleti S, Singha SS, Singh R, Kumar S. Prediction of groundwater quality using efficient machine learning technique. CHEMOSPHERE 2021;276:130265. [PMID: 34088106 DOI: 10.1016/j.chemosphere.2021.130265] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 03/07/2021] [Accepted: 03/11/2021] [Indexed: 06/12/2023]

Cheung YY, Cheung S, Mak J, Liu K, Xia X, Zhang X, Yung Y, Liu H. Distinct interaction effects of warming and anthropogenic input on diatoms and dinoflagellates in an urbanized estuarine ecosystem. GLOBAL CHANGE BIOLOGY 2021;27:3463-3473. [PMID: 33934458 DOI: 10.1111/gcb.15667] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 04/20/2021] [Indexed: 06/12/2023]

de Oliveira RCG, Cunha CL, Tôrres AR, Corrêa SM. Forecasts of tropospheric ozone in the Metropolitan Area of Rio de Janeiro based on missing data imputation and multivariate calibration techniques. ENVIRONMENTAL MONITORING AND ASSESSMENT 2021;193:531. [PMID: 34322768 DOI: 10.1007/s10661-021-09333-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 07/22/2021] [Indexed: 06/13/2023]

Ward BJ, Andriessen N, Tembo JM, Kabika J, Grau M, Scheidegger A, Morgenroth E, Strande L. Predictive models using "cheap and easy" field measurements: Can they fill a gap in planning, monitoring, and implementing fecal sludge management solutions? WATER RESEARCH 2021;196:116997. [PMID: 33744658 DOI: 10.1016/j.watres.2021.116997] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 02/19/2021] [Accepted: 03/01/2021] [Indexed: 06/12/2023]

Papacharalampous G, Tyralis H, Papalexiou SM, Langousis A, Khatami S, Volpi E, Grimaldi S. Global-scale massive feature extraction from monthly hydroclimatic time series: Statistical characterizations, spatial patterns and hydrological similarity. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021;767:144612. [PMID: 33454612 DOI: 10.1016/j.scitotenv.2020.144612] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Revised: 11/27/2020] [Accepted: 12/17/2020] [Indexed: 06/12/2023]

Abstract

Hydroclimatic time series analysis focuses on a few feature types (e.g., autocorrelations, trends, extremes), which describe a small portion of the entire information content of the observations. Aiming to exploit a larger part of the available information and, thus, to deliver more reliable results (e.g., in hydroclimatic time series clustering contexts), here we approach hydroclimatic time series analysis differently, i.e., by performing massive feature extraction. In this respect, we develop a big data framework for hydroclimatic variable behaviour characterization. This framework relies on approximately 60 diverse features and is completely automatic (in the sense that it does not depend on the hydroclimatic process at hand). We apply the new framework to characterize mean monthly temperature, total monthly precipitation and mean monthly river flow. The applications are conducted at the global scale by exploiting 40-year-long time series originating from over 13 000 stations. We extract interpretable knowledge on seasonality, trends, autocorrelation, long-range dependence and entropy, and on feature types that are met less frequently. We further compare the examined hydroclimatic variable types in terms of this knowledge and, identify patterns related to the spatial variability of the features. For this latter purpose, we also propose and exploit a hydroclimatic time series clustering methodology. This new methodology is based on Breiman's random forests. The descriptive and exploratory insights gained by the global-scale applications prove the usefulness of the adopted feature compilation in hydroclimatic contexts. Moreover, the spatially coherent patterns characterizing the clusters delivered by the new methodology build confidence in its future exploitation. Given this spatial coherence and the scale-independent nature of the delivered feature values (which makes them particularly useful in forecasting and simulation contexts), we believe that this methodology could also be beneficial within regionalization frameworks, in which knowledge on hydrological similarity is exploited in technical and operative terms.

Collapse

Tyralis H, Papacharalampous G. Boosting algorithms in energy research: a systematic review. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-05995-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Ho L, Jerves-Cobo R, Morales O, Larriva J, Arevalo-Durazno M, Barthel M, Six J, Bode S, Boeckx P, Goethals P. Spatial and temporal variations of greenhouse gas emissions from a waste stabilization pond: Effects of sludge distribution and accumulation. WATER RESEARCH 2021;193:116858. [PMID: 33540345 DOI: 10.1016/j.watres.2021.116858] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 01/12/2021] [Accepted: 01/18/2021] [Indexed: 06/12/2023]

Assessment of Annual Composite Images Obtained by Google Earth Engine for Urban Areas Mapping Using Random Forest. REMOTE SENSING 2021. [DOI: 10.3390/rs13040748] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Ferreira RG, Silva DDD, Elesbon AAA, Fernandes-Filho EI, Veloso GV, Fraga MDS, Ferreira LB. Machine learning models for streamflow regionalization in a tropical watershed. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2021;280:111713. [PMID: 33257181 DOI: 10.1016/j.jenvman.2020.111713] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 11/17/2020] [Accepted: 11/21/2020] [Indexed: 06/12/2023]

Influence of Random Forest Hyperparameterization on Short-Term Runoff Forecasting in an Andean Mountain Catchment. ATMOSPHERE 2021. [DOI: 10.3390/atmos12020238] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract The Random Forest (RF) algorithm, a decision-tree-based technique, has become a promising approach for applications addressing runoff forecasting in remote areas. This machine learning approach can overcome the limitations of scarce spatio-temporal data and physical parameters needed for process-based hydrological models. However, the influence of RF hyperparameters is still uncertain and needs to be explored. Therefore, the aim of this study is to analyze the sensitivity of RF runoff forecasting models of varying lead time to the hyperparameters of the algorithm. For this, models were trained by using (a) default and (b) extensive hyperparameter combinations through a grid-search approach that allow reaching the optimal set. Model performances were assessed based on the R2, %Bias, and RMSE metrics. We found that: (i) The most influencing hyperparameter is the number of trees in the forest, however the combination of the depth of the tree and the number of features hyperparameters produced the highest variability-instability on the models. (ii) Hyperparameter optimization significantly improved model performance for higher lead times (12- and 24-h). For instance, the performance of the 12-h forecasting model under default RF hyperparameters improved to R2 = 0.41 after optimization (gain of 0.17). However, for short lead times (4-h) there was no significant model improvement (0.69 < R2 < 0.70). (iii) There is a range of values for each hyperparameter in which the performance of the model is not significantly affected but remains close to the optimal. Thus, a compromise between hyperparameter interactions (i.e., their values) can produce similar high model performances. Model improvements after optimization can be explained from a hydrological point of view, the generalization ability for lead times larger than the concentration time of the catchment tend to rely more on hyperparameterization than in what they can learn from the input data. This insight can help in the development of operational early warning systems. Collapse

Machine Learning and Simulation-Optimization Coupling for Water Distribution Network Contamination Source Detection. SENSORS 2021;21:s21041157. [PMID: 33562175 PMCID: PMC7916058 DOI: 10.3390/s21041157] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Revised: 02/02/2021] [Accepted: 02/04/2021] [Indexed: 11/29/2022]

Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:ijerph18031023. [PMID: 33498931 PMCID: PMC7908193 DOI: 10.3390/ijerph18031023] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 01/21/2021] [Accepted: 01/21/2021] [Indexed: 11/23/2022]

Explanation and Probabilistic Prediction of Hydrological Signatures with Statistical Boosting Algorithms. REMOTE SENSING 2021. [DOI: 10.3390/rs13030333] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Wherry SA, Tesoriero AJ, Terziotti S. Factors Affecting Nitrate Concentrations in Stream Base Flow. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2021;55:902-911. [PMID: 33356185 DOI: 10.1021/acs.est.0c02495] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Development of a Regional Gridded Runoff Dataset Using Long Short-Term Memory (LSTM) Networks. HYDROLOGY 2021. [DOI: 10.3390/hydrology8010006] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Wagenaar D, Hermawan T, van den Homberg MJC, Aerts JCJH, Kreibich H, de Moel H, Bouwer LM. Improved Transferability of Data-Driven Damage Models Through Sample Selection Bias Correction. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2021;41:37-55. [PMID: 32830337 PMCID: PMC7891600 DOI: 10.1111/risa.13575] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Revised: 03/30/2020] [Accepted: 07/07/2020] [Indexed: 06/11/2023]

Liu H, Hitchcock DB, Samadi SZ. Spatio-temporal analysis of flood data from South Carolina. JOURNAL OF STATISTICAL DISTRIBUTIONS AND APPLICATIONS 2020. [DOI: 10.1186/s40488-020-00112-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Ha NT, Nguyen HQ, Truong NCQ, Le TL, Thai VN, Pham TL. Estimation of nitrogen and phosphorus concentrations from water quality surrogates using machine learning in the Tri An Reservoir, Vietnam. ENVIRONMENTAL MONITORING AND ASSESSMENT 2020;192:789. [PMID: 33241485 DOI: 10.1007/s10661-020-08731-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 11/03/2020] [Indexed: 06/11/2023]

Modeling the Impacts of Climate Change on Crop Yield and Irrigation in the Monocacy River Watershed, USA. CLIMATE 2020. [DOI: 10.3390/cli8120139] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Combined Approach Using Clustering-Random Forest to Evaluate Partial Discharge Patterns in Hydro Generators. ENERGIES 2020. [DOI: 10.3390/en13225992] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

An Improved Approach for Downscaling Coarse-Resolution Thermal Data by Minimizing the Spatial Averaging Biases in Random Forest. REMOTE SENSING 2020. [DOI: 10.3390/rs12213507] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract Land surface temperature (LST) plays a fundamental role in various geophysical processes at varying spatial and temporal scales. Satellite-based observations of LST provide a viable option for monitoring the spatial-temporal evolution of these processes. Downscaling is a widely adopted approach for solving the spatial-temporal trade-off associated with satellite-based observations of LST. However, despite the advances made in the field of LST downscaling, issues related to spatial averaging in the downscaling methodologies greatly hamper the utility of coarse-resolution thermal data for downscaling applications in complex environments. In this study, an improved LST downscaling approach based on random forest (RF) regression is presented. The proposed approach addresses issues related to spatial averaging biases associated with the downscaling model developed at the coarse resolution. The approach was applied to downscale the coarse-resolution Satellite Application Facility on Land Surface Analysis (LSA-SAF) LST product derived from the Spinning Enhanced Visible and Infrared Imager (SEVIRI) sensor aboard the Meteosat Second Generation (MSG) weather satellite. The LSA-SAF product was downscaled to a spatial resolution of ~30 m, based on predictor variables derived from Sentinel 2, and the Advanced Land Observing Satellite (ALOS) digital elevation model (DEM). Quantitatively and qualitatively, better downscaling results were obtained using the proposed approach in comparison to the conventional approach of downscaling LST using RF widely adopted in LST downscaling studies. The enhanced performance indicates that the proposed approach has the ability to reduce the spatial averaging biases inherent in the LST downscaling methodology and thus is more suitable for downscaling applications in complex environments. Collapse

An ensemble genetic programming model for seasonal precipitation forecasting. SN APPLIED SCIENCES 2020. [DOI: 10.1007/s42452-020-03625-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open

Combining Multi-Sensor Satellite Imagery to Improve Long-Term Monitoring of Temporary Surface Water Bodies in the Senegal River Floodplain. REMOTE SENSING 2020. [DOI: 10.3390/rs12193157] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Abstract Accurate monitoring of surface water bodies is essential in numerous hydrological and agricultural applications. Combining imagery from multiple sensors can improve long-term monitoring; however, the benefits derived from each sensor and the methods to automate long-term water mapping must be better understood across varying periods and in heterogeneous water environments. All available observations from Landsat 7, Landsat 8, Sentinel-2 and MODIS over 1999–2019 are processed in Google Earth Engines to evaluate and compare the benefits of single and multi-sensor approaches in long-term water monitoring of temporary water bodies, against extensive ground truth data from the Senegal River floodplain. Otsu automatic thresholding is compared with default thresholds and site-specific calibrated thresholds to improve Modified Normalized Difference Water Index (MNDWI) classification accuracy. Otsu thresholding leads to the lowest Root Mean Squared Error (RMSE) and high overall accuracies on selected Sentinel-2 and Landsat 8 images, but performance declines when applied to long-term monitoring compared to default or site-specific thresholds. On MODIS imagery, calibrated thresholds are crucial to improve classification in heterogeneous water environments, and results highlight excellent accuracies even in small (19 km2) water bodies despite the 500 m spatial resolution. Over 1999–2019, MODIS observations reduce average daily RMSE by 48% compared to the full Landsat 7 and 8 archive and by 51% compared to the published Global Surface Water datasets. Results reveal the need to integrate coarser MODIS observations in regional and global long-term surface water datasets, to accurately capture flood dynamics, overlooked by the full Landsat time series before 2013. From 2013, the Landsat 7 and Landsat 8 constellation becomes sufficient, and integrating MODIS observations degrades performance marginally. Combining Landsat and Sentinel-2 yields modest improvements after 2015. These results have important implications to guide the development of multi-sensor products and for applications across large wetlands and floodplains. Collapse

Uncertainty Analysis of Monthly Precipitation in GCMs Using Multiple Bias Correction Methods under Different RCPs. SUSTAINABILITY 2020. [DOI: 10.3390/su12187508] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Long-Term Groundwater Level Prediction Model Based on Hybrid KNN-RF Technique. HYDROLOGY 2020. [DOI: 10.3390/hydrology7030059] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Artificial intelligence for sustainability: Challenges, opportunities, and a research agenda. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT 2020. [DOI: 10.1016/j.ijinfomgt.2020.102104] [Citation(s) in RCA: 113] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Tyralis H, Papacharalampous G, Langousis A. Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-05172-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Assessment of Native Radar Reflectivity and Radar Rainfall Estimates for Discharge Forecasting in Mountain Catchments with a Random Forest Model. REMOTE SENSING 2020. [DOI: 10.3390/rs12121986] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Use of Machine Learning in Evaluation of Drought Perception in Irrigated Agriculture: The Case of an Irrigated Perimeter in Brazil. WATER 2020. [DOI: 10.3390/w12061546] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Machine Learning Methods for Improved Understanding of a Pumping Test in Heterogeneous Aquifers. WATER 2020. [DOI: 10.3390/w12051342] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Establishing an Empirical Model for Surface Soil Moisture Retrieval at the U.S. Climate Reference Network Using Sentinel-1 Backscatter and Ancillary Data. REMOTE SENSING 2020. [DOI: 10.3390/rs12081242] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Castrillo M, García ÁL. Estimation of high frequency nutrient concentrations from water quality surrogates using machine learning methods. WATER RESEARCH 2020;172:115490. [PMID: 31972414 DOI: 10.1016/j.watres.2020.115490] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Revised: 12/24/2019] [Accepted: 01/07/2020] [Indexed: 06/10/2023]

Klåvus A, Kokla M, Noerman S, Koistinen VM, Tuomainen M, Zarei I, Meuronen T, Häkkinen MR, Rummukainen S, Farizah Babu A, Sallinen T, Kärkkäinen O, Paananen J, Broadhurst D, Brunius C, Hanhineva K. "notame": Workflow for Non-Targeted LC-MS Metabolic Profiling. Metabolites 2020;10:E135. [PMID: 32244411 PMCID: PMC7240970 DOI: 10.3390/metabo10040135] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 03/25/2020] [Accepted: 03/28/2020] [Indexed: 02/06/2023] Open

Affiliation(s)

Anton Klåvus Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.)
Marietta Kokla Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.)
Stefania Noerman Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.)
Ville M. Koistinen Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.)
Marjo Tuomainen Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.)
Iman Zarei Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.)
Topi Meuronen Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.)
Merja R. Häkkinen School of Pharmacy, University of Eastern Finland, 70210 Kuopio, Finland; (M.R.H.); (S.R.); (O.K.)
Soile Rummukainen School of Pharmacy, University of Eastern Finland, 70210 Kuopio, Finland; (M.R.H.); (S.R.); (O.K.)
Ambrin Farizah Babu Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.)
Taisa Sallinen Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.) School of Pharmacy, University of Eastern Finland, 70210 Kuopio, Finland; (M.R.H.); (S.R.); (O.K.)
Olli Kärkkäinen School of Pharmacy, University of Eastern Finland, 70210 Kuopio, Finland; (M.R.H.); (S.R.); (O.K.)
Jussi Paananen Institute of Biomedicine, University of Eastern Finland, 70210 Kuopio, Finland;
David Broadhurst Centre for Integrative Metabolomics & Computational Biology, School of Science, Edith Cowan University, Joondalup, WA 6027, Australia;
Carl Brunius Department of Biology and Biological Engineering, Chalmers University of Technology, 41296 Gothenburg, Sweden; Chalmers Mass Spectrometry Infrastructure, Chalmers University of Technology, 41296 Gothenburg, Sweden
Kati Hanhineva Department of Clinical Nutrition and Public Health, University of Eastern Finland, 70210 Kuopio, Finland; (S.N.); (V.M.K.); (M.T.); (I.Z.); (T.M.); (A.F.B.); (T.S.) Department of Biology and Biological Engineering, Chalmers University of Technology, 41296 Gothenburg, Sweden; Department of Biochemistry, Food Chemistry and Food Development unit, University of Turku, 20014 Turun yliopisto, Finland

Collapse

Soil Temperature Dynamics at Hillslope Scale—Field Observation and Machine Learning-Based Approach. WATER 2020. [DOI: 10.3390/w12030713] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Abstract Soil temperature plays an important role in understanding hydrological, ecological, meteorological, and land surface processes. However, studies related to soil temperature variability are very scarce in various parts of the world, especially in the Indian Himalayan Region (IHR). Thus, this study aims to analyze the spatio-temporal variability of soil temperature in two nested hillslopes of the lesser Himalaya and to check the efficiency of different machine learning algorithms to estimate soil temperature in the data-scarce region. To accomplish this goal, grassed (GA) and agro-forested (AgF) hillslopes were instrumented with Odyssey water level and decagon soil moisture and temperature sensors. The average soil temperature of the south aspect hillslope (i.e., GA hillslope) was higher than the north aspect hillslope (i.e., AgF hillslope). After analyzing 40 rainfall events from both hillslopes, it was observed that a rainfall duration of greater than 7.5 h or an event with an average rainfall intensity greater than 7.5 mm/h results in more than 2 °C soil temperature drop. Further, a drop in soil temperature less than 1 °C was also observed during very high-intensity rainfall which has a very short event duration. During the rainy season, the soil temperature drop of the GA hillslope is higher than the AgF hillslope as the former one infiltrates more water. This observation indicates the significant correlation between soil moisture rise and soil temperature drop. The potential of four machine learning algorithms was also explored in predicting soil temperature under data-scarce conditions. Among the four machine learning algorithms, an extreme gradient boosting system (XGBoost) performed better for both the hillslopes followed by random forests (RF), multilayer perceptron (MLP), and support vector machine (SVMs). The addition of rainfall to meteorological and meteorological + soil moisture datasets did not improve the models considerably. However, the addition of soil moisture to meteorological parameters improved the model significantly. Collapse

Kim Y, Johnson MS, Knox SH, Black TA, Dalmagro HJ, Kang M, Kim J, Baldocchi D. Gap-filling approaches for eddy covariance methane fluxes: A comparison of three machine learning algorithms and a traditional method with principal component analysis. GLOBAL CHANGE BIOLOGY 2020;26:1499-1518. [PMID: 31553826 DOI: 10.1111/gcb.14845] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Accepted: 09/12/2019] [Indexed: 06/10/2023]

Abstract

Methane flux (FCH₄ ) measurements using the eddy covariance technique have increased over the past decade. FCH₄ measurements commonly include data gaps, as is the case with CO₂ and energy fluxes. However, gap-filling FCH₄ data are more challenging than other fluxes due to its unique characteristics including multidriver dependency, variabilities across multiple timescales, nonstationarity, spatial heterogeneity of flux footprints, and lagged influence of biophysical drivers. Some researchers have applied a marginal distribution sampling (MDS) algorithm, a standard gap-filling method for other fluxes, to FCH₄ datasets, and others have applied artificial neural networks (ANN) to resolve the challenging characteristics of FCH₄ . However, there is still no consensus regarding FCH₄ gap-filling methods due to limited comparative research. We are not aware of the applications of machine learning (ML) algorithms beyond ANN to FCH₄ datasets. Here, we compare the performance of MDS and three ML algorithms (ANN, random forest [RF], and support vector machine [SVM]) using multiple combinations of ancillary variables. In addition, we applied principal component analysis (PCA) as an input to the algorithms to address multidriver dependency of FCH₄ and reduce the internal complexity of the algorithmic structures. We applied this approach to five benchmark FCH₄ datasets from both natural and managed systems located in temperate and tropical wetlands and rice paddies. Results indicate that PCA improved the performance of MDS compared to traditional inputs. ML algorithms performed better when using all available biophysical variables compared to using PCA-derived inputs. Overall, RF was found to outperform other techniques for all sites. We found gap-filling uncertainty is much larger than measurement uncertainty in accumulated CH₄ budget. Therefore, the approach used for FCH₄ gap filling can have important implications for characterizing annual ecosystem-scale methane budgets, the accuracy of which is important for evaluating natural and managed systems and their interactions with global change processes.

Collapse

Mapping Forest Composition with Landsat Time Series: An Evaluation of Seasonal Composites and Harmonic Regression. REMOTE SENSING 2020. [DOI: 10.3390/rs12040610] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract The Landsat program has long supported pioneering research on the recovery of forest information by remote sensing technologies for several decades, and efforts to improve the thematic resolution and accuracy of forest compositional products remains an area of continued innovation. Recent development and application of Landsat time series analysis offers unique opportunities for quantifying seasonality and trend components among different forest types for developing alternative feature sets for forest vegetation mapping. Within a large forested landscape in Southeastern Ohio, USA, we examined the use of harmonic metrics developed from time series of all available Landsat-8 observations (2013–2019) relative to seasonal image composites, including accompanying spectral components and vegetation indices. A reference dataset among three sources was integrated and used to categorize forest inventory data into seven forest type classes and gradient compositional response. Results showed that the combination of harmonic metrics and topographic variables achieved an accuracy agreement with the reference data of 74.9% relative to seasonal composites (71.6%) and spectral indices (70.3%). Differences in agreement were attributed to improved discrimination of three heterogeneous upland hardwood classes and an early-successional, young forest class, all forest types of primary interest among managers across the region. Variable importance metrics often identified the cosine and sine terms that quantify the seasonality in spectral values in the harmonic feature space, suggesting these aspects best support the characterization of forest types at greater thematic detail than seasonal compositing procedures. This study demonstrates how advanced time series metrics can improve forest type modeling and forest gradient quantifications, thus showcasing a need for continued exploration of such approaches across different forest types. Collapse

Probabilistic Hydrological Post-Processing at Scale: Why and How to Apply Machine-Learning Quantile Regression Algorithms. WATER 2019. [DOI: 10.3390/w11102126] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract We conduct a large-scale benchmark experiment aiming to advance the use of machine-learning quantile regression algorithms for probabilistic hydrological post-processing “at scale” within operational contexts. The experiment is set up using 34-year-long daily time series of precipitation, temperature, evapotranspiration and streamflow for 511 catchments over the contiguous United States. Point hydrological predictions are obtained using the Génie Rural à 4 paramètres Journalier (GR4J) hydrological model and exploited as predictor variables within quantile regression settings. Six machine-learning quantile regression algorithms and their equal-weight combiner are applied to predict conditional quantiles of the hydrological model errors. The individual algorithms are quantile regression, generalized random forests for quantile regression, generalized random forests for quantile regression emulating quantile regression forests, gradient boosting machine, model-based boosting with linear models as base learners and quantile regression neural networks. The conditional quantiles of the hydrological model errors are transformed to conditional quantiles of daily streamflow, which are finally assessed using proper performance scores and benchmarking. The assessment concerns various levels of predictive quantiles and central prediction intervals, while it is made both independently of the flow magnitude and conditional upon this magnitude. Key aspects of the developed methodological framework are highlighted, and practical recommendations are formulated. In technical hydro-meteorological applications, the algorithms should be applied preferably in a way that maximizes the benefits and reduces the risks from their use. This can be achieved by (i) combining algorithms (e.g., by averaging their predictions) and (ii) integrating algorithms within systematic frameworks (i.e., by using the algorithms according to their identified skills), as our large-scale results point out. Collapse

Random Forest Ability in Regionalizing Hourly Hydrological Model Parameters. WATER 2019. [DOI: 10.3390/w11081540] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Duan ZY, Wang LM, Mammadov M, Lou H, Sun MH. Discriminatory Target Learning: Mining Significant Dependence Relationships from Labeled and Unlabeled Data. ENTROPY 2019;21:e21050537. [PMID: 33267251 PMCID: PMC7515026 DOI: 10.3390/e21050537] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2019] [Revised: 05/20/2019] [Accepted: 05/24/2019] [Indexed: 11/16/2022]