1
|
Ayinde BO, Musa MR, Ayinde AAO. Application of machine learning models and landsat 8 data for estimating seasonal pm 2.5 concentrations. Environ Anal Health Toxicol 2024; 39:e2024011-0. [PMID: 38631403 PMCID: PMC11079408 DOI: 10.5620/eaht.2024011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 03/12/2024] [Indexed: 04/19/2024] Open
Abstract
Air pollution is a significant global challenge that affects many cities. In Europe, Bosnia and Herzegovina (BiH) are among the most highly polluted and are mainly affected by air pollution. In this study, we integrate open-source landsat 8 remote sensing products, topographical data, and the limited ground truth PM2.5 data to spatially predict the air quality level across different seasons in Tuzla Canton, BiH by adopting three pre-existing machine learning models, namely XGBoost, K-Nearest Neighbour (KNN) and Naive Bayes (NB). These classification models were implemented based on landsat 8 bands, environmental-derived indices, and topographical variables generated for the study area. Based on the predicted results, the XGBoost model exhibited the highest overall accuracy across all seasons. The predicted model results were used to generate spatial air quality maps. Based on the classification maps, the PM2.5 air quality level predicted for Tuzla Canton in the Winter Season is very unhealthy. The findings conclude that the PM2.5 air quality concentration in Tuzla Canton is relatively unsatisfactory and requires urgent intervention by the government to prevent further deterioration of air quality in Tuzla and other affected cantons in BiH.
Collapse
|
2
|
El Mghouchi Y, Udristioiu MT, Yildizhan H. Multivariable Air-Quality Prediction and Modelling via Hybrid Machine Learning: A Case Study for Craiova, Romania. SENSORS (BASEL, SWITZERLAND) 2024; 24:1532. [PMID: 38475068 DOI: 10.3390/s24051532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 02/22/2024] [Accepted: 02/26/2024] [Indexed: 03/14/2024]
Abstract
Inadequate air quality has adverse impacts on human well-being and contributes to the progression of climate change, leading to fluctuations in temperature. Therefore, gaining a localized comprehension of the interplay between climate variations and air pollution holds great significance in alleviating the health repercussions of air pollution. This study uses a holistic approach to make air quality predictions and multivariate modelling. It investigates the associations between meteorological factors, encompassing temperature, relative humidity, air pressure, and three particulate matter concentrations (PM10, PM2.5, and PM1), and the correlation between PM concentrations and noise levels, volatile organic compounds, and carbon dioxide emissions. Five hybrid machine learning models were employed to predict PM concentrations and then the Air Quality Index (AQI). Twelve PM sensors evenly distributed in Craiova City, Romania, provided the dataset for five months (22 September 2021-17 February 2022). The sensors transmitted data each minute. The prediction accuracy of the models was evaluated and the results revealed that, in general, the coefficient of determination (R2) values exceeded 0.96 (interval of confidence is 0.95) and, in most instances, approached 0.99. Relative humidity emerged as the least influential variable on PM concentrations, while the most accurate predictions were achieved by combining pressure with temperature. PM10 (less than 10 µm in diameter) concentrations exhibited a notable correlation with PM2.5 (less than 2.5 µm in diameter) concentrations and a moderate correlation with PM1 (less than 1 µm in diameter). Nevertheless, other findings indicated that PM concentrations were not strongly related to NOISE, CO2, and VOC, and these last variables should be combined with another meteorological variable to enhance the prediction accuracy. Ultimately, this study established novel relationships for predicting PM concentrations and AQI based on the most effective combinations of predictor variables identified.
Collapse
Affiliation(s)
- Youness El Mghouchi
- Department of Energetics, ENSAM, Moulay Ismail University, Meknes 50050, Morocco
| | - Mihaela Tinca Udristioiu
- Department of Physics, Faculty of Science, University of Craiova, 13 A.I. Cuza Street, 200585 Craiova, Romania
| | - Hasan Yildizhan
- Engineering Faculty, Energy Systems Engineering, Adana Alparslan Türkeş Science and Technology University, Adana 46278, Turkey
| |
Collapse
|
3
|
Yu X, Chen S, Zhang X, Wu H, Guo Y, Guan J. Research progress of the artificial intelligence application in wastewater treatment during 2012-2022: a bibliometric analysis. WATER SCIENCE AND TECHNOLOGY : A JOURNAL OF THE INTERNATIONAL ASSOCIATION ON WATER POLLUTION RESEARCH 2023; 88:1750-1766. [PMID: 37830995 PMCID: wst_2023_296 DOI: 10.2166/wst.2023.296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
This study identified literatures from the Web of Science Core Collection on the application of artificial intelligence in wastewater treatment from 2011 to 2022, through bibliometrics, to summarize achievements and capture the scientific and technological progress. The number of papers published is on the rise, and especially, the number of papers issued after 2018 has increased sharply, with China contributing the most in this regard, followed by the US, Iran and India. The University of Tehran has the largest number of papers, WATER is the most published journal, and Nasr M has the largest number of articles. Collaborative network has been developed mainly through cooperation between European countries, China and the US. Remote sensing in developing countries needs to be further integrated with water quality monitoring programs. It is worth noting that artificial neural network is a research hotspot in recent years. Through keyword clustering analysis, 'machine learning' and 'deep learning' are hot keywords that have emerged since 2019. The use of neural networks for predicting the effectiveness of treatment of difficult to degrade wastewater is a future research trend. The rapid advancement of deep learning provides the opportunity to build automated pipeline defect detection systems through image recognition.
Collapse
Affiliation(s)
- Xiaoman Yu
- School of Resources and Environmental Engineering, Shanghai Polytechnic University, Shanghai 201209, China E-mail:
| | - Shuai Chen
- School of Resources and Environmental Engineering, Shanghai Polytechnic University, Shanghai 201209, China; Anhui International Joint Research Center for Nano Carbon-based Materials and Environmental Health, Huainan 232001, China
| | - Xiaojiao Zhang
- School of Resources and Environmental Engineering, Shanghai Polytechnic University, Shanghai 201209, China
| | - Hongcheng Wu
- Shanghai Wobai Environmental Development Co. Ltd, Shanghai 201209, China
| | - Yaoguang Guo
- School of Resources and Environmental Engineering, Shanghai Polytechnic University, Shanghai 201209, China
| | - Jie Guan
- School of Resources and Environmental Engineering, Shanghai Polytechnic University, Shanghai 201209, China
| |
Collapse
|
4
|
Tan S, Xie D, Ni C, Zhao G, Shao J, Chen F, Ni J. Spatiotemporal characteristics of air pollution in Chengdu-Chongqing urban agglomeration (CCUA) in Southwest, China: 2015-2021. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 325:116503. [PMID: 36274306 DOI: 10.1016/j.jenvman.2022.116503] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 10/04/2022] [Accepted: 10/09/2022] [Indexed: 06/16/2023]
Abstract
Studying the spatiotemporal characteristics of air pollutants in urban agglomerations and their response factors will help to improve the quality of urban living. In combining air quality monitoring data and wavelet analysis from the Chengdu-Chongqing urban agglomeration (CCUA), this study assessed the spatiotemporal distribution characteristics and influential factors of air pollutants on daily, monthly and annual scales. The results showed that the concentration of air pollutants in the CCUA has decreased year by year, and air quality has improved. Except for O3, pollutants in autumn and winter were higher than those in summer. The spatial distribution of air pollutants was obvious distributed in Chengdu, Chongqing, Zigong and Dazhou. Pollution incidents were mainly concentrated in winter. The 6 air pollutants and air quality index (AQI) have dominant periods on multiple time scales. AQI showed positive coherence with PM2.5 and PM10 on multiple time scales, and obvious positive coherence with SO2, CO, NO2 and O3 in the short term scale. AQI was not strongly correlated with the fire point, but exhibited obvious negative coherence in the long term scale. In addition, AQI showed an obvious positive correlation with temperature and sunshine hours in short term, and a clear negative correlation with humidity and rainfall. The research results of this paper will provide a reference for pollution prevention and control in the CCUA.
Collapse
Affiliation(s)
- Shaojun Tan
- College of Resources and Environment, Southwest University, Chongqing, 400715, China.
| | - Deti Xie
- College of Resources and Environment, Southwest University, Chongqing, 400715, China.
| | - Chengsheng Ni
- College of Resources and Environment, Southwest University, Chongqing, 400715, China.
| | - Guangyao Zhao
- College of Resources and Environment, Southwest University, Chongqing, 400715, China.
| | - Jingan Shao
- College of Geography and Tourism, Chongqing Normal University, Chongqing, 401331, China.
| | - Fangxin Chen
- College of Resources and Environment, Southwest University, Chongqing, 400715, China.
| | - Jiupai Ni
- College of Resources and Environment, Southwest University, Chongqing, 400715, China.
| |
Collapse
|
5
|
Feng H, Zhang X. A novel encoder-decoder model based on Autoformer for air quality index prediction. PLoS One 2023; 18:e0284293. [PMID: 37053153 PMCID: PMC10101400 DOI: 10.1371/journal.pone.0284293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 03/28/2023] [Indexed: 04/14/2023] Open
Abstract
Rapid economic development has led to increasingly serious air quality problems. Accurate air quality prediction can provide technical support for air pollution prevention and treatment. In this paper, we proposed a novel encoder-decoder model named as Enhanced Autoformer (EnAutoformer) to improve the air quality index (AQI) prediction. In this model, (a) The enhanced cross-correlation (ECC) is proposed for extracting the temporal dependencies in AQI time series; (b) Combining the ECC with the cross-stage feature fusion mechanism of CSPDenseNet, the core module CSP_ECC is proposed for improving the computational efficiency of the EnAutoformer. (c) The time series decomposition and dilated causal convolution added in the decoder module are exploited to extract the finer-grained features from the original AQI data and improve the performance of the proposed model for long-term prediction. The real-world air quality datasets collected from Lanzhou are used to validate the performance of our prediction model. The experimental results show that our EnAutoformer model can greatly improve the prediction accuracy compared to the baselines and can be used as a promising alternative for complex air quality prediction.
Collapse
Affiliation(s)
- Huifang Feng
- College of Mathematics and Statistics, Northwest Normal University, Lanzhou, China
| | - Xianghong Zhang
- College of Mathematics and Statistics, Northwest Normal University, Lanzhou, China
| |
Collapse
|
6
|
Peng S, Zhu J, Liu Z, Hu B, Wang M, Pu S. Prediction of Ammonia Concentration in a Pig House Based on Machine Learning Models and Environmental Parameters. Animals (Basel) 2022; 13:ani13010165. [PMID: 36611774 PMCID: PMC9817777 DOI: 10.3390/ani13010165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 12/17/2022] [Accepted: 12/29/2022] [Indexed: 01/04/2023] Open
Abstract
Accurately predicting the air quality in a piggery and taking control measures in advance are important issues for pig farm production and local environmental management. In this experiment, the NH3 concentration in a semi-automatic piggery was studied. First, the random forest algorithm (RF) and Pearson correlation analysis were combined to analyze the environmental parameters, and nine input schemes for the model feature parameters were identified. Three kinds of deep learning and three kinds of conventional machine learning algorithms were applied to the prediction of NH3 in the piggery. Through comparative experiments, appropriate environmental parameters (CO2, H2O, P, and outdoor temperature) and superior algorithms (LSTM and RNN) were selected. On this basis, the PSO algorithm was used to optimize the hyperparameters of the algorithms, and their prediction performance was also evaluated. The results showed that the R2 values of PSO-LSTM and PSO-RNN were 0.9487 and 0.9458, respectively. These models had good accuracy when predicting NH3 concentration in the piggery 0.5 h, 1 h, 1.5 h, and 2 h in advance. This study can provide a reference for the prediction of air concentrations in pig house environments.
Collapse
Affiliation(s)
- Siyi Peng
- Chongqing Academy of Animal Sciences, Changlong Avenue, Chongqing 402460, China
- College of Animal Science and Technology, Southwest University, Chongqing 402460, China
| | - Jiaming Zhu
- Chongqing Academy of Animal Sciences, Changlong Avenue, Chongqing 402460, China
- National Center of Technology Innovation for Pigs, Chongqing 402460, China
- Scientific Observation and Experiment Station of Livestock Equipment Engineering in Southwest, Ministry of Agriculture and Rural Affairs, Chongqing 402460, China
- Innovation and Entrepreneurship Team for Livestock Environment Control and Equipment R&D, Chongqing 402460, China
| | - Zuohua Liu
- Chongqing Academy of Animal Sciences, Changlong Avenue, Chongqing 402460, China
- College of Animal Science and Technology, Southwest University, Chongqing 402460, China
- National Center of Technology Innovation for Pigs, Chongqing 402460, China
| | - Bin Hu
- Chongqing Academy of Animal Sciences, Changlong Avenue, Chongqing 402460, China
- National Center of Technology Innovation for Pigs, Chongqing 402460, China
- Scientific Observation and Experiment Station of Livestock Equipment Engineering in Southwest, Ministry of Agriculture and Rural Affairs, Chongqing 402460, China
- Innovation and Entrepreneurship Team for Livestock Environment Control and Equipment R&D, Chongqing 402460, China
| | - Miao Wang
- Chongqing Academy of Animal Sciences, Changlong Avenue, Chongqing 402460, China
- College of Animal Science and Technology, Southwest University, Chongqing 402460, China
| | - Shihua Pu
- Chongqing Academy of Animal Sciences, Changlong Avenue, Chongqing 402460, China
- National Center of Technology Innovation for Pigs, Chongqing 402460, China
- Scientific Observation and Experiment Station of Livestock Equipment Engineering in Southwest, Ministry of Agriculture and Rural Affairs, Chongqing 402460, China
- Innovation and Entrepreneurship Team for Livestock Environment Control and Equipment R&D, Chongqing 402460, China
- Correspondence:
| |
Collapse
|
7
|
Peng J, Han H, Yi Y, Huang H, Xie L. Machine learning and deep learning modeling and simulation for predicting PM2.5 concentrations. CHEMOSPHERE 2022; 308:136353. [PMID: 36084831 DOI: 10.1016/j.chemosphere.2022.136353] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 08/14/2022] [Accepted: 09/02/2022] [Indexed: 06/15/2023]
Abstract
Particulate matter (PM) pollution greatly endanger human physical and mental health, and it is of great practical significance to predict PM concentrations accurately. This study measured one-year monitoring data of six main meteorological parameters and PM2.5 concentrations independently at two monitoring sites in central China's Hunan Province. These datasets were then employed to train, validate, and evaluate the proposed extreme gradient boosting (XGBoost) machine learning model and the fully connected neural network deep learning model, respectively. The performances of the two models were compared, analyzed, and optimized through model parameter tuning. The XGBoost model had better prediction ability with R2 higher than 0.761 in the complete test dataset. When the complete dataset was divided into stratified sub-sets by daytime-nighttime periods, the value of R2 increased to 0.856 in the nighttime test dataset. The feature importance and influential mechanism of meteorological variables on PM2.5 concentrations were analyzed and discussed.
Collapse
Affiliation(s)
- Jian Peng
- School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China
| | - Haisheng Han
- School of Minerals Processing and Bioengineering, Central South University, Changsha, 410083, China
| | - Yong Yi
- Atmospheric Environment Monitoring Department, Changsha Environmental Monitoring Centre of Hunan Province, Changsha, 410001, China
| | - Huimin Huang
- Atmospheric Environment Monitoring Department, Changsha Environmental Monitoring Centre of Hunan Province, Changsha, 410001, China
| | - Le Xie
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, China.
| |
Collapse
|
8
|
Tella A, Balogun AL. GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:86109-86125. [PMID: 34533750 DOI: 10.1007/s11356-021-16150-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 08/20/2021] [Indexed: 06/13/2023]
Abstract
Rapid urbanization has caused severe deterioration of air quality globally, leading to increased hospitalization and premature deaths. Therefore, accurate prediction of air quality is crucial for mitigation planning to support urban sustainability and resilience. Although some studies have predicted air pollutants such as particulate matter (PM) using machine learning algorithms (MLAs), there is a paucity of studies on spatial hazard assessment with respect to the air quality index (AQI). Incorporating PM in AQI studies is crucial because of its easily inhalable micro-size which has adverse impacts on ecology, environment, and human health. Accurate and timely prediction of the air quality index can ensure adequate intervention to aid air quality management. Therefore, this study undertakes a spatial hazard assessment of the air quality index using particulate matter with a diameter of 10 μm or lesser (PM10) in Selangor, Malaysia, by developing four machine learning models: eXtreme Gradient Boosting (XGBoost), random forest (RF), K-nearest neighbour (KNN), and Naive Bayes (NB). Spatially processed data such as NDVI, SAVI, BU, LST, Ws, slope, elevation, and road density was used for the modelling. The model was trained with 70% of the dataset, while 30% was used for cross-validation. Results showed that XGBoost has the highest overall accuracy and precision of 0.989 and 0.995, followed by random forest (0.989, 0.993), K-nearest neighbour (0.987, 0.984), and Naive Bayes (0.917, 0.922), respectively. The spatial air quality maps were generated by integrating the geographical information system (GIS) with the four MLAs, which correlated with Malaysia's air pollution index. The maps indicate that air quality in Selangor is satisfactory and posed no threats to health. Nevertheless, the two algorithms with the best performance (XGBoost and RF) indicate that a high percentage of the air quality is moderate. The study concludes that successful air pollution management policies such as green infrastructure practice, improvement of energy efficiency, and restrictions on heavy-duty vehicles can be adopted in Selangor and other Southeast Asian cities to prevent deterioration of air quality in the future.
Collapse
Affiliation(s)
- Abdulwaheed Tella
- Geospatial Analysis and Modelling (GAM) Research Laboratory, Department of Civil and Environmental Engineering, Universiti Teknologi PETRONAS (UTP), 32610, Seri Iskandar, Perak, Malaysia.
| | - Abdul-Lateef Balogun
- Geospatial Analysis and Modelling (GAM) Research Laboratory, Department of Civil and Environmental Engineering, Universiti Teknologi PETRONAS (UTP), 32610, Seri Iskandar, Perak, Malaysia
| |
Collapse
|
9
|
Rawson A, Brito M, Sabeur Z. Spatial Modeling of Maritime Risk Using Machine Learning. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2022; 42:2291-2311. [PMID: 34854116 DOI: 10.1111/risa.13866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 09/14/2021] [Accepted: 11/09/2021] [Indexed: 06/13/2023]
Abstract
Managing navigational safety is a key responsibility of coastal states. Predicting and measuring these risks has a high complexity due to their infrequent occurrence, multitude of causes, and large study areas. As a result, maritime risk models are generally limited in scale to small regions, generalized across diverse environments, or rely on the use of expert judgement. Therefore, such an approach has limited scalability and may incorrectly characterize the risk. Within this article a novel method for undertaking spatial modeling of maritime risk is proposed through machine learning. This enables navigational safety to be characterized while leveraging the significant volumes of relevant data available. The method comprises two key components: aggregation of historical accident data, vessel traffic, and other exploratory features into a spatial grid; and the implementation of several classification algorithms that predicts annual accident occurrence for various vessel types. This approach is applied to characterize the risk of collisions and groundings in the United Kingdom. The results vary between hazard types and vessel types but show remarkable capability at characterizing maritime risk, with accuracies and area under curve scores in excess of 90% in most implementations. Furthermore, the ensemble tree-based algorithms of XGBoost and Random Forest consistently outperformed other machine learning algorithms that were tested. The resultant potential risk maps provide decisionmakers with actionable intelligence in order to target risk mitigation measures in regions with the greatest requirement.
Collapse
Affiliation(s)
- Andrew Rawson
- Electronics and Computer Science, University of Southampton, Highfield, Southampton, UK
| | - Mario Brito
- Centre for Risk Research, Southampton Business School, University of Southampton, Highfield, Southampton, UK
| | - Zoheir Sabeur
- Department of Computing and Informatics, Talbot Campus, University of Bournemouth, Bournemouth, UK
| |
Collapse
|
10
|
Youssef AM, Pourghasemi HR, El-Haddad BA. Advanced machine learning algorithms for flood susceptibility modeling - performance comparison: Red Sea, Egypt. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:66768-66792. [PMID: 35508847 DOI: 10.1007/s11356-022-20213-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 04/08/2022] [Indexed: 06/14/2023]
Abstract
Floods are among the most devastating environmental hazards that directly and indirectly affect people's lives and activities. In many countries, sustainable environmental management requires the assessment of floods and the likely flood-prone areas to avoid potential hazards. In this study, the performance and capabilities of seven machine learning algorithms (MLAs) for flood susceptibility mapping were tested, evaluated, and compared. These MLAs, including support vector machine (SVM), random forest (RF), multivariate adaptive regression spline (MARS), boosted regression tree (BRT), functional data analysis (FDA), general linear model (GLM), and multivariate discriminant analysis (MDA), were tested for the area between Safaga and Ras Gharib cities, Red Sea, Egypt. A geospatial database was developed with eleven flood-related factors, namely altitude, slope aspect, lithology, land use/land cover (LULC), slope length (LS), topographic wetness index (TWI), slope angle, profile curvature, plan curvature, stream power index (SPI), and hydrolithology units. In addition, 420 actual flooded areas were recorded from the study area to create a flood inventory map. The inventory data were randomly divided into training group with 70% and validation group with 30%. The flood-related factors were tested with a multicollinearity test, the variance inflation factor (VIF) was less than 2.135, the tolerance (TOL) was more than 0.468, and their importance was evaluated with a partial least squares (PLS) method. The results show that RF performed the best with the highest AUC (area under curve) value of 0.813, followed by GLM with 0.802, MARS with 0.801, BRT with 0.777, MDA with 0.768%, FDA with 0.763, and SVM with 0.733. The results of this study and the flood susceptibility maps could be useful for environmental mitigation, future development activities in the area, and flood control areas.
Collapse
Affiliation(s)
- Ahmed M Youssef
- Geology Department, Faculty of Science, Sohag University, Sohag, Egypt
- Geological Hazards Department, Applied Geology Sector, Saudi Geological Survey, P.O. Box 54141, Jeddah, 21514, Kingdom of Saudi Arabia
| | - Hamid Reza Pourghasemi
- Department of Natural Resources and Environmental Engineering, College of Agriculture, Shiraz University, Shiraz, Iran.
| | - Bosy A El-Haddad
- Geology Department, Faculty of Science, Sohag University, Sohag, Egypt
| |
Collapse
|
11
|
Ma X, Chen T, Ge R, Cui C, Xu F, Lv Q. Time series-based PM 2.5 concentration prediction in Jing-Jin-Ji area using machine learning algorithm models. Heliyon 2022; 8:e10691. [PMID: 36185154 PMCID: PMC9519508 DOI: 10.1016/j.heliyon.2022.e10691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 07/24/2022] [Accepted: 09/14/2022] [Indexed: 12/03/2022] Open
Abstract
Globally all countries encounter air pollution problems along their development path. As a significant indicator of air quality, PM2.5 concentration has long been proven to be affecting the population’s death rate. Machine learning algorithms proven to outperform traditional statistical approaches are widely used in air pollution prediction. However research on the model selection discussion and environmental interpretation of model prediction results is still scarce and urgently needed to lead the policy making on air pollution control. Our research compared four types of machine learning algorisms LinearSVR, K-Nearest Neighbor, Lasso regression, Gradient boosting by looking into their performance in predicting PM2.5 concentrations among different cities and seasons. The results show that the machine learning model is able to forecast the next day PM2.5 concentration based on the previous five days' data with better accuracy. The comparative experiments show that based on city level the Gradient Boosting prediction model has better prediction performance with mean absolute error (MAE) of 9 ug/m3 and root mean square error (RMSE) of 10.25–16.76 ug/m3, lower compared with the other three models, and based on season level four models have the best prediction performances in winter time and the worst in summer time. And more importantly the demonstration of models' different performances in each city and each season is of great significance in environmental policy implications.
Collapse
Affiliation(s)
- Xin Ma
- School of Management and Economics, North China University of Water Resources and Electric Power, Zhengzhou 450046, China
| | - Tengfei Chen
- School of Management and Economics, North China University of Water Resources and Electric Power, Zhengzhou 450046, China
| | - Rubing Ge
- Environmental Protection Investment Performance Center, Chinese Academy of Environmental Planning, Beijing 100012, China
| | - Caocao Cui
- School of Management and Economics, North China University of Water Resources and Electric Power, Zhengzhou 450046, China
| | - Fan Xu
- School of Management and Economics, North China University of Water Resources and Electric Power, Zhengzhou 450046, China
| | - Qi Lv
- School of Management and Economics, North China University of Water Resources and Electric Power, Zhengzhou 450046, China
| |
Collapse
|
12
|
Moradi E, Darabi H, Heydari E, Karimi M, Kløve B. Vegetation vulnerability to hydrometeorological stresses in water-scarce areas using machine learning and remote sensing techniques. ECOL INFORM 2022. [DOI: 10.1016/j.ecoinf.2022.101838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
13
|
Random Forest Estimation and Trend Analysis of PM2.5 Concentration over the Huaihai Economic Zone, China (2000–2020). SUSTAINABILITY 2022. [DOI: 10.3390/su14148520] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Consisting of ten cities in four Chinese provinces, the Huaihai Economic Zone has suffered serious air pollution over the last two decades, particularly of fine particulate matter (PM2.5). In this study, we used multi-source data, namely MAIAC AOD (at a 1 km spatial resolution), meteorological, topographic, date, and location (latitude and longitude) data, to construct a regression model using random forest to estimate the daily PM2.5 concentration over the Huaihai Economic Zone from 2000 to 2020. It was found that the variable expressing time (date) had the greatest characteristic importance when estimating PM2.5. By averaging the modeled daily PM2.5 concentration, we produced a yearly PM2.5 concentration dataset, at a 1 km resolution, for the study area from 2000 to 2020. On comparing modeled daily PM2.5 with observational data, the coefficient of determination (R2) of the modeling was 0.85, the root means square error (RMSE) was 14.63 μg/m3, and the mean absolute error (MAE) was 10.03 μg/m3. The quality assessment of the synthesized yearly PM2.5 concentration dataset shows that R2 = 0.77, RMSE = 6.92 μg/m3, and MAE = 5.42 μg/m3. Despite different trends from 2000–2010 and from 2010–2020, the trend of PM2.5 concentration over the Huaihai Economic Zone during the 21 years was, overall, decreasing. The area of the significantly decreasing trend was small and mainly concentrated in the lake areas of the Zone. It is concluded that PM2.5 can be well-estimated from the MAIAC AOD dataset, when incorporating spatiotemporal variability using random forest, and that the resultant PM2.5 concentration data provide a basis for environmental monitoring over large geographic areas.
Collapse
|
14
|
Study of Driving Factors Using Machine Learning to Determine the Effect of Topography, Climate, and Fuel on Wildfire in Pakistan. REMOTE SENSING 2022. [DOI: 10.3390/rs14081918] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
As the climate changes with the population expansion in Pakistan, wildfires are becoming more threatening. The goal of this study was to understand fire trends which might help to improve wildland management and reduction in wildfire risk in Pakistan. Using descriptive analyses, we investigated the spatiotemporal trends and causes of wildfire in the 2001–2020 period. Optimized machine learning (ML) models were incorporated using variables representing potential fire drivers, such as weather, topography, and fuel, which includes vegetation, soil, and socioeconomic data. The majority of fires occurred in the last 5 years, with winter being the most prevalent season in coastal regions. ML models such as RF outperformed others and correctly predicted fire occurrence (AUC values of 0.84–0.93). Elevation, population, specific humidity, vapor pressure, and NDVI were all key factors; however, their contributions varied depending on locational clusters and seasons. The percentage shares of climatic conditions, fuel, and topographical variables at the country level were 55.2%, 31.8%, and 12.8%, respectively. This study identified the probable driving factors of Pakistan wildfires, as well as the probability of fire occurrences across the country. The analytical approach, as well as the findings and conclusions reached, can be very useful to policymakers, environmentalists, and climate change researchers, among others, and may help Pakistan improve its wildfire management and mitigation.
Collapse
|
15
|
Wang F, Zhang Z, Wang G, Wang Z, Li M, Liang W, Gao J, Wang W, Chen D, Feng Y, Shi G. Machine learning and theoretical analysis release the non-linear relationship among ozone, secondary organic aerosol and volatile organic compounds. J Environ Sci (China) 2022; 114:75-84. [PMID: 35459516 DOI: 10.1016/j.jes.2021.07.026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 07/22/2021] [Accepted: 07/22/2021] [Indexed: 06/14/2023]
Abstract
Fine particulate matter (PM2.5) and ozone (O3) pollutions are prevalent air quality issues in China. Volatile organic compounds (VOCs) have significant impact on the formation of O3 and secondary organic aerosols (SOA) contributing PM2.5. Herein, we investigated 54 VOCs, O3 and SOA in Tianjin from June 2017 to May 2019 to explore the non-linear relationship among O3, SOA and VOCs. The monthly patterns of VOCs and SOA concentrations were characterized by peak values during October to March and reached a minimum from April to September, but the observed O3 was exactly the opposite. Machine learning methods resolved the importance of individual VOCs on O3 and SOA that alkenes (mainly ethylene, propylene, and isoprene) have the highest importance to O3 formation; alkanes (Cn, n ≥ 6) and aromatics were the main source of SOA formation. Machine learning methods revealed and emphasized the importance of photochemical consumptions of VOCs to O3 and SOA formation. Ozone formation potential (OFP) and secondary organic aerosol formation potential (SOAFP) calculated by consumed VOCs quantitatively indicated that more than 80% of the consumed VOCs were alkenes which dominated the O3 formation, and the importance of consumed aromatics and alkenes to SOAFP were 40.84% and 56.65%, respectively. Therein, isoprene contributed the most to OFP at 41.45% regardless of the season, while aromatics (58.27%) contributed the most to SOAFP in winter. Collectively, our findings can provide scientific evidence on policymaking for VOCs controls on seasonal scales to achieve effective reduction in both SOA and O3.
Collapse
Affiliation(s)
- Feng Wang
- State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China; CMA-NKU Cooperative Laboratory for Atmospheric Environment-Health Research (CLAER), College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China
| | - Zhongcheng Zhang
- State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China; CMA-NKU Cooperative Laboratory for Atmospheric Environment-Health Research (CLAER), College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China
| | - Gen Wang
- State Key Laboratory on Odor Pollution Control, Tianjin Academy of Environmental Sciences, Tianjin 300191, China
| | - Zhenyu Wang
- State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China; CMA-NKU Cooperative Laboratory for Atmospheric Environment-Health Research (CLAER), College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China
| | - Mei Li
- Guangdong Provincial Engineering Research Center for On-line Source Apportionment System of Air Pollution Jinan University, Institute of Mass Spectrometry and Atmospheric Environment, Guangzhou 510632, China; Guangdong-Hongkong-Macau Joint Laboratory of Collaborative Innovation for Environmental Quality, Guangzhou 510632, China
| | - Weiqing Liang
- State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China; CMA-NKU Cooperative Laboratory for Atmospheric Environment-Health Research (CLAER), College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China
| | - Jie Gao
- State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China; CMA-NKU Cooperative Laboratory for Atmospheric Environment-Health Research (CLAER), College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China
| | - Wei Wang
- Trusted AI System Laboratory, College of Computer Science, Nankai University, Tianjin 300350, China.
| | - Da Chen
- Key Laboratory of Civil Aviation Thermal Hazards Prevention and Emergency Response, Civil Aviation University of China, Tianjin 300300, China
| | - Yinchang Feng
- State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China; CMA-NKU Cooperative Laboratory for Atmospheric Environment-Health Research (CLAER), College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China
| | - Guoliang Shi
- State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, Tianjin Key Laboratory of Urban Transport Emission Research, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China; CMA-NKU Cooperative Laboratory for Atmospheric Environment-Health Research (CLAER), College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China.
| |
Collapse
|
16
|
Modeling solubility of CO2–N2 gas mixtures in aqueous electrolyte systems using artificial intelligence techniques and equations of state. Sci Rep 2022; 12:3625. [PMID: 35256623 PMCID: PMC8901744 DOI: 10.1038/s41598-022-07393-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 02/09/2022] [Indexed: 12/03/2022] Open
Abstract
Determining the solubility of non-hydrocarbon gases such as carbon dioxide (CO2) and nitrogen (N2) in water and brine is one of the most controversial challenges in the oil and chemical industries. Although many researches have been conducted on solubility of gases in brine and water, very few researches investigated the solubility of power plant flue gases (CO2–N2 mixtures) in aqueous solutions. In this study, using six intelligent models, including Random Forest, Decision Tree (DT), Gradient Boosting-Decision Tree (GB-DT), Adaptive Boosting-Decision Tree (AdaBoost-DT), Adaptive Boosting-Support Vector Regression (AdaBoost-SVR), and Gradient Boosting-Support Vector Regression (GB-SVR), the solubility of CO2–N2 mixtures in water and brine solutions was predicted, and the results were compared with four equations of state (EOSs), including Peng–Robinson (PR), Soave–Redlich–Kwong (SRK), Valderrama–Patel–Teja (VPT), and Perturbed-Chain Statistical Associating Fluid Theory (PC-SAFT). The results indicate that the Random Forest model with an average absolute percent relative error (AAPRE) value of 2.8% has the best predictions. The GB-SVR and DT models also have good precision with AAPRE values of 6.43% and 7.41%, respectively. For solubility of CO2 present in gaseous mixtures in aqueous systems, the PC-SAFT model, and for solubility of N2, the VPT EOS had the best results among the EOSs. Also, the sensitivity analysis of input parameters showed that increasing the mole percent of CO2 in gaseous phase, temperature, pressure, and decreasing the ionic strength increase the solubility of CO2–N2 mixture in water and brine solutions. Another significant issue is that increasing the salinity of brine also has a subtractive effect on the solubility of CO2–N2 mixture. Finally, the Leverage method proved that the actual data are of excellent quality and the Random Forest approach is quite reliable for determining the solubility of the CO2–N2 gas mixtures in aqueous systems.
Collapse
|
17
|
Prediction of Daily Mean PM10 Concentrations Using Random Forest, CART Ensemble and Bagging Stacked by MARS. SUSTAINABILITY 2022. [DOI: 10.3390/su14020798] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
A novel framework for stacked regression based on machine learning was developed to predict the daily average concentrations of particulate matter (PM10), one of Bulgaria’s primary health concerns. The measurements of nine meteorological parameters were introduced as independent variables. The goal was to carefully study a limited number of initial predictors and extract stochastic information from them to build an extended set of data that allowed the creation of highly efficient predictive models. Four base models using random forest, CART ensemble and bagging, and their rotation variants, were built and evaluated. The heterogeneity of these base models was achieved by introducing five types of diversities, including a new simplified selective ensemble algorithm. The predictions from the four base models were then used as predictors in multivariate adaptive regression splines (MARS) models. All models were statistically tested using out-of-bag or with 5-fold and 10-fold cross-validation. In addition, a variable importance analysis was conducted. The proposed framework was used for short-term forecasting of out-of-sample data for seven days. It was shown that the stacked models outperformed all single base models. An index of agreement IA = 0.986 and a coefficient of determination of about 95% were achieved.
Collapse
|
18
|
Zhao Y, Li Y, Yang F. A state-of-the-art review on modeling the biochar effect: Guidelines for beginners. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 802:149861. [PMID: 34461475 DOI: 10.1016/j.scitotenv.2021.149861] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 08/11/2021] [Accepted: 08/19/2021] [Indexed: 06/13/2023]
Abstract
Biochar has been widely advocated due to its special properties and sustainability for agriculture soil amendment. The influencing mechanism of biochar on soil properties is a key aspect of quantifying and predicting its benefits and trade-offs. The contribution of biochar to both environmental and agricultural benefits has been deeply discussed and extensively reviewed, but few reviews have focused on modeling biochar effects. An overview of recent advances in biochar modeling is illustrated and approaches classified in this paper. Applications of a machine learning model, a deterministic model, and a numerical model to biochar are categorized and summarized. A discussion of the advantages and disadvantages of each model and a comparison among them are also provided. Finally, this paper gives many suggestions on narrowing the knowledge gap to advance biochar modeling. Further study of biochar modeling in management planning and design and application of the model results in agricultural systems will help accelerate the expansion of biochar's application scale and encourage the efficient utilization of waste in agricultural systems.
Collapse
Affiliation(s)
- Ying Zhao
- School of Water Conservancy & Civil Engineering, Northeast Agricultural University, Harbin 150030, China; Joint Laboratory of Northeast Agricultural University and Max Planck Institute of Colloids and Interfaces (NEAU-MPICI), Harbin 150030, China
| | - YueLei Li
- School of Water Conservancy & Civil Engineering, Northeast Agricultural University, Harbin 150030, China; Joint Laboratory of Northeast Agricultural University and Max Planck Institute of Colloids and Interfaces (NEAU-MPICI), Harbin 150030, China
| | - Fan Yang
- School of Water Conservancy & Civil Engineering, Northeast Agricultural University, Harbin 150030, China; Joint Laboratory of Northeast Agricultural University and Max Planck Institute of Colloids and Interfaces (NEAU-MPICI), Harbin 150030, China.
| |
Collapse
|
19
|
Samadianfard S, Kargar K, Shadkani S, Hashemi S, Abbaspour A, Safari MJS. Hybrid models for suspended sediment prediction: optimized random forest and multi-layer perceptron through genetic algorithm and stochastic gradient descent methods. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06550-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
20
|
Yuk H, Yang S, Wi S, Kang Y, Kim S. Verification of particle matter generation due to deterioration of building materials as the cause of indoor fine dust. JOURNAL OF HAZARDOUS MATERIALS 2021; 416:125920. [PMID: 34492852 DOI: 10.1016/j.jhazmat.2021.125920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 04/13/2021] [Accepted: 04/15/2021] [Indexed: 06/13/2023]
Abstract
Particles of fine dust are pollutants that adversely affect indoor air quality and exacerbate human respiratory diseases. The aging of the building was pointed out as a source of fine dust indoors. The aging of buildings has various causes of deterioration. During various deterioration, friction adversely affects the building floor finish. In this study, an accelerated friction deterioration device was used to confirm the generation of fine dust particles through the frictional deterioration of floor finishes in buildings. The study found that the concentration of fine dust particles attributed to deteriorating flooring was 327 mg/m3 in PM2.5 and 4828 mg/m3 in PM10 and confirmed that particle distribution differs depending on the surface of the flooring. Particles of 10 µm or less were observed through particle analysis. The study confirmed that fine dust particles did not diffuse in a specific direction and that the detected fine dust particles could be attributed to deterioration. Further research is needed on the detection of fine dust in degraded building finishing materials.
Collapse
Affiliation(s)
- Hyeonseong Yuk
- Department of Architecture and Architectural Engineering, Yonsei University, Seoul 03722, Republic of Korea
| | - Sungwoong Yang
- Department of Architecture and Architectural Engineering, Yonsei University, Seoul 03722, Republic of Korea
| | - Seunghwan Wi
- Department of Architecture and Architectural Engineering, Yonsei University, Seoul 03722, Republic of Korea
| | - Yujin Kang
- Department of Architecture and Architectural Engineering, Yonsei University, Seoul 03722, Republic of Korea
| | - Sumin Kim
- Department of Architecture and Architectural Engineering, Yonsei University, Seoul 03722, Republic of Korea.
| |
Collapse
|
21
|
Park J, Chang S. A Particulate Matter Concentration Prediction Model Based on Long Short-Term Memory and an Artificial Neural Network. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18136801. [PMID: 34202834 PMCID: PMC8297184 DOI: 10.3390/ijerph18136801] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2021] [Revised: 06/15/2021] [Accepted: 06/16/2021] [Indexed: 01/12/2023]
Abstract
Many countries are concerned about high particulate matter (PM) concentrations caused by rapid industrial development, which can harm both human health and the environment. To manage PM, the prediction of PM concentrations based on historical data is actively being conducted. Existing technologies for predicting PM mostly assess the model performance for the prediction of existing PM concentrations; however, PM must be forecast in advance, before it becomes highly concentrated and causes damage to the citizens living in the affected regions. Thus, it is necessary to conduct research on an index that can illustrate whether the PM concentration will increase or decrease. We developed a model that can predict whether the PM concentration might increase or decrease after a certain time, specifically for PM2.5 (fine PM) generated by anthropogenic volatile organic compounds. An algorithm that can select a model on an hourly basis, based on the long short-term memory (LSTM) and artificial neural network (ANN) models, was developed. The proposed algorithm exhibited a higher F1-score than the LSTM, ANN, or random forest models alone. The model developed in this study could be used to predict future regional PM concentration levels more effectively.
Collapse
|
22
|
Li L, Zhang R, Sun J, He Q, Kong L, Liu X. Monitoring and prediction of dust concentration in an open-pit mine using a deep-learning algorithm. JOURNAL OF ENVIRONMENTAL HEALTH SCIENCE & ENGINEERING 2021; 19:401-414. [PMID: 34150244 PMCID: PMC8172817 DOI: 10.1007/s40201-021-00613-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2020] [Accepted: 01/04/2021] [Indexed: 05/12/2023]
Abstract
PURPOSE Dust pollution is currently one of the most serious environmental problems faced by open-pit mines. Compared with underground mining, open-pit mining has many dust sources, and a wide area of influence and complicated changes in meteorological conditions can result in great variations in dust concentration. Therefore, the prediction of dust concentrations in open-pit mines requires research and is of great significance for reducing environmental pollution and personal health hazards. METHODS This study is based on monitoring of the concentration of total suspended particulate (TSP) in the Anjialing open-pit coal mine in Pingshuo. This paper proposes a hybrid model based on a long short-term memory (LSTM) network and the attention mechanism (LSTM-Attention) and applies it to the prediction of TSP concentration. The LSTM model reflects the historical process of an input time series, and the attention mechanism extracts the inherent characteristics of the input parameters to assign weights based on the importance of the influencing factors. The autoregressive integrated moving average (ARIMA) and LSTM models are also used to predict the TSP concentration. Finally, several statistical measures of error are used to evaluate the accuracy of the model and perform a sensitivity analysis. RESULTS It was found that, in general, the TSP concentration was highest in the period 08:00-09:00 and lowest in the period 15:00-16:00. In addition to the influence of meteorological parameters and normal operations, the reason for this trend is the presence of an inversion layer above the open-pit mine. The results show that, compared with the ARIMA and LSTM models, the LSTM-Attention model is more stable and has a prediction accuracy that is 5.6% and 3.0% greater, respectively. CONCLUSION This model can be applied to the prediction of dust concentrations in open-pit mines and provide guidance on when to carry out dust-suppression work. It has expansibility and is potentially valuable for application in a wide range of areas.
Collapse
Affiliation(s)
- Lin Li
- School of Energy and Mining Engineering, China University of Mining & Technology (Beijing), Beijing, 100083 China
| | - Ruixin Zhang
- School of Energy and Mining Engineering, China University of Mining & Technology (Beijing), Beijing, 100083 China
- School of Safety Engineering, North China Institute of Science and Technology, Sanhe, 065201 Hebei China
| | - Jiandong Sun
- School of Safety Engineering, North China Institute of Science and Technology, Sanhe, 065201 Hebei China
| | - Qian He
- School of Energy and Mining Engineering, China University of Mining & Technology (Beijing), Beijing, 100083 China
| | - Lingzhen Kong
- School of Energy and Mining Engineering, China University of Mining & Technology (Beijing), Beijing, 100083 China
| | - Xin Liu
- School of Safety Engineering, North China Institute of Science and Technology, Sanhe, 065201 Hebei China
| |
Collapse
|
23
|
Moradi E, Abdolshahnejad M, Borji Hassangavyar M, Ghoohestani G, da Silva AM, Khosravi H, Cerdà A. Machine learning approach to predict susceptible growth regions of Moringa peregrina (Forssk). ECOL INFORM 2021. [DOI: 10.1016/j.ecoinf.2021.101267] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
24
|
Mosavi A, Sajedi Hosseini F, Choubin B, Taromideh F, Ghodsi M, Nazari B, Dineva AA. Susceptibility mapping of groundwater salinity using machine learning models. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021; 28:10804-10817. [PMID: 33099737 DOI: 10.1007/s11356-020-11319-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 10/18/2020] [Indexed: 06/11/2023]
Abstract
Increasing groundwater salinity has recently raised severe environmental and health concerns around the world. Advancement of the novel methods for spatial salinity modeling and prediction would be essential for effective management of the resources and planning mitigation policies. The current research presents the application of machine learning (ML) models in groundwater salinity mapping based on the dichotomous predictions. The groundwater salinity is predicted using the essential factors (i.e., identified by the simulated annealing feature selection methodology) through k-fold cross-validation methodology. Six ML models, namely, flexible discriminant analysis (FDA), mixture discriminant analysis (MAD), boosted regression tree (BRT), multivariate adaptive regression spline (MARS), random forest (RF), support vector machine (SVM), were employed to groundwater salinity mapping. The results of the modeling indicated that the SVM model had superior performance than other models. Variables of soil order, groundwater withdrawal, precipitation, land use, and elevation had the most contribute to groundwater salinity mapping. Results highlighted that the southern parts of the region and some parts in the north, northeast, and west have a high groundwater salinity, in which these areas are mostly matched with soil order of Entisols, bareland areas, and low elevations.
Collapse
Affiliation(s)
- Amirhosein Mosavi
- Environmental Quality, Atmospheric Science and Climate Change Research Group, Ton Duc Thang University, Ho Chi Minh City, Vietnam
- Faculty of Environment and Labour Safety, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | - Farzaneh Sajedi Hosseini
- Reclamation of Arid and Mountainous Regions Department, Faculty of Natural Resources, University of Tehran, Karaj, Iran
| | - Bahram Choubin
- Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran.
| | - Fereshteh Taromideh
- Department of Irrigation, Sari Agricultural Sciences and Natural Resources University, Sari, Iran
| | - Marzieh Ghodsi
- Faculty of Geography, University of Tehran, Tehran, Iran
| | - Bijan Nazari
- Department of Water Sciences and Engineering, Imam Khomeini International University, Qazvin, Iran
| | - Adrienn A Dineva
- Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam.
| |
Collapse
|
25
|
Prediction of Neutralization Depth of R.C. Bridges Using Machine Learning Methods. CRYSTALS 2021. [DOI: 10.3390/cryst11020210] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Machine learning techniques have become a popular solution to prediction problems. These approaches show excellent performance without being explicitly programmed. In this paper, 448 sets of data were collected to predict the neutralization depth of concrete bridges in China. Random forest was used for parameter selection. Besides this, four machine learning methods, such as support vector machine (SVM), k-nearest neighbor (KNN) and XGBoost, were adopted to develop models. The results show that machine learning models obtain a high accuracy (>80%) and an acceptable macro recall rate (>80%) even with only four parameters. For SVM models, the radial basis function has a better performance than other kernel functions. The radial basis kernel SVM method has the highest verification accuracy (91%) and the highest macro recall rate (86%). Besides this, the preference of different methods is revealed in this study.
Collapse
|
26
|
A First Approach to Aerosol Classification Using Space-Borne Measurement Data: Machine Learning-Based Algorithm and Evaluation. REMOTE SENSING 2021. [DOI: 10.3390/rs13040609] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
A new method was developed for classifying aerosol types involving a machine-learning approach to the use of satellite data. An Aerosol Robotic NETwork (AERONET)-based aerosol-type dataset was used as a target variable in a random forest (RF) model. The contributions of satellite input variables to the RF-based model were quantified to determine an optimal set of input variables. The new method, based on inputs of satellite variables, allows the classification of seven aerosol types: pure dust, dust-dominant mixed, pollution-dominant mixed aerosols, and pollution aerosols (strongly, moderately, weakly, and non-absorbing). The performance of the model was statistically evaluated using AERONET data excluded from the model training dataset. Model accuracy for classifying the seven aerosol types was 59%, improving to 72% for four types (pure dust, dust-dominant mixed, strongly absorbing, and non-absorbing). The performance of the model was evaluated against an earlier aerosol classification method based on the wavelength dependence of single-scattering albedo (SSA) and fine-mode-fraction values from AERONET. Typical wavelength dependences of SSA for individual aerosol types are consistent with those obtained for aerosol types by the new method. This study demonstrates that an RF-based model is capable of satellite aerosol classification with sensitivity to the contribution of non-spherical particles.
Collapse
|
27
|
Petermann E, Meyer H, Nussbaum M, Bossew P. Mapping the geogenic radon potential for Germany by machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 754:142291. [PMID: 33254926 DOI: 10.1016/j.scitotenv.2020.142291] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Revised: 08/12/2020] [Accepted: 09/07/2020] [Indexed: 06/12/2023]
Abstract
The radioactive gas radon (Rn) is considered as an indoor air pollutant due to its detrimental effects on human health. In fact, exposure to Rn belongs to the most important causes for lung cancer after tobacco smoking. The dominant source of indoor Rn is the ground beneath the house. The geogenic Rn potential (GRP) - a function of soil gas Rn concentration and soil gas permeability - quantifies what "earth delivers in terms of Rn" and represents a hazard indicator for elevated indoor Rn concentration. In this study, we aim at developing an improved spatial continuous GRP map based on 4448 field measurements of GRP distributed across Germany. We fitted three different machine learning algorithms, multivariate adaptive regression splines, random forest and support vector machines utilizing 36 candidate predictors. Predictor selection, hyperparameter tuning and performance assessment were conducted using a spatial cross-validation where the data was iteratively left out by spatial blocks of 40 km*40 km. This procedure counteracts the effect of spatial auto-correlation in predictor and response data and minimizes dependence of training and test data. The spatial cross-validated performance statistics revealed that random forest provided the most accurate predictions. The predictors selected as informative reflect geology, climate (temperature, precipitation and soil moisture), soil hydraulic, soil physical (field capacity, coarse fraction) and soil chemical properties (potassium and nitrogen concentration). Model interpretation techniques such as predictor importance as well as partial and spatial dependence plots confirmed the hypothesized dominant effect of geology on GRP, but also revealed significant contributions of the other predictors. Partial and spatial dependence plots gave further valuable insight into the quantitative predictor-response relationship and its spatial distribution. A comparison with a previous version of the German GRP map using 1359 independent test data indicates a significantly better performance of the random forest based map.
Collapse
Affiliation(s)
- Eric Petermann
- Federal Office for Radiation Protection (BfS), Section Radon and NORM, Berlin, Germany.
| | - Hanna Meyer
- Westfälische Wilhelms-Universität Münster, Institute of Landscape Ecology, Münster, Germany
| | - Madlene Nussbaum
- Bern University of Applied Sciences (BFH), School of Agricultural, Forest and Food Sciences, (HAFL), Zollikofen, Switzerland
| | - Peter Bossew
- Federal Office for Radiation Protection (BfS), Section Radon and NORM, Berlin, Germany
| |
Collapse
|
28
|
Asthma-prone areas modeling using a machine learning model. Sci Rep 2021; 11:1912. [PMID: 33479275 PMCID: PMC7820586 DOI: 10.1038/s41598-021-81147-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Accepted: 12/28/2020] [Indexed: 12/17/2022] Open
Abstract
Nowadays, owing to population growth, increasing environmental pollution, and lifestyle changes, the number of asthmatics has significantly increased. Therefore, the purpose of our study was to determine the asthma-prone areas in Tehran, Iran considering environmental, spatial factors. Initially, we built a spatial database using 872 locations of children with asthma and 13 environmental factors affecting the disease—distance to parks and streets, rainfall, temperature, humidity, pressure, wind speed, particulate matter (PM 10 and PM 2.5), ozone (O3), sulfur dioxide (SO2), carbon monoxide (CO), and nitrogen dioxide (NO2). Subsequently, utilizing this spatial database, a random forest (RF) machine learning model, and a geographic information system, we prepared a map of asthma-prone areas. For modeling and validation, we deployed 70% and 30%, respectively, of the locations of children with asthma. The results of spatial autocorrelation and RF model showed that the criteria of distance to parks and streets as well as PM 2.5 and PM 10 had the greatest impact on asthma occurrence in the study area. Spatial autocorrelation analyses indicated that the distribution of asthma cases was not random. According to receiver operating characteristic results, the RF model had good accuracy (the area under the curve was 0.987 and 0.921, respectively, for training and testing data).
Collapse
|
29
|
Himawari-8 Aerosol Optical Depth (AOD) Retrieval Using a Deep Neural Network Trained Using AERONET Observations. REMOTE SENSING 2020. [DOI: 10.3390/rs12244125] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Spectral aerosol optical depth (AOD) estimation from satellite-measured top of atmosphere (TOA) reflectances is challenging because of the complicated TOA-AOD relationship and a nexus of land surface and atmospheric state variations. This task is usually undertaken using a physical model to provide a first estimate of the TOA reflectances which are then optimized by comparison with the satellite data. Recently developed deep neural network (DNN) models provide a powerful tool to represent the complicated relationship statistically. This study presents a methodology based on DNN to estimate AOD using Himawari-8 Advanced Himawari Imager (AHI) TOA observations. A year (2017) of AHI TOA observations over the Himawari-8 full disk collocated in space and time with Aerosol Robotic Network (AERONET) AOD data were used to derive a total of 14,154 training and validation samples. The TOA reflectance in all six AHI solar bands, three TOA reflectance ratios derived based on the dark-target assumptions, sun-sensor geometry, and auxiliary data are used as predictors to estimate AOD at 500 nm. The DNN AOD is validated by separating training and validation samples using random k-fold cross-validation and using AERONET site-specific leave-one-station-out validation, and is compared with a random forest regression estimator and Japan Meteorological Agency (JMA) AOD. The DNN AOD shows high accuracy: (1) RMSE = 0.094, R2 = 0.915 for k-fold cross-validation, and (2) RMSE = 0.172, R2 = 0.730 for leave-one-station-out validation. The k-fold cross-validation overestimates the DNN accuracy as the training and validation samples may come from the same AHI pixel location. The leave-one-station-out validation reflects the accuracy for large-area applications where there are no training samples for the pixel location to be estimated. The DNN AOD has better accuracy than the random forest AOD and JMA AOD. In addition, the contribution of the dark-target derived TOA ratio predictors is examined and confirmed, and the sensitivity to the DNN structure is discussed.
Collapse
|
30
|
Choubin B, Borji M, Hosseini FS, Mosavi A, Dineva AA. Mass wasting susceptibility assessment of snow avalanches using machine learning models. Sci Rep 2020; 10:18363. [PMID: 33110178 PMCID: PMC7591884 DOI: 10.1038/s41598-020-75476-w] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 10/15/2020] [Indexed: 12/13/2022] Open
Abstract
Snow avalanche is among the most harmful natural hazards with major socioeconomic and environmental destruction in the cold and mountainous regions. The devastating propagation and accumulation of the snow avalanche debris and mass wasting of surface rocks and vegetation particles threaten human life, transportation networks, built environments, ecosystems, and water resources. Susceptibility assessment of snow avalanche hazardous areas is of utmost importance for mitigation and development of land-use policies. This research evaluates the performance of the well-known machine learning methods, i.e., generalized additive model (GAM), multivariate adaptive regression spline (MARS), boosted regression trees (BRT), and support vector machine (SVM), in modeling the mass wasting hazard induced by snow avalanches. The key features are identified by the recursive feature elimination (RFE) method and used for the model calibration. The results indicated a good performance of the modeling process (Accuracy > 0.88, Kappa > 0.76, Precision > 0.84, Recall > 0.86, and AUC > 0.89), which the SVM model highlighted superior performance than others. Sensitivity analysis demonstrated that the topographic position index (TPI) and distance to stream (DTS) were the most important variables which had more contribution in producing the susceptibility maps.
Collapse
Affiliation(s)
- Bahram Choubin
- Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran
| | - Moslem Borji
- Reclamation of Arid and Mountainous Regions Department, Faculty of Natural Resources, University of Tehran, Karaj, Iran
| | - Farzaneh Sajedi Hosseini
- Reclamation of Arid and Mountainous Regions Department, Faculty of Natural Resources, University of Tehran, Karaj, Iran
| | - Amirhosein Mosavi
- Environmental Quality, Atmospheric Science and Climate Change Research Group, Ton Duc Thang University, Ho Chi Minh City, Vietnam.
- Faculty of Environment and Labour Safety, Ton Duc Thang University, Ho Chi Minh City, Vietnam.
| | - Adrienn A Dineva
- Institute of Research and Development, Duy Tan University, Da Nang, 550000, Vietnam.
- Kalman Kando Faculty of Electrical Engineering, Obuda University, Budapest, Hungary.
| |
Collapse
|
31
|
Susceptibility Prediction of Groundwater Hardness Using Ensemble Machine Learning Models. WATER 2020. [DOI: 10.3390/w12102770] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Groundwater resources, unlike surface water, are more vulnerable to disturbances and contaminations, as they take a very long time and significant cost to recover. So, predictive modeling and prevention strategies can empower policymakers for efficient groundwater governance through informed decisions and recommendations. Due to the importance of groundwater quality modeling, the hardness susceptibility mapping using machine learning (ML) models has not been explored. For the first time, the current research aimed to predict groundwater hardness susceptibility using the ML models. The performance of two ensemble models of boosted regression trees (BRT) and random forest (RF) is investigated through the arrangement of a comparative study with multivariate discriminant analysis (MDA). According to the hardness values in 135 groundwater quality monitoring wells, the hard and soft water are determined; then, 11 predictor variables including distance from the sea (DFS), land use, elevation, distance from the river (DFR), depth to groundwater (DTGW), pH, precipitation (PCP), evaporation (E), groundwater level (GWL), curvature, and lithology are used for predicting the groundwater hardness susceptibility map. Results indicated that the variables of DFR, DTGW, elevation, and DFS had a higher contribution to the modeling process. So, the high harness areas are mostly related to low elevations, low DTGW, and proximity to river and sea, which facilitate the percolation conditions for minerals containing calcium or magnesium into groundwater.
Collapse
|
32
|
Abstract
Several outbreak prediction models for COVID-19 are being used by officials around the world to make informed decisions and enforce relevant control measures. Among the standard models for COVID-19 global pandemic prediction, simple epidemiological and statistical models have received more attention by authorities, and these models are popular in the media. Due to a high level of uncertainty and lack of essential data, standard models have shown low accuracy for long-term prediction. Although the literature includes several attempts to address this issue, the essential generalization and robustness abilities of existing models need to be improved. This paper presents a comparative analysis of machine learning and soft computing models to predict the COVID-19 outbreak as an alternative to susceptible–infected–recovered (SIR) and susceptible-exposed-infectious-removed (SEIR) models. Among a wide range of machine learning models investigated, two models showed promising results (i.e., multi-layered perceptron, MLP; and adaptive network-based fuzzy inference system, ANFIS). Based on the results reported here, and due to the highly complex nature of the COVID-19 outbreak and variation in its behavior across nations, this study suggests machine learning as an effective tool to model the outbreak. This paper provides an initial benchmarking to demonstrate the potential of machine learning for future research. This paper further suggests that a genuine novelty in outbreak prediction can be realized by integrating machine learning and SEIR models.
Collapse
|
33
|
Dézerald O, Mondy CP, Dembski S, Kreutzenberger K, Reyjol Y, Chandesris A, Valette L, Brosse S, Toussaint A, Belliard J, Merg ML, Usseglio-Polatera P. A diagnosis-based approach to assess specific risks of river degradation in a multiple pressure context: Insights from fish communities. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 734:139467. [PMID: 32470662 DOI: 10.1016/j.scitotenv.2020.139467] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/27/2020] [Accepted: 05/13/2020] [Indexed: 06/11/2023]
Abstract
In the context of increasing pressure on water bodies, many fish-based indices have been developed to evaluate the ecological status of rivers. However, most of these indices suffer from several limitations, which hamper the capacity of water managers to select the most appropriate measures of restoration. Those limitations include: (i) being dependent on reference conditions, (ii) not satisfactorily handling complex and non-linear biological responses to pressure gradients, and (iii) being unable to identify specific risks of stream degradation in a multi-pressure context. To tackle those issues, we developed a diagnosis-based approach using Random Forest models to predict the impairment probabilities of river fish communities by 28 pressure categories (chemical, hydromorphological and biological). In addition, the database includes the abundances of 72 fish species collected from 1527 sites in France, sampled between 2005 and 2015; and fish taxonomic and biological information. Twenty random forest models provided at least good performances when evaluating impairment probabilities of fish communities by those pressures. The best performing models indicated that fish communities were impacted, on average, by 7.34 ± 0.03 abiotic pressure categories (mean ± SE), and that hydromorphological alterations (5.27 ± 0.02) were more often detected than chemical ones (2.06 ± 0.02). These models showed that alterations in longitudinal continuity, and contaminations by Polycyclic Aromatic Hydrocarbons were respectively the most frequent hydromorphological and chemical pressure categories in French rivers. This approach has also efficiently detected the functional impact of invasive alien species. Identifying and ranking the impacts of multiple anthropogenic pressures that trigger functional shifts in river biological communities is essential for managers to prioritize actions and to implement appropriate restoration programmes. Actually implemented in an R package, this approach has the capacity to detect a variety of impairments, resulting in an efficient assessment of ecological risks across various spatial and temporal scales.
Collapse
Affiliation(s)
- Olivier Dézerald
- ESE, Ecology and Ecosystems Health, INRAE, Agrocampus Ouest, 35042 Rennes, France; Université de Lorraine, CNRS, LIEC, F-57000 Metz, France.
| | - Cédric P Mondy
- Office Français de la Biodiversité, Direction Régionale Ile-de-France, 12 cours Lumière, F-94300 Vincennes, France
| | - Samuel Dembski
- Office Français de la Biodiversité, Direction Régionale Ile-de-France, 12 cours Lumière, F-94300 Vincennes, France
| | - Karl Kreutzenberger
- Office Français de la Biodiversité, Direction Générale, 35042 Rennes, France
| | - Yorick Reyjol
- UMS Patrinat (OFB-CNRS-MNHN), Muséum national d'Histoire naturelle CP41, 36 rue Geoffroy Saint-Hilaire, 75005 Paris, France
| | - André Chandesris
- INRAE, UR Riverly, 5 rue de la Doua - CS 20244, 69625 Villeurbanne Cedex, France
| | - Laurent Valette
- INRAE, UR Riverly, 5 rue de la Doua - CS 20244, 69625 Villeurbanne Cedex, France
| | - Sébastien Brosse
- Laboratoire Evolution et Diversité Biologique, UMR 5174 UPS-CNRS-IRD, Université Paul Sabatier, 118 route de Narbonne, F-31062 Toulouse, France
| | - Aurèle Toussaint
- Institute of Ecology and Earth Sciences, University of Tartu, Lai 40, Tartu 51005, Estonia
| | - Jérôme Belliard
- Université Paris-Saclay, INRAE, UR HYCAR, 92160 Antony, France
| | - Marie-Line Merg
- Université Paris-Saclay, INRAE, UR HYCAR, 92160 Antony, France
| | | |
Collapse
|
34
|
Bozdağ A, Dokuz Y, Gökçek ÖB. Spatial prediction of PM 10 concentration using machine learning algorithms in Ankara, Turkey. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2020; 263:114635. [PMID: 33618491 DOI: 10.1016/j.envpol.2020.114635] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 04/14/2020] [Accepted: 04/18/2020] [Indexed: 06/12/2023]
Abstract
With the increase in population and industrialization, air pollution has become one of the global problems nowadays. Therefore, air pollutant parameters should be measured at regular intervals, and the necessary measures should be taken by evaluating the results of measurements. In order to prevent air pollution, pollutant parameters must be evaluated within the framework of a model. Recently, in order to obtain objective and more sensitive results with regard to air pollution nowadays, studies, which use machine learning algorithms in artificial intelligence technologies, have been carried out. In this study, PM10 concentrations, which are obtained from 7 stations in Ankara province in Turkey, were trained with machine learning algorithms (LASSO, SVR, RF, kNN, xGBoost, ANN). The PM10 concentrations of the years 2009-2017 of 6 stations in Ankara were given as input, and the PM10 concentrations of the seventh station for the year 2018 were predicted. The model development stage was repeated for each station, and the performance and error rates of the algorithms were determined by comparing the results produced by the algorithms with the actual results. The best results were provided with ANN (R2 = 0.58, RMSE = 20.8, MAE = 14.4). The spatial distribution of the estimated concentration results was provided through Geographic Information System (GIS), and spatial strategies for improving air pollution over land use were established.
Collapse
Affiliation(s)
- Aslı Bozdağ
- Faculty of Engineering, Department of Geomatics Engineering, Nigde Omer Halisdemir University, 51240, Nigde, Turkey
| | - Yeşim Dokuz
- Faculty of Engineering, Department of Computer Engineering, Nigde Omer Halisdemir University, 51240, Nigde, Turkey
| | - Öznur Begüm Gökçek
- Faculty of Engineering, Department of Environmental Engineering, Niğde Ömer Halisdemir University, 51240, Nigde, Turkey.
| |
Collapse
|
35
|
Abstract
Soil erosion is a serious threat to sustainable agriculture, food production, and environmental security. The advancement of accurate models for soil erosion susceptibility and hazard assessment is of utmost importance for enhancing mitigation policies and laws. This paper proposes novel machine learning (ML) models for the susceptibility mapping of the water erosion of soil. The weighted subspace random forest (WSRF), Gaussian process with a radial basis function kernel (Gaussprradial), and naive Bayes (NB) ML methods were used in the prediction of the soil erosion susceptibility. Data included 227 samples of erosion and non-erosion locations through field surveys to advance models of the spatial distribution using predictive factors. In this study, 19 effective factors of soil erosion were considered. The critical factors were selected using simulated annealing feature selection (SAFS). The critical factors included aspect, curvature, slope length, flow accumulation, rainfall erosivity factor, distance from the stream, drainage density, fault density, normalized difference vegetation index (NDVI), hydrologic soil group, soil texture, and lithology. The dataset cells of samples (70% for training and 30% for testing) were randomly prepared to assess the robustness of the different models. The functional relevance between soil erosion and effective factors was computed using the ML models. The ML models were evaluated using different metrics, including accuracy, the kappa coefficient, and the probability of detection (POD). The accuracies of the WSRF, Gaussprradial, and NB methods were 0.91, 0.88, and 0.85, respectively, for the testing data; 0.82, 0.76, and 0.71, respectively, for the kappa coefficient; and 0.94, 0.94, and 0.94, respectively, for POD. However, the ML models, especially the WSRF, had an acceptable performance regarding producing soil erosion susceptibility maps. Maps produced with the most robust models can be a useful tool for sustainable management, watershed conservation, and the reduction of soil and water loss.
Collapse
|
36
|
Hosseini FS, Choubin B, Mosavi A, Nabipour N, Shamshirband S, Darabi H, Haghighi AT. Flash-flood hazard assessment using ensembles and Bayesian-based machine learning models: Application of the simulated annealing feature selection method. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 711:135161. [PMID: 31818576 DOI: 10.1016/j.scitotenv.2019.135161] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 10/21/2019] [Accepted: 10/23/2019] [Indexed: 05/28/2023]
Abstract
Flash-floods are increasingly recognized as a frequent natural hazard worldwide. Iran has been among the mostdevastated regions affected by the major floods. While the temporal flash-flood forecasting models are mainly developed for warning systems, the models for assessing hazardous areas can greatly contribute to adaptation and mitigation policy-making and disaster risk reduction. Former researches in the flash-flood hazard mapping have heightened the urge for the advancement of more accurate models. Thus, the current research proposes the state-of-the-art ensemble models of boosted generalized linear model (GLMBoost) and random forest (RF), and Bayesian generalized linear model (BayesGLM) methods for higher performance modeling. Furthermore, a pre-processing method, namely simulated annealing (SA), is used to eliminate redundant variables from the modeling process. Results of the modeling based on the hit and miss analysis indicates high performance for both models (accuracy = 90-92%, Kappa = 79-84%, Success ratio = 94-96%, Threat score = 80-84%, and Heidke skill score = 79-84%). The variables of distance from the stream, vegetation, drainage density, land use, and elevation have shown more contribution among others for modeling the flash-flood. The results of this study can significantly facilitate mapping the hazardous areas and further assist watershed managers to control and remediate induced damages of flood in the data-scarce regions.
Collapse
Affiliation(s)
- Farzaneh Sajedi Hosseini
- Department of Reclamation of Arid and Mountainous Regions, Faculty of Natural Resources, University of Tehran, Karaj, Iran
| | - Bahram Choubin
- Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran
| | - Amir Mosavi
- School of the Built Environment, Oxford Brookes University, Oxford OX3 0BP, UK; Kalman Kando Faculty of Electrical Engineering, Obuda University, Budapest, Hungary
| | - Narjes Nabipour
- Institute of Research and Development, Duy Tan University, Da Nang 550000, Viet Nam.
| | - Shahaboddin Shamshirband
- Department for Management of Science and Technology Development, Ton Duc Thang University, Ho Chi Minh City, Viet Nam; Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Viet Nam.
| | - Hamid Darabi
- Water, Energy and Environmental Engineering Research Unit, University of Oulu, P.O. Box 4300, FIN-90014 Oulu, Finland
| | - Ali Torabi Haghighi
- Water, Energy and Environmental Engineering Research Unit, University of Oulu, P.O. Box 4300, FIN-90014 Oulu, Finland
| |
Collapse
|
37
|
The Role of Urban Morphology Design on Enhancing Physical Activity and Public Health. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17072359. [PMID: 32244358 PMCID: PMC7178257 DOI: 10.3390/ijerph17072359] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 03/26/2020] [Accepted: 03/27/2020] [Indexed: 12/28/2022]
Abstract
Along with environmental pollution, urban planning has been connected to public health. The research indicates that the quality of built environments plays an important role in reducing mental disorders and overall health. The structure and shape of the city are considered as one of the factors influencing happiness and health in urban communities and the type of the daily activities of citizens. The aim of this study was to promote physical activity in the main structure of the city via urban design in a way that the main form and morphology of the city can encourage citizens to move around and have physical activity within the city. Functional, physical, cultural-social, and perceptual-visual features are regarded as the most important and effective criteria in increasing physical activities in urban spaces, based on literature review. The environmental quality of urban spaces and their role in the physical activities of citizens in urban spaces were assessed by using the questionnaire tool and analytical network process (ANP) of structural equation modeling. Further, the space syntax method was utilized to evaluate the role of the spatial integration of urban spaces on improving physical activities. Based on the results, consideration of functional diversity, spatial flexibility and integration, security, and the aesthetic and visual quality of urban spaces plays an important role in improving the physical health of citizens in urban spaces. Further, more physical activities, including motivation for walking and the sense of public health and happiness, were observed in the streets having higher linkage and space syntax indexes with their surrounding texture.
Collapse
|
38
|
A Novel GIS-Based Random Forest Machine Algorithm for the Spatial Prediction of Shallow Landslide Susceptibility. FORESTS 2020. [DOI: 10.3390/f11010118] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This study developed and verified a new hybrid machine learning model, named random forest machine (RFM), for the spatial prediction of shallow landslides. RFM is a hybridization of two state-of-the-art machine learning algorithms, random forest classifier (RFC) and support vector machine (SVM), in which RFC is used to generate subsets from training data and SVM is used to build decision functions for these subsets. To construct and verify the hybrid RFM model, a shallow landslide database of the Lang Son area (northern Vietnam) was prepared. The database consisted of 101 shallow landslide polygons and 14 conditioning factors. The relevance of these factors for shallow landslide susceptibility modeling was assessed using the ReliefF method. Experimental results pointed out that the proposed RFM can help to achieve the desired prediction with an F1 score of roughly 0.96. The performance of the RFM was better than those of benchmark approaches, including the SVM, RFC, and logistic regression. Thus, the newly developed RFM is a promising tool to help local authorities in shallow landslide hazard mitigations.
Collapse
|
39
|
Extreme Learning Machine-Based Model for Solubility Estimation of Hydrocarbon Gases in Electrolyte Solutions. Processes (Basel) 2020. [DOI: 10.3390/pr8010092] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Calculating hydrocarbon components solubility of natural gases is known as one of the important issues for operational works in petroleum and chemical engineering. In this work, a novel solubility estimation tool has been proposed for hydrocarbon gases—including methane, ethane, propane, and butane—in aqueous electrolyte solutions based on extreme learning machine (ELM) algorithm. Comparing the ELM outputs with a comprehensive real databank which has 1175 solubility points yielded R-squared values of 0.985 and 0.987 for training and testing phases respectively. Furthermore, the visual comparison of estimated and actual hydrocarbon solubility led to confirm the ability of proposed solubility model. Additionally, sensitivity analysis has been employed on the input variables of model to identify their impacts on hydrocarbon solubility. Such a comprehensive and reliable study can help engineers and scientists to successfully determine the important thermodynamic properties, which are key factors in optimizing and designing different industrial units such as refineries and petrochemical plants.
Collapse
|
40
|
Modeling Pan Evaporation Using Gaussian Process Regression K-Nearest Neighbors Random Forest and Support Vector Machines; Comparative Analysis. ATMOSPHERE 2020. [DOI: 10.3390/atmos11010066] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Evaporation is a very important process; it is one of the most critical factors in agricultural, hydrological, and meteorological studies. Due to the interactions of multiple climatic factors, evaporation is considered as a complex and nonlinear phenomenon to model. Thus, machine learning methods have gained popularity in this realm. In the present study, four machine learning methods of Gaussian Process Regression (GPR), K-Nearest Neighbors (KNN), Random Forest (RF) and Support Vector Regression (SVR) were used to predict the pan evaporation (PE). Meteorological data including PE, temperature (T), relative humidity (RH), wind speed (W), and sunny hours (S) collected from 2011 through 2017. The accuracy of the studied methods was determined using the statistical indices of Root Mean Squared Error (RMSE), correlation coefficient (R) and Mean Absolute Error (MAE). Furthermore, the Taylor charts utilized for evaluating the accuracy of the mentioned models. The results of this study showed that at Gonbad-e Kavus, Gorgan and Bandar Torkman stations, GPR with RMSE of 1.521 mm/day, 1.244 mm/day, and 1.254 mm/day, KNN with RMSE of 1.991 mm/day, 1.775 mm/day, and 1.577 mm/day, RF with RMSE of 1.614 mm/day, 1.337 mm/day, and 1.316 mm/day, and SVR with RMSE of 1.55 mm/day, 1.262 mm/day, and 1.275 mm/day had more appropriate performances in estimating PE values. It was found that GPR for Gonbad-e Kavus Station with input parameters of T, W and S and GPR for Gorgan and Bandar Torkmen stations with input parameters of T, RH, W and S had the most accurate predictions and were proposed for precise estimation of PE. The findings of the current study indicated that the PE values may be accurately estimated with few easily measured meteorological parameters.
Collapse
|
41
|
Comparative Analysis of Machine Learning Models for Prediction of Remaining Service Life of Flexible Pavement. MATHEMATICS 2019. [DOI: 10.3390/math7121198] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Prediction of the remaining service life (RSL) of pavement is a challenging task for road maintenance and transportation engineering. The prediction of the RSL estimates the time that a major repair or reconstruction becomes essential. The conventional approach to predict RSL involves using non-destructive tests. These tests, in addition to being costly, interfere with traffic flow and compromise operational safety. In this paper, surface distresses of pavement are used to estimate the RSL to address the aforementioned challenges. To implement the proposed theory, 105 flexible pavement segments are considered. For each pavement segment, the type, severity, and extent of surface damage and the pavement condition index (PCI) were determined. The pavement RSL was then estimated using non-destructive tests include falling weight deflectometer (FWD) and ground-penetrating radar (GPR). After completing the dataset, the modeling was conducted to predict RSL using three techniques include support vector regression (SVR), support vector regression optimized by the fruit fly optimization algorithm (SVR-FOA), and gene expression programming (GEP). All three techniques estimated the RSL of the pavement by selecting the PCI as input. The correlation coefficient (CC), Nash–Sutcliffe efficiency (NSE), scattered index (SI), and Willmott’s index of agreement (WI) criteria were used to examine the performance of the three techniques adopted in this study. In the end, it was found that GEP with values of 0.874, 0.598, 0.601, and 0.807 for CC, SI, NSE, and WI criteria, respectively, had the highest accuracy in predicting the RSL of pavement.
Collapse
|
42
|
Choubin B, Mosavi A, Alamdarloo EH, Hosseini FS, Shamshirband S, Dashtekian K, Ghamisi P. Earth fissure hazard prediction using machine learning models. ENVIRONMENTAL RESEARCH 2019; 179:108770. [PMID: 31577962 DOI: 10.1016/j.envres.2019.108770] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 09/19/2019] [Accepted: 09/22/2019] [Indexed: 06/10/2023]
Abstract
Earth fissures are the cracks on the surface of the earth mainly formed in the arid and the semi-arid basins. The excessive withdrawal of groundwater, as well as the other underground natural resources, has been introduced as the significant causing of land subsidence and potentially, the earth fissuring. Fissuring is rapidly turning into the nations' major disasters which are responsible for significant economic, social, and environmental damages with devastating consequences. Modeling the earth fissure hazard is particularly important for identifying the vulnerable groundwater areas for the informed water management, and effectively enforce the groundwater recharge policies toward the sustainable conservation plans to preserve existing groundwater resources. Modeling the formation of earth fissures and ultimately prediction of the hazardous areas has been greatly challenged due to the complexity, and the multidisciplinary involved to predict the earth fissures. This paper aims at proposing novel machine learning models for prediction of earth fissuring hazards. The Simulated annealing feature selection (SAFS) method was applied to identify key features, and the generalized linear model (GLM), multivariate adaptive regression splines (MARS), classification and regression tree (CART), random forest (RF), and support vector machine (SVM) have been used for the first time to build the prediction models. Results indicated that all the models had good accuracy (>86%) and precision (>81%) in the prediction of the earth fissure hazard. The GLM model (as a linear model) had the lowest performance, while the RF model was the best model in the modeling process. Sensitivity analysis indicated that the hazardous class in the study area was mainly related to low elevations with characteristics of high groundwater withdrawal, drop in groundwater level, high well density, high road density, low precipitation, and Quaternary sediments distribution.
Collapse
Affiliation(s)
- Bahram Choubin
- Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran
| | - Amir Mosavi
- School of the Built Environment, Oxford Brookes University, Oxford, OX30BP, UK; Kalman Kando Faculty of Electrical Engineering, Obuda University, Budapest, Hungary
| | - Esmail Heydari Alamdarloo
- Department of Reclamation of Arid and Mountainous Regions, Faculty of Natural Resources, University of Tehran, Karaj, Iran
| | - Farzaneh Sajedi Hosseini
- Department of Reclamation of Arid and Mountainous Regions, Faculty of Natural Resources, University of Tehran, Karaj, Iran
| | - Shahaboddin Shamshirband
- Department for Management of Science and Technology Development, Ton Duc Thang University, Ho Chi Minh City, Viet Nam; Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Viet Nam.
| | - Kazem Dashtekian
- Yazd Agricultural and Natural Resources Research Center, AREEO, Yazd, Iran
| | - Pedram Ghamisi
- Exploration Devision, Helmholtz Institute Freiberg for Resource Technology, Helmholtz-Zentrum Dresden-Rossendorf Helmholtz Institute Freiberg for Resource Technology, Freiberg, Germany
| |
Collapse
|
43
|
Developing an ANFIS-PSO Model to Predict Mercury Emissions in Combustion Flue Gases. MATHEMATICS 2019. [DOI: 10.3390/math7100965] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Accurate prediction of mercury content emitted from fossil-fueled power stations is of the utmost importance for environmental pollution assessment and hazard mitigation. In this paper, mercury content in the output gas of power stations’ boilers was predicted using an adaptive neuro-fuzzy inference system (ANFIS) method integrated with particle swarm optimization (PSO). The input parameters of the model included coal characteristics and the operational parameters of the boilers. The dataset was collected from 82 sample points in power plants and employed to educate and examine the proposed model. To evaluate the performance of the proposed hybrid model of the ANFIS-PSO, the statistical meter of MARE% was implemented, which resulted in 0.003266 and 0.013272 for training and testing, respectively. Furthermore, relative errors between the acquired data and predicted values were between −0.25% and 0.1%, which confirm the accuracy of the model to deal non-linearity and represent the dependency of flue gas mercury content into the specifications of coal and the boiler type.
Collapse
|