Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zamani Joharestani M, Cao C, Ni X, Bashir B, Talebiesfandarani S. PM2.5 Prediction Based on Random Forest, XGBoost, and Deep Learning Using Multisource Remote Sensing Data. Atmosphere 2019;10:373. [DOI: 10.3390/atmos10070373] [Citation(s) in RCA: 86] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

For:	Zamani Joharestani M, Cao C, Ni X, Bashir B, Talebiesfandarani S. PM2.5 Prediction Based on Random Forest, XGBoost, and Deep Learning Using Multisource Remote Sensing Data. Atmosphere 2019;10:373. [DOI: 10.3390/atmos10070373] [Citation(s) in RCA: 86] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Number

Cited by Other Article(s)

Zhang Y, Du S, Guan L, Chen X, Lei L, Liu L. Estimating global 0.1° scale gridded anthropogenic CO₂ emissions using TROPOMI NO₂ and a data-driven method. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;949:175177. [PMID: 39094662 DOI: 10.1016/j.scitotenv.2024.175177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Revised: 07/03/2024] [Accepted: 07/29/2024] [Indexed: 08/04/2024]

Fatima M, Ahmad A, Butt I, Arshad S, Kiani B. Geospatial modelling of ambient air pollutants and chronic obstructive pulmonary diseases at regional scale in Pakistan. ENVIRONMENTAL MONITORING AND ASSESSMENT 2024;196:929. [PMID: 39271595 DOI: 10.1007/s10661-024-13105-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Accepted: 09/06/2024] [Indexed: 09/15/2024]

Zhang K, Lin J, Li Y, Sun Y, Tong W, Li F, Chien LC, Yang Y, Su WC, Tian H, Fu P, Qiao F, Romeiko XX, Lin S, Luo S, Craft E. Unmasking the sky: high-resolution PM_2.5 prediction in Texas using machine learning techniques. JOURNAL OF EXPOSURE SCIENCE & ENVIRONMENTAL EPIDEMIOLOGY 2024;34:814-820. [PMID: 38561475 DOI: 10.1038/s41370-024-00659-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 03/06/2024] [Accepted: 03/07/2024] [Indexed: 04/04/2024]

Abstract

BACKGROUND

Although PM2.5 (fine particulate matter with an aerodynamic diameter less than 2.5 µm) is an air pollutant of great concern in Texas, limited regulatory monitors pose a significant challenge for decision-making and environmental studies.

OBJECTIVE

This study aimed to predict PM2.5 concentrations at a fine spatial scale on a daily basis by using novel machine learning approaches and incorporating satellite-derived Aerosol Optical Depth (AOD) and a variety of weather and land use variables.

METHODS

We compiled a comprehensive dataset in Texas from 2013 to 2017, including ground-level PM2.5 concentrations from regulatory monitors; AOD values at 1-km resolution based on images retrieved from the MODIS satellite; and weather, land-use, population density, among others. We built predictive models for each year separately to estimate PM2.5 concentrations using two machine learning approaches called gradient boosted trees and random forest. We evaluated the model prediction performance using in-sample and out-of-sample validations.

RESULTS

Our predictive models demonstrate excellent in-sample model performance, as indicated by high R2 values generated from the gradient boosting models (0.94-0.97) and random forest models (0.81-0.90). However, the out-of-sample R2 values fall within a range of 0.52-0.75 for gradient boosting models and 0.44-0.69 for random forest models. Model performance varies slightly across years. A generally decreasing trend in predicted PM2.5 concentrations over time is observed in Eastern Texas.

IMPACT STATEMENT

We utilized machine learning approaches to predict PM2.5 levels in Texas. Both gradient boosting and random forest models perform well. Gradient boosting models perform slightly better than random forest models. Our models showed excellent in-sample prediction performance (R2 > 0.9).

Collapse

Affiliation(s)

Kai Zhang Department of Environmental Health Sciences, School of Public Health,University at Albany, State University of New York, Rensselaer, NY, USA.
Jeffrey Lin Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Yuanfei Li Asian Demographic Research Institute, Shanghai University, Shanghai, China
Yue Sun Department of International Development, Community, and Environment, Clark University, Worcester, MA, USA
Weitian Tong Department of Computer Science, Georgia Southern University, Statesboro, GA, USA
Fangyu Li Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Lung-Chang Chien Department of Epidemiology and Biostatistics, School of Public Health, University of Nevada, Las Vegas, Las Vegas, NV, USA
Yiping Yang Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Wei-Chung Su Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Hezhong Tian State Key Joint Laboratory of Environmental Simulation & Pollution Control, School of Environment, Beijing Normal University, Beijing, China Center for Atmospheric Environmental Studies, Beijing Normal University, Beijing, China
Peng Fu Department of Plant Biology, University of Illinois, Urbana, IL, USA Center for Economy, Environment, and Energy, Harrisburg University, Harrisburg, PA, USA
Fengxiang Qiao Innovative Transportation Research Institute, Texas Southern University, Houston, TX, USA
Xiaobo Xue Romeiko Department of Environmental Health Sciences, School of Public Health,University at Albany, State University of New York, Rensselaer, NY, USA
Shao Lin Department of Environmental Health Sciences, School of Public Health,University at Albany, State University of New York, Rensselaer, NY, USA
Sheng Luo Department of Biostatistics & Bioinformatics, Duke University, Durham, NC, USA
Elena Craft Health Effects Institute, Boston, MA, USA

Collapse

Rakholia R, Le Q, Vu K, Ho BQ, Carbajo RS. Accurate PM_2.5 urban air pollution forecasting using multivariate ensemble learning Accounting for evolving target distributions. CHEMOSPHERE 2024;364:143097. [PMID: 39154769 DOI: 10.1016/j.chemosphere.2024.143097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 07/28/2024] [Accepted: 08/13/2024] [Indexed: 08/20/2024]

Zalzal J, Minet L, Brook J, Mihele C, Chen H, Hatzopoulou M. Capturing Exposure Disparities with Chemical Transport Models: Evaluating the Suitability of Downscaling Using Land Use Regression. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024. [PMID: 39092553 DOI: 10.1021/acs.est.4c03725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]

Mohammadi Dashtaki N, Mirahmadizadeh A, Fararouei M, Mohammadi Dashtaki R, Hoseini M, Nayeb MR. The Lag -Effects of Air Pollutants and Meteorological Factors on COVID-19 Infection Transmission and Severity: Using Machine Learning Techniques. J Res Health Sci 2024;24:e00622. [PMID: 39311105 PMCID: PMC11380733 DOI: 10.34172/jrhs.2024.157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 02/12/2024] [Accepted: 05/20/2024] [Indexed: 09/27/2024] Open

Lim B, Song W. Exploring CrossFit performance prediction and analysis via extensive data and machine learning. J Sports Med Phys Fitness 2024;64:640-649. [PMID: 38916087 DOI: 10.23736/s0022-4707.24.15786-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]

Venkatraman Jagatha J, Schneider C, Sauter T. Parsimonious Random-Forest-Based Land-Use Regression Model Using Particulate Matter Sensors in Berlin, Germany. SENSORS (BASEL, SWITZERLAND) 2024;24:4193. [PMID: 39000970 PMCID: PMC11244214 DOI: 10.3390/s24134193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/07/2024] [Accepted: 06/21/2024] [Indexed: 07/16/2024]

Abstract

Machine learning (ML) methods are widely used in particulate matter prediction modelling, especially through use of air quality sensor data. Despite their advantages, these methods' black-box nature obscures the understanding of how a prediction has been made. Major issues with these types of models include the data quality and computational intensity. In this study, we employed feature selection methods using recursive feature elimination and global sensitivity analysis for a random-forest (RF)-based land-use regression model developed for the city of Berlin, Germany. Land-use-based predictors, including local climate zones, leaf area index, daily traffic volume, population density, building types, building heights, and street types were used to create a baseline RF model. Five additional models, three using recursive feature elimination method and two using a Sobol-based global sensitivity analysis (GSA), were implemented, and their performance was compared against that of the baseline RF model. The predictors that had a large effect on the prediction as determined using both the methods are discussed. Through feature elimination, the number of predictors were reduced from 220 in the baseline model to eight in the parsimonious models without sacrificing model performance. The model metrics were compared, which showed that the parsimonious_GSA-based model performs better than does the baseline model and reduces the mean absolute error (MAE) from 8.69 µg/m3 to 3.6 µg/m3 and the root mean squared error (RMSE) from 9.86 µg/m3 to 4.23 µg/m3 when applying the trained model to reference station data. The better performance of the GSA_parsimonious model is made possible by the curtailment of the uncertainties propagated through the model via the reduction of multicollinear and redundant predictors. The parsimonious model validated against reference stations was able to predict the PM2.5 concentrations with an MAE of less than 5 µg/m3 for 10 out of 12 locations. The GSA_parsimonious performed best in all model metrics and improved the R2 from 3% in the baseline model to 17%. However, the predictions exhibited a degree of uncertainty, making it unreliable for regional scale modelling. The GSA_parsimonious model can nevertheless be adapted to local scales to highlight the land-use parameters that are indicative of PM2.5 concentrations in Berlin. Overall, population density, leaf area index, and traffic volume are the major predictors of PM2.5, while building type and local climate zones are the less significant predictors. Feature selection based on sensitivity analysis has a large impact on the model performance. Optimising models through sensitivity analysis can enhance the interpretability of the model dynamics and potentially reduce computational costs and time when modelling is performed for larger areas.

Collapse

Jitkajornwanich K, Vijaranakul N, Jaiyen S, Srestasathiern P, Lawawirojwong S. Enhancing risk communication and environmental crisis management through satellite imagery and AI for air quality index estimation. MethodsX 2024;12:102611. [PMID: 38420115 PMCID: PMC10901142 DOI: 10.1016/j.mex.2024.102611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 02/10/2024] [Indexed: 03/02/2024] Open

Abstract

Due to climate change, the air pollution problem has become more and more prominent [23]. Air pollution has impacts on people globally, and is considered one of the leading risk factors for premature death worldwide; it was ranked as number 4 according to the website [24]. A study, 'The Global Burden of Disease,' reported 4,506,193 deaths were caused by outdoor air pollution in 2019 [22,25]. The air pollution problem is become even more apparent when it comes to developing countries [22], including Thailand, which is considered one of the developing countries [26]. In this research, we focus and analyze the air pollution in Thailand, which has the annual average PM2.5 (particulate matter 2.5) concentration falls in between 15 and 25, classified as the interim target 2 by 2021's WHO AQG (World Health Organization's Air Quality Guidelines) [27]. (The interim targets refer to areas where the air pollutants concentration is high, with 1 being the highest concentration and decreasing down to 4 [27,28]). However, the methodology proposed here can also be adopted in other areas as well. During the winter in Thailand, Bangkok and its surrounding metroplex have been facing the issue of air pollution (e.g., PM2.5) every year. Currently, air quality measurement is done by simply implementing physical air quality measurement devices at designated-but limited number of locations. In this work, we propose a method that allows us to estimate the Air Quality Index (AQI) on a larger scale by utilizing Landsat 8 images with machine learning techniques. We propose and compare hybrid models with pure regression models to enhance AQI prediction based on satellite images. Our hybrid model consists of two parts as follows:•The classification part and the estimation part, whereas the pure regressor model consists of only one part, which is a pure regression model for AQI estimation.•The two parts of the hybrid model work hand in hand such that the classification part classifies data points into each class of air quality standard, which is then passed to the estimation part to estimate the final AQI. From our experiments, after considering all factors and comparing their performances, we conclude that the hybrid model has a slightly better performance than the pure regressor model, although both models can achieve a generally minimum R2 (R2 > 0.7). We also introduced and tested an additional factor, DOY (day of year), and incorporated it into our model. Additional experiments with similar approaches are also performed and compared. And, the results also show that our hybrid model outperform them. Keywords: climate change, air pollution, air quality assessment, air quality index, AQI, machine learning, AI, Landsat 8, satellite imagery analysis, environmental data analysis, natural disaster monitoring and management, crisis and disaster management and communication.

Collapse

Xia Y, McCracken T, Liu T, Chen P, Metcalf A, Fan C. Understanding the Disparities of PM2.5 Air Pollution in Urban Areas via Deep Support Vector Regression. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024;58:8404-8416. [PMID: 38698567 DOI: 10.1021/acs.est.3c09177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]

Zhao S, Chen K, Xiong B, Guo C, Dang Z. Prediction of adsorption of metal cations by clay minerals using machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;924:171733. [PMID: 38492590 DOI: 10.1016/j.scitotenv.2024.171733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 02/24/2024] [Accepted: 03/13/2024] [Indexed: 03/18/2024]

Shi TL, Jia KH, Bao YT, Nie S, Tian XC, Yan XM, Chen ZY, Li ZC, Zhao SW, Ma HY, Zhao Y, Li X, Zhang RG, Guo J, Zhao W, El-Kassaby YA, Müller N, Van de Peer Y, Wang XR, Street NR, Porth I, An X, Mao JF. High-quality genome assembly enables prediction of allele-specific gene expression in hybrid poplar. PLANT PHYSIOLOGY 2024;195:652-670. [PMID: 38412470 PMCID: PMC11060683 DOI: 10.1093/plphys/kiae078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 01/08/2024] [Accepted: 01/09/2024] [Indexed: 02/29/2024]

Affiliation(s)

Tian-Le Shi State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Kai-Hua Jia State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China Key Laboratory of Crop Genetic Improvement & Ecology and Physiology, Institute of Crop Germplasm Resources, Shandong Academy of Agricultural Sciences, Ji’nan 250100, China
Yu-Tao Bao State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Shuai Nie Rice Research Institute, Guangdong Academy of Agricultural Sciences & Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs & Guangdong Key Laboratory of New Technology in Rice Breeding, Guangzhou 510640, China
Xue-Chan Tian State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Xue-Mei Yan State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Zhao-Yang Chen State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Zhi-Chao Li State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Shi-Wei Zhao State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Hai-Yao Ma State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Ye Zhao State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Xiang Li School of Agriculture, Ningxia University, Yinchuan 750021, China
Ren-Gang Zhang Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, Yunnan, China
Jing Guo College of Forestry, Shandong Agricultural University, Tai’an 271000, China
Wei Zhao Umeå Plant Science Centre, Department of Ecology and Environmental Science, Umeå University, SE-901 87 Umeå, Sweden
Yousry Aly El-Kassaby Department of Forest and Conservation Sciences, Faculty of Forestry, University of British Columbia, Vancouver, Bc, V6T 1Z4, Canada
Niels Müller Thünen-Institute of Forest Genetics, 22927 Grosshansdorf, Germany
Yves Van de Peer Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium VIB Center for Plant Systems Biology, 9052 Ghent, Belgium Centre for Microbial Ecology and Genomics, Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria 0028, South Africa College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
Xiao-Ru Wang Umeå Plant Science Centre, Department of Ecology and Environmental Science, Umeå University, SE-901 87 Umeå, Sweden
Nathaniel Robert Street Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, SE-901 87 Umeå, Sweden
Ilga Porth Départment des Sciences du Bois et de la Forêt, Faculté de Foresterie, de Géographie et Géomatique, Université Laval, Québec, QC G1V 0A6, Canada
Xinmin An State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
Jian-Feng Mao State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, SE-901 87 Umeå, Sweden

Collapse

Wood DA. Trend-attribute forecasting of hourly PM2.5 trends in fifteen cities of Central England applying optimized machine learning feature selection. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024;356:120561. [PMID: 38479290 DOI: 10.1016/j.jenvman.2024.120561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 02/18/2024] [Accepted: 03/05/2024] [Indexed: 04/07/2024]

Abstract

Recorded particulate matter (PM2.5) hourly trends are compared for fifteen urban recording sites distributed across central England for the period 2018 to 2022. They include 10 urban-background and five urban-traffic (roadside) sites with some located within the same urban area. The sites all show consistent background and peak distributions with mean annual values and standard deviations higher for 2018 and 2019 than for 2020 to 2022. The objective of this study is to demonstrate that trend attributes extracted from hourly recorded univariate PM2.5 trends at these sites can be used to provide reliable short-term hourly predictions and provide valuable insight into the regional variations in the recorded trends. Fifteen trend attributes extracted from the prior 12 h (t-1 to t-12) of recorded PM2.5 data were compiled and used as input to four supervised machine learning models (SML) to forecast PM2.5 concentrations up to 13 h ahead (t0 to t+12). All recording sites delivered forecasts with similar ranges of error levels for specific hours ahead which are consistent with their PM2.5 recorded ranges. Forecasting results for four representative sites are presented in detail using models trained and cross-validated with 2020 and 2021 hourly data to forecast 2021 and 2022 hourly data, respectively. A novel optimized feature selection procedure using a suite of five optimizers is used to improve the efficiency of the forecasting models. The LASSO and support vector regression models generate the best and most generalizable hourly PM2.5 forecasts from trained and validated SML models with mean average error (MAE) of between ∼1 and ∼3 μg/m3 for t0 to t+3 h ahead. A novel overfitting indicator, exploiting the cross-validation mean values, demonstrates that these two models are not affected by overfitting. Forecasts for t+6 to t+12 h forward generate higher MAE values between ∼3 and ∼4 μg/m3 due to their tendency to underestimate some of the extreme PM2.5 peaks. These findings indicate that further model refinements are required to generate more reliable short-term predictions for the t+6 to t+24 h ahead.

Collapse

Ma Z, Wang B, Luo W, Jiang J, Liu D, Wei H, Luo H. Air pollutant prediction model based on transfer learning two-stage attention mechanism. Sci Rep 2024;14:7385. [PMID: 38548823 PMCID: PMC10978953 DOI: 10.1038/s41598-024-57784-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 03/21/2024] [Indexed: 04/01/2024] Open

Ayinde BO, Musa MR, Ayinde AAO. Application of machine learning models and landsat 8 data for estimating seasonal pm 2.5 concentrations. Environ Anal Health Toxicol 2024;39:e2024011-0. [PMID: 38631403 PMCID: PMC11079408 DOI: 10.5620/eaht.2024011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 03/12/2024] [Indexed: 04/19/2024] Open

Brahimi N, Zhang H, Zaidi SDA, Dai L. A Unified Spatio-Temporal Inference Network for Car-Sharing Serial Prediction. SENSORS (BASEL, SWITZERLAND) 2024;24:1266. [PMID: 38400424 PMCID: PMC10892602 DOI: 10.3390/s24041266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 02/07/2024] [Accepted: 02/14/2024] [Indexed: 02/25/2024]

Guastavino S, Piana M, Benvenuto F. Bad and Good Errors: Value-Weighted Skill Scores in Deep Ensemble Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:1993-2002. [PMID: 35776819 DOI: 10.1109/tnnls.2022.3186068] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Verma A, Ranga V, Vishwakarma DK. A novel approach for forecasting PM2.5 pollution in Delhi using CATALYST. ENVIRONMENTAL MONITORING AND ASSESSMENT 2023;195:1457. [PMID: 37950817 DOI: 10.1007/s10661-023-12020-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 10/23/2023] [Indexed: 11/13/2023]

Abstract

Air pollution is one of the main environmental issues in densely populated urban areas like Delhi. Predictions of the PM2.5 concentration must be accurate for pollution reduction strategies and policy actions to succeed. This research article presents a novel approach for forecasting PM2.5 pollution in Delhi by combining a pre-trained CNN model with a transformer-based model called CATALYST (Convolutional and Transformer model for Air Quality Forecasting). This proposed strategy uses a mixture of the two models. To derive attributes of the PM2.5 timeline of data, a pre-existing CNN model is utilized to transform the data into visual representations, which are analyzed subsequently. The CATALYST model is trained to predict future PM2.5 pollution levels using a sliding window training approach on extracted features. The model is utilized for analyzing temporal dependencies in PM2.5 time-series data. This model incorporates the advancements in the transformer-based architecture initially designed for natural language processing applications. CATALYST combines positional encoding with the Transformer architecture to capture intricate patterns and variations resulting from diverse meteorological, geographical, and anthropogenic factors. In addition, an innovative approach is suggested for building input-output couples, intending to address the problem of missing or partial data in environmental time-series datasets while ensuring that all training data blocks are comprehensive. On a PM2.5 dataset, we analyze the proposed CATALYST model and compare its performance with other standard time-series forecasting approaches, such as ARIMA and LSTM. The outcomes of the experiments demonstrate that the suggested model works better than conventional methods and is a potential strategy for accurately forecasting PM2.5 pollution. The applicability of CATALYST to real-world scenarios can be tested by running more experiments on real-world datasets. This can help develop efficient pollution mitigation measures, impacting public health and environmental sustainability.

Collapse

Kieu HT, Pak HY, Trinh HL, Pang DSC, Khoo E, Law AWK. UAV-based remote sensing of turbidity in coastal environment for regulatory monitoring and assessment. MARINE POLLUTION BULLETIN 2023;196:115482. [PMID: 37864857 DOI: 10.1016/j.marpolbul.2023.115482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 08/30/2023] [Accepted: 09/01/2023] [Indexed: 10/23/2023]

Morapedi TD, Obagbuwa IC. Air pollution particulate matter (PM2.5) prediction in South African cities using machine learning techniques. Front Artif Intell 2023;6:1230087. [PMID: 37881653 PMCID: PMC10595005 DOI: 10.3389/frai.2023.1230087] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 09/04/2023] [Indexed: 10/27/2023] Open

Abstract

Background

Air pollution contributes to the most severe environmental and health problems due to industrial emissions and atmosphere contamination, produced by climate and traffic factors, fossil fuel combustion, and industrial characteristics. Because this is a global issue, several nations have established control of air pollution stations in various cities to monitor pollutants like Nitrogen Dioxide (NO2), Ozone (O3), Sulfur Dioxide (SO2), Carbon Monoxide (CO), Particulate Matter (PM2.5, PM10), to notify inhabitants when pollution levels surpass the quality threshold. With the rise in air pollution, it is necessary to construct models to capture data on air pollutant concentrations. Compared to other parts of the world, Africa has a scarcity of reliable air quality sensors for monitoring and predicting Particulate Matter (PM2.5). This demonstrates the possibility of extending research in air pollution control.

Methods

Machine learning techniques were utilized in this study to identify air pollution in terms of time, cost, and efficiency so that different scenarios and systems may select the optimal way for their needs. To assess and forecast the behavior of Particulate Matter (PM2.5), this study presented a Machine Learning approach that includes Cat Boost Regressor, Extreme Gradient Boosting Regressor, Random Forest Classifier, Logistic Regression, Support Vector Machine, K-Nearest Neighbor, and Decision Tree.

Results

Cat Boost Regressor and Extreme Gradient Boosting Regressor were implemented to predict the latest PM2.5 concentrations for South African Cities with recording stations using past dated recordings, then the best performing model between the two is used to predict PM2.5 concentrations for South African Cities with no recording stations and also to predict future PM2.5 concentrations for South African Cities. K-Nearest Neighbor, Logistic Regression, Support Vector Machine, Decision Tree, and Random Forest Classifier were implemented to create a system predicting the Air Quality Index (AQI) Status.

Conclusion

This study investigated various machine learning techniques for air pollution to analyze and predict air pollution behavior regarding air quality and air pollutants, detecting which areas are most affected in South African cities.

Collapse

Guo Q, Zhang H, Zhang Y, Jiang X. Prediction of PM_2.5 concentration based on the CEEMDAN-RLMD-BiLSTM-LEC model. PeerJ 2023;11:e15931. [PMID: 37663301 PMCID: PMC10470446 DOI: 10.7717/peerj.15931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 07/30/2023] [Indexed: 09/05/2023] Open

Zhen Y, Wang L, Sun H, Liu C. Prediction of microplastic abundance in surface water of the ocean and influencing factors based on ensemble learning. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023;331:121834. [PMID: 37209894 DOI: 10.1016/j.envpol.2023.121834] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 04/18/2023] [Accepted: 05/13/2023] [Indexed: 05/22/2023]

Lu Y, Li K. Multistation collaborative prediction of air pollutants based on the CNN-BiLSTM model. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023;30:92417-92435. [PMID: 37490250 DOI: 10.1007/s11356-023-28877-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 07/16/2023] [Indexed: 07/26/2023]

Zhang Y, Wu W, Li Y, Li Y. An investigation of PM2.5 concentration changes in Mid-Eastern China before and after COVID-19 outbreak. ENVIRONMENT INTERNATIONAL 2023;175:107941. [PMID: 37146469 PMCID: PMC10119641 DOI: 10.1016/j.envint.2023.107941] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 03/24/2023] [Accepted: 04/17/2023] [Indexed: 05/07/2023]

Abstract

With the Chinese government revising ambient air quality standards and strengthening the monitoring and management of pollutants such as PM_2.5, the concentrations of air pollutants in China have gradually decreased in recent years. Meanwhile, the strong control measures taken by the Chinese government in the face of COVID-19 in 2020 have an extremely profound impact on the reduction of pollutants in China. Therefore, investigations of pollutant concentration changes in China before and after COVID-19 outbreak are very necessary and concerning, but the number of monitoring stations is very limited, making it difficult to conduct a high spatial density investigation. In this study, we construct a modern deep learning model based on multi-source data, which includes remotely sensed AOD data products, other reanalysis element data, and ground monitoring station data. Combining satellite remote sensing techniques, we finally realize a high spital density PM_2.5 concentration change investigation method, and analyze the seasonal and annual, the spatial and temporal characteristics of PM_2.5 concentrations in Mid-Eastern China from 2016 to 2021 and the impact of epidemic closure and control measures on regional and provincial PM_2.5 concentrations. We find that PM_2.5 concentrations in Mid-Eastern China during these years is mainly characterized by "north-south superiority and central inferiority", seasonal differences are evident, with the highest in winter, the second highest in autumn and the lowest in summer, and a gradual decrease in overall concentration during the year. According to our experimental results, the annual average PM_2.5 concentration decreases by 3.07 % in 2020, and decreases by 24.53 % during the shutdown period, which is probably caused by China's epidemic control measures. At the same time, some provinces with a large share of secondary industry see PM_2.5 concentrations drop by more than 30 %. By 2021, PM_2.5 concentrations rebound slightly, rising by 10 % in most provinces.

Collapse

Spatio-temporal air quality analysis and PM2.5 prediction over Hyderabad City, India using artificial intelligence techniques. ECOL INFORM 2023. [DOI: 10.1016/j.ecoinf.2023.102067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]

Islam ARMT, Al Awadh M, Mallick J, Pal SC, Chakraborty R, Fattah MA, Ghose B, Kakoli MKA, Islam MA, Naqvi HR, Bilal M, Elbeltagi A. Estimating ground-level PM_2.5 using subset regression model and machine learning algorithms in Asian megacity, Dhaka, Bangladesh. AIR QUALITY, ATMOSPHERE, & HEALTH 2023;16:1117-1139. [PMID: 37303964 PMCID: PMC9961308 DOI: 10.1007/s11869-023-01329-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 02/16/2023] [Indexed: 06/13/2023]

Abstract

Fine particulate matter (PM2.5) has become a prominent pollutant due to rapid economic development, urbanization, industrialization, and transport activities, which has serious adverse effects on human health and the environment. Many studies have employed traditional statistical models and remote-sensing technologies to estimate PM2.5 concentrations. However, statistical models have shown inconsistency in PM2.5 concentration predictions, while machine learning algorithms have excellent predictive capacity, but little research has been done on the complementary advantages of diverse approaches. The present study proposed the best subset regression model and machine learning approaches, including random tree, additive regression, reduced error pruning tree, and random subspace, to estimate the ground-level PM2.5 concentrations over Dhaka. This study used advanced machine learning algorithms to measure the effects of meteorological factors and air pollutants (NOX, SO2, CO, and O3) on the dynamics of PM2.5 in Dhaka from 2012 to 2020. Results showed that the best subset regression model was well-performed for forecasting PM2.5 concentrations for all sites based on the integration of precipitation, relative humidity, temperature, wind speed, SO2, NOX, and O3. Precipitation, relative humidity, and temperature have negative correlations with PM2.5. The concentration levels of pollutants are much higher at the beginning and end of the year. Random subspace is the optimal model for estimating PM2.5 because it has the least statistical error metrics compared to other models. This study suggests ensemble learning models to estimate PM2.5 concentrations. This study will help quantify ground-level PM2.5 concentration exposure and recommend regional government actions to prevent and regulate PM2.5 air pollution.

Supplementary Information

The online version contains supplementary material available at 10.1007/s11869-023-01329-w.

Collapse

Cha GW, Choi SH, Hong WH, Park CW. Developing a Prediction Model of Demolition-Waste Generation-Rate via Principal Component Analysis. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023;20:3159. [PMID: 36833851 PMCID: PMC9968033 DOI: 10.3390/ijerph20043159] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 02/08/2023] [Accepted: 02/09/2023] [Indexed: 06/18/2023]

Bagheri H. Using deep ensemble forest for high-resolution mapping of PM2.5 from MODIS MAIAC AOD in Tehran, Iran. ENVIRONMENTAL MONITORING AND ASSESSMENT 2023;195:377. [PMID: 36757448 DOI: 10.1007/s10661-023-10951-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 01/20/2023] [Indexed: 06/18/2023]

Hikouei IS, Eshleman KN, Saharjo BH, Graham LLB, Applegate G, Cochrane MA. Using machine learning algorithms to predict groundwater levels in Indonesian tropical peatlands. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;857:159701. [PMID: 36306856 DOI: 10.1016/j.scitotenv.2022.159701] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/12/2022] [Accepted: 10/20/2022] [Indexed: 06/16/2023]

Fan K, Dhammapala R, Harrington K, Lamb B, Lee Y. Machine learning-based ozone and PM2.5 forecasting: Application to multiple AQS sites in the Pacific Northwest. Front Big Data 2023;6:1124148. [PMID: 36910164 PMCID: PMC9999009 DOI: 10.3389/fdata.2023.1124148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 02/06/2023] [Indexed: 03/14/2023] Open

Abstract

Air quality in the Pacific Northwest (PNW) of the U.S has generally been good in recent years, but unhealthy events were observed due to wildfires in summer or wood burning in winter. The current air quality forecasting system, which uses chemical transport models (CTMs), has had difficulty forecasting these unhealthy air quality events in the PNW. We developed a machine learning (ML) based forecasting system, which consists of two components, ML1 (random forecast classifiers and multiple linear regression models) and ML2 (two-phase random forest regression model). Our previous study showed that the ML system provides reliable forecasts of O₃ at a single monitoring site in Kennewick, WA. In this paper, we expand the ML forecasting system to predict both O₃ in the wildfire season and PM2.5 in wildfire and cold seasons at all available monitoring sites in the PNW during 2017-2020, and evaluate our ML forecasts against the existing operational CTM-based forecasts. For O₃, both ML1 and ML2 are used to achieve the best forecasts, which was the case in our previous study: ML2 performs better overall (R² = 0.79), especially for low-O₃ events, while ML1 correctly captures more high-O₃ events. Compared to the CTM-based forecast, our O₃ ML forecasts reduce the normalized mean bias (NMB) from 7.6 to 2.6% and normalized mean error (NME) from 18 to 12% when evaluating against the observation. For PM2.5, ML2 performs the best and thus is used for the final forecasts. Compared to the CTM-based PM2.5, ML2 clearly improves PM2.5 forecasts for both wildfire season (May to September) and cold season (November to February): ML2 reduces NMB (-27 to 7.9% for wildfire season; 3.4 to 2.2% for cold season) and NME (59 to 41% for wildfires season; 67 to 28% for cold season) significantly and captures more high-PM2.5 events correctly. Our ML air quality forecast system requires fewer computing resources and fewer input datasets, yet it provides more reliable forecasts than (if not, comparable to) the CTM-based forecast. It demonstrates that our ML system is a low-cost, reliable air quality forecasting system that can support regional/local air quality management.

Collapse

Karimian H, Li Y, Chen Y, Wang Z. Evaluation of different machine learning approaches and aerosol optical depth in PM_2.5 prediction. ENVIRONMENTAL RESEARCH 2023;216:114465. [PMID: 36241075 DOI: 10.1016/j.envres.2022.114465] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Revised: 09/11/2022] [Accepted: 09/27/2022] [Indexed: 06/16/2023]

Cha GW, Choi SH, Hong WH, Park CW. Development of Machine Learning Model for Prediction of Demolition Waste Generation Rate of Buildings in Redevelopment Areas. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;20:107. [PMID: 36612429 PMCID: PMC9819715 DOI: 10.3390/ijerph20010107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 12/14/2022] [Accepted: 12/17/2022] [Indexed: 06/17/2023]

A Review on Pollution Treatment in Cement Industrial Areas: From Prevention Techniques to Python-Based Monitoring and Controlling Models. Processes (Basel) 2022. [DOI: 10.3390/pr10122682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Fallah-Shorshani M, Yin X, McConnell R, Fruin S, Franklin M. Estimating traffic noise over a large urban area: An evaluation of methods. ENVIRONMENT INTERNATIONAL 2022;170:107583. [PMID: 36272254 DOI: 10.1016/j.envint.2022.107583] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 09/29/2022] [Accepted: 10/11/2022] [Indexed: 06/16/2023]

Tella A, Balogun AL. GIS-based air quality modelling: spatial prediction of PM10 for Selangor State, Malaysia using machine learning algorithms. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022;29:86109-86125. [PMID: 34533750 DOI: 10.1007/s11356-021-16150-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 08/20/2021] [Indexed: 06/13/2023]

Abstract

Rapid urbanization has caused severe deterioration of air quality globally, leading to increased hospitalization and premature deaths. Therefore, accurate prediction of air quality is crucial for mitigation planning to support urban sustainability and resilience. Although some studies have predicted air pollutants such as particulate matter (PM) using machine learning algorithms (MLAs), there is a paucity of studies on spatial hazard assessment with respect to the air quality index (AQI). Incorporating PM in AQI studies is crucial because of its easily inhalable micro-size which has adverse impacts on ecology, environment, and human health. Accurate and timely prediction of the air quality index can ensure adequate intervention to aid air quality management. Therefore, this study undertakes a spatial hazard assessment of the air quality index using particulate matter with a diameter of 10 μm or lesser (PM10) in Selangor, Malaysia, by developing four machine learning models: eXtreme Gradient Boosting (XGBoost), random forest (RF), K-nearest neighbour (KNN), and Naive Bayes (NB). Spatially processed data such as NDVI, SAVI, BU, LST, Ws, slope, elevation, and road density was used for the modelling. The model was trained with 70% of the dataset, while 30% was used for cross-validation. Results showed that XGBoost has the highest overall accuracy and precision of 0.989 and 0.995, followed by random forest (0.989, 0.993), K-nearest neighbour (0.987, 0.984), and Naive Bayes (0.917, 0.922), respectively. The spatial air quality maps were generated by integrating the geographical information system (GIS) with the four MLAs, which correlated with Malaysia's air pollution index. The maps indicate that air quality in Selangor is satisfactory and posed no threats to health. Nevertheless, the two algorithms with the best performance (XGBoost and RF) indicate that a high percentage of the air quality is moderate. The study concludes that successful air pollution management policies such as green infrastructure practice, improvement of energy efficiency, and restrictions on heavy-duty vehicles can be adopted in Selangor and other Southeast Asian cities to prevent deterioration of air quality in the future.

Collapse

Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q. Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. MOLECULAR PLANT 2022;15:1664-1695. [PMID: 36081348 DOI: 10.1016/j.molp.2022.09.001] [Citation(s) in RCA: 51] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 09/02/2022] [Indexed: 05/12/2023]

Abstract

The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support.

Collapse

Deep matrix factorization models for estimation of missing data in a low-cost sensor network to measure air quality. ECOL INFORM 2022. [DOI: 10.1016/j.ecoinf.2022.101775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Li J, Kang CM, Wolfson JM, Alahmad B, Al-Hemoud A, Garshick E, Koutrakis P. Estimation of fine particulate matter in an arid area from visibility based on machine learning. JOURNAL OF EXPOSURE SCIENCE & ENVIRONMENTAL EPIDEMIOLOGY 2022;32:926-931. [PMID: 36151455 PMCID: PMC9742157 DOI: 10.1038/s41370-022-00480-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 09/12/2022] [Accepted: 09/13/2022] [Indexed: 05/04/2023]

Abstract

BACKGROUND

The absence of air pollution monitoring networks makes it difficult to assess historical fine particulate matter (PM_2.5) exposures for countries in the areas, such as Kuwait, which are severe impacted by desert dust and anthropogenic pollution.

OBJECTIVE

We constructed an ensemble machine learning model to predict daily PM_2.5 concentrations for regions lack of PM_2.5 observations.

METHODS

The model was constructed based on daily PM_2.5, visibility, and other meteorological data collected at two sites in Kuwait. Then, our model was applied to predict the daily level of PM_2.5 concentrations for eight airports located in Kuwait and Iraq from 2013 to 2020.

RESULTS

As compared to traditional statistic models, the proposed machine learning methods improved the accuracy in using visibility to predict daily PM_2.5 concentrations with a cross-validation R² of 0.68. The predicted level of daily PM_2.5 concentrations were consistent with previous measurements. The predicted average yearly PM_2.5 concentration for the eight stations is 50.65 µg/m³. For all stations, the monthly average PM_2.5 concentrations reached their maximum in July and their minimum in November.

SIGNIFICANCE

These findings make it possible to retrospectively estimate daily PM_2.5 exposures using the large-scale databases of historical visibility in regions with few particulate matter monitoring stations.

IMPACT STATEMENT

The scarcity of air pollution ground monitoring networks makes it difficult to assess historical fine particulate matter exposures for countries in arid areas such as Kuwait. Visibility is closely related to atmospheric particulate matter concentrations and historical airport visibility records are commonly available in most countries. Our model make it possible to retrospectively estimate daily PM_2.5 exposures using the large-scale databases of historical visibility in arid regions with few particulate matter ground monitoring stations. The product of such models can be critical for environmental risk assessments and population health studies.

Collapse

Xu H, Zhang A, Xu X, Li P, Ji Y. Prediction of Particulate Concentration Based on Correlation Analysis and a Bi-GRU Model. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:13266. [PMID: 36293843 PMCID: PMC9603264 DOI: 10.3390/ijerph192013266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 10/08/2022] [Accepted: 10/11/2022] [Indexed: 06/16/2023]

Childs ML, Li J, Wen J, Heft-Neal S, Driscoll A, Wang S, Gould CF, Qiu M, Burney J, Burke M. Daily Local-Level Estimates of Ambient Wildfire Smoke PM_2.5 for the Contiguous US. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022;56:13607-13621. [PMID: 36134580 DOI: 10.1021/acs.est.2c02934] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Mapping Dominant Tree Species of German Forests. REMOTE SENSING 2022. [DOI: 10.3390/rs14143330] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Khan N, Kamaruddin MA, Ullah Sheikh U, Zawawi MH, Yusup Y, Bakht MP, Mohamed Noor N. Prediction of Oil Palm Yield Using Machine Learning in the Perspective of Fluctuating Weather and Soil Moisture Conditions: Evaluation of a Generic Workflow. PLANTS (BASEL, SWITZERLAND) 2022;11:1697. [PMID: 35807648 PMCID: PMC9268852 DOI: 10.3390/plants11131697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 06/20/2022] [Accepted: 06/24/2022] [Indexed: 11/19/2022]

Prediction of Air Pollutant Concentrations via RANDOM Forest Regressor Coupled with Uncertainty Analysis—A Case Study in Ningxia. ATMOSPHERE 2022. [DOI: 10.3390/atmos13060960] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Environmental Pollution Analysis and Impact Study-A Case Study for the Salton Sea in California. ATMOSPHERE 2022. [DOI: 10.3390/atmos13060914] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Xia W, Jiang Y, Chen X, Zhao R. Application of machine learning algorithms in municipal solid waste management: A mini review. WASTE MANAGEMENT & RESEARCH : THE JOURNAL OF THE INTERNATIONAL SOLID WASTES AND PUBLIC CLEANSING ASSOCIATION, ISWA 2022;40:609-624. [PMID: 34269157 PMCID: PMC9016669 DOI: 10.1177/0734242x211033716] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

Wijnands JS, Nice KA, Seneviratne S, Thompson J, Stevenson M. The impact of the COVID-19 pandemic on air pollution: A global assessment using machine learning techniques. ATMOSPHERIC POLLUTION RESEARCH 2022;13:101438. [PMID: 35506000 PMCID: PMC9047632 DOI: 10.1016/j.apr.2022.101438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 04/21/2022] [Accepted: 04/22/2022] [Indexed: 06/14/2023]

Abstract

In response to the COVID-19 pandemic, most countries implemented public health ordinances that resulted in restricted mobility and a resultant change in air quality. This has provided an opportunity to quantify the extent to which carbon-based transport and industrial activity affect air quality. However, quantification of these complex effects has proven to be difficult, depending on the stringency of restrictions, country-specific emission source profiles, long-term trends and meteorological effects on atmospheric chemistry, emission levels and in-flow from nearby countries. In this study, confounding factors were disentangled for a direct comparison of pandemic-related reductions in absolute pollutions levels, globally. The non-linear relationships between atmospheric processes and daily ground-level NO2 , PM10, PM2.5 and O3 measurements were captured in city- and pollutant-specific XGBoost models for over 700 cities, adjusting for weather, seasonality and trends. City-level modelling allowed adaptation to the distinct topography, urban morphology, climate and atmospheric conditions for each city, individually, as the weather variables that were most predictive varied across cities. Pollution forecasts for 2020 in absence of a pandemic were generated based on weather and formed an ensemble for country-level pollution reductions. Findings were robust to modelling assumptions and consistent with various published case studies. NO2 reduced most in China, Europe and India, following severe government restrictions as part of the initial lockdowns. Reductions were highly correlated with changes in mobility levels, especially trips to transit stations, workplaces, retail and recreation venues. Further, NO2 did not fully revert to pre-pandemic levels in 2020. Ambient PM2.5 pollution, which has severe adverse health consequences, reduced most in China and India. Since positive health effects could be offset to some extent by prolonged exposure to indoor pollution, alternative transport initiatives could prove to be an important pathway towards better health outcomes in these countries. Increased O3 levels during initial lockdowns have been documented widely. However, our analyses also found a subsequent reduction in O3 for many countries below what was expected based on meteorological conditions during summer months (e.g., China, United Kingdom, France, Germany, Poland, Turkey). The effects in periods with high O3 levels are especially important for the development of effective mitigation strategies to improve health outcomes.

Collapse

Zhang P, Yang L, Ma W, Wang N, Wen F, Liu Q. Spatiotemporal estimation of the PM_2.5 concentration and human health risks combining the three-dimensional landscape pattern index and machine learning methods to optimize land use regression modeling in Shaanxi, China. ENVIRONMENTAL RESEARCH 2022;208:112759. [PMID: 35077716 DOI: 10.1016/j.envres.2022.112759] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Revised: 01/05/2022] [Accepted: 01/16/2022] [Indexed: 06/14/2023]

Abstract

PM_2.5 pollution endangers human health and urban sustainable development. Land use regression (LUR) is one of the most important methods to reveal the temporal and spatial heterogeneity of PM_2.5, and the introduction of characteristic variables of geographical factors and the improvement of model construction methods are important research directions for its optimization. However, the complex non-linear correlation between PM_2.5 and influencing indicators is always unrecognized by the traditional regression model. The two-dimensional landscape pattern index is difficult to reflect the real information of the surface, and the research accuracy cannot meet the requirements. As such, a novel integrated three-dimensional landscape pattern index (TDLPI) and machine learning extreme gradient boosting (XGBOOST) improved LUR model (LTX) are developed to estimate the spatiotemporal heterogeneity in the fine particle concentration in Shaanxi, China, and health risks of exposure and inhalation of PM_2.5 were explored. The LTX model performed well with R² = 0.88, RMSE of 8.73 μg/m³ and MAE of 5.85 μg/m³. Our findings suggest that integrated three-dimensional landscape pattern information and XGBOOST approaches can accurately estimate annual and seasonal variations of PM_2.5 pollution The Guanzhong Plain and northern Shaanxi always feature high PM_2.5 values, which exhibit similar distribution trends to those of the observed PM_2.5 pollution. This study demonstrated the outstanding performance of the LTX model, which outperforms most models in past researches. On the whole, LTX approach is reliable and can improve the accuracy of pollutant concentration prediction. The health risks of human exposure to fine particles are relatively high in winter. Central part is a high health risk area, while northern area is low. Our study provides a new method for atmospheric pollutants assessing, which is important for LUR model optimization, high-precision PM_2.5 pollution prediction and landscape pattern planning. These results can also contribute to human health exposure risks and future epidemiological studies of air pollution.

Collapse

Estimating Hourly Surface Solar Irradiance from GK2A/AMI Data Using Machine Learning Approach around Korea. REMOTE SENSING 2022. [DOI: 10.3390/rs14081840] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Identification of Smartwatch-Collected Lifelog Variables Affecting Body Mass Index in Middle-Aged People Using Regression Machine Learning Algorithms and SHapley Additive Explanations. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12083819] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Gill M, Anderson R, Hu H, Bennamoun M, Petereit J, Valliyodan B, Nguyen HT, Batley J, Bayer PE, Edwards D. Machine learning models outperform deep learning models, provide interpretation and facilitate feature selection for soybean trait prediction. BMC PLANT BIOLOGY 2022;22:180. [PMID: 35395721 PMCID: PMC8991976 DOI: 10.1186/s12870-022-03559-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Accepted: 03/21/2022] [Indexed: 05/26/2023]