1
|
Venkatraman Jagatha J, Schneider C, Sauter T. Parsimonious Random-Forest-Based Land-Use Regression Model Using Particulate Matter Sensors in Berlin, Germany. SENSORS (BASEL, SWITZERLAND) 2024; 24:4193. [PMID: 39000970 PMCID: PMC11244214 DOI: 10.3390/s24134193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/07/2024] [Accepted: 06/21/2024] [Indexed: 07/16/2024]
Abstract
Machine learning (ML) methods are widely used in particulate matter prediction modelling, especially through use of air quality sensor data. Despite their advantages, these methods' black-box nature obscures the understanding of how a prediction has been made. Major issues with these types of models include the data quality and computational intensity. In this study, we employed feature selection methods using recursive feature elimination and global sensitivity analysis for a random-forest (RF)-based land-use regression model developed for the city of Berlin, Germany. Land-use-based predictors, including local climate zones, leaf area index, daily traffic volume, population density, building types, building heights, and street types were used to create a baseline RF model. Five additional models, three using recursive feature elimination method and two using a Sobol-based global sensitivity analysis (GSA), were implemented, and their performance was compared against that of the baseline RF model. The predictors that had a large effect on the prediction as determined using both the methods are discussed. Through feature elimination, the number of predictors were reduced from 220 in the baseline model to eight in the parsimonious models without sacrificing model performance. The model metrics were compared, which showed that the parsimonious_GSA-based model performs better than does the baseline model and reduces the mean absolute error (MAE) from 8.69 µg/m3 to 3.6 µg/m3 and the root mean squared error (RMSE) from 9.86 µg/m3 to 4.23 µg/m3 when applying the trained model to reference station data. The better performance of the GSA_parsimonious model is made possible by the curtailment of the uncertainties propagated through the model via the reduction of multicollinear and redundant predictors. The parsimonious model validated against reference stations was able to predict the PM2.5 concentrations with an MAE of less than 5 µg/m3 for 10 out of 12 locations. The GSA_parsimonious performed best in all model metrics and improved the R2 from 3% in the baseline model to 17%. However, the predictions exhibited a degree of uncertainty, making it unreliable for regional scale modelling. The GSA_parsimonious model can nevertheless be adapted to local scales to highlight the land-use parameters that are indicative of PM2.5 concentrations in Berlin. Overall, population density, leaf area index, and traffic volume are the major predictors of PM2.5, while building type and local climate zones are the less significant predictors. Feature selection based on sensitivity analysis has a large impact on the model performance. Optimising models through sensitivity analysis can enhance the interpretability of the model dynamics and potentially reduce computational costs and time when modelling is performed for larger areas.
Collapse
Affiliation(s)
| | - Christoph Schneider
- Geography Department, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany
| | - Tobias Sauter
- Geography Department, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany
| |
Collapse
|
2
|
Sharma GK, Ghuge VV. How urban growth dynamics impact the air quality? A case of eight Indian metropolitan cities. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 930:172399. [PMID: 38631640 DOI: 10.1016/j.scitotenv.2024.172399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 04/02/2024] [Accepted: 04/09/2024] [Indexed: 04/19/2024]
Abstract
Air pollution is a matter of great significance that confronts the sustainable progress of urban areas. Against India's swift urbanization, several urban areas exhibit the coexistence of escalating populace and expansion in developed regions alongside extensive spatial heterogeneity. The interaction mechanism between the growth of urban areas and the expansion of cities holds immense importance for the remediation of air pollution. Henceforth, the present investigation utilizes geographically weighted regression (GWR) to examine the influence of urban expansion and population growth on air quality. The examination will use a decade of data on the variation in PM2.5 levels from 2010 to 2020 in eight Indian metropolitan cities. The study's findings demonstrate a spatial heterogeneity between urban growth dynamics and air pollution levels. Urban growth and the expansion of cities demonstrate notable positive impacts on air quality, although the growth of infilling within expanding urban areas can significantly affect air quality. Given the unique trajectories of urban development in developing countries, this research provides many suggestions for urban administrators to foster sustainable urban growth.
Collapse
Affiliation(s)
- Gajender Kumar Sharma
- Department of Architecture & Planning, Visvesvaraya National Institute of Technology, Nagpur, India.
| | - Vidya V Ghuge
- Department of Architecture & Planning, Visvesvaraya National Institute of Technology, Nagpur, India.
| |
Collapse
|
3
|
Wu H, Guo B, Guo T, Pei L, Jing P, Wang Y, Ma X, Bai H, Wang Z, Xie T, Chen M. A study on identifying synergistic prevention and control regions for PM 2.5 and O 3 and exploring their spatiotemporal dynamic in China. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2024; 341:122880. [PMID: 37944886 DOI: 10.1016/j.envpol.2023.122880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 10/18/2023] [Accepted: 11/04/2023] [Indexed: 11/12/2023]
Abstract
Air pollutants, notably ozone (O3) and fine particulate matter (PM2.5) give rise to evident adverse impacts on public health and the ecotope, prompting extensive global apprehension. Though PM2.5 has been effectively mitigated in China, O3 has been emerging as a primary pollutant, especially in summer. Currently, alleviating PM2.5 and O3 synergistically faces huge challenges. The synergistic prevention and control (SPC) regions of PM2.5 and O3 and their spatiotemporal patterns were still unclear. To address the above issues, this study utilized ground monitoring station data, meteorological data, and auxiliary data to predict the China High-Resolution O3 Dataset (CHROD) via a two-stage model. Furthermore, SPC regions were identified based on a spatial overlay analysis using a Geographic Information System (GIS). The standard deviation ellipse was employed to investigate the spatiotemporal dynamic characteristics of SPC regions. Some outcomes were obtained. The two-stage model significantly improved the accuracy of O3 concentration prediction with acceptable R2 (0.86), and our CHROD presented higher spatiotemporal resolution compared with existing products. SPC regions exhibited significant spatiotemporal variations during the Blue Sky Protection Campaign (BSPC) in China. SPC regions were dominant in spring and autumn, and O3-controlled and PM2.5-dominated zones were detected in summer and winter, respectively. SPC regions were primarily located in the northwest, north, east, and central regions of China, specifically in the Beijing-Tianjin-Hebei urban agglomeration (BTH), Shanxi, Shaanxi, Shandong, Henan, Jiangsu, Xinjiang, and Anhui provinces. The gravity center of SPC regions was distributed in the BTH in winter, and in Xinjiang during spring, summer, and autumn. This study can supply scientific references for the collaborative management of PM2.5 and O3.
Collapse
Affiliation(s)
- Haojie Wu
- College of Geomatics, Xi'an University of Science and Technology, Xi'an, Shaanxi, 710054, China; Shaanxi Key Laboratory of Environmental Monitoring and Forewarning of Trace Pollutants, Xi'an, Shaanxi, 710043, China
| | - Bin Guo
- College of Geomatics, Xi'an University of Science and Technology, Xi'an, Shaanxi, 710054, China.
| | - Tengyue Guo
- Department of Geological Engineering, Qinghai University, Xining, Qinghai, 810016, China
| | - Lin Pei
- School of Exercise and Health Sciences, Xi'an Physical Education University, Xi'an, Shaanxi, 710068, China
| | - Peiqing Jing
- State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, Hubei, 430072, China
| | - Yan Wang
- School of Geography and Tourism, Shaanxi Normal University, Xi'an, Shaanxi, 710119, China
| | - Xuying Ma
- College of Geomatics, Xi'an University of Science and Technology, Xi'an, Shaanxi, 710054, China
| | - Haorui Bai
- College of Geomatics, Xi'an University of Science and Technology, Xi'an, Shaanxi, 710054, China
| | - Zheng Wang
- College of Geomatics, Xi'an University of Science and Technology, Xi'an, Shaanxi, 710054, China
| | - Tingting Xie
- College of Geomatics, Xi'an University of Science and Technology, Xi'an, Shaanxi, 710054, China
| | - Miaoyi Chen
- College of Geomatics, Xi'an University of Science and Technology, Xi'an, Shaanxi, 710054, China
| |
Collapse
|
4
|
Robinson DL, Goodman N, Vardoulakis S. Five Years of Accurate PM 2.5 Measurements Demonstrate the Value of Low-Cost PurpleAir Monitors in Areas Affected by Woodsmoke. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:7127. [PMID: 38063557 PMCID: PMC10706150 DOI: 10.3390/ijerph20237127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 11/13/2023] [Accepted: 11/22/2023] [Indexed: 12/18/2023]
Abstract
Low-cost optical sensors are used in many countries to monitor fine particulate (PM2.5) air pollution, especially in cities and towns with large spatial and temporal variation due to woodsmoke pollution. Previous peer-reviewed research derived calibration equations for PurpleAir (PA) sensors by co-locating PA units at a government regulatory air pollution monitoring site in Armidale, NSW, Australia, a town where woodsmoke is the main source of PM2.5 pollution. The calibrations enabled the PA sensors to provide accurate estimates of PM2.5 that were almost identical to those from the NSW Government reference equipment and allowed the high levels of wintertime PM2.5 pollution and the substantial spatial and temporal variation from wood heaters to be quantified, as well as the estimated costs of premature mortality exceeding $10,000 per wood heater per year. This follow-up study evaluates eight PA sensors co-located at the same government site to check their accuracy over the following four years, using either the original calibrations, the default woodsmoke equation on the PA website for uncalibrated sensors, or the ALT-34 conversion equation (see text). Minimal calibration drift was observed, with year-round correlations, r = 0.98 ± 0.01, and root mean square error (RMSE) = 2.0 μg/m3 for daily average PA PM2.5 vs. reference equipment. The utitilty of the PA sensors without prior calibration at locations affected by woodsmoke was also demonstrated by the year-round correlations of 0.94 and low RMSE between PA (woodsmoke and ALT-34 conversions) and reference PM2.5 at the NSW Government monitoring sites in Orange and Gunnedah. To ensure the reliability of the PA data, basic quality checks are recommended, including the agreement of the two laser sensors in each PA unit and removing any transient spikes affecting only one sensor. In Armidale, from 2019 to 2022, the continuing high spatial variation in the PM2.5 levels observed during the colder months was many times higher than any discrepancies between the PA and reference measurements. Particularly unhealthy PM2.5 levels were noted in southern and eastern central Armidale. The measurements inside two older weatherboard houses in Armidale showed that high outdoor pollution resulted in high pollution inside the houses within 1-2 h. Daily average PM2.5 concentrations available on the PA website allow air pollution at different sites across regions (and countries) to be compared. Such comparisons revealed major elevations in PA PM2.5 at Gunnedah, Orange, Monash (Australian Capital Territory), and Christchurch (New Zealand) during the wood heating season. The data for Gunnedah and Muswellbrook suggest a slight underestimation of PM2.5 at other times of the year when there are proportionately more dust and other larger particles. A network of appropriately calibrated PA sensors can provide valuable information on the spatial and temporal variation in the air pollution that can be used to identify pollution hotspots, improve estimates of population exposure and health costs, and inform public policy.
Collapse
Affiliation(s)
- Dorothy L. Robinson
- Healthy Environments and Lives (HEAL) National Research Network, Canberra, ACT 2601, Australia; (N.G.); (S.V.)
| | - Nigel Goodman
- Healthy Environments and Lives (HEAL) National Research Network, Canberra, ACT 2601, Australia; (N.G.); (S.V.)
- College of Health and Medicine, The Australian National University, Canberra, ACT 2601, Australia
| | - Sotiris Vardoulakis
- Healthy Environments and Lives (HEAL) National Research Network, Canberra, ACT 2601, Australia; (N.G.); (S.V.)
- College of Health and Medicine, The Australian National University, Canberra, ACT 2601, Australia
| |
Collapse
|
5
|
Liang L, Daniels J, Biancardi M, Zhou Y. Reconstructing aerosol optical depth using spatiotemporal Long Short-Term Memory convolutional autoencoder. Sci Data 2023; 10:842. [PMID: 38036585 PMCID: PMC10689425 DOI: 10.1038/s41597-023-02696-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 10/27/2023] [Indexed: 12/02/2023] Open
Abstract
Aerosol Optical Depth (AOD) is a crucial atmospheric parameter in comprehending climate change, air quality, and its impacts on human health. Satellites offer exceptional spatiotemporal AOD data continuity. However, data quality is influenced by various atmospheric, landscape, and instrumental factors, resulting in data gaps. This study presents a new solution to this challenge by providing a long-term, gapless satellite-derived AOD dataset for Texas from 2010 to 2022, utilizing Moderate Resolution Imaging Spectroradiometer (MODIS) Multi-angle Implementation of Atmospheric Correction (MAIAC) products. Missing AOD data were reconstructed using a spatiotemporal Long Short-Term Memory (LSTM) convolutional autoencoder. Evaluation against an independent test dataset demonstrated the model's effectiveness, with an average Root Mean Square Error (RMSE) of 0.017 and an R2 value of 0.941. Validation against the ground-based AERONET dataset indicated satisfactory agreement, with RMSE values ranging from 0.052 to 0.067. The reconstructed AOD data are available at daily, monthly, quarterly, and yearly scales, providing a valuable resource to advance understanding of the Earth's atmosphere and support decision-making concerning air quality and public health.
Collapse
Affiliation(s)
- Lu Liang
- Department of Geography and the Environment, University of North Texas, Denton, TX, 76203, USA.
| | - Jacob Daniels
- Department of Electrical Engineering, University of North Texas, Denton, TX, 76203, USA
| | - Michael Biancardi
- Department of Computer Science and Engineering, University of North Texas, Denton, TX, 76203, USA
| | - Yuye Zhou
- School of Architecture and Urban Planning, Nanjing University, Nanjing, 210093, China
| |
Collapse
|
6
|
Ray PP. AI in Sensor Research: A Reality Check and the Underestimated Potential of ChatGPT. ACS Sens 2023; 8:2412-2414. [PMID: 37318550 DOI: 10.1021/acssensors.3c01045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
|
7
|
Kosmopoulos G, Salamalikis V, Wilbert S, Zarzalejo LF, Hanrieder N, Karatzas S, Kazantzidis A. Investigating the Sensitivity of Low-Cost Sensors in Measuring Particle Number Concentrations across Diverse Atmospheric Conditions in Greece and Spain. SENSORS (BASEL, SWITZERLAND) 2023; 23:6541. [PMID: 37514835 PMCID: PMC10383866 DOI: 10.3390/s23146541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 07/10/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023]
Abstract
Low-cost sensors (LCSs) for particulate matter (PM) concentrations have attracted the interest of researchers, supplementing their efforts to quantify PM in higher spatiotemporal resolution. The precision of PM mass concentration measurements from PMS 5003 sensors has been widely documented, though limited information is available regarding their size selectivity and number concentration measurement accuracy. In this work, PMS 5003 sensors, along with a Federal Referral Methods (FRM) sampler (Grimm spectrometer), were deployed across three sites with different atmospheric profiles, an urban (Germanou) and a background (UPat) site in Patras (Greece), and a semi-arid site in Almería (Spain, PSA). The LCSs particle number concentration measurements were investigated for different size bins. Findings for particles with diameter between 0.3 and 10 μm suggest that particle size significantly affected the LCSs' response. The LCSs could accurately detect number concentrations for particles smaller than 1 μm in the urban (R2 = 0.9) and background sites (R2 = 0.92), while a modest correlation was found with the reference instrument in the semi-arid area (R2 = 0.69). However, their performance was rather poor (R2 < 0.31) for coarser aerosol fractions at all sites. Moreover, during periods when coarse particles were dominant, i.e., dust events, PMS 5003 sensors were unable to report accurate number distributions (R2 values < 0.47) and systematically underestimated particle number concentrations. The results indicate that several questions arise concerning the sensors' capabilities to estimate PM2.5 and PM10 concentrations, since their size distribution did not agree with the reference instruments.
Collapse
Affiliation(s)
- Georgios Kosmopoulos
- Laboratory of Atmospheric Physics, Department of Physics, University of Patras, GR 26500 Patras, Greece
| | | | - Stefan Wilbert
- Institute of Solar Research, German Aerospace Center (DLR), Paseo de Almería 73, 04001 Almería, Spain
| | - Luis F Zarzalejo
- Renewable Energy Division, CIEMAT Energy Department, Avenida Complutense, 40, 28040 Madrid, Spain
| | - Natalie Hanrieder
- Institute of Solar Research, German Aerospace Center (DLR), Paseo de Almería 73, 04001 Almería, Spain
| | - Stylianos Karatzas
- Civil Engineering Department, University of Patras, GR 26500 Patras, Greece
| | - Andreas Kazantzidis
- Laboratory of Atmospheric Physics, Department of Physics, University of Patras, GR 26500 Patras, Greece
| |
Collapse
|