1
|
Ndiaye A, Shen Y, Kyriakou K, Karssenberg D, Schmitz O, Flückiger B, Hoogh KD, Hoek G. Hourly land-use regression modeling for NO 2 and PM 2.5 in the Netherlands. ENVIRONMENTAL RESEARCH 2024; 256:119233. [PMID: 38802030 DOI: 10.1016/j.envres.2024.119233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 05/24/2024] [Accepted: 05/25/2024] [Indexed: 05/29/2024]
Abstract
Annual average land-use regression (LUR) models have been widely used to assess spatial patterns of air pollution exposures. However, they fail to capture diurnal variability in air pollution and consequently might result in biased dynamic exposure assessments. In this study we aimed to model average hourly concentrations for two major pollutants, NO2 and PM2.5, for the Netherlands using the LUR algorithm. We modelled the spatial variation of average hourly concentrations for the years 2016-2019 combined, for two seasons, and for two weekday types. Two modelling approaches were used, supervised linear regression (SLR) and random forest (RF). The potential predictors included population, road, land use, satellite retrievals, and chemical transport model pollution estimates variables with different buffer sizes. We also temporally adjusted hourly concentrations from a 2019 annual model using the hourly monitoring data, to compare its performance with the hourly modelling approach. The results showed that hourly NO2 models performed overall well (5-fold cross validation R2 = 0.50-0.78), while the PM2.5 performed moderately (5-fold cross validation R2 = 0.24-0.62). Both for NO2 and PM2.5 the warm season models performed worse than the cold season ones, and the weekends' worse than weekdays'. The performance of the RF and SLR models was similar for both pollutants. For both SLR and RF, variables with larger buffer sizes representing variation in background concentrations, were selected more often in the weekend models compared to the weekdays, and in the warm season compared to the cold one. Temporal adjustment of annual average models performed overall worse than both modelling approaches (NO2 hourly R2 = 0.35-0.70; PM2.5 hourly R2 = 0.01-0.15). The difference in model performance and selection of variables across hours, seasons, and weekday types documents the benefit to develop independent hourly models when matching it to hourly time activity data.
Collapse
Affiliation(s)
- Aisha Ndiaye
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Pobox 80125, 3508, TC Utrecht, the Netherlands.
| | - Youchen Shen
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Pobox 80125, 3508, TC Utrecht, the Netherlands
| | - Kalliopi Kyriakou
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Pobox 80125, 3508, TC Utrecht, the Netherlands
| | - Derek Karssenberg
- Department of Physical Geography, Faculty of Geosciences, Utrecht University, Princetonlaan 8a, 3584 CB Utrecht, the Netherlands
| | - Oliver Schmitz
- Department of Physical Geography, Faculty of Geosciences, Utrecht University, Princetonlaan 8a, 3584 CB Utrecht, the Netherlands
| | - Benjamin Flückiger
- Swiss Tropical and Public Health Institute, Kreuzstrasse 2 CH-4123 Allschwil, Switzerland; University of Basel, Petersplatz 1, Postfach, 4001, Basel, Switzerland
| | - Kees de Hoogh
- Swiss Tropical and Public Health Institute, Kreuzstrasse 2 CH-4123 Allschwil, Switzerland; University of Basel, Petersplatz 1, Postfach, 4001, Basel, Switzerland
| | - Gerard Hoek
- Division of Environmental Epidemiology, Institute for Risk Assessment Sciences, Utrecht University, Pobox 80125, 3508, TC Utrecht, the Netherlands
| |
Collapse
|
2
|
Sun Z, Cui K, Qi X, Wang J, Han L, Gu X, Lu H. How do drunk-driving events escalate into drunk-driving crashes? An empirical analysis of Beijing from a spatiotemporal perspective. Int J Inj Contr Saf Promot 2024; 31:256-272. [PMID: 38279202 DOI: 10.1080/17457300.2023.2300459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 12/24/2023] [Indexed: 01/28/2024]
Abstract
Drunk-driving events often escalate into drunk-driving crashes, however, the contributing factors of this progression remain elusive. To mitigate the likelihood of crashes stemming from drunk-driving events, this paper introduces the notion of 'the severity of drunk-driving event' and examines the complex relationship between the severity and its contributing factors, considering spatiotemporal heterogeneity. The study utilizes a Geographically and Temporally Weighted Binary Logistic Regression (GTWBLR) model to conduct spatiotemporal analysis based on police-reported drunk-driving events in Beijing, China. The results show that most factors passed the non-stationary test, indicating their effects on the severity of drunk-driving event vary significantly across different spatial and temporal domains. Notably, during non-workday, drunk-driving events in northeast of Beijing are more likely to escalate into crashes. Furthermore, severe weather during winter in the northwest of Beijing is associated with high risk of drunk-driving crashes. Based on these insights, the authorities can strengthen drunk-driving checks in the northeast region of Beijing, particularly during non-workdays. And it is crucial to promptly clear accumulated snow on the roads during severe winter weather to improve road safety. These insights and recommendations are highly valuable for reducing the risk of drunk-driving crashes.
Collapse
Affiliation(s)
- Zhiyuan Sun
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Keqi Cui
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Xin Qi
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Jianyu Wang
- Beijing Key Laboratory of General Aviation Technology, Beijing University of Civil Engineering and Architecture, Beijing, China
| | - Lu Han
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Xin Gu
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Huapu Lu
- Institute of Transportation Engineering, Tsinghua University, Beijing, China
| |
Collapse
|
3
|
Wang Y, Ni J, Xu K, Zhang H, Gong X, He C. Intricate synergistic effects between air pollution and carbon emission: An emerging evidence from China. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2024; 349:123851. [PMID: 38527582 DOI: 10.1016/j.envpol.2024.123851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Revised: 02/29/2024] [Accepted: 03/22/2024] [Indexed: 03/27/2024]
Abstract
Due to global climate change and intensifying anthropogenic pollution, China confronts the dual challenge of controlling particulate matter 2.5 μm (PM2.5) pollution and reducing carbon emissions. Quantifying the characteristics of PM2.5 concentrations and CO2 emissions, as well as identifying the driving factors and synergistic effects of PM2.5 reduction and CO2 mitigation, are crucial steps in promoting sustainable urban development and achieving the Sustainable Development Goals (SDGs) in China. In this study, we selected 168 cities as our case-study, and quantified spatial characteristics of PM2.5 concentrations and CO2 emissions from 2015 to 2020 in China. Then we analyzed driving factors affecting the spatial heterogeneity of PM2.5 reduction and CO2 mitigation applying Multi-scale Geographically Weighted Regression (MGWR) model. By employing coupling coordination degree (CCD) model, we further detected the spatiotemporal evolution patterns of the synergistic effects between PM2.5 reduction and CO2 mitigation in key Chinese cities. The result showed that: (a) From 2015 to 2020, PM2.5 concentrations experienced a significant reduction from 59.78 μg/m3 to 49.83 μg/m3, while CO2 emissions increased from 44.88 × 106 t in 2015 to 45.77 × 106 t in 2020; (b) Green economy efficiency (gee), government attention (gover), and environmental regulation (envir) demonstrate the most pronounced synergistic effect on pollution reduction and carbon mitigation, with the drivers exhibiting obvious spatial heterogeneity; (c) The overall coupling coordination level of PM2.5 pollution and CO2 emissions in China dropped from 0.49 in 2015 to 0.46 in 2020, and the coupling coordination grade in northern cities was notably higher than that in southern cities. The result enhances our understanding of spatiotemporal patterns of synergistic effects between PM2.5 reduction and CO2 mitigation, and provides the theoretical basis for policy decision-making to realize pollution decrease and carbon neutral and regional environment governance.
Collapse
Affiliation(s)
- Yanwen Wang
- School of Economics and Management, China University of Geosciences, Wuhan, 430074, China
| | - Jinmian Ni
- College of Resources and Environment, Yangtze University, Wuhan, 430100, China; Hubei Key Laboratory of Petroleum Geochemistry and Environment, Yangtze University, Wuhan, 430100, China
| | - Kewei Xu
- School of Economics and Management, China University of Geosciences, Wuhan, 430074, China
| | - Hao Zhang
- School of Economics and Management, China University of Geosciences, Wuhan, 430074, China
| | - Xusheng Gong
- Hubei University of Science and Technology, Xianning, 437100, China
| | - Chao He
- Hubei University of Science and Technology, Xianning, 437100, China.
| |
Collapse
|
4
|
Yuan Z, Shen Y, Hoek G, Vermeulen R, Kerckhoffs J. LUR modeling of long-term average hourly concentrations of NO 2 using hyperlocal mobile monitoring data. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 922:171251. [PMID: 38417522 DOI: 10.1016/j.scitotenv.2024.171251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 02/22/2024] [Accepted: 02/22/2024] [Indexed: 03/01/2024]
Abstract
Mobile monitoring campaigns have effectively captured spatial hyperlocal variations in long-term average concentrations of regulated and unregulated air pollutants. However, their application in estimating spatiotemporally varying maps has rarely been investigated. Tackling this gap, we investigated whether mobile measurements can assess long-term average nitrogen dioxide (NO2) concentrations for each hour of the day. Using mobile NO2 data monitored for 10 months in Amsterdam, we examined the performance of two spatiotemporal land use regression (LUR) methods, Spatiotemporal-Kriging and GTWR (Geographical and Temporal Weighted Regression), alongside two classical spatial LUR models developed separately for each hour. We found that mobile measurements follow the general pattern of fixed-site measurements, but with considerable deviations (indicating collection uncertainty). Leveraging heterogeneous spatiotemporal autocorrelations, GTWR smoothed these deviations and achieved an overall performance of an R2 of 0.49 and a Mean Absolute Error of 6.33 μg/m3, validated by long-term fixed-site measurements (out-of-sample). The other models tested were more affected by the collection uncertainty. We highlighted that the spatiotemporal variations captured in mobile measurements can be used to reconstruct long-term average hourly air pollution maps. These maps facilitate dynamic exposure assessments considering spatiotemporal human activity patterns.
Collapse
Affiliation(s)
- Zhendong Yuan
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands.
| | - Youchen Shen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| | - Gerard Hoek
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands; Julius Centre for Health Sciences and Primary Care, University Medical Centre, University of Utrecht, the Netherlands
| | - Jules Kerckhoffs
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| |
Collapse
|
5
|
Shen Y, de Hoogh K, Schmitz O, Clinton N, Tuxen-Bettman K, Brandt J, Christensen JH, Frohn LM, Geels C, Karssenberg D, Vermeulen R, Hoek G. Monthly average air pollution models using geographically weighted regression in Europe from 2000 to 2019. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 918:170550. [PMID: 38320693 DOI: 10.1016/j.scitotenv.2024.170550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/02/2024] [Accepted: 01/27/2024] [Indexed: 02/12/2024]
Abstract
Detailed spatial models of monthly air pollution levels at a very fine spatial resolution (25 m) can help facilitate studies to explore critical time-windows of exposure at intermediate term. Seasonal changes in air pollution may affect both levels and spatial patterns of air pollution across Europe. We built Europe-wide land-use regression (LUR) models to estimate monthly concentrations of regulated air pollutants (NO2, O3, PM10 and PM2.5) between 2000 and 2019. Monthly average concentrations were collected from routine monitoring stations. Including both monthly-fixed and -varying spatial variables, we used supervised linear regression (SLR) to select predictors and geographically weighted regression (GWR) to estimate spatially-varying regression coefficients for each month. Model performance was assessed with 5-fold cross-validation (CV). We also compared the performance of the monthly LUR models with monthly adjusted concentrations. Results revealed significant monthly variations in both estimates and model structure, particularly for O3, PM10, and PM2.5. The 5-fold CV showed generally good performance of the monthly GWR models across months and years (5-fold CV R2: 0.31-0.66 for NO2, 0.4-0.79 for O3, 0.4-0.78 for PM10, 0.46-0.87 for PM2.5). Monthly GWR models slightly outperformed monthly-adjusted models. Correlations between monthly GWR model were generally moderate to high (Pearson correlation >0.6). In conclusion, we are the first to develop robust monthly LUR models for air pollution in Europe. These monthly LUR models, at a 25 m spatial resolution, enhance epidemiologists to better characterize Europe-wide intermediate-term health effects related to air pollution, facilitating investigations into critical exposure time windows in birth cohort studies.
Collapse
Affiliation(s)
- Youchen Shen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands.
| | - Kees de Hoogh
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands; Swiss Tropical and Public Health Institute, Basel, Switzerland; University of Basel, Basel, Switzerland
| | - Oliver Schmitz
- Department of Physical Geography, Faculty of Geosciences, Utrecht University, Utrecht, the Netherlands
| | - Nick Clinton
- Google, Inc, Mountain View, California, United States
| | | | - Jørgen Brandt
- Department of Environmental Science, Aarhus University, Roskilde, Denmark
| | | | - Lise M Frohn
- Department of Environmental Science, Aarhus University, Roskilde, Denmark
| | - Camilla Geels
- Department of Environmental Science, Aarhus University, Roskilde, Denmark
| | - Derek Karssenberg
- Department of Physical Geography, Faculty of Geosciences, Utrecht University, Utrecht, the Netherlands
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands; Julius Centre for Health Sciences and Primary Care, University Medical Centre, Utrecht University, Utrecht, the Netherlands
| | - Gerard Hoek
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| |
Collapse
|
6
|
Alari A, Ranzani O, Olmos S, Milà C, Rico A, Ballester J, Basagaña X, Dadvand P, Duarte-Salles T, Nieuwenhuijsen M, Vivanco-Hidalgo RM, Tonne C. Short-term exposure to air pollution and hospital admission after COVID-19 in Catalonia: the COVAIR-CAT study. Int J Epidemiol 2024; 53:dyae041. [PMID: 38514998 DOI: 10.1093/ije/dyae041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 03/01/2024] [Indexed: 03/23/2024] Open
Abstract
BACKGROUND A growing body of evidence has reported positive associations between long-term exposure to air pollution and poor COVID-19 outcomes. Inconsistent findings have been reported for short-term air pollution, mostly from ecological study designs. Using individual-level data, we studied the association between short-term variation in air pollutants [nitrogen dioxide (NO2), particulate matter with a diameter of <2.5 µm (PM2.5) and a diameter of <10 µm (PM10) and ozone (O3)] and hospital admission among individuals diagnosed with COVID-19. METHODS The COVAIR-CAT (Air pollution in relation to COVID-19 morbidity and mortality: a large population-based cohort study in Catalonia, Spain) cohort is a large population-based cohort in Catalonia, Spain including 240 902 individuals diagnosed with COVID-19 in the primary care system from 1 March until 31 December 2020. Our outcome was hospitalization within 30 days of COVID-19 diagnosis. We used individual residential address to assign daily air-pollution exposure, estimated using machine-learning methods for spatiotemporal prediction. For each pandemic wave, we fitted Cox proportional-hazards models accounting for non-linear-distributed lagged exposure over the previous 7 days. RESULTS Results differed considerably by pandemic wave. During the second wave, an interquartile-range increase in cumulative weekly exposure to air pollution (lag0_7) was associated with a 12% increase (95% CI: 4% to 20%) in COVID-19 hospitalizations for NO2, 8% (95% CI: 1% to 16%) for PM2.5 and 9% (95% CI: 3% to 15%) for PM10. We observed consistent positive associations for same-day (lag0) exposure, whereas lag-specific associations beyond lag0 were generally not statistically significant. CONCLUSIONS Our study suggests positive associations between NO2, PM2.5 and PM10 and hospitalization risk among individuals diagnosed with COVID-19 during the second wave. Cumulative hazard ratios were largely driven by exposure on the same day as hospitalization.
Collapse
Affiliation(s)
- Anna Alari
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Otavio Ranzani
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Sergio Olmos
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Carles Milà
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Alex Rico
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Joan Ballester
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
| | - Xavier Basagaña
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Payam Dadvand
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Talita Duarte-Salles
- Fundació Institut Universitari per a la recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Barcelona, Spain
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Mark Nieuwenhuijsen
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | | | - Cathryn Tonne
- Barcelona Institute for Global Health, ISGlobal, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| |
Collapse
|
7
|
Valipour Shokouhi B, de Hoogh K, Gehrig R, Eeftens M. Estimation of historical daily airborne pollen concentrations across Switzerland using a spatio temporal random forest model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 906:167286. [PMID: 37742957 DOI: 10.1016/j.scitotenv.2023.167286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 09/20/2023] [Accepted: 09/20/2023] [Indexed: 09/26/2023]
Abstract
High concentrations of airborne pollen trigger seasonal allergies and possibly more severe adverse respiratory and cardiovascular health events. Predicting pollen concentration accurately is valuable for epidemiological studies, in order to study the effects of pollen exposure. We aimed to develop a spatiotemporal machine learning model predicting daily pollen concentrations at a spatial resolution of 1 × 1 km across Switzerland between 2000 and 2019. Daily pollen concentrations for five common, highly allergenic pollen types (hazel, alder, birch ash, and grasses) were available from fourteen measurement sites across Switzerland. We considered several predictors such as elevation, species distribution, wind speed, wind direction, temperature, precipitation, relative humidity, satellite-observed Normalized Difference Vegetation Index, and land-use (CORINE, Landsat satellite) to explain variation in pollen concentration. We employed feature engineering techniques to encode categorical variables and fill in missing values. We applied a random forest machine learning model with 5-fold cross-validation. The 5th-99th percentiles for concentrations of hazel, alder, birch, ash, and grass pollen at the pollen monitoring stations were 0-298, 0-306, 0-1153, 0-800, and 0-290 pollen grains/m3, respectively. The results of a predictive model for these concentrations yielded overall R2 values of 0.87, 0.84, 0.89, 0.88, and 0.91, and temporal root mean squared errors (RMSEs) of 16.07, 16.72, 69.04, 41.50, and 22.45 pollen grains/m3. An analysis of predictor variable importance indicates that the average national daily pollen concentration is the most important predictor of pollen concentrations for all pollen types. Furthermore, meteorological variables including temperature, total precipitation, humidity, boundary layer height, wind speed, and wind direction, as well as date and satellite features, are important factors in pollen concentration prediction. These spatiotemporal pollen models will serve to estimate individual residential pollen exposure for epidemiological studies. Resulting estimates will enable us to study respiratory and cardiovascular mortality and hospital admissions in Switzerland.
Collapse
Affiliation(s)
- Behzad Valipour Shokouhi
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland; University of Basel, Basel, Switzerland
| | - Kees de Hoogh
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland; University of Basel, Basel, Switzerland
| | - Regula Gehrig
- Federal Office of Meteorology and Climatology MeteoSwiss, Switzerland
| | - Marloes Eeftens
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland; University of Basel, Basel, Switzerland.
| |
Collapse
|
8
|
Tian T, Kwan MP, Vermeulen R, Helbich M. Geographic uncertainties in external exposome studies: A multi-scale approach to reduce exposure misclassification. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 906:167637. [PMID: 37816406 DOI: 10.1016/j.scitotenv.2023.167637] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/15/2023] [Accepted: 10/05/2023] [Indexed: 10/12/2023]
Abstract
BACKGROUND Many studies on environment-health associations have emphasized that the selected buffer size (i.e., the scale of the geographic context when exposures are assigned at people's address location) may affect estimated effect sizes. However, there is limited methodological progress in addressing these buffer size-related uncertainties. AIM We aimed to 1) develop a statistical multi-scale approach to address buffer-related scale effects in cohort studies, and 2) investigate how environment-health associations differ between our multi-scale approach and ad hoc selected buffer sizes. METHODS We used lacunarity analyses to determine the largest meaningful buffer size for multiple high-resolution exposure surfaces (i.e., fine particulate matter [PM2.5], noise, and the normalized difference vegetation index [NDVI]). Exposures were linked to 7.7 million Dutch adults at their home addresses. We assigned exposure estimates based on buffers with fine-grained distance increments until the lacunarity-based upper limit was reached. Bayesian Cox model averaging addressed geographic uncertainties in the estimated exposure effect sizes within the exposure-specific upper buffer limits on mortality. Z-tests assessed statistical differences between averaged effect sizes and those obtained through pre-selected 100, 300, 1200, and 1500 m buffers. RESULTS The estimated lacunarity curves suggested exposure-specific upper buffer size limits; the largest was for NDVI (960 m), followed by noise (910 m) and PM2.5 (450 m). We recorded 845,229 deaths over eight years of follow-up. Our multi-scale approach indicated that higher values of NDVI were health-protectively associated with mortality risk (hazard ratio [HR]: 0.917, 95 % confidence interval [CI]: 0.886-0.948). Increased noise exposure was associated with an increased risk of mortality (HR: 1.003, 95 % CI: 1.002-1.003), while PM2.5 showed null associations (HR:0.998, 95 % CI: 0.997-1.000). Effect sizes of NDVI and noise differed significantly across the averaged and prespecified buffers (p < 0.05). CONCLUSIONS Geographic uncertainties in residential-based exposure assessments may obscure environment-health associations or risk spurious ones. Our multi-scale approach produced more consistent effect estimates and mitigated contextual uncertainties.
Collapse
Affiliation(s)
- Tian Tian
- Department of Human Geography and Spatial Planning, Utrecht University, Princetonlaan 8a, 3584 CB Utrecht, the Netherlands.
| | - Mei-Po Kwan
- Department of Human Geography and Spatial Planning, Utrecht University, Princetonlaan 8a, 3584 CB Utrecht, the Netherlands; Department of Geography and Resource Management and Institute of Space and Earth Information Science, Chinese University of Hong Kong, Hong Kong, China
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands; Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht, the Netherlands
| | - Marco Helbich
- Department of Human Geography and Spatial Planning, Utrecht University, Princetonlaan 8a, 3584 CB Utrecht, the Netherlands
| |
Collapse
|
9
|
Liu R, Ma Z, Gasparrini A, de la Cruz A, Bi J, Chen K. Integrating Augmented In Situ Measurements and a Spatiotemporal Machine Learning Model To Back Extrapolate Historical Particulate Matter Pollution over the United Kingdom: 1980-2019. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:21605-21615. [PMID: 38085698 DOI: 10.1021/acs.est.3c05424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2023]
Abstract
Historical PM2.5 data are essential for assessing the health effects of air pollution exposure across the life course or early life. However, a lack of high-quality data sources, such as satellite-based aerosol optical depth before 2000, has resulted in a gap in spatiotemporally resolved PM2.5 data for historical periods. Taking the United Kingdom as an example, we leveraged the light gradient boosting model to capture the spatiotemporal association between PM2.5 concentrations and multi-source geospatial predictors. Augmented PM2.5 from PM10 measurements expanded the spatiotemporal representativeness of the ground measurements. Observations before and after 2009 were used to train and test the models, respectively. Our model showed fair prediction accuracy from 2010 to 2019 [the ranges of coefficients of determination (R2) for the grid-based cross-validation are 0.71-0.85] and commendable back extrapolation performance from 1998 to 2009 (the ranges of R2 for the independent external testing are 0.32-0.65) at the daily level. The pollution episodes in the 1980s and pollution levels in the 1990s were also reproduced by our model. The 4-decade PM2.5 estimates demonstrated that most regions in England witnessed significant downward trends in PM2.5 pollution. The methods developed in this study are generalizable to other data-rich regions for historical air pollution exposure assessment.
Collapse
Affiliation(s)
- Riyang Liu
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing, Jiangsu 210023, People's Republic of China
- Department of Environmental Health Sciences, Yale School of Public Health, New Haven, Connecticut 06520, United States
- Yale Center on Climate Change and Health, Yale School of Public Health, New Haven, Connecticut 06520, United States
| | - Zongwei Ma
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing, Jiangsu 210023, People's Republic of China
| | - Antonio Gasparrini
- Environment & Health Modelling (EHM) Lab, Department of Public Health Environments and Society, London School of Hygiene & Tropical Medicine, London WC1H 9SH, United Kingdom
| | - Arturo de la Cruz
- Environment & Health Modelling (EHM) Lab, Department of Public Health Environments and Society, London School of Hygiene & Tropical Medicine, London WC1H 9SH, United Kingdom
| | - Jun Bi
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing, Jiangsu 210023, People's Republic of China
| | - Kai Chen
- Department of Environmental Health Sciences, Yale School of Public Health, New Haven, Connecticut 06520, United States
- Yale Center on Climate Change and Health, Yale School of Public Health, New Haven, Connecticut 06520, United States
| |
Collapse
|
10
|
Alvarado-Molina M, Curto A, Wheeler AJ, Tham R, Cerin E, Nieuwenhuijsen M, Vermeulen R, Donaire-Gonzalez D. Improving traffic-related air pollution estimates by modelling minor road traffic volumes. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023; 338:122657. [PMID: 37813140 DOI: 10.1016/j.envpol.2023.122657] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 09/12/2023] [Accepted: 09/28/2023] [Indexed: 10/11/2023]
Abstract
Accurately estimating annual average daily traffic (AADT) on minor roads is essential for assessing traffic-related air pollution (TRAP) exposure, particularly in areas where most people live. Our study assessed the direct and indirect external validity of three methods used to estimate AADT on minor roads in Melbourne, Australia. We estimated the minor road AADT using a fixed-value approach (assuming 600 vehicles/day) and linear and negative binomial (NB) models. The models were generated using road type, road importance index, AADT and distance of the nearest major road, population density, workplace density, and weighted road density. External measurements of traffic counts, as well as black carbon (BC) and ultrafine particles (UFP), were conducted at 201 sites for direct and indirect validation, respectively. Statistical tests included Akaike information criterion (AIC) to compare models' performance, the concordance correlation coefficient (CCC) for direct validation, and Spearman's correlation coefficient for indirect validation. Results show that 88.5% of the roads in Melbourne are minor, yet only 18.9% have AADT. The performance assessment of minor road models indicated comparable performance for both models (AIC of 1,023,686 vs. 1,058,502). In the direct validation with external traffic measurements, there was no difference between the three methods for overall minor roads. However, for minor roads within residential areas, CCC (95% confidence interval [CI]) values were -0.001 (-0.17; 0.18), 0.47 (0.32; 0.60), and 0.29 (0.18; 0.39) for the fixed-value approach, the linear model, and the NB model, respectively. In the indirect validation, we found differences only on UFP where the Spearman's correlation (95% CI) for both models and fixed-value approach were 0.50 (0.37; 0.62) and 0.34 (0.19; 0.48), respectively. In conclusion, our linear model outperformed the fixed-value approach when compared against traffic and TRAP measurements. The methodology followed in this study is relevant to locations with incomplete minor road AADT data.
Collapse
Affiliation(s)
- Miguel Alvarado-Molina
- Mary Mackillop Institute for Health Research, Australian Catholic University, Melbourne, Australia.
| | - Ariadna Curto
- Mary Mackillop Institute for Health Research, Australian Catholic University, Melbourne, Australia; ISGlobal, Barcelona, Spain; Department of Experimental and Health Sciences, Universitat Pompeu (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Amanda J Wheeler
- Commonwealth Scientific and Industrial Research Organisation (CSIRO), Environment, Melbourne, VIC 3195, Australia; Menzies Institute for Medical Research, University of Tasmania, Hobart, TAS 7000, Australia
| | - Rachel Tham
- Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC 3053, Australia
| | - Ester Cerin
- Mary Mackillop Institute for Health Research, Australian Catholic University, Melbourne, Australia; School of Public Health, The University of Hong Kong, 7 Sassoon Rd., Sandy Bay, Hong Kong; Baker Heart and Diabetes Institute, Melbourne, VIC, Australia; Department of Community Medicine, UiT The Artic University of Norway, Tromsø, Norway
| | - Mark Nieuwenhuijsen
- Mary Mackillop Institute for Health Research, Australian Catholic University, Melbourne, Australia; ISGlobal, Barcelona, Spain; Department of Experimental and Health Sciences, Universitat Pompeu (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
| | - Roel Vermeulen
- Institute for Risk, Assessment Sciences (IRAS), Division of Environmental Epidemiology (EEPI), Utrecht University, Utrecht, the Netherlands; Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - David Donaire-Gonzalez
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| |
Collapse
|
11
|
Milà C, Ballester J, Basagaña X, Nieuwenhuijsen MJ, Tonne C. Estimating daily air temperature and pollution in Catalonia: A comprehensive spatiotemporal modelling of multiple exposures. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023; 337:122501. [PMID: 37690467 DOI: 10.1016/j.envpol.2023.122501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 08/31/2023] [Accepted: 09/01/2023] [Indexed: 09/12/2023]
Abstract
Environmental epidemiology studies require models of multiple exposures to adjust for co-exposure and explore interactions. We estimated spatiotemporal exposure to surface air temperature and pollution (PM2.5, PM10, NO2, O3) at high spatiotemporal resolution (daily, 250 m) for 2018-2020 in Catalonia. Innovations include the use of TROPOMI products, a data split for remote sensing gap-filling evaluation, estimation of prediction uncertainty, and use of explainable machine learning. We compiled meteorological and air quality station measurements, climate and atmospheric composition reanalyses, remote sensing products, and other spatiotemporal data. We performed gap-filling of remotely-sensed products using Random Forest (RF) models and validated them using Out-Of-Bag (OOB) samples and a structured data split. The exposure modelling workflow consisted of: 1) PM2.5 station imputation with PM10 data; 2) quantile RF (QRF) model fitting; and 3) geostatistical residual spatial interpolation. Prediction uncertainty was estimated using QRF. SHAP values were used to examine variable importance and the fitted relationships. Model performance was assessed via nested CV at the station level. Evaluation of the gap-filling models using the structured split showed error underestimation when using OOB. Temperature models had the best performance (R2 =0.98) followed by the gaseous air pollutants (R2 =0.81 for NO2 and 0.86 for O3), while the performance of the PM2.5 and PM10 models was lower (R2 =0.57 and 0.63 respectively). Predicted exposure patterns captured urban heat island effects, dust advection events, and NO2 hotspots. SHAP values estimated a high importance of TROPOMI tropospheric NO2 columns in PM and NO2 models, and confirmed that the fitted associations conformed to prior knowledge. Our work highlights the importance of correctly validating gap-filling models and the potential of TROPOMI measurements. Moderate performance in PM models can be partly explained by the poor station coverage. Our exposure estimates can be used in epidemiological studies potentially accounting for exposure uncertainty.
Collapse
Affiliation(s)
- Carles Milà
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | | | - Xavier Basagaña
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Barcelona, Spain
| | - Mark J Nieuwenhuijsen
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Barcelona, Spain
| | - Cathryn Tonne
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Barcelona, Spain.
| |
Collapse
|
12
|
Wei L, Kwan MP, Vermeulen R, Helbich M. Measuring environmental exposures in people's activity space: The need to account for travel modes and exposure decay. JOURNAL OF EXPOSURE SCIENCE & ENVIRONMENTAL EPIDEMIOLOGY 2023; 33:954-962. [PMID: 36788269 DOI: 10.1038/s41370-023-00527-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 01/24/2023] [Accepted: 01/25/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND Accurately quantifying people's out-of-home environmental exposure is important for identifying disease risk factors. Several activity space-based exposure assessments exist, possibly leading to different exposure estimates, and have neither considered individual travel modes nor exposure-related distance decay effects. OBJECTIVE We aimed (1) to develop an activity space-based exposure assessment approach that included travel modes and exposure-related distance decay effects and (2) to compare the size of such spaces and the exposure estimates derived from them across typically used activity space operationalizations. METHODS We used 7-day-long global positioning system (GPS)-enabled smartphone-based tracking data of 269 Dutch adults. People's GPS trajectory points were classified into passive and active travel modes. Exposure-related distance decay effects were modeled through linear, exponential, and Gaussian decay functions. We performed cross-comparisons on these three functional decay models and an unweighted model in conjunction with four activity space models (i.e., home-based buffers, minimum convex polygons, two standard deviational ellipses, and time-weighted GPS-based buffers). We applied non-parametric Kruskal-Wallis tests, pair-wise Wilcoxon signed-rank tests, and Spearman correlations to assess mean differences in the extent of the activity spaces and correlations across exposures to particulate matter (PM2.5), noise, green space, and blue space. RESULTS Participants spent, on average, 42% of their daily life out-of-home. We observed that including travel modes into activity space delineation resulted in significantly more compact activity spaces. Exposure estimates for PM2.5 and blue space were significantly (p < 0.05) different between exposure estimates that did or did not account for travel modes, unlike noise and green space, for which differences did not reach significance. While the inclusion of distance decay effects significantly affected noise and green space exposure assessments, the decay functions applied appear not to have had any impact on the results. We found that residential exposure estimates appear appropriate for use as proxy values for the overall amount of PM2.5 exposure in people's daily lives, while GPS-based assessments are suitable for noise, green space, and blue space. SIGNIFICANCE For some exposures, the tested activity space definitions, although significantly correlated, exhibited differing exposure estimate results based on inclusion or exclusion of travel modes or distance decay effect. Results only supported using home-based buffer values as proxies for individuals' daily short-term PM2.5 exposure.
Collapse
Affiliation(s)
- Lai Wei
- Department of Human Geography and Spatial Planning, Utrecht University, Utrecht, The Netherlands.
| | - Mei-Po Kwan
- Department of Human Geography and Spatial Planning, Utrecht University, Utrecht, The Netherlands
- Department of Geography and Resource Management and Institute of Space and Earth Information Science, Chinese University of Hong Kong, Hong Kong, China
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands
- Julius Centre for Health Sciences and Primary Care, University Medical Centre, Utrecht University, Utrecht, The Netherlands
| | - Marco Helbich
- Department of Human Geography and Spatial Planning, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
13
|
Yuan Z, Kerckhoffs J, Shen Y, de Hoogh K, Hoek G, Vermeulen R. Integrating large-scale stationary and local mobile measurements to estimate hyperlocal long-term air pollution using transfer learning methods. ENVIRONMENTAL RESEARCH 2023; 228:115836. [PMID: 37028540 DOI: 10.1016/j.envres.2023.115836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 03/31/2023] [Accepted: 04/01/2023] [Indexed: 05/16/2023]
Abstract
Mobile air quality measurements are collected typically for several seconds per road segment and in specific timeslots (e.g., working hours). These short-term and on-road characteristics of mobile measurements become the ubiquitous shortcomings of applying land use regression (LUR) models to estimate long-term concentrations at residential addresses. This issue was previously found to be mitigated by transferring LUR models to the long-term residential domain using routine long-term measurements in the studied region as the transfer target (local scale). However, long-term measurements are generally sparse in individual cities. For this scenario, we propose an alternative by taking long-term measurements collected over a larger geographical area (global scale) as the transfer target and local mobile measurements as the source (Global2Local model). We empirically tested national, airshed countries (i.e., national plus neighboring countries) and Europe as the global scale in developing Global2Local models to map nitrogen dioxide (NO2) concentrations in Amsterdam. The airshed countries scale provided the lowest absolute errors, and the Europe-wide scale had the highest R2. Compared to a "global" LUR model (trained exclusively with European-wide long-term measurements), and a local mobile LUR model (using mobile data from Amsterdam only), the Global2Local model significantly reduced the absolute error of the local mobile LUR model (root-mean-square error, 6.9 vs 12.6 μg/m3) and improved the percentage explained variances compared to the global model (R2, 0.43 vs 0.28, assessed by independent long-term NO2 measurements in Amsterdam, n = 90). The Global2Local method improves the generalizability of mobile measurements in mapping long-term residential concentrations with a fine spatial resolution, which is preferred in environmental epidemiological studies.
Collapse
Affiliation(s)
- Zhendong Yuan
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CK, Utrecht, Netherlands.
| | - Jules Kerckhoffs
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CK, Utrecht, Netherlands
| | - Youchen Shen
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CK, Utrecht, Netherlands
| | - Kees de Hoogh
- Swiss Tropical and Public Health Institute, Kreuzstrasse 2, 4123, Allschwil, Switzerland; University of Basel, Petersplatz 1, Postfach, 4001, Basel, Switzerland
| | - Gerard Hoek
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CK, Utrecht, Netherlands
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CK, Utrecht, Netherlands; Julius Centre for Health Sciences and Primary Care, University Medical Centre, University of Utrecht, 3584 CK, Utrecht, the Netherlands
| |
Collapse
|
14
|
Kamińska JA, Kajewska-Szkudlarek J. The importance of data splitting in combined NO x concentration modelling. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 868:161744. [PMID: 36690101 DOI: 10.1016/j.scitotenv.2023.161744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 01/04/2023] [Accepted: 01/17/2023] [Indexed: 06/17/2023]
Abstract
The polluted air breathed every day by those living in large conurbations poses a significant risk to their health. Through effective modelling (prediction) of concentrations of pollutants and identification of the factors influencing them, it should be possible to obtain advance information on dangers and to plan and implement measures to reduce them. This work describes two different modelling approaches: based on the NOx concentration of the previous hour (C&RT models); and based on meteorological factors, traffic flow, and past (up to two previous hours) NOx and NO2 concentrations (CA models). For each approach, three alternative machine learning methods were applied: artificial neutral network (ANN), random forest (RF), and support vector regression (SVR). The best fits were obtained for the models using ANN and RF (MAPE values in the range 18.3-18.5 %). Poorer fits were found for the SVR models (MAPE equal to 23.4 % for the C&RT approach and 29.3 % for CA). No significant preferences were identified between the C&RT and CA approaches (based on various goodness-of-fit measures). The choice should be determined by the purposes for which the forecast is to be used.
Collapse
Affiliation(s)
- Joanna A Kamińska
- Department of Applied Mathematics, Wroclaw University of Environmental and Life Sciences, Grunwaldzka Street 53, 50-357 Wroclaw, Poland
| | - Joanna Kajewska-Szkudlarek
- Institute of Environmental Engineering, Wroclaw University of Environmental and Life Sciences, Grunwaldzki Square 24, 50-363 Wroclaw, Poland.
| |
Collapse
|
15
|
Wong PY, Su HJ, Lung SCC, Wu CD. An ensemble mixed spatial model in estimating long-term and diurnal variations of PM 2.5 in Taiwan. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 866:161336. [PMID: 36603626 DOI: 10.1016/j.scitotenv.2022.161336] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 12/13/2022] [Accepted: 12/29/2022] [Indexed: 06/17/2023]
Abstract
Meteorology, human activities, and other emission sources drive diurnal cyclic patterns of air pollution. Previous studies mainly focused on the variation of PM2.5 concentrations during daytime rather than nighttime. In addition, assessing the spatial variations of PM2.5 in large areas is a critical issue for environmental epidemiological studies to clarify the health effects from PM2.5 exposures. In terms of air pollution spatial modelling, using only a single model might lose information in capturing spatial and temporal correlation between predictors and pollutant levels. Hence, this study aimed to propose an ensemble mixed spatial model that incorporated Kriging interpolation, land-use regression (LUR), machine learning, and stacking ensemble approach to estimate long-term PM2.5 variations for nearly three decades in daytime and nighttime. Three steps of model development were applied: 1) linear based LUR and Hybrid Kriging-LUR were used to determine influential predictors; 2) machine learning algorithms were used to enhance model prediction accuracy; 3) predictions from the selected machine learning models were fitted and evaluated again to build the final ensemble mixed spatial model. The results showed that prediction performance increased from 0.514 to 0.895 for daily, 0.478 to 0.879 for daytime, and 0.523 to 0.878 for nighttime when applying the proposed ensemble mixed spatial model compared with LUR. Results of overfitting test and extrapolation ability test confirmed the robustness and reliability of the developed models. The distance to the nearest thermal power plant, density of soil and pebbles fields, and funeral facilities might affect the variation of PM2.5 levels between daytime and nighttime. The PM2.5 level was higher in daytime compared with nighttime with little difference, revealing the importance of estimating nighttime PM2.5 variations. Our findings also clarified the emission sources in daytime and nighttime, which serve as valuable information for air pollution control strategies establishment.
Collapse
Affiliation(s)
- Pei-Yi Wong
- Department of Environmental and Occupational Health, National Cheng Kung University, Tainan, Taiwan
| | - Huey-Jen Su
- Department of Environmental and Occupational Health, National Cheng Kung University, Tainan, Taiwan
| | - Shih-Chun Candice Lung
- Research Center for Environmental Changes, Academia Sinica, Taipei, Taiwan; Department of Atmospheric Sciences, National Taiwan University, Taipei, Taiwan; Institute of Environmental Health, National Taiwan University, Taipei, Taiwan
| | - Chih-Da Wu
- Department of Geomatics, National Cheng Kung University, Tainan, Taiwan; National Institute of Environmental Health Sciences, National Health Research Institutes, Miaoli, Taiwan.
| |
Collapse
|