1
|
Vachon J, Buteau S, Liu Y, Van Ryswyk K, Hatzopoulou M, Smargiassi A. Spatial and spatiotemporal modelling of intra-urban ultrafine particles: A comparison of linear, nonlinear, regularized, and machine learning methods. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 954:176523. [PMID: 39326743 DOI: 10.1016/j.scitotenv.2024.176523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 09/09/2024] [Accepted: 09/23/2024] [Indexed: 09/28/2024]
Abstract
BACKGROUND Machine learning methods are proposed to improve the predictions of ambient air pollution, yet few studies have compared ultrafine particles (UFP) models across a broad range of statistical and machine learning approaches, and only one compared spatiotemporal models. Most reported marginal differences between methods. This limits our ability to draw conclusions about the best methods to model ambient UFPs. OBJECTIVE To compare the performance and predictions of statistical and machine learning methods used to model spatial and spatiotemporal ambient UFPs. METHODS Daily and annual models were developed from UFP measurements from a year-long mobile monitoring campaign in Quebec City, Canada, combined with 262 geospatial and six meteorological predictors. Various road segment lengths were considered (100/300/500 m) for UFP data aggregation. Four statistical methods included linear, non-linear, and regularized regressions, whereas eight machine learning regressions utilized tree-based, neural networks, support vector, and kernel ridge algorithms. Nested cross-validation was used for model training, hyperparameter tuning and performance evaluation. RESULTS Mean annual UFP concentrations was 13,335 particles/cm3. Machine learning outperformed statistical methods in predicting UFPs. Tree-based methods performed best across temporal scales and segment lengths, with XGBoost producing the overall best performing models (annual R2 = 0.78-0.86, RMSE = 2163-2169 particles/cm3; daily R2 = 0.47-0.48, RMSE = 8651-11,422 particles/cm3). With 100 m segments, other annual models performed similarly well, but their prediction surfaces of annual mean UFP concentrations showed signs of overfitting. Spatial aggregation of monitoring data significantly impacted model performance. Longer segments yielded lower RMSE in all daily models and for annual statistical models, but not for annual machine learning models. CONCLUSIONS The use of tree-based methods significantly improved spatiotemporal predictions of UFP concentrations, and to a lesser extent annual concentrations. Segment length and hyperparameter tuning had notable impacts on model performance and should be considered in future studies.
Collapse
Affiliation(s)
- Julien Vachon
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada
| | - Stéphane Buteau
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada
| | - Ying Liu
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada
| | - Keith Van Ryswyk
- Air Pollution Exposure Science Section, Water and Air Quality Bureau, Health Canada, Ottawa, Canada
| | | | - Audrey Smargiassi
- Department of Environmental and Occupational Health, School of Public Health, University of Montreal, Montreal, Canada; Center for Public Health Research (CReSP), University of Montreal and CIUSSS du Centre-Sud-de-l'Île-de-Montréal, Montreal, Canada.
| |
Collapse
|
2
|
Yuan Z, Kerckhoffs J, Li H, Khan J, Hoek G, Vermeulen R. Hyperlocal Air Pollution Mapping: A Scalable Transfer Learning LUR Approach for Mobile Monitoring. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:14372-14383. [PMID: 39082120 PMCID: PMC11325550 DOI: 10.1021/acs.est.4c06144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Addressing the challenge of mapping hyperlocal air pollution in areas without local monitoring, we evaluated unsupervised transfer learning-based land-use regression (LUR) models developed using mobile monitoring data from other cities: CORrelation ALignment (Coral) and its inverse distance-weighted modification (IDW_Coral). These models mitigated domain shifts and transferred patterns learned from mobile air quality monitoring campaigns in Copenhagen and Rotterdam to estimate annual average air pollution levels in Amsterdam (50m road segments) without involving any Amsterdam measurements in model development. For nitrogen dioxide (NO2), IDW_Coral outperformed Copenhagen and Rotterdam LUR models directly applied to Amsterdam, achieving MAE (4.47 μg/m3) and RMSE (5.36 μg/m3) comparable to a locally fitted LUR model (AMS_SLR) developed using Amsterdam mobile measurements collected for 160 days. IDW_Coral yielded an R2 of 0.35, similar to that of the AMS_SLR based on 20 collection days, suggesting a minimum requirement of 20-day mobile monitoring to capture city-specific insights. For ultrafine particles (UFP), IDW_Coral's citywide predictions strongly correlated with previously published mixed-effect models fitted with 160-day Amsterdam measurements (Pearson correlation of 0.71 for UFP and 0.72 for NO2). IDW_Coral demands no direct measurements in the target area, showcasing its potential for large-scale applications and offering significant economic efficiencies in executing mobile monitoring campaigns.
Collapse
Affiliation(s)
- Zhendong Yuan
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CM Utrecht, Netherlands
| | - Jules Kerckhoffs
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CM Utrecht, Netherlands
| | - Hao Li
- Professorship of Big Geospatial Data Management, Technical University of Munich, 85521 Ottobrunn, Germany
| | - Jibran Khan
- Department of Environmental Science, Aarhus University, DK-4000 Roskilde, Denmark
- Danish Big Data Centre for Environment and Health (BERTHA), Aarhus University, DK-4000 Roskilde, Denmark
| | - Gerard Hoek
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CM Utrecht, Netherlands
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences, Utrecht University, 3584 CM Utrecht, Netherlands
- Julius Centre for Health Sciences and Primary Care, University Medical Centre, Utrecht University, 3584 CX Utrecht, The Netherlands
| |
Collapse
|
3
|
Zalzal J, Minet L, Brook J, Mihele C, Chen H, Hatzopoulou M. Capturing Exposure Disparities with Chemical Transport Models: Evaluating the Suitability of Downscaling Using Land Use Regression. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024. [PMID: 39092553 DOI: 10.1021/acs.est.4c03725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
High resolution exposure surfaces are essential to capture disparities in exposure to traffic-related air pollution in urban areas. In this study, we develop an approach to downscale Chemical Transport Model (CTM) simulations to a hyperlocal level (∼100m) in the Greater Toronto Area (GTA) under three scenarios where emissions from cars, trucks and buses are zeroed out, thus capturing the burden of each transportation mode. This proposed approach statistically fuses CTMs with Land-Use Regression using machine learning techniques. With this proposed downscaling approach, changes in air pollutant concentrations under different scenarios are appropriately captured by downscaling factors that are trained to reflect the spatial distribution of emission reductions. Our validation analysis shows that high-resolution models resulted in better performance than coarse models when compared with observations at reference stations. We used this downscaling approach to assess disparities in exposure to nitrogen dioxide (NO2) for populations composed of renters, low-income households, recent immigrants, and visible minorities. Individuals in all four categories were disproportionately exposed to the burden of cars, trucks, and buses. We conducted this analysis at spatial resolutions of 12, 4, 1 km, and 100 m and observed that disparities were significantly underestimated when using coarse spatial resolutions. This reinforces the need for high-spatial resolution exposure surfaces for environmental justice analyses.
Collapse
Affiliation(s)
- Jad Zalzal
- Department of Civil & Mineral Engineering, University of Toronto, 35 St George Street, Toronto, Ontario M5S 1A4, Canada
| | - Laura Minet
- Department of Civil Engineering, University of Victoria, 3800 Finnerty Road, Victoria, British Columbia V8P 5C2, Canada
| | - Jeffrey Brook
- Dalla Lana School of Public Health, University of Toronto, 155 College Street, Toronto, Ontario M5T 3M7, Canada
| | - Cristian Mihele
- Air Quality Research Division, Environment and Climate Change Canada, 4905 Dufferin Street, North York, Ontario M3H 5T4, Canada
| | - Hong Chen
- Dalla Lana School of Public Health, University of Toronto, 155 College Street, Toronto, Ontario M5T 3M7, Canada
- Environmental Health Science and Research Bureau, Health Canada, 50 Colombine Driveway, Ottawa, Ontario K1A 0K9, Canada
- Public Health Ontario, 480 University Avenue, Toronto, Ontario M5G 1 V2, Canada
- ICES, 2075 Bayview Avenue, Toronto, Ontario M4N 3M5, Canada
| | - Marianne Hatzopoulou
- Department of Civil & Mineral Engineering, University of Toronto, 35 St George Street, Toronto, Ontario M5S 1A4, Canada
| |
Collapse
|
4
|
Apte JS, Manchanda C. High-resolution urban air pollution mapping. Science 2024; 385:380-385. [PMID: 39052801 DOI: 10.1126/science.adq3678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Accepted: 06/07/2024] [Indexed: 07/27/2024]
Abstract
Variation in urban air pollution arises because of complex spatial, temporal, and chemical processes, which profoundly affect population exposure, human health, and environmental justice. This Review highlights insights from two popular in situ measurement methods-mobile monitoring and dense sensor networks-that have distinct but complementary strengths in characterizing the dynamics and impacts of the multidimensional urban air quality system. Mobile monitoring can measure many pollutants at fine spatial scales, thereby informing about processes and control strategies. Sensor networks excel at providing temporal resolution at many locations. Increasingly sophisticated studies leveraging both methods can vividly identify spatial and temporal patterns that affect exposures and disparities and offer mechanistic insight toward effective interventions. This Review summarizes the strengths and limitations of these methods and discusses their implications for understanding fine-scale processes and impacts.
Collapse
Affiliation(s)
- Joshua S Apte
- Department of Civil and Environmental Engineering, University of California, Berkeley, Berkeley, CA 94720, USA
- School of Public Health, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Chirag Manchanda
- Department of Civil and Environmental Engineering, University of California, Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
5
|
Yuan Z, Shen Y, Hoek G, Vermeulen R, Kerckhoffs J. LUR modeling of long-term average hourly concentrations of NO 2 using hyperlocal mobile monitoring data. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 922:171251. [PMID: 38417522 DOI: 10.1016/j.scitotenv.2024.171251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 02/22/2024] [Accepted: 02/22/2024] [Indexed: 03/01/2024]
Abstract
Mobile monitoring campaigns have effectively captured spatial hyperlocal variations in long-term average concentrations of regulated and unregulated air pollutants. However, their application in estimating spatiotemporally varying maps has rarely been investigated. Tackling this gap, we investigated whether mobile measurements can assess long-term average nitrogen dioxide (NO2) concentrations for each hour of the day. Using mobile NO2 data monitored for 10 months in Amsterdam, we examined the performance of two spatiotemporal land use regression (LUR) methods, Spatiotemporal-Kriging and GTWR (Geographical and Temporal Weighted Regression), alongside two classical spatial LUR models developed separately for each hour. We found that mobile measurements follow the general pattern of fixed-site measurements, but with considerable deviations (indicating collection uncertainty). Leveraging heterogeneous spatiotemporal autocorrelations, GTWR smoothed these deviations and achieved an overall performance of an R2 of 0.49 and a Mean Absolute Error of 6.33 μg/m3, validated by long-term fixed-site measurements (out-of-sample). The other models tested were more affected by the collection uncertainty. We highlighted that the spatiotemporal variations captured in mobile measurements can be used to reconstruct long-term average hourly air pollution maps. These maps facilitate dynamic exposure assessments considering spatiotemporal human activity patterns.
Collapse
Affiliation(s)
- Zhendong Yuan
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands.
| | - Youchen Shen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| | - Gerard Hoek
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands; Julius Centre for Health Sciences and Primary Care, University Medical Centre, University of Utrecht, the Netherlands
| | - Jules Kerckhoffs
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| |
Collapse
|
6
|
Amini H, Bergmann ML, Taghavi Shahri SM, Tayebi S, Cole-Hunter T, Kerckhoffs J, Khan J, Meliefste K, Lim YH, Mortensen LH, Hertel O, Reeh R, Gaarde Nielsen C, Loft S, Vermeulen R, Andersen ZJ, Schwartz J. Harnessing AI to unmask Copenhagen's invisible air pollutants: A study on three ultrafine particle metrics. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2024; 346:123664. [PMID: 38431246 DOI: 10.1016/j.envpol.2024.123664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 02/08/2024] [Accepted: 02/25/2024] [Indexed: 03/05/2024]
Abstract
Ultrafine particles (UFPs) are airborne particles with a diameter of less than 100 nm. They are emitted from various sources, such as traffic, combustion, and industrial processes, and can have adverse effects on human health. Long-term mean ambient average particle size (APS) in the UFP range varies over space within cities, with locations near UFP sources having typically smaller APS. Spatial models for lung deposited surface area (LDSA) within urban areas are limited and currently there is no model for APS in any European city. We collected particle number concentration (PNC), LDSA, and APS data over one-year monitoring campaign from May 2021 to May 2022 across 27 locations and estimated annual mean in Copenhagen, Denmark, and obtained additionally annual mean PNC data from 6 state-owned continuous monitors. We developed 94 predictor variables, and machine learning models (random forest and bagged tree) were developed for PNC, LDSA, and APS. The annual mean PNC, LDSA, and APS were, respectively, 5523 pt/cm3, 12.0 μm2/cm3, and 46.1 nm. The final R2 values by random forest (RF) model were 0.93 for PNC, 0.88 for LDSA, and 0.85 for APS. The 10-fold, repeated 10-times cross-validation R2 values were 0.65, 0.67, and 0.60 for PNC, LDSA, and APS, respectively. The root mean square error for final RF models were 296 pt/cm3, 0.48 μm2/cm3, and 1.60 nm for PNC, LDSA, and APS, respectively. Traffic-related variables, such as length of major roads within buffers 100-150 m and distance to streets with various speed limits were amongst the highly-ranked predictors for our models. Overall, our ML models achieved high R2 values and low errors, providing insights into UFP exposure in a European city where average PNC is quite low. These hyperlocal predictions can be used to study health effects of UFPs in the Danish Capital.
Collapse
Affiliation(s)
- Heresh Amini
- Department of Environmental Medicine and Public Health, Institute for Climate Change, Environmental Health, and Exposomics, Icahn School of Medicine at Mount Sinai, New York, United States; Department of Public Health, University of Copenhagen, Copenhagen, Denmark; Department of Environmental Health, Harvard TH Chan School of Public Health, Boston, MA, United States.
| | - Marie L Bergmann
- Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | | | - Shali Tayebi
- Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Thomas Cole-Hunter
- Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Jules Kerckhoffs
- Institute for Risk Assessment Sciences (IRAS), Utrecht University, the Netherlands
| | - Jibran Khan
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Danish Big Data Centre for Environment and Health (BERTHA), Aarhus University, Roskilde, Denmark
| | - Kees Meliefste
- Institute for Risk Assessment Sciences (IRAS), Utrecht University, the Netherlands
| | - Youn-Hee Lim
- Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Laust H Mortensen
- Department of Public Health, University of Copenhagen, Copenhagen, Denmark; Statistics Denmark, Copenhagen, Denmark
| | - Ole Hertel
- Faculty of Technical Sciences, Aarhus University, Denmark
| | | | | | - Steffen Loft
- Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences (IRAS), Utrecht University, the Netherlands
| | - Zorana J Andersen
- Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Joel Schwartz
- Department of Environmental Health, Harvard TH Chan School of Public Health, Boston, MA, United States
| |
Collapse
|
7
|
Kerckhoffs J, Hoek G, Vermeulen R. Mobile monitoring of air pollutants; performance evaluation of a mixed-model land use regression framework in relation to the number of drive days. ENVIRONMENTAL RESEARCH 2024; 240:117457. [PMID: 37865326 DOI: 10.1016/j.envres.2023.117457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 09/29/2023] [Accepted: 10/18/2023] [Indexed: 10/23/2023]
Abstract
We used black carbon data from a mobile monitoring campaign in Oakland, USA measuring street segments up to 40 times and compared a data-only, LUR model and mixed-model approach with a long-term average, represented by the average concentration based on 40 drive days on that street segment. The mixed model outperformed the data-only and LUR model estimates, with 80% explained variance after 5 drive days and 90% after 14 drive days. The data-only approach needed 8 and 15 to achieve an explained variance of 80% and 90%, respectively, The LUR model never achieved an explained variance higher than 70%. The mixed model is a scalable approach, as it can be used before all street segments in a domain are measured by developing a LUR model and adds information with increasing repeats per street segment.
Collapse
Affiliation(s)
- Jules Kerckhoffs
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands.
| | - Gerard Hoek
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| | - Roel Vermeulen
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands; Julius Centre for Health Sciences and Primary Care, University Medical Centre, University of Utrecht, the Netherlands
| |
Collapse
|
8
|
von Mikecz A. Elegant Nematodes Improve Our Understanding of Human Neuronal Diseases, the Role of Pollutants and Strategies of Resilience. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:16755-16763. [PMID: 37874738 PMCID: PMC10634345 DOI: 10.1021/acs.est.3c04580] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 10/05/2023] [Accepted: 10/06/2023] [Indexed: 10/26/2023]
Abstract
The prevalence of neurodegenerative disorders such as Alzheimer's and Parkinson's disease are rising globally. The role of environmental pollution in neurodegeneration is largely unknown. Thus, this perspective advocates exposome research in C. elegans models of human diseases. The models express amyloid proteins such as Aβ, recapitulate the degeneration of specifically vulnerable neurons and allow for correlated neurobehavioral phenotyping throughout the entire life span of the nematode. Neurobehavioral traits like locomotion gaits, rigidity, or cognitive decline are quantifiable and carefully mimic key aspects of the human diseases. Underlying molecular pathways of neurodegeneration are elucidated in pollutant-exposed C. elegans Alzheimer's or Parkinson's models by transcriptomics (RNA-seq), mass spectrometry-based proteomics and omics addressing other biochemical traits. Validation of the identified disease pathways can be achieved by genome-wide association studies in matching human cohorts. A consistent One Health approach includes isolation of nematodes from contaminated sites and their comparative investigation by imaging, neurobehavioral profiling and single worm proteomics. C. elegans models of neurodegenerative diseases are likewise well-suited for high throughput methods that provide a promising strategy to identify resilience pathways of neurosafety and keep up with the number of pollutants, nonchemical exposome factors, and their interactions.
Collapse
Affiliation(s)
- Anna von Mikecz
- IUF − Leibniz Research Institute
of Environmental Medicine GmbH, Auf’m Hennekamp 50, 40225 Duesseldorf, Germany
| |
Collapse
|