Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Girguis MS, Li L, Lurmann F, Wu J, Urman R, Rappaport E, Breton C, Gilliland F, Stram D, Habre R. Exposure measurement error in air pollution studies: A framework for assessing shared, multiplicative measurement error in ensemble learning estimates of nitrogen oxides. Environ Int 2019;125:97-106. [PMID: 30711654 PMCID: PMC6499078 DOI: 10.1016/j.envint.2018.12.025] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 12/10/2018] [Accepted: 12/12/2018] [Indexed: 05/22/2023]

For:	Girguis MS, Li L, Lurmann F, Wu J, Urman R, Rappaport E, Breton C, Gilliland F, Stram D, Habre R. Exposure measurement error in air pollution studies: A framework for assessing shared, multiplicative measurement error in ensemble learning estimates of nitrogen oxides. Environ Int 2019;125:97-106. [PMID: 30711654 PMCID: PMC6499078 DOI: 10.1016/j.envint.2018.12.025] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 12/10/2018] [Accepted: 12/12/2018] [Indexed: 05/22/2023]

Number

Cited by Other Article(s)

VoPham T, White AJ, Jones RR. Geospatial Science for the Environmental Epidemiology of Cancer in the Exposome Era. Cancer Epidemiol Biomarkers Prev 2024;33:451-460. [PMID: 38566558 PMCID: PMC10996842 DOI: 10.1158/1055-9965.epi-23-1237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 12/11/2023] [Accepted: 01/29/2024] [Indexed: 04/04/2024] Open

Wei Y, Qiu X, Yazdi MD, Shtein A, Shi L, Yang J, Peralta AA, Coull BA, Schwartz JD. The Impact of Exposure Measurement Error on the Estimated Concentration-Response Relationship between Long-Term Exposure to PM2.5 and Mortality. ENVIRONMENTAL HEALTH PERSPECTIVES 2022;130:77006. [PMID: 35904519 PMCID: PMC9337229 DOI: 10.1289/ehp10389] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]

Abstract

BACKGROUND

Exposure measurement error is a central concern in air pollution epidemiology. Given that studies have been using ambient air pollution predictions as proxy exposure measures, the potential impact of exposure error on health effect estimates needs to be comprehensively assessed.

OBJECTIVES

We aimed to generate wide-ranging scenarios to assess direction and magnitude of bias caused by exposure errors under plausible concentration-response relationships between annual exposure to fine particulate matter [PM ≤2.5μm in aerodynamic diameter (PM2.5)] and all-cause mortality.

METHODS

In this simulation study, we use daily PM2.5 predictions at 1-km2 spatial resolution to estimate annual PM2.5 exposures and their uncertainties for ZIP Codes of residence across the contiguous United States between 2000 and 2016. We consider scenarios in which we vary the error type (classical or Berkson) and the true concentration-response relationship between PM2.5 exposure and mortality (linear, quadratic, or soft-threshold-i.e., a smooth approximation to the hard-threshold model). In each scenario, we generate numbers of deaths using error-free exposures and confounders of concurrent air pollutants and neighborhood-level covariates and perform epidemiological analyses using error-prone exposures under correct specification or misspecification of the concentration-response relationship between PM2.5 exposure and mortality, adjusting for the confounders.

RESULTS

We simulate 1,000 replicates of each of 162 scenarios investigated. In general, both classical and Berkson errors can bias the concentration-response curve toward the null. The biases remain small even when using three times the predicted uncertainty to generate errors and are relatively larger at higher exposure levels.

DISCUSSION

Our findings suggest that the causal determination for long-term PM2.5 exposure and mortality is unlikely to be undermined when using high-resolution ambient predictions given that the estimated effect is generally smaller than the truth. The small magnitude of bias suggests that epidemiological findings are relatively robust against the exposure error. In practice, the use of ambient predictions with a finer spatial resolution will result in smaller bias. https://doi.org/10.1289/EHP10389.

Collapse

Research on statistical characteristics modeling of matching probability and measurement error based on machine learning. INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS IN THE SERVICE SECTOR 2022. [DOI: 10.4018/ijisss.290548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Habre R, Girguis M, Urman R, Fruin S, Lurmann F, Shafer M, Gorski P, Franklin M, McConnell R, Avol E, Gilliland F. Contribution of tailpipe and non-tailpipe traffic sources to quasi-ultrafine, fine and coarse particulate matter in southern California. JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION (1995) 2021;71:209-230. [PMID: 32990509 PMCID: PMC8112073 DOI: 10.1080/10962247.2020.1826366] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 08/21/2020] [Accepted: 09/09/2020] [Indexed: 05/19/2023]

Abstract

Exposure to traffic-related air pollution (TRAP) in the near-roadway environment is associated with multiple adverse health effects. To characterize the relative contribution of tailpipe and non-tailpipe TRAP sources to particulate matter (PM) in the quasi-ultrafine (PM_0.2), fine (PM_2.5) and coarse (PM_2.5-10) size fractions and identify their spatial determinants in southern California (CA). Month-long integrated PM_0.2, PM_2.5 and PM_2.5-10 samples (n = 461, 265 and 298, respectively) were collected across cool and warm seasons in 8 southern CA communities (2008-9). Concentrations of PM mass, elements, carbons and major ions were obtained. Enrichment ratios (ER) in PM_0.2 and PM₁₀ relative to PM_2.5 were calculated for each element. The Positive Matrix Factorization model was used to resolve and estimate the relative contribution of TRAP sources to PM in three size fractions. Generalized additive models (GAMs) with bivariate loess smooths were used to understand the geographic variation of TRAP sources and identify their spatial determinants. EC, OC, and B had the highest median ER in PM_0.2 relative to PM_2.5. Six, seven and five sources (with characteristic species) were resolved in PM_0.2, PM_2.5 and PM_2.5-10, respectively. Combined tailpipe and non-tailpipe traffic sources contributed 66%, 32% and 18% of PM_0.2, PM_2.5 and PM_2.5-10 mass, respectively. Tailpipe traffic emissions (EC, OC, B) were the largest contributor to PM_0.2 mass (58%). Distinct gasoline and diesel tailpipe traffic sources were resolved in PM_2.5. Others included fuel oil, biomass burning, secondary inorganic aerosol, sea salt, and crustal/soil. CALINE4 dispersion model nitrogen oxides, trucks and intersections were most correlated with TRAP sources. The influence of smaller roadways and intersections became more apparent once Long Beach was excluded. Non-tailpipe emissions constituted ~8%, 11% and 18% of PM_0.2, PM_2.5 and PM_2.5-10, respectively, with important exposure and health implications. Future efforts should consider non-linear relationships amongst predictors when modeling exposures. Implications: Vehicle emissions result in a complex mix of air pollutants with both tailpipe and non-tailpipe components. As mobile source regulations lead to decreased tailpipe emissions, the relative contribution of non-tailpipe traffic emissions to near-roadway exposures is increasing. This study documents the presence of non-tailpipe abrasive vehicular emissions (AVE) from brake and tire wear, catalyst degradation and resuspended road dust in the quasi-ultrafine (PM_0.2), fine and coarse particulate matter size fractions, with contributions reaching up to 30% in PM_0.2 in some southern California communities. These findings have important exposure and policy implications given the high metal content of AVE and the efficiency of PM_0.2 at reaching the alveolar region of the lungs and other organ systems once inhaled. This work also highlights important considerations for building models that can accurately predict tailpipe and non-tailpipe exposures for population health studies.

Collapse

Li L, Girguis M, Lurmann F, Pavlovic N, McClure C, Franklin M, Wu J, Oman LD, Breton C, Gilliland F, Habre R. Ensemble-based deep learning for estimating PM_2.5 over California with multisource big data including wildfire smoke. ENVIRONMENT INTERNATIONAL 2020;145:106143. [PMID: 32980736 PMCID: PMC7643812 DOI: 10.1016/j.envint.2020.106143] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 08/14/2020] [Accepted: 09/13/2020] [Indexed: 05/21/2023]

Abstract

INTRODUCTION

Estimating PM2.5 concentrations and their prediction uncertainties at a high spatiotemporal resolution is important for air pollution health effect studies. This is particularly challenging for California, which has high variability in natural (e.g, wildfires, dust) and anthropogenic emissions, meteorology, topography (e.g. desert surfaces, mountains, snow cover) and land use.

METHODS

Using ensemble-based deep learning with big data fused from multiple sources we developed a PM2.5 prediction model with uncertainty estimates at a high spatial (1 km × 1 km) and temporal (weekly) resolution for a 10-year time span (2008-2017). We leveraged autoencoder-based full residual deep networks to model complex nonlinear interrelationships among PM2.5 emission, transport and dispersion factors and other influential features. These included remote sensing data (MAIAC aerosol optical depth (AOD), normalized difference vegetation index, impervious surface), MERRA-2 GMI Replay Simulation (M2GMI) output, wildfire smoke plume dispersion, meteorology, land cover, traffic, elevation, and spatiotemporal trends (geo-coordinates, temporal basis functions, time index). As one of the primary predictors of interest with substantial missing data in California related to bright surfaces, cloud cover and other known interferences, missing MAIAC AOD observations were imputed and adjusted for relative humidity and vertical distribution. Wildfire smoke contribution to PM2.5 was also calculated through HYSPLIT dispersion modeling of smoke emissions derived from MODIS fire radiative power using the Fire Energetics and Emissions Research version 1.0 model.

RESULTS

Ensemble deep learning to predict PM2.5 achieved an overall mean training RMSE of 1.54 μg/m3 (R2: 0.94) and test RMSE of 2.29 μg/m3 (R2: 0.87). The top predictors included M2GMI carbon monoxide mixing ratio in the bottom layer, temporal basis functions, spatial location, air temperature, MAIAC AOD, and PM2.5 sea salt mass concentration. In an independent test using three long-term AQS sites and one short-term non-AQS site, our model achieved a high correlation (>0.8) and a low RMSE (<3 μg/m3). Statewide predictions indicated that our model can capture the spatial distribution and temporal peaks in wildfire-related PM2.5. The coefficient of variation indicated highest uncertainty over deciduous and mixed forests and open water land covers.

CONCLUSION

Our method can be generalized to other regions, including those having a mix of major urban areas, deserts, intensive smoke events, snow cover and complex terrains, where PM2.5 has previously been challenging to predict. Prediction uncertainty estimates can also inform further model development and measurement error evaluations in exposure and health studies.

Collapse

Girguis MS, Li L, Lurmann F, Wu J, Breton C, Gilliland F, Stram D, Habre R. Exposure Measurement Error in Air Pollution Studies: The Impact of Shared, Multiplicative Measurement Error on Epidemiological Health Risk Estimates. AIR QUALITY, ATMOSPHERE, & HEALTH 2020;13:631-643. [PMID: 32601528 PMCID: PMC7323995 DOI: 10.1007/s11869-020-00826-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Accepted: 04/08/2020] [Indexed: 05/29/2023]

Abstract

Spatiotemporal air pollution models are increasingly being used to estimate health effects in epidemiological studies. Although such exposure prediction models typically result in improved spatial and temporal resolution of air pollution predictions, they remain subject to shared measurement error, a type of measurement error common in spatiotemporal exposure models which occurs when measurement error is not independent of exposures. A fundamental challenge of exposure measurement error in air pollution assessment is the strong correlation and sometimes identical (shared) error of exposure estimates across geographic space and time. When exposure estimates with shared measurement error are used to estimate health risk in epidemiological analyses, complex errors are potentially introduced, resulting in biased epidemiological conclusions. We demonstrate the influence of using a three-stage spatiotemporal exposure prediction model and introduce formal methods of shared, multiplicative measurement error (SMME) correction of epidemiological health risk estimates. Using our three-stage, ensemble learning based nitrogen oxides (NOx) exposure prediction model, we quantified SMME. We conducted an epidemiological analysis of wheeze risk in relation to NOx exposure among school-aged children. To demonstrate the incremental influence of exposure modeling stage, we iteratively estimated the health risk using assigned exposure predictions from each stage of the NOx model. We then determined the impact of SMME on the variance of the health risk estimates under various scenarios. Depending on the stage of the spatiotemporal exposure model used, we found that wheeze odds ratio ranged from 1.16 to 1.28 for an interquartile range increase in NOx. With each additional stage of exposure modeling, the health effect estimate moved further away from the null (OR=1). When corrected for observed SMME, the health effects confidence intervals slightly lengthened, but our epidemiological conclusions were not altered. When the variance estimate was corrected for the potential "worst case scenario" of SMME, the standard error further increased, having a meaningful influence on epidemiological conclusions. Our framework can be expanded and used to understand the implications of using exposure predictions subject to shared measurement error in future health investigations.

Collapse

A Robust Deep Learning Approach for Spatiotemporal Estimation of Satellite AOD and PM2.5. REMOTE SENSING 2020. [DOI: 10.3390/rs12020264] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Abstract Accurate estimation of fine particulate matter with diameter ≤2.5 μm (PM2.5) at a high spatiotemporal resolution is crucial for the evaluation of its health effects. Previous studies face multiple challenges including limited ground measurements and availability of spatiotemporal covariates. Although the multiangle implementation of atmospheric correction (MAIAC) retrieves satellite aerosol optical depth (AOD) at a high spatiotemporal resolution, massive non-random missingness considerably limits its application in PM2.5 estimation. Here, a deep learning approach, i.e., bootstrap aggregating (bagging) of autoencoder-based residual deep networks, was developed to make robust imputation of MAIAC AOD and further estimate PM2.5 at a high spatial (1 km) and temporal (daily) resolution. The base model consisted of autoencoder-based residual networks where residual connections were introduced to improve learning performance. Bagging of residual networks was used to generate ensemble predictions for better accuracy and uncertainty estimates. As a case study, the proposed approach was applied to impute daily satellite AOD and subsequently estimate daily PM2.5 in the Jing-Jin-Ji metropolitan region of China in 2015. The presented approach achieved competitive performance in AOD imputation (mean test R2: 0.96; mean test RMSE: 0.06) and PM2.5 estimation (test R2: 0.90; test RMSE: 22.3 μg/m3). In the additional independent tests using ground AERONET AOD and PM2.5 measurements at the monitoring station of the U.S. Embassy in Beijing, this approach achieved high R2 (0.82–0.97). Compared with the state-of-the-art machine learning method, XGBoost, the proposed approach generated more reasonable spatial variation for predicted PM2.5 surfaces. Publically available covariates used included meteorology, MERRA2 PBLH and AOD, coordinates, and elevation. Other covariates such as cloud fractions or land-use were not used due to unavailability. The results of validation and independent testing demonstrate the usefulness of the proposed approach in exposure assessment of PM2.5 using satellite AOD having massive missing values. Collapse

Developing an ANFIS-PSO Model to Predict Mercury Emissions in Combustion Flue Gases. MATHEMATICS 2019. [DOI: 10.3390/math7100965] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]