1
|
Ma S, Yu K, Tang ML, Pan J, Härdle WK, Tian M. A Bayesian multistage spatio-temporally dependent model for spatial clustering and variable selection. Stat Med 2023; 42:4794-4823. [PMID: 37652405 DOI: 10.1002/sim.9889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 06/30/2023] [Accepted: 08/13/2023] [Indexed: 09/02/2023]
Abstract
In spatio-temporal epidemiological analysis, it is of critical importance to identify the significant covariates and estimate the associated time-varying effects on the health outcome. Due to the heterogeneity of spatio-temporal data, the subsets of important covariates may vary across space and the temporal trends of covariate effects could be locally different. However, many spatial models neglected the potential local variation patterns, leading to inappropriate inference. Thus, this article proposes a flexible Bayesian hierarchical model to simultaneously identify spatial clusters of regression coefficients with common temporal trends, select significant covariates for each spatial group by introducing binary entry parameters and estimate spatio-temporally varying disease risks. A multistage strategy is employed to reduce the confounding bias caused by spatially structured random components. A simulation study demonstrates the outperformance of the proposed method, compared with several alternatives based on different assessment criteria. The methodology is motivated by two important case studies. The first concerns the low birth weight incidence data in 159 counties of Georgia, USA, for the years 2007 to 2018 and investigates the time-varying effects of potential contributing covariates in different cluster regions. The second concerns the circulatory disease risks across 323 local authorities in England over 10 years and explores the underlying spatial clusters and associated important risk factors.
Collapse
Affiliation(s)
- Shaopei Ma
- School of Statistics, University of International Business and Economics, Beijing, China
| | - Keming Yu
- Mathematical Sciences, Brunel University, Uxbridge, London, UK
| | - Man-Lai Tang
- Mathematical Sciences, Brunel University, Uxbridge, London, UK
| | - Jianxin Pan
- Research Center for Mathematics, Beijing Normal University, Zhuhai, China
- Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, China
| | - Wolfgang Karl Härdle
- School of Business and Economics, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Maozai Tian
- Center for Applied Statistics, School of Statistics, Renmin University of China, Beijing, China
| |
Collapse
|
2
|
Huang G, Brown P, Shin HH. Multi-pollutant case-crossover models of all-cause and cause-specific mortality and hospital admissions by age group in 47 Canadian cities. ENVIRONMENTAL RESEARCH 2023; 225:115598. [PMID: 36868451 DOI: 10.1016/j.envres.2023.115598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 02/24/2023] [Accepted: 02/28/2023] [Indexed: 06/18/2023]
Abstract
Most of the existing epidemiological studies have investigated adverse health effects of multiple air pollutants for a limited number of cities, thus the evidence of the health impacts is limited and it is challenging to compare these results because of different modeling approaches and potential publication bias. In this paper, we expand the number of Canadian cities, with the use of the most recent available health data. A multi-pollutant model in a case-crossover design is used to investigate the short-term impacts of air pollution on various health outcomes in 47 Canadian main cities, comparing three age groups (all-age, senior (age 66+), non-senior). The main findings are that a 14 ppb increase of O3 was associated with a 0.17%-2.78% (0.62%-1.46%) increase in the odds of all-age respiratory mortality (hospitalization). A 12.8 ppb increase of NO2 was associated with a 0.57%-1.47% (0.68%-1.86%) increase in the odds of all-age (non-senior) respiratory hospitalization. A 7.6 μgm-3 increase of PM2.5 was associated with a 0.19%-0.69% (0.33%-1.1%) increase in the odds of all-age (non-senior) respiratory hospitalization.
Collapse
Affiliation(s)
- Guowen Huang
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada; Centre for Global Health Research, St Michael's Hospital, Toronto, ON, Canada
| | - Patrick Brown
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada; Centre for Global Health Research, St Michael's Hospital, Toronto, ON, Canada
| | - Hwashin Hyun Shin
- Environmental Health Science and Research Bureau, Health Canada, Ottawa, ON, Canada; Department of Mathematics and Statistics, Queen's University, Kingston, ON, Canada.
| |
Collapse
|
3
|
Jeong J, Kim M, Choi J. Investigating the spatio-temporal variation of hepatitis A in Korea using a Bayesian model. Front Public Health 2023; 10:1085077. [PMID: 36743156 PMCID: PMC9895396 DOI: 10.3389/fpubh.2022.1085077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 12/06/2022] [Indexed: 01/22/2023] Open
Abstract
Hepatitis A is a water-borne infectious disease that frequently occurs in unsanitary environments. However, paradoxically, those who have spent their infancy in a sanitary environment are more susceptible to hepatitis A because they do not have the opportunity to acquire natural immunity. In Korea, hepatitis A is prevalent because of the distribution of uncooked seafood, especially during hot and humid summers. In general, the transmission of hepatitis A is known to be dynamically affected by socioeconomic, environmental, and weather-related factors and is heterogeneous in time and space. In this study, we aimed to investigate the spatio-temporal variation of hepatitis A and the effects of socioeconomic and weather-related factors in Korea using a flexible spatio-temporal model. We propose a Bayesian Poisson regression model coupled with spatio-temporal variability to estimate the effects of risk factors. We used weekly hepatitis A incidence data across 250 districts in Korea from 2016 to 2019. We found spatial and temporal autocorrelations of hepatitis A indicating that the spatial distribution of hepatitis A varied dynamically over time. From the estimation results, we noticed that the districts with large proportions of males and foreigners correspond to higher incidences. The average temperature was positively correlated with the incidence, which is in agreement with other studies showing that the incidences in Korea are noticeable in spring and summer due to the increased outdoor activity and intake of stale seafood. To the best of our knowledge, this study is the first to suggest a spatio-temporal model for hepatitis A across the entirety of Korean. The proposed model could be useful for predicting, preventing, and controlling the spread of hepatitis A.
Collapse
Affiliation(s)
- Jaehong Jeong
- Department of Mathematics, Hanyang University, Seoul, Republic of Korea,Research Institute for Natural Sciences, Hanyang University, Seoul, Republic of Korea
| | - Mijeong Kim
- Department of Statistics, Ewha Womans University, Seoul, Republic of Korea,*Correspondence: Mijeong Kim ✉
| | - Jungsoon Choi
- Department of Mathematics, Hanyang University, Seoul, Republic of Korea,Research Institute for Natural Sciences, Hanyang University, Seoul, Republic of Korea,Jungsoon Choi ✉
| |
Collapse
|
4
|
Gamerman D, Ippoliti L, Valentini P. A dynamic structural equation approach to estimate the short‐term effects of air pollution on human health. J R Stat Soc Ser C Appl Stat 2022. [DOI: 10.1111/rssc.12554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Dani Gamerman
- Departamento de Métodos EstatísticosUniversidade Federal do Rio de Janeiro Rio de JaneiroBrazil
| | - Luigi Ippoliti
- Department of EconomicsUniversity G. d'Annunzio, Chieti‐Pescara PescaraItaly
| | - Pasquale Valentini
- Department of EconomicsUniversity G. d'Annunzio, Chieti‐Pescara PescaraItaly
| |
Collapse
|
5
|
Jaya IGNM, Folmer H. Spatiotemporal high-resolution prediction and mapping: methodology and application to dengue disease. JOURNAL OF GEOGRAPHICAL SYSTEMS 2022; 24:527-581. [PMID: 35221792 PMCID: PMC8857957 DOI: 10.1007/s10109-021-00368-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 10/08/2021] [Indexed: 05/16/2023]
Abstract
Dengue disease has become a major public health problem. Accurate and precise identification, prediction and mapping of high-risk areas are crucial elements of an effective and efficient early warning system in countering the spread of dengue disease. In this paper, we present the fusion area-cell spatiotemporal generalized geoadditive-Gaussian Markov random field (FGG-GMRF) framework for joint estimation of an area-cell model, involving temporally varying coefficients, spatially and temporally structured and unstructured random effects, and spatiotemporal interaction of the random effects. The spatiotemporal Gaussian field is applied to determine the unobserved relative risk at cell level. It is transformed to a Gaussian Markov random field using the finite element method and the linear stochastic partial differential equation approach to solve the "big n" problem. Sub-area relative risk estimates are obtained as block averages of the cell outcomes within each sub-area boundary. The FGG-GMRF model is estimated by applying Bayesian Integrated Nested Laplace Approximation. In the application to Bandung city, Indonesia, we combine low-resolution area level (district) spatiotemporal data on population at risk and incidence and high-resolution cell level data on weather variables to obtain predictions of relative risk at subdistrict level. The predicted dengue relative risk at subdistrict level suggests significant fine-scale heterogeneities which are not apparent when examining the area level. The relative risk varies considerably across subdistricts and time, with the latter showing an increase in the period January-July and a decrease in the period August-December. Supplementary Information The online version contains supplementary material available at 10.1007/s10109-021-00368-0.
Collapse
Affiliation(s)
- I. Gede Nyoman Mindra Jaya
- Faculty of Spatial Sciences, University of Groningen, Groningen, The Netherlands
- Statistics Department, Padjadjaran University, Bandung, Indonesia
| | - Henk Folmer
- Faculty of Spatial Sciences, University of Groningen, Groningen, The Netherlands
- Statistics Department, Padjadjaran University, Bandung, Indonesia
| |
Collapse
|
6
|
Baer DR, Lawson AB, Joseph JE. Joint space-time Bayesian disease mapping via quantification of disease risk association. Stat Methods Med Res 2021; 30:35-61. [PMID: 33595403 DOI: 10.1177/0962280220938975] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Alzheimer's disease is an increasingly prevalent neurological disorder with no effective therapies. Thus, there is a need to characterize the progression of Alzheimer's disease risk in order to preclude its inception in patients. Characterizing Alzheimer's disease risk can be accomplished at the population-level by the space-time modeling of Alzheimer's disease incidence data. In this paper, we develop flexible Bayesian hierarchical models which can borrow risk information from conditions antecedent to Alzheimer's disease, such as mild cognitive impairment, in an effort to better characterize Alzheimer's disease risk over space and time. From an application of these models to real-world Alzheimer's disease and mild cognitive impairment spatiotemporal incidence data, we found that our novel models provided improved model goodness of fit, and via a simulation study, we demonstrated the importance of diagnosing the label-switching problem for our models as well as the importance of model specification in order to best capture the contribution of time in modeling Alzheimer's disease risk.
Collapse
Affiliation(s)
- Daniel R Baer
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| | - Andrew B Lawson
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| | - Jane E Joseph
- Department of Neuroscience, Medical University of South Carolina, Charleston, SC, USA
| |
Collapse
|
7
|
Larsen A, Kolpacoff V, McCormack K, Seewaldt V, Hyslop T. Using Latent Class Modeling to Jointly Characterize Economic Stress and Multipollutant Exposure. Cancer Epidemiol Biomarkers Prev 2020; 29:1940-1948. [PMID: 32856601 DOI: 10.1158/1055-9965.epi-19-1365] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Revised: 06/10/2020] [Accepted: 08/13/2020] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Work is needed to better understand how joint exposure to environmental and economic factors influence cancer. We hypothesize that environmental exposures vary with socioeconomic status (SES) and urban/rural locations, and areas with minority populations coincide with high economic disadvantage and pollution. METHODS To model joint exposure to pollution and SES, we develop a latent class mixture model (LCMM) with three latent variables (SES Advantage, SES Disadvantage, and Air Pollution) and compare the LCMM fit with K-means clustering. We ran an ANOVA to test for high exposure levels in non-Hispanic black populations. The analysis is at the census tract level for the state of North Carolina. RESULTS The LCMM was a better and more nuanced fit to the data than K-means clustering. Our LCMM had two sublevels (low, high) within each latent class. The worst levels of exposure (high SES disadvantage, low SES advantage, high pollution) are found in 22% of census tracts, while the best levels (low SES disadvantage, high SES advantage, low pollution) are found in 5.7%. Overall, 34.1% of the census tracts exhibit high disadvantage, 66.3% have low advantage, and 59.2% have high mixtures of toxic pollutants. Areas with higher SES disadvantage had significantly higher non-Hispanic black population density (NHBPD; P < 0.001), and NHBPD was higher in areas with higher pollution (P < 0.001). CONCLUSIONS Joint exposure to air toxins and SES varies with rural/urban location and coincides with minority populations. IMPACT Our model can be extended to provide a holistic modeling framework for estimating disparities in cancer survival.See all articles in this CEBP Focus section, "Environmental Carcinogenesis: Pathways to Prevention."
Collapse
Affiliation(s)
- Alexandra Larsen
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina
| | - Viktoria Kolpacoff
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina
| | - Kara McCormack
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina
| | | | - Terry Hyslop
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina. .,Duke Cancer Institute, Durham, North Carolina
| |
Collapse
|
8
|
Kanankege KST, Alvarez J, Zhang L, Perez AM. An Introductory Framework for Choosing Spatiotemporal Analytical Tools in Population-Level Eco-Epidemiological Research. Front Vet Sci 2020; 7:339. [PMID: 32733923 PMCID: PMC7358365 DOI: 10.3389/fvets.2020.00339] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Accepted: 05/15/2020] [Indexed: 12/04/2022] Open
Abstract
Spatiotemporal visualization and analytical tools (SATs) are increasingly being applied to risk-based surveillance/monitoring of adverse health events affecting humans, animals, and ecosystems. Different disciplines use diverse SATs to address similar research questions. The juxtaposition of these diverse techniques provides a list of options for researchers who are new to population-level spatial eco-epidemiology. Here, we are conducting a narrative review to provide an overview of the multiple available SATs, and introducing a framework for choosing among them when addressing common research questions across disciplines. The framework is comprised of three stages: (a) pre-hypothesis testing stage, in which hypotheses regarding the spatial dependence of events are generated; (b) primary hypothesis testing stage, in which the existence of spatial dependence and patterns are tested; and (c) secondary-hypothesis testing and spatial modeling stage, in which predictions and inferences were made based on the identified spatial dependences and associated covariates. In this step-wise process, six key research questions are formulated, and the answers to those questions should lead researchers to select one or more methods from four broad categories of SATs: (T1) visualization and descriptive analysis; (T2) spatial/spatiotemporal dependence and pattern recognition; (T3) spatial smoothing and interpolation; and (T4) geographic correlation studies (i.e., spatial modeling and regression). The SATs described here include both those used for decades and also other relatively new tools. Through this framework review, we intend to facilitate the choice among available SATs and promote their interdisciplinary use to support improving human, animal, and ecosystem health.
Collapse
Affiliation(s)
- Kaushi S. T. Kanankege
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, Saint Paul, MN, United States
| | - Julio Alvarez
- Departamento de Sanidad Animal, Centro de Vigilancia Sanitaria Veterinaria (VISAVET), Facultad de Veterinaria, Universidad Complutense, Madrid, Spain
| | - Lin Zhang
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, United States
| | - Andres M. Perez
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, Saint Paul, MN, United States
| |
Collapse
|
9
|
Napier G, Lee D, Robertson C, Lawson A. A Bayesian space-time model for clustering areal units based on their disease trends. Biostatistics 2020; 20:681-697. [PMID: 29917057 PMCID: PMC6797054 DOI: 10.1093/biostatistics/kxy024] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 04/13/2018] [Accepted: 04/29/2018] [Indexed: 11/27/2022] Open
Abstract
Population-level disease risk across a set of non-overlapping areal units varies in space and time, and a large research literature has developed methodology for identifying clusters of areal units exhibiting elevated risks. However, almost no research has extended the clustering paradigm to identify groups of areal units exhibiting similar temporal disease trends. We present a novel Bayesian hierarchical mixture model for achieving this goal, with inference based on a Metropolis-coupled Markov chain Monte Carlo ((MC)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$^3$\end{document}) algorithm. The effectiveness of the (MC)\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$^3$\end{document} algorithm compared to a standard Markov chain Monte Carlo implementation is demonstrated in a simulation study, and the methodology is motivated by two important case studies in the United Kingdom. The first concerns the impact on measles susceptibility of the discredited paper linking the measles, mumps, and rubella vaccination to an increased risk of Autism and investigates whether all areas in the Scotland were equally affected. The second concerns respiratory hospitalizations and investigates over a 10 year period which parts of Glasgow have shown increased, decreased, and no change in risk.
Collapse
Affiliation(s)
- Gary Napier
- School of Mathematics and Statistics, University of Glasgow, University Place, Glasgow, UK
| | - Duncan Lee
- School of Mathematics and Statistics, University of Glasgow, University Place, Glasgow, UK
| | - Chris Robertson
- Department of Mathematics and Statistics, University of Strathclyde, 26 Richmond Street, Glasgow, UK
| | - Andrew Lawson
- Department of Public Health Sciences, Medical University of South Carolina, South Carolina, USA
| |
Collapse
|
10
|
Kang D, Jang Y, Choi H, Hwang SS, Koo Y, Choi J. Space-Time Relationship between Short-Term Exposure to Fine and Coarse Particles and Mortality in a Nationwide Analysis of Korea: A Bayesian Hierarchical Spatio-Temporal Model. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2019; 16:ijerph16122111. [PMID: 31207896 PMCID: PMC6617003 DOI: 10.3390/ijerph16122111] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 06/12/2019] [Accepted: 06/13/2019] [Indexed: 11/30/2022]
Abstract
Previous studies have shown an association between mortality and ambient air pollution in South Korea. However, these studies may have been subject to bias, as they lacked adjustment for spatio-temporal structures. This paper addresses this research gap by examining the association between air pollution and cause-specific mortality in South Korea between 2012 and 2015 using a two-stage Bayesian spatio-temporal model. We used 2012–2014 mortality and air pollution data for parameter estimation (i.e., model fitting) and 2015 data for model validation. Our results suggest that the relative risks of total, cardiovascular, and respiratory mortality were 1.028, 1.047, and 1.045, respectively, with every 10-µg/m3 increase in monthly PM2.5 (fine particulate matter) exposure. These findings warrant protection of populations who experience elevated ambient air pollution exposure to mitigate mortality burden in South Korea.
Collapse
Affiliation(s)
- Dayun Kang
- Department of Applied Statistics, Hanyang University, Seoul 04763, Korea.
| | - Yujin Jang
- Department of Applied Statistics, Hanyang University, Seoul 04763, Korea.
| | - Hyunho Choi
- Department of Applied Statistics, Hanyang University, Seoul 04763, Korea.
| | - Seung-Sik Hwang
- Department of Public Health Sciences, Graduate School of Public Health, Seoul National University, Seoul 08826, Korea.
| | - Younseo Koo
- Department of Environmental Engineering, Anyang University, Anyang 14028, Korea.
| | - Jungsoon Choi
- Department of Mathematics, Hanyang University, Seoul 04763, Korea.
- Research Institute for Natural Sciences, Hanyang University, Seoul 04763, Korea.
| |
Collapse
|
11
|
Baer DR, Lawson AB. Evaluation of Bayesian multiple stage estimation under spatial CAR model variants. J STAT COMPUT SIM 2018. [DOI: 10.1080/00949655.2018.1536755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Affiliation(s)
- Daniel R. Baer
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| | - Andrew B. Lawson
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| |
Collapse
|
12
|
Lawson AB. Bayesian latent modeling of spatio‐temporal variation in small‐area health data. ACTA ACUST UNITED AC 2018. [DOI: 10.1002/wics.1441] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Andrew B. Lawson
- Department of Public Health Sciences, Division of Biostatistics and Bioinformatics Medical University of South Carolina Charleston South Carolina
| |
Collapse
|
13
|
Choi J, Lawson AB. A Bayesian two-stage spatially dependent variable selection model for space-time health data. Stat Methods Med Res 2018; 28:2570-2582. [PMID: 29635974 DOI: 10.1177/0962280218767980] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In space-time epidemiological modeling, most studies have considered the overall variations in relative risk to better estimate the effects of risk factors on health outcomes. However, the associations between risk factors and health outcomes may vary across space and time. Especially, the temporal patterns of the covariate effects may depend on space. Thus, we propose a Bayesian two-stage spatially dependent variable selection approach for space-time health data to determine the spatially varying subsets of regression coefficients with common temporal dependence. The two-stage structure allows reduction of the spatial confounding bias in the estimates of the regression coefficients. A simulation study is conducted to examine the performance of the proposed two-stage model. We apply the proposed model to the number of inpatients with lung cancer in 159 counties of Georgia, USA.
Collapse
Affiliation(s)
- Jungsoon Choi
- 1 Department of Mathematics, College of Natural Sciences, Hanyang University, Seoul, South Korea.,2 Research Institute for Natural Sciences, Hanyang University, Seoul, South Korea
| | - Andrew B Lawson
- 3 Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| |
Collapse
|
14
|
Lawson AB, Carroll R, Faes C, Kirby RS, Aregay M, Watjou K. Spatiotemporal multivariate mixture models for Bayesian model selection in disease mapping. ENVIRONMETRICS 2017; 28:e2465. [PMID: 29230091 PMCID: PMC5722237 DOI: 10.1002/env.2465] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
It is often the case that researchers wish to simultaneously explore the behavior of and estimate overall risk for multiple, related diseases with varying rarity while accounting for potential spatial and/or temporal correlation. In this paper, we propose a flexible class of multivariate spatio-temporal mixture models to fill this role. Further, these models offer flexibility with the potential for model selection as well as the ability to accommodate lifestyle, socio-economic, and physical environmental variables with spatial, temporal, or both structures. Here, we explore the capability of this approach via a large scale simulation study and examine a motivating data example involving three cancers in South Carolina. The results which are focused on four model variants suggest that all models possess the ability to recover simulation ground truth and display improved model fit over two baseline Knorr-Held spatio-temporal interaction model variants in a real data application.
Collapse
Affiliation(s)
- AB Lawson
- Department of Public Health Sciences, Medical University of South Carolina
| | - R Carroll
- Department of Public Health Sciences, Medical University of South Carolina
| | - C Faes
- Interuniversity Institute for Statistics and Statistical Bioinformatics, Hasselt University
| | - RS Kirby
- Department of Community and Family Health, University of South Florida
| | - M Aregay
- Department of Public Health Sciences, Medical University of South Carolina
| | - K Watjou
- Interuniversity Institute for Statistics and Statistical Bioinformatics, Hasselt University
| |
Collapse
|
15
|
Pannullo F, Lee D, Neal L, Dalvi M, Agnew P, O’Connor FM, Mukhopadhyay S, Sahu S, Sarran C. Quantifying the impact of current and future concentrations of air pollutants on respiratory disease risk in England. Environ Health 2017; 16:29. [PMID: 28347336 PMCID: PMC5368918 DOI: 10.1186/s12940-017-0237-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2016] [Accepted: 03/20/2017] [Indexed: 05/21/2023]
Abstract
BACKGROUND Estimating the long-term health impact of air pollution in a spatio-temporal ecological study requires representative concentrations of air pollutants to be constructed for each geographical unit and time period. Averaging concentrations in space and time is commonly carried out, but little is known about how robust the estimated health effects are to different aggregation functions. A second under researched question is what impact air pollution is likely to have in the future. METHODS We conducted a study for England between 2007 and 2011, investigating the relationship between respiratory hospital admissions and different pollutants: nitrogen dioxide (NO2); ozone (O3); particulate matter, the latter including particles with an aerodynamic diameter less than 2.5 micrometers (PM2.5), and less than 10 micrometers (PM10); and sulphur dioxide (SO2). Bayesian Poisson regression models accounting for localised spatio-temporal autocorrelation were used to estimate the relative risks (RRs) of pollution on disease risk, and for each pollutant four representative concentrations were constructed using combinations of spatial and temporal averages and maximums. The estimated RRs were then used to make projections of the numbers of likely respiratory hospital admissions in the 2050s attributable to air pollution, based on emission projections from a number of Representative Concentration Pathways (RCP). RESULTS NO2 exhibited the largest association with respiratory hospital admissions out of the pollutants considered, with estimated increased risks of between 0.9 and 1.6% for a one standard deviation increase in concentrations. In the future the projected numbers of respiratory hospital admissions attributable to NO2 in the 2050s are lower than present day rates under 3 Representative Concentration Pathways (RCPs): 2.6, 6.0, and 8.5, which is due to projected reductions in future NO2 emissions and concentrations. CONCLUSIONS NO2 concentrations exhibit consistent substantial present-day health effects regardless of how a representative concentration is constructed in space and time. Thus as concentrations are predicted to remain above limits set by European Union Legislation until the 2030s in parts of urban England, it will remain a substantial health risk for some time.
Collapse
Affiliation(s)
- Francesca Pannullo
- School of Mathematics and Statistics, University of Glasgow, Glasgow, G12 8QW UK
| | - Duncan Lee
- School of Mathematics and Statistics, University of Glasgow, Glasgow, G12 8QW UK
| | - Lucy Neal
- Met Office, FitzRoy Road, Exeter, EX1 3PB UK
| | - Mohit Dalvi
- Met Office Hadley Centre, FitzRoy Road, Exeter, EX1 3PB UK
| | - Paul Agnew
- Met Office, FitzRoy Road, Exeter, EX1 3PB UK
| | | | | | - Sujit Sahu
- Mathematical Sciences, University of Southampton, Highfield, Southampton, SO17 1BJ UK
| | | |
Collapse
|
16
|
Anderson C, Ryan LM. A Comparison of Spatio-Temporal Disease Mapping Approaches Including an Application to Ischaemic Heart Disease in New South Wales, Australia. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2017; 14:ijerph14020146. [PMID: 28165383 PMCID: PMC5334700 DOI: 10.3390/ijerph14020146] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Revised: 01/19/2017] [Accepted: 01/20/2017] [Indexed: 11/16/2022]
Abstract
The field of spatio-temporal modelling has witnessed a recent surge as a result of developments in computational power and increased data collection. These developments allow analysts to model the evolution of health outcomes in both space and time simultaneously. This paper models the trends in ischaemic heart disease (IHD) in New South Wales, Australia over an eight-year period between 2006 and 2013. A number of spatio-temporal models are considered, and we propose a novel method for determining the goodness-of-fit for these models by outlining a spatio-temporal extension of the Moran's I statistic. We identify an overall decrease in the rates of IHD, but note that the extent of this health improvement varies across the state. In particular, we identified a number of remote areas in the north and west of the state where the risk stayed constant or even increased slightly.
Collapse
Affiliation(s)
- Craig Anderson
- School of Mathematical and Physical Sciences, University of Technology Sydney, 15 Broadway, Ultimo, NSW 2007, Australia.
- ARC Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS), Parkville, VIC 3010, Australia.
| | - Louise M Ryan
- School of Mathematical and Physical Sciences, University of Technology Sydney, 15 Broadway, Ultimo, NSW 2007, Australia.
- ARC Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS), Parkville, VIC 3010, Australia.
| |
Collapse
|
17
|
An analysis of temporal and spatial patterns in Italian hospitalization rates for multiple diagnosis. Spat Spatiotemporal Epidemiol 2016; 19:37-45. [PMID: 27839579 DOI: 10.1016/j.sste.2016.04.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/13/2015] [Revised: 03/03/2016] [Accepted: 04/27/2016] [Indexed: 11/20/2022]
Abstract
In this paper, the Italian hospitalization database provided by the Ministry of Health is analyzed in terms of the temporal and spatial patterns of the hospitalization rates. The database covers the period 2010-2012 and the rates are evaluated for 110 Italian provinces and 18 diagnosis groups as defined by the ICD-9 classification. The analysis is based on a novel model-based clustering approach which enables clustering of spatially registered time series with respect to latent temporal patterns. The clustering result is analyzed to study the spatial distribution of the latent temporal patterns and their trend in order to identify possible critical areas in terms of increasing rates. Additionally, emerging spatial patterns may help common causes driving the hospitalization rates to be identified.
Collapse
|
18
|
Lee D, Lawson A. Quantifying the Spatial Inequality and Temporal Trends in Maternal Smoking Rates in Glasgow. Ann Appl Stat 2016; 10:1427-1446. [PMID: 28580047 DOI: 10.1214/16-aoas941] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Maternal smoking is well known to adversely affect birth outcomes, and there is considerable spatial variation in the rates of maternal smoking in the city of Glasgow, Scotland. This spatial variation is a partial driver of health inequalities between rich and poor communities, and it is of interest to determine the extent to which these inequalities have changed over time. Therefore in this paper we develop a Bayesian hierarchical model for estimating the spatio-temporal pattern in smoking incidence across Glasgow between 2000 and 2013, which can identify the changing geographical extent of clusters of areas exhibiting elevated maternal smoking incidences that partially drive health inequalities. Additionally, we provide freely available software via the R package CARBayesST to allow others to implement the model we have developed. The study period includes the introduction of a ban on smoking in public places in 2006, and the results show an average decline of around 11% in maternal smoking rates over the study period.
Collapse
Affiliation(s)
- Duncan Lee
- School of Mathematics and Statistics, University of Glasgow, Glasgow, UK, G12 8QQ
| | - Andrew Lawson
- Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA, 29401-8350
| |
Collapse
|
19
|
Choi J, Lawson AB. Bayesian spatially dependent variable selection for small area health modeling. Stat Methods Med Res 2016; 27:234-249. [DOI: 10.1177/0962280215627184] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Statistical methods for spatial health data to identify the significant covariates associated with the health outcomes are of critical importance. Most studies have developed variable selection approaches in which the covariates included appear within the spatial domain and their effects are fixed across space. However, the impact of covariates on health outcomes may change across space and ignoring this behavior in spatial epidemiology may cause the wrong interpretation of the relations. Thus, the development of a statistical framework for spatial variable selection is important to allow for the estimation of the space-varying patterns of covariate effects as well as the early detection of disease over space. In this paper, we develop flexible spatial variable selection approaches to find the spatially-varying subsets of covariates with significant effects. A Bayesian hierarchical latent model framework is applied to account for spatially-varying covariate effects. We present a simulation example to examine the performance of the proposed models with the competing models. We apply our models to a county-level low birth weight incidence dataset in Georgia.
Collapse
Affiliation(s)
- Jungsoon Choi
- Department of Mathematics, College of Natural Sciences, Hanyang University, Seoul, South Korea
- Research Institute for Natural Sciences, Hanyang University, Seoul, South Korea
| | - Andrew B Lawson
- Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Medical University of South Carolina, SC, USA
| |
Collapse
|
20
|
Rushworth A, Lee D, Sarran C. An adaptive spatiotemporal smoothing model for estimating trends and step changes in disease risk. J R Stat Soc Ser C Appl Stat 2016. [DOI: 10.1111/rssc.12155] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
21
|
How robust are the estimated effects of air pollution on health? Accounting for model uncertainty using Bayesian model averaging. Spat Spatiotemporal Epidemiol 2016; 18:53-62. [PMID: 27494960 PMCID: PMC4985538 DOI: 10.1016/j.sste.2016.04.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Revised: 02/16/2016] [Accepted: 04/01/2016] [Indexed: 11/22/2022]
Abstract
We explored the sensitivity of the pollution-health effect to three factors. Estimation of NO2, choice of deprivation and choice of spatial autocorrelation model. Choice of these factors leads to a wide variation in pollution-health effects. BMA is utilised to estimate an overall effect while accounting for model uncertainty. Overall, a positive but borderline pollution-health effect was obtained.
The long-term impact of air pollution on human health can be estimated from small-area ecological studies in which the health outcome is regressed against air pollution concentrations and other covariates, such as socio-economic deprivation. Socio-economic deprivation is multi-factorial and difficult to measure, and includes aspects of income, education, and housing as well as others. However, these variables are potentially highly correlated, meaning one can either create an overall deprivation index, or use the individual characteristics, which can result in a variety of pollution-health effects. Other aspects of model choice may affect the pollution-health estimate, such as the estimation of pollution, and spatial autocorrelation model. Therefore, we propose a Bayesian model averaging approach to combine the results from multiple statistical models to produce a more robust representation of the overall pollution-health effect. We investigate the relationship between nitrogen dioxide concentrations and cardio-respiratory mortality in West Central Scotland between 2006 and 2012.
Collapse
|
22
|
Two-stage Bayesian model to evaluate the effect of air pollution on chronic respiratory diseases using drug prescriptions. Spat Spatiotemporal Epidemiol 2016; 18:1-12. [PMID: 27494955 DOI: 10.1016/j.sste.2016.03.001] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Revised: 03/01/2016] [Accepted: 03/03/2016] [Indexed: 11/22/2022]
Abstract
Exposure to high levels of air pollutant concentration is known to be associated with respiratory problems which can translate into higher morbidity and mortality rates. The link between air pollution and population health has mainly been assessed considering air quality and hospitalisation or mortality data. However, this approach limits the analysis to individuals characterised by severe conditions. In this paper we evaluate the link between air pollution and respiratory diseases using general practice drug prescriptions for chronic respiratory diseases, which allow to draw conclusions based on the general population. We propose a two-stage statistical approach: in the first stage we specify a space-time model to estimate the monthly NO2 concentration integrating several data sources characterised by different spatio-temporal resolution; in the second stage we link the concentration to the β2-agonists prescribed monthly by general practices in England and we model the prescription rates through a small area approach.
Collapse
|
23
|
Lee D, Sarran C. Controlling for unmeasured confounding and spatial misalignment in long-term air pollution and health studies. ENVIRONMETRICS 2015; 26:477-487. [PMID: 27547047 PMCID: PMC4975605 DOI: 10.1002/env.2348] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Revised: 03/25/2015] [Accepted: 06/04/2015] [Indexed: 05/22/2023]
Abstract
The health impact of long-term exposure to air pollution is now routinely estimated using spatial ecological studies, owing to the recent widespread availability of spatial referenced pollution and disease data. However, this areal unit study design presents a number of statistical challenges, which if ignored have the potential to bias the estimated pollution-health relationship. One such challenge is how to control for the spatial autocorrelation present in the data after accounting for the known covariates, which is caused by unmeasured confounding. A second challenge is how to adjust the functional form of the model to account for the spatial misalignment between the pollution and disease data, which causes within-area variation in the pollution data. These challenges have largely been ignored in existing long-term spatial air pollution and health studies, so here we propose a novel Bayesian hierarchical model that addresses both challenges and provide software to allow others to apply our model to their own data. The effectiveness of the proposed model is compared by simulation against a number of state-of-the-art alternatives proposed in the literature and is then used to estimate the impact of nitrogen dioxide and particulate matter concentrations on respiratory hospital admissions in a new epidemiological study in England in 2010 at the local authority level. © 2015 The Authors. Environmetrics published by John Wiley & Sons Ltd.
Collapse
Affiliation(s)
- Duncan Lee
- School of Mathematics and StatisticsUniversity of GlasgowGlasgowU.K.
| | | |
Collapse
|
24
|
Huang G, Lee D, Scott M. An integrated Bayesian model for estimating the long-term health effects of air pollution by fusing modelled and measured pollution data: A case study of nitrogen dioxide concentrations in Scotland. Spat Spatiotemporal Epidemiol 2015; 14-15:63-74. [DOI: 10.1016/j.sste.2015.09.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Revised: 07/11/2015] [Accepted: 09/23/2015] [Indexed: 11/24/2022]
|
25
|
Rushworth A, Lee D, Mitchell R. A spatio-temporal model for estimating the long-term effects of air pollution on respiratory hospital admissions in Greater London. Spat Spatiotemporal Epidemiol 2014; 10:29-38. [PMID: 25113589 DOI: 10.1016/j.sste.2014.05.001] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/23/2013] [Revised: 02/05/2014] [Accepted: 05/06/2014] [Indexed: 11/30/2022]
Abstract
It has long been known that air pollution is harmful to human health, as many epidemiological studies have been conducted into its effects. Collectively, these studies have investigated both the acute and chronic effects of pollution, with the latter typically based on individual level cohort designs that can be expensive to implement. As a result of the increasing availability of small-area statistics, ecological spatio-temporal study designs are also being used, with which a key statistical problem is allowing for residual spatio-temporal autocorrelation that remains after the covariate effects have been removed. We present a new model for estimating the effects of air pollution on human health, which allows for residual spatio-temporal autocorrelation, and a study into the long-term effects of air pollution on human health in Greater London, England. The individual and joint effects of different pollutants are explored, via the use of single pollutant models and multiple pollutant indices.
Collapse
Affiliation(s)
- Alastair Rushworth
- School of Mathematics and Statistics, University Gardens, University of Glasgow, Glasgow G12 8QW, UK.
| | - Duncan Lee
- School of Mathematics and Statistics, University Gardens, University of Glasgow, Glasgow G12 8QW, UK
| | | |
Collapse
|
26
|
Lee D, Mitchell R. Controlling for localised spatio-temporal autocorrelation in long-term air pollution and health studies. Stat Methods Med Res 2014; 23:488-506. [PMID: 24648100 PMCID: PMC4272194 DOI: 10.1177/0962280214527384] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Estimating the long-term health impact of air pollution using an ecological spatio-temporal study design is a challenging task, due to the presence of residual spatio-temporal autocorrelation in the health counts after adjusting for the covariate effects. This autocorrelation is commonly modelled by a set of random effects represented by a Gaussian Markov random field (GMRF) prior distribution, as part of a hierarchical Bayesian model. However, GMRF models typically assume the random effects are globally smooth in space and time, and thus are likely to be collinear to any spatially and temporally smooth covariates such as air pollution. Such collinearity leads to poor estimation performance of the estimated fixed effects, and motivated by this epidemiological problem, this paper proposes new GMRF methodology to allow for localised spatio-temporal smoothing. This means random effects that are either geographically or temporally adjacent are allowed to be autocorrelated or conditionally independent, which allows more flexible autocorrelation structures to be represented. This increased flexibility results in improved fixed effects estimation compared with global smoothing models, which is evidenced by our simulation study. The methodology is then applied to the motivating study investigating the long-term effects of air pollution on respiratory ill health in Greater Glasgow, Scotland between 2007 and 2011.
Collapse
Affiliation(s)
- Duncan Lee
- School of Mathematics and Statistics, University of Glasgow, Glasgow, UK
| | - Richard Mitchell
- Institute for Health and Wellbeing, University of Glasgow, Glasgow, UK
| |
Collapse
|