1
|
Janssens A, Vaes B, Van Pottelbergh G, Libin PJK, Neyens T. Model-based disease mapping using primary care registry data. Spat Spatiotemporal Epidemiol 2024; 49:100654. [PMID: 38876557 DOI: 10.1016/j.sste.2024.100654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 03/19/2024] [Accepted: 04/26/2024] [Indexed: 06/16/2024]
Abstract
BACKGROUND Spatial modeling of disease risk using primary care registry data is promising for public health surveillance. However, it remains unclear to which extent challenges such as spatially disproportionate sampling and practice-specific reporting variation affect statistical inference. METHODS Using lower respiratory tract infection data from the INTEGO registry, modeled with a logistic model incorporating patient characteristics, a spatially structured random effect at municipality level, and an unstructured random effect at practice level, we conducted a case and simulation study to assess the impact of these challenges on spatial trend estimation. RESULTS Even with spatial imbalance and practice-specific reporting variation, the model performed well. Performance improved with increasing spatial sample balance and decreasing practice-specific variation. CONCLUSION Our findings indicate that, with correction for reporting efforts, primary care registries are valuable for spatial trend estimation. The diversity of patient locations within practice populations plays an important role.
Collapse
Affiliation(s)
- Arne Janssens
- Academic Centre of General Practice, Department of Public Health and Primary Care, Faculty of Medicine, KU Leuven, Kapucijnenvoer 35, B-3000 Leuven, Belgium.
| | - Bert Vaes
- Academic Centre of General Practice, Department of Public Health and Primary Care, Faculty of Medicine, KU Leuven, Kapucijnenvoer 35, B-3000 Leuven, Belgium.
| | - Gijs Van Pottelbergh
- Academic Centre of General Practice, Department of Public Health and Primary Care, Faculty of Medicine, KU Leuven, Kapucijnenvoer 35, B-3000 Leuven, Belgium.
| | - Pieter J K Libin
- I-BioStat, Data Science Institute, Hasselt University, Martelarenlaan 42, B-3500 Hasselt, Belgium; Artificial Intelligence Lab, Department of Computer Science, Vrije Universiteit Brussel, Brussels, Belgium; Department of Microbiology and Immunology, Rega Institute for Medical Research, Clinical and Epidemiological Virology, KU Leuven, Leuven, Belgium.
| | - Thomas Neyens
- I-BioStat, Data Science Institute, Hasselt University, Martelarenlaan 42, B-3500 Hasselt, Belgium; L-BioStat, Department of Public Health and Primary Care, Faculty of Medicine, KU Leuven, Kapucijnenvoer 35, B-3000 Leuven, Belgium.
| |
Collapse
|
2
|
Rotejanaprasert C, Chinpong K, Lawson AB, Chienwichai P, Maude RJ. Evaluation and comparison of spatial cluster detection methods for improved decision making of disease surveillance: a case study of national dengue surveillance in Thailand. BMC Med Res Methodol 2024; 24:14. [PMID: 38243198 PMCID: PMC10797994 DOI: 10.1186/s12874-023-02135-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 12/21/2023] [Indexed: 01/21/2024] Open
Abstract
BACKGROUND Dengue is a mosquito-borne disease that causes over 300 million infections worldwide each year with no specific treatment available. Effective surveillance systems are needed for outbreak detection and resource allocation. Spatial cluster detection methods are commonly used, but no general guidance exists on the most appropriate method for dengue surveillance. Therefore, a comprehensive study is needed to assess different methods and provide guidance for dengue surveillance programs. METHODS To evaluate the effectiveness of different cluster detection methods for dengue surveillance, we selected and assessed commonly used methods: Getis Ord [Formula: see text], Local Moran, SaTScan, and Bayesian modeling. We conducted a simulation study to compare their performance in detecting clusters, and applied all methods to a case study of dengue surveillance in Thailand in 2019 to further evaluate their practical utility. RESULTS In the simulation study, Getis Ord [Formula: see text] and Local Moran had similar performance, with most misdetections occurring at cluster boundaries and isolated hotspots. SaTScan showed better precision but was less effective at detecting inner outliers, although it performed well on large outbreaks. Bayesian convolution modeling had the highest overall precision in the simulation study. In the dengue case study in Thailand, Getis Ord [Formula: see text] and Local Moran missed most disease clusters, while SaTScan was mostly able to detect a large cluster. Bayesian disease mapping seemed to be the most effective, with adaptive detection of irregularly shaped disease anomalies. CONCLUSIONS Bayesian modeling showed to be the most effective method, demonstrating the best accuracy in adaptively identifying irregularly shaped disease anomalies. In contrast, SaTScan excelled in detecting large outbreaks and regular forms. This study provides empirical evidence for the selection of appropriate tools for dengue surveillance in Thailand, with potential applicability to other disease control programs in similar settings.
Collapse
Affiliation(s)
- Chawarat Rotejanaprasert
- Department of Tropical Hygiene, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand.
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand.
| | - Kawin Chinpong
- Chulabhorn Learning and Research Centre, Chulabhorn Royal Academy, Bangkok, Thailand
| | - Andrew B Lawson
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
- Usher Institute, University of Edinburgh, Edinburgh, UK
| | - Peerut Chienwichai
- Princess Srisavangavadhana College of Medicine, Chulabhorn Royal Academy, Bangkok, Thailand
| | - Richard J Maude
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
- Harvard T.H. Chan School of Public Health, Harvard University, Cambridge, MA, USA
- Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, UK
- The Open University, Milton Keynes, UK
| |
Collapse
|
3
|
Tang IW, Bartell SM, Vieira VM. Unmatched spatially stratified controls: A simulation study examining efficiency and precision using spatially-diverse controls and generalized additive models. Spat Spatiotemporal Epidemiol 2023; 45:100584. [PMID: 37301599 DOI: 10.1016/j.sste.2023.100584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 03/15/2023] [Accepted: 04/07/2023] [Indexed: 06/12/2023]
Abstract
Unmatched spatially stratified random sampling (SSRS) of non-cases selects geographically balanced controls by dividing the study area into spatial strata and randomly selecting controls from all non-cases within each stratum. The performance of SSRS control selection was evaluated in a case study spatial analysis of preterm birth in Massachusetts. In a simulation study, we fit generalized additive models using controls selected by SSRS or simple random sample (SRS) designs. We compared mean squared error (MSE), bias, relative efficiency (RE), and statistically significant map results to the model results with all non-cases. SSRS designs had lower average MSE (0.0042-0.0044) and higher RE (77-80%) compared to SRS designs (MSE: 0.0072-0.0073; RE across designs: 71%). SSRS map results were more consistent across simulations, reliably identifying statistically significant areas. SSRS designs improved efficiency by selecting controls that are geographically distributed, particularly from low population density areas, and may be more appropriate for spatial analyses.
Collapse
Affiliation(s)
- Ian W Tang
- Department of Environmental and Occupational Health, Program in Public Health, Susan and Henry Samueli College of Health Sciences, University of California, 100 Theory Drive, Suite 100, Irvine, CA 92617, USA.
| | - Scott M Bartell
- Department of Environmental and Occupational Health, Program in Public Health, Susan and Henry Samueli College of Health Sciences, University of California, 100 Theory Drive, Suite 100, Irvine, CA 92617, USA; Department of Statistics, Donald Bren School of Information & Computer Sciences, University of California, Irvine, USA
| | - Verónica M Vieira
- Department of Environmental and Occupational Health, Program in Public Health, Susan and Henry Samueli College of Health Sciences, University of California, 100 Theory Drive, Suite 100, Irvine, CA 92617, USA
| |
Collapse
|
4
|
Briz-Redón Á, Iftimi A, Correcher JF, De Andrés J, Lozano M, Romero-García C. A comparison of multiple neighborhood matrix specifications for spatio-temporal model fitting: a case study on COVID-19 data. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT : RESEARCH JOURNAL 2021; 36:271-282. [PMID: 34421343 PMCID: PMC8371601 DOI: 10.1007/s00477-021-02077-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 08/07/2021] [Indexed: 06/13/2023]
Abstract
Establishing proper neighbor relations between a set of spatial units under analysis is essential when carrying out a spatial or spatio-temporal analysis. However, it is usual that researchers choose some of the most typical (and simple) neighborhood structures, such as the first-order contiguity matrix, without exploring other options. In this paper, we compare the performance of different neighborhood matrices in the context of modeling the weekly relative risk of COVID-19 over small areas located in or near Valencia, Spain. Specifically, we construct contiguity-based, distance-based, covariate-based (considering mobility flows and sociodemographic characteristics), and hybrid neighborhood matrices. We evaluate the goodness of fit, the overall predictive quality, the ability to detect high-risk spatio-temporal units, the capability to capture the spatio-temporal autocorrelation in the data, and the goodness of smoothing for a set of spatio-temporal models based on each of the neighborhood matrices. The results show that contiguity-based matrices, some of the distance-based matrices, and those based on sociodemographic characteristics perform better than the matrices based on k-nearest neighbors and those involving mobility flows. In addition, we test the linear combination of some of the constructed neighborhood matrices and the reweighting of these matrices after eliminating weak neighbor relations, without any model improvement.
Collapse
Affiliation(s)
- Álvaro Briz-Redón
- Statistics Office, City Council of Valencia, Carrer de l’Arquebisbe Mayoral, 1, 46002 Valencia, Spain
| | - Adina Iftimi
- Department of Statistics and Operations Research, University of Valencia, Valencia, Spain
| | | | - Jose De Andrés
- Anesthesia Unit - Surgical Specialties Department, University of Valencia, Valencia, Spain
- Department of Anesthesia, Critical Care and Pain Unit, General University Hospital, Valencia, Spain
| | - Manuel Lozano
- Department of Preventive Medicine and Public Health, Food Sciences, Toxicology and Forensic Medicine, University of Valencia, Valencia, Spain
| | - Carolina Romero-García
- Department of Anesthesia, Critical Care and Pain Unit, General University Hospital, Valencia, Spain
- Division of Research Methodology, European University of Valencia, Valencia, Spain
| |
Collapse
|
5
|
Gao F, Kihal W, Le Meur N, Souris M, Deguen S. Does the edge effect impact on the measure of spatial accessibility to healthcare providers? Int J Health Geogr 2017; 16:46. [PMID: 29228961 PMCID: PMC5725922 DOI: 10.1186/s12942-017-0119-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Accepted: 11/26/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Spatial accessibility indices are increasingly applied when investigating inequalities in health. Although most studies are making mentions of potential errors caused by the edge effect, many acknowledge having neglected to consider this concern by establishing spatial analyses within a finite region, settling for hypothesizing that accessibility to facilities will be under-reported. Our study seeks to assess the effect of edge on the accuracy of defining healthcare provider access by comparing healthcare provider accessibility accounting or not for the edge effect, in a real-world application. METHODS This study was carried out in the department of Nord, France. The statistical unit we use is the French census block known as 'IRIS' (Ilot Regroupé pour l'Information Statistique), defined by the National Institute of Statistics and Economic Studies. The geographical accessibility indicator used is the "Index of Spatial Accessibility" (ISA), based on the E2SFCA algorithm. We calculated ISA for the pregnant women population by selecting three types of healthcare providers: general practitioners, gynecologists and midwives. We compared ISA variation when accounting or not edge effect in urban and rural zones. The GIS method was then employed to determine global and local autocorrelation. Lastly, we compared the relationship between socioeconomic distress index and ISA, when accounting or not for the edge effect, to fully evaluate its impact. RESULTS The results revealed that on average ISA when offer and demand beyond the boundary were included is slightly below ISA when not accounting for the edge effect, and we found that the IRIS value was more likely to deteriorate than improve. Moreover, edge effect impact can vary widely by health provider type. There is greater variability within the rural IRIS group than within the urban IRIS group. We found a positive correlation between socioeconomic distress variables and composite ISA. Spatial analysis results (such as Moran's spatial autocorrelation index and local indicators of spatial autocorrelation) are not really impacted. CONCLUSION Our research has revealed minor accessibility variation when edge effect has been considered in a French context. No general statement can be set up because intensity of impact varies according to healthcare provider type, territorial organization and methodology used to measure the accessibility to healthcare. Additional researches are required in order to distinguish what findings are specific to a territory and others common to different countries. It constitute a promising direction to determine more precisely healthcare shortage areas and then to fight against social health inequalities.
Collapse
Affiliation(s)
- Fei Gao
- EHESP Rennes, Sorbonne Paris Cité, Paris, France. .,L'équipe REPERES, Recherche en Pharmaco-épidémiologie et recours aux soins, UPRES EA-7449, Rennes, France. .,Department of Quantitative Methods for Public Health, EHESP School of Public Health, Avenue du Professeur Léon Bernard, 35043, Rennes, France.
| | - Wahida Kihal
- LIVE UMR 7362 CNRS (Laboratoire Image Ville Environnement), University of Strasbourg, 6700, Strasbourg, France
| | - Nolwenn Le Meur
- EHESP Rennes, Sorbonne Paris Cité, Paris, France.,L'équipe REPERES, Recherche en Pharmaco-épidémiologie et recours aux soins, UPRES EA-7449, Rennes, France.,Department of Quantitative Methods for Public Health, EHESP School of Public Health, Avenue du Professeur Léon Bernard, 35043, Rennes, France
| | - Marc Souris
- IRD, UMR_D 190 "Emergence des Pathologies Virales" (IRD French Institute of Research for Development, Aix-Marseille University, EHESP French School of Public Health), Marseille, France
| | - Séverine Deguen
- EHESP Rennes, Sorbonne Paris Cité, Paris, France.,Department of Social Epidemiology, Sorbonne Universités, UPMC Univ Paris 06, INSERM, Institut Pierre Louis d'Epidémiologie et de Santé Publique (UMRS 1136), Paris, France
| |
Collapse
|
6
|
Martin MK, Helm J, Patyk KA. An approach for de-identification of point locations of livestock premises for further use in disease spread modeling. Prev Vet Med 2015; 120:131-140. [PMID: 25944175 DOI: 10.1016/j.prevetmed.2015.04.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2014] [Revised: 03/31/2015] [Accepted: 04/17/2015] [Indexed: 11/27/2022]
Abstract
We describe a method for de-identifying point location data used for disease spread modeling to allow data custodians to share data with modeling experts without disclosing individual farm identities. The approach is implemented in an open-source software program that is described and evaluated here. The program allows a data custodian to select a level of de-identification based on the K-anonymity statistic. The program converts a file of true farm locations and attributes into a file appropriate for use in disease spread modeling with the locations randomly modified to prevent re-identification based on location. Important epidemiological relationships such as clustering are preserved to as much as possible to allow modeling similar to those using true identifiable data. The software implementation was verified by visual inspection and basic descriptive spatial analysis of the output. Performance is sufficient to allow de-identification of even large data sets on desktop computers available to any data custodian.
Collapse
Affiliation(s)
- Michael K Martin
- Livestock Poultry Health Division, Clemson University, Columbia, SC 29224, USA.
| | - Julie Helm
- Livestock Poultry Health Division, Clemson University, Columbia, SC 29224, USA
| | - Kelly A Patyk
- U.S Department of Agriculture, Animal and Plant Health Inspection Service, Veterinary Services, Science Technology and Analysis Services, Center for Epidemiology and Animal Health, 2150 Centre Avenue, Building B, Fort Collins, CO 80526, USA
| |
Collapse
|
7
|
Abstract
In this article, we consider the problem of comparing the distribution of observations in a planar region to a pre-specified null distribution. Our motivation is a surveillance setting where we map locations of incident disease, aiming to monitor these data over time, to locate potential areas of high/low incidence so as to direct public health actions. We propose a non-parametric approach to distance-based disease risk mapping inspired by tomographic imaging. We consider several one-dimensional projections via the observed distribution of distances to a chosen fixed point; we then compare this distribution to that expected under the null and average these comparisons across projections to compute a relative-risk-like score at each point in the region. The null distribution can be established from historical data. Scores are displayed on the map using a color scale. In addition, we give a detailed description of the method along with some desirable theoretical properties. To further assess the performance of this method, we compare it to the widely used log ratio of kernel density estimates. As a performance metric, we evaluate the accuracy to locate simulated spatial clusters superimposed on a uniform distribution in the unit disk. Results suggest that both methods can adequately locate this increased risk but each relies on an appropriate choice of parameters. Our proposed method, distance-based mapping (DBM), can also generalize to arbitrary metric spaces and/or high-dimensional data.
Collapse
|
8
|
Neyens T, Faes C, Molenberghs G. A generalized Poisson-gamma model for spatially overdispersed data. Spat Spatiotemporal Epidemiol 2011; 3:185-94. [PMID: 22749204 DOI: 10.1016/j.sste.2011.10.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/05/2011] [Revised: 10/18/2011] [Accepted: 10/31/2011] [Indexed: 10/15/2022]
Abstract
Modern disease mapping commonly uses hierarchical Bayesian methods to model overdispersion and spatial correlation. Classical random-effects based solutions include the Poisson-gamma model, which uses the conjugacy between the Poisson and gamma distributions, but which does not model spatial correlation, on the one hand, and the more advanced CAR model, which also introduces a spatial autocorrelation term but without a closed-form posterior distribution on the other. In this paper, a combined model is proposed: an alternative convolution model accounting for both overdispersion and spatial correlation in the data by combining the Poisson-gamma model with a spatially-structured normal CAR random effect. The Limburg Cancer Registry data on kidney and prostate cancer in Limburg were used to compare the conventional and new models. A simulation study confirmed results and interpretations coming from the real datasets. Relative risk maps showed that the combined model provides an intermediate between the non-patterned negative binomial and the sometimes oversmoothed CAR convolution model.
Collapse
Affiliation(s)
- Thomas Neyens
- I-BioStat, Universiteit Hasselt, B-3590 Diepenbeek, Belgium
| | | | | |
Collapse
|
9
|
Fortney J, Rushton G, Wood S, Zhang L, Xu S, Dong F, Rost K. Community-level risk factors for depression hospitalizations. ADMINISTRATION AND POLICY IN MENTAL HEALTH AND MENTAL HEALTH SERVICES RESEARCH 2007; 34:343-52. [PMID: 17294123 DOI: 10.1007/s10488-007-0117-z] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2006] [Accepted: 01/12/2007] [Indexed: 11/27/2022]
Abstract
This study measured geographic variation in depression hospitalizations and identified community-level risk factors. Depression hospitalizations were identified from the Statewide Inpatient Database. The dependent variable was specified as the indirectly standardized hospitalization rate. County-level data for 14 states were collected from federal agencies. The Bayesian spatial regression model included socio-demographic, economic, and health system characteristics as independent variables. There were 8.5 depression hospitalizations per 1,000 residents. 8.8% of counties had hospitalization rates 33% greater than the standardized rate. Significant risk factors included unemployment, poverty, physician supply, and hospital bed supply. Significant protective factors included rurality, economic dependence, and housing stress.
Collapse
Affiliation(s)
- John Fortney
- Health Services Research and Development, Center for Mental Health and Outcomes Research, Central Arkansas Veterans Healthcare System, North Little Rock, AR, USA.
| | | | | | | | | | | | | |
Collapse
|