1
|
French JP, Meysami M, Lipner EM. Prefiltered component-based greedy (PreCoG) scan method. Stat Med 2024; 43:4113-4130. [PMID: 38992939 DOI: 10.1002/sim.10170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 06/22/2024] [Accepted: 06/28/2024] [Indexed: 07/13/2024]
Abstract
The spatial distribution of disease cases can provide important insights into disease spread and its potential risk factors. Identifying disease clusters correctly can help us discover new risk factors and inform interventions to control and prevent the spread of disease as quickly as possible. In this study, we propose a novel scan method, the Prefiltered Component-based Greedy (PreCoG) scan method, which efficiently and accurately detects irregularly shaped clusters using a prefiltered component-based algorithm. The PreCoG scan method's flexibility allows it to perform well in detecting both regularly and irregularly-shaped clusters. Additionally, it is fast to apply while providing high power, sensitivity, and positive predictive value for the detected clusters compared to other scan methods. To confirm the effectiveness of the PreCoG method, we compare its performance to many other scan methods. Additionally, we have implemented this method in the smerc R package to make it publicly available to other researchers. Our proposed PreCoG scan method presents a unique and innovative process for detecting disease clusters and can improve the accuracy of disease surveillance systems.
Collapse
Affiliation(s)
- Joshua P French
- Department of Mathematical and Statistical Sciences, University of Colorado Denver, Denver, Colorado, USA
| | - Mohammad Meysami
- Department of Mathematics, Clarkson University, Potsdam, New York, USA
| | - Ettie M Lipner
- The National Institutes of Health, Stapleton, Maryland, USA
| |
Collapse
|
2
|
Ullah S, Barakzai MAK, Xie T. Performance of a negative binomial-GLM in spatial scan statistic: a case study of low-birth weights in Pakistan. GEOSPATIAL HEALTH 2024; 19. [PMID: 39228273 DOI: 10.4081/gh.2024.1313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2024] [Accepted: 08/08/2024] [Indexed: 09/05/2024]
Abstract
Spatial cluster analyses of health events are useful for enabling targeted interventions. Spatial scan statistic is the stateof- the-art method for this kind of analysis and the Poisson Generalized Linear Model (GLM) approach to the spatial scan statistic can be used for count data for spatial cluster detection with covariate adjustment. However, its use for modelling is limited due to data over-dispersion. A Generalized Linear Mixed Model (GLMM) has recently been proposed for modelling this kind of over-dispersion by incorporating random effects to model area-specific intrinsic variation not explained by other covariates in the model. However, these random effects may exhibit a geographical correlation, which may lead to a potential spatial cluster being undetected. To handle the over-dispersion in the count data, this study aimed to evaluate the performance of a negative binomial- GLM in spatial scan statistic on real-world data of low birth weights in Khyber-Pakhtunkhwa Province, Pakistan, 2019. The results were compared with the Poisson-GLM and GLMM, showing that the negative binomial-GLM is an ideal choice for spatial scan statistic in the presence of over-dispersed data. With a covariate (maternal anaemia) adjustment, the negative binomial-GLMbased spatial scan statistic detected one significant cluster covering Dir lower district. Without the covariate adjustment, it detected two clusters, each covering one district. The district of Peshawar was seen as the most likely cluster and Battagram as the secondary cluster. However, none of the clusters were detected by GLMM spatial scan statistic, which might be due to the spatial correlation of the random effects in GLMM.
Collapse
Affiliation(s)
- Sami Ullah
- School of Mathematics, Statistics and Mechanics, Beijing University of Technology.
| | | | - Tianfa Xie
- School of Mathematics, Statistics and Mechanics, Beijing University of Technology.
| |
Collapse
|
3
|
Frévent C, Ahmed MS, Dabo-Niang S, Genin M. A Shared-Frailty Spatial Scan Statistic Model for Time-to-Event Data. Biom J 2024; 66:e202300200. [PMID: 38988210 DOI: 10.1002/bimj.202300200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 01/24/2024] [Accepted: 05/04/2024] [Indexed: 07/12/2024]
Abstract
Spatial scan statistics are well-known methods widely used to detect spatial clusters of events. Furthermore, several spatial scan statistics models have been applied to the spatial analysis of time-to-event data. However, these models do not take account of potential correlations between the observations of individuals within the same spatial unit or potential spatial dependence between spatial units. To overcome this problem, we have developed a scan statistic based on a Cox model with shared frailty and that takes account of the spatial dependence between spatial units. In simulation studies, we found that (i) conventional models of spatial scan statistics for time-to-event data fail to maintain the type I error in the presence of a correlation between the observations of individuals within the same spatial unit and (ii) our model performed well in the presence of such correlation and spatial dependence. We have applied our method to epidemiological data and the detection of spatial clusters of mortality in patients with end-stage renal disease in northern France.
Collapse
Affiliation(s)
- Camille Frévent
- Université de Lille, CHU Lille, ULR 2694 - METRICS: Évaluation des technologies de santé et des pratiques médicales, Université de Lille, Lille, France
| | - Mohamed-Salem Ahmed
- Université de Lille, CHU Lille, ULR 2694 - METRICS: Évaluation des technologies de santé et des pratiques médicales, Université de Lille, Lille, France
- Alicante SARL, Lesquin, France
| | - Sophie Dabo-Niang
- CNRS, UMR 8524 - Laboratoire Paul Painlevé, Université de Lille, Lille, France
- MODAL team, INRIA Lille-Nord Europe, Villeneuve-d'Ascq, France
| | - Michaël Genin
- Université de Lille, CHU Lille, ULR 2694 - METRICS: Évaluation des technologies de santé et des pratiques médicales, Université de Lille, Lille, France
| |
Collapse
|
4
|
Boyle J, Ward MH, Cerhan JR, Rothman N, Wheeler DC. Assessing and attenuating the impact of selection bias on spatial cluster detection studies. Spat Spatiotemporal Epidemiol 2024; 49:100659. [PMID: 38876558 PMCID: PMC11180222 DOI: 10.1016/j.sste.2024.100659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/27/2023] [Revised: 05/09/2024] [Accepted: 05/10/2024] [Indexed: 06/16/2024]
Abstract
Spatial cluster analyses are commonly used in epidemiologic studies of case-control data to detect whether certain areas in a study region have an excess of disease risk. Case-control studies are susceptible to potential biases including selection bias, which can result from non-participation of eligible subjects in the study. However, there has been no systematic evaluation of the effects of non-participation on the findings of spatial cluster analyses. In this paper, we perform a simulation study assessing the effect of non-participation on spatial cluster analysis using the local spatial scan statistic under a variety of scenarios that vary the location and rates of study non-participation and the presence and intensity of a zone of elevated risk for disease for simulated case-control studies. We find that geographic areas of lower participation among controls than cases can greatly inflate false-positive rates for identification of artificial spatial clusters. Additionally, we find that even modest non-participation outside of a true zone of elevated risk can decrease spatial power to identify the true zone. We propose a spatial algorithm to correct for potentially spatially structured non-participation that compares the spatial distributions of the observed sample and underlying population. We demonstrate its ability to markedly decrease false positive rates in the absence of elevated risk and resist decreasing spatial sensitivity to detect true zones of elevated risk. We apply our method to a case-control study of non-Hodgkin lymphoma. Our findings suggest that greater attention should be paid to the potential effects of non-participation in spatial cluster studies.
Collapse
Affiliation(s)
- Joseph Boyle
- Department of Biostatistics, Virginia Commonwealth University, Richmond, VA, USA.
| | - Mary H Ward
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
| | - James R Cerhan
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Nathaniel Rothman
- Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
| | - David C Wheeler
- Department of Biostatistics, Virginia Commonwealth University, Richmond, VA, USA
| |
Collapse
|
5
|
Moon J, Jung I. A simulation study for geographic cluster detection analysis on population-based health survey data using spatial scan statistics. Int J Health Geogr 2022; 21:11. [PMID: 36085072 PMCID: PMC9463844 DOI: 10.1186/s12942-022-00311-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 08/26/2022] [Indexed: 11/15/2022] Open
Abstract
Background In public health and epidemiology, spatial scan statistics can be used to identify spatial cluster patterns of health-related outcomes from population-based health survey data. Although it is appropriate to consider the complex sample design and sampling weight when analyzing complex sample survey data, the observed survey responses without these considerations are often used in many studies related to spatial cluster detection. Methods We conducted a simulation study to investigate which data type from complex survey data is more suitable for use by comparing the spatial cluster detection results of three approaches: (1) individual-level data, (2) weighted individual-level data, and (3) aggregated data. Results The results of the spatial cluster detection varied depending on the data type. To compare the performance of spatial cluster detection, sensitivity and positive predictive value (PPV) were evaluated over 100 iterations. The average sensitivity was high for all three approaches, but the average PPV was higher when using aggregated data than when using individual-level data with or without sampling weights. Conclusions Through the simulation study, we found that use of aggregate-level data is more appropriate than other types of data, when searching for spatial clusters using spatial scan statistics on population-based health survey data.
Collapse
Affiliation(s)
- Jisu Moon
- Division of Biostatistics, Department of Biomedical Systems Informatics, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seodaemun-gu, Seoul, 03722, Korea
| | - Inkyung Jung
- Division of Biostatistics, Department of Biomedical Systems Informatics, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seodaemun-gu, Seoul, 03722, Korea.
| |
Collapse
|
6
|
Matos de Carvalho D, Amorim do Amaral GJ, De Bastiani F. Spatial scan statistics based on empirical likelihood. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2021.1949470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Daniel Matos de Carvalho
- Statistics Department, Federal Institute of Paraíba, João Pessoa, Paraíba, Brazil
- Statistics Department, Federal University of Pernambuco, Recife, Pernambuco, Brazil
| | | | - Fernanda De Bastiani
- Statistics Department, Federal University of Pernambuco, Recife, Pernambuco, Brazil
| |
Collapse
|
7
|
Sun Y, Xie J, Hu X. Detecting Spatial Clusters of Coronavirus Infection Across London During the Second Wave. APPLIED SPATIAL ANALYSIS AND POLICY 2021; 15:557-571. [PMID: 34367372 PMCID: PMC8330217 DOI: 10.1007/s12061-021-09413-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 06/30/2021] [Indexed: 05/31/2023]
Abstract
The identification of seriously infected areas across a city, region, or country can inform policies and assist in resources allocation. Concentration of coronavirus infection can be identified through applying cluster detection methods to coronavirus cases over space. To enhance the identification of seriously infected areas by relevant studies, this study focused on coronavirus infection by small area across a city during the second wave. Specifically, we firstly explored spatiotemporal patterns of new coronavirus cases. Subsequently, we detected spatial clusters of new coronavirus cases by small area. Empirically, we used the London-wide small-area coronavirus infection data aggregately collected. Methodologically, we applied a fast Bayesian model-based detection method newly developed to new coronavirus cases by small area. As empirical evidence on the association of socioeconomic factors and coronavirus spread have been found, spatial patterns of coronavirus infection are arguably associated with socioeconomic and built environmental characteristics. Therefore, we further investigated the socioeconomic and built environmental characteristics of the clusters detected. As a result, the most significant clusters of new cases during the second wave are likely to occur around the airports. And, lower income or lower healthcare accessibility is associated with concentration of coronavirus infection across London.
Collapse
Affiliation(s)
- Yeran Sun
- Department of Geography, College of Science, Swansea University, Swansea, SA2 8PP UK
| | - Jing Xie
- Faculty of Architecture, The University of Hong Kong, Knowles Building, Pokfulam Road, Hong Kong, 999077 China
| | - Xuke Hu
- Institute of Data Science, German Aerospace Center (DLR), 07745 Jena, Germany
| |
Collapse
|
8
|
Choi YG, Hanrahan LP, Norton D, Zhao YQ. Simultaneous spatial smoothing and outlier detection using penalized regression, with application to childhood obesity surveillance from electronic health records. Biometrics 2020; 78:324-336. [PMID: 33215685 DOI: 10.1111/biom.13404] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Revised: 09/24/2020] [Accepted: 11/06/2020] [Indexed: 11/28/2022]
Abstract
Electronic health records (EHRs) have become a platform for data-driven granular-level surveillance in recent years. In this paper, we make use of EHRs for early prevention of childhood obesity. The proposed method simultaneously provides smooth disease mapping and outlier information for obesity prevalence that are useful for raising public awareness and facilitating targeted intervention. More precisely, we consider a penalized multilevel generalized linear model. We decompose regional contribution into smooth and sparse signals, which are automatically identified by a combination of fusion and sparse penalties imposed on the likelihood function. In addition, we weigh the proposed likelihood to account for the missingness and potential nonrepresentativeness arising from the EHR data. We develop a novel alternating minimization algorithm, which is computationally efficient, easy to implement, and guarantees convergence. Simulation studies demonstrate superior performance of the proposed method. Finally, we apply our method to the University of Wisconsin Population Health Information Exchange database.
Collapse
Affiliation(s)
- Young-Geun Choi
- Department of Statistics, Sookmyung Women's University, Seoul, South Korea
| | - Lawrence P Hanrahan
- Department of Family Medicine, and Community Health, University of Wisconsin-Madison, Madison, Wisconsin
| | - Derek Norton
- Department of Biostatistics, and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin
| | - Ying-Qi Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
| |
Collapse
|
9
|
Lee J, Kamenetsky ME, Gangnon RE, Zhu J. Clustered spatio-temporal varying coefficient regression model. Stat Med 2020; 40:465-480. [PMID: 33103247 DOI: 10.1002/sim.8785] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Revised: 06/28/2020] [Accepted: 08/08/2020] [Indexed: 12/14/2022]
Abstract
In regression analysis for spatio-temporal data, identifying clusters of spatial units over time in a regression coefficient could provide insight into the unique relationship between a response and covariates in certain subdomains of space and time windows relative to the background in other parts of the spatial domain and the time period of interest. In this article, we propose a varying coefficient regression method for spatial data repeatedly sampled over time, with heterogeneity in regression coefficients across both space and over time. In particular, we extend a varying coefficient regression model for spatial-only data to spatio-temporal data with flexible temporal patterns. We consider the detection of a potential cylindrical cluster of regression coefficients based on testing whether the regression coefficient is the same or not over the entire spatial domain for each time point. For multiple clusters, we develop a sequential identification approach. We assess the power and identification of known clusters via a simulation study. Our proposed methodology is illustrated by the analysis of a cancer mortality dataset in the Southeast of the U.S.
Collapse
Affiliation(s)
- Junho Lee
- Statistics Program, CEMSE Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Maria E Kamenetsky
- Department of Population Health Sciences, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Ronald E Gangnon
- Department of Population Health Sciences, University of Wisconsin-Madison, Madison, Wisconsin, USA.,Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Jun Zhu
- Department of Statistics, University of Wisconsin-Madison, Madison, Wisconsin, USA
| |
Collapse
|
10
|
Discovering Spatio-Temporal Clusters of Road Collisions Using the Method of Fast Bayesian Model-Based Cluster Detection. SUSTAINABILITY 2020. [DOI: 10.3390/su12208681] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Public availability of geo-coded or geo-referenced road collisions (crashes) makes it possible to perform geovisualisation and spatio-temporal analysis of road collisions across a city. This study aims to detect spatio-temporal clusters of road collisions across Greater London between 2010 and 2014. We implemented a fast Bayesian model-based cluster detection method with no covariates and after adjusting for potential covariates respectively. As empirical evidence on the association of street connectivity measures and the occurrence of road collisions had been found, we selected street connectivity measures as the potential covariates in our cluster detection. Results of the most significant cluster and the second most significant cluster during five consecutive years are located around the central areas. Moreover, after adjusting the covariates, the most significant cluster moves from the central areas of London to its peripheral areas, while the second most significant cluster remains unchanged. Additionally, one potential covariate used in this study, length-based road density, exhibits a positive association with the number of road collisions; meanwhile count-based intersection density displays a negative association. Although the covariates (i.e., road density and intersection density) exhibit potential impact on the clusters of road collisions, they are unlikely to contribute to the majority of clusters. Furthermore, the method of fast Bayesian model-based cluster detection is developed to discover spatio-temporal clusters of serious injury collisions. Most of the areas at risk of serious injury collisions overlay those at risk of road collisions. Although not being identified as areas at risk of road collisions, some districts, e.g., City of London, are regarded as areas at risk of serious injury collisions.
Collapse
|
11
|
Ahmed M, Genin M. A functional‐model‐adjusted spatial scan statistic. Stat Med 2020; 39:1025-1040. [DOI: 10.1002/sim.8459] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 10/23/2019] [Accepted: 12/04/2019] [Indexed: 11/06/2022]
Affiliation(s)
- Mohamed‐Salem Ahmed
- EA2694 ‐ Santé publique : épidemiologie et qualité des soinsUniversity of Lille Lille France
- CHU Lille Lille France
| | - Michaël Genin
- EA2694 ‐ Santé publique : épidemiologie et qualité des soinsUniversity of Lille Lille France
- CHU Lille Lille France
| |
Collapse
|
12
|
Sugishita Y, Sugawara T, Ohkusa Y, Ishikawa T, Yoshida M, Endo H. Syndromic surveillance using ambulance transfer data in Tokyo, Japan. J Infect Chemother 2019; 26:8-12. [PMID: 31611069 DOI: 10.1016/j.jiac.2019.09.011] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 09/08/2019] [Accepted: 09/15/2019] [Indexed: 11/28/2022]
Abstract
Bioterrorism attacks become more probable when important high-profile international or political events are held, such as G7 summit meetings or mass gathering events including Olympic and Paralympic games and FIFA World Cup tournaments. Outbreaks of infectious disease and widespread incidents of food poisoning are also public health concerns at such times. In Japan, the Tokyo Metropolitan Government operates Ambulance Transfer Syndromic Surveillance (ATSS), which can help monitor such incidents. The present study presents and assesses the ATSS framework. During the study period of October 2017 through November 2018, we monitored 33 areas for symptoms of 9 categories: vomiting/nausea, dizziness, palpitation, unconsciousness, breathing disorder, fever, spasm/paralysis, collapse/weakness, and bloody emesis/nasal hemorrhage. Among all symptoms, we found 9929 low-level aberrations, 2537 medium-level aberrations, and 577 high-level aberrations, with respective frequencies of 9.2%, 2.3%, and 0.5%. Of those, Tokyo Metropolitan Institute of Public Health reported the information to Tokyo Metropolitan Government 28 times during the period. Of the 28 identified clusters, Tokyo Metropolitan Government judged the necessity for investigating 7. All of those were investigated at hospitals by the jurisdictional public health center. Because ATSS covers almost the entire Tokyo metropolitan area, with about 13.8 million residents, it is definitely the largest syndromic surveillance in the world.
Collapse
Affiliation(s)
- Yoshiyuki Sugishita
- National Institute of Infectious Diseases, Japan; Bureau of Social Welfare and Public Health, Tokyo Metropolitan Government, Japan.
| | | | | | | | - Michihiko Yoshida
- Bureau of Social Welfare and Public Health, Tokyo Metropolitan Government, Japan
| | | |
Collapse
|
13
|
Takahashi K, Shimadzu H. Multiple-cluster detection test for purely temporal disease clustering: Integration of scan statistics and generalized linear models. PLoS One 2018; 13:e0207821. [PMID: 30462741 PMCID: PMC6249023 DOI: 10.1371/journal.pone.0207821] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2018] [Accepted: 11/05/2018] [Indexed: 11/18/2022] Open
Abstract
The spatial scan statistic is commonly used to detect spatial and/or temporal disease clusters in epidemiological studies. Although multiple clusters in the study space can be thus identified, current theoretical developments are mainly based on detecting a ‘single’ cluster. The standard scan statistic procedure enables the detection of multiple clusters, recursively identifying additional ‘secondary’ clusters. However, their p-values are calculated one at a time, as if each cluster is a primary one. Therefore, a new procedure that can accurately evaluate multiple clusters as a whole is needed. The present study focuses on purely temporal cases and then proposes a new test procedure that evaluates the p-value for multiple clusters, combining generalized linear models with an information criterion approach. This framework encompasses the conventional, currently widely used detection procedure as a special case. An application study adopting the new framework is presented, analysing the Japanese daily incidence of out-of-hospital cardiac arrest cases. The analysis reveals that the number of the incident increases around New Year’s Day in Japan. Further, simulation studies undertaken confirm that the proposed method possesses a consistency property that tends to select the correct number of clusters when the truth is known.
Collapse
Affiliation(s)
- Kunihiko Takahashi
- Department of Biostatistics, Nagoya University Graduate School of Medicine, Nagoya, Japan
- * E-mail:
| | - Hideyasu Shimadzu
- Department of Mathematical Sciences, Loughborough University, Loughborough, Leicestershire, United Kingdom
| |
Collapse
|
14
|
Peprah S, Curreiro FC, Hayes JH, Stern K, Parekh S, D’Souza G. A spatiotemporal analysis of invasive cervical cancer incidence in the state of Maryland between 2003 and 2012. Cancer Causes Control 2018. [DOI: 10.1007/s10552-018-1019-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
15
|
A modified generalized lasso algorithm to detect local spatial clusters for count data. ASTA ADVANCES IN STATISTICAL ANALYSIS 2018. [DOI: 10.1007/s10182-018-0318-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
16
|
Lin P, Kung Y, Clayton M. Spatial scan statistics for detection of multiple clusters with arbitrary shapes. Biometrics 2016; 72:1226-1234. [DOI: 10.1111/biom.12509] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Revised: 12/01/2015] [Accepted: 02/01/2016] [Indexed: 11/28/2022]
Affiliation(s)
- Pei‐Sheng Lin
- Division of Biostatistics and Bioinformatics, National Health Research Institutes Taiwan
- Department of Mathematics, National Chung Cheng University Taiwan
| | - Yi‐Hung Kung
- Division of Biostatistics and Bioinformatics, National Health Research Institutes Taiwan
| | - Murray Clayton
- Department of Statistics, University of Wisconsin‐Madison Wisconsin, U.S.A
| |
Collapse
|
17
|
Jung I, Park G. p-value approximations for spatial scan statistics using extreme value distributions. Stat Med 2014; 34:504-14. [PMID: 25345856 DOI: 10.1002/sim.6347] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Revised: 09/21/2014] [Accepted: 10/07/2014] [Indexed: 01/26/2023]
Abstract
Spatial scan statistics are widely applied to identify spatial clusters in geographic disease surveillance. To evaluate the statistical significance of detected clusters, Monte Carlo hypothesis testing is often used because the null distribution of spatial scan statistics is not known. A drawback of the method is that we have to increase the number of replications to obtain accurate p-values. Gumbel-based p-value approximations for spatial scan statistics have recently been proposed and evaluated for Poisson and Bernoulli models. In this study, we examine the use of a generalized extreme value distribution to approximate the null distribution of spatial scan statistics as well as the Gumbel distribution. Through simulation, p-value approximations using extreme value distributions for spatial scan statistics are assessed for multinomial and ordinal models in addition to Poisson and Bernoulli models.
Collapse
Affiliation(s)
- Inkyung Jung
- Department of Biostatistics, Yonsei University College of Medicine, Seoul, 120-752, Korea
| | | |
Collapse
|
18
|
Luo J. A Spatial Scan Statistic on Trends. JOURNAL OF STATISTICAL THEORY AND PRACTICE 2014. [DOI: 10.1080/15598608.2014.847765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
19
|
Affiliation(s)
- Pei-Sheng Lin
- Division of Biostatistics and Bioinformatics; National Health Research Institutes
- Department of Mathematics; National Chung Cheng University
| |
Collapse
|
20
|
|
21
|
Jung I, Lee H. Spatial cluster detection for ordinal outcome data. Stat Med 2012; 31:4040-8. [DOI: 10.1002/sim.5475] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Accepted: 04/14/2012] [Indexed: 01/17/2023]
Affiliation(s)
- Inkyung Jung
- Department of Biostatistics; Yonsei University College of Medicine; 250 Seongsanno, Seodaemun-gu; Seoul; 120-752; Korea
| | - Hana Lee
- Department of Biostatistics; Yonsei University College of Medicine; 250 Seongsanno, Seodaemun-gu; Seoul; 120-752; Korea
| |
Collapse
|
22
|
Zhang T, Zhang Z, Lin G. Spatial scan statistics with overdispersion. Stat Med 2011; 31:762-74. [PMID: 22052573 DOI: 10.1002/sim.4404] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2011] [Accepted: 08/13/2011] [Indexed: 11/07/2022]
Abstract
The spatial scan statistic has been widely used in spatial disease surveillance and spatial cluster detection for more than a decade. However, overdispersion often presents in real-world data, causing not only violation of the Poisson assumption but also excessive type I errors or false alarms. In order to account for overdispersion, we extend the Poisson-based spatial scan test to a quasi-Poisson-based test. The simulation shows that the proposed method can substantially reduce type I error probabilities in the presence of overdispersion. In a case study of infant mortality in Jiangxi, China, both tests detect a cluster; however, a secondary cluster is identified by only the Poisson-based test. It is recommended that a cluster detected by the Poisson-based scan test should be interpreted with caution when it is not confirmed by the quasi-Poisson-based test.
Collapse
Affiliation(s)
- Tonglin Zhang
- Department of Statistics, Purdue University, 250 North University Street, West Lafayette, IN 47907-2066, USA.
| | | | | |
Collapse
|
23
|
Chan HP, Tu IP. Log-linear, logistic model fitting and local score statistics for cluster detection with covariate adjustments. Stat Med 2011; 30:91-100. [DOI: 10.1002/sim.4082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2009] [Accepted: 08/17/2010] [Indexed: 11/12/2022]
|
24
|
Abstract
As a geographical cluster detection analysis tool, the spatial scan statistic has been developed for different types of data such as Bernoulli, Poisson, ordinal, exponential and normal. Another interesting data type is multinomial. For example, one may want to find clusters where the disease-type distribution is statistically significantly different from the rest of the study region when there are different types of disease. In this paper, we propose a spatial scan statistic for such data, which is useful for geographical cluster detection analysis for categorical data without any intrinsic order information. The proposed method is applied to meningitis data consisting of five different disease categories to identify areas with distinct disease-type patterns in two counties in the U.K. The performance of the method is evaluated through a simulation study.
Collapse
Affiliation(s)
- Inkyung Jung
- Department of Biostatistics, Yonsei University College of Medicine, Seoul, Korea.
| | | | | |
Collapse
|
25
|
Gómez-Rubio V, López-Quílez A. Statistical methods for the geographical analysis of rare diseases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2010; 686:151-71. [PMID: 20824445 DOI: 10.1007/978-90-481-9485-8_10] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
In this chapter we provide a summary of different methods for the detection of disease clusters. First of all, we give a summary of methods for computing estimates of the relative risk. These estimates provide smoothed values of the relative risks that can account for its spatial variation. Some methods for assessing spatial autocorrelation and general clustering are also discussed to test for significant spatial variation of the risk. In order to find the actual location of the clusters, scan methods are introduced. The spatial scan statistic is discussed as well as its extension by means of Generalised Linear Models that allows for the inclusion of covariates and cluster effects. In this context, zero-inflated models are introduced to account for the high number of zeros that appear when studying rare diseases. Finally, two applications of these methods are shown using data of Systemic Lupus Erythematosus in Spain and brain cancer in Navarre (Spain).
Collapse
Affiliation(s)
- Virgilio Gómez-Rubio
- Departamento de Matemáticas, Universidad de Castilla-La Mancha, Escuela de Ingenieros Industriales, Albacete, Spain.
| | | |
Collapse
|