1
|
Chen J, Ferguson T, Jorgensen P. Using Scan Statistics for Cluster Detection: Recognizing Real Bandwagons. Methodol Comput Appl Probab 2019. [DOI: 10.1007/s11009-019-09737-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
2
|
Affiliation(s)
- Suchismita Goswami
- Computational and Data Sciences, George Mason University, Fairfax, VA, USA
| | - Edward J. Wegman
- Computational and Data Sciences, George Mason University, Fairfax, VA, USA
| |
Collapse
|
3
|
Detection and Analysis of Spikes in a Random Sequence. Methodol Comput Appl Probab 2018. [DOI: 10.1007/s11009-018-9637-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
4
|
Lai WT, Chen CH, Hung H, Chen RB, Shete S, Wu CC. Recognizing spatial and temporal clustering patterns of dengue outbreaks in Taiwan. BMC Infect Dis 2018; 18:256. [PMID: 29866173 PMCID: PMC5987425 DOI: 10.1186/s12879-018-3159-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 05/23/2018] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Dengue fever is the most common arboviral infection in humans, with viral transmissions occurring in more than 100 countries in tropical regions. A global strategy for dengue prevention and control was established more than 10 years ago. However, the factors that drive the transmission of the dengue virus and subsequent viral infection continue unabated. The largest dengue outbreaks in Taiwan since World War II occurred in two recent successive years: 2014 and 2015. METHODS We performed a systematic analysis to detect and recognize spatial and temporal clustering patterns of dengue incidence in geographical areas of Taiwan, using the map-based pattern recognition procedure and scan test. Our aim was to recognize geographical heterogeneity patterns of varying dengue incidence intensity and detect hierarchical incidence intensity clusters. RESULTS Using the map-based pattern recognition procedure, we identified and delineated two separate hierarchical dengue incidence intensity clusters that comprise multiple mutually adjacent geographical units with high dengue incidence rates. We also found that that dengue incidence tends to peak simultaneously and homogeneously among the neighboring geographic units with high rates in the same cluster. CONCLUSION Beyond significance testing, this study is particularly desired by and useful for health authorities who require optimal characteristics of disease incidence patterns on maps and over time. Among the integrated components for effective prevention and control of dengue and dengue hemorrhagic fever are active surveillance and community-based integrated mosquito control, for which this study provides valuable inferences. Effective dengue prevention and control programs in Taiwan are critical, and have the added benefit of controlling the potential emergence of Zika.
Collapse
Affiliation(s)
- Wei-Ting Lai
- Department of Statistics, College of Management, National Cheng Kung University, Tainan, Taiwan
| | - Chien-Hsiun Chen
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
- School of Chinese Medicine, China Medical University, Taichung, Taiwan
| | - Hsin Hung
- Department of Environmental and Occupational Health, College of Medicine, National Cheng Kung University, 1 University Road Tainan, 701, Tainan, Taiwan
| | - Ray-Bing Chen
- Department of Statistics, College of Management, National Cheng Kung University, Tainan, Taiwan
| | - Sanjay Shete
- Departments of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Departments of Epidemiology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Chih-Chieh Wu
- Department of Environmental and Occupational Health, College of Medicine, National Cheng Kung University, 1 University Road Tainan, 701, Tainan, Taiwan.
| |
Collapse
|
5
|
Assessing current temporal and space-time anomalies of disease incidence. PLoS One 2017; 12:e0188065. [PMID: 29131869 PMCID: PMC5683561 DOI: 10.1371/journal.pone.0188065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Accepted: 10/31/2017] [Indexed: 11/19/2022] Open
Abstract
Approaches used to early and accurately characterize epidemiologic patterns of disease incidence in a temporal and spatial series are becoming increasingly important. Cluster tests are generally designed for retrospective detection of epidemiologic anomalies in a temporal or space-time series. Timely identification of anomalies of disease or poisoning incidence during ongoing surveillance or an outbreak requires the use of sensitive statistical methods that recognize an incidence pattern at the time of occurrence. This report describes 2 novel analytical methods that focus on detecting anomalies of incidence at the time of occurrence in a temporal and space-time series. The first method describes the paucity of incidence at the time of occurrence in an ongoing surveillance and is designed to evaluate whether a decline in incidence occurs on the single current day or during the most recent few days. The second method provides an overall assessment of current clustering or paucity of incidence in a space-time series, allowing for several space regions. We illustrate the application of these methods using a subsample of a temporal series of data on the largest dengue outbreak in Taiwan in 2015 since World War II and demonstrate that they are useful to efficiently monitor incoming data for current clustering and paucity of incidence in a temporal and space-time series. In light of the recent global emergence and resurgence of Zika, dengue, and chikungunya infection, these approaching for detecting current anomalies of incidence in the ongoing surveillance of disease are particularly desired and needed.
Collapse
|
6
|
Glaz J, Naus J, Roos M, Wallenstein S. Poisson approximations for the distribution and moments of ordered m-spacings. J Appl Probab 2016. [DOI: 10.2307/3214961] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
This article investigates the accuracy of approximations for the distribution of ordered m-spacings for i.i.d. uniform observations in the interval (0, 1). Several Poisson approximations and a compound Poisson approximation are studied. The result of a simulation study is included to assess the accuracy of these approximations. A numerical procedure for evaluating the moments of the ordered m-spacings is developed and evaluated for the most accurate approximation.
Collapse
|
7
|
Vives C, Caballero P, Álvarez-Dardet C. [Temporal analysis of mortality due to intimate partner violence in Spain]. GACETA SANITARIA 2015; 18:346-50. [PMID: 15498403 DOI: 10.1016/s0213-9111(04)71843-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
OBJECTIVES To analyze the temporal distribution of mortality due to violence by intimate partners (VIP) and to identify possible temporal clusters in women deaths by VIP in Spain. METHODS We performed a descriptive epidemiological study based on the VIP deaths included in the database of the Federation of Divorced and Separated Women (1998-2003). The epidemic index (EI) was calculated as the ratio between the actual number of VIP deaths in a given month from January to July 2003 and the median number in the same month in the five preceding years. A Poisson model was used to analyze the distribution by years (1998-2002), seasons, months, and days. Simple regression analysis was performed with three-monthly means. A temporal cluster analysis was also carried out. RESULTS In 2003, the EI of VIP mortality was high in January (EI = 1.6), March (EI = 1.2), May (EI = 1.5), June (EI = 2), and July (EI = 2.5). Compared with 1998 and Sundays, respectively, mortality due to VIP was significantly increased in 2001 (relative risk, RR = 1.52; 95% confidence interval [CI], 1.05-2.20) and on Mondays (RR = 1.77; 95%CI, 1.13-2.76). The regression analyses confirmed an increase between the first three-month period of 1998 and the last three-month period of 2001. There were no differences between seasons and months. No temporal clusters of deaths were detected. CONCLUSIONS VIP is currently an increasing epidemic in Spain with no clear temporal pattern. Political and legal efforts to reduce this problem do not seem to be successful.
Collapse
Affiliation(s)
- Carmen Vives
- Departamento de Salud Pública. Universidad de Alicante. Alicante. España. Red de Investigación sobre Salud y Género
| | | | | |
Collapse
|
8
|
Bornstein C, Winter D, Barnett-Itzhaki Z, David E, Kadri S, Garber M, Amit I. A negative feedback loop of transcription factors specifies alternative dendritic cell chromatin States. Mol Cell 2014; 56:749-62. [PMID: 25453760 DOI: 10.1016/j.molcel.2014.10.014] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Revised: 08/13/2014] [Accepted: 10/14/2014] [Indexed: 01/11/2023]
Abstract
During hematopoiesis, cells originating from the same stem cell reservoir differentiate into distinct cell types. The mechanisms enabling common progenitors to differentiate into alternative cell fates are not fully understood. Here, we identify cell-fate-determining transcription factors (TFs) governing dendritic cell (DC) development by annotating the enhancer landscapes of the DC lineage. Combining these analyses with detailed overexpression, knockdown, and ChIP-Seq studies, we show that Irf8 functions as a plasmacytoid DC epigenetic and fate-determining TF, regulating massive, cell-specific chromatin changes in thousands of pDC enhancers. Importantly, Irf8 forms a negative feedback loop with Cebpb, a monocyte-derived DC epigenetic fate-determining TF. We show that using this circuit logic, a pulse of TF expression can stably define epigenetic and transcriptional states, regardless of the microenvironment. More broadly, our study proposes a general paradigm that allows closely related cells with a similar set of signal-dependent factors to generate differential and persistent enhancer landscapes.
Collapse
Affiliation(s)
| | - Deborah Winter
- Department of Immunology, Weizmann Institute, Rehovot 76100, Israel
| | | | - Eyal David
- Department of Immunology, Weizmann Institute, Rehovot 76100, Israel
| | - Sabah Kadri
- Broad Institute, 7 Cambridge Center, Cambridge, MA 02142, USA
| | - Manuel Garber
- Program in Bioinformatics and Integrative Biology and Program in Molecular Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Ido Amit
- Department of Immunology, Weizmann Institute, Rehovot 76100, Israel.
| |
Collapse
|
9
|
Loohuis LO, Witzel A, Mishra B. Improving Detection of Driver Genes: Power-Law Null Model of Copy Number Variation in Cancer. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:1260-1263. [PMID: 26357061 DOI: 10.1109/tcbb.2014.2351805] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
In this paper, we study Copy Number Variation (CNV) data. The underlying process generating CNV segments is generally assumed to be memory-less, giving rise to an exponential distribution of segment lengths. In this paper, we provide evidence from cancer patient data, which suggests that this generative model is too simplistic, and that segment lengths follow a power-law distribution instead. We conjecture a simple preferential attachment generative model that provides the basis for the observed power-law distribution. We then show how an existing statistical method for detecting cancer driver genes can be improved by incorporating the power-law distribution in the null model.
Collapse
|
10
|
Silva IT, Rosales RA, Holanda AJ, Nussenzweig MC, Jankovic M. Identification of chromosomal translocation hotspots via scan statistics. ACTA ACUST UNITED AC 2014; 30:2551-8. [PMID: 24860160 DOI: 10.1093/bioinformatics/btu351] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
MOTIVATION The detection of genomic regions unusually rich in a given pattern is an important undertaking in the analysis of next-generation sequencing data. Recent studies of chromosomal translocations in activated B lymphocytes have identified regions that are frequently translocated to c-myc oncogene. A quantitative method for the identification of translocation hotspots was crucial to this study. Here we improve this analysis by using a simple probabilistic model and the framework provided by scan statistics to define the number and location of translocation breakpoint hotspots. A key feature of our method is that it provides a global chromosome-wide nominal control level to clustering, as opposed to previous methods based on local criteria. While being motivated by a specific application, the detection of unusual clusters is a widespread problem in bioinformatics. We expect our method to be useful in the analysis of data from other experimental approaches such as of ChIP-seq and 4C-seq. RESULTS The analysis of translocations from B lymphocytes with the method described here reveals the presence of longer hotspots when compared with those defined previously. Further, we show that the hotspot size changes substantially in the absence of DNA repair protein 53BP1. When 53BP1 deficiency is combined with overexpression of activation-induced cytidine deaminase, the hotspot length increases even further. These changes are not detected by previous methods that use local significance criteria for clustering. Our method is also able to identify several exclusive translocation hotspots located in genes of known tumor supressors. AVAILABILITY AND IMPLEMENTATION The detection of translocation hotspots is done with hot_scan, a program implemented in R and Perl. Source code and documentation are freely available for download at https://github.com/itojal/hot_scan.
Collapse
Affiliation(s)
- Israel T Silva
- Laboratory of Molecular Immunology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA, Departamento de Computação e Matemática, Universidade de São Paulo. Av. Bandeirantes, 3900, Ribeirão Preto, CEP 14049-901 and National Institute of Science and Technology in Stem Cell and Cell Therapy and Center for Cell Based Therapy. Rua Catão Roxo, 2501, Ribeirão Preto, CEP 14051-140, SP, Brazil Laboratory of Molecular Immunology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA, Departamento de Computação e Matemática, Universidade de São Paulo. Av. Bandeirantes, 3900, Ribeirão Preto, CEP 14049-901 and National Institute of Science and Technology in Stem Cell and Cell Therapy and Center for Cell Based Therapy. Rua Catão Roxo, 2501, Ribeirão Preto, CEP 14051-140, SP, Brazil
| | - Rafael A Rosales
- Laboratory of Molecular Immunology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA, Departamento de Computação e Matemática, Universidade de São Paulo. Av. Bandeirantes, 3900, Ribeirão Preto, CEP 14049-901 and National Institute of Science and Technology in Stem Cell and Cell Therapy and Center for Cell Based Therapy. Rua Catão Roxo, 2501, Ribeirão Preto, CEP 14051-140, SP, Brazil
| | - Adriano J Holanda
- Laboratory of Molecular Immunology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA, Departamento de Computação e Matemática, Universidade de São Paulo. Av. Bandeirantes, 3900, Ribeirão Preto, CEP 14049-901 and National Institute of Science and Technology in Stem Cell and Cell Therapy and Center for Cell Based Therapy. Rua Catão Roxo, 2501, Ribeirão Preto, CEP 14051-140, SP, Brazil
| | - Michel C Nussenzweig
- Laboratory of Molecular Immunology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA, Departamento de Computação e Matemática, Universidade de São Paulo. Av. Bandeirantes, 3900, Ribeirão Preto, CEP 14049-901 and National Institute of Science and Technology in Stem Cell and Cell Therapy and Center for Cell Based Therapy. Rua Catão Roxo, 2501, Ribeirão Preto, CEP 14051-140, SP, Brazil
| | - Mila Jankovic
- Laboratory of Molecular Immunology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA, Departamento de Computação e Matemática, Universidade de São Paulo. Av. Bandeirantes, 3900, Ribeirão Preto, CEP 14049-901 and National Institute of Science and Technology in Stem Cell and Cell Therapy and Center for Cell Based Therapy. Rua Catão Roxo, 2501, Ribeirão Preto, CEP 14051-140, SP, Brazil
| |
Collapse
|
11
|
Wu CC, Shete S, Jo EJ, Xu Y, Lu EY, Chen WV, Amos CI. Whole-genome detection of disease-associated deletions or excess homozygosity in a case-control study of rheumatoid arthritis. Hum Mol Genet 2012; 22:1249-61. [PMID: 23223014 DOI: 10.1093/hmg/dds512] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Unlike genome-wide association studies, few comprehensive studies of copy number variation's contribution to complex human disease susceptibility have been performed. Copy number variations are abundant in humans and represent one of the least well-studied classes of genetic variants; in addition, known rheumatoid arthritis susceptibility loci explain only a portion of familial clustering. Therefore, we performed a genome-wide study of association between deletion or excess homozygosity and rheumatoid arthritis using high-density 550 K SNP genotype data from a genome-wide association study. We used a genome-wide statistical method that we recently developed to test each contiguous SNP locus between 868 cases and 1194 controls to detect excess homozygosity or deletion variants that influence susceptibility. Our method is designed to detect statistically significant evidence of deletions or homozygosity at individual SNPs for SNP-by-SNP analyses and to combine the information among neighboring SNPs for cluster analyses. In addition to successfully detecting the known deletion variants on major histocompatibility complex, we identified 4.3 and 28 kb clusters on chromosomes 10p and 13q, respectively, which were significant at a Bonferroni-type-corrected 0.05 nominal significant level. Independently, we performed analyses using PennCNV, an algorithm for identifying and cataloging copy numbers for individuals based on a hidden Markov model, and identified cases and controls that had chromosomal segments with copy number <2. Using Fisher's exact test for comparing the numbers of cases and controls with copy number <2 per SNP, we identified 26 significant SNPs (protective; more controls than cases) aggregating on chromosome 14 with P-values <10(-8).
Collapse
Affiliation(s)
- Chih-Chieh Wu
- Department of Epidemiology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
| | | | | | | | | | | | | |
Collapse
|
12
|
Ionita-Laza I, Makarov V, Buxbaum JD. Scan-statistic approach identifies clusters of rare disease variants in LRP2, a gene linked and associated with autism spectrum disorders, in three datasets. Am J Hum Genet 2012; 90:1002-13. [PMID: 22578327 DOI: 10.1016/j.ajhg.2012.04.010] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2012] [Revised: 02/27/2012] [Accepted: 04/19/2012] [Indexed: 01/20/2023] Open
Abstract
Cluster-detection approaches, commonly used in epidemiology and astronomy, can be applied in the context of genetic sequence data for the identification of genetic regions significantly enriched with rare disease-risk variants (DRVs). Unlike existing association tests for sequence data, the goal of cluster-detection methods is to localize significant disease mutation clusters within a gene or region of interest. Here, we focus on a chromosome 2q replicated linkage region that is associated with autism spectrum disorder (ASD) and that has been sequenced in three independent datasets. We found that variants in one gene, LRP2, residing on 2q are associated with ASD in two datasets (the combined variable-threshold-test p value is 1.2 × 10(-5)). Using a cluster-detection method, we show that in the discovery and replication datasets, variants associated with ASD cluster preponderantly in 25 kb windows (adjusted p values are p(1) = 0.003 and p(2) = 0.002), and the two windows are highly overlapping. Furthermore, for the third dataset, a 25 kb region similar to those in the other two datasets shows significant evidence of enrichment of rare DRVs. The region implicated by all three studies is involved in ligand binding, suggesting that subtle alterations in either LRP2 expression or LRP2 primary sequence modulate the uptake of LRP2 ligands. BMP4 is a ligand of particular interest given its role in forebrain development, and modest changes in BMP4 binding, which binds to LRP2 near the mutation cluster, might subtly affect development and could lead to autism-associated phenotypes.
Collapse
|
13
|
von Brandenstein M, Depping R, Schäfer E, Dienes HP, Fries JW. Protein kinase C α regulates nuclear pri-microRNA 15a release as part of endothelin signaling. BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH 2011; 1813:1793-802. [DOI: 10.1016/j.bbamcr.2011.06.006] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Revised: 05/12/2011] [Accepted: 06/09/2011] [Indexed: 01/07/2023]
|
14
|
Preston D, Protopapas P, Brodley C. Discovering arbitrary event types in time series. Stat Anal Data Min 2009. [DOI: 10.1002/sam.10060] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
15
|
Wu CC, Shete S, Chen WV, Peng B, Lee AT, Ma J, Gregersen PK, Amos CI. Detection of disease-associated deletions in case-control studies using SNP genotypes with application to rheumatoid arthritis. Hum Genet 2009; 126:303-15. [PMID: 19415332 DOI: 10.1007/s00439-009-0672-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2009] [Accepted: 04/12/2009] [Indexed: 10/20/2022]
Abstract
Genomic deletions have long been known to play a causative role in microdeletion syndromes. Recent whole-genome genetic studies have shown that deletions can increase the risk for several psychiatric disorders, suggesting that genomic deletions play an important role in the genetic basis of complex traits. However, the association between genomic deletions and common, complex diseases has not yet been systematically investigated in gene mapping studies. Likelihood-based statistical methods for identifying disease-associated deletions have recently been developed for familial studies of parent-offspring trios. The purpose of this study is to develop statistical approaches for detecting genomic deletions associated with complex disease in case-control studies. Our methods are designed to be used with dense single nucleotide polymorphism (SNP) genotypes to detect deletions in large-scale or whole-genome genetic studies. As more and more SNP genotype data for genome-wide association studies become available, development of sophisticated statistical approaches will be needed that use these data. Our proposed statistical methods are designed to be used in SNP-by-SNP analyses and in cluster analyses based on combined evidence from multiple SNPs. We found that these methods are useful for detecting disease-associated deletions and are robust in the presence of linkage disequilibrium using simulated SNP data sets. Furthermore, we applied the proposed statistical methods to SNP genotype data of chromosome 6p for 868 rheumatoid arthritis patients and 1,197 controls from the North American Rheumatoid Arthritis Consortium. We detected disease-associated deletions within the region of human leukocyte antigen in which genomic deletions were previously discovered in rheumatoid arthritis patients.
Collapse
Affiliation(s)
- Chih-Chieh Wu
- Unit 1340, Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, TX 77030, USA.
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Nielsen TD, Huang J, Rogers JM, Killingsworth CR, Ideker RE. Epicardial mapping of ventricular fibrillation over the posterior descending artery and left posterior papillary muscle of the swine heart. J Interv Card Electrophysiol 2008; 24:11-7. [PMID: 18839296 DOI: 10.1007/s10840-008-9310-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/30/2008] [Accepted: 08/05/2008] [Indexed: 10/21/2022]
Abstract
BACKGROUND Recent studies suggest that during ventricular fibrillation (VF) epicardial vessels may be a site of conduction block and the posterior papillary muscle (PPM) in the left ventricle (LV) may be the location of a "mother rotor." The goal of this study was to obtain evidence to support or refute these possibilities. METHODS Epicardial activation over the posterior LV and right ventricle (RV) was mapped during the first 20 s of electrically induced VF in six open-chest pigs with a 504 electrode plaque covering a 20 cm(2) area centered over the posterior descending artery (PDA). RESULTS The locations of epicardial breakthrough as well as reentry clustered in time and space during VF. Spatially, reentry occurred significantly more frequently over the LV than the RV in all 48 episodes, and breakthrough clustered near the PPM (p < 0.001). Significant temporal clustering occurred in 79% of breakthrough episodes and 100% of reentry episodes. These temporal clusters occurred at different times so that there was significantly less breakthrough when reentry was present (p < 0.0001). Conduction block occurred significantly more frequently near the PDA than elsewhere. CONCLUSIONS The PDA is a site of epicardial block which may contribute to VF maintenance. Epicardial breakthrough clusters near the PPM. Reentry also clusters in space but at a separate site. The fact that breakthrough and reentry cluster at different locations and at different times supports the possibility of a drifting filament at the PPM so that at times reentry is present on the surface but at other times the reentrant wavefront breaks through to the epicardium.
Collapse
Affiliation(s)
- Thomas D Nielsen
- Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
| | | | | | | | | |
Collapse
|
17
|
Liang Y, Kelemen A. Statistical advances and challenges for analyzing correlated high dimensional SNP data in genomic study for complex diseases. STATISTICS SURVEYS 2008. [DOI: 10.1214/07-ss026] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
18
|
Anderson N, Titterington D. A comparison of two statistics for detecting clustering in one dimension. J STAT COMPUT SIM 2007. [DOI: 10.1080/00949659508811699] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
19
|
Sun YV, Levin AM, Boerwinkle E, Robertson H, Kardia SLR. A scan statistic for identifying chromosomal patterns of SNP association. Genet Epidemiol 2007; 30:627-35. [PMID: 16858698 DOI: 10.1002/gepi.20173] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
We have developed a single nucleotide polymorphism (SNP) association scan statistic that takes into account the complex distribution of the human genome variation in the identification of chromosomal regions with significant SNP associations. This scan statistic has wide applicability for genetic analysis, whether to identify important chromosomal regions associated with common diseases based on whole-genome SNP association studies or to identify disease susceptibility genes based on dense SNP positional candidate studies. To illustrate this method, we analyzed patterns of SNP associations on chromosome 19 in a large cohort study. Among 2,944 SNPs, we found seven regions that contained clusters of significantly associated SNPs. The average width of these regions was 35 kb with a range of 10-72 kb. We compared the scan statistic results to Fisher's product method using a sliding window approach, and detected 22 regions with significant clusters of SNP associations. The average width of these regions was 131 kb with a range of 10.1-615 kb. Given that the distances between SNPs are not taken into consideration in the sliding window approach, it is likely that a large fraction of these regions represents false positives. However, all seven regions detected by the scan statistic were also detected by the sliding window approach. The linkage disequilibrium (LD) patterns within the seven regions were highly variable indicating that the clusters of SNP associations were not due to LD alone. The scan statistic developed here can be used to make gene-based or region-based SNP inferences about disease association.
Collapse
Affiliation(s)
- Yan V Sun
- Department of Epidemiology, University of Michigan, 611 Church Street, Ann Arbor, MI 48104, USA
| | | | | | | | | |
Collapse
|
20
|
Smeeton N, Wilkinson G. The detection of annual clusters in individual patterns of parasuicide. J Appl Stat 2006. [DOI: 10.1080/02664768800000024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Nigel Smeeton
- a General Practice Research Unit , Institute of Psychiatry , London
| | - Greg Wilkinson
- b General Practice Research Unit , Institute of Psychiatry , London
| |
Collapse
|
21
|
|
22
|
Ionita I, Daruwala RS, Mishra B. Mapping tumor-suppressor genes with multipoint statistics from copy-number-variation data. Am J Hum Genet 2006; 79:13-22. [PMID: 16773561 PMCID: PMC1474131 DOI: 10.1086/504354] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2005] [Accepted: 03/17/2006] [Indexed: 11/03/2022] Open
Abstract
Array-based comparative genomic hybridization (arrayCGH) is a microarray-based comparative genomic hybridization technique that has been used to compare tumor genomes with normal genomes, thus providing rapid genomic assays of tumor genomes in terms of copy-number variations of those chromosomal segments that have been gained or lost. When properly interpreted, these assays are likely to shed important light on genes and mechanisms involved in the initiation and progression of cancer. Specifically, chromosomal segments, deleted in one or both copies of the diploid genomes of a group of patients with cancer, point to locations of tumor-suppressor genes (TSGs) implicated in the cancer. In this study, we focused on automatic methods for reliable detection of such genes and their locations, and we devised an efficient statistical algorithm to map TSGs, using a novel multipoint statistical score function. The proposed algorithm estimates the location of TSGs by analyzing segmental deletions (hemi- or homozygous) in the genomes of patients with cancer and the spatial relation of the deleted segments to any specific genomic interval. The algorithm assigns, to an interval of consecutive probes, a multipoint score that parsimoniously captures the underlying biology. It also computes a P value for every putative TSG by using concepts from the theory of scan statistics. Furthermore, it can identify smaller sets of predictive probes that can be used as biomarkers for diagnosis and therapeutics. We validated our method using different simulated artificial data sets and one real data set, and we report encouraging results. We discuss how, with suitable modifications to the underlying statistical model, this algorithm can be applied generally to a wider class of problems (e.g., detection of oncogenes).
Collapse
Affiliation(s)
- Iuliana Ionita
- Courant Institute of Mathematical Sciences, New York, NY 10012, USA
| | | | | |
Collapse
|
23
|
Abstract
We describe two classes of statistics for testing an arbitrary model of disease incidence over time against an alternative model involving a spike (pulse) superimposed on this background. The statistics are each based on taking the maximum of some function comparing observed and expected numbers of events in a window of width w. One approach applies p-values for scan statistics calculated for a constant background rate to this more general problem. For a fixed window, w, the approach gives a simple formula to determine p-values for retrospective analysis, or to sound an alarm for either continuous or grouped prospective data. The latter application involves a new approximation for the distribution of the maximum number of cases in w consecutive intervals. The second approach based on generalized likelihood ratio tests (GLRTs), sounds an alarm for a higher than anticipated rate of events in a scanning window of fixed length, or for window sizes that lie in a region. GLRTs are constructed for continuous observations, for grouped data, or for a sequence of trials. As for GLRTs used in retrospective evaluations, simulation is required to implement the prospective procedure. For grouped surveillance data, we compare by simulation, operating characteristics of the P-scan with fixed windows (both correctly specified and not), the fixed-window GLRT, the variable-window GLRT, and a variant of the CUSUM. The simulations demonstrate a very high correlation between the P-scan and corresponding fixed-window GLRT.
Collapse
Affiliation(s)
- Joseph Naus
- Department of Statistics, Rutgers University of Piscataway, NJ 08854, USA
| | | |
Collapse
|
24
|
Mellmann A, Friedrich AW, Rosenkötter N, Rothgänger J, Karch H, Reintjes R, Harmsen D. Automated DNA sequence-based early warning system for the detection of methicillin-resistant Staphylococcus aureus outbreaks. PLoS Med 2006; 3:e33. [PMID: 16396609 PMCID: PMC1325475 DOI: 10.1371/journal.pmed.0030033] [Citation(s) in RCA: 129] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2005] [Accepted: 10/28/2005] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND The detection of methicillin-resistant Staphylococcus aureus (MRSA) usually requires the implementation of often rigorous infection-control measures. Prompt identification of an MRSA epidemic is crucial for the control of an outbreak. In this study we evaluated various early warning algorithms for the detection of an MRSA cluster. METHODS AND FINDINGS Between 1998 and 2003, 557 non-replicate MRSA strains were collected from staff and patients admitted to a German tertiary-care university hospital. The repeat region of the S. aureus protein A (spa) gene in each of these strains was sequenced. Using epidemiological and typing information for the period 1998-2002 as reference data, clusters in 2003 were determined by temporal-scan test statistics. Various early warning algorithms (frequency, clonal, and infection control professionals [ICP] alerts) were tested in a prospective analysis for the year 2003. In addition, a newly implemented automated clonal alert system of the Ridom StaphType software was evaluated. A total of 549 of 557 MRSA were typeable using spa sequencing. When analyzed using scan test statistics, 42 out of 175 MRSA in 2003 formed 13 significant clusters (p < 0.05). These clusters were used as the "gold standard" to evaluate the various algorithms. Clonal alerts (spa typing and epidemiological data) were 100% sensitive and 95.2% specific. Frequency (epidemiological data only) and ICP alerts were 100% and 62.1% sensitive and 47.2% and 97.3% specific, respectively. The difference in specificity between clonal and ICP alerts was not significant. Both methods exhibited a positive predictive value above 80%. CONCLUSIONS Rapid MRSA outbreak detection, based on epidemiological and spa typing data, is a suitable alternative for classical approaches and can assist in the identification of potential sources of infection.
Collapse
Affiliation(s)
| | | | | | | | - Helge Karch
- 1Institute for Hygiene, University Hospital Münster, Münster, Germany
| | - Ralf Reintjes
- 2Hamburg University of Applied Sciences, Hamburg, Germany
| | - Dag Harmsen
- 4Department of Periodontology, University Hospital Münster, Münster, Germany
| |
Collapse
|
25
|
Bonaldi C, Daures JP, Molinari N. Authors' Reply. Biometrics 2003. [DOI: 10.1111/1541-0420.00085_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
26
|
|
27
|
Reiter A, Saussele S, Grimwade D, Wiemels JL, Segal MR, Lafage-Pochitaloff M, Walz C, Weisser A, Hochhaus A, Willer A, Reichert A, Büchner T, Lengfelder E, Hehlmann R, Cross NCP. Genomic anatomy of the specific reciprocal translocation t(15;17) in acute promyelocytic leukemia. Genes Chromosomes Cancer 2003; 36:175-88. [PMID: 12508246 DOI: 10.1002/gcc.10154] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
The genomic breakpoints in the t(15;17)(q22;q21), associated with acute promyelocytic leukemia (APL), are known to occur within three different PML breakpoint cluster regions (bcr) on chromosome 15 and within RARA intron 2 on chromosome 17; however, the precise mechanism by which this translocation arises is unclear. To clarify this mechanism, we (i). assembled the sequence of RARA intron 2, (ii). amplified and sequenced the genomic PML-RARA junction sequences from 37 APL patients, and (iii). amplified and sequenced the reverse RARA-PML genomic fusion in 29 of these cases. Three significant breakpoint microclusters within RARA intron 2 were identified, suggesting that sequence-associated or structural factors play a role in the formation of the t(15;17). There was no evidence that the location of a breakpoint in PML had any relationship to the location of the corresponding breakpoint in RARA. Although some sequence motifs previously implicated in illegitimate recombinations were found in the microcluster regions, these associations were not significant. Comparison of forward and reverse genomic junctions revealed microhomologies, deletions, and/or duplications of either gene in all but one case, in which a complex rearrangement with inversion of the PML-derived sequence was found. These findings are consistent with the hypothesis that the t(15;17) occurs by nonhomologous recombination of DNA after processing of the double-strand breaks by a dysfunctional DNA damage-repair mechanism.
Collapse
MESH Headings
- Chromosome Breakage/genetics
- Chromosome Mapping
- Chromosomes, Human, Pair 15/genetics
- Chromosomes, Human, Pair 17/genetics
- Cloning, Molecular
- Cytogenetic Analysis/methods
- Humans
- Introns/genetics
- Leukemia, Promyelocytic, Acute/genetics
- Mutagenesis, Insertional/genetics
- Neoplasm Proteins/genetics
- Oncogene Proteins, Fusion/genetics
- Receptors, Retinoic Acid/genetics
- Retinoic Acid Receptor alpha
- Reverse Transcriptase Polymerase Chain Reaction
- Sequence Analysis, DNA
- Translocation, Genetic/genetics
Collapse
Affiliation(s)
- Andreas Reiter
- III. Medizinische Universitätsklinik, Klinikum Mannheim, Fakultät für Klinische Medizin der Universität Heidelberg, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
|
29
|
Wissow LS, Walkup J, Barlow A, Reid R, Kane S. Cluster and regional influences on suicide in a Southwestern American Indian tribe. Soc Sci Med 2001; 53:1115-24. [PMID: 11556603 DOI: 10.1016/s0277-9536(00)00405-6] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Suicide is the second leading cause of death among American Indian youth. Elevated rates of suicide in Indian communities have been attributed both to outbreaks and to regional trends. We assessed the contribution of these two factors for a single tribe, and attempted to define a profile of individuals at risk. Data came from the tribe's registry of suicide attempts and completions for 1990-1993 and analysis of death certificates for the period 1985-1996. Using combined tribal and death certificate data, the average annual (age-adjusted) rate of completed suicide among tribal members was 44.7/100,000 for 1990-1993. Within the 45 suicide deaths and serious attempts in this time period, we identified one grouping of seven cases taking place in a 40-day period. All seven involved hanging and youth (13-28 years old). Using death certificate data alone, the average annual rate of suicide death for non-natives in the surrounding county in the period 1985-1996 was 22.7/100,000. Age-adjusted to the county population, the tribal rate for the same period was not significantly different (24.6/100,000). Tribal and county suicide patterns differed by age distribution and method but not by gender. We concluded that both regional trends and clustering contribute to suicide in this community. Further prevention efforts may need to focus on both unique tribal characteristics and shared factors among non-native neighbors.
Collapse
Affiliation(s)
- L S Wissow
- Faculty of Social and Behavioral Sciences, School of Hygiene and Public Health, Johns Hopkins University, Baltimore, MD 21205, USA.
| | | | | | | | | |
Collapse
|
30
|
|
31
|
Abstract
Due to their contagious or point-source nature, ill-health and diseases often cluster in time and/or space. Overlooking this characteristic can lead to a delay in the control or eradication of the health problem. In addition to potentially expediting control efforts, cluster identification techniques enable the researcher or health-care official to identify and adjust for confounding factors and to generate new hypotheses regarding disease transmission. This paper examines a variety of temporal and spatial clustering techniques and focuses on those which have been reported recently in the veterinary literature.
Collapse
Affiliation(s)
- T E Carpenter
- Department of Medicine and Epidemiology, School of Veterinary Medicine, University of California, Davis, CA 95616, USA.
| |
Collapse
|
32
|
Grimson RC, Mendelsohn S. A method for detecting current temporal clusters of toxic events through data monitoring by poison control centers. JOURNAL OF TOXICOLOGY. CLINICAL TOXICOLOGY 2001; 38:761-5. [PMID: 11192463 DOI: 10.1081/clt-100102389] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
BACKGROUND Poison control centers have become a widely recognized source of data for chemical and other environmental exposures. Concurrently, increased emphasis on early identification of temporal clustering of toxic exposures has stimulated development of sensitive statistical methods to detect clusters at the time of occurrence. METHOD This paper discusses the scan test which has been applied to retrospective data to detect carbon monoxide poisoning clusters. We propose a more sensitive method, based on the binomial distribution to detect current clusters of only 1 or more days duration during the ongoing data collection and monitoring process. RESULTS Applied to daily carbon monoxide poisoning incidence data on which the scan test has been applied, the new test for current clustering evidently has much more power. For retrospective identification of previous clusters lasting more than 1 day, the scan test is recommended. However, for previous daily clusters, a third method is recommended. CONCLUSION Certain toxic events such as carbon monoxide poisoning occasionally occur in daily clusters, or in clusters lasting a few days. Timely detection of clusters requires application of early intervention strategies fostered by sensitive statistical methods of detection, as presented here.
Collapse
Affiliation(s)
- R C Grimson
- Department of Preventive Medicine, Health Sciences Center, SUNY at Stony Brook, New York 11794-8036, USA.
| | | |
Collapse
|
33
|
Vredevoe LK, Richter PJ, Madigan JE, Kimsey RB. Association of Ixodes pacificus (Acari: ixodidae) with the spatial and temporal distribution of equine granulocytic ehrlichiosis in California. JOURNAL OF MEDICAL ENTOMOLOGY 1999; 36:551-561. [PMID: 10534948 DOI: 10.1093/jmedent/36.5.551] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
This study was conducted to determine if the biology of certain ticks associated with horses regulates the spatial and temporal distribution of equine granulocytic ehrlichiosis (EGE) in California north of Monterey County. We compared the spatial and temporal distribution of EGE cases with the seasons of activity and life histories of ticks that infest horses. Spatially, cases collected from equine veterinarians clustered around each other in a manner different from the way in which control cities of practice were distributed, with foci limited to the Sierra Nevada and coastal foothills. Cases also clustered seasonally: most were diagnosed between November and April. The spatial and temporal pattern of EGE cases closely parallels the well-characterized life history and distribution of Ixodes pacificus Cooley & Kohls, but not other ticks commonly associated with horses. Building on previous studies, there is compelling evidence that this tick has the vectorial capacity to transmit Ehrlichia equi to horses. Based on the life history and distribution of I. pacificus in relation to EGE cases, we reason that this tick is the only biologically plausible vector of E. equi in California, and provide evidence for a tightly linked association between I. pacificus and the epidemiology of EGE.
Collapse
Affiliation(s)
- L K Vredevoe
- Biological Sciences Department, California Polytechnic State University, San Luis Obispo 93407, USA
| | | | | | | |
Collapse
|
34
|
Huffer FW, Lin CT. Approximating the Distribution of the Scan Statistic Using Moments of the Number of Clumps. J Am Stat Assoc 1997. [DOI: 10.1080/01621459.1997.10473668] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
35
|
Nelson MA, Thompson FH, Emerson J, Aickin M, Adair L, Trent JM, Leong SP, Taetle R. Clinical implications of cytogenetic abnormalities in melanoma. Surg Clin North Am 1996; 76:1257-71. [PMID: 8977549 DOI: 10.1016/s0039-6109(05)70513-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Unlike leukemia, in which specific reciprocal translocations are frequently observed, melanomas involve complex recurring chromosome anomalies. Analysis of the constituted genome of melanoma patients should identify cancer susceptibility genes and at-risk individuals in families with a history of melanoma. The first of these genes to be cloned is the cell cycle regulatory protein inhibitor--the p16 gene-- and a second gene locus for melanoma predisposition has been linked to the chromosome 1p36 band region. Detection of the most common somatic genetic alterations in melanoma enhances our understanding of molecular mechanisms of melanoma development and may lead to genetic markers in melanoma. Some alterations may be used to identify interesting subpopulations. Others may be of prognostic value when they are considered in tandem with clinical data.
Collapse
Affiliation(s)
- M A Nelson
- Department of Pathology, Arizona Cancer Center, University of Arizona, Tucson, USA
| | | | | | | | | | | | | | | |
Collapse
|
36
|
Jacquez GM, Grimson R, Waller LA, Wartenberg D. The Analysis of Disease Clusters, Part II: Introduction to Techniques. Infect Control Hosp Epidemiol 1996. [DOI: 10.2307/30141142] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
37
|
Wikse SE, Kinsel ML, Field RW, Holland PS. Investigating perinatal calf mortality in beef herds. Vet Clin North Am Food Anim Pract 1994; 10:147-66. [PMID: 8199918 DOI: 10.1016/s0749-0720(15)30595-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
This article summarizes the information presented in this issue to identify the reasons for a perinatal calf mortality (PCM) problem through a risk-factor-evaluation approach. The benefits of necropsies in PCM investigations are discussed, and the gross lesions of the major causes of PCM are described. The use of 2 x 2 tables to calculate odds ratios and the Chi-square method to test the validity of the odds ratios are presented as methods to determine suspected risk factors. The article concludes with a discussion of the management plan used to solve a PCM problem.
Collapse
Affiliation(s)
- S E Wikse
- Department of Large Animal Medicine and Surgery, Texas A&M University, College of Veterinary Medicine, College Station
| | | | | | | |
Collapse
|
38
|
Abstract
This article investigates the accuracy of approximations for the distribution of ordered m-spacings for i.i.d. uniform observations in the interval (0, 1). Several Poisson approximations and a compound Poisson approximation are studied. The result of a simulation study is included to assess the accuracy of these approximations. A numerical procedure for evaluating the moments of the ordered m-spacings is developed and evaluated for the most accurate approximation.
Collapse
|
39
|
Abstract
The scan statistic is used to test the hypothesis that the observed events occur at random (uniformly distributed) in time or space versus the hypothesis that they cluster within a moving window of size w. To implement the testing procedure based on the scan statistic its tail probabilities have to be effectively evaluated. In this article a survey of results on the approximations of the distribution of the scan statistic and its moments is presented. Numerical results comparing these approximations are also given. Numerous references with applications in epidemiological studies using the scan statistics are mentioned. Related scan statistics that have been used in many other interesting applications are listed in the reference section. The article concludes with the presentation of unsolved problems related to the scan statistic.
Collapse
Affiliation(s)
- J Glaz
- Department of Statistics, University of Connecticut, Storrs 06269-3120
| |
Collapse
|
40
|
Abstract
The scan statistic is the maximum number of events in an interval of fixed length w as the subinterval moves over the entire time frame. Previous research derived the null distribution of the scan statistic under the conditional model which assumed that the total number of events was fixed, and under the unconditional model which let the total number of events be a random variable. This paper derives approximations for the power of the scan test for a pulse alternative. Under this alternative, the relative risk of disease on a subinterval (tau, tau + w), tau unknown, is theta-fold as high as it is for other subintervals of length w. Two sets of approximations are given for each model. The first approximation gives highly accurate results, but requires use of a personal computer. The second procedure can be performed on a hand-held calculator and appears very accurate for the cases examined.
Collapse
Affiliation(s)
- S Wallenstein
- Department of Biomathematics, Mount Sinai School of Medicine, New York, NY 10029
| | | | | |
Collapse
|
41
|
Sahu SK, Bendel RB, Sison CP. Effect of relative risk and cluster configuration on the power of the one-dimensional scan statistic. Stat Med 1993; 12:1853-65. [PMID: 8272666 DOI: 10.1002/sim.4780121912] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The scan test for clustering in time is based on the maximum number of events in an interval (window) of width w as the window moves across the entire time frame. Power estimates of the scan statistic are simulated for a variety of epidemiologically motivated situations. Two cluster configurations are used: a rectangular pulse, and a triangular pulse designed to emulate environmental contamination. For a rectangular pulse, the relative risk R of disease in the cluster region is R-fold as high as it is for the background region. The power is strongly influenced by the sample size, the relative risk, and the width or duration of the cluster region, whereas the effect of the cluster configuration is small. Using a 5 per cent significance level, a relative risk of 4, a standardized cluster duration of 0.10, a relative window width of 1.5, and a (non-random) sample size of 50, the simulated power is approximately 80 per cent, indicating that the minimum sample size in the cluster region for adequate power is in the 12-32 range for values of the parameters used in this study.
Collapse
Affiliation(s)
- S K Sahu
- Department of Statistics, University of Connecticut, Storrs 06269-4120
| | | | | |
Collapse
|
42
|
Abstract
This paper presents combinatorial (exact) methods that are useful in the analysis of disease cluster data obtained from small environments, such as buildings and neighbourhoods. Maxwell-Boltzmann and Fermi-Dirac occupancy models are compared in terms of appropriateness of representation of disease incidence patterns (space and/or time) in these environments. The methods are illustrated by a statistical analysis of the incidence pattern of bone fractures in a setting wherein fracture clustering was alleged to be occurring. One of the methodological results derived in this paper is the exact distribution of the maximum cell frequency in occupancy models.
Collapse
Affiliation(s)
- R C Grimson
- Department of Preventive Medicine, School of Medicine, SUNY at Stony Brook 11794-8036
| |
Collapse
|
43
|
|
44
|
Abstract
Statistical approaches help in the determination of significant configurations in protein and nucleic acid sequence data. Three recent statistical methods are discussed: (i) score-based sequence analysis that provides a means for characterizing anomalies in local sequence text and for evaluating sequence comparisons; (ii) quantile distributions of amino acid usage that reveal general compositional biases in proteins and evolutionary relations; and (iii) r-scan statistics that can be applied to the analysis of spacings of sequence markers.
Collapse
Affiliation(s)
- S Karlin
- Department of Mathematics, Stanford University, CA 94305
| | | |
Collapse
|
45
|
Grimson RC, Aldrich TE, Drane JW. Clustering in sparse data and an analysis of rhabdomyosarcoma incidence. Stat Med 1992; 11:761-8. [PMID: 1594815 DOI: 10.1002/sim.4780110607] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Time series of epidemiologic events often contain periods of atypically low or high frequency. Correspondingly, for quite rate diseases there occur instances of long vacuous durations interrupted noticeably by periods of some disease activity. A recent community-based observation of the incidence of rhabdomyosarcoma (RMS), and an investigation of it, yielded sparse data of this general description. We introduce a combinatorial test for patchy time series and apply it to the RMS data. We comment on the prevalent practice of post hoc data analysis of alleged clusters, and on scale effects.
Collapse
Affiliation(s)
- R C Grimson
- Department of Preventive Medicine, SUNY, Stony Brook 11794
| | | | | |
Collapse
|
46
|
Hryhorczuk DO, Frateschi LJ, Lipscomb JW, Zhang R. Use of the scan statistic to detect temporal clustering of poisonings. JOURNAL OF TOXICOLOGY. CLINICAL TOXICOLOGY 1992; 30:459-65. [PMID: 1512818 DOI: 10.3109/15563659209021560] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The purpose of this study is to explore the use of the scan test to detect temporal clustering of poison control center data. We applied the scan test to a computerized data set consisting of all cases of carbon monoxide poisoning reported to our Poison Control Center in 1988. Using a time window of three days, the scan test identified three statistically significant temporal clusters. A review of case and climatologic data revealed that two of the clusters were due to family outbreaks and one was due to a sudden cold spell. We conclude that the scan test is a potentially useful surveillance tool which can be easily applied to poison control center data.
Collapse
Affiliation(s)
- D O Hryhorczuk
- Rush-Presbyterian-St. Luke's Poison Control Center, Cook County Hospital, University of Illinois, Chicago
| | | | | | | |
Collapse
|
47
|
Gail MH. A bibliography and comments on the use of statistical models in epidemiology in the 1980s. Stat Med 1991; 10:1819-85. [PMID: 1805315 DOI: 10.1002/sim.4780101204] [Citation(s) in RCA: 26] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
This paper reviews developments in statistical modelling in epidemiology in the 1980's, with emphasis on cohort and case-control studies. The central roles of the logistic and proportional hazard models are highlighted, and it is shown how these models lead to a deeper understanding of classical designs and methods of analysis as well as to efficient new designs and analytical procedures. The important area of model misspecification is discussed, including the problems of omitted latent structure, mis-modelling of available measurements, missing data and errors in measurements. Various designs motivated by the logistic model are illustrated numerically, and designs based on the proportional hazards model are discussed, as are papers on sample size determination. There are brief introductions to the literature on other topics, including attributable risk, disease clustering, family studies and genetics, analysis of disease incidence data, infectious disease, longitudinal data, screening and miscellaneous related topics in statistics. An extensive bibliography is indexed according to the outline of the paper.
Collapse
Affiliation(s)
- M H Gail
- National Cancer Institute, Rockville, Maryland 20892
| |
Collapse
|
48
|
Karlin S, Macken C. Some Statistical Problems in the Assessment of Inhomogeneities of DNA Sequence Data. J Am Stat Assoc 1991. [DOI: 10.1080/01621459.1991.10475001] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
49
|
|
50
|
Abstract
A straightforward test for detecting clustering in patterns of parasuicide in individual patients is described and applied to data from the Edinburgh Regional Poisoning Treatment Centre.
Collapse
Affiliation(s)
- N Smeeton
- General Practice Research Unit, Institute of Psychiatry, London
| | | |
Collapse
|