1
|
Tang X, Li L. Multivariate Temporal Point Process Regression. J Am Stat Assoc 2021; 118:830-845. [PMID: 37519438 PMCID: PMC10373792 DOI: 10.1080/01621459.2021.1955690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 05/23/2021] [Accepted: 07/05/2021] [Indexed: 10/20/2022]
Abstract
Point process modeling is gaining increasing attention, as point process type data are emerging in a large variety of scientific applications. In this article, motivated by a neuronal spike trains study, we propose a novel point process regression model, where both the response and the predictor can be a high-dimensional point process. We model the predictor effects through the conditional intensities using a set of basis transferring functions in a convolutional fashion. We organize the corresponding transferring coefficients in the form of a three-way tensor, then impose the low-rank, sparsity, and subgroup structures on this coefficient tensor. These structures help reduce the dimensionality, integrate information across different individual processes, and facilitate the interpretation. We develop a highly scalable optimization algorithm for parameter estimation. We derive the large sample error bound for the recovered coefficient tensor, and establish the subgroup identification consistency, while allowing the dimension of the multivariate point process to diverge. We demonstrate the efficacy of our method through both simulations and a cross-area neuronal spike trains analysis in a sensory cortex study.
Collapse
Affiliation(s)
- Xiwei Tang
- Department of Statistics, University of Virginia, Charlottesville, VA
| | - Lexin Li
- Department of Biostatistics and Epidemiology, University of California, Berkeley, CA
| |
Collapse
|
2
|
Garber MD, McCullough LE, Mooney SJ, Kramer MR, Watkins KE, Lobelo RF, Flanders WD. At-risk-measure Sampling in Case-Control Studies with Aggregated Data. Epidemiology 2021; 32:101-110. [PMID: 33093327 PMCID: PMC7707160 DOI: 10.1097/ede.0000000000001268] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 09/23/2020] [Indexed: 11/26/2022]
Abstract
Transient exposures are difficult to measure in epidemiologic studies, especially when both the status of being at risk for an outcome and the exposure change over time and space, as when measuring built-environment risk on transportation injury. Contemporary "big data" generated by mobile sensors can improve measurement of transient exposures. Exposure information generated by these devices typically only samples the experience of the target cohort, so a case-control framework may be useful. However, for anonymity, the data may not be available by individual, precluding a case-crossover approach. We present a method called at-risk-measure sampling. Its goal is to estimate the denominator of an incidence rate ratio (exposed to unexposed measure of the at-risk experience) given an aggregated summary of the at-risk measure from a cohort. Rather than sampling individuals or locations, the method samples the measure of the at-risk experience. Specifically, the method as presented samples person-distance and person-events summarized by location. It is illustrated with data from a mobile app used to record bicycling. The method extends an established case-control sampling principle: sample the at-risk experience of a cohort study such that the sampled exposure distribution approximates that of the cohort. It is distinct from density sampling in that the sample remains in the form of the at-risk measure, which may be continuous, such as person-time or person-distance. This aspect may be both logistically and statistically efficient if such a sample is already available, for example from big-data sources like aggregated mobile-sensor data.
Collapse
Affiliation(s)
- Michael D. Garber
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA
| | - Lauren E. McCullough
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA
| | - Stephen J. Mooney
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA
- Harborview Injury Prevention & Research Center, University of Washington, Seattle, WA
| | - Michael R. Kramer
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA
| | - Kari E. Watkins
- School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, GA
| | - R.L. Felipe Lobelo
- Hubert Department of Global Health, Rollins School of Public Health, Emory University, Atlanta, GA
| | - W. Dana Flanders
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA
| |
Collapse
|
3
|
Gamerman D. Spatiotemporal point processes: regression, model specifications and future directions. BRAZ J PROBAB STAT 2019. [DOI: 10.1214/19-bjps444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
4
|
Hagar Y, Hayden M, Wiedinmyer C, Dukic V. Comparison of Models Analyzing a Small Number of Observed Meningitis Cases in Navrongo, Ghana. JOURNAL OF AGRICULTURAL, BIOLOGICAL, AND ENVIRONMENTAL STATISTICS 2017; 22:76-104. [PMID: 38178919 PMCID: PMC10766423 DOI: 10.1007/s13253-016-0270-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 11/17/2016] [Indexed: 01/06/2024]
Abstract
The "meningitis belt" is a region in sub-Saharan Africa where annual outbreaks of meningitis occur, with epidemics observed cyclically. While we know that meningitis is heavily dependent on seasonal trends, the exact pathways for contracting the disease are not fully understood and warrant further investigation. Most previous approaches have used large sample inference to assess impacts of weather on meningitis rates. However, in the case of rare events, the validity of such assumptions is uncertain. This work examines the meningitis trends in the context of rare events, with the specific objective of quantifying the underlying seasonal patterns in meningitis rates. We compare three main classes of models: the Poisson generalized linear model, the Poisson generalized additive model, and a Bayesian hazard model extended to accommodate count data and a changing at-risk population. We compare the accuracy and robustness of the models through the bias, RMSE, and standard deviation of the estimators, and also provide a detailed case study of meningitis patterns for data collected in Navrongo, Ghana.
Collapse
Affiliation(s)
- Y Hagar
- Applied Mathematics, University of Colorado at Boulder, Boulder, Colorado, USA
| | - M Hayden
- National Center of Atmospheric Research (NCAR), Boulder, Colorado, USA
| | - C Wiedinmyer
- National Center of Atmospheric Research (NCAR), Boulder, Colorado, USA
| | - V Dukic
- Applied Mathematics, University of Colorado at Boulder, Boulder, Colorado, USA
| |
Collapse
|
5
|
Pinto Junior JA, Gamerman D, Paez MS, Fonseca Alves RH. Point pattern analysis with spatially varying covariate effects, applied to the study of cerebrovascular deaths. Stat Med 2014; 34:1214-26. [PMID: 25534815 DOI: 10.1002/sim.6389] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2014] [Revised: 11/18/2014] [Accepted: 11/24/2014] [Indexed: 11/11/2022]
Abstract
This article proposes a modeling approach for handling spatial heterogeneity present in the study of the geographical pattern of deaths due to cerebrovascular disease.The framework involvesa point pattern analysis with components exhibiting spatial variation. Preliminary studies indicate that mortality of this disease and the effect of relevant covariates do not exhibit uniform geographic distribution. Our model extends a previously proposed model in the literature that uses spatial and non-spatial variables by allowing for spatial variation of the effect of non-spatial covariates. A number of relative risk indicators are derived by comparing different covariate levels, different geographic locations, or both. The methodology is applied to the study of the geographical death pattern of cerebrovascular deaths in the city of Rio de Janeiro. The results compare well against existing alternatives, including fixed covariate effects. Our model is able to capture and highlight important data information that would not be noticed otherwise, providing information that is required for appropriate health decision-making.
Collapse
|
6
|
Chang X, Waagepetersen R, Yu H, Ma X, Holford TR, Wang R, Guan Y. Disease risk estimation by combining case-control data with aggregated information on the population at risk. Biometrics 2014; 71:114-121. [PMID: 25351292 DOI: 10.1111/biom.12256] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Revised: 07/01/2014] [Accepted: 09/01/2014] [Indexed: 11/28/2022]
Abstract
We propose a novel statistical framework by supplementing case-control data with summary statistics on the population at risk for a subset of risk factors. Our approach is to first form two unbiased estimating equations, one based on the case-control data and the other on both the case data and the summary statistics, and then optimally combine them to derive another estimating equation to be used for the estimation. The proposed method is computationally simple and more efficient than standard approaches based on case-control data alone. We also establish asymptotic properties of the resulting estimator, and investigate its finite-sample performance through simulation. As a substantive application, we apply the proposed method to investigate risk factors for endometrial cancer, by using data from a recently completed population-based case-control study and summary statistics from the Behavioral Risk Factor Surveillance System, the Population Estimates Program of the US Census Bureau, and the Connecticut Department of Transportation.
Collapse
Affiliation(s)
- Xiaohui Chang
- College of Business, Oregon State University, Corvallis, Oregon, U.S.A
| | - Rasmus Waagepetersen
- Department of Mathematical Sciences, Aalborg University, DK-9220 Aalborg, Denmark
| | - Herbert Yu
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, Hawaii, U.S.A
| | - Xiaomei Ma
- Yale School of Public Health, New Haven, Connecticut, U.S.A
| | | | - Rong Wang
- Yale School of Public Health, New Haven, Connecticut, U.S.A
| | - Yongtao Guan
- Department of Management Science, University of Miami, Coral Gables, Florida, U.S.A
| |
Collapse
|
7
|
Huang H, Ma X, Waagepetersen R, Holford TR, Wang R, Risch H, Mueller L, Guan Y. A new estimation approach for combining epidemiological data from multiple sources. J Am Stat Assoc 2014; 109:11-23. [PMID: 24683281 PMCID: PMC3964681 DOI: 10.1080/01621459.2013.870904] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
We propose a novel two-step procedure to combine epidemiological data obtained from diverse sources with the aim to quantify risk factors affecting the probability that an individual develops certain disease such as cancer. In the first step we derive all possible unbiased estimating functions based on a group of cases and a group of controls each time. In the second step, we combine these estimating functions efficiently in order to make full use of the information contained in data. Our approach is computationally simple and flexible. We illustrate its efficacy through simulation and apply it to investigate pancreatic cancer risks based on data obtained from the Connecticut Tumor Registry, a population-based case-control study, and the Behavioral Risk Factor Surveillance System which is a state-based system of health surveys.
Collapse
Affiliation(s)
- Hui Huang
- Department of Management Science, University of Miami, Coral Gables, FL 33124
| | - Xiaomei Ma
- Yale School of Public Health, New Haven, CT 06520
| | - Rasmus Waagepetersen
- Department of Mathematical Sciences, Aalborg University, Fredrik Bajersvej 7G, DK-9220 Aalborg, Denmark
| | | | - Rong Wang
- Yale School of Public Health, New Haven, CT 06520
| | - Harvey Risch
- Yale School of Public Health, New Haven, CT 06520
| | - Lloyd Mueller
- Connecticut Department of Public Health, 410 Capitol Avenue, MS# 11HCQ, Hartford, CT 06134
| | - Yongtao Guan
- Department of Management Science, University of Miami, Coral Gables, FL 33124
| |
Collapse
|
8
|
Rathbun SL. Optimal estimation of Poisson intensity with partially observed covariates. Biometrika 2012. [DOI: 10.1093/biomet/ass069] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
9
|
Abstract
A typical recurrent event dataset consists of an often large number of recurrent event processes, each of which contains multiple event times observed from an individual during a follow-up period. Such data have become increasingly available in medical and epidemiological studies. In this article, we introduce novel procedures to conduct second-order analysis for a flexible class of semiparametric recurrent event processes. Such an analysis can provide useful information regarding the dependence structure within each recurrent event process. Specifically, we will use the proposed procedures to test whether the individual recurrent event processes are all Poisson processes and to suggest sensible alternative models for them if they are not. We apply these procedures to a well-known recurrent event dataset on chronic granulomatous disease and an epidemiological dataset on meningococcal disease cases in Merseyside, United Kingdom to illustrate their practical value.
Collapse
Affiliation(s)
- Yongtao Guan
- Division of Biostatistics, Yale University, New Haven, Connecticut 06520, USA.
| |
Collapse
|