1
|
Engebretsen S, Rø G, de Blasio BF. A compelling demonstration of why traditional statistical regression models cannot be used to identify risk factors from case data on infectious diseases: a simulation study. BMC Med Res Methodol 2022; 22:146. [PMID: 35596137 PMCID: PMC9123765 DOI: 10.1186/s12874-022-01565-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 03/03/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Regression models are often used to explain the relative risk of infectious diseases among groups. For example, overrepresentation of immigrants among COVID-19 cases has been found in multiple countries. Several studies apply regression models to investigate whether different risk factors can explain this overrepresentation among immigrants without considering dependence between the cases. METHODS We study the appropriateness of traditional statistical regression methods for identifying risk factors for infectious diseases, by a simulation study. We model infectious disease spread by a simple, population-structured version of an SIR (susceptible-infected-recovered)-model, which is one of the most famous and well-established models for infectious disease spread. The population is thus divided into different sub-groups. We vary the contact structure between the sub-groups of the population. We analyse the relation between individual-level risk of infection and group-level relative risk. We analyse whether Poisson regression estimators can capture the true, underlying parameters of transmission. We assess both the quantitative and qualitative accuracy of the estimated regression coefficients. RESULTS We illustrate that there is no clear relationship between differences in individual characteristics and group-level overrepresentation -small differences on the individual level can result in arbitrarily high overrepresentation. We demonstrate that individual risk of infection cannot be properly defined without simultaneous specification of the infection level of the population. We argue that the estimated regression coefficients are not interpretable and show that it is not possible to adjust for other variables by standard regression methods. Finally, we illustrate that regression models can result in the significance of variables unrelated to infection risk in the constructed simulation example (e.g. ethnicity), particularly when a large proportion of contacts is within the same group. CONCLUSIONS Traditional regression models which are valid for modelling risk between groups for non-communicable diseases are not valid for infectious diseases. By applying such methods to identify risk factors of infectious diseases, one risks ending up with wrong conclusions. Output from such analyses should therefore be treated with great caution.
Collapse
Affiliation(s)
| | - Gunnar Rø
- Department of Method Development and Analytics, Norwegian Institute of Public Health, Oslo, Norway
| | - Birgitte Freiesleben de Blasio
- Department of Method Development and Analytics, Norwegian Institute of Public Health, Oslo, Norway
- Oslo Centre for Biostatistics and Epidemiology, Department of Biostatistics, Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway
| |
Collapse
|
2
|
Domenech de Cellès M, Wong A, Andrea Barrero Guevara L, Rohani P. Immunological heterogeneity informs estimation of the durability of vaccine protection. J R Soc Interface 2022; 19:20220070. [PMID: 35611620 PMCID: PMC9131131 DOI: 10.1098/rsif.2022.0070] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 04/26/2022] [Indexed: 11/30/2022] Open
Abstract
Deciphering the properties of vaccines against an emerging pathogen is essential for optimizing immunization strategies. Early after vaccine roll-out, however, uncertainties about vaccine immunity raise the question of how much time is needed to estimate these properties, particularly the durability of vaccine protection. Here we designed a simulation study, based on a generic transmission model of vaccination, to simulate the impact of a breadth of vaccines with different mean (range: 10 months-2 years) and variability (coefficient of variation range: 50-100%) of the duration of protection. Focusing on the dynamics of SARS-CoV-2 in the year after start of mass immunization in Germany as a case study, we then assessed how confidently the duration of protection could be estimated under a range of epidemiological scenarios. We found that lower mean and higher heterogeneity facilitated estimation of the duration of vaccine protection. Across the vaccines tested, rapid waning and high heterogeneity permitted complete identification of the duration of protection; by contrast, slow waning and low heterogeneity allowed only estimation of the fraction of vaccinees with rapid loss of immunity. These findings suggest that limited epidemiological data can inform the duration of vaccine immunity. More generally, they highlight the need to carefully consider immunological heterogeneity when designing transmission models to evaluate vaccines.
Collapse
Affiliation(s)
| | - Anabelle Wong
- Infectious Disease Epidemiology group, Max Planck Institute for Infection Biology, 10117 Berlin, Germany
- Institute of Public Health, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany
| | - Laura Andrea Barrero Guevara
- Infectious Disease Epidemiology group, Max Planck Institute for Infection Biology, 10117 Berlin, Germany
- Institute of Public Health, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany
| | - Pejman Rohani
- Odum School of Ecology, University of Georgia, Athens, GA 30602, USA
- Department of Infectious Diseases, University of Georgia, Athens, GA 30602, USA
- Center for Influenza Disease & Emergence Research (CIDER), University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
3
|
Løchen A, Anderson RM. Dynamic transmission models and economic evaluations of pneumococcal conjugate vaccines: a quality appraisal and limitations. Clin Microbiol Infect 2021. [DOI: 10.1016/j.cmi.2021.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
4
|
Olson DR, Lopman BA, Konty KJ, Mathes RW, Papadouka V, Ternier A, Zucker JR, Simonsen L, Grenfell BT, Pitzer VE. Surveillance data confirm multiyear predictions of rotavirus dynamics in New York City. SCIENCE ADVANCES 2020; 6:eaax0586. [PMID: 32133392 PMCID: PMC7043922 DOI: 10.1126/sciadv.aax0586] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 12/06/2019] [Indexed: 05/17/2023]
Abstract
Prediction skill is a key test of models for epidemic dynamics. However, future validation of models against out-of-sample data is rare, partly because of a lack of timely surveillance data. We address this gap by analyzing the response of rotavirus dynamics to infant vaccination. Syndromic surveillance of emergency department visits for diarrhea in New York City reveals a marked decline in diarrheal incidence among infants and young children, in line with data on rotavirus-coded hospitalizations and laboratory-confirmed cases, and a shift from annual to biennial epidemics increasingly affecting older children and adults. A published mechanistic model qualitatively predicted these patterns more than 2 years in advance. Future efforts to increase vaccination coverage may disrupt these patterns and lead to further declines in the incidence of rotavirus-attributable gastroenteritis.
Collapse
Affiliation(s)
- Donald R. Olson
- New York City Department of Health and Mental Hygiene, New York City, NY, USA
- Corresponding author. (D.R.O.); (V.E.P.)
| | - Benjamin A. Lopman
- Centers for Disease Control and Prevention, Atlanta, GA, USA
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Kevin J. Konty
- New York City Department of Health and Mental Hygiene, New York City, NY, USA
| | - Robert W. Mathes
- New York City Department of Health and Mental Hygiene, New York City, NY, USA
| | - Vikki Papadouka
- New York City Department of Health and Mental Hygiene, New York City, NY, USA
| | - Alexandra Ternier
- New York City Department of Health and Mental Hygiene, New York City, NY, USA
| | - Jane R. Zucker
- New York City Department of Health and Mental Hygiene, New York City, NY, USA
- Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Lone Simonsen
- Department of Science and Environment, Roskilde University, Rodskilde, Denmark
- Department of Global Health, George Washington University, Washington, DC, USA
| | - Bryan T. Grenfell
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
| | - Virginia E. Pitzer
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
- Corresponding author. (D.R.O.); (V.E.P.)
| |
Collapse
|
5
|
Løchen A, Anderson R. Dynamic transmission models and economic evaluations of pneumococcal conjugate vaccines: a quality appraisal and limitations. Clin Microbiol Infect 2020; 26:60-70. [DOI: 10.1016/j.cmi.2019.04.026] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Revised: 04/08/2019] [Accepted: 04/22/2019] [Indexed: 02/01/2023]
|
6
|
Morozova O, Cohen T, Crawford FW. Risk ratios for contagious outcomes. J R Soc Interface 2018; 15:20170696. [PMID: 29343627 PMCID: PMC5805970 DOI: 10.1098/rsif.2017.0696] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2017] [Accepted: 12/18/2017] [Indexed: 12/12/2022] Open
Abstract
Epidemiologists commonly use the risk ratio to summarize the relationship between a binary covariate and outcome, even when outcomes may be dependent. Investigations of transmissible diseases in clusters-households, villages or small groups-often report risk ratios. Epidemiologists have warned that risk ratios may be misleading when outcomes are contagious, but the nature of this error is poorly understood. In this study, we assess the meaning of the risk ratio when outcomes are contagious. We provide a mathematical definition of infectious disease transmission within clusters, based on the canonical stochastic susceptible-infective model. From this characterization, we define the individual-level ratio of instantaneous infection risks as the inferential target, and evaluate the properties of the risk ratio as an approximation of this quantity. We exhibit analytically and by simulation the circumstances under which the risk ratio implies an effect whose direction is opposite that of the true effect of the covariate. In particular, the risk ratio can be greater than one even when the covariate reduces both individual-level susceptibility to infection, and transmissibility once infected. We explain these findings in the epidemiologic language of confounding and Simpson's paradox, underscoring the pitfalls of failing to account for transmission when outcomes are contagious.
Collapse
Affiliation(s)
- Olga Morozova
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, 60 College Street, New Haven, CT 06510, USA
| | - Ted Cohen
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, 60 College Street, New Haven, CT 06510, USA
| | - Forrest W Crawford
- Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT 06510, USA
- Department of Ecology and Evolutionary Biology, Yale University, 165 Prospect St, New Haven, CT 06511, USA
- Yale School of Management, 165 Whitney Ave, New Haven, CT 06511, USA
| |
Collapse
|
7
|
Abstract
The average effect of an infectious disease intervention (eg, a vaccine) varies across populations with different degrees of exposure to the pathogen. As a result, many investigators favor a per-exposure effect measure that is considered independent of the population level of exposure and that can be used in simulations to estimate the total disease burden averted by an intervention across different populations. However, while per-exposure effects are frequently estimated, the quantity of interest is often poorly defined, and assumptions in its calculation are typically left implicit. In this article, we build upon work by Halloran and Struchiner (Epidemiology. 1995;6:142-151) to develop a formal definition of the per-exposure effect and discuss conditions necessary for its unbiased estimation. With greater care paid to the parameterization of transmission models, their results can be better understood and can thereby be of greater value to decision-makers.
Collapse
|
8
|
Estimating a Markovian epidemic model using household serial interval data from the early phase of an epidemic. PLoS One 2013; 8:e73420. [PMID: 24023679 PMCID: PMC3758268 DOI: 10.1371/journal.pone.0073420] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 07/22/2013] [Indexed: 11/23/2022] Open
Abstract
The clinical serial interval of an infectious disease is the time between date of symptom onset in an index case and the date of symptom onset in one of its secondary cases. It is a quantity which is commonly collected during a pandemic and is of fundamental importance to public health policy and mathematical modelling. In this paper we present a novel method for calculating the serial interval distribution for a Markovian model of household transmission dynamics. This allows the use of Bayesian MCMC methods, with explicit evaluation of the likelihood, to fit to serial interval data and infer parameters of the underlying model. We use simulated and real data to verify the accuracy of our methodology and illustrate the importance of accounting for household size. The output of our approach can be used to produce posterior distributions of population level epidemic characteristics.
Collapse
|