1
|
Incorporating testing volume into estimation of effective reproduction number dynamics. JOURNAL OF THE ROYAL STATISTICAL SOCIETY. SERIES A, (STATISTICS IN SOCIETY) 2024; 187:436-453. [PMID: 38617598 PMCID: PMC11009926 DOI: 10.1093/jrsssa/qnad128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/04/2023] [Accepted: 09/10/2023] [Indexed: 04/16/2024]
Abstract
Branching process inspired models are widely used to estimate the effective reproduction number-a useful summary statistic describing an infectious disease outbreak-using counts of new cases. Case data is a real-time indicator of changes in the reproduction number, but is challenging to work with because cases fluctuate due to factors unrelated to the number of new infections. We develop a new model that incorporates the number of diagnostic tests as a surveillance model covariate. Using simulated data and data from the SARS-CoV-2 pandemic in California, we demonstrate that incorporating tests leads to improved performance over the state of the art.
Collapse
|
2
|
The central role of the identifying assumption in population size estimation. Biometrics 2024; 80:ujad028. [PMID: 38456546 DOI: 10.1093/biomtc/ujad028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 05/12/2022] [Accepted: 12/12/2023] [Indexed: 03/09/2024]
Abstract
The problem of estimating the size of a population based on a subset of individuals observed across multiple data sources is often referred to as capture-recapture or multiple-systems estimation. This is fundamentally a missing data problem, where the number of unobserved individuals represents the missing data. As with any missing data problem, multiple-systems estimation requires users to make an untestable identifying assumption in order to estimate the population size from the observed data. If an appropriate identifying assumption cannot be found for a data set, no estimate of the population size should be produced based on that data set, as models with different identifying assumptions can produce arbitrarily different population size estimates-even with identical observed data fits. Approaches to multiple-systems estimation often do not explicitly specify identifying assumptions. This makes it difficult to decouple the specification of the model for the observed data from the identifying assumption and to provide justification for the identifying assumption. We present a re-framing of the multiple-systems estimation problem that leads to an approach that decouples the specification of the observed-data model from the identifying assumption, and discuss how common models fit into this framing. This approach takes advantage of existing software and facilitates various sensitivity analyses. We demonstrate our approach in a case study estimating the number of civilian casualties in the Kosovo war.
Collapse
|
3
|
Rejoinder to the discussion on "The central role of the identifying assumption in population size estimation". Biometrics 2024; 80:ujad033. [PMID: 38456545 DOI: 10.1093/biomtc/ujad033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 08/23/2023] [Accepted: 12/12/2023] [Indexed: 03/09/2024]
Abstract
We organize the discussants' major comments into the following categories: sensitivity analyses, zero counts, model selection, the marginal no-highest-order interaction (NHOI) assumption, and the usefulness of our proposed framework.
Collapse
|
4
|
Incorporating testing volume into estimation of effective reproduction number dynamics. ARXIV 2023:arXiv:2208.04418v2. [PMID: 35979401 PMCID: PMC9383801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Branching process inspired models are widely used to estimate the effective reproduction number -- a useful summary statistic describing an infectious disease outbreak -- using counts of new cases. Case data is a real-time indicator of changes in the reproduction number, but is challenging to work with because cases fluctuate due to factors unrelated to the number of new infections. We develop a new model that incorporates the number of diagnostic tests as a surveillance model covariate. Using simulated data and data from the SARS-CoV-2 pandemic in California, we demonstrate that incorporating tests leads to improved performance over the state-of-the-art.
Collapse
|
5
|
A small area model to assess temporal trends and sub-national disparities in healthcare quality. Nat Commun 2023; 14:4555. [PMID: 37507373 PMCID: PMC10382513 DOI: 10.1038/s41467-023-40234-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 07/13/2023] [Indexed: 07/30/2023] Open
Abstract
Monitoring subnational healthcare quality is important for identifying and addressing geographic inequities. Yet, health facility surveys are rarely powered to support the generation of estimates at more local levels. With this study, we propose an analytical approach for estimating both temporal and subnational patterns of healthcare quality indicators from health facility survey data. This method uses random effects to account for differences between survey instruments; space-time processes to leverage correlations in space and time; and covariates to incorporate auxiliary information. We applied this method for three countries in which at least four health facility surveys had been conducted since 1999 - Kenya, Senegal, and Tanzania - and estimated measures of sick-child care quality per WHO Service Availability and Readiness Assessment (SARA) guidelines at programmatic subnational level, between 1999 and 2020. Model performance metrics indicated good out-of-sample predictive validity, illustrating the potential utility of geospatial statistical models for health facility data. This method offers a way to jointly estimate indicators of healthcare quality over space and time, which could then provide insights to decision-makers and health service program managers.
Collapse
|
6
|
Early effects of COVID-19 on maternal and child health service disruption in Mozambique. Front Public Health 2023; 11:1075691. [PMID: 37139385 PMCID: PMC10149948 DOI: 10.3389/fpubh.2023.1075691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Accepted: 02/21/2023] [Indexed: 05/05/2023] Open
Abstract
This article is part of the Research Topic 'Health Systems Recovery in the Context of COVID-19 and Protracted Conflict'. Introduction After the World Health Organization declared COVID-19 a pandemic, more than 184 million cases and 4 million deaths had been recorded worldwide by July 2021. These are likely to be underestimates and do not distinguish between direct and indirect deaths resulting from disruptions in health care services. The purpose of our research was to assess the early impact of COVID-19 in 2020 and early 2021 on maternal and child healthcare service delivery at the district level in Mozambique using routine health information system data, and estimate associated excess maternal and child deaths. Methods Using data from Mozambique's routine health information system (SISMA, Sistema de Informação em Saúde para Monitoria e Avaliação), we conducted a time-series analysis to assess changes in nine selected indicators representing the continuum of maternal and child health care service provision in 159 districts in Mozambique. The dataset was extracted as counts of services provided from January 2017 to March 2021. Descriptive statistics were used for district comparisons, and district-specific time-series plots were produced. We used absolute differences or ratios for comparisons between observed data and modeled predictions as a measure of the magnitude of loss in service provision. Mortality estimates were performed using the Lives Saved Tool (LiST). Results All maternal and child health care service indicators that we assessed demonstrated service delivery disruptions (below 10% of the expected counts), with the number of new users of family planing and malaria treatment with Coartem (number of children under five treated) experiencing the largest disruptions. Immediate losses were observed in April 2020 for all indicators, with the exception of treatment of malaria with Coartem. The number of excess deaths estimated in 2020 due to loss of health service delivery were 11,337 (12.8%) children under five, 5,705 (11.3%) neonates, and 387 (7.6%) mothers. Conclusion Findings from our study support existing research showing the negative impact of COVID-19 on maternal and child health services utilization in sub-Saharan Africa. This study offers subnational and granular estimates of service loss that can be useful for health system recovery planning. To our knowledge, it is the first study on the early impacts of COVID-19 on maternal and child health care service utilization conducted in an African Portuguese-speaking country.
Collapse
|
7
|
A Statistical Review of Template Model Builder: A Flexible Tool for Spatial Modelling. Int Stat Rev 2022. [DOI: 10.1111/insr.12534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
8
|
Estimating the stillbirth rate for 195 countries using a Bayesian sparse regression model with temporal smoothing. Ann Appl Stat 2022. [DOI: 10.1214/21-aoas1571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
9
|
A linear noise approximation for stochastic epidemic models fit to partially observed incidence counts. Biometrics 2022; 78:1530-1541. [PMID: 34374071 DOI: 10.1111/biom.13538] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2020] [Revised: 06/10/2021] [Accepted: 06/17/2021] [Indexed: 12/30/2022]
Abstract
Stochastic epidemic models (SEMs) fit to incidence data are critical to elucidating outbreak dynamics, shaping response strategies, and preparing for future epidemics. SEMs typically represent counts of individuals in discrete infection states using Markov jump processes (MJPs), but are computationally challenging as imperfect surveillance, lack of subject-level information, and temporal coarseness of the data obscure the true epidemic. Analytic integration over the latent epidemic process is impossible, and integration via Markov chain Monte Carlo (MCMC) is cumbersome due to the dimensionality and discreteness of the latent state space. Simulation-based computational approaches can address the intractability of the MJP likelihood, but are numerically fragile and prohibitively expensive for complex models. A linear noise approximation (LNA) that approximates the MJP transition density with a Gaussian density has been explored for analyzing prevalence data in large-population settings, but requires modification for analyzing incidence counts without assuming that the data are normally distributed. We demonstrate how to reparameterize SEMs to appropriately analyze incidence data, and fold the LNA into a data augmentation MCMC framework that outperforms deterministic methods, statistically, and simulation-based methods, computationally. Our framework is computationally robust when the model dynamics are complex and applies to a broad class of SEMs. We evaluate our method in simulations that reflect Ebola, influenza, and SARS-CoV-2 dynamics, and apply our method to national surveillance counts from the 2013-2015 West Africa Ebola outbreak.
Collapse
|
10
|
Bayesian multiresolution modeling of georeferenced data: An extension of ‘LatticeKrig’. Comput Stat Data Anal 2022. [DOI: 10.1016/j.csda.2022.107503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
11
|
Combining information to estimate adherence in studies of pre-exposure prophylaxis for HIV prevention: Application to HPTN 067. Stat Med 2022; 41:1120-1136. [PMID: 35080038 PMCID: PMC8881405 DOI: 10.1002/sim.9321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 12/29/2021] [Accepted: 12/30/2021] [Indexed: 11/11/2022]
Abstract
In trials of oral HIV pre-exposure prophylaxis (PrEP), multiple approaches have been used to measure adherence, including self-report, pill counts, electronic dose monitoring devices, and biological measures such as drug levels in plasma, peripheral blood mononuclear cells, hair, and/or dried blood spots. No one of these measures is ideal and each has strengths and weaknesses. However, accurate estimates of adherence to oral PrEP are important as drug efficacy is closely tied to adherence, and secondary analyses of trial data within identified adherent/non-adherent subgroups may yield important insights into real-world drug effectiveness. We develop a statistical approach to combining multiple measures of adherence and show in simulated data that the proposed method provides a more accurate measure of true adherence than self-report. We then apply the method to estimate adherence in the ADAPT study (HPTN 067) in South African women.
Collapse
|
12
|
A FLEXIBLE BAYESIAN FRAMEWORK TO ESTIMATE AGE- AND CAUSE-SPECIFIC CHILD MORTALITY OVER TIME FROM SAMPLE REGISTRATION DATA. Ann Appl Stat 2022; 16:124-143. [PMID: 37621750 PMCID: PMC10448806 DOI: 10.1214/21-aoas1489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/26/2023]
Abstract
In order to implement disease-specific interventions in young age groups, policy makers in low- and middle-income countries require timely and accurate estimates of age- and cause-specific child mortality. High-quality data is not available in settings where these interventions are most needed, but there is a push to create sample registration systems that collect detailed mortality information. current methods that estimate mortality from this data employ multistage frameworks without rigorous statistical justification that separately estimate all-cause and cause-specific mortality and are not sufficiently adaptable to capture important features of the data. We propose a flexible Bayesian modeling framework to estimate age- and cause-specific child mortality from sample registration data. We provide a theoretical justification for the framework, explore its properties via simulation, and use it to estimate mortality trends using data from the Maternal and Child Health Surveillance System in China.
Collapse
|
13
|
Global, regional, and national trends in under-5 mortality between 1990 and 2019 with scenario-based projections until 2030: a systematic analysis by the UN Inter-agency Group for Child Mortality Estimation. Lancet Glob Health 2022; 10:e195-e206. [PMID: 35063111 PMCID: PMC8789561 DOI: 10.1016/s2214-109x(21)00515-5] [Citation(s) in RCA: 98] [Impact Index Per Article: 49.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 09/29/2021] [Accepted: 10/25/2021] [Indexed: 12/25/2022]
Abstract
BACKGROUND The Sustainable Development Goals (SDGs), set in 2015 by the UN General Assembly, call for all countries to reach an under-5 mortality rate (U5MR) of at least as low as 25 deaths per 1000 livebirths and a neonatal mortality rate (NMR) of at least as low as 12 deaths per 1000 livebirths by 2030. We estimated levels and trends in under-5 mortality for 195 countries from 1990 to 2019, and conducted scenario-based projections of the U5MR and NMR from 2020 to 2030 to assess country progress in, and potential for, reaching SDG targets on child survival and the potential under-5 and neonatal deaths over the next decade. METHODS Levels and trends in under-5 mortality are based on the UN Inter-agency Group for Child Mortality Estimation (UN IGME) database on under-5 mortality, which contains around 18 000 country-year datapoints for 195 countries-nearly 10 000 of those datapoints since 1990. The database includes nationally representative mortality data from vital registration systems, sample registration systems, population censuses, and household surveys. As with previous sets of national UN IGME estimates, a Bayesian B-spline bias-reduction model (B3) that considers the systematic biases associated with the different data source types was fitted to these data to generate estimates of under-5 (age 0-4 years) mortality with uncertainty intervals for 1990-2019 for all countries. Levels and trends in the neonatal mortality rate (0-27 days) are modelled separately as the log ratio of the neonatal mortality rate to the under-5 mortality rate using a Bayesian model. Estimated mortality rates are combined with livebirths data to calculate the number of under-5 and neonatal deaths. To assess the regional and global burden of under-5 deaths in the present decade and progress towards SDG targets, we constructed several scenario-based projections of under-5 mortality from 2020 to 2030 and estimated national, regional, and global under-5 mortality trends up to 2030 for each scenario. FINDINGS The global U5MR decreased by 59% (90% uncertainty interval [UI] 56-61) from 93·0 (91·7-94·5) deaths per 1000 livebirths in 1990 to 37·7 (36·1-40·8) in 2019, while the annual number of global under-5 deaths declined from 12·5 (12·3-12·7) million in 1990 to 5·2 (5·0-5·6) million in 2019-a 58% (55-60) reduction. The global NMR decreased by 52% (90% UI 48-55) from 36·6 (35·6-37·8) deaths per 1000 livebirths in 1990, to 17·5 (16·6-19·0) in 2019, and the annual number of global neonatal deaths declined from 5·0 (4·9-5·2) million in 1990, to 2·4 (2·3-2·7) million in 2019, a 51% (47-54) reduction. As of 2019, 122 of 195 countries have achieved the SDG U5MR target, and 20 countries are on track to achieve the target by 2030, while 53 will need to accelerate progress to meet the target by 2030. 116 countries have reached the SDG NMR target with 16 on track, leaving 63 at risk of missing the target. If current trends continue, 48·1 million under-5 deaths are projected to occur between 2020 and 2030, almost half of them projected to occur during the neonatal period. If all countries met the SDG target on under-5 mortality, 11 million under-5 deaths could be averted between 2020 and 2030. INTERPRETATION As a result of effective global health initiatives, millions of child deaths have been prevented since 1990. However, the task of ending all preventable child deaths is not done and millions more deaths could be averted by meeting international targets. Geographical and economic variation demonstrate the possibility of even lower mortality rates for children under age 5 years and point to the regions and countries with highest mortality rates and in greatest need of resources and action. FUNDING Bill & Melinda Gates Foundation, US Agency for International Development.
Collapse
|
14
|
Space-time smoothing models for subnational measles routine immunization coverage estimation with complex survey data. Ann Appl Stat 2021. [DOI: 10.1214/21-aoas1474] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
15
|
Naomi: a new modelling tool for estimating HIV epidemic indicators at the district level in sub-Saharan Africa. J Int AIDS Soc 2021; 24 Suppl 5:e25788. [PMID: 34546657 PMCID: PMC8454682 DOI: 10.1002/jia2.25788] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 07/19/2021] [Indexed: 11/23/2022] Open
Abstract
INTRODUCTION HIV planning requires granular estimates for the number of people living with HIV (PLHIV), antiretroviral treatment (ART) coverage and unmet need, and new HIV infections by district, or equivalent subnational administrative level. We developed a Bayesian small-area estimation model, called Naomi, to estimate these quantities stratified by subnational administrative units, sex, and five-year age groups. METHODS Small-area regressions for HIV prevalence, ART coverage and HIV incidence were jointly calibrated using subnational household survey data on all three indicators, routine antenatal service delivery data on HIV prevalence and ART coverage among pregnant women, and service delivery data on the number of PLHIV receiving ART. Incidence was modelled by district-level HIV prevalence and ART coverage. Model outputs of counts and rates for each indicator were aggregated to multiple geographic and demographic stratifications of interest. The model was estimated in an empirical Bayes framework, furnishing probabilistic uncertainty ranges for all output indicators. Example results were presented using data from Malawi during 2016-2018. RESULTS Adult HIV prevalence in September 2018 ranged from 3.2% to 17.1% across Malawi's districts and was higher in southern districts and in metropolitan areas. ART coverage was more homogenous, ranging from 75% to 82%. The largest number of PLHIV was among ages 35 to 39 for both women and men, while the most untreated PLHIV were among ages 25 to 29 for women and 30 to 34 for men. Relative uncertainty was larger for the untreated PLHIV than the number on ART or total PLHIV. Among clients receiving ART at facilities in Lilongwe city, an estimated 71% (95% CI, 61% to 79%) resided in Lilongwe city, 20% (14% to 27%) in Lilongwe district outside the metropolis, and 9% (6% to 12%) in neighbouring Dowa district. Thirty-eight percent (26% to 50%) of Lilongwe rural residents and 39% (27% to 50%) of Dowa residents received treatment at facilities in Lilongwe city. CONCLUSIONS The Naomi model synthesizes multiple subnational data sources to furnish estimates of key indicators for HIV programme planning, resource allocation, and target setting. Further model development to meet evolving HIV policy priorities and programme need should be accompanied by continued strengthening and understanding of routine health system data.
Collapse
|
16
|
Global, regional, and national estimates and trends in stillbirths from 2000 to 2019: a systematic assessment. Lancet 2021; 398:772-785. [PMID: 34454675 PMCID: PMC8417352 DOI: 10.1016/s0140-6736(21)01112-0] [Citation(s) in RCA: 156] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 04/30/2021] [Accepted: 05/06/2021] [Indexed: 01/12/2023]
Abstract
BACKGROUND Stillbirths are a major public health issue and a sensitive marker of the quality of care around pregnancy and birth. The UN Global Strategy for Women's, Children's and Adolescents' Health (2016-30) and the Every Newborn Action Plan (led by UNICEF and WHO) call for an end to preventable stillbirths. A first step to prevent stillbirths is obtaining standardised measurement of stillbirth rates across countries. We estimated stillbirth rates and their trends for 195 countries from 2000 to 2019 and assessed progress over time. METHODS For a systematic assessment, we created a dataset of 2833 country-year datapoints from 171 countries relevant to stillbirth rates, including data from registration and health information systems, household-based surveys, and population-based studies. After data quality assessment and exclusions, we used 1531 datapoints to estimate country-specific stillbirth rates for 195 countries from 2000 to 2019 using a Bayesian hierarchical temporal sparse regression model, according to a definition of stillbirth of at least 28 weeks' gestational age. Our model combined covariates with a temporal smoothing process such that estimates were informed by data for country-periods with high quality data, while being based on covariates for country-periods with little or no data on stillbirth rates. Bias and additional uncertainty associated with observations based on alternative stillbirth definitions and source types, and observations that were subject to non-sampling errors, were included in the model. We compared the estimated stillbirth rates and trends to previously reported mortality estimates in children younger than 5 years. FINDINGS Globally in 2019, an estimated 2·0 million babies (90% uncertainty interval [UI] 1·9-2·2) were stillborn at 28 weeks or more of gestation, with a global stillbirth rate of 13·9 stillbirths (90% UI 13·5-15·4) per 1000 total births. Stillbirth rates in 2019 varied widely across regions, from 22·8 stillbirths (19·8-27·7) per 1000 total births in west and central Africa to 2·9 (2·7-3·0) in western Europe. After west and central Africa, eastern and southern Africa and south Asia had the second and third highest stillbirth rates in 2019. The global annual rate of reduction in stillbirth rate was estimated at 2·3% (90% UI 1·7-2·7) from 2000 to 2019, which was lower than the 2·9% (2·5-3·2) annual rate of reduction in neonatal mortality rate (for neonates aged <28 days) and the 4·3% (3·8-4·7) annual rate of reduction in mortality rate among children aged 1-59 months during the same period. Based on the lower bound of the 90% UIs, 114 countries had an estimated decrease in stillbirth rate since 2000, with four countries having a decrease of at least 50·0%, 28 having a decrease of 25·0-49·9%, 50 having a decrease of 10·0-24·9%, and 32 having a decrease of less than 10·0%. For the remaining 81 countries, we found no decrease in stillbirth rate since 2000. Of these countries, 34 were in sub-Saharan Africa, 16 were in east Asia and the Pacific, and 15 were in Latin America and the Caribbean. INTERPRETATION Progress in reducing the rate of stillbirths has been slow compared with decreases in the mortality rate of children younger than 5 years. Accelerated improvements are most needed in the regions and countries with high stillbirth rates, particularly in sub-Saharan Africa. Future prevention of stillbirths needs increased efforts to raise public awareness, improve data collection, assess progress, and understand public health priorities locally, all of which require investment. FUNDING Bill & Melinda Gates Foundation and the UK Foreign, Commonwealth and Development Office.
Collapse
|
17
|
Estimating seroprevalence of SARS-CoV-2 in Ohio: A Bayesian multilevel poststratification approach with multiple diagnostic tests. Proc Natl Acad Sci U S A 2021; 118:e2023947118. [PMID: 34172581 PMCID: PMC8255994 DOI: 10.1073/pnas.2023947118] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Globally, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has infected more than 59 million people and killed more than 1.39 million. Designing and monitoring interventions to slow and stop the spread of the virus require knowledge of how many people have been and are currently infected, where they live, and how they interact. The first step is an accurate assessment of the population prevalence of past infections. There are very few population-representative prevalence studies of SARS-CoV-2 infections, and only two states in the United States-Indiana and Connecticut-have reported probability-based sample surveys that characterize statewide prevalence of SARS-CoV-2. One of the difficulties is the fact that tests to detect and characterize SARS-CoV-2 coronavirus antibodies are new, are not well characterized, and generally function poorly. During July 2020, a survey representing all adults in the state of Ohio in the United States collected serum samples and information on protective behavior related to SARS-CoV-2 and coronavirus disease 2019 (COVID-19). Several features of the survey make it difficult to estimate past prevalence: 1) a low response rate; 2) a very low number of positive cases; and 3) the fact that multiple poor-quality serological tests were used to detect SARS-CoV-2 antibodies. We describe a Bayesian approach for analyzing the biomarker data that simultaneously addresses these challenges and characterizes the potential effect of selective response. The model does not require survey sample weights; accounts for multiple imperfect antibody test results; and characterizes uncertainty related to the sample survey and the multiple imperfect, potentially correlated tests.
Collapse
|
18
|
133 Unbound corneocyte lipid envelopes in 12R-lipoxygenase deficiency support a direct role in lipid-protein crosslinking. J Invest Dermatol 2021. [DOI: 10.1016/j.jid.2021.02.152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
19
|
A linear noise approximation for stochastic epidemic models fit to partially observed incidence counts. ARXIV 2021:arXiv:2001.05099v2. [PMID: 33948449 PMCID: PMC8095205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Revised: 04/27/2021] [Indexed: 06/12/2023]
Abstract
Stochastic epidemic models (SEMs) fit to incidence data are critical to elucidating outbreak dynamics, shaping response strategies, and preparing for future epidemics. SEMs typically represent counts of individuals in discrete infection states using Markov jump processes (MJPs), but are computationally challenging as imperfect surveillance, lack of subject-level information, and temporal coarseness of the data obscure the true epidemic. Analytic integration over the latent epidemic process is impossible, and integration via Markov chain Monte Carlo (MCMC) is cumbersome due to the dimensionality and discreteness of the latent state space. Simulation-based computational approaches can address the intractability of the MJP likelihood, but are numerically fragile and prohibitively expensive for complex models. A linear noise approximation (LNA) that approximates the MJP transition density with a Gaussian density has been explored for analyzing prevalence data in large-population settings, but requires modification for analyzing incidence counts without assuming that the data are normally distributed. We demonstrate how to reparameterize SEMs to appropriately analyze incidence data, and fold the LNA into a data augmentation MCMC framework that outperforms deterministic methods, statistically, and simulation-based methods, computationally. Our framework is computationally robust when the model dynamics are complex and applies to a broad class of SEMs. We evaluate our method in simulations that reflect Ebola, influenza, and SARS-CoV-2 dynamics, and apply our method to national surveillance counts from the 2013--2015 West Africa Ebola outbreak.
Collapse
|
20
|
Space-time modeling of child mortality at the Admin-2 level in a low and middle income countries context. Stat Med 2021; 40:1593-1638. [PMID: 33586227 PMCID: PMC8055469 DOI: 10.1002/sim.8854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 11/30/2020] [Accepted: 12/02/2020] [Indexed: 11/08/2022]
Abstract
The Sustainable Development Goals call for a total reduction of preventable child mortality before 2030. Further, the goals state the desirability to have subnational mortality estimates. Estimates at this level are required for health interventions at the subnational level. In a low and middle income countries context, the data on mortality typically consist of household surveys, which are carried out with a stratified, cluster design, and census microsamples. Most household surveys collect full birth history (FBH) data on birth and death dates of a mother's children, but censuses collect summary birth history (SBH) data which consist only of the number of children born and the number that died. In previous work, direct (survey-weighted) estimates with associated variances were derived from FBH data and smoothed in space and time. Unfortunately, the FBH data from household surveys are usually not sufficiently abundant to obtain yearly estimates at the Admin-2 level (at which interventions are often made). In this paper we describe four extensions to previous work: (i) combining SBH data with FBH data, (ii) modeling on a yearly scale, to combine data on a yearly scale with data at coarser time scales, (iii) adjusting direct estimates in Admin-2 areas where we do not observe any deaths due to small sample sizes, (iv) acknowledge differences in data sources by modeling potential bias arising from the various data sources. The methods are illustrated using household survey and census data from Kenya and Malawi, to produce mortality estimates from 1980 to the time of the most recent survey, and predictions to 2020.
Collapse
|
21
|
Abstract
There is an increasing focus on reducing inequalities in health outcomes in developing countries. Subnational variation is of particular interest, with geographically-indexed data being used to understand the spatial risk of detrimental outcomes and to identify who is at greatest risk. While some health surveys provide observations with associated geographic coordinates (point data), many others provide data that have their locations masked and instead only report the strata (polygon information) within which the data resides (masked data). How to harmonize these data sources for spatial analysis has been previously considered although only ad hoc methods and comparison of methods is lacking. In this paper, we present a new method for analyzing masked survey data, using a method that is consistent with the data-generating process. In addition, we critique two previously proposed approaches to analyzing masked data and illustrate that they are fundamentally flawed methodologically. To validate our method, we compare our approach with previously formulated solutions in several realistic simulation environments in which the underlying structure of the risk field is known. We simulate samples from spatiotemporal fields in a way that mimics the sampling frame implemented in the most common health surveys in low- and middle-income countries, the Demographic and Health Surveys and Multiple Indicator Cluster Surveys. In simulations, the newly proposed approach outperforms previously proposed approaches in terms of minimizing error while increasing the precision of estimates. The approaches are subsequently compared using child mortality data from the Dominican Republic where our findings are reinforced. The ability to accurately increase precision of child mortality estimates, and health outcomes in general, by leveraging various types of data, improves our ability to implement precision public health initiatives and better understand the landscape of geographic health inequalities.
Collapse
|
22
|
Pointless spatial modeling. Biostatistics 2020; 21:e17-e32. [PMID: 30202860 DOI: 10.1093/biostatistics/kxy041] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 07/13/2018] [Accepted: 08/04/2018] [Indexed: 11/14/2022] Open
Abstract
The analysis of area-level aggregated summary data is common in many disciplines including epidemiology and the social sciences. Typically, Markov random field spatial models have been employed to acknowledge spatial dependence and allow data-driven smoothing. In the context of an irregular set of areas, these models always have an ad hoc element with respect to the definition of a neighborhood scheme. In this article, we exploit recent theoretical and computational advances to carry out modeling at the continuous spatial level, which induces a spatial model for the discrete areas. This approach also allows reconstruction of the continuous underlying surface, but the interpretation of such surfaces is delicate since it depends on the quality, extent and configuration of the observed data. We focus on models based on stochastic partial differential equations. We also consider the interesting case in which the aggregate data are supplemented with point data. We carry out Bayesian inference and, in the language of generalized linear mixed models, if the link is linear, an efficient implementation of the model is available via integrated nested Laplace approximations. For nonlinear links, we present two approaches: a fully Bayesian implementation using a Hamiltonian Monte Carlo algorithm and an empirical Bayes implementation, that is much faster and is based on Laplace approximations. We examine the properties of the approach using simulation, and then apply the model to the classic Scottish lip cancer data.
Collapse
|
23
|
Child mortality estimation incorporating summary birth history data. Biometrics 2020; 77:1456-1466. [PMID: 32970318 DOI: 10.1111/biom.13383] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 09/08/2020] [Accepted: 09/15/2020] [Indexed: 11/28/2022]
Abstract
The United Nations' Sustainable Development Goal 3.2 aims to reduce under-five child mortality to 25 deaths per 1000 live births by 2030. Child mortality tends to be concentrated in developing regions where information needed to assess achievement of this goal often comes from surveys and censuses. In both, women are asked about their birth histories, but with varying degrees of detail. Full birth history (FBH) data contain the reported dates of births and deaths of every surveyed mother's children. In contrast, summary birth history (SBH) data contain only the total number of children born and total number of children who died for each mother. Specialized methods are needed to accommodate this type of data into analyses of child mortality trends. We develop a data augmentation scheme within a Bayesian framework where for SBH data, birth and death dates are introduced as auxiliary variables. Since we specify a full probability model for the data, many of the well-known biases that exist in this data can be accommodated, along with space-time smoothing on the underlying mortality rates. We illustrate our approach in a simulation, showing robustness to model misspecification and that uncertainty is reduced when incorporating SBH data over simply analyzing all available FBH data. We also apply our approach to data from the Central region of Malawi and compare with the well-known Brass method.
Collapse
|
24
|
What About Equity? Neighborhood Deprivation and Cannabis Retailers in Portland, Oregon. ACTA ACUST UNITED AC 2020. [DOI: 10.26828/cannabis.2020.02.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
25
|
Ecological inference for infectious disease data, with application to vaccination strategies. Stat Med 2020; 39:220-238. [PMID: 31797425 PMCID: PMC11016350 DOI: 10.1002/sim.8390] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Revised: 08/05/2019] [Accepted: 09/17/2019] [Indexed: 11/11/2022]
Abstract
Disease surveillance systems provide a rich source of data regarding infectious diseases, aggregated across geographical regions. The analysis of such ecological data is fraught with difficulties, and, unless care and suitable data summaries are available, will lead to biased estimates of individual-level parameters. We consider using surveillance data to study the impacts of vaccination. To catalog the problems of ecological inference, we start with an individual-level model, which contains familiar parameters, and derive an ecologically consistent model for infectious diseases in partially vaccinated populations. We compare with other popular model classes and highlight deficiencies. We explore the properties of the new model through simulation and demonstrate that, under standard assumptions, the ecological model provides less biased estimates. We then fit the new model to data collected on measles outbreaks in Germany from 2005-2007.
Collapse
|
26
|
Abstract
Since 2000, many countries have achieved considerable success in improving child survival, but localized progress remains unclear. To inform efforts towards United Nations Sustainable Development Goal 3.2-to end preventable child deaths by 2030-we need consistently estimated data at the subnational level regarding child mortality rates and trends. Here we quantified, for the period 2000-2017, the subnational variation in mortality rates and number of deaths of neonates, infants and children under 5 years of age within 99 low- and middle-income countries using a geostatistical survival model. We estimated that 32% of children under 5 in these countries lived in districts that had attained rates of 25 or fewer child deaths per 1,000 live births by 2017, and that 58% of child deaths between 2000 and 2017 in these countries could have been averted in the absence of geographical inequality. This study enables the identification of high-mortality clusters, patterns of progress and geographical inequalities to inform appropriate investments and implementations that will help to improve the health of all populations.
Collapse
|
27
|
Abstract
Accurate estimates of the under-five mortality rate in a developing world context are a key barometer of the health of a nation. This paper describes a new model to analyze survey data on mortality in this context. We are interested in both spatial and temporal description, that is wishing to estimate under-five mortality rate across regions and years and to investigate the association between the under-five mortality rate and spatially varying covariate surfaces. We illustrate the methodology by producing yearly estimates for subnational areas in Kenya over the period 1980-2014 using data from the Demographic and Health Surveys, which use stratified cluster sampling. We use a binomial likelihood with fixed effects for the urban/rural strata and random effects for the clustering to account for the complex survey design. Smoothing is carried out using Bayesian hierarchical models with continuous spatial and temporally discrete components. A key component of the model is an offset to adjust for bias due to the effects of HIV epidemics. Substantively, there has been a sharp decline in Kenya in the under-five mortality rate in the period 1980-2014, but large variability in estimated subnational rates remains. A priority for future research is understanding this variability. In exploratory work, we examine whether a variety of spatial covariate surfaces can explain the variability in under-five mortality rate. Temperature, precipitation, a measure of malaria infection prevalence, and a measure of nearness to cities were candidates for inclusion in the covariate model, but the interplay between space, time, and covariates is complex.
Collapse
|
28
|
647 Basis for the link between atopic dermatitis and autism. J Invest Dermatol 2019. [DOI: 10.1016/j.jid.2019.03.723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
29
|
Geogenomic Segregation and Temporal Trends of Human Pathogenic Escherichia coli O157:H7, Washington, USA, 2005-2014 1. Emerg Infect Dis 2018; 24:32-39. [PMID: 29260688 PMCID: PMC5749469 DOI: 10.3201/eid2401.170851] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The often-noted and persistent increased incidence of Escherichia
coli O157:H7 infections in rural areas is not well understood. We
used a cohort of E. coli O157:H7 cases reported in Washington,
USA, during 2005–2014, along with phylogenomic characterization of the
infecting isolates, to identify geographic segregation of and temporal trends in
specific phylogenetic lineages of E. coli O157:H7. Kernel
estimation and generalized additive models demonstrated that pathogen lineages
were spatially segregated during the period of analysis and identified a focus
of segregation spanning multiple, predominantly rural, counties for each of the
main clinical lineages, Ib, IIa, and IIb. These results suggest the existence of
local reservoirs from which humans are infected. We also noted a secular
increase in the proportion of lineage IIa and IIb isolates. Spatial segregation
by phylogenetic lineage offers the potential to identify local reservoirs and
intervene to prevent continued transmission.
Collapse
|
30
|
Responding to Mycobacterium chimaera heater-cooler unit contamination: international and national intersectoral collaboration coordinated in the state of Queensland, Australia. J Hosp Infect 2018; 100:e77-e84. [PMID: 30036634 DOI: 10.1016/j.jhin.2018.07.024] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Accepted: 07/16/2018] [Indexed: 11/26/2022]
Abstract
BACKGROUND The index case of Mycobacterium chimaera infection in a patient following open cardiac surgery in the state of Queensland, Australia prompted a centralized coordinated response to mitigate the risk. AIM To describe the public health response to M. chimaera contamination of heater-cooler units (HCUs) and patient infection. METHODS A public health sector strategy was developed with national and international consultation to respond to the threat of HCUs contaminated with M. chimaera. Data linkage of non-tuberculous mycobacterium notifications and selected procedures was undertaken where potential use of HCUs was identified through hospitalization records. Water sampling and testing protocols were standardized. Public disclosure and patient notification were undertaken. FINDINGS A single case of disseminated M. chimaera infection in a patient has been diagnosed to date in Queensland, Australia. Ten of 12 (83%) LivaNova Stockert 3T HCUs from five hospitals tested positive for M. chimaera. In total, 5650 patients were notified by post of their potential risk of exposure. Use of the telehealth call centre was modest. M. chimaera was also found in extracorporeal membrane oxygenation heater units produced by two other device manufacturers, four of which tested positive prior to commissioning for use. CONCLUSIONS Rapid international collaboration optimized the Queensland Health response to potential M. chimaera exposure during cardiac surgery. State-wide collaboration ensured a transparent, consistent approach to contacting patients and informing the public of the potential risk. A framework for ongoing risk management, clinical awareness and laboratory diagnosis was established. No further cases of M. chimaera infection have been identified in Queensland.
Collapse
|
31
|
Stratified space–time infectious disease modelling, with an application to hand, foot and mouth disease in China. J R Stat Soc Ser C Appl Stat 2018. [DOI: 10.1111/rssc.12284] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
32
|
|
33
|
|
34
|
Associations between social capital and depression: A study of adult twins. Health Place 2018; 50:162-167. [PMID: 29459249 DOI: 10.1016/j.healthplace.2018.02.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Revised: 01/30/2018] [Accepted: 02/09/2018] [Indexed: 10/18/2022]
Abstract
Social capital is associated with depression independently of individual-level risk factors. We used a sample of 1586 same-sex twin pairs to test the association between seven measures of social capital and two related measures of neighborhood characteristics with depressive symptoms accounting for uncontrolled selection factors (i.e., genetics and shared environment). All measures of cognitive social capital and neighborhood characteristics were associated with less depressive symptoms in between-twin analysis. However, only measures of cognitive social capital were significantly associated with less depressive symptoms within-pairs. These results demonstrate that cognitive social capital is associated with depressive symptoms free of confounding from genetic and environmental factors shared within twins.
Collapse
|
35
|
Hyak mortality monitoring system: innovative sampling and estimation methods - proof of concept by simulation. Glob Health Epidemiol Genom 2018; 3:e3. [PMID: 29868228 PMCID: PMC5870438 DOI: 10.1017/gheg.2017.15] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2017] [Revised: 10/13/2017] [Accepted: 10/16/2017] [Indexed: 01/21/2023] Open
Abstract
Traditionally health statistics are derived from civil and/or vital registration. Civil registration in low- to middle-income countries varies from partial coverage to essentially nothing at all. Consequently the state of the art for public health information in low- to middle-income countries is efforts to combine or triangulate data from different sources to produce a more complete picture across both time and space - data amalgamation. Data sources amenable to this approach include sample surveys, sample registration systems, health and demographic surveillance systems, administrative records, census records, health facility records and others. We propose a new statistical framework for gathering health and population data - Hyak - that leverages the benefits of sampling and longitudinal, prospective surveillance to create a cheap, accurate, sustainable monitoring platform. Hyak has three fundamental components: Data amalgamation: A sampling and surveillance component that organizes two or more data collection systems to work together: (1) data from HDSS with frequent, intense, linked, prospective follow-up and (2) data from sample surveys conducted in large areas surrounding the Health and Demographic Surveillance System (HDSS) sites using informed sampling so as to capture as many events as possible;Cause of death: Verbal autopsy to characterize the distribution of deaths by cause at the population level; andSocioeconomic status (SES): Measurement of SES in order to characterize poverty and wealth. We conduct a simulation study of the informed sampling component of Hyak based on the Agincourt HDSS site in South Africa. Compared with traditional cluster sampling, Hyak's informed sampling captures more deaths, and when combined with an estimation model that includes spatial smoothing, produces estimates of both mortality counts and mortality rates that have lower variance and small bias.
Collapse
|
36
|
Associations between neighbourhood characteristics and depression: a twin study. J Epidemiol Community Health 2017; 72:202-207. [PMID: 29273630 DOI: 10.1136/jech-2017-209453] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2017] [Revised: 11/10/2017] [Accepted: 11/30/2017] [Indexed: 01/18/2023]
Abstract
BACKGROUND Depression is an important contributor to the global burden of disease. Besides several known individual-level factors that contribute to depression, there is a growing recognition that neighbourhood environment can also profoundly affect mental health. This study assessed associations between three neighbourhood constructs-socioeconomic deprivation, residential instability and income inequality-and depression among adult twin pairs. The twin design is used to examine the association between neighbourhood constructs and depression, controlling for selection factors (ie, genetic and shared environmental factors) that have confounded purported associations. METHODS We used multilevel random-intercept Poisson regression among 3738 same-sex twin pairs from a community-based twin registry to examine the association between neighbourhood constructs and depression. The within-pair association controls for confounding by genetic and environmental factors shared between twins within a pair, and is the main parameter of interest. Models were adjusted for individual-level income, education and marital status, and further by neighbourhood-level population density. RESULTS When twins were analysed as individuals (phenotypic model), all neighbourhood constructs were significantly associated with depression. However, only neighbourhood socioeconomic deprivation showed a significant within-pair association with depression. A 10-unit within-pair difference in neighbourhood socioeconomic deprivation was associated with 6% greater depressive symptoms (1.06, 95% CI 1.01 to 1.11); the association did not substantially change in adjusted models. CONCLUSION This study provides new evidence linking neighbourhood socioeconomic deprivation with greater depression. Future studies should employ longitudinal designs to better test social causation versus social selection.
Collapse
|
37
|
Efficient Data Augmentation for Fitting Stochastic Epidemic Models to Prevalence Data. J Comput Graph Stat 2017; 26:918-929. [PMID: 30515026 PMCID: PMC6275108 DOI: 10.1080/10618600.2017.1328365] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Revised: 02/01/2017] [Indexed: 10/19/2022]
Abstract
Stochastic epidemic models describe the dynamics of an epidemic as a disease spreads through a population. Typically, only a fraction of cases are observed at a set of discrete times. The absence of complete information about the time evolution of an epidemic gives rise to a complicated latent variable problem in which the state space size of the epidemic grows large as the population size increases. This makes analytically integrating over the missing data infeasible for populations of even moderate size. We present a data augmentation Markov chain Monte Carlo (MCMC) framework for Bayesian estimation of stochastic epidemic model parameters, in which measurements are augmented with subject-level disease histories. In our MCMC algorithm, we propose each new subject-level path, conditional on the data, using a time-inhomogeneous continuous-time Markov process with rates determined by the infection histories of other individuals. The method is general, and may be applied to a broad class of epidemic models with only minimal modifications to the model dynamics and/or emission distribution. We present our algorithm in the context of multiple stochastic epidemic models in which the data are binomially sampled prevalence counts, and apply our method to data from an outbreak of influenza in a British boarding school.
Collapse
|
38
|
Air pollution exposure is associated with MRSA acquisition in young U.S. children with cystic fibrosis. BMC Pulm Med 2017; 17:106. [PMID: 28750627 PMCID: PMC5530959 DOI: 10.1186/s12890-017-0449-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2016] [Accepted: 07/19/2017] [Indexed: 01/21/2023] Open
Abstract
Background The role of air pollution in increasing susceptibility to respiratory tract infections in the cystic fibrosis (CF) population has not been well described. We recently demonstrated that chronic PM2.5 exposure is associated with an increased risk of initial Pseudomonas aeruginosa acquisition in young children with CF. The purpose of this study was to determine whether PM2.5 exposure is a risk factor for acquisition of other respiratory pathogens in young children with CF. Methods We conducted a retrospective study of initial acquisition of methicillin susceptible and methicillin resistant Staphylococcus aureus (MSSA and MRSA), Stenotrophomonas maltophilia and Achromobacter xylosoxidans in U.S. children <6 years of age with CF using the CF Foundation Patient Registry, 2003–2009. Multivariable Weibull regression with interval-censored outcomes was used to evaluate the association of PM2.5 concentration in the year prior to birth and risk of acquisition of each organism. Results During follow-up 63%, 17%, 24%, and 5% of children acquired MSSA, MRSA, S. maltophilia, and A. xylosoxidans, respectively. A 10 μg/m3 increase in PM2.5 exposure was associated with a 68% increased risk of MRSA acquisition (Hazard Ratio: 1.68; 95% Confidence Interval: 1.24, 2.27). PM2.5 was not associated with acquisition of other respiratory pathogens. Conclusions Fine particulate matter is an independent risk factor for initial MRSA acquisition in young children with CF. These results support the increasing evidence that air pollution contributes to pulmonary morbidities in the CF community.
Collapse
|
39
|
Seasonality of acquisition of respiratory bacterial pathogens in young children with cystic fibrosis. BMC Infect Dis 2017; 17:411. [PMID: 28599639 PMCID: PMC5466772 DOI: 10.1186/s12879-017-2511-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Accepted: 06/01/2017] [Indexed: 12/30/2022] Open
Abstract
Background Seasonal variations are often observed for respiratory tract infections; however, limited information is available regarding seasonal patterns of acquisition of common cystic fibrosis (CF)-related respiratory pathogens. We previously reported differential seasonal acquisition of Pseudomonas aeruginosa in young children with CF and no such variation for methicillin-susceptible Staphylococcus aureus acquisition. The purpose of this study was to describe and compare the seasonal incidence of acquisition of other respiratory bacterial pathogens in young children with CF. Methods We conducted a retrospective study to describe and compare the seasonal incidence of methicillin-resistant Staphylococcus aureus (MRSA), Stenotrophomonas maltophilia, Achromobacter xylosoxidans, and Haemophilus influenzae acquisition in young CF patients residing in the U.S. using the Cystic Fibrosis Foundation National Patient Registry, 2003-2009. Log-linear overdispersed Poisson regression was used to evaluate seasonal acquisition of each of these pathogens. Results A total of 4552 children met inclusion criteria. During follow-up 910 (20%), 1161 (26%), 228 (5%), and 2148 (47%) children acquired MRSA, S. maltophilia, A. xylosoxidans and H. influenzae, respectively. Compared to winter season, MRSA was less frequently acquired in spring (Incidence Rate Ratio [IRR]: 0.79; 95% Confidence Interval [CI]: 0.65, 0.96) and summer (IRR: 0.69; 95% CI: 0.57, 0.84) seasons. Similarly, a lower rate of A. xylosoxidans acquisition was observed in spring (IRR: 0.59; 95% CI: 0.39, 0.89). For H. influenzae, summer (IRR: 0.88; 95% CI: 0.78, 0.99) and autumn (IRR: 0.78; 95% CI: 0.69, 0.88) seasons were associated with lower acquisition rates compared to winter. No seasonal variation was observed for S. maltophilia acquisition. Conclusion Acquisition of CF-related respiratory pathogens displays seasonal variation in young children with CF, with the highest rate of acquisition for most pathogens occurring in the winter. Investigation of factors underlying these observed associations may contribute to our understanding of the aetiology of these infections and guide future infection control strategies. Electronic supplementary material The online version of this article (doi:10.1186/s12879-017-2511-9) contains supplementary material, which is available to authorized users.
Collapse
|
40
|
|
41
|
Time series modeling of pathogen-specific disease probabilities with subsampled data. Biometrics 2017; 73:283-293. [PMID: 27378138 PMCID: PMC5224700 DOI: 10.1111/biom.12560] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Revised: 04/01/2016] [Accepted: 05/01/2016] [Indexed: 11/26/2022]
Abstract
Many diseases arise due to exposure to one of multiple possible pathogens. We consider the situation in which disease counts are available over time from a study region, along with a measure of clinical disease severity, for example, mild or severe. In addition, we suppose a subset of the cases are lab tested in order to determine the pathogen responsible for disease. In such a context, we focus interest on modeling the probabilities of disease incidence given pathogen type. The time course of these probabilities is of great interest as is the association with time-varying covariates such as meteorological variables. In this set up, a natural Bayesian approach would be based on imputation of the unsampled pathogen information using Markov Chain Monte Carlo but this is computationally challenging. We describe a practical approach to inference that is easy to implement. We use an empirical Bayes procedure in a first step to estimate summary statistics. We then treat these summary statistics as the observed data and develop a Bayesian generalized additive model. We analyze data on hand, foot, and mouth disease (HFMD) in China in which there are two pathogens of primary interest, enterovirus 71 (EV71) and Coxackie A16 (CA16). We find that both EV71 and CA16 are associated with temperature, relative humidity, and wind speed, with reasonably similar functional forms for both pathogens. The important issue of confounding by time is modeled using a penalized B-spline model with a random effects representation. The level of smoothing is addressed by a careful choice of the prior on the tuning variance.
Collapse
|
42
|
|
43
|
Temporal Trends in Geographic and Sociodemographic Disparities in Colorectal Cancer Among Medicare Patients, 1973-2010. J Rural Health 2016; 33:361-370. [PMID: 27578387 DOI: 10.1111/jrh.12209] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Revised: 07/08/2016] [Accepted: 07/18/2016] [Indexed: 12/14/2022]
Abstract
PURPOSE Colorectal cancer (CRC) incidence and mortality in the United States have steadily declined since the 1980s, but racial and socioeconomic disparities remain. The influence of geographic factors is poorly understood and may be affected by evolving insurance coverage and screening test uptake. We characterized temporal trends in the association between geographic and sociodemographic factors and CRC outcomes. METHODS We used the 1973-2010 SEER-Medicare files to identify patients aged ≥65 years with and without CRC. Beneficiary residential ZIP codes were used to extract local-level data. We constructed multivariable logistic regression models for CRC incidence and mortality using geographic and sociodemographic variables in 4 time periods: (1) 1973-1997; (2) 1998-2001; (3) 2002-2006; and (4) 2007-2010. FINDINGS We analyzed 1,093,758 records, including 336,321 CRC cases. Compared to urban residence, small rural residence was strongly associated with increased CRC incidence (OR 1.50, 95% CI: 1.43-1.57) and mortality (OR 1.35, 95% CI: 1.26-1.45) in 1973-1997, but the associations diminished by 2007-2010 (OR 1.09, 95% CI: 1.04-1.15 for incidence; OR 1.10, 95% CI: 1.01-1.20 for mortality). The disparity between blacks and whites increased over time for both incidence (OR 1.09, 95% CI: 1.05-1.13 in 1973-1997 vs OR 1.32, 95% CI: 1.27-1.37 in 2007-2010) and mortality (OR 1.22, 95% CI: 1.16-1.28 in 1973-1997 vs OR 1.34, 95% CI: 1.26-1.42 in 2007-2010). High socioeconomic status was associated with greater incidence and mortality in 1973-1997, but it became protective after 1998. CONCLUSIONS Although disparities persist among Medicare beneficiaries, the relationship between geographic and sociodemographic factors and CRC incidence and mortality has evolved over time.
Collapse
|
44
|
Abstract
Despite seasonal cholera outbreaks in Bangladesh, little is known about the relationship between environmental conditions and cholera cases. We seek to develop a predictive model for cholera outbreaks in Bangladesh based on environmental predictors. To do this, we estimate the contribution of environmental variables, such as water depth and water temperature, to cholera outbreaks in the context of a disease transmission model. We implement a method which simultaneously accounts for disease dynamics and environmental variables in a Susceptible-Infected-Recovered-Susceptible (SIRS) model. The entire system is treated as a continuous-time hidden Markov model, where the hidden Markov states are the numbers of people who are susceptible, infected, or recovered at each time point, and the observed states are the numbers of cholera cases reported. We use a Bayesian framework to fit this hidden SIRS model, implementing particle Markov chain Monte Carlo methods to sample from the posterior distribution of the environmental and transmission parameters given the observed data. We test this method using both simulation and data from Mathbaria, Bangladesh. Parameter estimates are used to make short-term predictions that capture the formation and decline of epidemic peaks. We demonstrate that our model can successfully predict an increase in the number of infected individuals in the population weeks before the observed number of cholera cases increases, which could allow for early notification of an epidemic and timely allocation of resources.
Collapse
|
45
|
Using Small-Area Estimation to Calculate the Prevalence of Smoking by Subcounty Geographic Areas in King County, Washington, Behavioral Risk Factor Surveillance System, 2009-2013. Prev Chronic Dis 2016; 13:E59. [PMID: 27149070 PMCID: PMC4858449 DOI: 10.5888/pcd13.150536] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Introduction King County, Washington, fares well overall in many health indicators. However, county-level data mask disparities among subcounty areas. For disparity-focused assessment, a demand exists for examining health data at subcounty levels such as census tracts and King County health reporting areas (HRAs). Methods We added a “nearest intersection” question to the Behavioral Risk Factor Surveillance System (BRFSS) and geocoded the data for subcounty geographic areas, including census tracts. To overcome small sample size at the census tract level, we used hierarchical Bayesian models to obtain smoothed estimates in cigarette smoking rates at the census tract and HRA levels. We also used multiple imputation to adjust for missing values in census tracts. Results Direct estimation of adult smoking rates at the census tract level ranged from 0% to 56% with a median of 10%. The 90% confidence interval (CI) half-width for census tract with nonzero rates ranged from 1 percentage point to 37 percentage points with a median of 13 percentage points. The smoothed-multiple–imputation rates ranged from 5% to 28% with a median of 12%. The 90% CI half-width ranged from 4 percentage points to 13 percentage points with a median of 8 percentage points. Conclusion The nearest intersection question in the BRFSS provided geocoded data at subcounty levels. The Bayesian model provided estimation with improved precision at the census tract and HRA levels. Multiple imputation can be used to account for missing geographic data. Small-area estimation, which has been used for King County public health programs, has increasingly become a useful tool to meet the demand of presenting data at more granular levels.
Collapse
|
46
|
Abstract
Cluster detection is an important public health endeavor, and in this article, we describe and apply a recently developed Bayesian method. Commonly used approaches are based on so-called scan statistics and suffer from a number of difficulties, which include how to choose a level of significance and how to deal with the possibility of multiple clusters. The basis of our model is to partition the study region into a set of areas that are either "null" or "non-null," the latter corresponding to clusters (excess risk) or anticlusters (reduced risk). We demonstrate the Bayesian method and compare with a popular existing approach, using data on breast, brain, lung, prostate, and colorectal cancer, in the Puget Sound region of Washington State.
Collapse
|
47
|
Excavating Neandertal and Denisovan DNA from the genomes of Melanesian individuals. Science 2016; 352:235-9. [PMID: 26989198 DOI: 10.1126/science.aad9416] [Citation(s) in RCA: 233] [Impact Index Per Article: 29.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Accepted: 02/29/2016] [Indexed: 12/15/2022]
Abstract
Although Neandertal sequences that persist in the genomes of modern humans have been identified in Eurasians, comparable studies in people whose ancestors hybridized with both Neandertals and Denisovans are lacking. We developed an approach to identify DNA inherited from multiple archaic hominin ancestors and applied it to whole-genome sequences from 1523 geographically diverse individuals, including 35 previously unknown Island Melanesian genomes. In aggregate, we recovered 1.34 gigabases and 303 megabases of the Neandertal and Denisovan genome, respectively. We use these maps of archaic sequences to show that Neandertal admixture occurred multiple times in different non-African populations, characterize genomic regions that are significantly depleted of archaic sequences, and identify signatures of adaptive introgression.
Collapse
|
48
|
|
49
|
Restricted Covariance Priors with Applications in Spatial Statistics. BAYESIAN ANALYSIS 2015; 10:965-990. [PMID: 26753014 PMCID: PMC4705859 DOI: 10.1214/14-ba927] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
We present a Bayesian model for area-level count data that uses Gaussian random effects with a novel type of G-Wishart prior on the inverse variance- covariance matrix. Specifically, we introduce a new distribution called the truncated G-Wishart distribution that has support over precision matrices that lead to positive associations between the random effects of neighboring regions while preserving conditional independence of non-neighboring regions. We describe Markov chain Monte Carlo sampling algorithms for the truncated G-Wishart prior in a disease mapping context and compare our results to Bayesian hierarchical models based on intrinsic autoregression priors. A simulation study illustrates that using the truncated G-Wishart prior improves over the intrinsic autoregressive priors when there are discontinuities in the disease risk surface. The new model is applied to an analysis of cancer incidence data in Washington State.
Collapse
|
50
|
Space-Time Smoothing of Complex Survey Data: Small Area Estimation for Child Mortality. Ann Appl Stat 2015; 9:1889-1905. [PMID: 27468328 DOI: 10.1214/15-aoas872] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Many people living in low and middle-income countries are not covered by civil registration and vital statistics systems. Consequently, a wide variety of other types of data including many household sample surveys are used to estimate health and population indicators. In this paper we combine data from sample surveys and demographic surveillance systems to produce small area estimates of child mortality through time. Small area estimates are necessary to understand geographical heterogeneity in health indicators when full-coverage vital statistics are not available. For this endeavor spatio-temporal smoothing is beneficial to alleviate problems of data sparsity. The use of conventional hierarchical models requires careful thought since the survey weights may need to be considered to alleviate bias due to non-random sampling and non-response. The application that motivated this work is estimation of child mortality rates in five-year time intervals in regions of Tanzania. Data come from Demographic and Health Surveys conducted over the period 1991-2010 and two demographic surveillance system sites. We derive a variance estimator of under five years child mortality that accounts for the complex survey weighting. For our application, the hierarchical models we consider include random effects for area, time and survey and we compare models using a variety of measures including the conditional predictive ordinate (CPO). The method we propose is implemented via the fast and accurate integrated nested Laplace approximation (INLA).
Collapse
|