1
|
Wan H, Zhu W, Yan J, Han X, Yu J, Liao Q, Zhang T. Application of compound poisson model to estimate underreported risk of non-communicable diseases in underdeveloped areas. One Health 2024; 19:100889. [PMID: 39314245 PMCID: PMC11417528 DOI: 10.1016/j.onehlt.2024.100889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 08/08/2024] [Accepted: 09/05/2024] [Indexed: 09/25/2024] Open
Abstract
Background Hypertension and diabetes are major components of non-communicable diseases (NCDs), with a substantial number of patients residing in underdeveloped areas. Limited medical resources in these areas often results in underreporting of disease prevalence, masking the true extent of diseases. Taking the underdeveloped Liangshan Yi Autonomous Prefecture in China as an example, this study aimed to correct the underreported prevalence of hypertension and type 2 diabetes so as to provide inspiration for the allocation of medical resources in such areas. Methods Assuming the true number of patients in each area follows a Poisson distribution, we applied a Compound Poisson Model based on Clustering of Data Quality (CPM-CDQ) to estimate the potential true prevalence of hypertension and diabetes, as well as the registration rate of existing patients. Specifically, a hierarchical clustering approach was utilized to group the counties based on the data quality, and then the registration rate of the cluster with the best data quality was used as a priori information for the model. The model parameters were estimated by the maximum likelihood method. Sensitivity analyses were performed to test the robustness of the model. Results The estimated prevalence of hypertension in the entire Liangshan Prefecture from 2018 to 2020 ranged from 24.59 % to 25.28 %, and for diabetes, it ranged from 4.95 % to 8.42 %. The registration rates for hypertension and diabetes were 14.10 % to 24.59 % and 15.98 % to 29.12 %, respectively. Additionally, the accuracy of clustering the counties with the best data quality had a significant impact on the performance of the model. Conclusion Liangshan Prefecture is experiencing a significant high prevalence of hypertension and diabetes, accompanied by a concerningly low registration rate. The CPM-CDQ proved useful for assessing underreporting risks and facilitating targeted interventions for NCDs control and prevention, particularly in underdeveloped areas.
Collapse
Affiliation(s)
- Hongli Wan
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu 610041, Sichuan Province, China
| | - Wenhui Zhu
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu 610041, Sichuan Province, China
| | - Jingmin Yan
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu 610041, Sichuan Province, China
| | - Xinyue Han
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu 610041, Sichuan Province, China
| | - Jie Yu
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu 610041, Sichuan Province, China
| | - Qiang Liao
- Liangshan Prefecture Center for Disease Control and Prevention, Xichang 615000, Sichuan Province, China
| | - Tao Zhang
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu 610041, Sichuan Province, China
| |
Collapse
|
2
|
Gu L, Cai J, Feng Y, Zhan Y, Zhu Z, Liu N, Guan X, Li X. Spatio-temporal pattern and associate factors study on intestinal infectious diseases based on panel model in Zhejiang Province. BMC Public Health 2024; 24:3041. [PMID: 39491019 DOI: 10.1186/s12889-024-20411-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 10/15/2024] [Indexed: 11/05/2024] Open
Abstract
BACKGROUND Intestinal infectious diseases (IIDs) can impact the growth and development of children and weaken adults. This study aimed to establish a spatial panel model to analyze the relationship between factors such as population, economy and health resources, and the incidence of common IIDs. The objective was to provide a scientific basis for the formulation diseases prevention measures. METHODS Data on monthly reported cases of IIDs in each district and county of Zhejiang Province were collected from 2011 to 2021. The spatial distribution trend was plotted, and nine factors related to population, economy and health resources were selected for analysis. A spatial panel model was developed to identify statistically significant spatial patterns of influencing factors (P < 0.05). RESULTS The results revealed that each type of IIDs exhibited a certain level of clustering. Each IIDs had a significant radiation effect, HEV (b = 0.28, P < 0.05), bacillary dysentery (b = 0.38, P < 0.05), typhoid (b = 0.36, P < 0.05), other infectious diarrheas (OIDs) (b = 0.28, P < 0.05) and hand, foot and mouth disease (HFMD) (b = 0.39, P < 0.05), indicating that regions with high morbidity rates spread to neighboring areas. Among the population characteristics, density of population acted as a protective factor for bacillary dysentery (b=-1.81, P < 0.05), sex ratio acted as a protective factor for HFMD (b=-0.07, P < 0.05), and aging rate increased the risk of OIDs (b = 2.39, P < 0.05). Urbanization ratio posed a hazard factor for bacillary dysentery (b = 5.17, P < 0.05) and OIDs (b = 0.64, P < 0.05) while serving as a protective factor for typhoid (b=-1.61, P < 0.05) and HFMD (b=-0.39, P < 0.05). Per capita GDP was a risk factor for typhoid (b = 0.54, P < 0.05), but acted as a protective factor for OIDs (b=-0.45, P < 0.05) and HFMD (b=-0.27, P < 0.05). Additionally, the subsistence allowances ratio was a risk factor for HEV (b = 0.24, P < 0.05). CONCLUSION The incidence of IIDs in Zhejiang Province exhibited a certain degree of clustering, with major hotspots identified in Hangzhou, Shaoxing, and Jinhua. It would be essential to consider the spillover effects from neighboring regions and implement targeted measures to enhance disease prevention based on regional development.
Collapse
Affiliation(s)
- Lanfang Gu
- Department of Big Data in Health Science, Center for Clinical Big Data and Statistics, Second Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
| | - Jian Cai
- Institute for Communicable Disease Control and Prevention, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, China
| | - Yan Feng
- Institute for Communicable Disease Control and Prevention, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, China
| | - Yancen Zhan
- Department of Big Data in Health Science, Center for Clinical Big Data and Statistics, Second Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
| | - Zhixin Zhu
- Department of Big Data in Health Science, Center for Clinical Big Data and Statistics, Second Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
| | - Nawen Liu
- Department of Big Data in Health Science, Center for Clinical Big Data and Statistics, Second Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
| | - Xifei Guan
- Department of Big Data in Health Science, Center for Clinical Big Data and Statistics, Second Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
| | - Xiuyang Li
- Department of Big Data in Health Science, Center for Clinical Big Data and Statistics, Second Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China.
| |
Collapse
|
3
|
Han D, Linares P, Holm RH, Chandran K, Smith T. Wastewater threshold as an indicator of COVID-19 cases in correctional facilities for public health response: A modeling study. WATER RESEARCH 2024; 260:121934. [PMID: 38908309 DOI: 10.1016/j.watres.2024.121934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 06/05/2024] [Accepted: 06/11/2024] [Indexed: 06/24/2024]
Abstract
Although prison facilities are not fully isolated from the communities in which they are located, most of the population is confined and requires high levels of health vigilance and protection. This study aimed to examine the dynamic relationship between facility-level wastewater viral concentrations and the probability of at least one positive COVID-19 case within the facility. The study period was from January 11, 2021 to May 8, 2023. Wastewater samples were collected and analyzed for SARS-CoV-2 (N1) and pepper mild mottle virus (PMMoV) three times weekly across 14 prison facilities in Kentucky (USA). Positive clinical case reports were also provided. A hierarchical Bayesian facility-level temporal model with a latent lagged process was developed. We modeled facility-specific SARS-CoV-2 (N1) normalized by the PMMoV wastewater concentration ratio threshold associated with at least one COVID-19 clinical case at an 80 % probability. The threshold differed among facilities. Across the 14 facilities, our model demonstrates a mean capture rate of 94.95 % via the N1/PMMoV ratio threshold with pts≥0.5. However, as the pts threshold was set higher, such as at ≥0.9, the mean capture rate of the model was reduced to 60 %. This robust performance underscores the effectiveness of the model for accurately detecting the presence of positive COVID-19 cases among incarcerated people. The findings of this study provide a facility-specific threshold model for public health response based on frequent wastewater surveillance.
Collapse
Affiliation(s)
- Dan Han
- Department of Mathematics, College of Arts & Sciences, University of Louisville, 2301 S. 3rd St., Louisville, KY 40292, USA
| | - Pamela Linares
- Department of Mathematics, College of Arts & Sciences, University of Louisville, 2301 S. 3rd St., Louisville, KY 40292, USA
| | - Rochelle H Holm
- Christina Lee Brown Envirome Institute, School of Medicine, University of Louisville, 302 E. Muhammad Ali Blvd., Louisville, KY 40202, USA.
| | - Kartik Chandran
- Department of Earth and Environmental Engineering, Columbia University, 500 W. 120th St., New York, NY 10027, USA
| | - Ted Smith
- Christina Lee Brown Envirome Institute, School of Medicine, University of Louisville, 302 E. Muhammad Ali Blvd., Louisville, KY 40202, USA
| |
Collapse
|
4
|
Arima S, Polettini S, Pasculli G, Gesualdo L, Pesce F, Procaccini DA. A Bayesian nonparametric approach to correct for underreporting in count data. Biostatistics 2024; 25:904-918. [PMID: 37811675 DOI: 10.1093/biostatistics/kxad027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 06/06/2023] [Accepted: 08/21/2023] [Indexed: 10/10/2023] Open
Abstract
We propose a nonparametric compound Poisson model for underreported count data that introduces a latent clustering structure for the reporting probabilities. The latter are estimated with the model's parameters based on experts' opinion and exploiting a proxy for the reporting process. The proposed model is used to estimate the prevalence of chronic kidney disease in Apulia, Italy, based on a unique statistical database covering information on m = 258 municipalities obtained by integrating multisource register information. Accurate prevalence estimates are needed for monitoring, surveillance, and management purposes; yet, counts are deemed to be considerably underreported, especially in some areas of Apulia, one of the most deprived and heterogeneous regions in Italy. Our results agree with previous findings and highlight interesting geographical patterns of the disease. We compare our model to existing approaches in the literature using simulated as well as real data on early neonatal mortality risk in Brazil, described in previous research: the proposed approach proves to be accurate and particularly suitable when partial information about data quality is available.
Collapse
Affiliation(s)
- Serena Arima
- Department of Human and Social Sciences, University of Salento, Via di Valesio, 73100, LECCE, Italy
| | - Silvia Polettini
- Department of Social and Economic Sciences, Sapienza University, P.le Aldo Moro, 5, 00185 ROMA, Italy
| | - Giuseppe Pasculli
- Department of Computer, Control, and Management Engineering "Antonio Ruberti", Sapienza University, Via Ariosto, 25, 00185 Roma RM, Italy
| | - Loreto Gesualdo
- Section of Nephrology, Department of Precision and Regenerative Medicine and Ionian Area (DiMePre-J), Azienda Ospedaliero Universitaria Consorziale Policlinico di Bari, Piazza Giulio Cesare, 11 - 70124 Bari, Italy
| | - Francesco Pesce
- Division of Renal Medicine, "Fatebenefratelli Isola Tiberina-Gemelli Isola", 00186 Rome, Italy
| | - Deni-Aldo Procaccini
- Section of Nephrology, Department of Precision and Regenerative Medicine and Ionian Area (DiMePre-J), Azienda Ospedaliero Universitaria Consorziale Policlinico di Bari, Piazza Giulio Cesare, 11 - 70124 Bari, Italy
| |
Collapse
|
5
|
Lei J, MacNab Y. Bayesian hierarchical spatiotemporal models for prediction of (under)reporting rates and cases: COVID-19 infection among the older people in the United States during the 2020-2022 pandemic. Spat Spatiotemporal Epidemiol 2024; 49:100658. [PMID: 38876569 DOI: 10.1016/j.sste.2024.100658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 03/25/2024] [Accepted: 05/08/2024] [Indexed: 06/16/2024]
Abstract
The gap between the reported and actual COVID-19 infection cases has been an issue of concern. Here, we present Bayesian hierarchical spatiotemporal disease mapping models for state-level predictions of COVID-19 infection risks and (under)reporting rates among people aged 65 and above during the first two years of the pandemic in the United States. With prior elicitation based on recent prevalence studies, the study suggests that the median state-level reporting rate of COVID-19 infection was 90% (interquartile range: [78%, 96%]). Our study uncovers spatiotemporal variations and dynamics in state-level infection risks and (under)reporting rates, suggesting time-varying associations between higher population density, higher percentage of minorities, and higher percentage of vaccination and increased risks of COVID-19 infection, as well as an association between more easily accessible tests and higher reporting rates. With sensitivity analyses, we highlight the impact and importance of incorporating covariates information and objective prior references for evaluating the issue of underreporting.
Collapse
Affiliation(s)
- Jingxin Lei
- School of Public Health, University of British Columbia, 2206 East Mall, Vancouver, V6T 1Z3, BC, Canada.
| | - Ying MacNab
- School of Public Health, University of British Columbia, 2206 East Mall, Vancouver, V6T 1Z3, BC, Canada
| |
Collapse
|
6
|
Lope DJ, Demirhan H. JAGS model specification for spatiotemporal epidemiological modelling. Spat Spatiotemporal Epidemiol 2024; 49:100645. [PMID: 38876555 DOI: 10.1016/j.sste.2024.100645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Revised: 01/28/2024] [Accepted: 02/21/2024] [Indexed: 06/16/2024]
Abstract
Bayesian inference in modelling infectious diseases using Bayesian inference using Gibbs Sampling (BUGS) is notable in the last two decades in parallel with the advancements in computing and model development. The ability of BUGS to easily implement the Markov chain Monte Carlo (MCMC) method brought Bayesian analysis to the mainstream of infectious disease modelling. However, with the existing software that runs MCMC to make Bayesian inferences, it is challenging, especially in terms of computational complexity, when infectious disease models become more complex with spatial and temporal components, in addition to the increasing number of parameters and large datasets. This study investigates two alternative subscripting strategies for creating models in Just Another Gibbs Sampler (JAGS) environment and their performance in terms of run times. Our results are useful for practitioners to ensure the efficiency and timely implementation of Bayesian spatiotemporal infectious disease modelling.
Collapse
Affiliation(s)
- Dinah Jane Lope
- School of Science, Mathematical Sciences Discipline, RMIT University, Melbourne, 3000, Victoria, Australia.
| | - Haydar Demirhan
- School of Science, Mathematical Sciences Discipline, RMIT University, Melbourne, 3000, Victoria, Australia.
| |
Collapse
|
7
|
Adhikari B, Abdia Y, Ringa N, Clemens F, Mak S, Rose C, Janjua NZ, Otterstatter M, Irvine MA. Visible minority status and occupation were associated with increased COVID-19 infection in Greater Vancouver British Columbia between June and November 2020: an ecological study. Front Public Health 2024; 12:1336038. [PMID: 38481842 PMCID: PMC10935735 DOI: 10.3389/fpubh.2024.1336038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 02/16/2024] [Indexed: 05/12/2024] Open
Abstract
Background The COVID-19 pandemic has highlighted health disparities, especially among specific population groups. This study examines the spatial relationship between the proportion of visible minorities (VM), occupation types and COVID-19 infection in the Greater Vancouver region of British Columbia, Canada. Methods Provincial COVID-19 case data between June 24, 2020, and November 7, 2020, were aggregated by census dissemination area and linked with sociodemographic data from the Canadian 2016 census. Bayesian spatial Poisson regression models were used to examine the association between proportion of visible minorities, occupation types and COVID-19 infection. Models were adjusted for COVID-19 testing rates and other sociodemographic factors. Relative risk (RR) and 95% Credible Intervals (95% CrI) were calculated. Results We found an inverse relationship between the proportion of the Chinese population and risk of COVID-19 infection (RR = 0.98 95% CrI = 0.96, 0.99), whereas an increased risk was observed for the proportions of the South Asian group (RR = 1.10, 95% CrI = 1.08, 1.12), and Other Visible Minority group (RR = 1.06, 95% CrI = 1.04, 1.08). Similarly, a higher proportion of frontline workers (RR = 1.05, 95% CrI = 1.04, 1.07) was associated with higher infection risk compared to non-frontline. Conclusion Despite adjustments for testing, housing, occupation, and other social economic status variables, there is still a substantial association between the proportion of visible minorities, occupation types, and the risk of acquiring COVID-19 infection in British Columbia. This ecological analysis highlights the existing disparities in the burden of diseases among different visible minority populations and occupation types.
Collapse
Affiliation(s)
| | | | - Notice Ringa
- BC Centre for Disease Control, Vancouver, BC, Canada
- School of Population and Public Health, University of British Columbia, Vancouver, BC, Canada
| | | | - Sunny Mak
- BC Centre for Disease Control, Vancouver, BC, Canada
| | - Caren Rose
- BC Centre for Disease Control, Vancouver, BC, Canada
- School of Population and Public Health, University of British Columbia, Vancouver, BC, Canada
| | - Naveed Z. Janjua
- BC Centre for Disease Control, Vancouver, BC, Canada
- School of Population and Public Health, University of British Columbia, Vancouver, BC, Canada
| | - Michael Otterstatter
- BC Centre for Disease Control, Vancouver, BC, Canada
- School of Population and Public Health, University of British Columbia, Vancouver, BC, Canada
| | - Michael A. Irvine
- BC Centre for Disease Control, Vancouver, BC, Canada
- Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| |
Collapse
|
8
|
Roemer M, Schaefer MB, Pickens GT, Barrett ML. Estimating state-specific population-based hospitalization rates from in-state hospital discharge data. Health Serv Res 2023; 58:1314-1327. [PMID: 37602919 PMCID: PMC10622291 DOI: 10.1111/1475-6773.14216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/22/2023] Open
Abstract
OBJECTIVE To develop weights to estimate state population-based hospitalization rates for all residents of a state using only data from in-state hospitals which exclude residents treated in other states. DATA SOURCES AND STUDY SETTING Agency for Healthcare Research and Quality, Healthcare Cost and Utilization Project (HCUP), State Inpatient Databases (SID), 2018-2019, 47 states+DC. STUDY DESIGN We identified characteristics for patients hospitalized in each state differentiating movers (discharges for patients hospitalized outside state of residence) from stayers (discharges for patients hospitalized in state of residence) and created weights based on 2018 data informed by these characteristics. We calculated standard errors using a sampling framework and compared weight-based estimates against complete observed values for 2019. DATA COLLECTION/EXTRACTION METHODS SID are based on administrative billing records collected by hospitals, shared with statewide data organizations, and provided to HCUP. PRINCIPAL FINDINGS Of 34,186,766 discharged patients in 2018, 4.2% were movers. A higher share of movers (vs. stayers) lived in state border and rural counties; a lower share had discharges billed to Medicaid or were hospitalized for maternal/neonatal services. The difference between 2019 observed and estimated total discharges for all included states and DC was 9402 (mean absolute percentage error = 0.2%). We overestimated discharges with an expected payer of Medicaid, from the lowest income communities, and for maternal/neonatal care. We underestimated discharges with an expected payer of private insurance, from the highest income communities, and with injury diagnoses and surgical services. Estimates for most subsets were not within a 95% confidence interval, likely due to factors impossible to account for (e.g., hospital closures/openings, shifting consumer preferences). CONCLUSIONS The weights offer a practical solution for researchers with access to only a single state's data to account for movers when calculating population-based hospitalization rates.
Collapse
Affiliation(s)
- Marc Roemer
- Agency for Healthcare Research and QualityRockvilleMarylandUSA
| | - Mary Beth Schaefer
- Affiliation at time of study: IBMSanta BarbaraCaliforniaUSA
- Present address:
American Society of Plastic SurgeonsArlington HeightsIllinoisUSA
| | | | | |
Collapse
|
9
|
Aghabazaz Z, Kazemi I. Under-reported time-varying MINAR(1) process for modeling multivariate count series. Comput Stat Data Anal 2023; 188:107825. [PMID: 37700855 PMCID: PMC10493608 DOI: 10.1016/j.csda.2023.107825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/14/2023]
Abstract
A time-varying multivariate integer-valued autoregressive of order one (tvMINAR(1)) model is introduced for the non-stationary time series of correlated counts when under-reporting is likely present. A non-diagonal autoregression probability network is structured to preserve the cross-correlation of multivariate series, provide a necessary condition to ease model-fittings computations, and derive the full likelihood using the Viterbi algorithm. The motivating construction applies to fully under-reported counts that rely on a mixture presentation of the random thinning operator. Simulation studies are conducted to examine the proposed model, and the analysis of COVID-19 daily cases is accomplished to highlight its usefulness in applications. Finally, the comparison of models is presented using the posterior predictive checking method.
Collapse
Affiliation(s)
- Zeynab Aghabazaz
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, USA
| | - Iraj Kazemi
- Department of Statistics, Faculty of Mathematics & Statistics, University of Isfahan, Iran
| |
Collapse
|
10
|
Lopes de Oliveira G, Ferreira AJ, Teles CADS, Paixao ES, Fiaccone R, Lana R, Aquino R, Cardoso AM, Soares MA, Oliveira dos Santos I, Pereira M, Barreto ML, Ichihara MY. Estimating the real burden of gestational syphilis in Brazil, 2007-2018: a Bayesian modeling study. LANCET REGIONAL HEALTH. AMERICAS 2023; 25:100564. [PMID: 37575963 PMCID: PMC10415804 DOI: 10.1016/j.lana.2023.100564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 07/16/2023] [Accepted: 07/17/2023] [Indexed: 08/15/2023]
Abstract
Background Although several studies have estimated gestational syphilis (GS) incidence in several countries, underreporting correction is rarely considered. This study aimed to estimate the level of under-registration and correct the GS incidence rates in the 557 Brazilian microregions. Methods Brazilian GS notifications between 2007 and 2018 were obtained from the SINAN-Syphilis system. A cluster analysis was performed to group microregions according to the quality of GS notification. A Bayesian hierarchical Poisson regression model was applied to estimate the reporting probabilities among the clusters and to correct the associated incidence rates. Findings We estimate that 45,196 (90%-HPD: 13,299; 79,310) GS cases were underreported in Brazil from 2007 to 2018, representing a coverage of 87.12% (90%-HPD: 79.40%; 95.83%) of registered cases, where HPD stands for the Bayesian highest posterior density credible interval. Underreporting levels differ across the country, with microregions in North and Northeast regions presenting the highest percentage of missed cases. After underreporting correction, Brazil's estimated GS incidence rate increased from 8.74 to 10.02 per 1000 live births in the same period. Interpretation Our findings highlight disparities in the registration level and incidence rate of GS in Brazil, reflecting regional heterogeneity in the quality of syphilis surveillance, access to prenatal care, and childbirth assistance services. This study provides robust evidence to enhance national surveillance systems, guide specific policies for GS detection disease control, and potentially mitigate the harmful consequences of mother-to-child transmission. The methodology might be applied in other regions to correct disease underreporting. Funding National Council for Scientific and Technological Development; The Bill Melinda Gates Foundation and Wellcome Trust.
Collapse
Affiliation(s)
- Guilherme Lopes de Oliveira
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- Department of Computing, Federal Centre of Technological Education of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Andrêa J.F. Ferreira
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- The Ubuntu Center on Racism, Global Movement, Population and Equity, School of Public Health, Drexel University, Pennsylvania, USA
| | - Carlos Antônio de S.S. Teles
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
| | - Enny S. Paixao
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- London School of Hygiene and Tropical Medicine, London, UK
| | - Rosemeire Fiaccone
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- Statistics Department, Institute of Mathematics, Federal University of Bahia, Salvador, Bahia, Brazil
| | - Raquel Lana
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- Barcelona Supercomputing Center, Catalonia, Spain
| | - Rosana Aquino
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- Institute of Collective Health, Federal University of Bahia, Salvador, Bahia, Brazil
| | | | - Maria Auxiliadora Soares
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- Institute of Collective Health, Federal University of Bahia, Salvador, Bahia, Brazil
| | - Idália Oliveira dos Santos
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- Institute of Collective Health, Federal University of Bahia, Salvador, Bahia, Brazil
| | - Marcos Pereira
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- Institute of Collective Health, Federal University of Bahia, Salvador, Bahia, Brazil
| | - Maurício L. Barreto
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
- Institute of Collective Health, Federal University of Bahia, Salvador, Bahia, Brazil
| | - Maria Yury Ichihara
- Centre of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fiocruz, Salvador, Bahia, Brazil
| |
Collapse
|
11
|
Stoner O, Halliday A, Economou T. Correcting delayed reporting of COVID-19 using the generalized-Dirichlet-multinomial method. Biometrics 2023; 79:2537-2550. [PMID: 36484382 PMCID: PMC9877609 DOI: 10.1111/biom.13810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 11/18/2022] [Accepted: 11/29/2022] [Indexed: 12/14/2022]
Abstract
The COVID-19 pandemic has highlighted delayed reporting as a significant impediment to effective disease surveillance and decision-making. In the absence of timely data, statistical models which account for delays can be adopted to nowcast and forecast cases or deaths. We discuss the four key sources of systematic and random variability in available data for COVID-19 and other diseases, and critically evaluate current state-of-the-art methods with respect to appropriately separating and capturing this variability. We propose a general hierarchical approach to correcting delayed reporting of COVID-19 and apply this to daily English hospital deaths, resulting in a flexible prediction tool which could be used to better inform pandemic decision-making. We compare this approach to competing models with respect to theoretical flexibility and quantitative metrics from a 15-month rolling prediction experiment imitating a realistic operational scenario. Based on consistent leads in predictive accuracy, bias, and precision, we argue that this approach is an attractive option for correcting delayed reporting of COVID-19 and future epidemics.
Collapse
Affiliation(s)
- Oliver Stoner
- School of Mathematics and StatisticsUniversity of GlasgowGlasgowUK
| | - Alba Halliday
- School of Mathematics and StatisticsUniversity of GlasgowGlasgowUK
| | - Theo Economou
- Climate and Atmosphere Research CentreThe Cyprus InstituteAglantziaCyprus
| |
Collapse
|
12
|
Moriña D, Fernández-Fontelo A, Cabaña A, Arratia A, Puig P. Estimated Covid-19 burden in Spain: ARCH underreported non-stationary time series. BMC Med Res Methodol 2023; 23:75. [PMID: 36977977 PMCID: PMC10043853 DOI: 10.1186/s12874-023-01894-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 03/17/2023] [Indexed: 03/30/2023] Open
Abstract
BACKGROUND The problem of dealing with misreported data is very common in a wide range of contexts for different reasons. The current situation caused by the Covid-19 worldwide pandemic is a clear example, where the data provided by official sources were not always reliable due to data collection issues and to the high proportion of asymptomatic cases. In this work, a flexible framework is proposed, with the objective of quantifying the severity of misreporting in a time series and reconstructing the most likely evolution of the process. METHODS The performance of Bayesian Synthetic Likelihood to estimate the parameters of a model based on AutoRegressive Conditional Heteroskedastic time series capable of dealing with misreported information and to reconstruct the most likely evolution of the phenomenon is assessed through a comprehensive simulation study and illustrated by reconstructing the weekly Covid-19 incidence in each Spanish Autonomous Community. RESULTS Only around 51% of the Covid-19 cases in the period 2020/02/23-2022/02/27 were reported in Spain, showing relevant differences in the severity of underreporting across the regions. CONCLUSIONS The proposed methodology provides public health decision-makers with a valuable tool in order to improve the assessment of a disease evolution under different scenarios.
Collapse
Affiliation(s)
- David Moriña
- Department of Econometrics, Statistics and Applied Economics, Riskcenter-IREA, Universitat de Barcelona (UB), Barcelona, Spain.
| | - Amanda Fernández-Fontelo
- Departament de Matemàtiques, Universitat Autònoma de Barcelona (UAB), Cerdanyola del Vallès, Spain
| | - Alejandra Cabaña
- Departament de Matemàtiques, Universitat Autònoma de Barcelona (UAB), Cerdanyola del Vallès, Spain
| | - Argimiro Arratia
- Department of Computer Science, Universitat Politècnica de Catalunya (UPC), Barcelona, Spain
| | - Pedro Puig
- Departament de Matemàtiques, Universitat Autònoma de Barcelona (UAB), Cerdanyola del Vallès, Spain
- Centre de Recerca Matemàtica (CRM), Barcelona, Spain
| |
Collapse
|
13
|
Bravo-Vega C, Renjifo-Ibañez C, Santos-Vega M, León Nuñez LJ, Angarita-Sierra T, Cordovez JM. A generalized framework for estimating snakebite underreporting using statistical models: A study in Colombia. PLoS Negl Trop Dis 2023; 17:e0011117. [PMID: 36745647 PMCID: PMC9934346 DOI: 10.1371/journal.pntd.0011117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 02/16/2023] [Accepted: 01/20/2023] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Snakebite envenoming is a neglected tropical disease affecting deprived populations, and its burden is underestimated in some regions where patients prefer using traditional medicine, case reporting systems are deficient, or health systems are inaccessible to at-risk populations. Thus, the development of strategies to optimize disease management is a major challenge. We propose a framework that can be used to estimate total snakebite incidence at a fine political scale. METHODOLOGY/PRINCIPAL FINDINGS First, we generated fine-scale snakebite risk maps based on the distribution of venomous snakes in Colombia. We then used a generalized mixed-effect model that estimates total snakebite incidence based on risk maps, poverty, and travel time to the nearest medical center. Finally, we calibrated our model with snakebite data in Colombia from 2010 to 2019 using the Markov-chain-Monte-Carlo algorithm. Our results suggest that 10.19% of total snakebite cases (532.26 yearly envenomings) are not reported and these snakebite victims do not seek medical attention, and that populations in the Orinoco and Amazonian regions are the most at-risk and show the highest percentage of underreporting. We also found that variables such as precipitation of the driest month and mean temperature of the warmest quarter influences the suitability of environments for venomous snakes rather than absolute temperature or rainfall. CONCLUSIONS/SIGNIFICANCE Our framework permits snakebite underreporting to be estimated using data on snakebite incidence and surveillance, presence locations for the most medically significant venomous snake species, and openly available information on population size, poverty, climate, land cover, roads, and the locations of medical centers. Thus, our algorithm could be used in other countries to estimate total snakebite incidence and improve disease management strategies; however, this framework does not serve as a replacement for a surveillance system, which should be made a priority in countries facing similar public health challenges.
Collapse
Affiliation(s)
- Carlos Bravo-Vega
- Grupo de investigación en Biología Matemática y Computacional (BIOMAC), Departamento de Ingeniería Biomédica, Universidad de los Andes, Bogotá, Colombia
| | | | - Mauricio Santos-Vega
- Grupo de investigación en Biología Matemática y Computacional (BIOMAC), Departamento de Ingeniería Biomédica, Universidad de los Andes, Bogotá, Colombia
- Facultad de Medicina, Universidad de los Andes, Bogotá, Colombia
| | - Leonardo Jose León Nuñez
- Observatorio de Salud Pública y epidemiología "José Felix Patiño", Universidad de los Andes, Bogotá, Colombia
| | - Teddy Angarita-Sierra
- Grupo de investigación Biodiversidad para la sociedad, Universidad Nacional de Colombia sede de La Paz, Cesar, Colombia
| | - Juan Manuel Cordovez
- Grupo de investigación en Biología Matemática y Computacional (BIOMAC), Departamento de Ingeniería Biomédica, Universidad de los Andes, Bogotá, Colombia
| |
Collapse
|
14
|
Hepler SA, Kline DM, Bonny A, McKnight E, Waller LA. An integrated abundance model for estimating county-level prevalence of opioid misuse in Ohio. JOURNAL OF THE ROYAL STATISTICAL SOCIETY. SERIES A, (STATISTICS IN SOCIETY) 2023; 186:43-60. [PMID: 37261313 PMCID: PMC10227692 DOI: 10.1093/jrsssa/qnac013] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Opioid misuse is a national epidemic and a significant drug related threat to the United States. While the scale of the problem is undeniable, estimates of the local prevalence of opioid misuse are lacking, despite their importance to policy-making and resource allocation. This is due, in part, to the challenge of directly measuring opioid misuse at a local level. In this paper, we develop a Bayesian hierarchical spatio-temporal abundance model that integrates indirect county-level data on opioid-related outcomes with state-level survey estimates on prevalence of opioid misuse to estimate the latent county-level prevalence and counts of people who misuse opioids. A simulation study shows that our integrated model accurately recovers the latent counts and prevalence. We apply our model to county-level surveillance data on opioid overdose deaths and treatment admissions from the state of Ohio. Our proposed framework can be applied to other applications of small area estimation for hard to reach populations, which is a common occurrence with many health conditions such as those related to illicit behaviors.
Collapse
Affiliation(s)
- Staci A Hepler
- Department of Mathematics and Statistics, Wake Forest University, Winston-Salem, USA
| | - David M Kline
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, USA
| | - Andrea Bonny
- Division of Adolescent Medicine, Nationwide Children's Hospital, Department of Pediatrics, The Ohio State University, Columbus, USA
| | - Erin McKnight
- Division of Adolescent Medicine, Nationwide Children's Hospital, Department of Pediatrics, The Ohio State University, Columbus, USA
| | - Lance A Waller
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, USA
| |
Collapse
|
15
|
White EB, Hernández-Ramírez RU, Majwala RK, Nalugwa T, Reza T, Cattamanchi A, Katamba A, Davis JL. Is aggregated surveillance data a reliable method for constructing tuberculosis care cascades? A secondary data analysis from Uganda. PLOS GLOBAL PUBLIC HEALTH 2022; 2:e0000716. [PMID: 36962541 PMCID: PMC10045605 DOI: 10.1371/journal.pgph.0000716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 09/20/2022] [Indexed: 11/24/2022]
Abstract
To accelerate tuberculosis (TB) control and elimination, reliable data is needed to improve the quality of TB care. We assessed agreement between a surveillance dataset routinely collected for Uganda's national TB program and a high-fidelity dataset collected from the same source documents for a research study from 32 health facilities in 2017 and 2019 for six measurements: 1) Smear-positive and 2) GeneXpert-positive diagnoses, 3) bacteriologically confirmed and 4) clinically diagnosed treatment initiations, and the number of people initiating TB treatment who were also 5) living with HIV or 6) taking antiretroviral therapy. We measured agreement as the average difference between the two methods, expressed as the average ratio of the surveillance counts to the research data counts, its 95% limits of agreement (LOA), and the concordance correlation coefficient. We used linear mixed models to investigate whether agreement changed over time or was associated with facility characteristics. We found good overall agreement with some variation in the expected facility-level agreement for the number of smear positive diagnoses (average ratio [95% LOA]: 1.04 [0.38-2.82]; CCC: 0.78), bacteriologically confirmed treatment initiations (1.07 [0.67-1.70]; 0.82), and people living with HIV (1.11 [0.51-2.41]; 0.82). Agreement was poor for Xpert positives, with surveillance data undercounting relative to research data (0.45 [0.099-2.07]; 0.36). Although surveillance data overcounted relative to research data for clinically diagnosed treatment initiations (1.52 [0.71-3.26]) and number of people taking antiretroviral therapy (1.71 [0.71-4.12]), their agreement as assessed by CCC was not poor (0.82 and 0.62, respectively). Average agreement was similar across study years for all six measurements, but facility-level agreement varied from year to year and was not explained by facility characteristics. In conclusion, the agreement of TB surveillance data with high-fidelity research data was highly variable across measurements and facilities. To advance the use of routine TB data as a quality improvement tool, future research should elucidate and address reasons for variability in its quality.
Collapse
Affiliation(s)
- Elizabeth B. White
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, United States of America
- Uganda Tuberculosis Implementation Research Consortium, Makerere University, Kampala, Uganda
| | - Raúl U. Hernández-Ramírez
- Center for Interdisciplinary Research on AIDS, Yale School of Public Health, New Haven, CT, United States of America
- Center for Methods in Implementation and Prevention Science, Department of Biostatistics, Yale School of Public Health, New Haven, CT, United States of America
| | - Robert Kaos Majwala
- National Tuberculosis and Leprosy Program, Ministry of Health, Kampala, Uganda
| | - Talemwa Nalugwa
- Uganda Tuberculosis Implementation Research Consortium, Makerere University, Kampala, Uganda
| | - Tania Reza
- Uganda Tuberculosis Implementation Research Consortium, Makerere University, Kampala, Uganda
- Division of Pulmonary and Critical Care Medicine and Center for Tuberculosis, University of California, San Francisco, San Francisco, CA, United States of America
| | - Adithya Cattamanchi
- Uganda Tuberculosis Implementation Research Consortium, Makerere University, Kampala, Uganda
- Division of Pulmonary and Critical Care Medicine and Center for Tuberculosis, University of California, San Francisco, San Francisco, CA, United States of America
| | - Achilles Katamba
- Uganda Tuberculosis Implementation Research Consortium, Makerere University, Kampala, Uganda
- Department of Medicine, Makerere University College of Health Sciences, Kampala, Uganda
| | - J. Lucian Davis
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, United States of America
- Uganda Tuberculosis Implementation Research Consortium, Makerere University, Kampala, Uganda
- Pulmonary, Critical Care and Sleep Medicine Section, Yale School of Medicine, New Haven, CT, United States of America
| |
Collapse
|
16
|
Lope DJ, Demirhan H. Spatiotemporal Bayesian estimation of the number of under-reported COVID-19 cases in Victoria Australia. PeerJ 2022; 10:e14184. [PMID: 36299511 PMCID: PMC9590417 DOI: 10.7717/peerj.14184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Accepted: 09/14/2022] [Indexed: 01/24/2023] Open
Abstract
Having an estimate of the number of under-reported cases is crucial in determining the true burden of a disease. In the COVID-19 pandemic, there is a great need to quantify the true disease burden by capturing the true incidence rate to establish appropriate measures and strategies to combat the disease. This study investigates the under-reporting of COVID-19 cases in Victoria, Australia, during the third wave of the pandemic as a result of variation in geographic area and time. It is aimed to determine potential under-reported areas and generate the true picture of the disease in terms of the number of cases. A two-tiered Bayesian hierarchical model approach is employed to estimate the true incidence and detection rates through Bayesian model averaging. The proposed model goes beyond testing inequality across areas by looking into other covariates such as weather, vaccination rates, and access to vaccination and testing centres, including interactions and variations between space and time. This model aims for parsimony yet allows a broader range of scope to capture the underlying dynamic of the reported COVID-19 cases. Moreover, it is a data-driven, flexible, and generalisable model to a global context such as cross-country estimation and across time points under strict pandemic conditions.
Collapse
Affiliation(s)
- Dinah Jane Lope
- Mathematical Sciences Discipline/School of Science, RMIT University, Melbourne, Victoria, Australia
| | - Haydar Demirhan
- Mathematical Sciences Discipline/School of Science, RMIT University, Melbourne, Victoria, Australia
| |
Collapse
|
17
|
Lope DJ, Demirhan H, Dolgun A. Bayesian estimation of the effect of health inequality in disease detection. Int J Equity Health 2022; 21:118. [PMID: 36030233 PMCID: PMC9419354 DOI: 10.1186/s12939-022-01713-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 08/02/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Measuring health inequality is essential to ensure that everyone has equal accessibility to health care. Studies in the past have continuously presented and showed areas or groups of people affected by various inequality in accessing the health resources and services to help improve this matter. Alongside, disease prevention is as important to minimise the disease burden and improve health and quality of life. These aspects are interlinked and greatly contributes to one's health. METHOD In this study, the Gini coefficient and Lorenz curve are used to give an indication of the overall health inequality. The impact of this inequality in granular level is demonstrated using Bayesian estimation for disease detection. The Bayesian estimation used a two-component modelling approach that separates the case detection process and incidence rate using a mixed Poisson distribution while capturing underlying spatio-temporal characteristics. Bayesian model averaging is used in conjunction with the two-component modelling approach to improve the accuracy of estimates by incorporating many candidate models into the analysis instead of using fixed component models. This method is applied to an infectious disease, influenza, in Victoria, Australia between 2013 and 2016 and the corresponding primary health care of the state. RESULT There is a relatively equal distribution of health resources and services pertaining to general practitioners (GP) and GP clinics in Victoria, Australia. Roughly 80 percent of the population shares 70 percent of the number of GPs and GP clinics. The Bayesian estimation with model averaging revealed that access difficulty to health services impacts both case detection probability and incidence rate. Minimal differences are recorded in the observed and estimated incidence of influenza cases considering social deprivation factors. In most years, areas in Victoria's southwest and eastern parts have potential under-reported cases consistent with their relatively lower number of GP or GP clinics. CONCLUSION The Bayesian model estimated a slight discrepancy between the estimated incidence and the observed cases of influenza in Victoria, Australia in 2013-2016 period. This is consistent with the relatively equal health resources and services in the state. This finding is beneficial in determining areas with potential under-reported cases and under-served health care. The proposed approach in this study provides insight into the impact of health inequality in disease detection without requiring costly and time-extensive surveys and relying mainly on the data at hand. Furthermore, the application of Bayesian model averaging provided a flexible modelling framework that allows covariates to move between case detection and incidence models.
Collapse
Affiliation(s)
- Dinah Jane Lope
- School of Science, Mathematical Sciences Discipline, RMIT University, Melbourne, 3000 Australia
| | - Haydar Demirhan
- School of Science, Mathematical Sciences Discipline, RMIT University, Melbourne, 3000 Australia
| | - Anil Dolgun
- School of Science, Mathematical Sciences Discipline, RMIT University, Melbourne, 3000 Australia
| |
Collapse
|
18
|
Chen J, Song JJ, Stamey JD. A Bayesian Hierarchical Spatial Model to Correct for Misreporting in Count Data: Application to State-Level COVID-19 Data in the United States. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:3327. [PMID: 35329019 PMCID: PMC8950980 DOI: 10.3390/ijerph19063327] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 02/27/2022] [Accepted: 03/05/2022] [Indexed: 02/01/2023]
Abstract
The COVID-19 pandemic that began at the end of 2019 has caused hundreds of millions of infections and millions of deaths worldwide. COVID-19 posed a threat to human health and profoundly impacted the global economy and people's lifestyles. The United States is one of the countries severely affected by the disease. Evidence shows that the spread of COVID-19 was significantly underestimated in the early stages, which prevented governments from adopting effective interventions promptly to curb the spread of the disease. This paper adopts a Bayesian hierarchical model to study the under-reporting of COVID-19 at the state level in the United States as of the end of April 2020. The model examines the effects of different covariates on the under-reporting and accurate incidence rates and considers spatial dependency. In addition to under-reporting (false negatives), we also explore the impact of over-reporting (false positives). Adjusting for misclassification requires adding additional parameters that are not directly identified by the observed data. Informative priors are required. We discuss prior elicitation and include R functions that convert expert information into the appropriate prior distribution.
Collapse
Affiliation(s)
| | | | - James D. Stamey
- Department of Statistical Science, Baylor University, Waco, TX 76798-7140, USA; (J.C.); (J.J.S.)
| |
Collapse
|
19
|
New statistical model for misreported data with application to current public health challenges. Sci Rep 2021; 11:23321. [PMID: 34857815 PMCID: PMC8640038 DOI: 10.1038/s41598-021-02620-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Accepted: 11/18/2021] [Indexed: 01/19/2023] Open
Abstract
The main goal of this work is to present a new model able to deal with potentially misreported continuous time series. The proposed model is able to handle the autocorrelation structure in continuous time series data, which might be partially or totally underreported or overreported. Its performance is illustrated through a comprehensive simulation study considering several autocorrelation structures and three real data applications on human papillomavirus incidence in Girona (Catalonia, Spain) and Covid-19 incidence in two regions with very different circumstances: the early days of the epidemic in the Chinese region of Heilongjiang and the most current data from Catalonia.
Collapse
|
20
|
Sengupta D, Roy S, Banerjee T. Testing of Poisson mean with under-reported counts. BRAZ J PROBAB STAT 2021. [DOI: 10.1214/20-bjps493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Debjit Sengupta
- Department of Statistics, St. Xavier’s College, 30, Mother Teresa Sarani, Kolkata-700016, India
| | - Surupa Roy
- Department of Statistics, St. Xavier’s College, 30, Mother Teresa Sarani, Kolkata-700016, India
| | | |
Collapse
|
21
|
Moriña D, Fernández-Fontelo A, Cabaña A, Arratia A, Ávalos G, Puig P. Cumulated burden of Covid-19 in Spain from a Bayesian perspective. Eur J Public Health 2021; 31:917-920. [PMID: 34180981 PMCID: PMC8394830 DOI: 10.1093/eurpub/ckab118] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND The main goal of this work is to estimate the actual number of cases of Covid-19 in Spain in the period 01-31-2020/06-01-2020 by Autonomous Communities. Based on these estimates, this work allows us to accurately re-estimate the lethality of the disease in Spain, taking into account unreported cases. METHODS A hierarchical Bayesian model recently proposed in the literature has been adapted to model the actual number of Covid-19 cases in Spain. RESULTS The results of this work show that the real load of Covid-19 in Spain in the period considered is well above the data registered by the public health system. Specifically, the model estimates show that, cumulatively until June 1st, 2020, there were 2 425 930 cases of Covid-19 in Spain with characteristics similar to those reported (95% credibility interval: 2 148 261 2 813 864), from which were actually registered only 518 664. CONCLUSIONS Considering the results obtained from the second wave of the Spanish seroprevalence study, which estimates 2 350 324 cases of Covid-19 produced in Spain, in the period of time considered, it can be seen that the estimates provided by the model are quite good. This work clearly shows the key importance of having good quality data to optimize decision-making in the critical context of dealing with a pandemic.
Collapse
Affiliation(s)
- David Moriña
- Department of Econometrics, Statistics and Applied Economics, Riskcenter-IREA, Universitat de Barcelona (UB), Barcelona, Spain.,Centre de Recerca Matemàtica (CRM, Barcelona Graduate School of Mathematics (BGSMath), Departament de Matemàtiques, Universitat Autònoma de Barcelona (UAB), Cerdanyola del Vallès, Spain)
| | - Amanda Fernández-Fontelo
- Chair of Statistics, School of Business and Economics, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Alejandra Cabaña
- Centre de Recerca Matemàtica (CRM, Barcelona Graduate School of Mathematics (BGSMath), Departament de Matemàtiques, Universitat Autònoma de Barcelona (UAB), Cerdanyola del Vallès, Spain)
| | - Argimiro Arratia
- Department of Computer Science, Universitat Politècnica de Catalunya (UPC), Barcelona, Spain
| | - Gustavo Ávalos
- Department of Computer Science, Universitat Politècnica de Catalunya (UPC), Barcelona, Spain
| | - Pedro Puig
- Centre de Recerca Matemàtica (CRM, Barcelona Graduate School of Mathematics (BGSMath), Departament de Matemàtiques, Universitat Autònoma de Barcelona (UAB), Cerdanyola del Vallès, Spain)
| |
Collapse
|
22
|
Mena GE, Martinez PP, Mahmud AS, Marquet PA, Buckee CO, Santillana M. Socioeconomic status determines COVID-19 incidence and related mortality in Santiago, Chile. Science 2021; 372:eabg5298. [PMID: 33906968 PMCID: PMC8158961 DOI: 10.1126/science.abg5298] [Citation(s) in RCA: 227] [Impact Index Per Article: 75.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 04/22/2021] [Indexed: 12/22/2022]
Abstract
The COVID-19 pandemic has affected cities particularly hard. Here, we provide an in-depth characterization of disease incidence and mortality and their dependence on demographic and socioeconomic strata in Santiago, a highly segregated city and the capital of Chile. Our analyses show a strong association between socioeconomic status and both COVID-19 outcomes and public health capacity. People living in municipalities with low socioeconomic status did not reduce their mobility during lockdowns as much as those in more affluent municipalities. Testing volumes may have been insufficient early in the pandemic in those places, and both test positivity rates and testing delays were much higher. We find a strong association between socioeconomic status and mortality, measured by either COVID-19-attributed deaths or excess deaths. Finally, we show that infection fatality rates in young people are higher in low-income municipalities. Together, these results highlight the critical consequences of socioeconomic inequalities on health outcomes.
Collapse
Affiliation(s)
- Gonzalo E Mena
- Department of Statistics, University of Oxford, Oxford, UK.
| | - Pamela P Martinez
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Department of Statistics, University of Illinois at Urbana-Champaign, Champaign, IL, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Ayesha S Mahmud
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Demography, University of California, Berkeley, CA, USA
| | - Pablo A Marquet
- Departamento de Ecología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Santiago, Chile
- Instituto de Ecología y Biodiversidad (IEB), Santiago, Chile
- The Santa Fe Institute, Santa Fe, NM, USA
- Instituto de Sistema Complejos de Valparaíso (ISCV), Valparaíso, Chile
- Centro de Cambio Global UC, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Caroline O Buckee
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Mauricio Santillana
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
23
|
Mena G, Martinez PP, Mahmud AS, Marquet PA, Buckee CO, Santillana M. Socioeconomic status determines COVID-19 incidence and related mortality in Santiago, Chile. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021:2021.01.12.21249682. [PMID: 33469598 PMCID: PMC7814844 DOI: 10.1101/2021.01.12.21249682] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
Abstract
The current coronavirus disease 2019 (COVID-19) pandemic has impacted dense urban populations particularly hard. Here, we provide an in-depth characterization of disease incidence and mortality patterns, and their dependence on demographic and socioeconomic strata in Santiago, a highly segregated city and the capital of Chile. We find that among all age groups, there is a strong association between socioeconomic status and both mortality -measured either by direct COVID-19 attributed deaths or excess deaths- and public health capacity. Specifically, we show that behavioral factors like human mobility, as well as health system factors such as testing volumes, testing delays, and test positivity rates are associated with disease outcomes. These robust patterns suggest multiple possibly interacting pathways that can explain the observed disease burden and mortality differentials: (i) in lower socioeconomic status municipalities, human mobility was not reduced as much as in more affluent municipalities; (ii) testing volumes in these locations were insufficient early in the pandemic and public health interventions were applied too late to be effective; (iii) test positivity and testing delays were much higher in less affluent municipalities, indicating an impaired capacity of the health-care system to contain the spread of the epidemic; and (iv) infection fatality rates appear much higher in the lower end of the socioeconomic spectrum. Together, these findings highlight the exacerbated consequences of health-care inequalities in a large city of the developing world, and provide practical methodological approaches useful for characterizing COVID-19 burden and mortality in other segregated urban centers.
Collapse
|
24
|
Fernández-Fontelo A, Moriña D, Cabaña A, Arratia A, Puig P. Estimating the real burden of disease under a pandemic situation: The SARS-CoV2 case. PLoS One 2020; 15:e0242956. [PMID: 33270713 PMCID: PMC7714127 DOI: 10.1371/journal.pone.0242956] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Accepted: 11/12/2020] [Indexed: 01/01/2023] Open
Abstract
The present paper introduces a new model used to study and analyse the severe acute respiratory syndrome coronavirus 2 (SARS-CoV2) epidemic-reported-data from Spain. This is a Hidden Markov Model whose hidden layer is a regeneration process with Poisson immigration, Po-INAR(1), together with a mechanism that allows the estimation of the under-reporting in non-stationary count time series. A novelty of the model is that the expectation of the unobserved process's innovations is a time-dependent function defined in such a way that information about the spread of an epidemic, as modelled through a Susceptible-Infectious-Removed dynamical system, is incorporated into the model. In addition, the parameter controlling the intensity of the under-reporting is also made to vary with time to adjust to possible seasonality or trend in the data. Maximum likelihood methods are used to estimate the parameters of the model.
Collapse
Affiliation(s)
- Amanda Fernández-Fontelo
- Chair of Statistics, School of Business and Economics, Humboldt-Universität zu Berlin, Berlin, Germany
| | - David Moriña
- Departament de Matemàtiques, Barcelona Graduate School of Mathematics (BGSMath), Universitat Autònoma de Barcelona, Barcelona, Spain
- Department of Econometrics, Statistics and Applied Economics, Riskcenter-IREA, Universitat de Barcelona, Barcelona, Spain
- Centre de Recerca Matemàtica (CRM), Barcelona, Spain
| | - Alejandra Cabaña
- Departament de Matemàtiques, Barcelona Graduate School of Mathematics (BGSMath), Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Argimiro Arratia
- Department of Computer Science, Universitat Politècnica de Catalunya, Barcelona, Spain
| | - Pere Puig
- Departament de Matemàtiques, Barcelona Graduate School of Mathematics (BGSMath), Universitat Autònoma de Barcelona, Barcelona, Spain
| |
Collapse
|
25
|
de Oliveira ACS, Morita LHM, da Silva EB, Zardo LAR, Fontes CJF, Granzotto DCT. Bayesian modeling of COVID-19 cases with a correction to account for under-reported cases. Infect Dis Model 2020; 5:699-713. [PMID: 32995681 DOI: 10.1101/2020.05.24.20112029] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Revised: 09/14/2020] [Accepted: 09/20/2020] [Indexed: 05/23/2023] Open
Abstract
The novel of COVID-19 disease started in late 2019 making the worldwide governments came across a high number of critical and death cases, beyond constant fear of the collapse in their health systems. Since the beginning of the pandemic, researchers and authorities are mainly concerned with carrying out quantitative studies (modeling and predictions) overcoming the scarcity of tests that lead us to under-reporting cases. To address these issues, we introduce a Bayesian approach to the SIR model with correction for under-reporting in the analysis of COVID-19 cases in Brazil. The proposed model was enforced to obtain estimates of important quantities such as the reproductive rate and the average infection period, along with the more likely date when the pandemic peak may occur. Several under-reporting scenarios were considered in the simulation study, showing how impacting is the lack of information in the modeling.
Collapse
Affiliation(s)
| | - Lia Hanna Martins Morita
- Departamento de Estatística, Universidade Federal de Mato Grosso - UFMT, CEP: 78060-900, Cuiabá, MT, Brazil
| | - Eveliny Barroso da Silva
- Departamento de Estatística, Universidade Federal de Mato Grosso - UFMT, CEP: 78060-900, Cuiabá, MT, Brazil
| | - Luiz André Ribeiro Zardo
- Departamento de Estatística, Universidade Federal de Mato Grosso - UFMT, CEP: 78060-900, Cuiabá, MT, Brazil
| | | | | |
Collapse
|
26
|
Bracher J, Held L. A marginal moment matching approach for fitting endemic-epidemic models to underreported disease surveillance counts. Biometrics 2020; 77:1202-1214. [PMID: 32920842 DOI: 10.1111/biom.13371] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 09/01/2020] [Indexed: 11/30/2022]
Abstract
Count data are often subject to underreporting, especially in infectious disease surveillance. We propose an approximate maximum likelihood method to fit count time series models from the endemic-epidemic class to underreported data. The approach is based on marginal moment matching where underreported processes are approximated through completely observed processes from the same class. Moreover, the form of the bias when underreporting is ignored or taken into account via multiplication factors is analyzed. Notably, we show that this leads to a downward bias in model-based estimates of the effective reproductive number. A marginal moment matching approach can also be used to account for reporting intervals which are longer than the mean serial interval of a disease. The good performance of the proposed methodology is demonstrated in simulation studies. An extension to time-varying parameters and reporting probabilities is discussed and applied in a case study on weekly rotavirus gastroenteritis counts in Berlin, Germany.
Collapse
Affiliation(s)
- Johannes Bracher
- Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland
| | - Leonhard Held
- Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland
| |
Collapse
|
27
|
Stoner O, Economou T. Multivariate hierarchical frameworks for modeling delayed reporting in count data. Biometrics 2019; 76:789-798. [PMID: 31737902 PMCID: PMC7540263 DOI: 10.1111/biom.13188] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Revised: 11/06/2019] [Accepted: 11/07/2019] [Indexed: 11/28/2022]
Abstract
In many fields and applications, count data can be subject to delayed reporting. This is where the total count, such as the number of disease cases contracted in a given week, may not be immediately available, instead arriving in parts over time. For short‐term decision making, the statistical challenge lies in predicting the total count based on any observed partial counts, along with a robust quantification of uncertainty. We discuss previous approaches to modeling delayed reporting and present a multivariate hierarchical framework where the count generating process and delay mechanism are modeled simultaneously in a flexible way. This framework can also be easily adapted to allow for the presence of underreporting in the final observed count. To illustrate our approach and to compare it with existing frameworks, we present a case study of reported dengue fever cases in Rio de Janeiro. Based on both within‐sample and out‐of‐sample posterior predictive model checking and arguments of interpretability, adaptability, and computational efficiency, we discuss the relative merits of different approaches.
Collapse
Affiliation(s)
- Oliver Stoner
- Department of Mathematics, University of Exeter, Exeter, UK
| | - Theo Economou
- Department of Mathematics, University of Exeter, Exeter, UK
| |
Collapse
|
28
|
Bastos LS, Economou T, Gomes MFC, Villela DAM, Coelho FC, Cruz OG, Stoner O, Bailey T, Codeço CT. A modelling approach for correcting reporting delays in disease surveillance data. Stat Med 2019; 38:4363-4377. [PMID: 31292995 PMCID: PMC6900153 DOI: 10.1002/sim.8303] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Revised: 05/13/2019] [Accepted: 06/03/2019] [Indexed: 11/05/2022]
Abstract
One difficulty for real-time tracking of epidemics is related to reporting delay. The reporting delay may be due to laboratory confirmation, logistical problems, infrastructure difficulties, and so on. The ability to correct the available information as quickly as possible is crucial, in terms of decision making such as issuing warnings to the public and local authorities. A Bayesian hierarchical modelling approach is proposed as a flexible way of correcting the reporting delays and to quantify the associated uncertainty. Implementation of the model is fast due to the use of the integrated nested Laplace approximation. The approach is illustrated on dengue fever incidence data in Rio de Janeiro, and severe acute respiratory infection data in the state of Paraná, Brazil.
Collapse
Affiliation(s)
- Leonardo S Bastos
- Scientific Computing Program, Oswaldo Cruz Foundation, Rio de Janeiro, Brazil
| | | | - Marcelo F C Gomes
- Scientific Computing Program, Oswaldo Cruz Foundation, Rio de Janeiro, Brazil
| | - Daniel A M Villela
- Scientific Computing Program, Oswaldo Cruz Foundation, Rio de Janeiro, Brazil
| | - Flavio C Coelho
- School of Applied Mathematics, Getulio Vargas Foundation, Rio de Janeiro, Brazil
| | - Oswaldo G Cruz
- Scientific Computing Program, Oswaldo Cruz Foundation, Rio de Janeiro, Brazil
| | - Oliver Stoner
- Department of Mathematics, University of Exeter, Exeter, UK
| | - Trevor Bailey
- Department of Mathematics, University of Exeter, Exeter, UK
| | - Claudia T Codeço
- Scientific Computing Program, Oswaldo Cruz Foundation, Rio de Janeiro, Brazil
| |
Collapse
|
29
|
Stoner O, Economou T, Drummond Marques da Silva G. A Hierarchical Framework for Correcting Under-Reporting in Count Data. J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2019.1573732] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- Oliver Stoner
- Department of Mathematics, University of Exeter, Exeter, UK
| | - Theo Economou
- Department of Mathematics, University of Exeter, Exeter, UK
| | | |
Collapse
|