1
|
Sharmin M, Manivannan M, Woo D, Sorel O, Auclair JR, Gandhi M, Mujawar I. Cross-sectional Ct distributions from qPCR tests can provide an early warning signal for the spread of COVID-19 in communities. Front Public Health 2023; 11:1185720. [PMID: 37841738 PMCID: PMC10570742 DOI: 10.3389/fpubh.2023.1185720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 09/07/2023] [Indexed: 10/17/2023] Open
Abstract
Background SARS-CoV-2 PCR testing data has been widely used for COVID-19 surveillance. Existing COVID-19 forecasting models mainly rely on case counts obtained from qPCR results, even though the binary PCR results provide a limited picture of the pandemic trajectory. Most forecasting models have failed to accurately predict the COVID-19 waves before they occur. Recently a model utilizing cross-sectional population cycle threshold (Ct-the number of cycles required for the fluorescent signal to cross the background threshold) values obtained from PCR tests (Ct-based model) was developed to overcome the limitations of using only binary PCR results. In this study, we aimed to improve on COVID-19 forecasting models using features derived from the Ct-based model, to detect epidemic waves earlier than case-based trajectories. Methods PCR data was collected weekly at Northeastern University (NU) between August 2020 and January 2022. Campus and county epidemic trajectories were generated from case counts. A novel forecasting approach was developed by enhancing a recent deep learning model with Ct-based features and applied in Suffolk County and NU campus. For this, cross-sectional Ct values from PCR data were used to generate Ct-based epidemic trajectories, including effective reproductive rate (Rt) and incidence. The improvement in forecasting performance was compared using absolute errors and residual squared errors with respect to actual observed cases at the 7-day and 14-day forecasting horizons. The model was also tested prospectively over the period January 2022 to April 2022. Results Rt curves estimated from the Ct-based model indicated epidemic waves 12 to 14 days earlier than Rt curves from NU campus and Suffolk County cases, with a correlation of 0.57. Enhancing the forecasting models with Ct-based information significantly decreased absolute error (decrease of 49.4 and 221.5 for the 7 and 14-day forecasting horizons) and residual squared error (40.6 and 217.1 for the 7 and 14-day forecasting horizons) compared to the original model without Ct features. Conclusion Ct-based epidemic trajectories can herald an earlier signal for impending epidemic waves in the community and forecast transmission peaks. Moreover, COVID-19 forecasting models can be enhanced using these Ct features to improve their forecasting accuracy. In this study, we make the case that public health agencies should publish Ct values along with the binary positive/negative PCR results. Early and accurate forecasting of epidemic waves can inform public health policies and countermeasures which can mitigate spread.
Collapse
Affiliation(s)
- Mahfuza Sharmin
- Thermo Fisher Scientific, South San Francisco, CA, United States
| | - Mani Manivannan
- Thermo Fisher Scientific, South San Francisco, CA, United States
| | - David Woo
- Thermo Fisher Scientific, South San Francisco, CA, United States
| | - Océane Sorel
- Thermo Fisher Scientific, South San Francisco, CA, United States
| | - Jared R. Auclair
- Department of Chemistry and Chemical Biology, Northeastern University, Burlington, MA, United States
| | - Manoj Gandhi
- Thermo Fisher Scientific, South San Francisco, CA, United States
| | - Imran Mujawar
- Thermo Fisher Scientific, South San Francisco, CA, United States
| |
Collapse
|
2
|
Ben-Nun M, Riley P, Turtle J, Riley S. Consistent pattern of epidemic slowing across many geographies led to longer, flatter initial waves of the COVID-19 pandemic. PLoS Comput Biol 2022; 18:e1010375. [PMID: 35969627 PMCID: PMC9410547 DOI: 10.1371/journal.pcbi.1010375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 08/25/2022] [Accepted: 07/11/2022] [Indexed: 11/22/2022] Open
Abstract
To define appropriate planning scenarios for future pandemics of respiratory pathogens, it is important to understand the initial transmission dynamics of COVID-19 during 2020. Here, we fit an age-stratified compartmental model with a flexible underlying transmission term to daily COVID-19 death data from states in the contiguous U.S. and to national and sub-national data from around the world. The daily death data of the first months of the COVID-19 pandemic was qualitatively categorized into one of four main profile types: "spring single-peak", "summer single-peak", "spring/summer two-peak" and "broad with shoulder". We estimated a reproduction number R as a function of calendar time tc and as a function of time since the first death reported in that population (local pandemic time, tp). Contrary to the diversity of categories and range of magnitudes in death incidence profiles, the R(tp) profiles were much more homogeneous. We found that in both the contiguous U.S. and globally, the initial value of both R(tc) and R(tp) was substantial: at or above two. However, during the early months, pandemic time R(tp) decreased exponentially to a value that hovered around one. This decrease was accompanied by a reduction in the variance of R(tp). For calendar time R(tc), the decrease in magnitude was slower and non-exponential, with a smaller reduction in variance. Intriguingly, similar trends of exponential decrease and reduced variance were not observed in raw death data. Our findings suggest that the combination of specific government responses and spontaneous changes in behaviour ensured that transmissibility dropped, rather than remaining constant, during the initial phases of a pandemic. Future pandemic planning scenarios should include models that assume similar decreases in transmissibility, which lead to longer epidemics with lower peaks when compared with models based on constant transmissibility.
Collapse
Affiliation(s)
- Michal Ben-Nun
- Predictive Science Inc., San Diego California United States of America
- * E-mail:
| | - Pete Riley
- Predictive Science Inc., San Diego California United States of America
| | - James Turtle
- Predictive Science Inc., San Diego California United States of America
| | - Steven Riley
- MRC Centre for Outbreak Analysis and Modelling, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, United Kingdom
| |
Collapse
|
3
|
Riley P, Riley A, Turtle J, Ben-Nun M. COVID-19 deaths: Which explanatory variables matter the most? PLoS One 2022; 17:e0266330. [PMID: 35446873 PMCID: PMC9022803 DOI: 10.1371/journal.pone.0266330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 03/20/2022] [Indexed: 12/02/2022] Open
Abstract
More than a year since the appearance of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), many questions about the disease COVID-19 have been answered; however, many more remain poorly understood. Although the situation continues to evolve, it is crucial to understand what factors may be driving transmission through different populations, both for potential future waves, as well as the implications for future pandemics. In this report, we compiled a database of more than 28 potentially explanatory variables for each of the 50 U.S. states through early May 2020. Using a combination of traditional statistical and modern machine learning approaches, we identified those variables that were the most statistically significant, and, those that were the most important. These variables were chosen to be fiduciaries of a range of possible drivers for COVID-19 deaths in the USA. We found that population-weighted population density (PWPD), some “stay at home” metrics, monthly temperature and precipitation, race/ethnicity, and chronic low-respiratory death rate, were all statistically significant. Of these, PWPD and mobility metrics dominated. This suggests that the biggest impact on COVID-19 deaths was, at least initially, a function of where you lived, and not what you did. However, clearly, increasing social distancing has the net effect of (at least temporarily) reducing the effective PWPD. Our results strongly support the idea that the loosening of “lock-down” orders should be tailored to the local PWPD. In contrast to these variables, while still statistically significant, race/ethnicity, health, and climate effects could only account for a few percent of the variability in deaths. Where associations were anticipated but were not found, we discuss how limitations in the parameters chosen may mask a contribution that might otherwise be present.
Collapse
Affiliation(s)
- Pete Riley
- Predictive Science Inc., San Diego, California, United States of America
- * E-mail:
| | - Allison Riley
- Predictive Science Inc., San Diego, California, United States of America
| | - James Turtle
- Predictive Science Inc., San Diego, California, United States of America
| | - Michal Ben-Nun
- Predictive Science Inc., San Diego, California, United States of America
| |
Collapse
|
4
|
Wang H, Qiu J, Li C, Wan H, Yang C, Zhang T. Applying the Spatial Transmission Network to the Forecast of Infectious Diseases Across Multiple Regions. Front Public Health 2022; 10:774984. [PMID: 35359784 PMCID: PMC8962516 DOI: 10.3389/fpubh.2022.774984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Accepted: 02/02/2022] [Indexed: 11/13/2022] Open
Abstract
Objective Timely and accurate forecast of infectious diseases is essential for achieving precise prevention and control. A good forecasting method of infectious diseases should have the advantages of interpretability, feasibility, and forecasting performance. Since previous research had illustrated that the spatial transmission network (STN) showed good interpretability and feasibility, this study further explored its forecasting performance for infectious diseases across multiple regions. Meanwhile, this study also showed whether the STN could overcome the challenges of model rationality and practical needs. Methods The construction of the STN framework involved three major steps: the spatial kluster analysis by tree edge removal (SKATER) algorithm, structure learning by dynamic Bayesian network (DBN), and parameter learning by the vector autoregressive moving average (VARMA) model. Then, we evaluated the forecasting performance of STN by comparing its accuracy with that of the mechanism models like susceptible-exposed-infectious-recovered-susceptible (SEIRS) and machine-learning algorithm like long-short-term memory (LSTM). At the same time, we assessed the robustness of forecasting performance of STN in high and low incidence seasons. The influenza-like illness (ILI) data in the Sichuan Province of China from 2010 to 2017 were used as an example for illustration. Results The STN model revealed that ILI was likely to spread among multiple cities in Sichuan during the study period. During the whole study period, the forecasting accuracy of the STN (mean absolute percentage error [MAPE] = 31.134) was significantly better than that of the LSTM (MAPE = 41.657) and the SEIRS (MAPE = 62.039). In addition, the forecasting performance of STN was also superior to those of the other two methods in either the high incidence season (MAPE = 24.742) or the low incidence season (MAPE = 26.209), and the superiority was more obvious in the high incidence season. Conclusion This study applied the STN to the forecast of infectious diseases across multiple regions. The results illustrated that the STN not only had good accuracy in forecasting performance but also indicated the spreading directions of infectious diseases among multiple regions to a certain extent. Therefore, the STN is a promising candidate to improve the surveillance work.
Collapse
Affiliation(s)
- Huimin Wang
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, China
| | - Jianqing Qiu
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, China
| | - Cheng Li
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, China
| | - Hongli Wan
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, China
| | - Changhong Yang
- Sichuan Center for Disease Control and Prevention, Chengdu, China
| | - Tao Zhang
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, China
- *Correspondence: Tao Zhang
| |
Collapse
|
5
|
Osthus D. Fast and accurate influenza forecasting in the United States with Inferno. PLoS Comput Biol 2022; 18:e1008651. [PMID: 35100253 PMCID: PMC8830797 DOI: 10.1371/journal.pcbi.1008651] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 02/10/2022] [Accepted: 01/02/2022] [Indexed: 01/15/2023] Open
Abstract
Infectious disease forecasting is an emerging field and has the potential to improve public health through anticipatory resource allocation, situational awareness, and mitigation planning. By way of exploring and operationalizing disease forecasting, the U.S. Centers for Disease Control and Prevention (CDC) has hosted FluSight since the 2013/14 flu season, an annual flu forecasting challenge. Since FluSight’s onset, forecasters have developed and improved forecasting models in an effort to provide more timely, reliable, and accurate information about the likely progression of the outbreak. While improving the predictive performance of these forecasting models is often the primary objective, it is also important for a forecasting model to run quickly, facilitating further model development and improvement while providing flexibility when deployed in a real-time setting. In this vein I introduce Inferno, a fast and accurate flu forecasting model inspired by Dante, the top performing model in the 2018/19 FluSight challenge. When pseudoprospectively compared to all models that participated in FluSight 2018/19, Inferno would have placed 2nd in the national and regional challenge as well as the state challenge, behind only Dante. Inferno, however, runs in minutes and is trivially parallelizable, while Dante takes hours to run, representing a significant operational improvement with minimal impact to performance. Forecasting challenges like FluSight should continue to monitor and evaluate how they can be modified and expanded to incentivize the development of forecasting models that benefit public health. Infectious disease forecasting, if accurate, timely, and reliable, can assist decision makers with resource allocation planning in an attempt to curb the negative impacts of an outbreak. Forecasting challenges, like the U.S. Centers for Disease Control and Prevention’s flu forecasting challenge, FluSight, provide a space for teams to develop and operationalize real-time forecasting models that benefit public health, with weekly forecasts made at the state-level, Health and Human Services region-level, and the United States. The ultimate goal of these models is to produce accurate forecasts within the constraints of the forecasting challenge. Having a forecasting model that runs quickly is also important for future scalability, model development, and operational flexibility. In this paper, I present a fast and accurate flu forecasting model, Inferno. Through retrospective comparisons with FluSight-participating models, Inferno was shown to be a leading forecasting model in the field. Inferno, however, runs in minutes not hours, as other leading forecasting models do. This reduction in runtime constitutes an advancement in flu forecasting, positioning Inferno to scale to more granular geographic units, like counties or health care providers.
Collapse
Affiliation(s)
- Dave Osthus
- Statistical Sciences Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- * E-mail:
| |
Collapse
|
6
|
Childs ML, Kain MP, Harris MJ, Kirk D, Couper L, Nova N, Delwel I, Ritchie J, Becker AD, Mordecai EA. The impact of long-term non-pharmaceutical interventions on COVID-19 epidemic dynamics and control: the value and limitations of early models. Proc Biol Sci 2021; 288:20210811. [PMID: 34428971 PMCID: PMC8385372 DOI: 10.1098/rspb.2021.0811] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 07/26/2021] [Indexed: 12/21/2022] Open
Abstract
Mathematical models of epidemics are important tools for predicting epidemic dynamics and evaluating interventions. Yet, because early models are built on limited information, it is unclear how long they will accurately capture epidemic dynamics. Using a stochastic SEIR model of COVID-19 fitted to reported deaths, we estimated transmission parameters at different time points during the first wave of the epidemic (March-June, 2020) in Santa Clara County, California. Although our estimated basic reproduction number ([Formula: see text]) remained stable from early April to late June (with an overall median of 3.76), our estimated effective reproduction number ([Formula: see text]) varied from 0.18 to 1.02 in April before stabilizing at 0.64 on 27 May. Between 22 April and 27 May, our model accurately predicted dynamics through June; however, the model did not predict rising summer cases after shelter-in-place orders were relaxed in June, which, in early July, was reflected in cases but not yet in deaths. While models are critical for informing intervention policy early in an epidemic, their performance will be limited as epidemic dynamics evolve. This paper is one of the first to evaluate the accuracy of an early epidemiological compartment model over time to understand the value and limitations of models during unfolding epidemics.
Collapse
Affiliation(s)
- Marissa L. Childs
- Emmett Interdisciplinary Program in Environment and Resources, Stanford University, Stanford, CA 94305, USA
| | - Morgan P. Kain
- Department of Biology, Stanford University, Stanford, CA 94305, USA
- Natural Capital Project, Woods Institute for the Environment, Stanford University, Stanford, CA 94305, USA
| | | | - Devin Kirk
- Department of Biology, Stanford University, Stanford, CA 94305, USA
- Department of Zoology, University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4
| | - Lisa Couper
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Nicole Nova
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Isabel Delwel
- Department of Microbiology and Immunology, Stanford University, Stanford, CA 94305, USA
| | - Jacob Ritchie
- Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | | | - Erin A. Mordecai
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
7
|
Turtle J, Riley P, Ben-Nun M, Riley S. Accurate influenza forecasts using type-specific incidence data for small geographic units. PLoS Comput Biol 2021; 17:e1009230. [PMID: 34324487 PMCID: PMC8354478 DOI: 10.1371/journal.pcbi.1009230] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 08/10/2021] [Accepted: 06/30/2021] [Indexed: 11/24/2022] Open
Abstract
Influenza incidence forecasting is used to facilitate better health system planning and could potentially be used to allow at-risk individuals to modify their behavior during a severe seasonal influenza epidemic or a novel respiratory pandemic. For example, the US Centers for Disease Control and Prevention (CDC) runs an annual competition to forecast influenza-like illness (ILI) at the regional and national levels in the US, based on a standard discretized incidence scale. Here, we use a suite of forecasting models to analyze type-specific incidence at the smaller spatial scale of clusters of nearby counties. We used data from point-of-care (POC) diagnostic machines over three seasons, in 10 clusters, capturing: 57 counties; 1,061,891 total specimens; and 173,909 specimens positive for Influenza A. Total specimens were closely correlated with comparable CDC ILI data. Mechanistic models were substantially more accurate when forecasting influenza A positive POC data than total specimen POC data, especially at longer lead times. Also, models that fit subpopulations of the cluster (individual counties) separately were better able to forecast clusters than were models that directly fit to aggregated cluster data. Public health authorities may wish to consider developing forecasting pipelines for type-specific POC data in addition to ILI data. Simple mechanistic models will likely improve forecast accuracy when applied at small spatial scales to pathogen-specific data before being scaled to larger geographical units and broader syndromic data. Highly local forecasts may enable new public health messaging to encourage at-risk individuals to temporarily reduce their social mixing during seasonal peaks and guide public health intervention policy during potentially severe novel influenza pandemics.
Collapse
Affiliation(s)
- James Turtle
- Infectious Disease Group, Predictive Science Inc., San Diego, California, United States
- * E-mail:
| | - Pete Riley
- Infectious Disease Group, Predictive Science Inc., San Diego, California, United States
| | - Michal Ben-Nun
- Infectious Disease Group, Predictive Science Inc., San Diego, California, United States
| | - Steven Riley
- Infectious Disease Group, Predictive Science Inc., San Diego, California, United States
- MRC Centre for Outbreak Analysis and Modelling, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, United Kingdom
| |
Collapse
|
8
|
Abstract
Influenza forecasting in the United States (US) is complex and challenging due to spatial and temporal variability, nested geographic scales of interest, and heterogeneous surveillance participation. Here we present Dante, a multiscale influenza forecasting model that learns rather than prescribes spatial, temporal, and surveillance data structure and generates coherent forecasts across state, regional, and national scales. We retrospectively compare Dante's short-term and seasonal forecasts for previous flu seasons to the Dynamic Bayesian Model (DBM), a leading competitor. Dante outperformed DBM for nearly all spatial units, flu seasons, geographic scales, and forecasting targets. Dante's sharper and more accurate forecasts also suggest greater public health utility. Dante placed 1st in the Centers for Disease Control and Prevention's prospective 2018/19 FluSight challenge in both the national and regional competition and the state competition. The methodology underpinning Dante can be used in other seasonal disease forecasting contexts having nested geographic scales of interest.
Collapse
Affiliation(s)
- Dave Osthus
- Los Alamos National Laboratory, Statistical Sciences Group, Los Alamos, NM, USA.
| | - Kelly R Moran
- Los Alamos National Laboratory, Statistical Sciences Group, Los Alamos, NM, USA.,Department of Statistical Science, Duke University, Durham, NC, USA
| |
Collapse
|
9
|
Pei S, Teng X, Lewis P, Shaman J. Optimizing respiratory virus surveillance networks using uncertainty propagation. Nat Commun 2021; 12:222. [PMID: 33431854 PMCID: PMC7801666 DOI: 10.1038/s41467-020-20399-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 12/01/2020] [Indexed: 02/07/2023] Open
Abstract
Infectious disease prevention, control and forecasting rely on sentinel observations; however, many locations lack the capacity for routine surveillance. Here we show that, by using data from multiple sites collectively, accurate estimation and forecasting of respiratory diseases for locations without surveillance is feasible. We develop a framework to optimize surveillance sites that suppresses uncertainty propagation in a networked disease transmission model. Using influenza outbreaks from 35 US states, the optimized system generates better near-term predictions than alternate systems designed using population and human mobility. We also find that monitoring regional population centers serves as a reasonable proxy for the optimized network and could direct surveillance for diseases with limited records. The proxy method is validated using model simulations for 3,108 US counties and historical data for two other respiratory pathogens - human metapneumovirus and seasonal coronavirus - from 35 US states and can be used to guide systemic allocation of surveillance efforts.
Collapse
Affiliation(s)
- Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA.
| | - Xian Teng
- School of Computing and Information, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Paul Lewis
- Integrated Biosurveillance Section, Armed Forces Health Surveillance Branch, Silver Spring, MD, 20904, USA
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA.
| |
Collapse
|
10
|
Pei S, Shaman J. Aggregating forecasts of multiple respiratory pathogens supports more accurate forecasting of influenza-like illness. PLoS Comput Biol 2020; 16:e1008301. [PMID: 33090997 PMCID: PMC7608986 DOI: 10.1371/journal.pcbi.1008301] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 11/03/2020] [Accepted: 09/02/2020] [Indexed: 12/12/2022] Open
Abstract
Influenza-like illness (ILI) is a commonly measured syndromic signal representative of a range of acute respiratory infections. Reliable forecasts of ILI can support better preparation for patient surges in healthcare systems. Although ILI is an amalgamation of multiple pathogens with variable seasonal phasing and attack rates, most existing process-based forecasting systems treat ILI as a single infectious agent. Here, using ILI records and virologic surveillance data, we show that ILI signal can be disaggregated into distinct viral components. We generate separate predictions for six contributing pathogens (influenza A/H1, A/H3, B, respiratory syncytial virus, and human parainfluenza virus types 1-2 and 3), and develop a method to forecast ILI by aggregating these predictions. The relative contribution of each pathogen to the total ILI signal is estimated using a Markov Chain Monte Carlo (MCMC) method upon forecast aggregation. We find highly variable overall contributions from influenza type A viruses across seasons, but relatively stable contributions for the other pathogens. Using historical data from 1997 to 2014 at US national and regional levels, the proposed forecasting system generates improved predictions of both seasonal and near-term targets relative to a baseline method that simulates ILI as a single pathogen. The hierarchical forecasting system can generate predictions for each viral component, as well as infer and predict their contributions to ILI, which may additionally help physicians determine the etiological causes of ILI in clinical settings.
Collapse
Affiliation(s)
- Sen Pei
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, United States of America
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, United States of America
| |
Collapse
|
11
|
Aris-Brosou S, Parent L, Ibeh N. Viral Long-Term Evolutionary Strategies Favor Stability over Proliferation. Viruses 2019; 11:v11080677. [PMID: 31344814 PMCID: PMC6722887 DOI: 10.3390/v11080677] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Revised: 07/12/2019] [Accepted: 07/20/2019] [Indexed: 02/01/2023] Open
Abstract
Viruses are known to have some of the highest and most diverse mutation rates found in any biological replicator, with single-stranded (ss) RNA viruses evolving the fastest, and double-stranded (ds) DNA viruses having rates approaching those of bacteria. As mutation rates are tightly and negatively correlated with genome size, selection is a clear driver of viral evolution. However, the role of intragenomic interactions as drivers of viral evolution is still unclear. To understand how these two processes affect the long-term evolution of viruses infecting humans, we comprehensively analyzed ssRNA, ssDNA, dsRNA, and dsDNA viruses, to find which virus types and which functions show evidence for episodic diversifying selection and correlated evolution. We show that selection mostly affects single stranded viruses, that correlated evolution is more prevalent in DNA viruses, and that both processes, taken independently, mostly affect viral replication. However, the genes that are jointly affected by both processes are involved in key aspects of their life cycle, favoring viral stability over proliferation. We further show that both evolutionary processes are intimately linked at the amino acid level, which suggests that it is the joint action of selection and correlated evolution, and not just selection, that shapes the evolutionary trajectories of viruses—and possibly of their epidemiological potential.
Collapse
Affiliation(s)
- Stéphane Aris-Brosou
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada.
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, ON K1N 6N5, Canada.
| | - Louis Parent
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | - Neke Ibeh
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| |
Collapse
|