1
|
Vaughan TG, Scire J, Nadeau SA, Stadler T. Estimates of early outbreak-specific SARS-CoV-2 epidemiological parameters from genomic data. Proc Natl Acad Sci U S A 2024; 121:e2308125121. [PMID: 38175864 PMCID: PMC10786264 DOI: 10.1073/pnas.2308125121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 12/02/2023] [Indexed: 01/06/2024] Open
Abstract
We estimate the basic reproductive number and case counts for 15 distinct Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) outbreaks, distributed across 11 populations (10 countries and one cruise ship), based solely on phylodynamic analyses of genomic data. Our results indicate that, prior to significant public health interventions, the reproductive numbers for 10 (out of 15) of these outbreaks are similar, with median posterior estimates ranging between 1.4 and 2.8. These estimates provide a view which is complementary to that provided by those based on traditional line listing data. The genomic-based view is arguably less susceptible to biases resulting from differences in testing protocols, testing intensity, and import of cases into the community of interest. In the analyses reported here, the genomic data primarily provide information regarding which samples belong to a particular outbreak. We observe that once these outbreaks are identified, the sampling dates carry the majority of the information regarding the reproductive number. Finally, we provide genome-based estimates of the cumulative number of infections for each outbreak. For 7 out of 11 of the populations studied, the number of confirmed cases is much bigger than the cumulative number of infections estimated from the sequence data, a possible explanation being the presence of unsequenced outbreaks in these populations.
Collapse
Affiliation(s)
- Timothy G. Vaughan
- Department of Biosystems Science and Engineering, Eidgenössiche Technische Hochschule Zurich, Basel4058, Switzerland
- Computational Evolution Group, Swiss Institute of Bioinformatics, Lausanne1015, Switzerland
| | - Jérémie Scire
- Department of Biosystems Science and Engineering, Eidgenössiche Technische Hochschule Zurich, Basel4058, Switzerland
- Computational Evolution Group, Swiss Institute of Bioinformatics, Lausanne1015, Switzerland
| | - Sarah A. Nadeau
- Department of Biosystems Science and Engineering, Eidgenössiche Technische Hochschule Zurich, Basel4058, Switzerland
- Computational Evolution Group, Swiss Institute of Bioinformatics, Lausanne1015, Switzerland
| | - Tanja Stadler
- Department of Biosystems Science and Engineering, Eidgenössiche Technische Hochschule Zurich, Basel4058, Switzerland
- Computational Evolution Group, Swiss Institute of Bioinformatics, Lausanne1015, Switzerland
| |
Collapse
|
2
|
Weber A, Översti S, Kühnert D. Reconstructing relative transmission rates in Bayesian phylodynamics: Two-fold transmission advantage of Omicron in Berlin, Germany during December 2021. Virus Evol 2023; 9:vead070. [PMID: 38107332 PMCID: PMC10725310 DOI: 10.1093/ve/vead070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 11/08/2023] [Accepted: 11/27/2023] [Indexed: 12/19/2023] Open
Abstract
Phylodynamic methods have lately played a key role in understanding the spread of infectious diseases. During the coronavirus disease (COVID-19) pandemic, large scale genomic surveillance has further increased the potential of dynamic inference from viral genomes. With the continual emergence of novel severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2) variants, explicitly allowing transmission rate differences between simultaneously circulating variants in phylodynamic inference is crucial. In this study, we present and empirically validate an extension to the BEAST2 package birth-death skyline model (BDSKY), BDSKY[Formula: see text], which introduces a scaling factor for the transmission rate between independent, jointly inferred trees. In an extensive simulation study, we show that BDSKY[Formula: see text] robustly infers the relative transmission rates under different epidemic scenarios. Using publicly available genome data of SARS-CoV-2, we apply BDSKY[Formula: see text] to quantify the transmission advantage of the Omicron over the Delta variant in Berlin, Germany. We find the overall transmission rate of Omicron to be scaled by a factor of two with pronounced variation between the individual clusters of each variant. These results quantify the transmission advantage of Omicron over the previously circulating Delta variant, in a crucial period of pre-established non-pharmaceutical interventions. By inferring variant- as well as cluster-specific transmission rate scaling factors, we show the differences in transmission dynamics for each variant. This highlights the importance of incorporating lineage-specific transmission differences in phylodynamic inference.
Collapse
Affiliation(s)
- Ariane Weber
- Transmission, Infection, Diversification & Evolution Group (tide), Max Planck Institute of Geoanthropology, Kahlaische Strasse 10, Jena, Thuringia 07745, Germany
- Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, Leipzig, Saxony 04103, Germany
| | | | - Denise Kühnert
- Transmission, Infection, Diversification & Evolution Group (tide), Max Planck Institute of Geoanthropology, Kahlaische Strasse 10, Jena, Thuringia 07745, Germany
- Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, Leipzig, Saxony 04103, Germany
- Centre for Artificial Intelligence in Public Health Research, Robert Koch Institute, Ludwig-Witthöft-Straße 14, Wildau, Brandenburg 15745, Germany
| |
Collapse
|
3
|
Barido-Sottani J, Morlon H. The ClaDS rate-heterogeneous birth-death prior for full phylogenetic inference in BEAST2. Syst Biol 2023; 72:1180-1187. [PMID: 37161619 PMCID: PMC10627560 DOI: 10.1093/sysbio/syad027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 01/16/2023] [Accepted: 04/24/2023] [Indexed: 05/11/2023] Open
Abstract
Bayesian phylogenetic inference requires a tree prior, which models the underlying diversification process that gives rise to the phylogeny. Existing birth-death diversification models include a wide range of features, for instance, lineage-specific variations in speciation and extinction (SSE) rates. While across-lineage variation in SSE rates is widespread in empirical datasets, few heterogeneous rate models have been implemented as tree priors for Bayesian phylogenetic inference. As a consequence, rate heterogeneity is typically ignored when reconstructing phylogenies, and rate heterogeneity is usually investigated on fixed trees. In this paper, we present a new BEAST2 package implementing the cladogenetic diversification rate shift (ClaDS) model as a tree prior. ClaDS is a birth-death diversification model designed to capture small progressive variations in birth and death rates along a phylogeny. Unlike previous implementations of ClaDS, which were designed to be used with fixed, user-chosen phylogenies, our package is implemented in the BEAST2 framework and thus allows full phylogenetic inference, where the phylogeny and model parameters are co-estimated from a molecular alignment. Our package provides all necessary components of the inference, including a new tree object and operators to propose moves to the Monte-Carlo Markov chain. It also includes a graphical interface through BEAUti. We validate our implementation of the package by comparing the produced distributions to simulated data and show an empirical example of the full inference, using a dataset of cetaceans.
Collapse
Affiliation(s)
- Joëlle Barido-Sottani
- Institut de Biologie de l’ENS (IBENS), École normale supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
| | - Hélène Morlon
- Institut de Biologie de l’ENS (IBENS), École normale supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
| |
Collapse
|
4
|
Kopperud BT, Magee AF, Höhna S. Rapidly changing speciation and extinction rates can be inferred in spite of nonidentifiability. Proc Natl Acad Sci U S A 2023; 120:e2208851120. [PMID: 36757894 PMCID: PMC9963352 DOI: 10.1073/pnas.2208851120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 12/01/2022] [Indexed: 02/10/2023] Open
Abstract
The birth-death model is commonly used to infer speciation and extinction rates by fitting the model to phylogenetic trees with exclusively extant taxa. Recently, it was demonstrated that speciation and extinction rates are not identifiable if the rates are allowed to vary freely over time. The group of birth-death models that have the same likelihood is called a congruence class, and there is no statistical evidence to favor one model over the other. This issue has led researchers to question if and what patterns can reliably be inferred from phylogenies of only extant taxa and whether time-variable birth-death models should be fitted at all. We explore the congruence class in the context of several empirical phylogenies as well as hypothetical scenarios. For these empirical phylogenies, we assume that we inferred the true congruence class. Thus, our conclusions apply to any empirical phylogeny for which we robustly inferred the true congruence class. When we summarize shared patterns in the congruence class, we show that strong directional trends in speciation and extinction rates are shared among most models. Therefore, we conclude that the inference of strong directional trends is robust. Conversely, estimates of constant rates or gentle slopes are not robust and must be treated with caution. Interestingly, the space of valid speciation rates is narrower and more limited in contrast to extinction rates, which are less constrained. These results provide further evidence and insights that speciation rates can be estimated more reliably than extinction rates.
Collapse
Affiliation(s)
- Bjørn T. Kopperud
- GeoBio-Center LMU, Ludwig-Maximilians-Universität München, Munich80333, Germany
- Department of Earth and Environmental Sciences, Palaeontology & Geobiology, Ludwig-Maximilians-Universität München, Munich80333, Germany
| | - Andrew F. Magee
- Department of Biostatistics, University of California, Los Angeles, CA90095
| | - Sebastian Höhna
- GeoBio-Center LMU, Ludwig-Maximilians-Universität München, Munich80333, Germany
- Department of Earth and Environmental Sciences, Palaeontology & Geobiology, Ludwig-Maximilians-Universität München, Munich80333, Germany
| |
Collapse
|
5
|
Lam A, Duchene S. The Impacts of Low Diversity Sequence Data on Phylodynamic Inference during an Emerging Epidemic. Viruses 2021; 13:79. [PMID: 33430050 DOI: 10.3390/v13010079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 01/05/2021] [Accepted: 01/05/2021] [Indexed: 01/06/2023] Open
Abstract
Phylodynamic inference is a pivotal tool in understanding transmission dynamics of viral outbreaks. These analyses are strongly guided by the input of an epidemiological model as well as sequence data that must contain sufficient intersequence variability in order to be informative. These criteria, however, may not be met during the early stages of an outbreak. Here we investigate the impact of low diversity sequence data on phylodynamic inference using the birth–death and coalescent exponential models. Through our simulation study, estimating the molecular evolutionary rate required enough sequence diversity and is an essential first step for any phylodynamic inference. Following this, the birth–death model outperforms the coalescent exponential model in estimating epidemiological parameters, when faced with low diversity sequence data due to explicitly exploiting the sampling times. In contrast, the coalescent model requires additional samples and therefore variability in sequence data before accurate estimates can be obtained. These findings were also supported through our empirical data analyses of an Australian and a New Zealand cluster outbreaks of SARS-CoV-2. Overall, the birth–death model is more robust when applied to datasets with low sequence diversity given sampling is specified and this should be considered for future viral outbreak investigations.
Collapse
|
6
|
Leventhal GE, Günthard HF, Bonhoeffer S, Stadler T. Using an epidemiological model for phylogenetic inference reveals density dependence in HIV transmission. Mol Biol Evol 2013; 31:6-17. [PMID: 24085839 PMCID: PMC3879443 DOI: 10.1093/molbev/mst172] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The control, prediction, and understanding of epidemiological processes require insight into how infectious pathogens transmit in a population. The chain of transmission can in principle be reconstructed with phylogenetic methods which analyze the evolutionary history using pathogen sequence data. The quality of the reconstruction, however, crucially depends on the underlying epidemiological model used in phylogenetic inference. Until now, only simple epidemiological models have been used, which make limiting assumptions such as constant rate parameters, infinite total population size, or deterministically changing population size of infected individuals. Here, we present a novel phylogenetic method to infer parameters based on a classical stochastic epidemiological model. Specifically, we use the susceptible-infected-susceptible model, which accounts for density-dependent transmission rates and finite total population size, leading to a stochastically changing infected population size. We first validate our method by estimating epidemic parameters for simulated data and then apply it to transmission clusters from the Swiss HIV epidemic. Our estimates of the basic reproductive number R0 for the considered Swiss HIV transmission clusters are significantly higher than previous estimates, which were derived assuming infinite population size. This difference in key parameter estimates highlights the importance of careful model choice when doing phylogenetic inference. In summary, this article presents the first fully stochastic implementation of a classical epidemiological model for phylogenetic inference and thereby addresses a key aspect in ongoing efforts to merge phylogenetics and epidemiology.
Collapse
|