1
|
Chowell G, Skums P. Investigating and forecasting infectious disease dynamics using epidemiological and molecular surveillance data. Phys Life Rev 2024; 51:294-327. [PMID: 39488136 DOI: 10.1016/j.plrev.2024.10.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Accepted: 10/23/2024] [Indexed: 11/04/2024]
Abstract
The integration of viral genomic data into public health surveillance has revolutionized our ability to track and forecast infectious disease dynamics. This review addresses two critical aspects of infectious disease forecasting and monitoring: the methodological workflow for epidemic forecasting and the transformative role of molecular surveillance. We first present a detailed approach for validating epidemic models, emphasizing an iterative workflow that utilizes ordinary differential equation (ODE)-based models to investigate and forecast disease dynamics. We recommend a more structured approach to model validation, systematically addressing key stages such as model calibration, assessment of structural and practical parameter identifiability, and effective uncertainty propagation in forecasts. Furthermore, we underscore the importance of incorporating multiple data streams by applying both simulated and real epidemiological data from the COVID-19 pandemic to produce more reliable forecasts with quantified uncertainty. Additionally, we emphasize the pivotal role of viral genomic data in tracking transmission dynamics and pathogen evolution. By leveraging advanced computational tools such as Bayesian phylogenetics and phylodynamics, researchers can more accurately estimate transmission clusters and reconstruct outbreak histories, thereby improving data-driven modeling and forecasting and informing targeted public health interventions. Finally, we discuss the transformative potential of integrating molecular epidemiology with mathematical modeling to complement and enhance epidemic forecasting and optimize public health strategies.
Collapse
Affiliation(s)
- Gerardo Chowell
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, USA; Department of Applied Mathematics, Kyung Hee University, Yongin 17104, Korea.
| | - Pavel Skums
- School of Computing, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
2
|
May MR, Rannala B. Early detection of highly transmissible viral variants using phylogenomics. SCIENCE ADVANCES 2024; 10:eadk7623. [PMID: 39141727 PMCID: PMC11323880 DOI: 10.1126/sciadv.adk7623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 07/09/2024] [Indexed: 08/16/2024]
Abstract
As demonstrated by the SARS-CoV-2 pandemic, the emergence of novel viral strains with increased transmission rates poses a serious threat to global health. Statistical models of genome sequence evolution may provide a critical tool for early detection of these strains. Using a novel stochastic model that links transmission rates to the entire viral genome sequence, we study the utility of phylogenetic methods that use a phylogenetic tree relating viral samples versus count-based methods that use case counts of variants over time exclusively to detect increased transmission rates and identify candidate causative mutations. We find that phylogenies in particular can detect novel transmission-enhancing variants very soon after their origin and may facilitate the development of early detection systems for outbreak surveillance.
Collapse
Affiliation(s)
- Michael R. May
- Department of Evolution and Ecology, University of California Davis, Davis, CA, USA
| | | |
Collapse
|
3
|
Xu P, Liang S, Hahn A, Zhao V, Lo WT‘J, Haller BC, Sobkowiak B, Chitwood MH, Colijn C, Cohen T, Rhee KY, Messer PW, Wells MT, Clark AG, Kim J. e3SIM: epidemiological-ecological-evolutionary simulation framework for genomic epidemiology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.29.601123. [PMID: 39005464 PMCID: PMC11244936 DOI: 10.1101/2024.06.29.601123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Infectious disease dynamics are driven by the complex interplay of epidemiological, ecological, and evolutionary processes. Accurately modeling these interactions is crucial for understanding pathogen spread and informing public health strategies. However, existing simulators often fail to capture the dynamic interplay between these processes, resulting in oversimplified models that do not fully reflect real-world complexities in which the pathogen's genetic evolution dynamically influences disease transmission. We introduce the epidemiological-ecological-evolutionary simulator (e3SIM), an open-source framework that concurrently models the transmission dynamics and molecular evolution of pathogens within a host population while integrating environmental factors. Using an agent-based, discrete-generation, forward-in-time approach, e3SIM incorporates compartmental models, host-population contact networks, and quantitative-trait models for pathogens. This integration allows for realistic simulations of disease spread and pathogen evolution. Key features include a modular and scalable design, flexibility in modeling various epidemiological and population-genetic complexities, incorporation of time-varying environmental factors, and a user-friendly graphical interface. We demonstrate e3SIM's capabilities through simulations of realistic outbreak scenarios with SARS-CoV-2 and Mycobacterium tuberculosis, illustrating its flexibility for studying the genomic epidemiology of diverse pathogen types.
Collapse
Affiliation(s)
- Peiyu Xu
- Department of Molecular Biology & Genetics, Cornell University, Ithaca, NY, USA
| | - Shenni Liang
- Department of Computational Science, Cornell University, Ithaca, NY, USA
| | - Andrew Hahn
- Department of Computational Science, Cornell University, Ithaca, NY, USA
| | - Vivian Zhao
- Department of Computational Science, Cornell University, Ithaca, NY, USA
| | - Wai Tung ‘Jack’ Lo
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
| | - Benjamin C. Haller
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
| | - Benjamin Sobkowiak
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, New Haven, CT, USA
| | - Melanie H. Chitwood
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, New Haven, CT, USA
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
| | - Ted Cohen
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, New Haven, CT, USA
| | - Kyu Y. Rhee
- Department of Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Philipp W. Messer
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
| | - Martin T. Wells
- Department of Statistics and Data Science, Cornell University, Ithaca, NY, USA
| | - Andrew G. Clark
- Department of Molecular Biology & Genetics, Cornell University, Ithaca, NY, USA
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
| | - Jaehee Kim
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
| |
Collapse
|
4
|
May MR, Rannala B. Phylogenies increase power to detect highly transmissible viral genome variants. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.07.28.23293332. [PMID: 37577556 PMCID: PMC10418580 DOI: 10.1101/2023.07.28.23293332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
As demonstrated by the SARS-CoV-2 pandemic, the emergence of novel viral strains with increased transmission rates poses a significant threat to global health. Viral genome sequences, combined with statistical models of sequence evolution, may provide a critical tool for early detection of these strains. Using a novel statistical model that links transmission rates to the entire viral genome sequence, we study the power of phylogenetic methods-using a phylogenetic tree relating viral samples-and count-based methods-using case-counts of variants over time-to detect increased transmission rates, and to identify causative mutations. We find that phylogenies in particular can detect novel variants very soon after their origin, and may facilitate the development of early detection systems for outbreak surveillance.
Collapse
Affiliation(s)
- Michael R May
- Department of Evolution and Ecology, University of California Davis, Davis, CA USA
| | - Bruce Rannala
- Department of Evolution and Ecology, University of California Davis, Davis, CA USA
| |
Collapse
|
5
|
Park Y, Martin MA, Koelle K. Epidemiological inference for emerging viruses using segregating sites. Nat Commun 2023; 14:3105. [PMID: 37248255 PMCID: PMC10226718 DOI: 10.1038/s41467-023-38809-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 05/16/2023] [Indexed: 05/31/2023] Open
Abstract
Epidemiological models are commonly fit to case and pathogen sequence data to estimate parameters and to infer unobserved disease dynamics. Here, we present an inference approach based on sequence data that is well suited for model fitting early on during the expansion of a viral lineage. Our approach relies on a trajectory of segregating sites to infer epidemiological parameters within a Sequential Monte Carlo framework. Using simulated data, we first show that our approach accurately recovers key epidemiological quantities under a single-introduction scenario. We then apply our approach to SARS-CoV-2 sequence data from France, estimating a basic reproduction number of approximately 2.3-2.7 under an epidemiological model that allows for multiple introductions. Our approach presented here indicates that inference approaches that rely on simple population genetic summary statistics can be informative of epidemiological parameters and can be used for reconstructing infectious disease dynamics during the early expansion of a viral lineage.
Collapse
Affiliation(s)
- Yeongseon Park
- Graduate Program in Population Biology, Ecology, and Evolution, Emory University, Atlanta, GA, 30322, USA
| | - Michael A Martin
- Graduate Program in Population Biology, Ecology, and Evolution, Emory University, Atlanta, GA, 30322, USA
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Katia Koelle
- Department of Biology, Emory University, Atlanta, GA, 30322, USA.
- Emory Center of Excellence for Influenza Research and Response (CEIRR), Atlanta, GA, USA.
| |
Collapse
|
6
|
Dragomir D, Allman ES, Rhodes JA. Parameter Identifiability of a Multitype Pure-Birth Model of Speciation. J Comput Biol 2023; 30:277-292. [PMID: 36745414 DOI: 10.1089/cmb.2022.0330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Diversification models describe the random growth of evolutionary trees, modeling the historical relationships of species through speciation and extinction events. One class of such models allows for independently changing traits, or types, of the species within the tree, upon which speciation and extinction rates depend. Although identifiability of parameters is necessary to justify parameter estimation with a model, it has not been formally established for these models, despite their adoption for inference. This work establishes generic identifiability up to label swapping for the parameters of one of the simpler forms of such a model, a multitype pure birth model of speciation, from an asymptotic distribution derived from a single tree observation as its depth goes to infinity. Crucially for applications to available data, no observation of types is needed at any internal points in the tree, nor even at the leaves.
Collapse
Affiliation(s)
- Dakota Dragomir
- Department of Mathematics and Statistics, University of Alaska Fairbanks, Fairbanks, Alaska, USA
| | - Elizabeth S Allman
- Department of Mathematics and Statistics, University of Alaska Fairbanks, Fairbanks, Alaska, USA
| | - John A Rhodes
- Department of Mathematics and Statistics, University of Alaska Fairbanks, Fairbanks, Alaska, USA
| |
Collapse
|
7
|
Stockdale JE, Liu P, Colijn C. The potential of genomics for infectious disease forecasting. Nat Microbiol 2022; 7:1736-1743. [PMID: 36266338 DOI: 10.1038/s41564-022-01233-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 08/18/2022] [Indexed: 11/09/2022]
Abstract
Genomic technologies have led to tremendous gains in understanding how pathogens function, evolve and interact. Pathogen diversity is now measurable at high precision and resolution, in part because over the past decade, sequencing technologies have increased in speed and capacity, at decreased cost. Alongside this, the use of models that can forecast emergence and size of infectious disease outbreaks has risen, highlighted by the coronavirus disease 2019 pandemic but also due to modelling advances that allow for rapid estimates in emerging outbreaks to inform monitoring, coordination and resource deployment. However, genomics studies have remained largely retrospective. While they contain high-resolution views of pathogen diversification and evolution in the context of selection, they are often not aligned with designing interventions. This is a missed opportunity because pathogen diversification is at the core of the most pressing infectious public health challenges, and interventions need to take the mechanisms of virulence and understanding of pathogen diversification into account. In this Perspective, we assess these converging fields, discuss current challenges facing both surveillance specialists and modellers who want to harness genomic data, and propose next steps for integrating longitudinally sampled genomic data with statistical learning and interpretable modelling to make reliable predictions into the future.
Collapse
Affiliation(s)
- Jessica E Stockdale
- Department of Mathematics, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Pengyu Liu
- Department of Mathematics, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, Burnaby, British Columbia, Canada.
| |
Collapse
|
8
|
Featherstone LA, Zhang JM, Vaughan TG, Duchene S. Epidemiological inference from pathogen genomes: A review of phylodynamic models and applications. Virus Evol 2022; 8:veac045. [PMID: 35775026 PMCID: PMC9241095 DOI: 10.1093/ve/veac045] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 05/23/2022] [Accepted: 06/02/2022] [Indexed: 11/24/2022] Open
Abstract
Phylodynamics requires an interdisciplinary understanding of phylogenetics, epidemiology, and statistical inference. It has also experienced more intense application than ever before amid the SARS-CoV-2 pandemic. In light of this, we present a review of phylodynamic models beginning with foundational models and assumptions. Our target audience is public health researchers, epidemiologists, and biologists seeking a working knowledge of the links between epidemiology, evolutionary models, and resulting epidemiological inference. We discuss the assumptions linking evolutionary models of pathogen population size to epidemiological models of the infected population size. We then describe statistical inference for phylodynamic models and list how output parameters can be rearranged for epidemiological interpretation. We go on to cover more sophisticated models and finish by highlighting future directions.
Collapse
Affiliation(s)
- Leo A Featherstone
- Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC 3000, Australia
| | - Joshua M Zhang
- Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC 3000, Australia
| | - Timothy G Vaughan
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland
- Swiss Institute of Bioinformatics, Geneva 1015, Switzerland
| | - Sebastian Duchene
- Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC 3000, Australia
| |
Collapse
|
9
|
Kepler L, Hamins-Puertolas M, Rasmussen DA. Decomposing the sources of SARS-CoV-2 fitness variation in the United States. Virus Evol 2021; 7:veab073. [PMID: 34642604 PMCID: PMC8499931 DOI: 10.1093/ve/veab073] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 08/13/2021] [Accepted: 08/20/2021] [Indexed: 12/14/2022] Open
Abstract
The fitness of a pathogen is a composite phenotype determined by many different factors influencing growth rates both within and between hosts. Determining what factors shape fitness at the host population-level is especially challenging because both intrinsic factors like pathogen genetics and extrinsic factors such as host behavior influence between-host transmission potential. This challenge has been highlighted by controversy surrounding the population-level fitness effects of mutations in the SARS-CoV-2 genome and their relative importance when compared against non-genetic factors shaping transmission dynamics. Building upon phylodynamic birth-death models, we develop a new framework to learn how hundreds of genetic and non-genetic factors have shaped the fitness of SARS-CoV-2. We estimate the fitness effects of all amino acid variants and several structural variants that have circulated in the United States between February 2020 and March 2021 from viral phylogenies. We also estimate how much fitness variation among pathogen lineages is attributable to genetic versus non-genetic factors such as spatial heterogeneity in transmission rates. Before September 2020, most fitness variation between lineages can be explained by background spatial heterogeneity in transmission rates across geographic regions. Starting in late 2020, genetic variation in fitness increased dramatically with the emergence of several new lineages including B.1.1.7, B.1.427, B.1.429 and B.1.526. Our analysis also indicates that genetic variants in less well-explored genomic regions outside of Spike may be contributing significantly to overall fitness variation in the viral population.
Collapse
Affiliation(s)
- Lenora Kepler
- Bioinformatics Research Center, North Carolina State University, 1 Lampe Drive, Raleigh, NC 27607, USA
| | - Marco Hamins-Puertolas
- Biomathematics Graduate Program, North Carolina State University, Campus Box 8213, Raleigh, NC 27695, USA
| | - David A Rasmussen
- Bioinformatics Research Center, North Carolina State University, 1 Lampe Drive, Raleigh, NC 27607, USA
- Department of Entomology and Plant Pathology, North Carolina State University, Campus Box 7613, Raleigh, NC 27695, USA
| |
Collapse
|
10
|
MacPherson A, Louca S, McLaughlin A, Joy JB, Pennell MW. Unifying Phylogenetic Birth-Death Models in Epidemiology and Macroevolution. Syst Biol 2021; 71:172-189. [PMID: 34165577 PMCID: PMC8972974 DOI: 10.1093/sysbio/syab049] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 06/09/2021] [Accepted: 06/21/2021] [Indexed: 11/13/2022] Open
Abstract
Birth–death stochastic processes are the foundations of many phylogenetic models and are
widely used to make inferences about epidemiological and macroevolutionary dynamics. There
are a large number of birth–death model variants that have been developed; these impose
different assumptions about the temporal dynamics of the parameters and about the sampling
process. As each of these variants was individually derived, it has been difficult to
understand the relationships between them as well as their precise biological and
mathematical assumptions. Without a common mathematical foundation, deriving new models is
nontrivial. Here, we unify these models into a single framework, prove that many
previously developed epidemiological and macroevolutionary models are all special cases of
a more general model, and illustrate the connections between these variants. This
unification includes both models where the process is the same for all lineages and those
in which it varies across types. We also outline a straightforward procedure for deriving
likelihood functions for arbitrarily complex birth–death(-sampling) models that will
hopefully allow researchers to explore a wider array of scenarios than was previously
possible. By rederiving existing single-type birth–death sampling models, we clarify and
synthesize the range of explicit and implicit assumptions made by these models.
[Birth–death processes; epidemiology; macroevolution; phylogenetics; statistical
inference.]
Collapse
Affiliation(s)
- Ailene MacPherson
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, Canada.,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Stilianos Louca
- Department of Biology, University of Oregon, USA.,Institute of Ecology and Evolution, University of Oregon, USA
| | - Angela McLaughlin
- British Columbia Centre for Excellence in HIV/AIDS, Vancouver, Canada.,Bioinformatics, University of British Columbia, Vancouver, Canada
| | - Jeffrey B Joy
- British Columbia Centre for Excellence in HIV/AIDS, Vancouver, Canada.,Bioinformatics, University of British Columbia, Vancouver, Canada.,Department of Medicine, University of British Columbia, Vancouver, Canada
| | - Matthew W Pennell
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, Canada
| |
Collapse
|
11
|
Fairweather SJ, Gupta V, Chitsaz M, Booth L, Brown MH, O’Mara ML. Coordination of Substrate Binding and Protonation in the N. gonorrhoeae MtrD Efflux Pump Controls the Functionally Rotating Transport Mechanism. ACS Infect Dis 2021; 7:1833-1847. [PMID: 33980014 DOI: 10.1021/acsinfecdis.1c00149] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Multidrug resistance is a serious problem that threatens the effective treatment of the widespread sexually transmitted disease gonorrhea, caused by the Gram-negative bacterium Neisseria gonorrhoeae. The drug efflux pump primarily implicated in N. gonorrhoeae antimicrobial resistance is the inner membrane transporter MtrD, which forms part of the tripartite multiple transferable resistance (Mtr) CDE efflux system. A structure of MtrD was first solved in 2014 as a symmetrical homotrimer, and then, recently, as an asymmetrical homotrimer. Through a series of molecular dynamics simulations and mutagenesis experiments, we identify the combination of substrate binding and protonation states of the proton relay network that drives the transition from the symmetric to the asymmetric conformation of MtrD. We characterize the allosteric coupling between the functionally important local regions that control conformational changes between the access, binding, and extrusion states and allow for transition to the asymmetric MtrD conformation. We also highlight a significant rotation of the transmembrane helices caused by protonation of the proton relay network, which widens the intermonomeric gap that is a hallmark of the rotational transporter mechanism. This is the first analysis and description of the transport mechanism for the N. gonorrhoeae MtrD transporter and provides evidence that antimicrobial efflux in MtrD follows the functionally rotating transport mechanism seen in protein homologues from the same transport protein superfamily.
Collapse
Affiliation(s)
- Stephen J. Fairweather
- Research School of Chemistry, The Australian National University, Canberra, ACT 0200, Australia
- Research School of Biology, The Australian National University, Canberra, ACT 0200, Australia
| | - Vrinda Gupta
- Research School of Chemistry, The Australian National University, Canberra, ACT 0200, Australia
| | - Mohsen Chitsaz
- College of Science and Engineering, Flinders University, Bedford Park, SA 5042, Australia
| | - Lauren Booth
- Research School of Chemistry, The Australian National University, Canberra, ACT 0200, Australia
| | - Melissa H. Brown
- College of Science and Engineering, Flinders University, Bedford Park, SA 5042, Australia
| | - Megan L. O’Mara
- Research School of Chemistry, The Australian National University, Canberra, ACT 0200, Australia
| |
Collapse
|
12
|
Jara M, Crespo R, Roberts DL, Chapman A, Banda A, Machado G. Development of a Dissemination Platform for Spatiotemporal and Phylogenetic Analysis of Avian Infectious Bronchitis Virus. Front Vet Sci 2021; 8:624233. [PMID: 34017870 PMCID: PMC8129014 DOI: 10.3389/fvets.2021.624233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 02/27/2021] [Indexed: 11/13/2022] Open
Abstract
Infecting large portions of the global poultry populations, the avian infectious bronchitis virus (IBV) remains a major economic burden in North America. With more than 30 serotypes globally distributed, Arkansas, Connecticut, Delaware, Georgia, and Massachusetts are among the most predominant serotypes in the United States. Even though vaccination is widely used, the high mutation rate exhibited by IBV is continuously triggering the emergence of new viral strains and hindering control and prevention measures. For that reason, targeted strategies based on constantly updated information on the IBV circulation are necessary. Here, we sampled IBV-infected farms from one US state and collected and analyzed 65 genetic sequences coming from three different lineages along with the immunization information of each sampled farm. Phylodynamic analyses showed that IBV dispersal velocity was 12.3 km/year. The majority of IBV infections appeared to have derived from the introduction of the Arkansas DPI serotype, and the Arkansas DPI and Georgia 13 were the predominant serotypes. When analyzed against IBV sequences collected across the United States and deposited in the GenBank database, the most likely viral origin of our sequences was from the states of Alabama, Georgia, and Delaware. Information about vaccination showed that the MILDVAC-MASS+ARK vaccine was applied on 26% of the farms. Using a publicly accessible open-source tool for real-time interactive tracking of pathogen spread and evolution, we analyzed the spatiotemporal spread of IBV and developed an online reporting dashboard. Overall, our work demonstrates how the combination of genetic and spatial information could be used to track the spread and evolution of poultry diseases, providing timely information to the industry. Our results could allow producers and veterinarians to monitor in near-real time the current IBV strain circulating, making it more informative, for example, in vaccination-related decisions.
Collapse
Affiliation(s)
- Manuel Jara
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, United States
| | - Rocio Crespo
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, United States
| | - David L Roberts
- Department of Computer Science North Carolina State University, Raleigh, NC, United States
| | - Ashlyn Chapman
- Department of Computer Science North Carolina State University, Raleigh, NC, United States
| | - Alejandro Banda
- Poultry Research and Diagnostic Laboratory, College of Veterinary Medicine, Mississippi State University, Pearl, MS, United States
| | - Gustavo Machado
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, United States
| |
Collapse
|
13
|
Identifying Potentially Beneficial Genetic Mutations Associated with Monophyletic Selective Sweep and a Proof-of-Concept Study with Viral Genetic Data. mSystems 2021; 6:6/1/e01151-20. [PMID: 33622855 PMCID: PMC8573955 DOI: 10.1128/msystems.01151-20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Genetic mutations play a central role in evolution. For a significantly beneficial mutation, a one-time mutation event suffices for the species to prosper and predominate through the process called “monophyletic selective sweep.” However, existing methods that rely on counting the number of mutation events to detect selection are unable to find such a mutation in selective sweep. We here introduce a method to detect mutations at the single amino acid/nucleotide level that could be responsible for monophyletic selective sweep evolution. The method identifies a genetic signature associated with selective sweep using the population genetic test statistic Tajima’s D. We applied the algorithm to ebolavirus, influenza A virus, and severe acute respiratory syndrome coronavirus 2 to identify known biologically significant mutations and unrecognized mutations associated with potential selective sweep. The method can detect beneficial mutations, possibly leading to discovery of previously unknown biological functions and mechanisms related to those mutations. IMPORTANCE In biology, research on evolution is important to understand the significance of genetic mutation. When there is a significantly beneficial mutation, a population of species with the mutation prospers and predominates, in a process called “selective sweep.” However, there are few methods that can find such a mutation causing selective sweep from genetic data. We here introduce a novel method to detect such mutations. Applying the method to the genomes of ebolavirus, influenza viruses, and the novel coronavirus, we detected known biologically significant mutations and identified mutations the importance of which is previously unrecognized. The method can deepen our understanding of molecular and evolutionary biology.
Collapse
|
14
|
Lepers C, Billiard S, Porte M, Méléard S, Tran VC. Inference with selection, varying population size, and evolving population structure: application of ABC to a forward-backward coalescent process with interactions. Heredity (Edinb) 2021; 126:335-350. [PMID: 33128035 PMCID: PMC8027416 DOI: 10.1038/s41437-020-00381-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 10/15/2020] [Indexed: 11/08/2022] Open
Abstract
Genetic data are often used to infer demographic history and changes or detect genes under selection. Inferential methods are commonly based on models making various strong assumptions: demography and population structures are supposed a priori known, the evolution of the genetic composition of a population does not affect demography nor population structure, and there is no selection nor interaction between and within genetic strains. In this paper, we present a stochastic birth-death model with competitive interactions and asexual reproduction. We develop an inferential procedure for ecological, demographic, and genetic parameters. We first show how genetic diversity and genealogies are related to birth and death rates, and to how individuals compete within and between strains. This leads us to propose an original model of phylogenies, with trait structure and interactions, that allows multiple merging. Second, we develop an Approximate Bayesian Computation framework to use our model for analyzing genetic data. We apply our procedure to simulated data from a toy model, and to real data by analyzing the genetic diversity of microsatellites on Y-chromosomes sampled from Central Asia human populations in order to test whether different social organizations show significantly different fertilities.
Collapse
Affiliation(s)
| | - Sylvain Billiard
- Univ. Lille, CNRS, UMR 819 8 -Evo-Eco-Paleo, F-59000, Lille, France.
| | - Matthieu Porte
- IGN, Institut National de l'Information Géographique et Forestière, F-94165, Saint-Mandé, France.
| | - Sylvie Méléard
- CMAP, CNRS, Ecole Polytechnique, Institut polytechnique de Paris, route de Saclay, 91128, Palaiseau Cedex, France.
| | - Viet Chi Tran
- LAMA, Univ Gustave Eiffel, Univ Paris Est Creteil, CNRS, F-77454, Marne-la-Vallée, France.
| |
Collapse
|
15
|
van Dorp L, Richard D, Tan CCS, Shaw LP, Acman M, Balloux F. No evidence for increased transmissibility from recurrent mutations in SARS-CoV-2. Nat Commun 2020; 11:5986. [PMID: 33239633 PMCID: PMC7688939 DOI: 10.1038/s41467-020-19818-2] [Citation(s) in RCA: 198] [Impact Index Per Article: 39.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 10/30/2020] [Indexed: 02/07/2023] Open
Abstract
COVID-19 is caused by the coronavirus SARS-CoV-2, which jumped into the human population in late 2019 from a currently uncharacterised animal reservoir. Due to this recent association with humans, SARS-CoV-2 may not yet be fully adapted to its human host. This has led to speculations that SARS-CoV-2 may be evolving towards higher transmissibility. The most plausible mutations under putative natural selection are those which have emerged repeatedly and independently (homoplasies). Here, we formally test whether any homoplasies observed in SARS-CoV-2 to date are significantly associated with increased viral transmission. To do so, we develop a phylogenetic index to quantify the relative number of descendants in sister clades with and without a specific allele. We apply this index to a curated set of recurrent mutations identified within a dataset of 46,723 SARS-CoV-2 genomes isolated from patients worldwide. We do not identify a single recurrent mutation in this set convincingly associated with increased viral transmission. Instead, recurrent mutations currently in circulation appear to be evolutionary neutral and primarily induced by the human immune system via RNA editing, rather than being signatures of adaptation. At this stage we find no evidence for significantly more transmissible lineages of SARS-CoV-2 due to recurrent mutations.
Collapse
Affiliation(s)
- Lucy van Dorp
- UCL Genetics Institute, University College London, London, WC1E 6BT, UK.
| | - Damien Richard
- Cirad, UMR PVBMT, F-97410 St Pierre, Réunion, France
- Université de la Réunion, UMR PVBMT, F-97490 St Denis, Réunion, France
| | - Cedric C S Tan
- UCL Genetics Institute, University College London, London, WC1E 6BT, UK
| | - Liam P Shaw
- Nuffield Department of Medicine, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK
| | - Mislav Acman
- UCL Genetics Institute, University College London, London, WC1E 6BT, UK
| | - François Balloux
- UCL Genetics Institute, University College London, London, WC1E 6BT, UK.
| |
Collapse
|
16
|
Abstract
There is no doubt that the novel coronavirus SARS-CoV-2 that causes COVID-19 is mutating and thus has the potential to adapt during the current pandemic. Whether this evolution will lead to changes in the transmission, the duration, or the severity of the disease is not clear. This has led to considerable scientific and media debate, from raising alarms about evolutionary change to dismissing it. Here we review what little is currently known about the evolution of SARS-CoV-2 and extend existing evolutionary theory to consider how selection might be acting upon the virus during the COVID-19 pandemic. Although there is currently no definitive evidence that SARS-CoV-2 is undergoing further adaptation, continued evidence-based analysis of evolutionary change is important so that public health measures can be adjusted in response to substantive changes in the infectivity or severity of COVID-19.
Collapse
|