1
|
Shimagaki KS, Barton JP. Efficient epistasis inference via higher-order covariance matrix factorization. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.14.618287. [PMID: 39464126 PMCID: PMC11507688 DOI: 10.1101/2024.10.14.618287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
Epistasis can profoundly influence evolutionary dynamics. Temporal genetic data, consisting of sequences sampled repeatedly from a population over time, provides a unique resource to understand how epistasis shapes evolution. However, detecting epistatic interactions from sequence data is technically challenging. Existing methods for identifying epistasis are computationally demanding, limiting their applicability to real-world data. Here, we present a novel computational method for inferring epistasis that significantly reduces computational costs without sacrificing accuracy. We validated our approach in simulations and applied it to study HIV-1 evolution over multiple years in a data set of 16 individuals. There we observed a strong excess of negative epistatic interactions between beneficial mutations, especially mutations involved in immune escape. Our method is general and could be used to characterize epistasis in other large data sets.
Collapse
|
2
|
Arruda B, Baker ALV, Buckley A, Anderson TK, Torchetti M, Bergeson NH, Killian ML, Lantz K. Divergent Pathogenesis and Transmission of Highly Pathogenic Avian Influenza A(H5N1) in Swine. Emerg Infect Dis 2024; 30:738-751. [PMID: 38478379 PMCID: PMC10977838 DOI: 10.3201/eid3004.231141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2024] Open
Abstract
Highly pathogenic avian influenza (HPAI) viruses have potential to cross species barriers and cause pandemics. Since 2022, HPAI A(H5N1) belonging to the goose/Guangdong 2.3.4.4b hemagglutinin phylogenetic clade have infected poultry, wild birds, and mammals across North America. Continued circulation in birds and infection of multiple mammalian species with strains possessing adaptation mutations increase the risk for infection and subsequent reassortment with influenza A viruses endemic in swine. We assessed the susceptibility of swine to avian and mammalian HPAI H5N1 clade 2.3.4.4b strains using a pathogenesis and transmission model. All strains replicated in the lung of pigs and caused lesions consistent with influenza A infection. However, viral replication in the nasal cavity and transmission was only observed with mammalian isolates. Mammalian adaptation and reassortment may increase the risk for incursion and transmission of HPAI viruses in feral, backyard, or commercial swine.
Collapse
|
3
|
Illingworth CJR, Guerra-Assuncao JA, Gregg S, Charles O, Pang J, Roy S, Abdelnabi R, Neyts J, Breuer J. Genetic consequences of effective and suboptimal dosing with mutagenic drugs in a hamster model of SARS-CoV-2 infection. Virus Evol 2024; 10:veae001. [PMID: 38486802 PMCID: PMC10939363 DOI: 10.1093/ve/veae001] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 11/23/2023] [Accepted: 01/03/2024] [Indexed: 03/17/2024] Open
Abstract
Mutagenic antiviral drugs have shown promise against multiple viruses, but concerns have been raised about whether their use might promote the emergence of new and harmful viral variants. Recently, genetic signatures associated with molnupiravir use have been identified in the global SARS-COV-2 population. Here, we examine the consequences of using favipiravir and molnupiravir to treat SARS-CoV-2 infection in a hamster model, comparing viral genome sequence data collected from (1) untreated hamsters, and (2) from hamsters receiving effective and suboptimal doses of treatment. We identify a broadly linear relationship between drug dose and the extent of variation in treated viral populations, with a high proportion of this variation being composed of variants at frequencies of less than 1 per cent, below typical thresholds for variant calling. Treatment with an effective dose of antiviral drug was associated with a gain of between 7 and 10 variants per viral genome relative to drug-free controls: even after a short period of treatment a population founded by a transmitted virus could contain multiple sequence differences to that of the original host. Treatment with a suboptimal dose of drug showed intermediate gains of variants. No dose-dependent signal was identified in the numbers of single-nucleotide variants reaching frequencies in excess of 5 per cent. We did not find evidence to support the emergence of drug resistance or of novel immune phenotypes. Our study suggests that where onward transmission occurs, a short period of treatment with mutagenic drugs may be sufficient to generate a significant increase in the number of viral variants transmitted.
Collapse
Affiliation(s)
| | - Jose A Guerra-Assuncao
- Great Ormond Street Hospital for Children NHS Foundation Trust, Great Ormond Street, London WC1N 3JH, UK
- Infection, Immunity and Inflammation Research and Teaching Department, University College London, Gower Street, London WC1E 6BT, UK
| | - Samuel Gregg
- Infection, Immunity and Inflammation Research and Teaching Department, University College London, Gower Street, London WC1E 6BT, UK
| | - Oscar Charles
- Great Ormond Street Hospital for Children NHS Foundation Trust, Great Ormond Street, London WC1N 3JH, UK
- Infection, Immunity and Inflammation Research and Teaching Department, University College London, Gower Street, London WC1E 6BT, UK
| | - Juanita Pang
- Infection, Immunity and Inflammation Research and Teaching Department, University College London, Gower Street, London WC1E 6BT, UK
| | - Sunando Roy
- Infection, Immunity and Inflammation Research and Teaching Department, University College London, Gower Street, London WC1E 6BT, UK
| | - Rana Abdelnabi
- KU Leuven Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Laboratory of Virology and Chemotherapy, Herestraat 49, Leuven B-3000, Belgium
- The VirusBank Platform, Gaston Geenslaan, Leuven B-3000, Belgium
| | - Johan Neyts
- KU Leuven Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Laboratory of Virology and Chemotherapy, Herestraat 49, Leuven B-3000, Belgium
- The VirusBank Platform, Gaston Geenslaan, Leuven B-3000, Belgium
| | - Judith Breuer
- Great Ormond Street Hospital for Children NHS Foundation Trust, Great Ormond Street, London WC1N 3JH, UK
- Infection, Immunity and Inflammation Research and Teaching Department, University College London, Gower Street, London WC1E 6BT, UK
| |
Collapse
|
4
|
Gilbertson B, Duncan M, Subbarao K. Role of the viral polymerase during adaptation of influenza A viruses to new hosts. Curr Opin Virol 2023; 62:101363. [PMID: 37672875 DOI: 10.1016/j.coviro.2023.101363] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 08/15/2023] [Accepted: 08/15/2023] [Indexed: 09/08/2023]
Abstract
As a group, influenza-A viruses (IAV) infect a wide range of animal hosts, however, they are constrained to infecting selected host species by species-specific interactions between the host and virus, that are required for efficient replication of the viral RNA genome. When IAV cross the species barrier, they acquire mutations in the viral genome to enable interactions with the new host factors, or to compensate for their loss. The viral polymerase genes polymerase basic 1, polymerase basic 2, and polymerase-acidic are important sites of host adaptation. In this review, we discuss why the viral polymerase is so vital to the process of host adaptation, look at some of the known viral mutations, and host factors involved in adaptation, particularly of avian IAV to mammalian hosts.
Collapse
Affiliation(s)
- Brad Gilbertson
- Department of Microbiology and Immunology, University of Melbourne, at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia
| | - Melanie Duncan
- WHO Collaborating Centre for Reference and Research on Influenza, Victorian Infectious Diseases Reference Laboratory at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia
| | - Kanta Subbarao
- Department of Microbiology and Immunology, University of Melbourne, at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia; WHO Collaborating Centre for Reference and Research on Influenza, Victorian Infectious Diseases Reference Laboratory at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia.
| |
Collapse
|
5
|
Howe-Kerr LI, Grupstra CGB, Rabbitt KM, Conetta D, Coy SR, Klinges JG, Maher RL, McConnell KM, Meiling SS, Messyasz A, Schmeltzer ER, Seabrook S, Sims JA, Veglia AJ, Thurber AR, Thurber RLV, Correa AMS. Viruses of a key coral symbiont exhibit temperature-driven productivity across a reefscape. ISME COMMUNICATIONS 2023; 3:27. [PMID: 37009785 PMCID: PMC10068613 DOI: 10.1038/s43705-023-00227-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 02/17/2023] [Accepted: 03/01/2023] [Indexed: 05/31/2023]
Abstract
Viruses can affect coral health by infecting their symbiotic dinoflagellate partners (Symbiodiniaceae). Yet, viral dynamics in coral colonies exposed to environmental stress have not been studied at the reef scale, particularly within individual viral lineages. We sequenced the viral major capsid protein (mcp) gene of positive-sense single-stranded RNA viruses known to infect symbiotic dinoflagellates ('dinoRNAVs') to analyze their dynamics in the reef-building coral, Porites lobata. We repeatedly sampled 54 colonies harboring Cladocopium C15 dinoflagellates, across three environmentally distinct reef zones (fringing reef, back reef, and forereef) around the island of Moorea, French Polynesia over a 3-year period and spanning a reef-wide thermal stress event. By the end of the sampling period, 28% (5/18) of corals in the fringing reef experienced partial mortality versus 78% (14/18) of corals in the forereef. Over 90% (50/54) of colonies had detectable dinoRNAV infections. Reef zone influenced the composition and richness of viral mcp amino acid types ('aminotypes'), with the fringing reef containing the highest aminotype richness. The reef-wide thermal stress event significantly increased aminotype dispersion, and this pattern was strongest in the colonies that experienced partial mortality. These findings demonstrate that dinoRNAV infections respond to environmental fluctuations experienced in situ on reefs. Further, viral productivity will likely increase as ocean temperatures continue to rise, potentially impacting the foundational symbiosis underpinning coral reef ecosystems.
Collapse
Affiliation(s)
| | - Carsten G B Grupstra
- Department of BioSciences, Rice University, Houston, TX, USA
- Department of Biology, Boston University, Boston, MA, USA
| | - Kristen M Rabbitt
- Department of BioSciences, Rice University, Houston, TX, USA
- Department of Marine and Environmental Sciences, Northeastern University, Boston, MA, USA
| | - Dennis Conetta
- Department of BioSciences, Rice University, Houston, TX, USA
| | - Samantha R Coy
- Department of BioSciences, Rice University, Houston, TX, USA
- Department of Oceanography, Texas A & M University, College Station, TX, USA
| | - J Grace Klinges
- Mote Marine Laboratory, Elizabeth Moore International Center for Coral Reef Research & Restoration, Summerland Key, FL, USA
| | - Rebecca L Maher
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR, USA
| | | | - Sonora S Meiling
- University of the Virgin Islands, St. Thomas, US Virgin Islands, USA
| | - Adriana Messyasz
- Rutgers School of Environmental and Biological Sciences, New Brunswick, NJ, USA
| | | | - Sarah Seabrook
- Oregon State University, Corvallis, OR, USA
- National Institute of Water and Atmospheric Research, Wellington, New Zealand
| | - Jordan A Sims
- Department of BioSciences, Rice University, Houston, TX, USA
- Environmental Science and Policy, George Mason University, Fairfax, VA, USA
| | - Alex J Veglia
- Department of BioSciences, Rice University, Houston, TX, USA
| | | | | | | |
Collapse
|
6
|
Sohail MS, Louie RHY, Hong Z, Barton JP, McKay MR. Inferring Epistasis from Genetic Time-series Data. Mol Biol Evol 2022; 39:6710201. [PMID: 36130322 PMCID: PMC9558069 DOI: 10.1093/molbev/msac199] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Epistasis refers to fitness or functional effects of mutations that depend on the sequence background in which these mutations arise. Epistasis is prevalent in nature, including populations of viruses, bacteria, and cancers, and can contribute to the evolution of drug resistance and immune escape. However, it is difficult to directly estimate epistatic effects from sampled observations of a population. At present, there are very few methods that can disentangle the effects of selection (including epistasis), mutation, recombination, genetic drift, and genetic linkage in evolving populations. Here we develop a method to infer epistasis, along with the fitness effects of individual mutations, from observed evolutionary histories. Simulations show that we can accurately infer pairwise epistatic interactions provided that there is sufficient genetic diversity in the data. Our method also allows us to identify which fitness parameters can be reliably inferred from a particular data set and which ones are unidentifiable. Our approach therefore allows for the inference of more complex models of selection from time-series genetic data, while also quantifying uncertainty in the inferred parameters.
Collapse
Affiliation(s)
- Muhammad Saqib Sohail
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Hong Kong SAR, People’s Republic of China
| | - Raymond H Y Louie
- The Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
| | - Zhenchen Hong
- Department of Physics and Astronomy, University of California, Riverside, CA, USA
| | | | | |
Collapse
|
7
|
Autio A, Kettunen J, Nevalainen T, Kimura B, Hurme M. Herpesviruses and their genetic diversity in the blood virome of healthy individuals: effect of aging. Immun Ageing 2022; 19:15. [PMID: 35279192 PMCID: PMC8917371 DOI: 10.1186/s12979-022-00268-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Accepted: 02/11/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND As we age, the functioning of the human immune system declines. The results of this are increases in morbidity and mortality associated with infectious diseases, cancer, cardiovascular disease, and neurodegenerative disease in elderly individuals, as well as a weakened vaccination response. The aging of the immune system is thought to affect and be affected by the human virome, the collection of all viruses present in an individual. Persistent viral infections, such as those caused by certain herpesviruses, can be present in an individual for long periods of time without any overt pathology, yet are associated with disease in states of compromised immune function. To better understand the effects on human health of such persistent viral infections, we must first understand how the human virome changes with age. We have now analyzed the composition of the whole blood virome of 317 individuals, 21-70 years old, using a metatranscriptomic approach. Use of RNA sequencing data allows for the unbiased detection of RNA viruses and active DNA viruses. RESULTS The data obtained showed that Epstein-Barr virus (EBV) was the most frequently expressed virus, with other detected viruses being herpes simplex virus 1, human cytomegalovirus, torque teno viruses, and papillomaviruses. Of the 317 studied blood samples, 68 (21%) had EBV expression, whereas the other detected viruses were only detected in at most 6 samples (2%). We therefore focused on EBV in our further analyses. Frequency of EBV detection, relative EBV RNA abundance and the genetic diversity of EBV was not significantly different between age groups (21-59 and 60-70 years old). No significant correlation was seen between EBV RNA abundance and age. Deconvolution analysis revealed a significant difference in proportions of activated dendritic cells, macrophages M1, and activated mast cells between EBV expression positive and negative individuals. CONCLUSIONS As it is likely that the EBV RNA quantified in this work is derived from reactivation of the latent EBV virus, these data suggest that age does not affect the rate of reactivation nor the genetic landscape of EBV. These findings offer new insight on the genetic diversity of a persistent EBV infection in the long-term.
Collapse
Affiliation(s)
- Arttu Autio
- Faculty of Medicine and Health Technology, Tampere University, Arvo Ylpön katu 34, 33520 Tampere, Finland
- Gerontology Research Center (GEREC), Tampere, Finland
| | - Jalmari Kettunen
- Faculty of Medicine and Health Technology, Tampere University, Arvo Ylpön katu 34, 33520 Tampere, Finland
| | - Tapio Nevalainen
- Faculty of Medicine and Health Technology, Tampere University, Arvo Ylpön katu 34, 33520 Tampere, Finland
- Gerontology Research Center (GEREC), Tampere, Finland
- Science Centre, Pirkanmaa Hospital District, Tampere, Finland
| | - Bryn Kimura
- Faculty of Medicine and Health Technology, Tampere University, Arvo Ylpön katu 34, 33520 Tampere, Finland
| | - Mikko Hurme
- Faculty of Medicine and Health Technology, Tampere University, Arvo Ylpön katu 34, 33520 Tampere, Finland
- Gerontology Research Center (GEREC), Tampere, Finland
| |
Collapse
|
8
|
Reconstruction of evolving gene variants and fitness from short sequencing reads. Nat Chem Biol 2021; 17:1188-1198. [PMID: 34635842 PMCID: PMC8551035 DOI: 10.1038/s41589-021-00876-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 08/09/2021] [Indexed: 12/23/2022]
Abstract
Directed evolution can generate proteins with tailor-made activities. However, full-length genotypes, their frequencies and fitnesses are difficult to measure for evolving gene-length biomolecules using most high-throughput DNA sequencing methods, as short read lengths can lose mutation linkages in haplotypes. Here we present Evoracle, a machine learning method that accurately reconstructs full-length genotypes (R2 = 0.94) and fitness using short-read data from directed evolution experiments, with substantial improvements over related methods. We validate Evoracle on phage-assisted continuous evolution (PACE) and phage-assisted non-continuous evolution (PANCE) of adenine base editors and OrthoRep evolution of drug-resistant enzymes. Evoracle retains strong performance (R2 = 0.86) on data with complete linkage loss between neighboring nucleotides and large measurement noise, such as pooled Sanger sequencing data (~US$10 per timepoint), and broadens the accessibility of training machine learning models on gene variant fitnesses. Evoracle can also identify high-fitness variants, including low-frequency 'rising stars', well before they are identifiable from consensus mutations.
Collapse
|
9
|
Zhu H, Allman BE, Koelle K. Fitness Estimation for Viral Variants in the Context of Cellular Coinfection. Viruses 2021; 13:v13071216. [PMID: 34201862 PMCID: PMC8310006 DOI: 10.3390/v13071216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 06/16/2021] [Accepted: 06/18/2021] [Indexed: 11/16/2022] Open
Abstract
Animal models are frequently used to characterize the within-host dynamics of emerging zoonotic viruses. More recent studies have also deep-sequenced longitudinal viral samples originating from experimental challenges to gain a better understanding of how these viruses may evolve in vivo and between transmission events. These studies have often identified nucleotide variants that can replicate more efficiently within hosts and also transmit more effectively between hosts. Quantifying the degree to which a mutation impacts viral fitness within a host can improve identification of variants that are of particular epidemiological concern and our ability to anticipate viral adaptation at the population level. While methods have been developed to quantify the fitness effects of mutations using observed changes in allele frequencies over the course of a host’s infection, none of the existing methods account for the possibility of cellular coinfection. Here, we develop mathematical models to project variant allele frequency changes in the context of cellular coinfection and, further, integrate these models with statistical inference approaches to demonstrate how variant fitness can be estimated alongside cellular multiplicity of infection. We apply our approaches to empirical longitudinally sampled H5N1 sequence data from ferrets. Our results indicate that previous studies may have significantly underestimated the within-host fitness advantage of viral variants. These findings underscore the importance of considering the process of cellular coinfection when studying within-host viral evolutionary dynamics.
Collapse
Affiliation(s)
- Huisheng Zhu
- Department of Biology, Emory University, Atlanta, GA 30322, USA;
| | - Brent E. Allman
- Graduate Program in Population Biology, Ecology, and Evolution, Emory University, Atlanta, GA 30322, USA;
| | - Katia Koelle
- Department of Biology, Emory University, Atlanta, GA 30322, USA;
- Emory-UGA Center of Excellence for Influenza Research and Surveillance (CEIRS), Atlanta, GA 30322, USA
- Correspondence:
| |
Collapse
|
10
|
Yan AWC, Zhou J, Beauchemin CAA, Russell CA, Barclay WS, Riley S. Quantifying mechanistic traits of influenza viral dynamics using in vitro data. Epidemics 2020; 33:100406. [PMID: 33096342 DOI: 10.1016/j.epidem.2020.100406] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2019] [Revised: 07/10/2020] [Accepted: 09/04/2020] [Indexed: 11/28/2022] Open
Abstract
When analysing in vitro data, growth kinetics of influenza virus strains are often compared by computing their growth rates, which are sometimes used as proxies for fitness. However, analogous to mathematical models for epidemics, the growth rate can be defined as a function of mechanistic traits: the basic reproduction number (the average number of cells each infected cell infects) and the mean generation time (the average length of a replication cycle). Fitting a model to previously published and newly generated data from experiments in human lung cells, we compared estimates of growth rate, reproduction number and generation time for six influenza A strains. Of four strains in previously published data, A/Canada/RV733/2003 (seasonal H1N1) had the lowest basic reproduction number, followed by A/Mexico/INDRE4487/2009 (pandemic H1N1), then A/Indonesia/05/2005 (spill-over H5N1) and A/Anhui/1/2013 (spill-over H7N9). This ordering of strains was preserved for both generation time and growth rate, suggesting a positive biological correlation between these quantities which have not been previously observed. We further investigated these potential correlations using data from reassortant viruses with different internal proteins (from A/England/195/2009 (pandemic H1N1) and A/Turkey/05/2005 (H5N1)), and the same surface proteins (from A/Puerto Rico/8/34 (lab-adapted H1N1)). Similar correlations between traits were observed for these viruses, confirming our initial findings and suggesting that these patterns were related to the degree of human adaptation of internal genes. Also, the model predicted that strains with a smaller basic reproduction number, shorter generation time and slower growth rate underwent more replication cycles by the time of peak viral load, potentially accumulating mutations more quickly. These results illustrate the utility of mathematical models in inferring traits driving observed differences in in vitro growth of influenza strains.
Collapse
Affiliation(s)
- Ada W C Yan
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W2 1PG, United Kingdom
| | - Jie Zhou
- Section of Virology, Department of Medicine, Imperial College London, London W2 1PG, United Kingdom
| | - Catherine A A Beauchemin
- Department of Physics, Ryerson University, 350 Victoria Street, Toronto, Ontario, M5B 2K3, Canada; Interdisciplinary Theoretical and Mathematical Sciences (iTHEMS), RIKEN, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan
| | - Colin A Russell
- Laboratory of Applied Evolutionary Biology, Department of Medical Microbiology, Academic Medical Center, University of Amsterdam, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands
| | - Wendy S Barclay
- Section of Virology, Department of Medicine, Imperial College London, London W2 1PG, United Kingdom
| | - Steven Riley
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London W2 1PG, United Kingdom.
| |
Collapse
|
11
|
Ruark-Seward CL, Bonville B, Kennedy G, Rasmussen DA. Evolutionary dynamics of Tomato spotted wilt virus within and between alternate plant hosts and thrips. Sci Rep 2020; 10:15797. [PMID: 32978446 PMCID: PMC7519039 DOI: 10.1038/s41598-020-72691-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 09/04/2020] [Indexed: 12/12/2022] Open
Abstract
Tomato spotted wilt virus (TSWV) is a generalist pathogen with one of the broadest known host ranges among RNA viruses. To understand how TSWV adapts to different hosts, we experimentally passaged viral populations between two alternate hosts, Emilia sochifolia and Datura stramonium, and an obligate vector in which it also replicates, western flower thrips (Frankliniella occidentalis). Deep sequencing viral populations at multiple time points allowed us to track the evolutionary dynamics of viral populations within and between hosts. High levels of viral genetic diversity were maintained in both plants and thrips between transmission events. Rapid fluctuations in the frequency of amino acid variants indicated strong host-specific selection pressures on proteins involved in viral movement (NSm) and replication (RdRp). While several genetic variants showed opposing fitness effects in different hosts, fitness effects were generally positively correlated between hosts indicating that positive rather than antagonistic pleiotropy is pervasive. These results suggest that high levels of genetic diversity together with the positive pleiotropic effects of mutations have allowed TSWV to rapidly adapt to new hosts and expand its host range.
Collapse
Affiliation(s)
- Casey L Ruark-Seward
- Department of Entomology and Plant Pathology, North Carolina State University, Ricks Hall 312, 1 Lampe Drive, Raleigh, NC, 27607, USA
| | - Brian Bonville
- Department of Entomology and Plant Pathology, North Carolina State University, Ricks Hall 312, 1 Lampe Drive, Raleigh, NC, 27607, USA
| | - George Kennedy
- Department of Entomology and Plant Pathology, North Carolina State University, Ricks Hall 312, 1 Lampe Drive, Raleigh, NC, 27607, USA
| | - David A Rasmussen
- Department of Entomology and Plant Pathology, North Carolina State University, Ricks Hall 312, 1 Lampe Drive, Raleigh, NC, 27607, USA. .,Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.
| |
Collapse
|
12
|
Lumby CK, Zhao L, Breuer J, Illingworth CJR. A large effective population size for established within-host influenza virus infection. eLife 2020; 9:e56915. [PMID: 32773034 PMCID: PMC7431133 DOI: 10.7554/elife.56915] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Accepted: 07/30/2020] [Indexed: 12/13/2022] Open
Abstract
Strains of the influenza virus form coherent global populations, yet exist at the level of single infections in individual hosts. The relationship between these scales is a critical topic for understanding viral evolution. Here we investigate the within-host relationship between selection and the stochastic effects of genetic drift, estimating an effective population size of infection Ne for influenza infection. Examining whole-genome sequence data describing a chronic case of influenza B in a severely immunocompromised child we infer an Ne of 2.5 × 107 (95% confidence range 1.0 × 107 to 9.0 × 107) suggesting that genetic drift is of minimal importance during an established influenza infection. Our result, supported by data from influenza A infection, suggests that positive selection during within-host infection is primarily limited by the typically short period of infection. Atypically long infections may have a disproportionate influence upon global patterns of viral evolution.
Collapse
Affiliation(s)
- Casper K Lumby
- Department of Genetics, University of CambridgeCambridgeUnited Kingdom
| | - Lei Zhao
- Department of Genetics, University of CambridgeCambridgeUnited Kingdom
| | - Judith Breuer
- Great Ormond Street HospitalLondonUnited Kingdom
- Division of Infection and Immunity, University College LondonLondonUnited Kingdom
| | - Christopher JR Illingworth
- Department of Genetics, University of CambridgeCambridgeUnited Kingdom
- Department of Applied Mathematics and Theoretical Physics, University of CambridgeCambridgeUnited Kingdom
- Department of Computer Science, Institute of Biotechnology, University of HelsinkiHelsinkiFinland
| |
Collapse
|
13
|
Inferring Transmission Bottleneck Size from Viral Sequence Data Using a Novel Haplotype Reconstruction Method. J Virol 2020; 94:JVI.00014-20. [PMID: 32295920 PMCID: PMC7307158 DOI: 10.1128/jvi.00014-20] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 04/08/2020] [Indexed: 12/12/2022] Open
Abstract
Viral populations undergo a repeated cycle of within-host growth followed by transmission. Viral evolution is affected by each stage of this cycle. The number of viral particles transmitted from one host to another, known as the transmission bottleneck, is an important factor in determining how the evolutionary dynamics of the population play out, restricting the extent to which the evolved diversity of the population can be passed from one host to another. Previous study of viral sequence data has suggested that the transmission bottleneck size for influenza A transmission between human hosts is small. Reevaluating these data using a novel and improved method, we largely confirm this result, albeit that we infer a slightly higher bottleneck size in some cases, of between 1 and 13 virions. While a tight bottleneck operates in human influenza transmission, it is not extreme in nature; some diversity can be meaningfully retained between hosts. The transmission bottleneck is defined as the number of viral particles that transmit from one host to establish an infection in another. Genome sequence data have been used to evaluate the size of the transmission bottleneck between humans infected with the influenza virus; however, the methods used to make these estimates have some limitations. Specifically, viral allele frequencies, which form the basis of many calculations, may not fully capture a process which involves the transmission of entire viral genomes. Here, we set out a novel approach for inferring viral transmission bottlenecks; our method combines an algorithm for haplotype reconstruction with maximum likelihood methods for bottleneck inference. This approach allows for rapid calculation and performs well when applied to data from simulated transmission events; errors in the haplotype reconstruction step did not adversely affect inferences of the population bottleneck. Applied to data from a previous household transmission study of influenza A infection, we confirm the result that the majority of transmission events involve a small number of viruses, albeit with slightly looser bottlenecks being inferred, with between 1 and 13 particles transmitted in the majority of cases. While influenza A transmission involves a tight population bottleneck, the bottleneck is not so tight as to universally prevent the transmission of within-host viral diversity. IMPORTANCE Viral populations undergo a repeated cycle of within-host growth followed by transmission. Viral evolution is affected by each stage of this cycle. The number of viral particles transmitted from one host to another, known as the transmission bottleneck, is an important factor in determining how the evolutionary dynamics of the population play out, restricting the extent to which the evolved diversity of the population can be passed from one host to another. Previous study of viral sequence data has suggested that the transmission bottleneck size for influenza A transmission between human hosts is small. Reevaluating these data using a novel and improved method, we largely confirm this result, albeit that we infer a slightly higher bottleneck size in some cases, of between 1 and 13 virions. While a tight bottleneck operates in human influenza transmission, it is not extreme in nature; some diversity can be meaningfully retained between hosts.
Collapse
|
14
|
Abstract
The evolutionary dynamics of a virus can differ within hosts and across populations. Studies of within-host evolution provide an important link between experimental studies of virus evolution and large-scale phylodynamic analyses. They can determine the extent to which global processes are recapitulated on local scales and how accurately experimental infections model natural ones. They may also inform epidemiologic models of disease spread and reveal how host-level dynamics contribute to a virus's evolution at a larger scale. Over the last decade, advances in viral sequencing have enabled detailed studies of viral genetic diversity within hosts. I review how within-host diversity is sampled, measured, and expressed, and how comparative studies of viral diversity can be leveraged to elucidate a virus's evolutionary dynamics. These concepts are illustrated with detailed reviews of recent research on the within-host evolution of influenza virus, dengue virus, and cytomegalovirus.
Collapse
Affiliation(s)
- Adam S Lauring
- Division of Infectious Diseases, Department of Internal Medicine, and Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan 48109, USA;
| |
Collapse
|
15
|
A de novo approach to inferring within-host fitness effects during untreated HIV-1 infection. PLoS Pathog 2020; 16:e1008171. [PMID: 32492061 PMCID: PMC7295245 DOI: 10.1371/journal.ppat.1008171] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 06/15/2020] [Accepted: 05/11/2020] [Indexed: 12/15/2022] Open
Abstract
In the absence of effective antiviral therapy, HIV-1 evolves in response to the within-host environment, of which the immune system is an important aspect. During the earliest stages of infection, this process of evolution is very rapid, driven by a small number of CTL escape mutations. As the infection progresses, immune escape variants evolve under reduced magnitudes of selection, while competition between an increasing number of polymorphic alleles (i.e., clonal interference) makes it difficult to quantify the magnitude of selection acting upon specific variant alleles. To tackle this complex problem, we developed a novel multi-locus inference method to evaluate the role of selection during the chronic stage of within-host infection. We applied this method to targeted sequence data from the p24 and gp41 regions of HIV-1 collected from 34 patients with long-term untreated HIV-1 infection. We identify a broad distribution of beneficial fitness effects during infection, with a small number of variants evolving under strong selection and very many variants evolving under weaker selection. The uniquely large number of infections analysed granted a previously unparalleled statistical power to identify loci at which selection could be inferred to act with statistical confidence. Our model makes no prior assumptions about the nature of alleles under selection, such that any synonymous or non-synonymous variant may be inferred to evolve under selection. However, the majority of variants inferred with confidence to be under selection were non-synonymous in nature, and in most cases were have previously been associated with either CTL escape in p24 or neutralising antibody escape in gp41. We also identified a putative new CTL escape site (residue 286 in gag), and a region of gp41 (including residues 644, 648, 655 in env) likely to be associated with immune escape. Sites inferred to be under selection in multiple hosts have high within-host and between-host diversity although not all sites with high between-host diversity were inferred to be under selection at the within-host level. Our identification of selection at sites associated with resistance to broadly neutralising antibodies (bNAbs) highlights the need to fully understand the role of selection in untreated individuals when designing bNAb based therapies.
Collapse
|
16
|
Long JS, Mistry B, Haslam SM, Barclay WS. Host and viral determinants of influenza A virus species specificity. Nat Rev Microbiol 2020; 17:67-81. [PMID: 30487536 DOI: 10.1038/s41579-018-0115-z] [Citation(s) in RCA: 329] [Impact Index Per Article: 82.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Influenza A viruses cause pandemics when they cross between species and an antigenically novel virus acquires the ability to infect and transmit between these new hosts. The timing of pandemics is currently unpredictable but depends on ecological and virological factors. The host range of an influenza A virus is determined by species-specific interactions between virus and host cell factors. These include the ability to bind and enter cells, to replicate the viral RNA genome within the host cell nucleus, to evade host restriction factors and innate immune responses and to transmit between individuals. In this Review, we examine the host barriers that influenza A viruses of animals, especially birds, must overcome to initiate a pandemic in humans and describe how, on crossing the species barrier, the virus mutates to establish new interactions with the human host. This knowledge is used to inform risk assessments for future pandemics and to identify virus-host interactions that could be targeted by novel intervention strategies.
Collapse
Affiliation(s)
- Jason S Long
- Department of Medicine, Imperial College London, London, UK
| | - Bhakti Mistry
- Department of Medicine, Imperial College London, London, UK
| | - Stuart M Haslam
- Department of Life Sciences, Imperial College London, London, UK
| | - Wendy S Barclay
- Department of Medicine, Imperial College London, London, UK.
| |
Collapse
|
17
|
Soli R, Kaabi B, Barhoumi M, Maktouf C, Ahmed SBH. Bayesian phylogenetic analysis of the influenza-A virus genomes isolated in Tunisia, and determination of potential recombination events. Mol Phylogenet Evol 2019; 134:253-268. [DOI: 10.1016/j.ympev.2019.01.019] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Revised: 12/27/2018] [Accepted: 01/22/2019] [Indexed: 11/24/2022]
|
18
|
Soto W, Travisano M, Tolleson AR, Nishiguchi MK. Symbiont evolution during the free-living phase can improve host colonization. MICROBIOLOGY-SGM 2019; 165:174-187. [PMID: 30648935 PMCID: PMC7003651 DOI: 10.1099/mic.0.000756] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
For micro-organisms cycling between free-living and host-associated stages, where reproduction occurs in both of these lifestyles, an interesting inquiry is whether evolution during the free-living stage can be positively pleiotropic to microbial fitness in a host environment. To address this topic, the squid host Euprymna tasmanica and the marine bioluminescent bacterium Vibrio fischeri were utilized. Microbial ecological diversification in static liquid microcosms was used to simulate symbiont evolution during the free-living stage. Thirteen genetically distinct V. fischeri strains from a broad diversity of ecological sources (e.g. squid light organs, fish light organs and seawater) were examined to see if the results were reproducible in many different genetic settings. Genetic backgrounds that are closely related can be predisposed to considerable differences in how they respond to similar selection pressures. For all strains examined, new mutations with striking and facilitating effects on host colonization arose quickly during microbial evolution in the free-living stage, regardless of the ecological context under consideration for a strain’s genetic background. Microbial evolution outside a host environment promoted host range expansion, improved host colonization for a micro-organism, and diminished the negative correlation between biofilm formation and motility.
Collapse
Affiliation(s)
- William Soto
- 1College of William & Mary, Department of Biology, Integrated Science Center Rm 3035, 540 Landrum Dr Williamsburg, VA 23185, USA
| | - Michael Travisano
- 2Department of Ecology, Evolution, and Behavior, University of Minnesota-Twin Cities, 100 Ecology Building, 1987 Upper Buford Circle, Saint Paul, MN 55108, USA.,3BioTechnology Institute, University of Minnesota-Twin Cities, 140 Gortner Labs, 1479 Gortner Avenue, St Paul, MN 55108, USA
| | - Alexandra Rose Tolleson
- 1College of William & Mary, Department of Biology, Integrated Science Center Rm 3035, 540 Landrum Dr Williamsburg, VA 23185, USA
| | | |
Collapse
|
19
|
Zhao L, Abbasi AB, Illingworth CJR. Mutational load causes stochastic evolutionary outcomes in acute RNA viral infection. Virus Evol 2019; 5:vez008. [PMID: 31024738 PMCID: PMC6476161 DOI: 10.1093/ve/vez008] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Mutational load is known to be of importance for the evolution of RNA viruses, the combination of a high mutation rate and large population size leading to an accumulation of deleterious mutations. However, while the effects of mutational load on global viral populations have been considered, its quantitative effects at the within-host scale of infection are less well understood. We here show that even on the rapid timescale of acute disease, mutational load has an effect on within-host viral adaptation, reducing the effective selection acting upon beneficial variants by ∼10 per cent. Furthermore, mutational load induces considerable stochasticity in the pattern of evolution, causing a more than five-fold uncertainty in the effective fitness of a transmitted beneficial variant. Our work aims to bridge the gap between classic models from population genetic theory and the biology of viral infection. In an advance on some previous models of mutational load, we replace the assumption of a constant variant fitness cost with an experimentally-derived distribution of fitness effects. Expanding previous frameworks for evolutionary simulation, we introduce the Wright-Fisher model with continuous mutation, which describes a continuum of possible modes of replication within a cell. Our results advance our understanding of adaptation in the context of strong selection and a high mutation rate. Despite viral populations having large absolute sizes, critical events in viral adaptation, including antigenic drift and the onset of drug resistance, arise through stochastic evolutionary processes.
Collapse
Affiliation(s)
- Lei Zhao
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Ali B Abbasi
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Christopher J R Illingworth
- Department of Genetics, University of Cambridge, Cambridge, UK
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| |
Collapse
|
20
|
Zhao L, Illingworth CJR. Measurements of intrahost viral diversity require an unbiased diversity metric. Virus Evol 2019; 5:vey041. [PMID: 30723551 PMCID: PMC6354029 DOI: 10.1093/ve/vey041] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Viruses exist within hosts at large population sizes and are subject to high rates of mutation. As such, viral populations exhibit considerable sequence diversity. A variety of summary statistics have been developed which describe, in a single number, the extent of diversity in a viral population; such measurements allow the diversities of different populations to be compared, and the effect of evolutionary forces on a population to be assessed. Here we highlight statistical artefacts underlying some common measures of sequence diversity, whereby variation in the depth of genome sequencing may substantially affect the extent of diversity measured in a viral population, making comparisons of population diversity invalid. Specifically, naive estimation of sequence entropy provides a systematically biased metric, a lower read depth being expected to produce a lower estimate of diversity. The number of polymorphic loci per kilobase of genome is more unpredictably affected by read depth, giving potentially flawed results at lower sequencing depths. We show that the nucleotide diversity statistic π provides an unbiased estimate of diversity in the sense that the expected value of the statistic is equal to the correct value of the property being measured. Our results are of importance for studies interpreting genome sequence data; we describe how diversity may be assessed in viral populations in a fair and unbiased manner.
Collapse
Affiliation(s)
- Lei Zhao
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, UK
| | - Christopher J R Illingworth
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, UK
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Wilberforce Road, Cambridge, UK
| |
Collapse
|
21
|
Lumby CK, Nene NR, Illingworth CJR. A novel framework for inferring parameters of transmission from viral sequence data. PLoS Genet 2018; 14:e1007718. [PMID: 30325921 PMCID: PMC6203404 DOI: 10.1371/journal.pgen.1007718] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Revised: 10/26/2018] [Accepted: 09/26/2018] [Indexed: 11/18/2022] Open
Abstract
Transmission between hosts is a critical part of the viral lifecycle. Recent studies of viral transmission have used genome sequence data to evaluate the number of particles transmitted between hosts, and the role of selection as it operates during the transmission process. However, the interpretation of sequence data describing transmission events is a challenging task. We here present a novel and comprehensive framework for using short-read sequence data to understand viral transmission events, designed for influenza virus, but adaptable to other viral species. Our approach solves multiple shortcomings of previous methods for this purpose; for example, we consider transmission as an event involving whole viruses, rather than sets of independent alleles. We demonstrate how selection during transmission and noisy sequence data may each affect naive inferences of the population bottleneck, accounting for these in our framework so as to achieve a correct inference. We identify circumstances in which selection for increased viral transmission may or may not be identified from data. Applying our method to experimental data in which transmission occurs in the presence of strong selection, we show that our framework grants a more quantitative insight into transmission events than previous approaches, inferring the bottleneck in a manner that accounts for selection, both for within-host virulence, and for inherent viral transmissibility. Our work provides new opportunities for studying transmission processes in influenza, and by extension, in other infectious diseases.
Collapse
Affiliation(s)
- Casper K. Lumby
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Nuno R. Nene
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Christopher J. R. Illingworth
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
22
|
Abstract
The rapid global evolution of influenza virus begins with mutations that arise de novo in individual infections, but little is known about how evolution occurs within hosts. We review recent progress in understanding how and why influenza viruses evolve within human hosts. Advances in deep sequencing make it possible to measure within-host genetic diversity in both acute and chronic influenza infections. Factors like antigenic selection, antiviral treatment, tissue specificity, spatial structure, and multiplicity of infection may affect how influenza viruses evolve within human hosts. Studies of within-host evolution can contribute to our understanding of the evolutionary and epidemiological factors that shape influenza virus's global evolution.
Collapse
Affiliation(s)
- Katherine S Xue
- Department of Genome Sciences, University of Washington, Seattle, WA, USA; Division of Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Louise H Moncla
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Trevor Bedford
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Jesse D Bloom
- Department of Genome Sciences, University of Washington, Seattle, WA, USA; Division of Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA.
| |
Collapse
|
23
|
Inferring Fitness Effects from Time-Resolved Sequence Data with a Delay-Deterministic Model. Genetics 2018; 209:255-264. [PMID: 29500183 PMCID: PMC5937181 DOI: 10.1534/genetics.118.300790] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 02/28/2018] [Indexed: 11/30/2022] Open
Abstract
A broad range of approaches have considered the challenge of inferring selection from time-resolved genome sequence data. Models describing deterministic changes in allele or haplotype frequency have been highlighted as providing accurate and computationally... A common challenge arising from the observation of an evolutionary system over time is to infer the magnitude of selection acting upon a specific genetic variant, or variants, within the population. The inference of selection may be confounded by the effects of genetic drift in a system, leading to the development of inference procedures to account for these effects. However, recent work has suggested that deterministic models of evolution may be effective in capturing the effects of selection even under complex models of demography, suggesting the more general application of deterministic approaches to inference. Responding to this literature, we here note a case in which a deterministic model of evolution may give highly misleading inferences, resulting from the nondeterministic properties of mutation in a finite population. We propose an alternative approach that acts to correct for this error, and which we denote the delay-deterministic model. Applying our model to a simple evolutionary system, we demonstrate its performance in quantifying the extent of selection acting within that system. We further consider the application of our model to sequence data from an evolutionary experiment. We outline scenarios in which our model may produce improved results for the inference of selection, noting that such situations can be easily identified via the use of a regular deterministic model.
Collapse
|
24
|
Illingworth CJR, Roy S, Beale MA, Tutill H, Williams R, Breuer J. On the effective depth of viral sequence data. Virus Evol 2017; 3:vex030. [PMID: 29250429 PMCID: PMC5724399 DOI: 10.1093/ve/vex030] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Genome sequence data are of great value in describing evolutionary processes in viral populations. However, in such studies, the extent to which data accurately describes the viral population is a matter of importance. Multiple factors may influence the accuracy of a dataset, including the quantity and nature of the sample collected, and the subsequent steps in viral processing. To investigate this phenomenon, we sequenced replica datasets spanning a range of viruses, and in which the point at which samples were split was different in each case, from a dataset in which independent samples were collected from a single patient to another in which all processing steps up to sequencing were applied to a single sample before splitting the sample and sequencing each replicate. We conclude that neither a high read depth nor a high template number in a sample guarantee the precision of a dataset. Measures of consistency calculated from within a single biological sample may also be insufficient; distortion of the composition of a population by the experimental procedure or genuine within-host diversity between samples may each affect the results. Where it is possible, data from replicate samples should be collected to validate the consistency of short-read sequence data.
Collapse
Affiliation(s)
- Christopher J R Illingworth
- Department of Genetics, University of Cambridge, Cambridge, UK.,Department of Applied Maths and Theoretical Physics, Centre for Mathematical Sciences, University of Cambridge, Cambridge, UK
| | - Sunando Roy
- Division of Infection and Immunity, University College London, London, UK
| | | | - Helena Tutill
- Division of Infection and Immunity, University College London, London, UK
| | - Rachel Williams
- Division of Infection and Immunity, University College London, London, UK
| | - Judith Breuer
- Division of Infection and Immunity, University College London, London, UK
| |
Collapse
|
25
|
Sobel Leonard A, McClain MT, Smith GJD, Wentworth DE, Halpin RA, Lin X, Ransier A, Stockwell TB, Das SR, Gilbert AS, Lambkin-Williams R, Ginsburg GS, Woods CW, Koelle K, Illingworth CJR. The effective rate of influenza reassortment is limited during human infection. PLoS Pathog 2017; 13:e1006203. [PMID: 28170438 PMCID: PMC5315410 DOI: 10.1371/journal.ppat.1006203] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Revised: 02/17/2017] [Accepted: 01/26/2017] [Indexed: 12/31/2022] Open
Abstract
We characterise the evolutionary dynamics of influenza infection described by viral sequence data collected from two challenge studies conducted in human hosts. Viral sequence data were collected at regular intervals from infected hosts. Changes in the sequence data observed across time show that the within-host evolution of the virus was driven by the reversion of variants acquired during previous passaging of the virus. Treatment of some patients with oseltamivir on the first day of infection did not lead to the emergence of drug resistance variants in patients. Using an evolutionary model, we inferred the effective rate of reassortment between viral segments, measuring the extent to which randomly chosen viruses within the host exchange genetic material. We find strong evidence that the rate of effective reassortment is low, such that genetic associations between polymorphic loci in different segments are preserved during the course of an infection in a manner not compatible with epistasis. Combining our evidence with that of previous studies we suggest that spatial heterogeneity in the viral population may reduce the extent to which reassortment is observed. Our results do not contradict previous findings of high rates of viral reassortment in vitro and in small animal studies, but indicate that in human hosts the effective rate of reassortment may be substantially more limited. The influenza virus is an important cause of disease in the human population. During the course of an infection the virus can evolve rapidly. An important mechanism of viral evolution is reassortment, whereby different segments of the influenza genome are shuffled with other segments, producing new viral combinations. Here we study natural selection and reassortment during the course of infections occurring in human hosts. Examining viral genome sequence data from these infections, we note that genetic variants that were acquired during the growth of viruses in culture are selected against in the human host. In addition, we find evidence that the effective rate of reassortment is low. We suggest that the spatial separation between viruses in different parts of the host airway may limit the extent to which genetically distinct segments reassort with one another. Within the global population of influenza viruses, reassortment remains an important factor. However, reassortment is not so rapid as to exclude the possibility of interactions between genome segments affecting the course of influenza evolution during a single infection.
Collapse
Affiliation(s)
- Ashley Sobel Leonard
- Department of Biology, Duke University, Durham, North Carolina, United States of America
| | - Micah T. McClain
- Duke Center for Applied Genomics and Precision Medicine, Durham, North Carolina, United States of America
| | - Gavin J. D. Smith
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, Singapore
| | - David E. Wentworth
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Rebecca A. Halpin
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Xudong Lin
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Amy Ransier
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | | | - Suman R. Das
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Anthony S. Gilbert
- hVivo PLC, The QMB Innovation Centre, Queen Mary, University of London, London, United Kingdom
| | - Rob Lambkin-Williams
- hVivo PLC, The QMB Innovation Centre, Queen Mary, University of London, London, United Kingdom
| | - Geoffrey S. Ginsburg
- Duke Center for Applied Genomics and Precision Medicine, Durham, North Carolina, United States of America
| | - Christopher W. Woods
- Duke Center for Applied Genomics and Precision Medicine, Durham, North Carolina, United States of America
| | - Katia Koelle
- Department of Biology, Duke University, Durham, North Carolina, United States of America
| | - Christopher J. R. Illingworth
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Department of Applied Maths and Theoretical Physics, Centre for Mathematical Sciences, Wilberforce Road, University of Cambridge, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
26
|
Deep Sequencing of Influenza A Virus from a Human Challenge Study Reveals a Selective Bottleneck and Only Limited Intrahost Genetic Diversification. J Virol 2016; 90:11247-11258. [PMID: 27707932 PMCID: PMC5126380 DOI: 10.1128/jvi.01657-16] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 09/29/2016] [Indexed: 01/06/2023] Open
Abstract
Knowledge of influenza virus evolution at the point of transmission and at the intrahost level remains limited, particularly for human hosts. Here, we analyze a unique viral data set of next-generation sequencing (NGS) samples generated from a human influenza challenge study wherein 17 healthy subjects were inoculated with cell- and egg-passaged virus. Nasal wash samples collected from 7 of these subjects were successfully deep sequenced. From these, we characterized changes in the subjects' viral populations during infection and identified differences between the virus in these samples and the viral stock used to inoculate the subjects. We first calculated pairwise genetic distances between the subjects' nasal wash samples, the viral stock, and the influenza virus A/Wisconsin/67/2005 (H3N2) reference strain used to generate the stock virus. These distances revealed that considerable viral evolution occurred at various points in the human challenge study. Further quantitative analyses indicated that (i) the viral stock contained genetic variants that originated and likely were selected for during the passaging process, (ii) direct intranasal inoculation with the viral stock resulted in a selective bottleneck that reduced nonsynonymous genetic diversity in the viral hemagglutinin and nucleoprotein, and (iii) intrahost viral evolution continued over the course of infection. These intrahost evolutionary dynamics were dominated by purifying selection. Our findings indicate that rapid viral evolution can occur during acute influenza infection in otherwise healthy human hosts when the founding population size of the virus is large, as is the case with direct intranasal inoculation. IMPORTANCE Influenza viruses circulating among humans are known to rapidly evolve over time. However, little is known about how influenza virus evolves across single transmission events and over the course of a single infection. To address these issues, we analyze influenza virus sequences from a human challenge experiment that initiated infection with a cell- and egg-passaged viral stock, which appeared to have adapted during its preparation. We find that the subjects' viral populations differ genetically from the viral stock, with subjects' viral populations having lower representation of the amino-acid-changing variants that arose during viral preparation. We also find that most of the viral evolution occurring over single infections is characterized by further decreases in the frequencies of these amino-acid-changing variants and that only limited intrahost genetic diversification through new mutations is apparent. Our findings indicate that influenza virus populations can undergo rapid genetic changes during acute human infections.
Collapse
|
27
|
Estimating the Effective Population Size from Temporal Allele Frequency Changes in Experimental Evolution. Genetics 2016; 204:723-735. [PMID: 27542959 PMCID: PMC5068858 DOI: 10.1534/genetics.116.191197] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2016] [Accepted: 07/30/2016] [Indexed: 01/22/2023] Open
Abstract
The effective population size (Ne) is a major factor determining allele frequency changes in natural and experimental populations. Temporal methods provide a powerful and simple approach to estimate short-term Ne. They use allele frequency shifts between temporal samples to calculate the standardized variance, which is directly related to Ne. Here we focus on experimental evolution studies that often rely on repeated sequencing of samples in pools (Pool-seq). Pool-seq is cost-effective and often outperforms individual-based sequencing in estimating allele frequencies, but it is associated with atypical sampling properties: Additional to sampling individuals, sequencing DNA in pools leads to a second round of sampling, which increases the variance of allele frequency estimates. We propose a new estimator of Ne, which relies on allele frequency changes in temporal data and corrects for the variance in both sampling steps. In simulations, we obtain accurate Ne estimates, as long as the drift variance is not too small compared to the sampling and sequencing variance. In addition to genome-wide Ne estimates, we extend our method using a recursive partitioning approach to estimate Ne locally along the chromosome. Since the type I error is controlled, our method permits the identification of genomic regions that differ significantly in their Ne estimates. We present an application to Pool-seq data from experimental evolution with Drosophila and provide recommendations for whole-genome data. The estimator is computationally efficient and available as an R package at https://github.com/ThomasTaus/Nest.
Collapse
|
28
|
Illingworth CJR. SAMFIRE: multi-locus variant calling for time-resolved sequence data. Bioinformatics 2016; 32:2208-9. [PMID: 27153641 DOI: 10.1093/bioinformatics/btw205] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2015] [Accepted: 04/10/2016] [Indexed: 11/12/2022] Open
Abstract
UNLABELLED An increasingly common method for studying evolution is the collection of time-resolved short-read sequence data. Such datasets allow for the direct observation of rapid evolutionary processes, as might occur in natural microbial populations and in evolutionary experiments. In many circumstances, evolutionary pressure acting upon single variants can cause genomic changes at multiple nearby loci. SAMFIRE is an open-access software package for processing and analyzing sequence reads from time-resolved data, calling important single- and multi-locus variants over time, identifying alleles potentially affected by selection, calculating linkage disequilibrium statistics, performing haplotype reconstruction and exploiting time-resolved information to estimate the extent of uncertainty in reported genomic data. AVAILABILITY AND IMPLEMENTATION C ++ code may be found at https://github.com/cjri/samfire/ CONTACT chris.illingworth@gen.cam.ac.uk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- C J R Illingworth
- Department of Genetics, University of Cambridge, Cambridge CB2 3AS, UK
| |
Collapse
|