1
|
Lou J, Liang W, Cao L, Hu I, Zhao S, Chen Z, Chan RWY, Cheung PPH, Zheng H, Liu C, Li Q, Chong MKC, Zhang Y, Yeoh EK, Chan PKS, Zee BCY, Mok CKP, Wang MH. Predictive evolutionary modelling for influenza virus by site-based dynamics of mutations. Nat Commun 2024; 15:2546. [PMID: 38514647 PMCID: PMC10958014 DOI: 10.1038/s41467-024-46918-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/12/2024] [Indexed: 03/23/2024] Open
Abstract
Influenza virus continuously evolves to escape human adaptive immunity and generates seasonal epidemics. Therefore, influenza vaccine strains need to be updated annually for the upcoming flu season to ensure vaccine effectiveness. We develop a computational approach, beth-1, to forecast virus evolution and select representative virus for influenza vaccine. The method involves modelling site-wise mutation fitness. Informed by virus genome and population sero-positivity, we calibrate transition time of mutations and project the fitness landscape to future time, based on which beth-1 selects the optimal vaccine strain. In season-to-season prediction in historical data for the influenza A pH1N1 and H3N2 viruses, beth-1 demonstrates superior genetic matching compared to existing approaches. In prospective validations, the model shows superior or non-inferior genetic matching and neutralization against circulating virus in mice immunization experiments compared to the current vaccine. The method offers a promising and ready-to-use tool to facilitate vaccine strain selection for the influenza virus through capturing heterogeneous evolutionary dynamics over genome space-time and linking molecular variants to population immune response.
Collapse
Affiliation(s)
- Jingzhi Lou
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- Beth Bioinformatics Co. Ltd, Hong Kong SAR, China
| | - Weiwen Liang
- HKU-Pasteur Research Pole, School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Lirong Cao
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Inchi Hu
- Department of Statistics, George Mason University, Fairfax, VA, USA
| | - Shi Zhao
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Zigui Chen
- Department of Microbiology, CUHK, Hong Kong SAR, China
| | - Renee Wan Yi Chan
- Department of Paediatrics, CUHK, Hong Kong SAR, China
- Hong Kong Hub of Paediatric Excellence, CUHK, Hong Kong SAR, China
| | | | - Hong Zheng
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Caiqi Liu
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Qi Li
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Marc Ka Chun Chong
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Yexian Zhang
- Beth Bioinformatics Co. Ltd, Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Eng-Kiong Yeoh
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- Centre for Health Systems and Policy Research, CUHK, Hong Kong SAR, China
| | - Paul Kay-Sheung Chan
- Department of Microbiology, CUHK, Hong Kong SAR, China
- Stanley Ho Centre for Emerging Infectious Diseases, CUHK, Hong Kong SAR, China
| | - Benny Chung Ying Zee
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Chris Ka Pun Mok
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China.
- Li Ka Shing Institute of Health Sciences, Faculty of Medicine, CUHK, Hong Kong SAR, China.
| | - Maggie Haitian Wang
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China.
- CUHK Shenzhen Research Institute, Shenzhen, China.
| |
Collapse
|
2
|
Matsvay A, Dyachkova M, Mikhaylov I, Kiselev D, Say A, Burskaia V, Artyushin I, Khafizov K, Shipulin G. Complete Genome Sequence, Molecular Characterization and Phylogenetic Relationships of a Novel Tern Atadenovirus. Microorganisms 2021; 10:31. [PMID: 35056480 PMCID: PMC8781740 DOI: 10.3390/microorganisms10010031] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Revised: 12/20/2021] [Accepted: 12/22/2021] [Indexed: 01/03/2023] Open
Abstract
Discovery and study of viruses carried by migratory birds are tasks of high importance due to the host's ability to spread infectious diseases over significant distances. With this paper, we present and characterize the first complete genome sequence of atadenovirus from a tern bird (common tern, Sterna hirundo) preliminarily named tern atadenovirus 1 (TeAdV-1). TeAdV-1 genome is a linear double-stranded DNA molecule, 31,334 base pairs which contain 30 methionine-initiated open reading frames with gene structure typical for Atadenovirus genus, and the shortest known inverted terminal repeats (ITRs) within the Atadenovirus genus consisted of 25 bases. The nucleotide composition of the genome is characterized by a low G + C content (33.86%), which is the most AT-rich genome of known avian adenoviruses within Atadenovirus genus. The nucleotide sequence of the TeAdV-1 genome shows high divergence compared to known representatives of the Atadenovirus genus with the highest similarity to the duck atadenovirus 1 (53.7%). Phylogenetic analysis of the protein sequences of core genes confirms the taxonomic affiliation of the new representative to the genus Atadenovirus with the degree of divergence from the known representatives exceeding the interspecies distance within the genus. Thereby we proposed a novel TeAdV-1 to be considered as a separate species.
Collapse
Affiliation(s)
- Alina Matsvay
- Federal State Budgetary Institution "Centre for Strategic Planning and Management of Biomedical Health Risks" of the Federal Medical Biological Agency, 119121 Moscow, Russia
- Moscow Institute of Physics and Technology, National Research University, 115184 Moscow, Russia
| | - Marina Dyachkova
- Federal State Budgetary Institution "Centre for Strategic Planning and Management of Biomedical Health Risks" of the Federal Medical Biological Agency, 119121 Moscow, Russia
| | - Ivan Mikhaylov
- Federal State Budgetary Institution "Centre for Strategic Planning and Management of Biomedical Health Risks" of the Federal Medical Biological Agency, 119121 Moscow, Russia
| | - Daniil Kiselev
- Institute for Neurosciences of Montpellier, University of Montpellier, INSERM, 34091 Montpellier, France
| | - Anna Say
- Federal State Budgetary Institution "Centre for Strategic Planning and Management of Biomedical Health Risks" of the Federal Medical Biological Agency, 119121 Moscow, Russia
| | | | - Ilya Artyushin
- Faculty of Biology, Lomonosov Moscow State University, 119991 Moscow, Russia
| | - Kamil Khafizov
- Moscow Institute of Physics and Technology, National Research University, 115184 Moscow, Russia
| | - German Shipulin
- Federal State Budgetary Institution "Centre for Strategic Planning and Management of Biomedical Health Risks" of the Federal Medical Biological Agency, 119121 Moscow, Russia
| |
Collapse
|
3
|
Mohamad Noordin N, Tan JL, Chong CK, Chem YK, Tajudin N, Abu Bakar RS, Sengol S, Phoon HYP, Che Azid NAM, W Mohd Arifin WNA, Aziz ZA, Hussin H, Ibrahim NS, Omar A, Ravi U, Kamarul Zaman KH, Yamin MA, Ngeow YF. Genomic diversity of SARS-CoV-2 in Malaysia. PeerJ 2021; 9:e12449. [PMID: 34760404 PMCID: PMC8571957 DOI: 10.7717/peerj.12449] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 10/18/2021] [Indexed: 01/17/2023] Open
Abstract
Background More than a year after its first appearance in December 2019, the COVID-19 pandemic is still on a rampage in many parts of the world. Although several vaccines have been approved for emergency use, the emergence and rapid spread of new SARS-CoV-2 variants have sparked fears of vaccine failure due to immune evasion. Massive viral genome sequencing has been recommended to track the genetic changes that could lead to adverse consequences. Methods We sequenced SARS-CoV-2 respiratory isolates from the National Public Health Laboratory, Malaysia and examined them together with viral genomes deposited in GISAID by other Malaysian researchers, to understand the evolutionary trend of the virus circulating in the country. We studied the distribution of virus lineages and site-wise mutations, analysed genetic clustering with the goeBURST full Minimum Spanning Tree algorithm, examined the trend of viral nucleotide diversity over time and performed nucleotide substitution association analyses. Results We identified 22 sub-lineages, 13 clonal complexes, 178 sequence types and seven sites of linkage disequilibrium in 277 SARS-CoV-2 genomes sequenced between January and December 2020. B.1.524 was the largest lineage group. The number of mutations per genome ranged from 0 to 19. The mean genomic diversity value over 12 months was 3.26 × 10-4. Of 359 mutations detected, 60.5% of which were non-synonymous, the most frequent were in the ORF1ab (P4715L), S (D614G and A701V) and N (S194L) genes. Conclusion The SARS-CoV-2 virus accumulated an abundance of mutations in the first year of the COVID-19 pandemic in Malaysia. Its overall genetic diversity, however, is relatively low compared to other Asian countries with larger populations. Continuous genomic and epidemiological surveillance will help to clarify the evolutionary processes determining viral diversity and impacting on human health.
Collapse
Affiliation(s)
- Noorliza Mohamad Noordin
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Joon Liang Tan
- Faculty of Information Science and Technology, Multimedia University, Melaka, Melaka, Malaysia
| | | | - Yu Kie Chem
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Norazimah Tajudin
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Rehan Shuhada Abu Bakar
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Selvanesan Sengol
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Hannah Yik Phing Phoon
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | | | | | - Zirwatul Adilah Aziz
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Hani Hussin
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Nurul Syahida Ibrahim
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Aziyati Omar
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Ushananthiny Ravi
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | | | - Mohd Asri Yamin
- National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia
| | - Yun Fong Ngeow
- Faculty of Medicine and Health Sciences, Universiti Tunku Abdul Rahman, Bandar Sungai Long, Selangor, Malaysia.,Center for Research on Communincable Diseases, Universiti Tunku Abdul Rahman, Bandar Sungai Long, Selangor, Malaysia
| |
Collapse
|
4
|
Kepler L, Hamins-Puertolas M, Rasmussen DA. Decomposing the sources of SARS-CoV-2 fitness variation in the United States. Virus Evol 2021; 7:veab073. [PMID: 34642604 PMCID: PMC8499931 DOI: 10.1093/ve/veab073] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 08/13/2021] [Accepted: 08/20/2021] [Indexed: 12/14/2022] Open
Abstract
The fitness of a pathogen is a composite phenotype determined by many different factors influencing growth rates both within and between hosts. Determining what factors shape fitness at the host population-level is especially challenging because both intrinsic factors like pathogen genetics and extrinsic factors such as host behavior influence between-host transmission potential. This challenge has been highlighted by controversy surrounding the population-level fitness effects of mutations in the SARS-CoV-2 genome and their relative importance when compared against non-genetic factors shaping transmission dynamics. Building upon phylodynamic birth-death models, we develop a new framework to learn how hundreds of genetic and non-genetic factors have shaped the fitness of SARS-CoV-2. We estimate the fitness effects of all amino acid variants and several structural variants that have circulated in the United States between February 2020 and March 2021 from viral phylogenies. We also estimate how much fitness variation among pathogen lineages is attributable to genetic versus non-genetic factors such as spatial heterogeneity in transmission rates. Before September 2020, most fitness variation between lineages can be explained by background spatial heterogeneity in transmission rates across geographic regions. Starting in late 2020, genetic variation in fitness increased dramatically with the emergence of several new lineages including B.1.1.7, B.1.427, B.1.429 and B.1.526. Our analysis also indicates that genetic variants in less well-explored genomic regions outside of Spike may be contributing significantly to overall fitness variation in the viral population.
Collapse
Affiliation(s)
- Lenora Kepler
- Bioinformatics Research Center, North Carolina State University, 1 Lampe Drive, Raleigh, NC 27607, USA
| | - Marco Hamins-Puertolas
- Biomathematics Graduate Program, North Carolina State University, Campus Box 8213, Raleigh, NC 27695, USA
| | - David A Rasmussen
- Bioinformatics Research Center, North Carolina State University, 1 Lampe Drive, Raleigh, NC 27607, USA
- Department of Entomology and Plant Pathology, North Carolina State University, Campus Box 7613, Raleigh, NC 27695, USA
| |
Collapse
|
5
|
Piantham C, Ito K. Modeling the selective advantage of new amino acids on the hemagglutinin of H1N1 influenza viruses using their patient age distributions. Virus Evol 2021; 7:veab049. [PMID: 34285812 PMCID: PMC8286795 DOI: 10.1093/ve/veab049] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
In 2009, a new strain of H1N1 influenza A virus caused a pandemic, and its descendant strains are causing seasonal epidemics worldwide. Given the high mutation rate of influenza viruses, variant strains having different amino acids on hemagglutinin (HA) continuously emerge. To prepare vaccine strains for the next influenza seasons, it is an urgent task to predict which variants will be selected in the viral population. An analysis of 24,681 pairs of an amino acid sequence of HA of H1N1pdm2009 viruses and its patient age showed that the empirical fixation probability of new amino acids on HA significantly differed depending on their frequencies in the population, patient age distributions, and epitope flags. The selective advantage of a variant strain having a new amino acid was modeled by linear combinations of patients age distributions and epitope flags, and then the fixation probability of the new amino acid was modeled using Kimura’s formula for advantageous selection. The parameters of models were estimated from the sequence data and models were tested with four-fold cross validations. The frequency of new amino acids alone can achieve high sensitivity, specificity, and precision in predicting the fixation of a new amino acid of which frequency is more than 0.11. The estimated parameter suggested that viruses with a new amino acid having a frequency in the population higher than 0.11 have a significantly higher selective advantage compared to viruses with the old amino acid at the same position. The model considering the Z-value of patient age rank-sums of new amino acids predicted amino acid substitutions on HA with a sensitivity of 0.78, specificity of 0.86, and precision of 0.83, showing significant improvement compared to the constant selective advantage model, which used only the frequency of the amino acid. These results suggested that H1N1 viruses tend to be selected in the adult population, and frequency of viruses having new amino acids and their patient ages are useful to predict amino acid substitutions on HA.
Collapse
Affiliation(s)
- Chayada Piantham
- Division of Bioinformatics, Graduate School of Infectious Diseases, Hokkaido University, Sapporo 0600818, Japan
| | - Kimihito Ito
- Division of Bioinformatics, International Institute for Zoonosis Control, Hokkaido University, Sapporo 0010020, Japan
| |
Collapse
|
6
|
Barlukova A, Rouzine IM. The evolutionary origin of the universal distribution of mutation fitness effect. PLoS Comput Biol 2021; 17:e1008822. [PMID: 33684109 PMCID: PMC7971868 DOI: 10.1371/journal.pcbi.1008822] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 03/18/2021] [Accepted: 02/19/2021] [Indexed: 01/27/2023] Open
Abstract
An intriguing fact long defying explanation is the observation of a universal exponential distribution of beneficial mutations in fitness effect for different microorganisms. To explain this effect, we use a population model including mutation, directional selection, linkage, and genetic drift. The multiple-mutation regime of adaptation at large population sizes (traveling wave regime) is considered. We demonstrate analytically and by simulation that, regardless of the inherent distribution of mutation fitness effect across genomic sites, an exponential distribution of fitness effects emerges in the long term. This result follows from the exponential statistics of the frequency of the less-fit alleles, f, that we predict to evolve, in the long term, for both polymorphic and monomorphic sites. We map the logarithmic slope of the distribution onto the previously derived fixation probability and demonstrate that it increases linearly in time. Our results demonstrate a striking difference between the distribution of fitness effects observed experimentally for naturally occurring mutations, and the "inherent" distribution obtained in a directed-mutagenesis experiment, which can have any shape depending on the organism. Based on these results, we develop a new method to measure the fitness effect of mutations for each variable residue using DNA sequences sampled from adapting populations. This new method is not sensitive to linkage effects and does not require the one-site model assumptions.
Collapse
Affiliation(s)
- Ayuna Barlukova
- Sorbonne Université, Institute de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
| | - Igor M. Rouzine
- Sorbonne Université, Institute de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
- * E-mail: ,
| |
Collapse
|
7
|
Barrat-Charlaix P, Huddleston J, Bedford T, Neher RA. Limited Predictability of Amino Acid Substitutions in Seasonal Influenza Viruses. Mol Biol Evol 2021; 38:2767-2777. [PMID: 33749787 PMCID: PMC8233509 DOI: 10.1093/molbev/msab065] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Seasonal influenza viruses repeatedly infect humans in part because they rapidly change their antigenic properties and evade host immune responses, necessitating frequent updates of the vaccine composition. Accurate predictions of strains circulating in the future could therefore improve the vaccine match. Here, we studied the predictability of frequency dynamics and fixation of amino acid substitutions. Current frequency was the strongest predictor of eventual fixation, as expected in neutral evolution. Other properties, such as occurrence in previously characterized epitopes or high Local Branching Index (LBI) had little predictive power. Parallel evolution was found to be moderately predictive of fixation. Although the LBI had little power to predict frequency dynamics, it was still successful at picking strains representative of future populations. The latter is due to a tendency of the LBI to be high for consensus-like sequences that are closer to the future than the average sequence. Simulations of models of adapting populations, in contrast, show clear signals of predictability. This indicates that the evolution of influenza HA and NA, while driven by strong selection pressure to change, is poorly described by common models of directional selection such as traveling fitness waves.
Collapse
Affiliation(s)
- Pierre Barrat-Charlaix
- Biozentrum, Universität Basel, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| | - John Huddleston
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA.,Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Trevor Bedford
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA.,Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Richard A Neher
- Biozentrum, Universität Basel, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
8
|
A de novo approach to inferring within-host fitness effects during untreated HIV-1 infection. PLoS Pathog 2020; 16:e1008171. [PMID: 32492061 PMCID: PMC7295245 DOI: 10.1371/journal.ppat.1008171] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 06/15/2020] [Accepted: 05/11/2020] [Indexed: 12/15/2022] Open
Abstract
In the absence of effective antiviral therapy, HIV-1 evolves in response to the within-host environment, of which the immune system is an important aspect. During the earliest stages of infection, this process of evolution is very rapid, driven by a small number of CTL escape mutations. As the infection progresses, immune escape variants evolve under reduced magnitudes of selection, while competition between an increasing number of polymorphic alleles (i.e., clonal interference) makes it difficult to quantify the magnitude of selection acting upon specific variant alleles. To tackle this complex problem, we developed a novel multi-locus inference method to evaluate the role of selection during the chronic stage of within-host infection. We applied this method to targeted sequence data from the p24 and gp41 regions of HIV-1 collected from 34 patients with long-term untreated HIV-1 infection. We identify a broad distribution of beneficial fitness effects during infection, with a small number of variants evolving under strong selection and very many variants evolving under weaker selection. The uniquely large number of infections analysed granted a previously unparalleled statistical power to identify loci at which selection could be inferred to act with statistical confidence. Our model makes no prior assumptions about the nature of alleles under selection, such that any synonymous or non-synonymous variant may be inferred to evolve under selection. However, the majority of variants inferred with confidence to be under selection were non-synonymous in nature, and in most cases were have previously been associated with either CTL escape in p24 or neutralising antibody escape in gp41. We also identified a putative new CTL escape site (residue 286 in gag), and a region of gp41 (including residues 644, 648, 655 in env) likely to be associated with immune escape. Sites inferred to be under selection in multiple hosts have high within-host and between-host diversity although not all sites with high between-host diversity were inferred to be under selection at the within-host level. Our identification of selection at sites associated with resistance to broadly neutralising antibodies (bNAbs) highlights the need to fully understand the role of selection in untreated individuals when designing bNAb based therapies.
Collapse
|
9
|
Abstract
Influenza viruses rapidly diversify within individual human infections. Several recent studies have deep-sequenced clinical influenza infections to identify viral variation within hosts, but it remains unclear how within-host mutations fare at the between-host scale. Here, we compare the genetic variation of H3N2 influenza within and between hosts to link viral evolutionary dynamics across scales. Synonymous sites evolve at similar rates at both scales, indicating that global evolution at these putatively neutral sites results from the accumulation of within-host variation. However, nonsynonymous mutations are depleted between hosts compared to within hosts, suggesting that selection purges many of the protein-altering changes that arise within hosts. The exception is at antigenic sites, where selection detectably favors nonsynonymous mutations at the global scale, but not within hosts. These results suggest that selection against deleterious mutations and selection for antigenic change are the main forces that act on within-host variants of influenza virus as they transmit and circulate between hosts.
Collapse
Affiliation(s)
- Katherine S Xue
- Department of Genome Sciences, University of Washington, Foege Building S-250, Box 3550653720 15th Ave NE, Seattle WA 98195-5065, USA.,Basic Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA.,Department of Biology, Stanford University, Stanford, CA, USA
| | - Jesse D Bloom
- Department of Genome Sciences, University of Washington, Foege Building S-250, Box 3550653720 15th Ave NE, Seattle WA 98195-5065, USA.,Basic Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA.,Howard Hughes Medical Institute, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA
| |
Collapse
|
10
|
Rasmussen DA, Stadler T. Coupling adaptive molecular evolution to phylodynamics using fitness-dependent birth-death models. eLife 2019; 8:45562. [PMID: 31411558 PMCID: PMC6715349 DOI: 10.7554/elife.45562] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Accepted: 07/26/2019] [Indexed: 12/25/2022] Open
Abstract
Beneficial and deleterious mutations cause the fitness of lineages to vary across a phylogeny and thereby shape its branching structure. While standard phylogenetic models do not allow mutations to feedback and shape trees, birth-death models can account for this feedback by letting the fitness of lineages depend on their type. To date, these multi-type birth-death models have only been applied to cases where a lineage’s fitness is determined by a single character state. We extend these models to track sequence evolution at multiple sites. This approach remains computationally tractable by tracking the genotype and fitness of lineages probabilistically in an approximate manner. Although approximate, we show that we can accurately estimate the fitness of lineages and site-specific mutational fitness effects from phylogenies. We apply this approach to estimate the population-level fitness effects of mutations in Ebola and influenza virus, and compare our estimates with in vitro fitness measurements for these mutations.
Collapse
Affiliation(s)
- David A Rasmussen
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, United States.,Bioinformatics Research Center, North Carolina State University, Raleigh, United States
| | - Tanja Stadler
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
11
|
Zhao L, Abbasi AB, Illingworth CJR. Mutational load causes stochastic evolutionary outcomes in acute RNA viral infection. Virus Evol 2019; 5:vez008. [PMID: 31024738 PMCID: PMC6476161 DOI: 10.1093/ve/vez008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Mutational load is known to be of importance for the evolution of RNA viruses, the combination of a high mutation rate and large population size leading to an accumulation of deleterious mutations. However, while the effects of mutational load on global viral populations have been considered, its quantitative effects at the within-host scale of infection are less well understood. We here show that even on the rapid timescale of acute disease, mutational load has an effect on within-host viral adaptation, reducing the effective selection acting upon beneficial variants by ∼10 per cent. Furthermore, mutational load induces considerable stochasticity in the pattern of evolution, causing a more than five-fold uncertainty in the effective fitness of a transmitted beneficial variant. Our work aims to bridge the gap between classic models from population genetic theory and the biology of viral infection. In an advance on some previous models of mutational load, we replace the assumption of a constant variant fitness cost with an experimentally-derived distribution of fitness effects. Expanding previous frameworks for evolutionary simulation, we introduce the Wright-Fisher model with continuous mutation, which describes a continuum of possible modes of replication within a cell. Our results advance our understanding of adaptation in the context of strong selection and a high mutation rate. Despite viral populations having large absolute sizes, critical events in viral adaptation, including antigenic drift and the onset of drug resistance, arise through stochastic evolutionary processes.
Collapse
Affiliation(s)
- Lei Zhao
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Ali B Abbasi
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Christopher J R Illingworth
- Department of Genetics, University of Cambridge, Cambridge, UK
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| |
Collapse
|
12
|
Lumby CK, Nene NR, Illingworth CJR. A novel framework for inferring parameters of transmission from viral sequence data. PLoS Genet 2018; 14:e1007718. [PMID: 30325921 PMCID: PMC6203404 DOI: 10.1371/journal.pgen.1007718] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Revised: 10/26/2018] [Accepted: 09/26/2018] [Indexed: 11/18/2022] Open
Abstract
Transmission between hosts is a critical part of the viral lifecycle. Recent studies of viral transmission have used genome sequence data to evaluate the number of particles transmitted between hosts, and the role of selection as it operates during the transmission process. However, the interpretation of sequence data describing transmission events is a challenging task. We here present a novel and comprehensive framework for using short-read sequence data to understand viral transmission events, designed for influenza virus, but adaptable to other viral species. Our approach solves multiple shortcomings of previous methods for this purpose; for example, we consider transmission as an event involving whole viruses, rather than sets of independent alleles. We demonstrate how selection during transmission and noisy sequence data may each affect naive inferences of the population bottleneck, accounting for these in our framework so as to achieve a correct inference. We identify circumstances in which selection for increased viral transmission may or may not be identified from data. Applying our method to experimental data in which transmission occurs in the presence of strong selection, we show that our framework grants a more quantitative insight into transmission events than previous approaches, inferring the bottleneck in a manner that accounts for selection, both for within-host virulence, and for inherent viral transmissibility. Our work provides new opportunities for studying transmission processes in influenza, and by extension, in other infectious diseases.
Collapse
Affiliation(s)
- Casper K. Lumby
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Nuno R. Nene
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Christopher J. R. Illingworth
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
13
|
Correia V, Abecasis AB, Rebelo-de-Andrade H. Molecular footprints of selective pressure in the neuraminidase gene of currently circulating human influenza subtypes and lineages. Virology 2018; 522:122-130. [PMID: 30029011 DOI: 10.1016/j.virol.2018.07.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Revised: 07/03/2018] [Accepted: 07/04/2018] [Indexed: 12/20/2022]
Abstract
Influenza neuraminidase (NA) is under selective pressure (SP) of both host immune system and drug use. Here, we assembled large datasets of NA sequences of worldwide circulating viruses to estimate the global and site-specific SP acting on all current subtypes/lineages of human influenza NA. An overall negative SP of similar magnitude and a prevalence of negatively selected sites were observed for all subtypes/lineages. Positively selected sites varied according to the subtype/lineage, including N1-NA sites 247 and 275, N2-NA sites 148 and 151, and B/Victoria-NA site 395 associated with drug-resistance or reduced susceptibility. These results evidenced a potential role of positive selection in the low-level spread of A(H1N1)pdm09-H275Y drug-resistant viruses, and alerted for a potential higher risk of spread of a synergistic A(H1N1)pdm09 drug-resistant variant (H275Y/S247N). The positive selection detected at N2-NA sites 148 and 151 was probably an artefact from cell-culture. Overall mapping revealed six potential new druggable regions.
Collapse
Affiliation(s)
- Vanessa Correia
- Infectious Diseases Department, Instituto Nacional de Saúde Doutor Ricardo Jorge, IP, Av. Padre Cruz, 1649-016 Lisbon, Portugal; Host-Pathogen Interaction Unit, Research Institute for Medicines (iMed.ULisboa), Faculdade de Farmácia, Universidade de Lisboa, Av. Professor Gama Pinto, 1649-003 Lisbon, Portugal.
| | - Ana B Abecasis
- Global Health and Tropical Medicine, Instituto de Higiene e Medicina Tropical, Universidade NOVA de Lisboa, Rua da Junqueira 100, 1349-008 Lisbon, Portugal.
| | - Helena Rebelo-de-Andrade
- Infectious Diseases Department, Instituto Nacional de Saúde Doutor Ricardo Jorge, IP, Av. Padre Cruz, 1649-016 Lisbon, Portugal; Host-Pathogen Interaction Unit, Research Institute for Medicines (iMed.ULisboa), Faculdade de Farmácia, Universidade de Lisboa, Av. Professor Gama Pinto, 1649-003 Lisbon, Portugal.
| |
Collapse
|
14
|
Vázquez-García I, Salinas F, Li J, Fischer A, Barré B, Hallin J, Bergström A, Alonso-Perez E, Warringer J, Mustonen V, Liti G. Clonal Heterogeneity Influences the Fate of New Adaptive Mutations. Cell Rep 2018; 21:732-744. [PMID: 29045840 PMCID: PMC5656752 DOI: 10.1016/j.celrep.2017.09.046] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2017] [Revised: 05/25/2017] [Accepted: 09/14/2017] [Indexed: 11/03/2022] Open
Abstract
The joint contribution of pre-existing and de novo genetic variation to clonal adaptation is poorly understood but essential to designing successful antimicrobial or cancer therapies. To address this, we evolve genetically diverse populations of budding yeast, S. cerevisiae, consisting of diploid cells with unique haplotype combinations. We study the asexual evolution of these populations under selective inhibition with chemotherapeutic drugs by time-resolved whole-genome sequencing and phenotyping. All populations undergo clonal expansions driven by de novo mutations but remain genetically and phenotypically diverse. The clones exhibit widespread genomic instability, rendering recessive de novo mutations homozygous and refining pre-existing variation. Finally, we decompose the fitness contributions of pre-existing and de novo mutations by creating a large recombinant library of adaptive mutations in an ensemble of genetic backgrounds. Both pre-existing and de novo mutations substantially contribute to fitness, and the relative fitness of pre-existing variants sets a selective threshold for new adaptive mutations.
Collapse
Affiliation(s)
- Ignacio Vázquez-García
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK; Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge CB3 0WA, UK.
| | | | - Jing Li
- Université Côte d'Azur, INSERM, CNRS, IRCAN, 06107 Nice, France
| | - Andrej Fischer
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Benjamin Barré
- Université Côte d'Azur, INSERM, CNRS, IRCAN, 06107 Nice, France
| | - Johan Hallin
- Université Côte d'Azur, INSERM, CNRS, IRCAN, 06107 Nice, France
| | - Anders Bergström
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK; Université Côte d'Azur, INSERM, CNRS, IRCAN, 06107 Nice, France
| | - Elisa Alonso-Perez
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, Sweden
| | - Jonas Warringer
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, Sweden; Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, Ås, Norway
| | - Ville Mustonen
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK.
| | - Gianni Liti
- Université Côte d'Azur, INSERM, CNRS, IRCAN, 06107 Nice, France.
| |
Collapse
|
15
|
Morris DH, Gostic KM, Pompei S, Bedford T, Łuksza M, Neher RA, Grenfell BT, Lässig M, McCauley JW. Predictive Modeling of Influenza Shows the Promise of Applied Evolutionary Biology. Trends Microbiol 2018; 26:102-118. [PMID: 29097090 PMCID: PMC5830126 DOI: 10.1016/j.tim.2017.09.004] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Revised: 09/06/2017] [Accepted: 09/19/2017] [Indexed: 01/16/2023]
Abstract
Seasonal influenza is controlled through vaccination campaigns. Evolution of influenza virus antigens means that vaccines must be updated to match novel strains, and vaccine effectiveness depends on the ability of scientists to predict nearly a year in advance which influenza variants will dominate in upcoming seasons. In this review, we highlight a promising new surveillance tool: predictive models. Based on data-sharing and close collaboration between the World Health Organization and academic scientists, these models use surveillance data to make quantitative predictions regarding influenza evolution. Predictive models demonstrate the potential of applied evolutionary biology to improve public health and disease control. We review the state of influenza predictive modeling and discuss next steps and recommendations to ensure that these models deliver upon their considerable biomedical promise.
Collapse
Affiliation(s)
- Dylan H Morris
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA.
| | - Katelyn M Gostic
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, USA
| | - Simone Pompei
- Institute for Theoretical Physics, University of Cologne, Cologne, Germany
| | - Trevor Bedford
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Marta Łuksza
- Institute for Advanced Study, Princeton, NJ, USA
| | - Richard A Neher
- Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Bryan T Grenfell
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA; Fogarty International Center, National Institutes of Health, Bethesda, MD, USA
| | - Michael Lässig
- Institute for Theoretical Physics, University of Cologne, Cologne, Germany
| | - John W McCauley
- Worldwide Influenza Centre, Francis Crick Institute, London, UK
| |
Collapse
|
16
|
Raghwani J, Thompson RN, Koelle K. Selection on non-antigenic gene segments of seasonal influenza A virus and its impact on adaptive evolution. Virus Evol 2017; 3:vex034. [PMID: 29250432 PMCID: PMC5724400 DOI: 10.1093/ve/vex034] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Most studies on seasonal influenza A/H3N2 virus adaptation have focused on the main antigenic gene, hemagglutinin. However, there is increasing evidence that the genome-wide genetic background of novel antigenic variants can influence these variants’ emergence probabilities and impact their patterns of dominance in the population. This suggests that non-antigenic genes may be important in shaping the viral evolutionary dynamics. To better understand the role of selection on non-antigenic genes in the adaptive evolution of seasonal influenza viruses, we have developed a simple population genetic model that considers a virus with one antigenic and one non-antigenic gene segment. By simulating this model under different regimes of selection and reassortment, we find that the empirical patterns of lineage turnover for the antigenic and non-antigenic gene segments are best captured when there is both limited viral coinfection and selection operating on both gene segments. In contrast, under a scenario of only neutral evolution in the non-antigenic gene segment, we see persistence of multiple lineages for long periods of time in that segment, which is not compatible with observed molecular evolutionary patterns. Further, we find that reassortment, occurring in coinfected individuals, can increase the speed of viral adaptive evolution by primarily reducing selective interference and genetic linkage effects. Together, these findings suggest that, for influenza, with six internal or non-antigenic gene segments, the evolutionary dynamics of novel antigenic variants are likely to be influenced by the genome-wide genetic background as a result of linked selection among both beneficial and deleterious mutations.
Collapse
Affiliation(s)
- Jayna Raghwani
- Department of Zoology, University of Oxford, Oxford, OX1 3SY, UK
| | - Robin N Thompson
- Department of Zoology, University of Oxford, Oxford, OX1 3SY, UK
| | - Katia Koelle
- Department of Biology, Duke University, Durham, NC 27708, USA
| |
Collapse
|
17
|
|
18
|
Sobel Leonard A, McClain MT, Smith GJD, Wentworth DE, Halpin RA, Lin X, Ransier A, Stockwell TB, Das SR, Gilbert AS, Lambkin-Williams R, Ginsburg GS, Woods CW, Koelle K, Illingworth CJR. The effective rate of influenza reassortment is limited during human infection. PLoS Pathog 2017; 13:e1006203. [PMID: 28170438 PMCID: PMC5315410 DOI: 10.1371/journal.ppat.1006203] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Revised: 02/17/2017] [Accepted: 01/26/2017] [Indexed: 12/31/2022] Open
Abstract
We characterise the evolutionary dynamics of influenza infection described by viral sequence data collected from two challenge studies conducted in human hosts. Viral sequence data were collected at regular intervals from infected hosts. Changes in the sequence data observed across time show that the within-host evolution of the virus was driven by the reversion of variants acquired during previous passaging of the virus. Treatment of some patients with oseltamivir on the first day of infection did not lead to the emergence of drug resistance variants in patients. Using an evolutionary model, we inferred the effective rate of reassortment between viral segments, measuring the extent to which randomly chosen viruses within the host exchange genetic material. We find strong evidence that the rate of effective reassortment is low, such that genetic associations between polymorphic loci in different segments are preserved during the course of an infection in a manner not compatible with epistasis. Combining our evidence with that of previous studies we suggest that spatial heterogeneity in the viral population may reduce the extent to which reassortment is observed. Our results do not contradict previous findings of high rates of viral reassortment in vitro and in small animal studies, but indicate that in human hosts the effective rate of reassortment may be substantially more limited. The influenza virus is an important cause of disease in the human population. During the course of an infection the virus can evolve rapidly. An important mechanism of viral evolution is reassortment, whereby different segments of the influenza genome are shuffled with other segments, producing new viral combinations. Here we study natural selection and reassortment during the course of infections occurring in human hosts. Examining viral genome sequence data from these infections, we note that genetic variants that were acquired during the growth of viruses in culture are selected against in the human host. In addition, we find evidence that the effective rate of reassortment is low. We suggest that the spatial separation between viruses in different parts of the host airway may limit the extent to which genetically distinct segments reassort with one another. Within the global population of influenza viruses, reassortment remains an important factor. However, reassortment is not so rapid as to exclude the possibility of interactions between genome segments affecting the course of influenza evolution during a single infection.
Collapse
Affiliation(s)
- Ashley Sobel Leonard
- Department of Biology, Duke University, Durham, North Carolina, United States of America
| | - Micah T. McClain
- Duke Center for Applied Genomics and Precision Medicine, Durham, North Carolina, United States of America
| | - Gavin J. D. Smith
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, Singapore
| | - David E. Wentworth
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Rebecca A. Halpin
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Xudong Lin
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Amy Ransier
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | | | - Suman R. Das
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Anthony S. Gilbert
- hVivo PLC, The QMB Innovation Centre, Queen Mary, University of London, London, United Kingdom
| | - Rob Lambkin-Williams
- hVivo PLC, The QMB Innovation Centre, Queen Mary, University of London, London, United Kingdom
| | - Geoffrey S. Ginsburg
- Duke Center for Applied Genomics and Precision Medicine, Durham, North Carolina, United States of America
| | - Christopher W. Woods
- Duke Center for Applied Genomics and Precision Medicine, Durham, North Carolina, United States of America
| | - Katia Koelle
- Department of Biology, Duke University, Durham, North Carolina, United States of America
| | - Christopher J. R. Illingworth
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Department of Applied Maths and Theoretical Physics, Centre for Mathematical Sciences, Wilberforce Road, University of Cambridge, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
19
|
Nshogozabahizi JC, Dench J, Aris-Brosou S. Widespread Historical Contingency in Influenza Viruses. Genetics 2017; 205:409-420. [PMID: 28049709 PMCID: PMC5223518 DOI: 10.1534/genetics.116.193979] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Accepted: 11/04/2016] [Indexed: 11/18/2022] Open
Abstract
In systems biology and genomics, epistasis characterizes the impact that a substitution at a particular location in a genome can have on a substitution at another location. This phenomenon is often implicated in the evolution of drug resistance or to explain why particular "disease-causing" mutations do not have the same outcome in all individuals. Hence, uncovering these mutations and their locations in a genome is a central question in biology. However, epistasis is notoriously difficult to uncover, especially in fast-evolving organisms. Here, we present a novel statistical approach that replies on a model developed in ecology and that we adapt to analyze genetic data in fast-evolving systems such as the influenza A virus. We validate the approach using a two-pronged strategy: extensive simulations demonstrate a low-to-moderate sensitivity with excellent specificity and precision, while analyses of experimentally validated data recover known interactions, including in a eukaryotic system. We further evaluate the ability of our approach to detect correlated evolution during antigenic shifts or at the emergence of drug resistance. We show that in all cases, correlated evolution is prevalent in influenza A viruses, involving many pairs of sites linked together in chains; a hallmark of historical contingency. Strikingly, interacting sites are separated by large physical distances, which entails either long-range conformational changes or functional tradeoffs, for which we find support with the emergence of drug resistance. Our work paves a new way for the unbiased detection of epistasis in a wide range of organisms by performing whole-genome scans.
Collapse
Affiliation(s)
| | - Jonathan Dench
- Department of Biology, University of Ottawa, Ontario K1N 6N5, Canada
| | - Stéphane Aris-Brosou
- Department of Biology, University of Ottawa, Ontario K1N 6N5, Canada
- Department of Mathematics and Statistics, University of Ottawa, Ontario K1N 6N5, Canada
| |
Collapse
|
20
|
Koelle K, Rasmussen DA. The effects of a deleterious mutation load on patterns of influenza A/H3N2's antigenic evolution in humans. eLife 2015; 4:e07361. [PMID: 26371556 PMCID: PMC4611170 DOI: 10.7554/elife.07361] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Accepted: 09/14/2015] [Indexed: 11/19/2022] Open
Abstract
Recent phylogenetic analyses indicate that RNA virus populations carry a significant deleterious mutation load. This mutation load has the potential to shape patterns of adaptive evolution via genetic linkage to beneficial mutations. Here, we examine the effect of deleterious mutations on patterns of influenza A subtype H3N2's antigenic evolution in humans. By first analyzing simple models of influenza that incorporate a mutation load, we show that deleterious mutations, as expected, act to slow the virus's rate of antigenic evolution, while making it more punctuated in nature. These models further predict three distinct molecular pathways by which antigenic cluster transitions occur, and we find phylogenetic patterns consistent with each of these pathways in influenza virus sequences. Simulations of a more complex phylodynamic model further indicate that antigenic mutations act in concert with deleterious mutations to reproduce influenza's spindly hemagglutinin phylogeny, co-circulation of antigenic variants, and high annual attack rates. DOI:http://dx.doi.org/10.7554/eLife.07361.001 Each year, up to 15% of the world's population experience symptoms of an influenza infection, also commonly known as flu. The most common culprit is a strain of the virus called influenza type A subtype H3N2. One reason that so many people become infected each year is that this virus evolves rapidly. Within a few years, proteins on the surface of the virus known as antigens become less recognizable to the immune system of a person who has been previously infected. This means that the person can become ill with the virus again because their immune system cannot mount an effective response to the evolved virus strain. Influenza virus strains evolve rapidly because their genetic material accumulates mutations quickly. Although some of these mutations are beneficial to the virus, other mutations are harmful and reduce the ability of the virus to spread. Sometimes beneficial mutations may occur alongside harmful ones, but it is not known how the harmful mutations affect the evolution of the virus. Here, Koelle and Rasmussen used computer models of H3N2 influenza to examine the effect of harmful mutations on the evolution of this virus population. The models show that harmful mutations limit how quickly the antigens can evolve. Also, the presence of these harmful mutations effectively acts as a sieve: they allow only large changes in the antigens to establish in the virus population. The models suggest that there are three routes by which large changes in the antigens on H3N2 viruses may occur. The first is by a single mutation that has a big effect on the antigens in viruses that only carry a few harmful mutations, but these large mutations would not happen very often. Another route may be through more common mutations that have only a small or moderate benefit, which would allow the virus to become more common in the population before it acquires a beneficial mutation with a much greater effect. The third possibility is that a large beneficial mutation may arise in viruses that have many harmful mutations. These harmful mutations may initially limit the ability of the virus to spread, but over time, some of these harmful mutations may then be lost. Koelle and Rasmussen found that the computer models could recreate the patterns of virus evolution that have been observed in real strains of H3N2. Researchers use predictions of influenza evolution to help them decide which virus strains should be included in flu vaccines each year. Koelle and Rasmussen findings indicate that harmful mutations should be considered when making these predictions. DOI:http://dx.doi.org/10.7554/eLife.07361.002
Collapse
Affiliation(s)
- Katia Koelle
- Department of Biology, Duke University, Durham, United States.,Fogarty International Center, National Institutes of Health, Bethesda, United States
| | - David A Rasmussen
- Department of Biology, Duke University, Durham, United States.,Department of Biosystems Science and Engineering, Eidgenössische Technische Hochschule Zürich, Basel, Switzerland
| |
Collapse
|
21
|
Neverov AD, Kryazhimskiy S, Plotkin JB, Bazykin GA. Coordinated Evolution of Influenza A Surface Proteins. PLoS Genet 2015; 11:e1005404. [PMID: 26247472 PMCID: PMC4527594 DOI: 10.1371/journal.pgen.1005404] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Accepted: 06/30/2015] [Indexed: 11/18/2022] Open
Abstract
The surface proteins hemagglutinin (HA) and neuraminidase (NA) of human influenza A virus evolve under selection pressures to escape adaptive immune responses and antiviral drug treatments. In addition to these external selection pressures, some mutations in HA are known to affect the adaptive landscape of NA, and vice versa, because these two proteins are physiologically interlinked. However, the extent to which evolution of one protein affects the evolution of the other one is unknown. Here we develop a novel phylogenetic method for detecting the signatures of such genetic interactions between mutations in different genes – that is, inter-gene epistasis. Using this method, we show that influenza surface proteins evolve in a coordinated way, with mutations in HA affecting subsequent spread of mutations in NA and vice versa, at many sites. Of particular interest is our finding that the oseltamivir-resistance mutations in NA in subtype H1N1 were likely facilitated by prior mutations in HA. Our results illustrate that the adaptive landscape of a viral protein is remarkably sensitive to its genomic context and, more generally, that the evolution of any single protein must be understood within the context of the entire evolving genome. The fitness of an organism depends on the coordinated function of many genes. Thus, how a mutation in one gene affects fitness often depends on what mutations are present in other genes. This dependence is called “genetic interaction” or “epistasis”. The prevalence and type of such interactions are not well understood. Epistasis can be inferred from time-series sequencing data when a mutation in one gene is observed to facilitate the spread of a mutation in another gene. However, the situation is much more complicated when new combinations of genes are formed by processes such as recombination or reassortment. In such cases, deducing the time and order of genetic changes is difficult. Here, we devise a method to infer pairs of mutations in different genes which closely follow one another in the presence of reassortment. We apply it to evolution of two surface proteins of influenza A virus, hemagglutinin and neuraminidase, which are important targets for the human immune system and drugs. We show that mutations in one of these proteins are often facilitated by prior mutations, or compensated by subsequent mutations, in the other protein. In particular, drug-resistance mutations in neuraminidase were likely made possible by prior mutation in hemagglutinin. Knowledge of such interactions is necessary to fully understand and predict evolution.
Collapse
Affiliation(s)
| | - Sergey Kryazhimskiy
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- FAS Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Joshua B. Plotkin
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Georgii A. Bazykin
- Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of Sciences, Moscow, Russia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia
- Pirogov Russian National Research Medical University, Moscow, Russia
- * E-mail:
| |
Collapse
|
22
|
Illingworth CJR, Fischer A, Mustonen V. Identifying selection in the within-host evolution of influenza using viral sequence data. PLoS Comput Biol 2014; 10:e1003755. [PMID: 25080215 PMCID: PMC4117419 DOI: 10.1371/journal.pcbi.1003755] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Accepted: 06/13/2014] [Indexed: 02/07/2023] Open
Abstract
The within-host evolution of influenza is a vital component of its epidemiology. A question of particular interest is the role that selection plays in shaping the viral population over the course of a single infection. We here describe a method to measure selection acting upon the influenza virus within an individual host, based upon time-resolved genome sequence data from an infection. Analysing sequence data from a transmission study conducted in pigs, describing part of the haemagglutinin gene (HA1) of an influenza virus, we find signatures of non-neutrality in six of a total of sixteen infections. We find evidence for both positive and negative selection acting upon specific alleles, while in three cases, the data suggest the presence of time-dependent selection. In one infection we observe what is potentially a specific immune response against the virus; a non-synonymous mutation in an epitope region of the virus is found to be under initially positive, then strongly negative selection. Crucially, given the lack of homologous recombination in influenza, our method accounts for linkage disequilibrium between nucleotides at different positions in the haemagglutinin gene, allowing for the analysis of populations in which multiple mutations are present at any given time. Our approach offers a new insight into the dynamics of influenza infection, providing a detailed characterisation of the forces that underlie viral evolution. The evolution of the influenza virus is of great importance for human health. Through evolution, current influenza viruses develop the ability to infect people who have been vaccinated against earlier strains. New strains of influenza that infect birds and pigs could evolve to infect and spread between people, causing a global pandemic. The influenza virus lives within a human or animal host, so that viral evolution happens within, or in the spread between, individuals. As such, what happens to the virus during the course of an infection is a question of great interest. We here describe a statistical method that uses viral genome sequence data to measure how evolution affects the influenza virus within a single host. Studying data from infections transmitted between pigs, we find evidence for evolutionary adaptation in six of sixteen animals for which data were available. In one case, an immune response mounted by a pig against the virus is apparent. Our method provides a statistical framework for using sequence data to study viral evolution on very short timescales, enabling new research into within-host viral evolution.
Collapse
Affiliation(s)
| | - Andrej Fischer
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Ville Mustonen
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
23
|
Abstract
The seasonal human influenza A/H3N2 virus undergoes rapid evolution, which produces significant year-to-year sequence turnover in the population of circulating strains. Adaptive mutations respond to human immune challenge and occur primarily in antigenic epitopes, the antibody-binding domains of the viral surface protein haemagglutinin. Here we develop a fitness model for haemagglutinin that predicts the evolution of the viral population from one year to the next. Two factors are shown to determine the fitness of a strain: adaptive epitope changes and deleterious mutations outside the epitopes. We infer both fitness components for the strains circulating in a given year, using population-genetic data of all previous strains. From fitness and frequency of each strain, we predict the frequency of its descendent strains in the following year. This fitness model maps the adaptive history of influenza A and suggests a principled method for vaccine selection. Our results call for a more comprehensive epidemiology of influenza and other fast-evolving pathogens that integrates antigenic phenotypes with other viral functions coupled by genetic linkage.
Collapse
|
24
|
Neverov AD, Lezhnina KV, Kondrashov AS, Bazykin GA. Intrasubtype reassortments cause adaptive amino acid replacements in H3N2 influenza genes. PLoS Genet 2014; 10:e1004037. [PMID: 24415946 PMCID: PMC3886890 DOI: 10.1371/journal.pgen.1004037] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2013] [Accepted: 11/01/2013] [Indexed: 01/15/2023] Open
Abstract
Reassortments and point mutations are two major contributors to diversity of Influenza A virus; however, the link between these two processes is unclear. It has been suggested that reassortments provoke a temporary increase in the rate of amino acid changes as the viral proteins adapt to new genetic environment, but this phenomenon has not been studied systematically. Here, we use a phylogenetic approach to infer the reassortment events between the 8 segments of influenza A H3N2 virus since its emergence in humans in 1968. We then study the amino acid replacements that occurred in genes encoded in each segment subsequent to reassortments. In five out of eight genes (NA, M1, HA, PB1 and NS1), the reassortment events led to a transient increase in the rate of amino acid replacements on the descendant phylogenetic branches. In NA and HA, the replacements following reassortments were enriched with parallel and/or reversing replacements; in contrast, the replacements at sites responsible for differences between antigenic clusters (in HA) and at sites under positive selection (in NA) were underrepresented among them. Post-reassortment adaptive walks contribute to adaptive evolution in Influenza A: in NA, an average reassortment event causes at least 2.1 amino acid replacements in a reassorted gene, with, on average, 0.43 amino acid replacements per evolving post-reassortment lineage; and at least ~9% of all amino acid replacements are provoked by reassortments.
Collapse
Affiliation(s)
- Alexey D. Neverov
- Federal Budget Institution of Science “Central Research Institute for Epidemiology”, Moscow, Russia
- Department of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Ksenia V. Lezhnina
- Department of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Alexey S. Kondrashov
- Department of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
- Life Sciences Institute and Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Georgii A. Bazykin
- Department of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
- Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Moscow, Russia
- * E-mail:
| |
Collapse
|