1
|
Perofsky AC, Huddleston J, Hansen C, Barnes JR, Rowe T, Xu X, Kondor R, Wentworth DE, Lewis N, Whittaker L, Ermetal B, Harvey R, Galiano M, Daniels RS, McCauley JW, Fujisaki S, Nakamura K, Kishida N, Watanabe S, Hasegawa H, Sullivan SG, Barr IG, Subbarao K, Krammer F, Bedford T, Viboud C. Antigenic drift and subtype interference shape A(H3N2) epidemic dynamics in the United States. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2023.10.02.23296453. [PMID: 37873362 PMCID: PMC10593063 DOI: 10.1101/2023.10.02.23296453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Influenza viruses continually evolve new antigenic variants, through mutations in epitopes of their major surface proteins, hemagglutinin (HA) and neuraminidase (NA). Antigenic drift potentiates the reinfection of previously infected individuals, but the contribution of this process to variability in annual epidemics is not well understood. Here we link influenza A(H3N2) virus evolution to regional epidemic dynamics in the United States during 1997-2019. We integrate phenotypic measures of HA antigenic drift and sequence-based measures of HA and NA fitness to infer antigenic and genetic distances between viruses circulating in successive seasons. We estimate the magnitude, severity, timing, transmission rate, age-specific patterns, and subtype dominance of each regional outbreak and find that genetic distance based on broad sets of epitope sites is the strongest evolutionary predictor of A(H3N2) virus epidemiology. Increased HA and NA epitope distance between seasons correlates with larger, more intense epidemics, higher transmission, greater A(H3N2) subtype dominance, and a greater proportion of cases in adults relative to children, consistent with increased population susceptibility. Based on random forest models, A(H1N1) incidence impacts A(H3N2) epidemics to a greater extent than viral evolution, suggesting that subtype interference is a major driver of influenza A virus infection dynamics, presumably via heterosubtypic cross-immunity.
Collapse
Affiliation(s)
- Amanda C Perofsky
- Fogarty International Center, National Institutes of Health, United States
- Brotman Baty Institute for Precision Medicine, University of Washington, United States
| | - John Huddleston
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, United States
| | - Chelsea Hansen
- Fogarty International Center, National Institutes of Health, United States
- Brotman Baty Institute for Precision Medicine, University of Washington, United States
| | - John R Barnes
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - Thomas Rowe
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - Xiyan Xu
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - Rebecca Kondor
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - David E Wentworth
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - Nicola Lewis
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Lynne Whittaker
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Burcu Ermetal
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Ruth Harvey
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Monica Galiano
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Rodney Stuart Daniels
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - John W McCauley
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Seiichiro Fujisaki
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Kazuya Nakamura
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Noriko Kishida
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Shinji Watanabe
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Hideki Hasegawa
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Sheena G Sullivan
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Australia
| | - Ian G Barr
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Australia
| | - Kanta Subbarao
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Australia
| | - Florian Krammer
- Center for Vaccine Research and Pandemic Preparedness (C-VaRPP), Icahn School of Medicine at Mount Sinai, United States
- Department of Pathology, Molecular and Cell-Based Medicine, Icahn School of Medicine at Mount Sinai, United States
| | - Trevor Bedford
- Brotman Baty Institute for Precision Medicine, University of Washington, United States
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, United States
- Department of Genome Sciences, University of Washington, United States
- Howard Hughes Medical Institute, Seattle, United States
| | - Cécile Viboud
- Fogarty International Center, National Institutes of Health, United States
| |
Collapse
|
2
|
Klink GV, Kalinina OV, Bazykin GA. Changing selection on amino acid substitutions in Gag protein between major HIV-1 subtypes. Virus Evol 2024; 10:veae036. [PMID: 38808036 PMCID: PMC11131029 DOI: 10.1093/ve/veae036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Revised: 12/27/2023] [Accepted: 04/28/2024] [Indexed: 05/30/2024] Open
Abstract
Amino acid preferences at a protein site depend on the role of this site in protein function and structure as well as on external constraints. All these factors can change in the course of evolution, making amino acid propensities of a site time-dependent. When viral subtypes divergently evolve in different host subpopulations, such changes may depend on genetic, medical, and sociocultural differences between these subpopulations. Here, using our previously developed phylogenetic approach, we describe sixty-nine amino acid sites of the Gag protein of human immunodeficiency virus type 1 (HIV-1) where amino acids have different impact on viral fitness in six major subtypes of the type M. These changes in preferences trigger adaptive evolution; indeed, 32 (46 per cent) of these sites experienced strong positive selection at least in one of the subtypes. At some of the sites, changes in amino acid preferences may be associated with differences in immune escape between subtypes. The prevalence of an amino acid in a protein site within a subtype is only a poor predictor for whether this amino acid is preferred in this subtype according to the phylogenetic analysis. Therefore, attempts to identify the factors of viral evolution from comparative genomics data should integrate across multiple sources of information.
Collapse
Affiliation(s)
- Galya V Klink
- Laboratory of Molecular Evolution, Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of Sciences, Bolshoy Karetny per. 19, build.1, Moscow 127051, Russia
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Boulevard, 30, p.1, Skolkovo 121205, Russia
| | - Olga V Kalinina
- Drug Bioinformatics, Helmholtz Institute for Pharmaceutical Research Saarland (HIPS)/Helmholtz Centre for Infection Research (HZI), Campus E8.1, Saarbrücken 66123, Germany
- Center for Bioinformatics, Saarland University, Campus E2.1, Saarbrücken 66123, Germany
- Medical Faculty, Saarland University, Kirrberger Str. 100, Homburg 66421, Germany
| | - Georgii A Bazykin
- Laboratory of Molecular Evolution, Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of Sciences, Bolshoy Karetny per. 19, build.1, Moscow 127051, Russia
| |
Collapse
|
3
|
Lou J, Liang W, Cao L, Hu I, Zhao S, Chen Z, Chan RWY, Cheung PPH, Zheng H, Liu C, Li Q, Chong MKC, Zhang Y, Yeoh EK, Chan PKS, Zee BCY, Mok CKP, Wang MH. Predictive evolutionary modelling for influenza virus by site-based dynamics of mutations. Nat Commun 2024; 15:2546. [PMID: 38514647 PMCID: PMC10958014 DOI: 10.1038/s41467-024-46918-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/12/2024] [Indexed: 03/23/2024] Open
Abstract
Influenza virus continuously evolves to escape human adaptive immunity and generates seasonal epidemics. Therefore, influenza vaccine strains need to be updated annually for the upcoming flu season to ensure vaccine effectiveness. We develop a computational approach, beth-1, to forecast virus evolution and select representative virus for influenza vaccine. The method involves modelling site-wise mutation fitness. Informed by virus genome and population sero-positivity, we calibrate transition time of mutations and project the fitness landscape to future time, based on which beth-1 selects the optimal vaccine strain. In season-to-season prediction in historical data for the influenza A pH1N1 and H3N2 viruses, beth-1 demonstrates superior genetic matching compared to existing approaches. In prospective validations, the model shows superior or non-inferior genetic matching and neutralization against circulating virus in mice immunization experiments compared to the current vaccine. The method offers a promising and ready-to-use tool to facilitate vaccine strain selection for the influenza virus through capturing heterogeneous evolutionary dynamics over genome space-time and linking molecular variants to population immune response.
Collapse
Affiliation(s)
- Jingzhi Lou
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- Beth Bioinformatics Co. Ltd, Hong Kong SAR, China
| | - Weiwen Liang
- HKU-Pasteur Research Pole, School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Lirong Cao
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Inchi Hu
- Department of Statistics, George Mason University, Fairfax, VA, USA
| | - Shi Zhao
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Zigui Chen
- Department of Microbiology, CUHK, Hong Kong SAR, China
| | - Renee Wan Yi Chan
- Department of Paediatrics, CUHK, Hong Kong SAR, China
- Hong Kong Hub of Paediatric Excellence, CUHK, Hong Kong SAR, China
| | | | - Hong Zheng
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Caiqi Liu
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Qi Li
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Marc Ka Chun Chong
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Yexian Zhang
- Beth Bioinformatics Co. Ltd, Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Eng-Kiong Yeoh
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- Centre for Health Systems and Policy Research, CUHK, Hong Kong SAR, China
| | - Paul Kay-Sheung Chan
- Department of Microbiology, CUHK, Hong Kong SAR, China
- Stanley Ho Centre for Emerging Infectious Diseases, CUHK, Hong Kong SAR, China
| | - Benny Chung Ying Zee
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Chris Ka Pun Mok
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China.
- Li Ka Shing Institute of Health Sciences, Faculty of Medicine, CUHK, Hong Kong SAR, China.
| | - Maggie Haitian Wang
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China.
- CUHK Shenzhen Research Institute, Shenzhen, China.
| |
Collapse
|
4
|
Gilbertson B, Subbarao K. What Have We Learned by Resurrecting the 1918 Influenza Virus? Annu Rev Virol 2023; 10:25-47. [PMID: 37774132 DOI: 10.1146/annurev-virology-111821-104408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/01/2023]
Abstract
The 1918 Spanish influenza pandemic was one of the deadliest infectious disease events in recorded history, resulting in approximately 50-100 million deaths worldwide. The origins of the 1918 virus and the molecular basis for its exceptional virulence remained a mystery for much of the 20th century because the pandemic predated virologic techniques to isolate, passage, and store influenza viruses. In the late 1990s, overlapping fragments of influenza viral RNA preserved in the tissues of several 1918 victims were amplified and sequenced. The use of influenza reverse genetics then permitted scientists to reconstruct the 1918 virus entirely from cloned complementary DNA, leading to new insights into the origin of the virus and its pathogenicity. Here, we discuss some of the advances made by resurrection of the 1918 virus, including the rise of innovative molecular research, which is a topic in the dual use debate.
Collapse
Affiliation(s)
- Brad Gilbertson
- Department of Microbiology and Immunology, The University of Melbourne, Melbourne, Victoria, Australia
| | - Kanta Subbarao
- Department of Microbiology and Immunology, The University of Melbourne, Melbourne, Victoria, Australia
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia;
| |
Collapse
|
5
|
Servajean R, Bitbol AF. Impact of population size on early adaptation in rugged fitness landscapes. Philos Trans R Soc Lond B Biol Sci 2023; 378:20220045. [PMID: 37004726 PMCID: PMC10067268 DOI: 10.1098/rstb.2022.0045] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2023] Open
Abstract
Owing to stochastic fluctuations arising from finite population size, known as genetic drift, the ability of a population to explore a rugged fitness landscape depends on its size. In the weak mutation regime, while the mean steady-state fitness increases with population size, we find that the height of the first fitness peak encountered when starting from a random genotype displays various behaviours versus population size, even among small and simple rugged landscapes. We show that the accessibility of the different fitness peaks is key to determining whether this height overall increases or decreases with population size. Furthermore, there is often a finite population size that maximizes the height of the first fitness peak encountered when starting from a random genotype. This holds across various classes of model rugged landscapes with sparse peaks, and in some experimental and experimentally inspired ones. Thus, early adaptation in rugged fitness landscapes can be more efficient and predictable for relatively small population sizes than in the large-size limit. This article is part of the theme issue ‘Interdisciplinary approaches to predicting evolutionary biology’.
Collapse
Affiliation(s)
- Richard Servajean
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Anne-Florence Bitbol
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
6
|
Moulana A, Dupic T, Phillips AM, Desai MM. Genotype-phenotype landscapes for immune-pathogen coevolution. Trends Immunol 2023; 44:384-396. [PMID: 37024340 PMCID: PMC10147585 DOI: 10.1016/j.it.2023.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/08/2023] [Accepted: 03/09/2023] [Indexed: 04/07/2023]
Abstract
Our immune systems constantly coevolve with the pathogens that challenge them, as pathogens adapt to evade our defense responses, with our immune repertoires shifting in turn. These coevolutionary dynamics take place across a vast and high-dimensional landscape of potential pathogen and immune receptor sequence variants. Mapping the relationship between these genotypes and the phenotypes that determine immune-pathogen interactions is crucial for understanding, predicting, and controlling disease. Here, we review recent developments applying high-throughput methods to create large libraries of immune receptor and pathogen protein sequence variants and measure relevant phenotypes. We describe several approaches that probe different regions of the high-dimensional sequence space and comment on how combinations of these methods may offer novel insight into immune-pathogen coevolution.
Collapse
Affiliation(s)
- Alief Moulana
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Angela M Phillips
- Department of Microbiology and Immunology, University of California at San Francisco, San Francisco, CA 94143, USA
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA; Department of Physics, Harvard University, Cambridge, MA 02138, USA; NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge, MA 02138, USA; Quantitative Biology Initiative, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
7
|
Neverov AD, Fedonin G, Popova A, Bykova D, Bazykin G. Coordinated evolution at amino acid sites of SARS-CoV-2 spike. eLife 2023; 12:82516. [PMID: 36752391 PMCID: PMC9908078 DOI: 10.7554/elife.82516] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 01/15/2023] [Indexed: 02/05/2023] Open
Abstract
SARS-CoV-2 has adapted in a stepwise manner, with multiple beneficial mutations accumulating in a rapid succession at origins of VOCs, and the reasons for this are unclear. Here, we searched for coordinated evolution of amino acid sites in the spike protein of SARS-CoV-2. Specifically, we searched for concordantly evolving site pairs (CSPs) for which changes at one site were rapidly followed by changes at the other site in the same lineage. We detected 46 sites which formed 45 CSP. Sites in CSP were closer to each other in the protein structure than random pairs, indicating that concordant evolution has a functional basis. Notably, site pairs carrying lineage defining mutations of the four VOCs that circulated before May 2021 are enriched in CSPs. For the Alpha VOC, the enrichment is detected even if Alpha sequences are removed from analysis, indicating that VOC origin could have been facilitated by positive epistasis. Additionally, we detected nine discordantly evolving pairs of sites where mutations at one site unexpectedly rarely occurred on the background of a specific allele at another site, for example on the background of wild-type D at site 614 (four pairs) or derived Y at site 501 (three pairs). Our findings hint that positive epistasis between accumulating mutations could have delayed the assembly of advantageous combinations of mutations comprising at least some of the VOCs.
Collapse
Affiliation(s)
- Alexey Dmitrievich Neverov
- HSE UniversityMoscowRussian Federation,Central Research Institute for EpidemiologyMoscowRussian Federation
| | - Gennady Fedonin
- Central Research Institute for EpidemiologyMoscowRussian Federation,Moscow Institute of Physics and Technology (National Research University)MoscowRussian Federation,Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of SciencesMoscowRussian Federation
| | - Anfisa Popova
- Central Research Institute for EpidemiologyMoscowRussian Federation
| | - Daria Bykova
- Central Research Institute for EpidemiologyMoscowRussian Federation,Lomonosov Moscow State UniversityMoscowRussian Federation
| | - Georgii Bazykin
- Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of SciencesMoscowRussian Federation,Skolkovo Institute of Science and TechnologyMoscowRussian Federation
| |
Collapse
|
8
|
Compensatory epistasis maintains ACE2 affinity in SARS-CoV-2 Omicron BA.1. Nat Commun 2022; 13:7011. [PMID: 36384919 PMCID: PMC9668218 DOI: 10.1038/s41467-022-34506-z] [Citation(s) in RCA: 56] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 10/26/2022] [Indexed: 11/17/2022] Open
Abstract
The Omicron BA.1 variant emerged in late 2021 and quickly spread across the world. Compared to the earlier SARS-CoV-2 variants, BA.1 has many mutations, some of which are known to enable antibody escape. Many of these antibody-escape mutations individually decrease the spike receptor-binding domain (RBD) affinity for ACE2, but BA.1 still binds ACE2 with high affinity. The fitness and evolution of the BA.1 lineage is therefore driven by the combined effects of numerous mutations. Here, we systematically map the epistatic interactions between the 15 mutations in the RBD of BA.1 relative to the Wuhan Hu-1 strain. Specifically, we measure the ACE2 affinity of all possible combinations of these 15 mutations (215 = 32,768 genotypes), spanning all possible evolutionary intermediates from the ancestral Wuhan Hu-1 strain to BA.1. We find that immune escape mutations in BA.1 individually reduce ACE2 affinity but are compensated by epistatic interactions with other affinity-enhancing mutations, including Q498R and N501Y. Thus, the ability of BA.1 to evade immunity while maintaining ACE2 affinity is contingent on acquiring multiple interacting mutations. Our results implicate compensatory epistasis as a key factor driving substantial evolutionary change for SARS-CoV-2 and are consistent with Omicron BA.1 arising from a chronic infection.
Collapse
|
9
|
Liu T, Wang Y, Tan TJC, Wu NC, Brooke CB. The evolutionary potential of influenza A virus hemagglutinin is highly constrained by epistatic interactions with neuraminidase. Cell Host Microbe 2022; 30:1363-1369.e4. [PMID: 36150395 PMCID: PMC9588755 DOI: 10.1016/j.chom.2022.09.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 07/27/2022] [Accepted: 09/02/2022] [Indexed: 11/03/2022]
Abstract
Antigenic evolution of the influenza A virus (IAV) hemagglutinin (HA) gene limits efforts to effectively control the spread of the virus in the population. Efforts to understand the mechanisms governing HA antigenic evolution typically examine the HA gene in isolation. This can ignore the importance of balancing HA receptor binding activities with the receptor-destroying activities of the viral neuraminidase (NA) to maintain viral fitness. We hypothesize that the need to maintain functional balance with NA significantly constrains the evolutionary potential of the HA. We use deep mutational scanning and show that variation in NA activity significantly reshapes the HA fitness landscape by modulating the overall mutational robustness of HA. Consistent with this, we observe that different NA backgrounds support the emergence of distinct repertoires of HA escape variants under neutralizing antibody pressure. Our results reveal a critical role for intersegment epistasis in influencing the evolutionary potential of the HA gene.
Collapse
Affiliation(s)
- Tongyu Liu
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Yiquan Wang
- Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Timothy J C Tan
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Nicholas C Wu
- Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Carle Illinois College of Medicine, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
| | - Christopher B Brooke
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, IL, USA; Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
| |
Collapse
|
10
|
Azbukina N, Zharikova A, Ramensky V. Intragenic compensation through the lens of deep mutational scanning. Biophys Rev 2022; 14:1161-1182. [PMID: 36345285 PMCID: PMC9636336 DOI: 10.1007/s12551-022-01005-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/26/2022] [Indexed: 12/20/2022] Open
Abstract
A significant fraction of mutations in proteins are deleterious and result in adverse consequences for protein function, stability, or interaction with other molecules. Intragenic compensation is a specific case of positive epistasis when a neutral missense mutation cancels effect of a deleterious mutation in the same protein. Permissive compensatory mutations facilitate protein evolution, since without them all sequences would be extremely conserved. Understanding compensatory mechanisms is an important scientific challenge at the intersection of protein biophysics and evolution. In human genetics, intragenic compensatory interactions are important since they may result in variable penetrance of pathogenic mutations or fixation of pathogenic human alleles in orthologous proteins from related species. The latter phenomenon complicates computational and clinical inference of an allele's pathogenicity. Deep mutational scanning is a relatively new technique that enables experimental studies of functional effects of thousands of mutations in proteins. We review the important aspects of the field and discuss existing limitations of current datasets. We reviewed ten published DMS datasets with quantified functional effects of single and double mutations and described rates and patterns of intragenic compensation in eight of them. Supplementary Information The online version contains supplementary material available at 10.1007/s12551-022-01005-w.
Collapse
Affiliation(s)
- Nadezhda Azbukina
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
| | - Anastasia Zharikova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, Bld.3, 101000 Moscow, Russia
| | - Vasily Ramensky
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73, Leninskie Gory, 119991 Moscow, Russia
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., 10, Bld.3, 101000 Moscow, Russia
| |
Collapse
|
11
|
Shchur V, Spirin V, Sirotkin D, Burovski E, De Maio N, Corbett-Detig R. VGsim: Scalable viral genealogy simulator for global pandemic. PLoS Comput Biol 2022; 18:e1010409. [PMID: 36001646 PMCID: PMC9447924 DOI: 10.1371/journal.pcbi.1010409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 09/06/2022] [Accepted: 07/18/2022] [Indexed: 11/24/2022] Open
Abstract
Accurate simulation of complex biological processes is an essential component of developing and validating new technologies and inference approaches. As an effort to help contain the COVID-19 pandemic, large numbers of SARS-CoV-2 genomes have been sequenced from most regions in the world. More than 5.5 million viral sequences are publicly available as of November 2021. Many studies estimate viral genealogies from these sequences, as these can provide valuable information about the spread of the pandemic across time and space. Additionally such data are a rich source of information about molecular evolutionary processes including natural selection, for example allowing the identification of new variants with transmissibility and immunity evasion advantages. To our knowledge, there is no framework that is both efficient and flexible enough to simulate the pandemic to approximate world-scale scenarios and generate viral genealogies of millions of samples. Here, we introduce a new fast simulator VGsim which addresses the problem of simulation genealogies under epidemiological models. The simulation process is split into two phases. During the forward run the algorithm generates a chain of population-level events reflecting the dynamics of the pandemic using an hierarchical version of the Gillespie algorithm. During the backward run a coalescent-like approach generates a tree genealogy of samples conditioning on the population-level events chain generated during the forward run. Our software can model complex population structure, epistasis and immunity escape. We develop a fast and flexible simulation software package VGsim for modeling epidemiological processes and generating genealogies of large pathogen samples. The software takes into account host population structure, pathogen evolution, host immunity and some other epidemiological aspects. The computational efficiency of the package allows to simulate genealogies of tens of millions of samples, which is important, e.g., for SARS-CoV-2 genome studies.
Collapse
Affiliation(s)
- Vladimir Shchur
- International laboratory of statistical and computational genomics, HSE University, Moscow, Russia
- * E-mail:
| | - Vadim Spirin
- International laboratory of statistical and computational genomics, HSE University, Moscow, Russia
| | - Dmitry Sirotkin
- International laboratory of statistical and computational genomics, HSE University, Moscow, Russia
| | | | - Nicola De Maio
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Russell Corbett-Detig
- Department of Biomolecular Engineering and Genomics Institute, UC Santa Cruz, California, United States of America
| |
Collapse
|
12
|
Christensen SR, Martin ET, Petrie JG, Monto AS, Hensley SE. The 2009 Pandemic H1N1 Hemagglutinin Stalk Remained Antigenically Stable after Circulating in Humans for a Decade. J Virol 2022; 96:e0220021. [PMID: 35588275 PMCID: PMC9175623 DOI: 10.1128/jvi.02200-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Accepted: 04/21/2022] [Indexed: 11/20/2022] Open
Abstract
An H1N1 influenza virus caused a pandemic in 2009, and descendants of this virus continue to circulate seasonally in humans. Upon infection with the 2009 H1N1 pandemic strain (pH1N1), many humans produced antibodies against epitopes in the hemagglutinin (HA) stalk. HA stalk-focused antibody responses were common among pH1N1-infected individuals because HA stalk epitopes were conserved between the pH1N1 strain and previously circulating H1N1 strains. Here, we completed a series of experiments to determine if the pH1N1 HA stalk has acquired substitutions since 2009 that prevent the binding of human antibodies. We identified several amino acid substitutions that accrued in the pH1N1 HA stalk from 2009 to 2019. We completed enzyme-linked immunosorbent assays, absorption-based binding assays, and surface plasmon resonance experiments to determine if these substitutions affect antibody binding. Using sera collected from 230 humans (aged 21 to 80 years), we found that pH1N1 HA stalk substitutions that have emerged since 2009 do not affect antibody binding. Our data suggest that the HA stalk domain of pH1N1 viruses remained antigenically stable after circulating in humans for a decade. IMPORTANCE In 2009, a new pandemic H1N1 (pH1N1) virus began circulating in humans. Many individuals mounted hemagglutinin (HA) stalk-focused antibody responses upon infection with the 2009 pH1N1 strain, since the HA stalk of this virus was relatively conserved with other seasonal H1N1 strains. Here, we completed a series of studies to determine if the 2009 pH1N1 strain has undergone antigenic drift in the HA stalk domain over the past decade. We found that serum antibodies from 230 humans could not antigenically distinguish the 2009 and 2019 HA stalk. These data suggest that the HA stalk of pH1N1 has remained antigenically stable, despite the presence of high levels of HA stalk antibodies within the human population.
Collapse
Affiliation(s)
- Shannon R. Christensen
- Department of Microbiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Emily T. Martin
- Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Joshua G. Petrie
- Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Arnold S. Monto
- Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Scott E. Hensley
- Department of Microbiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
13
|
Stolyarova AV, Neretina TV, Zvyagina EA, Fedotova AV, Kondrashov A, Bazykin GA. Complex fitness landscape shapes variation in a hyperpolymorphic species. eLife 2022; 11:76073. [PMID: 35532122 PMCID: PMC9187340 DOI: 10.7554/elife.76073] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 05/09/2022] [Indexed: 11/13/2022] Open
Abstract
It is natural to assume that patterns of genetic variation in hyperpolymorphic species can reveal large-scale properties of the fitness landscape that are hard to detect by studying species with ordinary levels of genetic variation. Here, we study such patterns in a fungus Schizophyllum commune, the most polymorphic species known. Throughout the genome, short-range linkage disequilibrium (LD) caused by attraction of minor alleles is higher between pairs of nonsynonymous than of synonymous variants. This effect is especially pronounced for pairs of sites that are located within the same gene, especially if a large fraction of the gene is covered by haploblocks, genome segments where the gene pool consists of two highly divergent haplotypes, which is a signature of balancing selection. Haploblocks are usually shorter than 1000 nucleotides, and collectively cover about 10% of the S. commune genome. LD tends to be substantially higher for pairs of nonsynonymous variants encoding amino acids that interact within the protein. There is a substantial correlation between LDs at the same pairs of nonsynonymous mutations in the USA and the Russian populations. These patterns indicate that selection in S. commune involves positive epistasis due to compensatory interactions between nonsynonymous alleles. When less polymorphic species are studied, analogous patterns can be detected only through interspecific comparisons. Changes to DNA known as mutations may alter how the proteins and other components of a cell work, and thus play an important role in allowing living things to evolve new traits and abilities over many generations. Whether a mutation is beneficial or harmful may differ depending on the genetic background of the individual – that is, depending on other mutations present in other positions within the same gene – due to a phenomenon called epistasis. Epistasis is known to affect how various species accumulate differences in their DNA compared to each other over time. For example, a mutation that is rare in humans and known to cause disease may be widespread in other primates because its negative effect is canceled out by another mutation that is standard for these species but absent in humans. However, it remains unclear whether epistasis plays a significant part in shaping genetic differences between individuals of the same species. A type of fungus known as Schizophyllum commune lives on rotting wood and is found across the world. It is one of the most genetically diverse species currently known, so there is a higher chance of pairs of compensatory mutations occurring and persisting for a long time in S. commune than in most other species, providing a unique opportunity to study epistasis. Here, Stolyarova et al. studied two distinct populations of S. commune, one from the USA and one from Russia. The team found that – unlike in humans, flies and other less genetically diverse species – epistasis maintains combinations of mutations in S. commune that individually would be harmful to the fungus but together compensate for each other. For example, pairs of mutations affecting specific molecules known as amino acids – the building blocks of proteins – that physically interact with each other tended to be found together in the same individuals. One potential downside of having pairs of compensatory mutations in the genome is that when the organism reproduces, the process of making sex cells may split up these pairs so that harmful mutations are inherited without their partner mutations. Thus, epistasis may have helped shape the way S. commune and other genetically diverse species have evolved.
Collapse
Affiliation(s)
| | - Tatiana V Neretina
- Biological Faculty, Lomonosov Moscow State University, Moscow, Russian Federation
| | - Elena A Zvyagina
- Biological Faculty, Lomonosov Moscow State University, Moscow, Russian Federation
| | - Anna V Fedotova
- Skolkovo Institute of Science and Technology, Moscow, Russian Federation
| | - Alexey Kondrashov
- Department of Ecology and Evolutionary Biology, University of Michigan-Ann Arbor, Ann Arbor, United States
| | - Georgii A Bazykin
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, Russian Federation
| |
Collapse
|
14
|
Phylogenetic inference of changes in amino acid propensities with single-position resolution. PLoS Comput Biol 2022; 18:e1009878. [PMID: 35180226 PMCID: PMC9106220 DOI: 10.1371/journal.pcbi.1009878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 05/13/2022] [Accepted: 01/28/2022] [Indexed: 11/19/2022] Open
Abstract
Fitness conferred by the same allele may differ between genotypes and environments, and these differences shape variation and evolution. Changes in amino acid propensities at protein sites over the course of evolution have been inferred from sequence alignments statistically, but the existing methods are data-intensive and aggregate multiple sites. Here, we develop an approach to detect individual amino acids that confer different fitness in different groups of species from combined sequence and phylogenetic data. Using the fact that the probability of a substitution to an amino acid depends on its fitness, our method looks for amino acids such that substitutions to them occur more frequently in one group of lineages than in another. We validate our method using simulated evolution of a protein site under different scenarios and show that it has high specificity for a wide range of assumptions regarding the underlying changes in selection, while its sensitivity differs between scenarios. We apply our method to the env gene of two HIV-1 subtypes, A and B, and to the HA gene of two influenza A subtypes, H1 and H3, and show that the inferred fitness changes are consistent with the fitness differences observed in deep mutational scanning experiments. We find that changes in relative fitness of different amino acid variants within a site do not always trigger episodes of positive selection and therefore may not result in an overall increase in the frequency of substitutions, but can still be detected from changes in relative frequencies of different substitutions.
Collapse
|
15
|
Doelger J, Kardar M, Chakraborty AK. Inferring the intrinsic mutational fitness landscape of influenzalike evolving antigens from temporally ordered sequence data. Phys Rev E 2022; 105:024401. [PMID: 35291059 DOI: 10.1103/physreve.105.024401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 01/19/2022] [Indexed: 06/14/2023]
Abstract
There still are no effective long-term protective vaccines against viruses that continuously evolve under immune pressure such as seasonal influenza, which has caused, and can cause, devastating epidemics in the human population. To find such a broadly protective immunization strategy, it is useful to know how easily the virus can escape via mutation from specific antibody responses. This information is encoded in the fitness landscape of the viral proteins (i.e., knowledge of the viral fitness as a function of sequence). Here we present a computational method to infer the intrinsic mutational fitness landscape of influenzalike evolving antigens from yearly sequence data. We test inference performance with computer-generated sequence data that are based on stochastic simulations mimicking basic features of immune-driven viral evolution. Although the numerically simulated model does create a phylogeny based on the allowed mutations, the inference scheme does not use this information. This provides a contrast to other methods that rely on reconstruction of phylogenetic trees. Our method just needs a sufficient number of samples over multiple years. With our method, we are able to infer single as well as pairwise mutational fitness effects from the simulated sequence time series for short antigenic proteins. Our fitness inference approach may have potential future use for the design of immunization protocols by identifying intrinsically vulnerable immune target combinations on antigens that evolve under immune-driven selection. In the future, this approach may be applied to influenza and other novel viruses such as SARS-CoV-2, which evolves and, like influenza, might continue to escape the natural and vaccine-mediated immune pressures.
Collapse
Affiliation(s)
- Julia Doelger
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Mehran Kardar
- Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Arup K Chakraborty
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA; Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA; Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA; Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA; and Ragon Institute of MGH, MIT, and Harvard, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
16
|
Downard KM. SEQUENCE-FREE PHYLOGENETICS WITH MASS SPECTROMETRY. MASS SPECTROMETRY REVIEWS 2022; 41:3-14. [PMID: 33169385 DOI: 10.1002/mas.21658] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 08/31/2020] [Indexed: 06/11/2023]
Abstract
An alternative, more rapid, sequence-free approach to build phylogenetic trees has been conceived and implemented. Molecular phylogenetics has continued to mostly focus on improvement in tree construction based on gene sequence alignments. Here protein-based phylogenies are constructed using numerical data sets ("phylonumerics") representing the masses of peptide segments recorded in a mass mapping experiment. This truly sequence-free method requires no gene sequences, nor their alignment, to build the trees affording a considerable time and cost-saving to conventional phylogenetics methods. The approach also calculates single point amino acid mutations from a comparison of mass pairs from different maps in the data set and displays these at branch nodes across the tree together with their frequency. Studies of the consecutive, and near-consecutive, ancestral and descendant mutations across interconnected branches of a mass tree allow putative adaptive, epistatic, and compensatory mutations to be identified in order to investigate mechanisms associated with evolutionary processes and pathways. A side-by-side comparison of this sequence-free approach and conventional gene sequence phylogenetics is discussed.
Collapse
Affiliation(s)
- Kevin M Downard
- Infectious Disease Responses Laboratory, Prince of Wales Clinical Sciences, Medicine, University of New South Wales, Sydney, New South Wales, Australia
| |
Collapse
|
17
|
Wang Y, Lei R, Nourmohammad A, Wu NC. Antigenic evolution of human influenza H3N2 neuraminidase is constrained by charge balancing. eLife 2021; 10:e72516. [PMID: 34878407 PMCID: PMC8683081 DOI: 10.7554/elife.72516] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 12/07/2021] [Indexed: 11/13/2022] Open
Abstract
As one of the main influenza antigens, neuraminidase (NA) in H3N2 virus has evolved extensively for more than 50 years due to continuous immune pressure. While NA has recently emerged as an effective vaccine target, biophysical constraints on the antigenic evolution of NA remain largely elusive. Here, we apply combinatorial mutagenesis and next-generation sequencing to characterize the local fitness landscape in an antigenic region of NA in six different human H3N2 strains that were isolated around 10 years apart. The local fitness landscape correlates well among strains and the pairwise epistasis is highly conserved. Our analysis further demonstrates that local net charge governs the pairwise epistasis in this antigenic region. In addition, we show that residue coevolution in this antigenic region is correlated with the pairwise epistasis between charge states. Overall, this study demonstrates the importance of quantifying epistasis and the underlying biophysical constraint for building a model of influenza evolution.
Collapse
Affiliation(s)
- Yiquan Wang
- Department of Biochemistry, University of Illinois at Urbana-ChampaignUrbanaUnited States
| | - Ruipeng Lei
- Department of Biochemistry, University of Illinois at Urbana-ChampaignUrbanaUnited States
| | - Armita Nourmohammad
- Department of Physics, University of WashingtonSeattleUnited States
- Max Planck Institute for Dynamics and Self-OrganizationGöttingenGermany
- Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Nicholas C Wu
- Department of Biochemistry, University of Illinois at Urbana-ChampaignUrbanaUnited States
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-ChampaignUrbanaUnited States
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-ChampaignUrbanaUnited States
- Carle Illinois College of Medicine, University of Illinois at Urbana-ChampaignUrbanaUnited States
| |
Collapse
|
18
|
Behdenna A, Godfroid M, Petot P, Pothier J, Lambert A, Achaz G. A minimal yet flexible likelihood framework to assess correlated evolution. Syst Biol 2021; 71:823-838. [PMID: 34792608 DOI: 10.1093/sysbio/syab092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 11/04/2021] [Accepted: 11/09/2021] [Indexed: 11/14/2022] Open
Abstract
An evolutionary process is reflected in the sequence of changes of any trait (e.g. morphological or molecular) through time. Yet, a better understanding of evolution would be procured by characterizing correlated evolution, or when two or more evolutionary processes interact. Previously developed parametric methods often require significant computing time as they rely on the estimation of many parameters. Here we propose a minimal likelihood framework modelling the joint evolution of two traits on a known phylogenetic tree. The type and strength of correlated evolution is characterized by a few parameters tuning mutation rates of each trait and interdependencies between these rates. The framework can be applied to study any discrete trait or character ranging from nucleotide substitution to gain or loss of a biological function. More specifically, it can be used to 1) test for independence between two evolutionary processes, 2) identify the type of interaction between them and 3) estimate parameter values of the most likely model of interaction. In the current implementation, the method takes as input a phylogenetic tree with discrete evolutionary events mapped on its branches. The method then maximizes the likelihood for one or several chosen scenarios. The strengths and limits of the method, as well as its relative power compared to a few other methods, are assessed using both simulations and data from 16S rRNA sequences in a sample of 54 γ-enterobacteria. We show that, even with datasets of fewer than 100 species, the method performs well in parameter estimation and in evolutionary model selection.
Collapse
Affiliation(s)
- Abdelkader Behdenna
- Institut de Systématique, Évolution, Biodiversité (ISYEB), Muséum National d'Histoire Naturelle, CNRS UMR 7205, Sorbonne Université, École Pratique des Hautes Études, Université des Antilles, 45 rue Buffon, 75005 Paris, France
- SMILE Group, Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, 11, place Marcellin Berthelot, 75005 Paris, France
- Epigene Labs, 7 Square Gabriel Fauré, 75017 Paris, France
| | - Maxime Godfroid
- SMILE Group, Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, 11, place Marcellin Berthelot, 75005 Paris, France
| | - Patrice Petot
- Institut de Systématique, Évolution, Biodiversité (ISYEB), Muséum National d'Histoire Naturelle, CNRS UMR 7205, Sorbonne Université, École Pratique des Hautes Études, Université des Antilles, 45 rue Buffon, 75005 Paris, France
- SMILE Group, Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, 11, place Marcellin Berthelot, 75005 Paris, France
| | - Joël Pothier
- Institut de Systématique, Évolution, Biodiversité (ISYEB), Muséum National d'Histoire Naturelle, CNRS UMR 7205, Sorbonne Université, École Pratique des Hautes Études, Université des Antilles, 45 rue Buffon, 75005 Paris, France
| | - Amaury Lambert
- SMILE Group, Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, 11, place Marcellin Berthelot, 75005 Paris, France
- Laboratoire de Probabilités, Statistique et Modélisation (LPSM), Sorbonne Université, CNRS UMR 8001, Université de Paris, 4, place Jussieu, 75005 Paris, France
| | - Guillaume Achaz
- Institut de Systématique, Évolution, Biodiversité (ISYEB), Muséum National d'Histoire Naturelle, CNRS UMR 7205, Sorbonne Université, École Pratique des Hautes Études, Université des Antilles, 45 rue Buffon, 75005 Paris, France
- SMILE Group, Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, 11, place Marcellin Berthelot, 75005 Paris, France
- Éco-anthropologie, Muséum National d'Histoire Naturelle, CNRS UMR 7206, Université de Paris, place du Trocadéro, 75016 Paris, France
| |
Collapse
|
19
|
Wang MH, Lou J, Cao L, Zhao S, Chan RW, Chan PK, Chan MCW, Chong MK, Wu WK, Wei Y, Zhang H, Zee BC, Yeoh EK. Characterization of key amino acid substitutions and dynamics of the influenza virus H3N2 hemagglutinin. J Infect 2021; 83:671-677. [PMID: 34627840 DOI: 10.1016/j.jinf.2021.09.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 06/10/2021] [Accepted: 09/30/2021] [Indexed: 10/20/2022]
Abstract
The annual epidemics of seasonal influenza is partly attributed to the continued virus evolution. It is challenging to evaluate the effect of influenza virus mutations on evading population immunity. In this study, we introduce a novel statistical and computational approach to measure the dynamic molecular determinants underlying epidemics using effective mutations (EMs), and account for the time of waning mutation advantage against herd immunity by measuring the effective mutation periods (EMPs). Extensive analysis is performed on the sequencing and epidemiology data of H3N2 epidemics in ten regions from season to season. We systematically identified 46 EMs in the hemagglutinin (HA) gene, in which the majority were antigenic sites. Eight EMs were located in immunosubdominant stalk domain, an important target for developing broadly reactive antibodies. The EMs might provide timely information on key substitutions for influenza vaccines antigen design. The EMP suggested that major genetic variants of H3N2 circulated in Southeast Asia for an average duration of 4.5 years (SD 2.4) compared to a significantly shorter 2.0 years (SD 1.0) in temperate regions. The proposed method bridges population epidemics and molecular characteristics of infectious diseases, and would find broad applications in various pathogens mutation estimations.
Collapse
Affiliation(s)
- Maggie Haitian Wang
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China; CUHK Shenzhen Research Institute, Shenzhen, China.
| | - Jingzhi Lou
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China; CUHK Shenzhen Research Institute, Shenzhen, China
| | - Lirong Cao
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China; CUHK Shenzhen Research Institute, Shenzhen, China
| | - Shi Zhao
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China; CUHK Shenzhen Research Institute, Shenzhen, China
| | - Renee Wy Chan
- CUHK-UMCU Joint Research Laboratory of Respiratory Virus & Immunobiology, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China; Department of Paediatrics, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Paul Ks Chan
- Department of Microbiology, Stanley Ho Center for Emerging Infectious Diseases, Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Martin Chi-Wai Chan
- Department of Microbiology, Stanley Ho Center for Emerging Infectious Diseases, Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Marc Kc Chong
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China; CUHK Shenzhen Research Institute, Shenzhen, China
| | - William Kk Wu
- Department of Anaesthesia and Intensive Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Yuchen Wei
- Department of Microbiology, Stanley Ho Center for Emerging Infectious Diseases, Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Haoyang Zhang
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Benny Cy Zee
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China; CUHK Shenzhen Research Institute, Shenzhen, China
| | - Eng-Kiong Yeoh
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China.
| |
Collapse
|
20
|
Antigenic cartography reveals complexities of genetic determinants that lead to antigenic differences among pandemic GII.4 noroviruses. Proc Natl Acad Sci U S A 2021; 118:2015874118. [PMID: 33836574 PMCID: PMC7980451 DOI: 10.1073/pnas.2015874118] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Noroviruses are the predominant cause of acute gastroenteritis, with a single genotype (GII.4) responsible for the majority of infections. This prevalence is characterized by the periodic emergence of new variants that present substitutions at antigenic sites of the major structural protein (VP1), facilitating escape from herd immunity. Notably, the contribution of intravariant mutations to changes in antigenic properties is unknown. We performed a comprehensive antigenic analysis on a virus-like particle panel representing major chronological GII.4 variants to investigate diversification at the inter- and intravariant level. Immunoassays, neutralization data, and cartography analyses showed antigenic similarities between phylogenetically related variants, with major switches to antigenic properties observed over the evolution of GII.4 variants. Genetic analysis indicated that multiple coevolving amino acid changes-primarily at antigenic sites-are associated with the antigenic diversification of GII.4 variants. These data highlight complexities of the genetic determinants and provide a framework for the antigenic characterization of emerging GII.4 noroviruses.
Collapse
|
21
|
Jones JE, Le Sage V, Padovani GH, Calderon M, Wright ES, Lakdawala SS. Parallel evolution between genomic segments of seasonal human influenza viruses reveals RNA-RNA relationships. eLife 2021; 10:66525. [PMID: 34448455 PMCID: PMC8523153 DOI: 10.7554/elife.66525] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 08/23/2021] [Indexed: 11/22/2022] Open
Abstract
The influenza A virus (IAV) genome consists of eight negative-sense viral RNA (vRNA) segments that are selectively assembled into progeny virus particles through RNA-RNA interactions. To explore putative intersegmental RNA-RNA relationships, we quantified similarity between phylogenetic trees comprising each vRNA segment from seasonal human IAV. Intersegmental tree similarity differed between subtype and lineage. While intersegmental relationships were largely conserved over time in H3N2 viruses, they diverged in H1N1 strains isolated before and after the 2009 pandemic. Surprisingly, intersegmental relationships were not driven solely by protein sequence, suggesting that IAV evolution could also be driven by RNA-RNA interactions. Finally, we used confocal microscopy to determine that colocalization of highly coevolved vRNA segments is enriched over other assembly intermediates at the nuclear periphery during productive viral infection. This study illustrates how putative RNA interactions underlying selective assembly of IAV can be interrogated with phylogenetics. The viruses responsible for influenza evolve rapidly during infection. Changes typically emerge in two key ways: through random mutations in the genetic sequence of the virus, or by reassortment. Reassortment can occur when two or more strains infect the same cell. Once in a cell, viral particles ‘open up’ to release their genetic material so it can make copies of itself using the cell’s machinery. The new copies of the genetic material of the virus are used to make new viral particles, which then envelop the genetic material and are released from the cell to infect other cells. If several strains of a virus infect the same cell, a new viral particle may pick up genetic segments from each of the infecting strains, creating a new strain via reassortment. Several factors are known to affect the success of the reassortment process. For example, if the new strain acquires a genetic defect that hinders its replication cycle, it is likely to die out quickly. Other times, this trading of genetic information can create a strain that is more resistant to the human immune system, allowing it to sweep across the globe and cause a deadly pandemic. However, a key part of the reassortment process that still remains unclear is how genome segments from two different influenza strains recognize each other before merging together to create hybrid daughter viruses. To explore this further, Jones et al. used a technique called fluorescence microscopy. They found that genome segments that evolved along similar paths were more likely to cluster in the same area inside infected cells, and therefore, more likely to be reassorted together into a new strain during assembly of daughter viruses. This suggests that assembly may guide the evolutionary path taken by individual genomic segments. Jones et al. also looked at the evolution of different genome segments collected from patients suffering from seasonal influenza, and found that these segments had a distinct evolutionary path to those in pandemic-causing strains. This research provides new insights into the role of reassortment in the evolution of influenza viruses during infection. In particular, it suggests that how the genome segments interact with one another may have a previously unknown and important role in guiding this evolution. These insights could be used to predict future reassortment events based on evolutionary relationships between influenza virus genomic segments, and may in the future be used as part of risk assessment tools to predict the emergence of new pandemic strains.
Collapse
Affiliation(s)
- Jennifer E Jones
- Department of Microbiology & Molecular Genetics, University of Pittsburgh, Pittsburgh, United States.,Center for Evolutionary Biology and Medicine, University of Pittsburgh, Pittsburgh, United States.,Center for Vaccine Research, University of Pittsburgh School of Medicine, Pittsburgh, United States
| | - Valerie Le Sage
- Department of Microbiology & Molecular Genetics, University of Pittsburgh, Pittsburgh, United States.,Center for Vaccine Research, University of Pittsburgh School of Medicine, Pittsburgh, United States
| | - Gabriella H Padovani
- Department of Microbiology & Molecular Genetics, University of Pittsburgh, Pittsburgh, United States.,Center for Vaccine Research, University of Pittsburgh School of Medicine, Pittsburgh, United States
| | - Michael Calderon
- Department of Cell Biology, Center for Biologic Imaging, University of Pittsburgh, Pittsburgh, United States
| | - Erik S Wright
- Center for Evolutionary Biology and Medicine, University of Pittsburgh, Pittsburgh, United States.,Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, United States
| | - Seema S Lakdawala
- Department of Microbiology & Molecular Genetics, University of Pittsburgh, Pittsburgh, United States.,Center for Vaccine Research, University of Pittsburgh School of Medicine, Pittsburgh, United States
| |
Collapse
|
22
|
Rochman ND, Wolf YI, Faure G, Mutz P, Zhang F, Koonin EV. Ongoing global and regional adaptive evolution of SARS-CoV-2. Proc Natl Acad Sci U S A 2021; 118:e2104241118. [PMID: 34292871 PMCID: PMC8307621 DOI: 10.1073/pnas.2104241118] [Citation(s) in RCA: 143] [Impact Index Per Article: 47.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Understanding the trends in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) evolution is paramount to control the COVID-19 pandemic. We analyzed more than 300,000 high-quality genome sequences of SARS-CoV-2 variants available as of January 2021. The results show that the ongoing evolution of SARS-CoV-2 during the pandemic is characterized primarily by purifying selection, but a small set of sites appear to evolve under positive selection. The receptor-binding domain of the spike protein and the region of the nucleocapsid protein associated with nuclear localization signals (NLS) are enriched with positively selected amino acid replacements. These replacements form a strongly connected network of apparent epistatic interactions and are signatures of major partitions in the SARS-CoV-2 phylogeny. Virus diversity within each geographic region has been steadily growing for the entirety of the pandemic, but analysis of the phylogenetic distances between pairs of regions reveals four distinct periods based on global partitioning of the tree and the emergence of key mutations. The initial period of rapid diversification into region-specific phylogenies that ended in February 2020 was followed by a major extinction event and global homogenization concomitant with the spread of D614G in the spike protein, ending in March 2020. The NLS-associated variants across multiple partitions rose to global prominence in March to July, during a period of stasis in terms of interregional diversity. Finally, beginning in July 2020, multiple mutations, some of which have since been demonstrated to enable antibody evasion, began to emerge associated with ongoing regional diversification, which might be indicative of speciation.
Collapse
Affiliation(s)
- Nash D Rochman
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894;
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| | - Guilhem Faure
- Broad Institute of MIT and Harvard, Cambridge, MA 02142
| | - Pascal Mutz
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| | - Feng Zhang
- Broad Institute of MIT and Harvard, Cambridge, MA 02142;
- HHMI, Massachusetts Institute of Technology, Cambridge, MA 02139
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894;
| |
Collapse
|
23
|
Pedruzzi G, Rouzine IM. An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza. PLoS Pathog 2021; 17:e1009669. [PMID: 34153082 PMCID: PMC8248644 DOI: 10.1371/journal.ppat.1009669] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 07/01/2021] [Accepted: 05/25/2021] [Indexed: 12/18/2022] Open
Abstract
Linkage effects in a multi-locus population strongly influence its evolution. The models based on the traveling wave approach enable us to predict the average speed of evolution and the statistics of phylogeny. However, predicting statistically the evolution of specific sites and pairs of sites in the multi-locus context remains a mathematical challenge. In particular, the effects of epistasis, the interaction of gene regions contributing to phenotype, is difficult to predict theoretically and detect experimentally in sequence data. A large number of false-positive interactions arises from stochastic linkage effects and indirect interactions, which mask true epistatic interactions. Here we develop a proof-of-principle method to filter out false-positive interactions. We start by demonstrating that the averaging of haplotype frequencies over multiple independent populations is necessary but not sufficient for epistatic detection, because it still leaves high numbers of false-positive interactions. To compensate for the residual stochastic noise, we develop a three-way haplotype method isolating true interactions. The fidelity of the method is confirmed analytically and on simulated genetic sequences evolved with a known epistatic network. The method is then applied to a large sequence database of neurominidase protein of influenza A H1N1 obtained from various geographic locations to infer the epistatic network responsible for the difference between the pre-pandemic virus and the pandemic strain of 2009. These results present a simple and reliable technique to measure epistatic interactions of any sign from sequence data. Interactions between genomic sites create a fitness landscape. The knowledge of topology and strength of interactions is vital for predicting the escape of viruses from drugs and immune response and their passing through fitness valleys. Many efforts have been invested into measuring these interactions from DNA sequence sets. Unfortunately, reproducibility of the results remains low due partly to a very small fraction of interaction pairs and partly to stochastic linkage noise masking true interactions. Here we propose a method to separate stochastic linkage and indirect interactions from epistatic interactions and apply it to influenza virus sequence data.
Collapse
Affiliation(s)
- Gabriele Pedruzzi
- Sorbonne Université, Institute de Biologie Paris-Seine, Laboratoire de Biologie Computationelle et Quantitative LCQB, Paris, France
| | - Igor M. Rouzine
- Sorbonne Université, Institute de Biologie Paris-Seine, Laboratoire de Biologie Computationelle et Quantitative LCQB, Paris, France
- * E-mail:
| |
Collapse
|
24
|
Abstract
Background: Pathogens are often assumed to evolve towards reduced virulence, but counterexamples abound. Faced with a new pathogen, such as SARS-CoV-2, it is crucial to be able to forecast the case fatality rate (CFR) and the overall disease burden. Considerable effort has been invested towards developing a mathematical framework for predicting virulence evolution. Although many approaches accurately recapitulate complex outcomes, most rely on an assumed trade-off between CFR and infection rate. It is often impractical to empirically validate this constraint for human pathogens. Methods: A compartment model with parameters tuning the degree to which symptomatic individuals are isolated and the duration of immunity is constructed and evaluated at both short timescales and at equilibrium. Results: We reveal kinetic constraints whereby variation of multiple parameters in concert leads to decreased CFR and increased pathogen fitness, whereas independent variation of the parameters decreases pathogen fitness. Smallpox, SARS-CoV-2, and influenza are analyzed as diverse representatives of human respiratory viruses. We show that highly virulent viruses, such as smallpox, are often constrained by the host behavior, whereas moderately virulent viruses, such as SARS-CoV-2, appear to be typically constrained by the relationship between the duration of immunity and CFR. Conclusions: Evolution of human respiratory epidemics appears to be often kinetically constrained and a reduction in CFR should not be assumed. These results agree with previous work demonstrating an increase in virulence for smallpox and further predict that SARS-CoV-2 is likely to continue presenting a substantial disease burden. Herd immunity against SARS-CoV-2 and viruses with similar life history traits might be unachievable without vaccination. However, partial isolation of symptomatic individuals can have a major effect on the epidemic dynamics not only by reducing the number of fatalities in the short term but also by changing the evolutionary trajectory of moderate CFR viruses towards reduced CFR.
Collapse
Affiliation(s)
- Nash Rochman
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| |
Collapse
|
25
|
Abstract
Background: It is often assumed that pathogens evolve towards reduced virulence, but counterexamples abound. Faced with a new pathogen, such as SARS-CoV-2, it is highly desirable to be able to forecast the case fatality rate (CFR) and overall disease burden into the future. Considerable effort has been invested towards the development of a mathematical framework for predicting virulence evolution. Although many approaches accurately recapitulate complex outcomes, most rely on an assumed trade-off between CFR and infection rate. It is often impractical to empirically validate this constraint for human pathogens. Methods: A compartment model with parameters tuning the degree to which symptomatic individuals are isolated and the duration of immunity is constructed and evaluated at both short timescales and at equilibrium (when it exists). Results: We reveal kinetic constraints where the variation of multiple parameters in concert leads to decreased CFR and increased pathogen fitness, whereas independent variation of the parameters decreases pathogen fitness. Smallpox, SARS-CoV-2, and influenza are analyzed as diverse representatives of human respiratory viruses. We show that highly virulent viruses, such as smallpox, are likely often constrained by host behavior, whereas moderately virulent viruses, such as SARS-CoV-2, appear to be typically constrained by the relationship between the duration of immunity and CFR. Conclusions: Evolution of human respiratory epidemics appears to be often kinetically constrained and a reduction in CFR should not be assumed. Our findings imply that, without continued public health intervention, SARS-CoV-2 is likely to continue presenting a substantial disease burden. The existence of a parameter regime admitting endemic equilibrium suggests that herd immunity is unachievable. However, we demonstrate that even partial isolation of symptomatic individuals can have a major effect not only by reducing the number of fatalities in the short term but also by potentially changing the evolutionary trajectory of the virus towards reduced CFR.
Collapse
Affiliation(s)
- Nash Rochman
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| |
Collapse
|
26
|
Piantham C, Ito K. Modeling the selective advantage of new amino acids on the hemagglutinin of H1N1 influenza viruses using their patient age distributions. Virus Evol 2021; 7:veab049. [PMID: 34285812 PMCID: PMC8286795 DOI: 10.1093/ve/veab049] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
In 2009, a new strain of H1N1 influenza A virus caused a pandemic, and its descendant strains are causing seasonal epidemics worldwide. Given the high mutation rate of influenza viruses, variant strains having different amino acids on hemagglutinin (HA) continuously emerge. To prepare vaccine strains for the next influenza seasons, it is an urgent task to predict which variants will be selected in the viral population. An analysis of 24,681 pairs of an amino acid sequence of HA of H1N1pdm2009 viruses and its patient age showed that the empirical fixation probability of new amino acids on HA significantly differed depending on their frequencies in the population, patient age distributions, and epitope flags. The selective advantage of a variant strain having a new amino acid was modeled by linear combinations of patients age distributions and epitope flags, and then the fixation probability of the new amino acid was modeled using Kimura’s formula for advantageous selection. The parameters of models were estimated from the sequence data and models were tested with four-fold cross validations. The frequency of new amino acids alone can achieve high sensitivity, specificity, and precision in predicting the fixation of a new amino acid of which frequency is more than 0.11. The estimated parameter suggested that viruses with a new amino acid having a frequency in the population higher than 0.11 have a significantly higher selective advantage compared to viruses with the old amino acid at the same position. The model considering the Z-value of patient age rank-sums of new amino acids predicted amino acid substitutions on HA with a sensitivity of 0.78, specificity of 0.86, and precision of 0.83, showing significant improvement compared to the constant selective advantage model, which used only the frequency of the amino acid. These results suggested that H1N1 viruses tend to be selected in the adult population, and frequency of viruses having new amino acids and their patient ages are useful to predict amino acid substitutions on HA.
Collapse
Affiliation(s)
- Chayada Piantham
- Division of Bioinformatics, Graduate School of Infectious Diseases, Hokkaido University, Sapporo 0600818, Japan
| | - Kimihito Ito
- Division of Bioinformatics, International Institute for Zoonosis Control, Hokkaido University, Sapporo 0010020, Japan
| |
Collapse
|
27
|
Magee AF, Hilton SK, DeWitt WS. Robustness of phylogenetic inference to model misspecification caused by pairwise epistasis. Mol Biol Evol 2021; 38:4603-4615. [PMID: 34043795 PMCID: PMC8476159 DOI: 10.1093/molbev/msab163] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Likelihood-based phylogenetic inference posits a probabilistic model of character state change along branches of a phylogenetic tree. These models typically assume statistical independence of sites in the sequence alignment. This is a restrictive assumption that facilitates computational tractability, but ignores how epistasis, the effect of genetic background on mutational effects, influences the evolution of functional sequences. We consider the effect of using a misspecified site-independent model on the accuracy of Bayesian phylogenetic inference in the setting of pairwise-site epistasis. Previous work has shown that as alignment length increases, tree reconstruction accuracy also increases. Here, we present a simulation study demonstrating that accuracy increases with alignment size even if the additional sites are epistatically coupled. We introduce an alignment-based test statistic that is a diagnostic for pairwise epistasis and can be used in posterior predictive checks.
Collapse
Affiliation(s)
- Andrew F Magee
- Departments of Biology.,Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Sarah K Hilton
- Departments of Genome Sciences, University of Washington, Seattle, USA.,Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - William S DeWitt
- Departments of Genome Sciences, University of Washington, Seattle, USA.,Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA
| |
Collapse
|
28
|
Selection for cooperativity causes epistasis predominately between native contacts and enables epistasis-based structure reconstruction. Proc Natl Acad Sci U S A 2021; 118:2010057118. [PMID: 33879570 DOI: 10.1073/pnas.2010057118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Epistasis and cooperativity of folding both result from networks of energetic interactions in proteins. Epistasis results from energetic interactions among mutants, whereas cooperativity results from energetic interactions during folding that reduce the presence of intermediate states. The two concepts seem intuitively related, but it is unknown how they are related, particularly in terms of selection. To investigate their relationship, we simulated protein evolution under selection for cooperativity and separately under selection for epistasis. Strong selection for cooperativity created strong epistasis between contacts in the native structure but weakened epistasis between nonnative contacts. In contrast, selection for epistasis increased epistasis in both native and nonnative contacts and reduced cooperativity. Because epistasis can be used to predict protein structure only if it preferentially occurs in native contacts, this result indicates that selection for cooperativity may be key for predicting structure using epistasis. To evaluate this inference, we simulated the evolution of guanine nucleotide-binding protein (GB1) with and without cooperativity. With cooperativity, strong epistatic interactions clearly map out the native GB1 structure, while allowing the presence of intermediate states (low cooperativity) obscured the structure. This indicates that using epistasis measurements to reconstruct protein structure may be inappropriate for proteins with stable intermediates.
Collapse
|
29
|
Evolution of Type B Influenza Virus Using a Mass Spectrometry Based Phylonumerics Approach. Evol Biol 2021. [DOI: 10.1007/s11692-021-09535-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
30
|
Rochman ND, Wolf YI, Faure G, Mutz P, Zhang F, Koonin EV. Ongoing Global and Regional Adaptive Evolution of SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2020.10.12.336644. [PMID: 33083804 PMCID: PMC7574262 DOI: 10.1101/2020.10.12.336644] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Understanding the trends in SARS-CoV-2 evolution is paramount to control the COVID-19 pandemic. We analyzed more than 300,000 high quality genome sequences of SARS-CoV-2 variants available as of January 2021. The results show that the ongoing evolution of SARS-CoV-2 during the pandemic is characterized primarily by purifying selection, but a small set of sites appear to evolve under positive selection. The receptor-binding domain of the spike protein and the nuclear localization signal (NLS) associated region of the nucleocapsid protein are enriched with positively selected amino acid replacements. These replacements form a strongly connected network of apparent epistatic interactions and are signatures of major partitions in the SARS-CoV-2 phylogeny. Virus diversity within each geographic region has been steadily growing for the entirety of the pandemic, but analysis of the phylogenetic distances between pairs of regions reveals four distinct periods based on global partitioning of the tree and the emergence of key mutations. The initial period of rapid diversification into region-specific phylogenies that ended in February 2020 was followed by a major extinction event and global homogenization concomitant with the spread of D614G in the spike protein, ending in March 2020. The NLS associated variants across multiple partitions rose to global prominence in March-July, during a period of stasis in terms of inter-regional diversity. Finally, beginning July 2020, multiple mutations, some of which have since been demonstrated to enable antibody evasion, began to emerge associated with ongoing regional diversification, which might be indicative of speciation.
Collapse
Affiliation(s)
- Nash D Rochman
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| | - Guilhem Faure
- Broad Institute of MIT and Harvard, Cambridge, MA 02142
| | - Pascal Mutz
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| | - Feng Zhang
- Broad Institute of MIT and Harvard, Cambridge, MA 02142
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02139
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| |
Collapse
|
31
|
Effect of N-linked glycosylation at position 162 of hemagglutinin in influenza A virus A(H1N1)pdm09. Meta Gene 2021. [DOI: 10.1016/j.mgene.2020.100828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
|
32
|
Neverov AD, Popova AV, Fedonin GG, Cheremukhin EA, Klink GV, Bazykin GA. Episodic evolution of coadapted sets of amino acid sites in mitochondrial proteins. PLoS Genet 2021; 17:e1008711. [PMID: 33493156 PMCID: PMC7861529 DOI: 10.1371/journal.pgen.1008711] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 02/04/2021] [Accepted: 12/07/2020] [Indexed: 11/19/2022] Open
Abstract
The rate of evolution differs between protein sites and changes with time. However, the link between these two phenomena remains poorly understood. Here, we design a phylogenetic approach for distinguishing pairs of amino acid sites that evolve concordantly, i.e., such that substitutions at one site trigger subsequent substitutions at the other; and also pairs of sites that evolve discordantly, so that substitutions at one site impede subsequent substitutions at the other. We distinguish groups of amino acid sites that undergo coordinated evolution and evolve discordantly from other such groups. In mitochondrion-encoded proteins of metazoans and fungi, we show that concordantly evolving sites are clustered in protein structures. By analysing the phylogenetic patterns of substitutions at concordantly and discordantly evolving site pairs, we find that concordant evolution has two distinct causes: epistatic interactions between amino acid substitutions and episodes of selection independently affecting substitutions at different sites. The rate of substitutions at concordantly evolving groups of protein sites changes in the course of evolution, indicating episodes of selection limited to some of the lineages. The phylogenetic positions of these changes are consistent between proteins, suggesting common selective forces underlying them. The mode and rate of evolution of a protein site depends on the effect of its mutations on protein fitness. The fitness effect of a mutation itself can change in the course of evolution for at least two reasons. First, it can be modulated by substitutions occurring at other sites, a phenomenon called epistasis. Second, changes in selection can be non-epistatic, affecting sites independently of one another. Here, we analyse substitutions accumulated by the evolving lineages of the five proteins encoded by the mitochondrial genomes of thousands of species of metazoans and fungi. We show that substitutions at different amino acid sites occur in a coordinated fashion, and this coordination is caused both by epistasis and by episodes of selection affecting groups of sites. We partition each protein into several groups of concordantly evolving sites such that evolution of sites from different groups is discordant, and show that the proteins encoded by the mitochondrial genome consist of coevolving structural blocks. Some of these blocks have a clear functional specialization, e.g. are associated with interfaces between proteins composing respiratory complexes. Together, our results reveal a previously unrecognized complexity in the causes of variation in evolutionary rates between protein sites.
Collapse
Affiliation(s)
- Alexey D. Neverov
- Department of Molecular Diagnostics, Central Research Institute for Epidemiology, Moscow, Russia
- * E-mail:
| | - Anfisa V. Popova
- Department of Molecular Diagnostics, Central Research Institute for Epidemiology, Moscow, Russia
| | - Gennady G. Fedonin
- Department of Molecular Diagnostics, Central Research Institute for Epidemiology, Moscow, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, Moscow region, Russia
| | | | - Galya V. Klink
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, Russia
| | - Georgii A. Bazykin
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, Russia
- Skolkovo Institute of Science and Technology, Skolkovo, Russia
| |
Collapse
|
33
|
Sealy JE, Peacock TP, Sadeyen JR, Chang P, Everest HJ, Bhat S, Iqbal M. Adsorptive mutation and N-linked glycosylation modulate influenza virus antigenicity and fitness. Emerg Microbes Infect 2020; 9:2622-2631. [PMID: 33179567 PMCID: PMC7738305 DOI: 10.1080/22221751.2020.1850180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Influenza viruses have an error-prone polymerase complex that facilitates a mutagenic environment. Antigenic mutants swiftly arise from this environment with the capacity to persist in both humans and economically important livestock even in the face of vaccination. Furthermore, influenza viruses can adjust the antigenicity of the haemagglutinin (HA) protein, the primary influenza immunogen, using one of four molecular mechanisms. Two prominent mechanisms are: (1) enhancing binding avidity of HA toward cellular receptors to outcompete antibody binding and (2) amino acid substitutions that introduce an N-linked glycan on HA that sterically block antibody binding. In this study we investigate the impact that adsorptive mutation and N-linked glycosylation have on receptor-binding, viral fitness, and antigenicity. We utilize the H9N2 A/chicken/Pakistan/SKP-827/16 virus which naturally contains HA residue T180 that we have previously shown to be an adsorptive mutant relative to virus with T180A. We find that the addition of N-linked glycans can be beneficial or deleterious to virus replication depending on the background receptor binding avidity. We also find that in some cases, an N-linked glycan can trump the effect of an avidity enhancing substitution with respect to antigenicity. Taken together these data shed light on a potential route to the generation of a virus which is "fit" and able to overcome vaccine pressure.
Collapse
Affiliation(s)
| | - Thomas P Peacock
- Department of Infectious Diseases, Imperial College London, London, UK
| | | | | | - Holly J Everest
- Avian Influenza, The Pirbright Institute, Woking, UK.,Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Sushant Bhat
- Avian Influenza, The Pirbright Institute, Woking, UK
| | - Munir Iqbal
- Avian Influenza, The Pirbright Institute, Woking, UK
| |
Collapse
|
34
|
Rochman ND, Wolf YI, Koonin EV. Evolution of Human Respiratory Virus Epidemics. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020:2020.11.23.20237503. [PMID: 33269367 PMCID: PMC7709188 DOI: 10.1101/2020.11.23.20237503] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
BACKGROUND While pathogens often evolve towards reduced virulence, many counterexamples are evident. When faced with a new pathogen, such as SARS-CoV-2, it is highly desirable to be able to forecast the case fatality rate (CFR) into the future. Considerable effort has been invested towards the development of a mathematical framework for predicting virulence evolution. Although these approaches accurately recapitulate some complex outcomes, most rely on an assumed trade-off between mortality and infectivity. It is often impractical to empirically validate this constraint for human pathogens. RESULTS Using a compartment model with parameters tuning the degree to which symptomatic individuals are isolated and the duration of immunity, we reveal kinetic constraints where the variation of multiple parameters in concert leads to decreased virulence and increased pathogen fitness, whereas independent variation of the parameters decreases pathogen fitness. Smallpox, SARS-CoV-2, and Influenza are analyzed as diverse representatives of human respiratory viruses. We show that highly virulent viruses, such as Smallpox, are likely often constrained by host behavior, whereas moderately virulent viruses, such as SARS-CoV-2, appear to be typically constrained by the relationship between the duration of immunity and CFR. CONCLUSIONS The evolution of human respiratory epidemics appears to be often kinetically constrained and a reduction in virulence should not be assumed. Our findings imply that, without continued public health intervention, SARS-CoV-2 is likely to continue presenting a substantial disease burden. The existence of a parameter regime admitting endemic equilibrium suggests that herd immunity is unachievable. However, we demonstrate that even partial isolation of symptomatic individuals can have a major effect not only by reducing the number of fatalities in the short term but also by potentially changing the evolutionary trajectory of the virus towards reduced virulence.
Collapse
Affiliation(s)
- Nash D Rochman
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894
| |
Collapse
|
35
|
Escalera-Zamudio M, Golden M, Gutiérrez B, Thézé J, Keown JR, Carrique L, Bowden TA, Pybus OG. Parallel evolution in the emergence of highly pathogenic avian influenza A viruses. Nat Commun 2020; 11:5511. [PMID: 33139731 PMCID: PMC7608645 DOI: 10.1038/s41467-020-19364-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 10/12/2020] [Indexed: 01/30/2023] Open
Abstract
Parallel molecular evolution and adaptation are important phenomena commonly observed in viruses. Here, we exploit parallel molecular evolution to understand virulence evolution in avian influenza viruses (AIV). Highly-pathogenic AIVs evolve independently from low-pathogenic ancestors via acquisition of polybasic cleavage sites. Why some AIV lineages but not others evolve in this way is unknown. We hypothesise that the parallel emergence of highly-pathogenic AIV may be facilitated by permissive or compensatory mutations occurring across the viral genome. We combine phylogenetic, statistical and structural approaches to discover parallel mutations in AIV genomes associated with the highly-pathogenic phenotype. Parallel mutations were screened using a statistical test of mutation-phenotype association and further evaluated in the contexts of positive selection and protein structure. Our resulting mutational panel may help to reveal new links between virulence evolution and other traits, and raises the possibility of predicting aspects of AIV evolution.
Collapse
Affiliation(s)
| | - Michael Golden
- Department of Zoology, Oxford University, Parks Rd, Oxford, OX1 3PS, UK
| | | | - Julien Thézé
- Department of Zoology, Oxford University, Parks Rd, Oxford, OX1 3PS, UK
| | - Jeremy Russell Keown
- Division of Structural Biology, Wellcome Centre for Human Genetics, Oxford, OX3 7BN, UK
| | - Loic Carrique
- Division of Structural Biology, Wellcome Centre for Human Genetics, Oxford, OX3 7BN, UK
| | - Thomas A Bowden
- Division of Structural Biology, Wellcome Centre for Human Genetics, Oxford, OX3 7BN, UK
| | - Oliver G Pybus
- Department of Zoology, Oxford University, Parks Rd, Oxford, OX1 3PS, UK.
- Department of Pathobiology and Population Sciences, Royal Veterinary College, London, UK.
| |
Collapse
|
36
|
Stolyarova AV, Nabieva E, Ptushenko VV, Favorov AV, Popova AV, Neverov AD, Bazykin GA. Senescence and entrenchment in evolution of amino acid sites. Nat Commun 2020; 11:4603. [PMID: 32929079 PMCID: PMC7490271 DOI: 10.1038/s41467-020-18366-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Accepted: 08/20/2020] [Indexed: 01/01/2023] Open
Abstract
Amino acid propensities at a site change in the course of protein evolution. This may happen for two reasons. Changes may be triggered by substitutions at epistatically interacting sites elsewhere in the genome. Alternatively, they may arise due to environmental changes that are external to the genome. Here, we design a framework for distinguishing between these alternatives. Using analytical modelling and simulations, we show that they cause opposite dynamics of the fitness of the allele currently occupying the site: it tends to increase with the time since its origin due to epistasis ("entrenchment"), but to decrease due to random environmental fluctuations ("senescence"). By analysing the genomes of vertebrates and insects, we show that the amino acids originating at negatively selected sites experience strong entrenchment. By contrast, the amino acids originating at positively selected sites experience senescence. We propose that senescence of the current allele is a cause of adaptive evolution.
Collapse
Affiliation(s)
- A V Stolyarova
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143028, Russia.
| | - E Nabieva
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143028, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, 127051, Russia
| | - V V Ptushenko
- Department of Photochemistry and Photobiology, N. M. Emanuel Institute of Biochemical Physics of Russian Academy of Sciences, Moscow, 119334, Russia
- A. N. Belozersky Institute of Physical-Chemical Biology, M. V. Lomonosov Moscow State University, Moscow, 119992, Russia
| | - A V Favorov
- Division of Biostatistics and Bioinformatics, Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA
- Laboratory of System Biology and Computational Genetics, Vavilov Institute of General Genetics, Moscow, 119991, Russia
| | - A V Popova
- Department of Molecular Diagnostics, Central Research Institute for Epidemiology, Moscow, 111123, Russia
| | - A D Neverov
- Department of Molecular Diagnostics, Central Research Institute for Epidemiology, Moscow, 111123, Russia
| | - G A Bazykin
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143028, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, 127051, Russia
| |
Collapse
|
37
|
Wasik BR, Voorhees IEH, Barnard KN, Alford-Lawrence BK, Weichert WS, Hood G, Nogales A, Martínez-Sobrido L, Holmes EC, Parrish CR. Influenza Viruses in Mice: Deep Sequencing Analysis of Serial Passage and Effects of Sialic Acid Structural Variation. J Virol 2019; 93:e01039-19. [PMID: 31511393 PMCID: PMC6854484 DOI: 10.1128/jvi.01039-19] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Accepted: 09/09/2019] [Indexed: 12/19/2022] Open
Abstract
Influenza A viruses have regularly jumped to new host species to cause epidemics or pandemics, an evolutionary process that involves variation in the viral traits necessary to overcome host barriers and facilitate transmission. Mice are not a natural host for influenza virus but are frequently used as models in studies of pathogenesis, often after multiple passages to achieve higher viral titers that result in clinical disease such as weight loss or death. Here, we examine the processes of influenza A virus infection and evolution in mice by comparing single nucleotide variations of a human H1N1 pandemic virus, a seasonal H3N2 virus, and an H3N2 canine influenza virus during experimental passage. We also compared replication and sequence variation in wild-type mice expressing N-glycolylneuraminic acid (Neu5Gc) with those seen in mice expressing only N-acetylneuraminic acid (Neu5Ac). Viruses derived from plasmids were propagated in MDCK cells and then passaged in mice up to four times. Full-genome deep sequencing of the plasmids, cultured viruses, and viruses from mice at various passages revealed only small numbers of mutational changes. The H3N2 canine influenza virus showed increases in frequency of sporadic mutations in the PB2, PA, and NA segments. The H1N1 pandemic virus grew well in mice, and while it exhibited the maintenance of some minority mutations, there was no clear evidence for adaptive evolution. The H3N2 seasonal virus did not establish in the mice. Finally, there were no clear sequence differences associated with the presence or absence of Neu5Gc.IMPORTANCE Mice are commonly used as a model to study the growth and virulence of influenza A viruses in mammals but are not a natural host and have distinct sialic acid receptor profiles compared to humans. Using experimental infections with different subtypes of influenza A virus derived from different hosts, we found that evolution of influenza A virus in mice did not necessarily proceed through the linear accumulation of host-adaptive mutations, that there was variation in the patterns of mutations detected in each repetition, and that the mutation dynamics depended on the virus examined. In addition, variation in the viral receptor, sialic acid, did not affect influenza virus evolution in this model. Overall, our results show that while mice provide a useful animal model for influenza virus pathology, host passage evolution will vary depending on the specific virus tested.
Collapse
Affiliation(s)
- Brian R Wasik
- Baker Institute for Animal Health, Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University, Ithaca, New York, USA
| | - Ian E H Voorhees
- Baker Institute for Animal Health, Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University, Ithaca, New York, USA
| | - Karen N Barnard
- Baker Institute for Animal Health, Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University, Ithaca, New York, USA
| | - Brynn K Alford-Lawrence
- Baker Institute for Animal Health, Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University, Ithaca, New York, USA
| | - Wendy S Weichert
- Baker Institute for Animal Health, Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University, Ithaca, New York, USA
| | - Grace Hood
- Baker Institute for Animal Health, Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University, Ithaca, New York, USA
- College of Veterinary Medicine, University of Queensland, Gatton, Queensland, Australia
| | - Aitor Nogales
- Department of Microbiology and Immunology, University of Rochester Medical Center, Rochester, New York, USA
| | - Luis Martínez-Sobrido
- Department of Microbiology and Immunology, University of Rochester Medical Center, Rochester, New York, USA
| | - Edward C Holmes
- Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Biological Sciences and Sydney Medical School, University of Sydney, Sydney, New South Wales, Australia
| | - Colin R Parrish
- Baker Institute for Animal Health, Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University, Ithaca, New York, USA
| |
Collapse
|
38
|
Evolution of H5-Type Avian Influenza A Virus Towards Mammalian Tropism in Egypt, 2014 to 2015. Pathogens 2019; 8:pathogens8040224. [PMID: 31703251 PMCID: PMC6963730 DOI: 10.3390/pathogens8040224] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Revised: 10/31/2019] [Accepted: 11/04/2019] [Indexed: 12/24/2022] Open
Abstract
Highly pathogenic avian influenza viruses (HPAIV) of the H5-subtype have circulated continuously in Egypt since 2006, resulting in numerous poultry outbreaks and considerable sporadic human infections. The extensive circulation and wide spread of these viruses in domestic poultry have resulted in various evolutionary changes with a dramatic impact on viral transmission ability to contact mammals including humans. The transmitted viruses are either (1) adapted well enough in their avian hosts to readily infect mammals, or (2) adapted in the new mammalian hosts to improve their fitness. In both cases, avian influenza viruses (AIVs) acquire various host-specific adaptations. These adaptive variations are not all well-known or thoroughly characterized. In this study, a phylogenetic algorithm based on the informational spectrum method, designated hereafter as ISM, was applied to analyze the affinity of H5-type HA proteins of Egyptian AIV isolates (2006–2015) towards human-type cell receptors. To characterize AIV H5-HA proteins displaying high ISM values reflecting an increased tendency of the HA towards human-type receptors, recombinant IV expressing monobasic, low pathogenic (LP) H5-HA versions in the background of the human influenza virus A/PR/8/1934(H1N1) (LP 7+1), were generated. These viruses were compared with a LP 7+1 expressing a monobasic H5-HA from a human origin virus isolate (human LP-7271), for their receptor binding specificity (ISM), in vitro replication efficiency and in vivo pathogenicity in mammals. Interestingly, using ISM analysis, we identified a LP 7+1 virus (LP-S10739C) expressing the monobasic H5-HA of AIV A/Chicken/Egypt/S10739C/2015(H5N1) that showed high affinity towards human-type receptors. This in silico prediction was reflected by a higher in vitro replication efficiency in mammalian cell cultures and a higher virulence in mice as compared with LP-7271. Sequence comparison between the LP-S10739C and the LP-7271 H5-HA, revealed distinct amino acid changes. Their contribution to the increased mammalian receptor propensity of LP-S10739C demands further investigation to better deduce the molecular determinant behind the reported high morbidity of 2014 to 2015 HPAI H5N1 virus in humans in Egypt. This study provides insights into the evolution of Egyptian H5 HPAIVs and highlights the need to identify the viral evolution in order to recognize emerging AIV with the potential to threaten human and animal populations.
Collapse
|
39
|
Allele-specific nonstationarity in evolution of influenza A virus surface proteins. Proc Natl Acad Sci U S A 2019; 116:21104-21112. [PMID: 31578251 DOI: 10.1073/pnas.1904246116] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Influenza A virus (IAV) is a major public health problem and a pandemic threat. Its evolution is largely driven by diversifying positive selection so that relative fitness of different amino acid variants changes with time due to changes in herd immunity or genomic context, and novel amino acid variants attain fitness advantage. Here, we hypothesize that diversifying selection also has another manifestation: the fitness associated with a particular amino acid variant should decline with time since its origin, as the herd immunity adapts to it. By tracing the evolution of antigenic sites at IAV surface proteins, we show that an amino acid variant becomes progressively more likely to become replaced by another variant with time since its origin-a phenomenon we call "senescence." Senescence is particularly pronounced at experimentally validated antigenic sites, implying that it is largely driven by host immunity. By contrast, at internal sites, existing variants become more favorable with time, probably due to arising contingent mutations at other epistatically interacting sites. Our findings reveal a previously undescribed facet of adaptive evolution and suggest approaches for prediction of evolutionary dynamics of pathogens.
Collapse
|
40
|
Aris-Brosou S, Parent L, Ibeh N. Viral Long-Term Evolutionary Strategies Favor Stability over Proliferation. Viruses 2019; 11:v11080677. [PMID: 31344814 PMCID: PMC6722887 DOI: 10.3390/v11080677] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Revised: 07/12/2019] [Accepted: 07/20/2019] [Indexed: 02/01/2023] Open
Abstract
Viruses are known to have some of the highest and most diverse mutation rates found in any biological replicator, with single-stranded (ss) RNA viruses evolving the fastest, and double-stranded (ds) DNA viruses having rates approaching those of bacteria. As mutation rates are tightly and negatively correlated with genome size, selection is a clear driver of viral evolution. However, the role of intragenomic interactions as drivers of viral evolution is still unclear. To understand how these two processes affect the long-term evolution of viruses infecting humans, we comprehensively analyzed ssRNA, ssDNA, dsRNA, and dsDNA viruses, to find which virus types and which functions show evidence for episodic diversifying selection and correlated evolution. We show that selection mostly affects single stranded viruses, that correlated evolution is more prevalent in DNA viruses, and that both processes, taken independently, mostly affect viral replication. However, the genes that are jointly affected by both processes are involved in key aspects of their life cycle, favoring viral stability over proliferation. We further show that both evolutionary processes are intimately linked at the amino acid level, which suggests that it is the joint action of selection and correlated evolution, and not just selection, that shapes the evolutionary trajectories of viruses—and possibly of their epidemiological potential.
Collapse
Affiliation(s)
- Stéphane Aris-Brosou
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada.
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, ON K1N 6N5, Canada.
| | - Louis Parent
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | - Neke Ibeh
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| |
Collapse
|
41
|
Abstract
For nearly a century adaptive landscapes have provided overviews of the evolutionary process and yet they remain metaphors. We redefine adaptive landscapes in terms of biological processes rather than descriptive phenomenology. We focus on the underlying mechanisms that generate emergent properties such as epistasis, dominance, trade-offs and adaptive peaks. We illustrate the utility of landscapes in predicting the course of adaptation and the distribution of fitness effects. We abandon aged arguments concerning landscape ruggedness in favor of empirically determining landscape architecture. In so doing, we transform the landscape metaphor into a scientific framework within which causal hypotheses can be tested.
Collapse
Affiliation(s)
- Xiao Yi
- BioTechnology Institute, University of Minnesota, St. Paul, MN
| | - Antony M Dean
- BioTechnology Institute, University of Minnesota, St. Paul, MN.,Department of Ecology, Evolution, and Behavior, University of Minnesota, St. Paul, MN
| |
Collapse
|
42
|
Jalali SAH, Mohammadinezhad R, Mohammadi A, Latifian MH, Talebi M, Soleimanin-Zad S, Golkar P, Hemmatzadeh F. Molecular evolution and selection pressure analysis of infectious hematopoietic necrosis virus (IHNV) revealed the origin and phylogenetic relationship of Iranian isolates in recent epidemics in Iran. Virology 2019; 535:45-58. [PMID: 31272011 DOI: 10.1016/j.virol.2019.06.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 06/19/2019] [Accepted: 06/19/2019] [Indexed: 12/15/2022]
Abstract
Infectious hematopoietic necrosis virus (IHNV) is the causative agent for a lethal salmonid disease. In this study, we surveyed the IHNV's epidemiology, diversity and the origin of infection in Iran. Phylogenetic analysis revealed that Iranian isolates belonged to one of the two lineages of E genogroup. Subsequently, a combination of phylogenetic, antigenic and structural analysis was performed to investigate the evolution of E genogroup lineages. Site-specific analysis of the viral glycoprotein showed different co-evolving and positively selected sites in each lineage. Most of these sites were mapped to the predicted antigenic patches of the glycoprotein. Further characterization revealed E lineages can be differentiated, in part, by specific mutations at positions 91 and 130, which are located in the structurally flexible regions of the glycoprotein, suggesting a key adaptative role for these sites. These data may assist in better monitoring the emerging isolates in regions infected to IHNV from E genogroup.
Collapse
Affiliation(s)
- Seyed Amir Hossein Jalali
- Research Institute for Biotechnology and Bioengineering, Isfahan University of Technology, Isfahan, 84156-83111, Iran; Department of Natural Resources, Isfahan University of Technology, Isfahan, 84156-83111, Iran.
| | - Rezvan Mohammadinezhad
- Research Institute for Biotechnology and Bioengineering, Isfahan University of Technology, Isfahan, 84156-83111, Iran
| | - Ashraf Mohammadi
- Human Viral vaccine Department, Razi Vaccine and Serum Research Institute (RVSRI), Hessark Karadj Agricultural Research, Education and Extension Organization (AREEO), Tehran, Iran
| | - Mohamad Hassan Latifian
- Department of Agricultural Biotechnology, College of Agriculture, Isfahan University of Technology, Isfahan, 8415683111, Iran
| | - Majid Talebi
- Research Institute for Biotechnology and Bioengineering, Isfahan University of Technology, Isfahan, 84156-83111, Iran; Department of Agricultural Biotechnology, College of Agriculture, Isfahan University of Technology, Isfahan, 8415683111, Iran
| | - Sabihe Soleimanin-Zad
- Research Institute for Biotechnology and Bioengineering, Isfahan University of Technology, Isfahan, 84156-83111, Iran; Department of Food Science and Technology, College of Agriculture, Isfahan University of Technology, Isfahan, Iran
| | - Pouran Golkar
- Research Institute for Biotechnology and Bioengineering, Isfahan University of Technology, Isfahan, 84156-83111, Iran
| | - Farhid Hemmatzadeh
- School of Animal and Veterinary Sciences, The University of Adelaide, Roseworthy, SA, Australia
| |
Collapse
|
43
|
Fisher KJ, Kryazhimskiy S, Lang GI. Detecting genetic interactions using parallel evolution in experimental populations. Philos Trans R Soc Lond B Biol Sci 2019; 374:20180237. [PMID: 31154981 DOI: 10.1098/rstb.2018.0237] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Eukaryotic genomes contain thousands of genes organized into complex and interconnected genetic interaction networks. Most of our understanding of how genetic variation affects these networks comes from quantitative-trait loci mapping and from the systematic analysis of double-deletion (or knockdown) mutants, primarily in the yeast Saccharomyces cerevisiae. Evolve and re-sequence experiments are an alternative approach for identifying novel functional variants and genetic interactions, particularly between non-loss-of-function mutations. These experiments leverage natural selection to obtain genotypes with functionally important variants and positive genetic interactions. However, no systematic methods for detecting genetic interactions in these data are yet available. Here, we introduce a computational method based on the idea that variants in genes that interact will co-occur in evolved genotypes more often than expected by chance. We apply this method to a previously published yeast experimental evolution dataset. We find that genetic targets of selection are distributed non-uniformly among evolved genotypes, indicating that genetic interactions had a significant effect on evolutionary trajectories. We identify individual gene pairs with a statistically significant genetic interaction score. The strongest interaction is between genes TRK1 and PHO84, genes that have not been reported to interact in previous systematic studies. Our work demonstrates that leveraging parallelism in experimental evolution is useful for identifying genetic interactions that have escaped detection by other methods. This article is part of the theme issue 'Convergent evolution in the genomics era: new insights and directions'.
Collapse
Affiliation(s)
- Kaitlin J Fisher
- 1 Department of Biological Sciences, Lehigh University , Bethlehem, PA 18015 , USA
| | - Sergey Kryazhimskiy
- 2 Division of Biological Sciences, University of California San Diego , La Jolla, CA 92093 , USA
| | - Gregory I Lang
- 1 Department of Biological Sciences, Lehigh University , Bethlehem, PA 18015 , USA
| |
Collapse
|
44
|
Domingo J, Baeza-Centurion P, Lehner B. The Causes and Consequences of Genetic Interactions (Epistasis). Annu Rev Genomics Hum Genet 2019; 20:433-460. [PMID: 31082279 DOI: 10.1146/annurev-genom-083118-014857] [Citation(s) in RCA: 124] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The same mutation can have different effects in different individuals. One important reason for this is that the outcome of a mutation can depend on the genetic context in which it occurs. This dependency is known as epistasis. In recent years, there has been a concerted effort to quantify the extent of pairwise and higher-order genetic interactions between mutations through deep mutagenesis of proteins and RNAs. This research has revealed two major components of epistasis: nonspecific genetic interactions caused by nonlinearities in genotype-to-phenotype maps, and specific interactions between particular mutations. Here, we provide an overview of our current understanding of the mechanisms causing epistasis at the molecular level, the consequences of genetic interactions for evolution and genetic prediction, and the applications of epistasis for understanding biology and determining macromolecular structures.
Collapse
Affiliation(s)
- Júlia Domingo
- Systems Biology Program, Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08003 Barcelona, Spain; , ,
| | - Pablo Baeza-Centurion
- Systems Biology Program, Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08003 Barcelona, Spain; , ,
| | - Ben Lehner
- Systems Biology Program, Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08003 Barcelona, Spain; , , .,Universitat Pompeu Fabra, 08003 Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), 08010 Barcelona, Spain
| |
Collapse
|
45
|
Koel BF, Burke DF, van der Vliet S, Bestebroer TM, Rimmelzwaan GF, Osterhaus ADME, Smith DJ, Fouchier RAM. Epistatic interactions can moderate the antigenic effect of substitutions in haemagglutinin of influenza H3N2 virus. J Gen Virol 2019; 100:773-777. [PMID: 31017567 DOI: 10.1099/jgv.0.001263] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
We previously showed that single amino acid substitutions at seven positions in haemagglutinin determined major antigenic change of influenza H3N2 virus. Here, the impact of two such substitutions was tested in 11 representative H3 haemagglutinins to investigate context-dependence effects. The antigenic effect of substitutions introduced at haemagglutinin position 145 was fully independent of the amino acid context of the representative haemagglutinins. Antigenic change caused by substitutions introduced at haemagglutinin position 155 was variable and context-dependent. Our results suggest that epistatic interactions with contextual amino acids in the haemagglutinin can moderate the magnitude of antigenic change.
Collapse
Affiliation(s)
- Björn F Koel
- 1 Department of Viroscience, Erasmus MC, Rotterdam, The Netherlands
| | - David F Burke
- 2 Center for Pathogen Evolution, Department of Zoology, University of Cambridge, Cambridge, UK
| | | | | | | | | | - Derek J Smith
- 1 Department of Viroscience, Erasmus MC, Rotterdam, The Netherlands
- 2 Center for Pathogen Evolution, Department of Zoology, University of Cambridge, Cambridge, UK
| | - Ron A M Fouchier
- 1 Department of Viroscience, Erasmus MC, Rotterdam, The Netherlands
| |
Collapse
|
46
|
Beleva Guthrie V, Masica DL, Fraser A, Federico J, Fan Y, Camps M, Karchin R. Network Analysis of Protein Adaptation: Modeling the Functional Impact of Multiple Mutations. Mol Biol Evol 2019. [PMID: 29522102 PMCID: PMC5967520 DOI: 10.1093/molbev/msy036] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The evolution of new biochemical activities frequently involves complex dependencies between mutations and rapid evolutionary radiation. Mutation co-occurrence and covariation have previously been used to identify compensating mutations that are the result of physical contacts and preserve protein function and fold. Here, we model pairwise functional dependencies and higher order interactions that enable evolution of new protein functions. We use a network model to find complex dependencies between mutations resulting from evolutionary trade-offs and pleiotropic effects. We present a method to construct these networks and to identify functionally interacting mutations in both extant and reconstructed ancestral sequences (Network Analysis of Protein Adaptation). The time ordering of mutations can be incorporated into the networks through phylogenetic reconstruction. We apply NAPA to three distantly homologous β-lactamase protein clusters (TEM, CTX-M-3, and OXA-51), each of which has experienced recent evolutionary radiation under substantially different selective pressures. By analyzing the network properties of each protein cluster, we identify key adaptive mutations, positive pairwise interactions, different adaptive solutions to the same selective pressure, and complex evolutionary trajectories likely to increase protein fitness. We also present evidence that incorporating information from phylogenetic reconstruction and ancestral sequence inference can reduce the number of spurious links in the network, whereas preserving overall network community structure. The analysis does not require structural or biochemical data. In contrast to function-preserving mutation dependencies, which are frequently from structural contacts, gain-of-function mutation dependencies are most commonly between residues distal in protein structure.
Collapse
Affiliation(s)
- Violeta Beleva Guthrie
- Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD
| | - David L Masica
- Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD
| | - Andrew Fraser
- Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD
| | - Joseph Federico
- Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD
| | - Yunfan Fan
- Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD
| | - Manel Camps
- Department of Environmental Toxicology, University of California Santa Cruz, Santa Cruz, CA
| | - Rachel Karchin
- Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD.,Department of Oncology, Johns Hopkins University Medicine, Baltimore, MD
| |
Collapse
|
47
|
Ecological and Evolutionary Processes Shaping Viral Genetic Diversity. Viruses 2019; 11:v11030220. [PMID: 30841497 PMCID: PMC6466605 DOI: 10.3390/v11030220] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 02/22/2019] [Accepted: 02/27/2019] [Indexed: 02/07/2023] Open
Abstract
The contemporary genomic diversity of viruses is a result of the continuous and dynamic interaction of past ecological and evolutionary processes. Thus, genome sequences of viruses can be a valuable source of information about these processes. In this review, we first describe the relevant processes shaping viral genomic variation, with a focus on the role of host–virus coevolution and its potential to give rise to eco-evolutionary feedback loops. We further give a brief overview of available methodology designed to extract information about these processes from genomic data. Short generation times and small genomes make viruses ideal model systems to study the joint effect of complex coevolutionary and eco-evolutionary interactions on genetic evolution. This complexity, together with the diverse array of lifetime and reproductive strategies in viruses ask for extensions of existing inference methods, for example by integrating multiple information sources. Such integration can broaden the applicability of genetic inference methods and thus further improve our understanding of the role viruses play in biological communities.
Collapse
|
48
|
Existing Host Range Mutations Constrain Further Emergence of RNA Viruses. J Virol 2019; 93:JVI.01385-18. [PMID: 30463962 DOI: 10.1128/jvi.01385-18] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 11/06/2018] [Indexed: 02/07/2023] Open
Abstract
RNA viruses are capable of rapid host shifting, typically due to a point mutation that confers expanded host range. As additional point mutations are necessary for further expansions, epistasis among host range mutations can potentially affect the mutational neighborhood and frequency of niche expansion. We mapped the mutational neighborhood of host range expansion using three genotypes of the double-stranded RNA (dsRNA) bacteriophage φ6 (wild type and two isogenic host range mutants) on the novel host Pseudomonas syringae pv. atrofaciens. Both Sanger sequencing of 50 P. syringae pv. atrofaciens mutant clones for each genotype and population Illumina sequencing revealed the same high-frequency mutations allowing infection of P. syringae pv. atrofaciens. Wild-type φ6 had at least nine different ways of mutating to enter the novel host, eight of which are in p3 (host attachment protein gene), and 13/50 clones had unchanged p3 genes. However, the two isogenic mutants had dramatically restricted neighborhoods: only one or two mutations, all in p3. Deep sequencing revealed that wild-type clones without mutations in p3 likely had changes in p12 (morphogenic protein), a region that was not polymorphic for the two isogenic host range mutants. Sanger sequencing confirmed that 10/13 of the wild-type φ6 clones had nonsynonymous mutations in p12, and 2 others had point mutations in p9 and p5. None of these genes had previously been associated with host range expansion in φ6. We demonstrate, for the first time, epistatic constraint in an RNA virus due to host range mutations themselves, which has implications for models of serial host range expansion.IMPORTANCE RNA viruses mutate rapidly and frequently expand their host ranges to infect novel hosts, leading to serial host shifts. Using an RNA bacteriophage model system (Pseudomonas phage φ6), we studied the impact of preexisting host range mutations on another host range expansion. Results from both clonal Sanger and Illumina sequencing show that extant host range mutations dramatically narrow the neighborhood of potential host range mutations compared to that of wild-type φ6. This research suggests that serial host-shifting viruses may follow a small number of molecular paths to enter additional novel hosts. We also identified new genes involved in φ6 host range expansion, expanding our knowledge of this important model system in experimental evolution.
Collapse
|
49
|
Shim H. Feature Learning of Virus Genome Evolution With the Nucleotide Skip-Gram Neural Network. Evol Bioinform Online 2019; 15:1176934318821072. [PMID: 30692845 PMCID: PMC6335656 DOI: 10.1177/1176934318821072] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Accepted: 11/15/2018] [Indexed: 12/14/2022] Open
Abstract
Recent studies reveal that even the smallest genomes such as viruses evolve through complex and stochastic processes, and the assumption of independent alleles is not valid in most applications. Advances in sequencing technologies produce multiple time-point whole-genome data, which enable potential interactions between these alleles to be investigated empirically. To investigate these interactions, we represent alleles as distributed vectors that encode for relationships with other alleles in the course of evolution and apply artificial neural networks to time-sampled whole-genome datasets for feature learning. We build this platform using methods and algorithms derived from natural language processing (NLP), and we denote it as the nucleotide skip-gram neural network. We learn distributed vectors of alleles using the changes in allele frequency of echovirus 11 in the presence or absence of the disinfectant (ClO2) from the experimental evolution data. Results from the training using a new open-source software TensorFlow show that the learned distributed vectors can be clustered using principal component analysis and hierarchical clustering to reveal a list of non-synonymous mutations that arise on the structural protein VP1 in connection to the candidate mutation for ClO2 adaptation. Furthermore, this method can account for recombination rates by setting the extent of interactions as a biological hyper-parameter, and the results show that the most realistic scenario of mid-range interactions across the genome is most consistent with the previous studies.
Collapse
Affiliation(s)
- Hyunjin Shim
- Artificial Intelligence Laboratory, Stanford University, Stanford, CA, USA.,School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.,Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| |
Collapse
|
50
|
Fragata I, Blanckaert A, Dias Louro MA, Liberles DA, Bank C. Evolution in the light of fitness landscape theory. Trends Ecol Evol 2019; 34:69-82. [DOI: 10.1016/j.tree.2018.10.009] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 10/16/2018] [Accepted: 10/17/2018] [Indexed: 01/28/2023]
|