1
|
Rubio A, de Toro M, Pérez-Pulido AJ. The most exposed regions of SARS-CoV-2 structural proteins are subject to strong positive selection and gene overlap may locally modify this behavior. mSystems 2024; 9:e0071323. [PMID: 38095866 PMCID: PMC10804949 DOI: 10.1128/msystems.00713-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 11/10/2023] [Indexed: 12/22/2023] Open
Abstract
The SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) pandemic that emerged in 2019 has been an unprecedented event in international science, as it has been possible to sequence millions of genomes, tracking their evolution very closely. This has enabled various types of secondary analyses of these genomes, including the measurement of their sequence selection pressure. In this work, we have been able to measure the selective pressure of all the described SARS-CoV-2 genes, even analyzed by sequence regions, and we show how this type of analysis allows us to separate the genes between those subject to positive selection (usually those that code for surface proteins or those exposed to the host immune system) and those subject to negative selection because they require greater conservation of their structure and function. We have also seen that when another gene with an overlapping reading frame appears within a gene sequence, the overlapping sequence between the two genes evolves under a stronger purifying selection than the average of the non-overlapping regions of the main gene. We propose this type of analysis as a useful tool for locating and analyzing all the genes of a viral genome when an adequate number of sequences are available.IMPORTANCEWe have analyzed the selection pressure of all severe acute respiratory syndrome coronavirus 2 genes by means of the nonsynonymous (Ka) to synonymous (Ks) substitution rate. We found that protein-coding genes are exposed to strong positive selection, especially in the regions of interaction with other molecules (host receptor and genome of the virus itself). However, overlapping coding regions are more protected and show negative selection. This suggests that this measure could be used to study viral gene function as well as overlapping genes.
Collapse
Affiliation(s)
- Alejandro Rubio
- Faculty of Experimental Sciences, Genetics Area, Andalusian Centre for Developmental Biology (CABD, UPO-CSIC-JA), University Pablo de Olavide, Sevilla, Spain
| | - Maria de Toro
- Genomics and Bioinformatics Core Facility, Center for Biomedical Research of La Rioja, Logroño, Spain
| | - Antonio J. Pérez-Pulido
- Faculty of Experimental Sciences, Genetics Area, Andalusian Centre for Developmental Biology (CABD, UPO-CSIC-JA), University Pablo de Olavide, Sevilla, Spain
| |
Collapse
|
3
|
Hou M, Shi J, Gong Z, Wen H, Lan Y, Deng X, Fan Q, Li J, Jiang M, Tang X, Wu CI, Li F, Ruan Y. Intra- vs. Interhost Evolution of SARS-CoV-2 Driven by Uncorrelated Selection-The Evolution Thwarted. Mol Biol Evol 2023; 40:msad204. [PMID: 37707487 PMCID: PMC10521905 DOI: 10.1093/molbev/msad204] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 09/05/2023] [Accepted: 09/07/2023] [Indexed: 09/15/2023] Open
Abstract
In viral evolution, a new mutation has to proliferate within the host (Stage I) in order to be transmitted and then compete in the host population (Stage II). We now analyze the intrahost single nucleotide variants (iSNVs) in a set of 79 SARS-CoV-2 infected patients with most transmissions tracked. Here, every mutation has two measures: 1) iSNV frequency within each individual host in Stage I; 2) occurrence among individuals ranging from 1 (private), 2-78 (public), to 79 (global) occurrences in Stage II. In Stage I, a small fraction of nonsynonymous iSNVs are sufficiently advantageous to rise to a high frequency, often 100%. However, such iSNVs usually fail to become public mutations. Thus, the selective forces in the two stages of evolution are uncorrelated and, possibly, antagonistic. For that reason, successful mutants, including many variants of concern, have to avoid being eliminated in Stage I when they first emerge. As a result, they may not have the transmission advantage to outcompete the dominant strains and, hence, are rare in the host population. Few of them could manage to slowly accumulate advantageous mutations to compete in Stage II. When they do, they would appear suddenly as in each of the six successive waves of SARS-CoV-2 strains. In conclusion, Stage I evolution, the gate-keeper, may contravene the long-term viral evolution and should be heeded in viral studies.
Collapse
Affiliation(s)
- Mei Hou
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Jingrong Shi
- Guangzhou Eighth People's Hospital, Guangzhou Medical University, Guangzhou, China
| | - Zanke Gong
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Haijun Wen
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Yun Lan
- Guangzhou Eighth People's Hospital, Guangzhou Medical University, Guangzhou, China
| | - Xizi Deng
- Guangzhou Eighth People's Hospital, Guangzhou Medical University, Guangzhou, China
| | - Qinghong Fan
- Guangzhou Eighth People's Hospital, Guangzhou Medical University, Guangzhou, China
| | - Jiaojiao Li
- Guangzhou Eighth People's Hospital, Guangzhou Medical University, Guangzhou, China
| | - Mengling Jiang
- Guangzhou Eighth People's Hospital, Guangzhou Medical University, Guangzhou, China
| | - Xiaoping Tang
- Guangzhou Eighth People's Hospital, Guangzhou Medical University, Guangzhou, China
| | - Chung-I Wu
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Feng Li
- Guangzhou Eighth People's Hospital, Guangzhou Medical University, Guangzhou, China
| | - Yongsen Ruan
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
4
|
Rochman ND. In viral games, refs go to the replay. EMBO Rep 2023; 24:e56992. [PMID: 36876587 PMCID: PMC10074127 DOI: 10.15252/embr.202356992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 02/14/2023] [Indexed: 03/07/2023] Open
Abstract
After more than 2 years of intensive investigation, the direct ancestors of the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) remain unidentified. Molecular epidemiology strongly supports a timeline marked by multiple, independent zoonoses in late 2019 (Pekar et al, 2022) solidifying the consensus hypothesis that close relatives of SARS-CoV-2 with high zoonotic potential were naturally circulating prior to the start of the pandemic (Andersen et al, 2020). Understanding where and when these ancestors acquired the genomic features that resulted in a virus with epidemic potential could enable the identification and mitigation of future pandemic viruses, even before the first human infection.
Collapse
Affiliation(s)
- Nash D Rochman
- National Center for Biotechnology InformationNational Library of MedicineBethesdaMDUSA
| |
Collapse
|
5
|
Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools. Proc Natl Acad Sci U S A 2023; 120:e2206945119. [PMID: 36693089 PMCID: PMC9945976 DOI: 10.1073/pnas.2206945119] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Quantifying SARS-like coronavirus (SL-CoV) evolution is critical to understanding the origins of SARS-CoV-2 and the molecular processes that could underlie future epidemic viruses. While genomic analyses suggest recombination was a factor in the emergence of SARS-CoV-2, few studies have quantified recombination rates among SL-CoVs. Here, we infer recombination rates of SL-CoVs from correlated substitutions in sequencing data using a coalescent model with recombination. Our computationally-efficient, non-phylogenetic method infers recombination parameters of both sampled sequences and the unsampled gene pools with which they recombine. We apply this approach to infer recombination parameters for a range of positive-sense RNA viruses. We then analyze a set of 191 SL-CoV sequences (including SARS-CoV-2) and find that ORF1ab and S genes frequently undergo recombination. We identify which SL-CoV sequence clusters have recombined with shared gene pools, and show that these pools have distinct structures and high recombination rates, with multiple recombination events occurring per synonymous substitution. We find that individual genes have recombined with different viral reservoirs. By decoupling contributions from mutation and recombination, we recover the phylogeny of non-recombined portions for many of these SL-CoVs, including the position of SARS-CoV-2 in this clonal phylogeny. Lastly, by analyzing >400,000 SARS-CoV-2 whole genome sequences, we show current diversity levels are insufficient to infer the within-population recombination rate of the virus since the pandemic began. Our work offers new methods for inferring recombination rates in RNA viruses with implications for understanding recombination in SARS-CoV-2 evolution and the structure of clonal relationships and gene pools shaping its origins.
Collapse
|
6
|
It takes a village to build a virus. Proc Natl Acad Sci U S A 2023; 120:e2219052120. [PMID: 36701364 PMCID: PMC9945952 DOI: 10.1073/pnas.2219052120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
|
7
|
Junk mail: Viral envelopes promote zoonoses. Proc Natl Acad Sci U S A 2023; 120:e2219962120. [PMID: 36623201 PMCID: PMC9933117 DOI: 10.1073/pnas.2219962120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
|
8
|
Rochman ND, Wolf YI, Koonin EV. Molecular adaptations during viral epidemics. EMBO Rep 2022; 23:e55393. [PMID: 35848484 PMCID: PMC9346483 DOI: 10.15252/embr.202255393] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 06/18/2022] [Accepted: 06/27/2022] [Indexed: 07/20/2023] Open
Abstract
In 1977, the world witnessed both the eradication of smallpox and the beginning of the modern age of genomics. Over the following half-century, 7 epidemic viruses of international concern galvanized virologists across the globe and led to increasingly extensive virus genome sequencing. These sequencing efforts exerted over periods of rapid adaptation of viruses to new hosts, in particular, humans provide insight into the molecular mechanisms underpinning virus evolution. Investment in virus genome sequencing was dramatically increased by the unprecedented support for phylogenomic analyses during the COVID-19 pandemic. In this review, we attempt to piece together comprehensive molecular histories of the adaptation of variola virus, HIV-1 M, SARS, H1N1-SIV, MERS, Ebola, Zika, and SARS-CoV-2 to the human host. Disruption of genes involved in virus-host interaction in animal hosts, recombination including genome segment reassortment, and adaptive mutations leading to amino acid replacements in virus proteins involved in host receptor binding and membrane fusion are identified as the key factors in the evolution of epidemic viruses.
Collapse
Affiliation(s)
- Nash D Rochman
- National Center for Biotechnology InformationNational Library of MedicineBethesdaMDUSA
| | - Yuri I Wolf
- National Center for Biotechnology InformationNational Library of MedicineBethesdaMDUSA
| | - Eugene V Koonin
- National Center for Biotechnology InformationNational Library of MedicineBethesdaMDUSA
| |
Collapse
|