1
|
Cao B, Wang X, Yin W, Gao Z, Xia B. The human microbiota is a beneficial reservoir for SARS-CoV-2 mutations. mBio 2024; 15:e0318723. [PMID: 38530031 DOI: 10.1128/mbio.03187-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Accepted: 02/14/2024] [Indexed: 03/27/2024] Open
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) mutations are rapidly emerging. In particular, beneficial mutations in the spike (S) protein, which can either make a person more infectious or enable immunological escape, are providing a significant obstacle to the prevention and treatment of pandemics. However, how the virus acquires a high number of beneficial mutations in a short time remains a mystery. We demonstrate here that variations of concern may be mutated due in part to the influence of the human microbiome. We searched the National Center for Biotechnology Information database for homologous fragments (HFs) after finding a mutation and the six neighboring amino acids in a viral mutation fragment. Among the approximate 8,000 HFs obtained, 61 mutations in S and other outer membrane proteins were found in bacteria, accounting for 62% of all mutation sources, which is 12-fold higher than the natural variable proportion. A significant proportion of these bacterial species-roughly 70%-come from the human microbiota, are mainly found in the lung or gut, and share a composition pattern with COVID-19 patients. Importantly, SARS-CoV-2 RNA-dependent RNA polymerase replicates corresponding bacterial mRNAs harboring mutations, producing chimeric RNAs. SARS-CoV-2 may collectively pick up mutations from the human microbiota that change the original virus's binding sites or antigenic determinants. Our study clarifies the evolving mutational mechanisms of SARS-CoV-2. IMPORTANCE Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) mutations are rapidly emerging, in particular advantageous mutations in the spike (S) protein, which either increase transmissibility or lead to immune escape and are posing a major challenge to pandemic prevention and treatment. However, how the virus acquires a high number of advantageous mutations in a short time remains a mystery. Here, we provide evidence that the human microbiota is a reservoir of advantageous mutations and aids mutational evolution and host adaptation of SARS-CoV-2. Our findings demonstrate a conceptual breakthrough on the mutational evolution mechanisms of SARS-CoV-2 for human adaptation. SARS-CoV-2 may grab advantageous mutations from the widely existing microorganisms in the host, which is undoubtedly an "efficient" manner. Our study might open a new perspective to understand the evolution of virus mutation, which has enormous implications for comprehending the trajectory of the COVID-19 pandemic.
Collapse
Affiliation(s)
- Birong Cao
- State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
- Guangdong Guangya High School, Guangzhou, China
| | - Xiaoxi Wang
- State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Wanchao Yin
- State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
- Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan, China
| | - Zhaobing Gao
- State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
- Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan, China
| | - Bingqing Xia
- State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
2
|
Murray JM, Murray DD, Schvoerer E, Akand EH. SARS-CoV-2 Delta and Omicron community transmission networks as added value to contact tracing. J Infect 2024; 88:173-179. [PMID: 38242366 DOI: 10.1016/j.jinf.2024.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Accepted: 01/14/2024] [Indexed: 01/21/2024]
Abstract
OBJECTIVES Calculations of SARS-CoV-2 transmission networks at a population level have been limited. Networks that estimate infections between individuals and whether this results in a mutation, can be a way to evaluate fitness of a mutational clone by how much it expands in number as well as determining the likelihood a transmission results in a new variant. METHODS Australian Delta and Omicron SARS-CoV-2 sequences were downloaded from GISAID. Transmission networks of infection between individuals were estimated using a novel mathematical method. RESULTS Many of the sequences were identical, with clone sizes following power law distributions driven by negative binomial probability distributions for both the number of infections per individual and the number of mutations per transmission (median 0.74 nucleotide changes for Delta and 0.71 for Omicron). Using these distributions, an agent-based model was able to replicate the observed clonal network structure, providing a basis for more detailed COVID-19 modelling. Possible recombination events, tracked by insertion/deletion (indel) patterns, were identified for each variant in these outbreaks. CONCLUSIONS This modelling approach reveals key transmission characteristics of SARS-CoV-2 and may complement traditional contact tracing. This methodology can also be applied to other diseases as genetic sequencing of viruses becomes more commonplace.
Collapse
Affiliation(s)
- John M Murray
- School of Mathematics and Statistics, UNSW Sydney, NSW 2052, Australia.
| | - Daniel D Murray
- Centre of Excellence for Health, Immunity and Infections (CHIP), Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
| | - Evelyne Schvoerer
- Laboratory of Virology, University Hospital of Nancy Brabois, F-54500 Vandoeuvre-les-Nancy, France; Lorraine University, Laboratory of Physical Chemistry and Microbiology for Materials and the Environment, LCPME UMR 7564, CNRS, 405 Rue de Vandoeuvre, F-54600 Villers-lès-Nancy, France
| | - Elma H Akand
- School of Mathematics and Statistics, UNSW Sydney, NSW 2052, Australia
| |
Collapse
|
3
|
Jeronimo PMC, Aksenen CF, Duarte IO, Lins RD, Miyajima F. Evolutionary deletions within the SARS-CoV-2 genome as signature trends for virus fitness and adaptation. J Virol 2024; 98:e0140423. [PMID: 38088350 PMCID: PMC10804945 DOI: 10.1128/jvi.01404-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2024] Open
Abstract
Coronaviruses are large RNA viruses that can infect and spread among humans and animals. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), responsible for coronavirus disease 2019, has evolved since its first detection in December 2019. Deletions are a common occurrence in SARS-CoV-2 evolution, particularly in specific genomic sites, and may be associated with the emergence of highly competent lineages. While deletions typically have a negative impact on viral fitness, some persist and become fixed in viral populations, indicating that they may confer advantageous benefits for the virus's adaptive evolution. This work presents a literature review and data analysis on structural losses in the SARS-CoV-2 genome and the potential relevance of specific signatures for enhanced viral fitness and spread.
Collapse
Affiliation(s)
| | - Cleber Furtado Aksenen
- Fiocruz Genomic Network, Oswaldo Cruz Foundation (FIOCRUZ), branch Ceara, Eusebio, Brazil
| | - Igor Oliveira Duarte
- Fiocruz Genomic Network, Oswaldo Cruz Foundation (FIOCRUZ), branch Ceara, Eusebio, Brazil
| | - Roberto D. Lins
- Fiocruz Genomic Network, Oswaldo Cruz Foundation (FIOCRUZ), branch Pernambuco, Recife, Brazil
| | - Fabio Miyajima
- Fiocruz Genomic Network, Oswaldo Cruz Foundation (FIOCRUZ), branch Ceara, Eusebio, Brazil
| |
Collapse
|
4
|
Li X, Yan H, Wong G, Ouyang W, Cui J. Identifying featured indels associated with SARS-CoV-2 fitness. Microbiol Spectr 2023; 11:e0226923. [PMID: 37698427 PMCID: PMC10580940 DOI: 10.1128/spectrum.02269-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 07/14/2023] [Indexed: 09/13/2023] Open
Abstract
As an RNA virus, severe acute respiratory coronavirus 2 (SARS-CoV-2) is known for frequent substitution mutations, and substitutions in important genome regions are often associated with viral fitness. However, whether indel mutations are related to viral fitness is generally ignored. Here we developed a computational methodology to investigate indels linked to fitness occurring in over 9 million SARS-CoV-2 genomes. Remarkably, by analyzing 31,642,404 deletion records and 1,981,308 insertion records, our pipeline identified 26,765 deletion types and 21,054 insertion types and discovered 65 indel types with a significant association with Pango lineages. We proposed the concept of featured indels representing the population of specific Pango lineages and variants as substitution mutations and termed these 65 indels as featured indels. The selective pressure of all indel types is assessed using the Bayesian model to explore the importance of indels. Our results exhibited higher selective pressure of indels like substitution mutations, which are important for assessing viral fitness and consistent with previous studies in vitro. Evaluation of the growth rate of each viral lineage indicated that indels play key roles in SARS-CoV-2 evolution and deserve more attention as substitution mutations. IMPORTANCE The fitness of indels in pathogen genome evolution has rarely been studied. We developed a computational methodology to investigate the severe acute respiratory coronavirus 2 genomes and analyze over 33 million records of indels systematically, ultimately proposing the concept of featured indels that can represent specific Pango lineages and identifying 65 featured indels. Machine learning model based on Bayesian inference and viral lineage growth rate evaluation suggests that these featured indels exhibit selection pressure comparable to replacement mutations. In conclusion, indels are not negligible for evaluating viral fitness.
Collapse
Affiliation(s)
- Xiang Li
- CAS Key Laboratory of Molecular Virology & Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, Shanghai, China
- AI for Science, Shanghai Artificial Intelligence Laboratory, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Hongliang Yan
- AI for Science, Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Gary Wong
- CAS Key Laboratory of Molecular Virology & Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, Shanghai, China
| | - Wanli Ouyang
- AI for Science, Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Jie Cui
- CAS Key Laboratory of Molecular Virology & Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
5
|
Aguilar Rangel M, Dolan PT, Taguwa S, Xiao Y, Andino R, Frydman J. High-resolution mapping reveals the mechanism and contribution of genome insertions and deletions to RNA virus evolution. Proc Natl Acad Sci U S A 2023; 120:e2304667120. [PMID: 37487061 PMCID: PMC10400975 DOI: 10.1073/pnas.2304667120] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/07/2023] [Indexed: 07/26/2023] Open
Abstract
RNA viruses rapidly adapt to selective conditions due to the high intrinsic mutation rates of their RNA-dependent RNA polymerases (RdRps). Insertions and deletions (indels) in viral genomes are major contributors to both deleterious mutational load and evolutionary novelty, but remain understudied. To characterize the mechanistic details of their formation and evolutionary dynamics during infection, we developed a hybrid experimental-bioinformatic approach. This approach, called MultiMatch, extracts insertions and deletions from ultradeep sequencing experiments, including those occurring at extremely low frequencies, allowing us to map their genomic distribution and quantify the rates at which they occur. Mapping indel mutations in adapting poliovirus and dengue virus populations, we determine the rates of indel generation and identify mechanistic and functional constraints shaping indel diversity. Using poliovirus RdRp variants of distinct fidelity and genome recombination rates, we demonstrate tradeoffs between fidelity and Indel generation. Additionally, we show that maintaining translation frame and viral RNA structures constrain the Indel landscape and that, due to these significant fitness effects, Indels exert a significant deleterious load on adapting viral populations. Conversely, we uncover positively selected Indels that modulate RNA structure, generate protein variants, and produce defective interfering genomes in viral populations. Together, our analyses establish the kinetic and mechanistic tradeoffs between misincorporation, recombination, and Indel rates and reveal functional principles defining the central role of Indels in virus evolution, emergence, and the regulation of viral infection.
Collapse
Affiliation(s)
| | - Patrick T. Dolan
- Department of Biology, Stanford University, Stanford, CA94305
- Department of Microbiology and Immunology, University of California, San Francisco, San Francisco, CA94143
| | - Shuhei Taguwa
- Department of Biology, Stanford University, Stanford, CA94305
- Research Institute for Microbial Diseases, Osaka University, Yamadaoka, Suita, Osaka565-0871, Japan
| | - Yinghong Xiao
- Department of Microbiology and Immunology, University of California, San Francisco, San Francisco, CA94143
| | - Raul Andino
- Department of Microbiology and Immunology, University of California, San Francisco, San Francisco, CA94143
| | - Judith Frydman
- Department of Biology, Stanford University, Stanford, CA94305
| |
Collapse
|
6
|
Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools. Proc Natl Acad Sci U S A 2023; 120:e2206945119. [PMID: 36693089 PMCID: PMC9945976 DOI: 10.1073/pnas.2206945119] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Quantifying SARS-like coronavirus (SL-CoV) evolution is critical to understanding the origins of SARS-CoV-2 and the molecular processes that could underlie future epidemic viruses. While genomic analyses suggest recombination was a factor in the emergence of SARS-CoV-2, few studies have quantified recombination rates among SL-CoVs. Here, we infer recombination rates of SL-CoVs from correlated substitutions in sequencing data using a coalescent model with recombination. Our computationally-efficient, non-phylogenetic method infers recombination parameters of both sampled sequences and the unsampled gene pools with which they recombine. We apply this approach to infer recombination parameters for a range of positive-sense RNA viruses. We then analyze a set of 191 SL-CoV sequences (including SARS-CoV-2) and find that ORF1ab and S genes frequently undergo recombination. We identify which SL-CoV sequence clusters have recombined with shared gene pools, and show that these pools have distinct structures and high recombination rates, with multiple recombination events occurring per synonymous substitution. We find that individual genes have recombined with different viral reservoirs. By decoupling contributions from mutation and recombination, we recover the phylogeny of non-recombined portions for many of these SL-CoVs, including the position of SARS-CoV-2 in this clonal phylogeny. Lastly, by analyzing >400,000 SARS-CoV-2 whole genome sequences, we show current diversity levels are insufficient to infer the within-population recombination rate of the virus since the pandemic began. Our work offers new methods for inferring recombination rates in RNA viruses with implications for understanding recombination in SARS-CoV-2 evolution and the structure of clonal relationships and gene pools shaping its origins.
Collapse
|
7
|
Miton CM, Tokuriki N. Insertions and Deletions (Indels): A Missing Piece of the Protein Engineering Jigsaw. Biochemistry 2023; 62:148-157. [PMID: 35830609 DOI: 10.1021/acs.biochem.2c00188] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Over the years, protein engineers have studied nature and borrowed its tricks to accelerate protein evolution in the test tube. While there have been considerable advances, our ability to generate new proteins in the laboratory is seemingly limited. One explanation for these shortcomings may be that insertions and deletions (indels), which frequently arise in nature, are largely overlooked during protein engineering campaigns. The profound effect of indels on protein structures, by way of drastic backbone alterations, could be perceived as "saltation" events that bring about significant phenotypic changes in a single mutational step. Should we leverage these effects to accelerate protein engineering and gain access to unexplored regions of adaptive landscapes? In this Perspective, we describe the role played by indels in the functional diversification of proteins in nature and discuss their untapped potential for protein engineering, despite their often-destabilizing nature. We hope to spark a renewed interest in indels, emphasizing that their wider study and use may prove insightful and shape the future of protein engineering by unlocking unique functional changes that substitutions alone could never achieve.
Collapse
Affiliation(s)
- Charlotte M Miton
- Michael Smith Laboratories, University of British Columbia, Vancouver, V6T 1Z4 BC, Canada
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, V6T 1Z4 BC, Canada
| |
Collapse
|
8
|
Influence of viral genome properties on polymerase fidelity. Trends Genet 2023; 39:9-14. [PMID: 36402624 DOI: 10.1016/j.tig.2022.10.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 10/27/2022] [Accepted: 10/30/2022] [Indexed: 11/17/2022]
Abstract
The first step of viral evolution takes place during genome replication via the error-prone viral polymerase. Among the mutants that arise through this process, only a few well-adapted variants will be selected by natural selection, renewing the viral genome population. Viral polymerase-mediated errors are thought to occur stochastically. However, accumulating evidence suggests that viral polymerase-mediated mutations are heterogeneously distributed throughout the viral genome. Here, we review work that supports this concept and provides mechanistic insights into how specific features of the viral genome could modulate viral polymerase-mediated errors. A predisposition to accumulate viral polymerase-mediated errors at specific loci in the viral genome may guide evolution to specific pathways, thus opening new directions of research to better understand viral evolutionary dynamics.
Collapse
|
9
|
Akaishi T, Fujiwara K, Ishii T. Genetic Recombination Sites Away from the Insertion/Deletion Hotspots in SARS-Related Coronaviruses. TOHOKU J EXP MED 2023; 259:17-26. [PMID: 36351613 DOI: 10.1620/tjem.2022.j093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/29/2023]
Affiliation(s)
| | - Kei Fujiwara
- Department of Gastroenterology and Metabolism, Nagoya City University
| | - Tadashi Ishii
- Department of Education and Support for Regional Medicine, Tohoku University
| |
Collapse
|
10
|
Singh P, Sharma K, Shaw D, Bhargava A, Negi SS. Mosaic Recombination Inflicted Various SARS-CoV-2 Lineages to Emerge into Novel Virus Variants: a Review Update. Indian J Clin Biochem 2022; 38:1-8. [PMID: 36569378 PMCID: PMC9759274 DOI: 10.1007/s12291-022-01109-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 12/08/2022] [Indexed: 12/23/2022]
Abstract
Human Coronaviruses (hCoVs) belongs to the enormous and dissimilar family of positive-sense, non-segmented, single-stranded RNA viruses. The RNA viruses are prone to high rates of mutational recombination resulting in emergence of evolutionary variant to alter various features including transmissibility and severity. The evolutionary changes affect the immune escape and reduce effectiveness of diagnostic and therapeutic measures by becoming undetectable by the currently available diagnostics and refractory to therapeutics and vaccines. Whole genome sequencing studies from various countries have adequately reported mosaic recombination between different lineage strain of SARS-CoV-2 whereby RNA dependent RNA polymerase (RdRp) gene reconnects with a homologous RNA strand at diverse position. This all lead to evolutionary emergence of new variant/ lineage as evident with the emergence of XBB in India at the time of writing this review. The continuous periodical genomic surveillance is utmost required for understanding the various lineages involved in recombination to emerge into hybrid variant. This may further help in assessing virus transmission dynamics, virulence and severity factor to help health authorities take appropriate timely action for prevention and control of any future COVID-19 outbreak.
Collapse
Affiliation(s)
- Pushpendra Singh
- Department of Microbiology, All India Institute of Medical Sciences, Raipur, Chhattisgarh India
| | - Kuldeep Sharma
- Department of Microbiology, All India Institute of Medical Sciences, Raipur, Chhattisgarh India
| | - Dipika Shaw
- Department of Microbiology, All India Institute of Medical Sciences, Raipur, Chhattisgarh India
| | - Anudita Bhargava
- Department of Microbiology, All India Institute of Medical Sciences, Raipur, Chhattisgarh India
| | - Sanjay Singh Negi
- Department of Microbiology, All India Institute of Medical Sciences, Raipur, Chhattisgarh India
| |
Collapse
|
11
|
Panzera Y, Cortinas MN, Marandino A, Calleros L, Bormida V, Goñi N, Techera C, Grecco S, Williman J, Ramas V, Coppola L, Mogdasy C, Chiparelli H, Pérez R. Emergence and spreading of the largest SARS-CoV-2 deletion in the Delta AY.20 lineage from Uruguay. GENE REPORTS 2022; 29:101703. [PMID: 36338321 PMCID: PMC9617655 DOI: 10.1016/j.genrep.2022.101703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 10/24/2022] [Accepted: 10/26/2022] [Indexed: 11/06/2022]
Abstract
The genetic variability of SARS-CoV-2 (genus Betacoronavirus, family Coronaviridae) has been scrutinized since its first detection in December 2019. Although the role of structural variants, particularly deletions, in virus evolution is little explored, these genome changes are extremely frequent. They are associated with relevant processes, including immune escape and attenuation. Deletions commonly occur in accessory ORFs and might even lead to the complete loss of one or more ORFs. This scenario poses an interesting question about the origin and spreading of extreme structural rearrangements that persist without compromising virus viability. Here, we analyze the genome of SARS-CoV-2 in late 2021 in Uruguay and identify a Delta lineage (AY.20) that experienced a large deletion (872 nucleotides according to the reference Wuhan strain) that removes the 7a, 7b, and 8 ORFs. Deleted viruses coexist with wild-type (without deletion) AY.20 and AY.43 strains. The Uruguayan deletion is like those identified in Delta strains from Poland and Japan but occurs in a different Delta clade. Besides providing proof of the circulation of this large deletion in America, we infer that the 872-deletion arises by the consecutive occurrence of a 6-nucleotide deletion, characteristic of delta strains, and an 866-nucleotide deletion that arose independently in the AY.20 Uruguayan lineage. The largest deletion occurs adjacent to transcription regulatory sequences needed to synthesize the nested set of subgenomic mRNAs that serve as templates for transcription. Our findings support the role of transcription sequences as a hotspot for copy-choice recombination and highlight the remarkable dynamic of SARS-CoV-2 genomes.
Collapse
Affiliation(s)
- Yanina Panzera
- Sección Genética Evolutiva, Departamento de Biología Animal, Instituto de Biología, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - María Noel Cortinas
- Genómica, Departamento de Laboratorios de Salud Pública, Ministerio de Salud Pública, Alfredo Navarro 3051 (entrada N), 11600 Montevideo, Uruguay
| | - Ana Marandino
- Sección Genética Evolutiva, Departamento de Biología Animal, Instituto de Biología, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - Lucía Calleros
- Sección Genética Evolutiva, Departamento de Biología Animal, Instituto de Biología, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - Victoria Bormida
- Genómica, Departamento de Laboratorios de Salud Pública, Ministerio de Salud Pública, Alfredo Navarro 3051 (entrada N), 11600 Montevideo, Uruguay
| | - Natalia Goñi
- Centro Nacional de Referencia de Influenza y otros Virus Respiratorios, Departamento de Laboratorios de Salud Pública, Ministerio de Salud Pública, Alfredo Navarro 3051 (entrada N), 11600 Montevideo, Uruguay
| | - Claudia Techera
- Sección Genética Evolutiva, Departamento de Biología Animal, Instituto de Biología, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - Sofía Grecco
- Sección Genética Evolutiva, Departamento de Biología Animal, Instituto de Biología, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - Joaquín Williman
- Sección Genética Evolutiva, Departamento de Biología Animal, Instituto de Biología, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - Viviana Ramas
- Centro Nacional de Referencia de Influenza y otros Virus Respiratorios, Departamento de Laboratorios de Salud Pública, Ministerio de Salud Pública, Alfredo Navarro 3051 (entrada N), 11600 Montevideo, Uruguay
| | - Leticia Coppola
- Centro Nacional de Referencia de Influenza y otros Virus Respiratorios, Departamento de Laboratorios de Salud Pública, Ministerio de Salud Pública, Alfredo Navarro 3051 (entrada N), 11600 Montevideo, Uruguay
| | - Cristina Mogdasy
- Centro Nacional de Referencia de Influenza y otros Virus Respiratorios, Departamento de Laboratorios de Salud Pública, Ministerio de Salud Pública, Alfredo Navarro 3051 (entrada N), 11600 Montevideo, Uruguay
| | - Héctor Chiparelli
- Centro Nacional de Referencia de Influenza y otros Virus Respiratorios, Departamento de Laboratorios de Salud Pública, Ministerio de Salud Pública, Alfredo Navarro 3051 (entrada N), 11600 Montevideo, Uruguay
| | - Ruben Pérez
- Sección Genética Evolutiva, Departamento de Biología Animal, Instituto de Biología, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| |
Collapse
|
12
|
On the Origins of Omicron's Unique Spike Gene Insertion. Vaccines (Basel) 2022; 10:vaccines10091509. [PMID: 36146586 PMCID: PMC9504260 DOI: 10.3390/vaccines10091509] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 08/26/2022] [Accepted: 08/26/2022] [Indexed: 01/28/2023] Open
Abstract
The emergence of a heavily mutated SARS-CoV-2 variant (Omicron; Pango lineage B.1.1.529 and BA sublineages) and its rapid spread to over 75 countries raised a global public health alarm. Characterizing the mutational profile of Omicron is necessary to interpret its clinical phenotypes which are shared with or distinctive from those of other SARS-CoV-2 variants. We compared the mutations of the initially circulating Omicron variant (now known as BA.1) with prior variants of concern (Alpha, Beta, Gamma, and Delta), variants of interest (Lambda, Mu, Eta, Iota, and Kappa), and ~1500 SARS-CoV-2 lineages constituting ~5.8 million SARS-CoV-2 genomes. Omicron's Spike protein harbors 26 amino acid mutations (23 substitutions, 2 deletions, and 1 insertion) that are distinct compared to other variants of concern. While the substitution and deletion mutations appeared in previous SARS-CoV-2 lineages, the insertion mutation (ins214EPE) was not previously observed in any other SARS-CoV-2 lineage. Here, we consider and discuss various mechanisms through which the nucleotide sequence encoding for ins214EPE could have been acquired, including local duplication, polymerase slippage, and template switching. Although we are not able to definitively determine the mechanism, we highlight the plausibility of template switching. Analysis of the homology of the inserted nucleotide sequence and flanking regions suggests that this template-switching event could have involved the genomes of SARS-CoV-2 variants (e.g., the B.1.1 strain), other human coronaviruses that infect the same host cells as SARS-CoV-2 (e.g., HCoV-OC43 or HCoV-229E), or a human transcript expressed in a host cell that was infected by the Omicron precursor.
Collapse
|
13
|
Lessons Learned and Yet-to-Be Learned on the Importance of RNA Structure in SARS-CoV-2 Replication. Microbiol Mol Biol Rev 2022; 86:e0005721. [PMID: 35862724 PMCID: PMC9491204 DOI: 10.1128/mmbr.00057-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
SARS-CoV-2, the etiological agent responsible for the COVID-19 pandemic, is a member of the virus family Coronaviridae, known for relatively extensive (~30-kb) RNA genomes that not only encode for numerous proteins but are also capable of forming elaborate structures. As highlighted in this review, these structures perform critical functions in various steps of the viral life cycle, ultimately impacting pathogenesis and transmissibility. We examine these elements in the context of coronavirus evolutionary history and future directions for curbing the spread of SARS-CoV-2 and other potential human coronaviruses. While we focus on structures supported by a variety of biochemical, biophysical, and/or computational methods, we also touch here on recent evidence for novel structures in both protein-coding and noncoding regions of the genome, including an assessment of the potential role for RNA structure in the controversial finding of SARS-CoV-2 integration in “long COVID” patients. This review aims to serve as a consolidation of previous works on coronavirus and more recent investigation of SARS-CoV-2, emphasizing the need for improved understanding of the role of RNA structure in the evolution and adaptation of these human viruses.
Collapse
|
14
|
Putative Host-Derived Insertions in the Genomes of Circulating SARS-CoV-2 Variants. mSystems 2022; 7:e0017922. [PMID: 35582907 PMCID: PMC9239191 DOI: 10.1128/msystems.00179-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Insertions in the SARS-CoV-2 genome have the potential to drive viral evolution, but the source of the insertions is often unknown. Recent proposals have suggested that human RNAs could be a source of some insertions, but the small size of many insertions makes this difficult to confirm. Through an analysis of available direct RNA sequencing data from SARS-CoV-2-infected cells, we show that viral-host chimeric RNAs are formed through what are likely stochastic RNA-dependent RNA polymerase template-switching events. Through an analysis of the publicly available GISAID SARS-CoV-2 genome collection, we identified two genomic insertions in circulating SARS-CoV-2 variants that are identical to regions of the human 18S and 28S rRNAs. These results provide direct evidence of the formation of viral-host chimeric sequences and the integration of host genetic material into the SARS-CoV-2 genome, highlighting the potential importance of host-derived insertions in viral evolution. IMPORTANCE Throughout the COVID-19 pandemic, the sequencing of SARS-CoV-2 genomes has revealed the presence of insertions in multiple globally circulating lineages of SARS-CoV-2, including the Omicron variant. The human genome has been suggested to be the source of some of the larger insertions, but evidence for this kind of event occurring is still lacking. Here, we leverage direct RNA sequencing data and SARS-CoV-2 genomes to show that host-viral chimeric RNAs are generated in infected cells and two large genomic insertions have likely been formed through the incorporation of host rRNA fragments into the SARS-CoV-2 genome. These host-derived insertions may increase the genetic diversity of SARS-CoV-2 and expand its strategies to acquire genetic material, potentially enhancing its adaptability, virulence, and spread.
Collapse
|
15
|
Gerdol M, Dishnica K, Giorgetti A. Emergence of a recurrent insertion in the N-terminal domain of the SARS-CoV-2 spike glycoprotein. Virus Res 2022; 310:198674. [PMID: 35021068 PMCID: PMC8743576 DOI: 10.1016/j.virusres.2022.198674] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 01/06/2022] [Accepted: 01/07/2022] [Indexed: 12/12/2022]
Abstract
Tracking the evolution of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) through genomic surveillance programs is undoubtedly one of the key priorities in the current pandemic situation. Although the genome of SARS-CoV-2 acquires mutations at a slower rate compared with other RNA viruses, evolutionary pressures derived from the widespread circulation of SARS-CoV-2 in the human population have progressively favored the global emergence, though natural selection, of several variants of concern that carry multiple non-synonymous mutations in the spike glycoprotein. These are often placed in key sites within major antibody epitopes and may therefore confer resistance to neutralizing antibodies, leading to partial immune escape, or otherwise compensate infectivity deficits associated with other non-synonymous substitutions. As previously shown by other authors, several emerging variants carry recurrent deletion regions (RDRs) that display a partial overlap with antibody epitopes located in the spike N-terminal domain (NTD). Comparatively, very little attention had been directed towards spike insertion mutations prior to the emergence of the B.1.1.529 (omicron) lineage. This manuscript describes a single recurrent insertion region (RIR1) in the N-terminal domain of SARS-CoV-2 spike protein, characterized by at least 49 independent acquisitions of 1-8 additional codons between Val213 and Leu216 in different viral lineages. Even though RIR1 is unlikely to confer antibody escape, its association with two distinct formerly widespread lineages (A.2.5 and B.1.214.2), with the quickly spreading omicron and with other VOCs and VOIs warrants further investigation concerning its effects on spike structure and viral infectivity.
Collapse
Affiliation(s)
- Marco Gerdol
- University of Trieste, Department of Life Sciences, 34127 Trieste, Italy.
| | - Klevia Dishnica
- University of Verona, Department of Biotechnology, 37134 Verona, Italy
| | | |
Collapse
|
16
|
Hassanin A, Rambaud O, Klein D. Genomic Bootstrap Barcodes and Their Application to Study the Evolution of Sarbecoviruses. Viruses 2022; 14:v14020440. [PMID: 35216033 PMCID: PMC8879460 DOI: 10.3390/v14020440] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 02/07/2022] [Accepted: 02/18/2022] [Indexed: 01/22/2023] Open
Abstract
Recombination creates mosaic genomes containing regions with mixed ancestry, and the accumulation of such events over time can complicate greatly many aspects of evolutionary inference. Here, we developed a sliding window bootstrap (SWB) method to generate genomic bootstrap (GB) barcodes to highlight the regions supporting phylogenetic relationships. The method was applied to an alignment of 56 sarbecoviruses, including SARS-CoV and SARS-CoV-2, responsible for the SARS epidemic and COVID-19 pandemic, respectively. The SWB analyses were also used to construct a consensus tree showing the most reliable relationships and better interpret hidden phylogenetic signals. Our results revealed that most relationships were supported by just a few genomic regions and confirmed that three divergent lineages could be found in bats from Yunnan: SCoVrC, which groups SARS-CoV related coronaviruses from China; SCoV2rC, which includes SARS-CoV-2 related coronaviruses from Southeast Asia and Yunnan; and YunSar, which contains a few highly divergent viruses recently described in Yunnan. The GB barcodes showed evidence for ancient recombination between SCoV2rC and YunSar genomes, as well as more recent recombination events between SCoVrC and SCoV2rC genomes. The recombination and phylogeographic patterns suggest a strong host-dependent selection of the viral RNA-dependent RNA polymerase. In addition, SARS-CoV-2 appears as a mosaic genome composed of regions sharing recent ancestry with three bat SCoV2rCs from Yunnan (RmYN02, RpYN06, and RaTG13) or related to more ancient ancestors in bats from Yunnan and Southeast Asia. Finally, our results suggest that viral circular RNAs may be key molecules for the mechanism of recombination.
Collapse
|
17
|
Malone B, Urakova N, Snijder EJ, Campbell EA. Structures and functions of coronavirus replication-transcription complexes and their relevance for SARS-CoV-2 drug design. Nat Rev Mol Cell Biol 2022; 23:21-39. [PMID: 34824452 PMCID: PMC8613731 DOI: 10.1038/s41580-021-00432-z] [Citation(s) in RCA: 184] [Impact Index Per Article: 92.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/22/2021] [Indexed: 02/08/2023]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has killed millions of people and continues to cause massive global upheaval. Coronaviruses are positive-strand RNA viruses with an unusually large genome of ~30 kb. They express an RNA-dependent RNA polymerase and a cohort of other replication enzymes and supporting factors to transcribe and replicate their genomes. The proteins performing these essential processes are prime antiviral drug targets, but drug discovery is hindered by our incomplete understanding of coronavirus RNA synthesis and processing. In infected cells, the RNA-dependent RNA polymerase must coordinate with other viral and host factors to produce both viral mRNAs and new genomes. Recent research aiming to decipher and contextualize the structures, functions and interplay of the subunits of the SARS-CoV-2 replication and transcription complex proteins has burgeoned. In this Review, we discuss recent advancements in our understanding of the molecular basis and complexity of the coronavirus RNA-synthesizing machinery. Specifically, we outline the mechanisms and regulation of RNA translation, replication and transcription. We also discuss the composition of the replication and transcription complexes and their suitability as targets for antiviral therapy.
Collapse
Affiliation(s)
- Brandon Malone
- grid.134907.80000 0001 2166 1519Laboratory of Molecular Biophysics, The Rockefeller University, New York, NY USA
| | - Nadya Urakova
- grid.10419.3d0000000089452978Department of Medical Microbiology, Leiden University Medical Center, Leiden, Netherlands
| | - Eric J. Snijder
- grid.10419.3d0000000089452978Department of Medical Microbiology, Leiden University Medical Center, Leiden, Netherlands
| | - Elizabeth A. Campbell
- grid.134907.80000 0001 2166 1519Laboratory of Molecular Biophysics, The Rockefeller University, New York, NY USA
| |
Collapse
|
18
|
Sonnleitner ST, Sonnleitner S, Hinterbichler E, Halbfurter H, Kopecky DB, Koblmüller S, Sturmbauer C, Posch W, Walder G. The mutational dynamics of the SARS-CoV-2 virus in serial passages in vitro. Virol Sin 2022; 37:198-207. [PMID: 35277373 PMCID: PMC8800542 DOI: 10.1016/j.virs.2022.01.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 01/21/2022] [Indexed: 11/04/2022] Open
Abstract
Since its outbreak in 2019, Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) keeps surprising the medical community by evolving diverse immune escape mutations in a rapid and effective manner. To gain deeper insight into mutation frequency and dynamics, we isolated ten ancestral strains of SARS-CoV-2 and performed consecutive serial incubation in ten replications in a suitable and common cell line and subsequently analysed them using RT-qPCR and whole genome sequencing. Along those lines we hoped to gain fundamental insights into the evolutionary capacity of SARS-CoV-2 in vitro. Our results identified a series of adaptive genetic changes, ranging from unique convergent substitutional mutations and hitherto undescribed insertions. The region coding for spike proved to be a mutational hotspot, evolving a number of mutational changes including the already known substitutions at positions S:484 and S:501. We discussed the evolution of all specific adaptations as well as possible reasons for the seemingly inhomogeneous potential of SARS-CoV-2 in the adaptation to cell culture. The combination of serial passage in vitro with whole genome sequencing uncovers the immense mutational potential of some SARS-CoV-2 strains. The observed genetic changes of SARS-CoV-2 in vitro could not be explained solely by selectively neutral mutations but possibly resulted from the action of directional selection accumulating favourable genetic changes in the evolving variants, along the path of increasing potency of the strain. Competition among a high number of quasi-species in the SARS-CoV-2 in vitro population gene pool may reinforce directional selection and boost the speed of evolutionary change. Ten genetically similar strains evolved very differently in serial passage in vitro. Observed mutations included substitutions at important spike positions. The three strains with the highest replication rates developed two convergent mutations. Via directional selection favourable genetic changes are accumulated. Competition among many quasi-species boosts the speed of evolutionary change.
Collapse
|
19
|
Rao RSP, Ahsan N, Xu C, Su L, Verburgt J, Fornelli L, Kihara D, Xu D. Evolutionary Dynamics of Indels in SARS-CoV-2 Spike Glycoprotein. Evol Bioinform Online 2021; 17:11769343211064616. [PMID: 34898980 PMCID: PMC8655444 DOI: 10.1177/11769343211064616] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 11/12/2021] [Indexed: 01/28/2023] Open
Abstract
SARS-CoV-2, responsible for the current COVID-19 pandemic that claimed over 5.0 million lives, belongs to a class of enveloped viruses that undergo quick evolutionary adjustments under selection pressure. Numerous variants have emerged in SARS-CoV-2, posing a serious challenge to the global vaccination effort and COVID-19 management. The evolutionary dynamics of this virus are only beginning to be explored. In this work, we have analysed 1.79 million spike glycoprotein sequences of SARS-CoV-2 and found that the virus is fine-tuning the spike with numerous amino acid insertions and deletions (indels). Indels seem to have a selective advantage as the proportions of sequences with indels steadily increased over time, currently at over 89%, with similar trends across countries/variants. There were as many as 420 unique indel positions and 447 unique combinations of indels. Despite their high frequency, indels resulted in only minimal alteration of N-glycosylation sites, including both gain and loss. As indels and point mutations are positively correlated and sequences with indels have significantly more point mutations, they have implications in the evolutionary dynamics of the SARS-CoV-2 spike glycoprotein.
Collapse
Affiliation(s)
- R Shyama Prasad Rao
- Biostatistics and Bioinformatics Division, Yenepoya Research Center, Yenepoya University, Mangaluru, Karnataka, India
| | - Nagib Ahsan
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK, USA
- Mass Spectrometry, Proteomics and Metabolomics Core Facility, Stephenson Life Sciences Research Center, University of Oklahoma, Norman, OK, USA
| | - Chunhui Xu
- Department of Electrical Engineering and Computer Science, Informatics Institute, and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Lingtao Su
- Department of Electrical Engineering and Computer Science, Informatics Institute, and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Jacob Verburgt
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Luca Fornelli
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK, USA
- Department of Biology, University of Oklahoma, Norman, OK, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, Informatics Institute, and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| |
Collapse
|
20
|
Garushyants SK, Rogozin IB, Koonin EV. Template switching and duplications in SARS-CoV-2 genomes give rise to insertion variants that merit monitoring. Commun Biol 2021; 4:1343. [PMID: 34848826 PMCID: PMC8632935 DOI: 10.1038/s42003-021-02858-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 11/01/2021] [Indexed: 12/29/2022] Open
Abstract
The appearance of multiple new SARS-CoV-2 variants during the COVID-19 pandemic is a matter of grave concern. Some of these variants, such as B.1.617.2, B.1.1.7, and B.1.351, manifest higher infectivity and virulence than the earlier SARS-CoV-2 variants, with potential dramatic effects on the course of the pandemic. So far, analysis of new SARS-CoV-2 variants focused primarily on nucleotide substitutions and short deletions that are readily identifiable by comparison to consensus genome sequences. In contrast, insertions have largely escaped the attention of researchers although the furin site insert in the Spike (S) protein is thought to be a determinant of SARS-CoV-2 virulence. Here, we identify 346 unique inserts of different lengths in SARS-CoV-2 genomes and present evidence that these inserts reflect actual virus variance rather than sequencing artifacts. Two principal mechanisms appear to account for the inserts in the SARS-CoV-2 genomes, polymerase slippage and template switch that might be associated with the synthesis of subgenomic RNAs. At least three inserts in the N-terminal domain of the S protein are predicted to lead to escape from neutralizing antibodies, whereas other inserts might result in escape from T-cell immunity. Thus, inserts in the S protein can affect its antigenic properties and merit monitoring.
Collapse
Affiliation(s)
- Sofya K Garushyants
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
21
|
Garushyants SK, Rogozin IB, Koonin EV. Insertions in SARS-CoV-2 genome caused by template switch and duplications give rise to new variants that merit monitoring. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2021.04.23.441209. [PMID: 33907754 PMCID: PMC8077628 DOI: 10.1101/2021.04.23.441209] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
The appearance of multiple new SARS-CoV-2 variants during the winter of 2020-2021 is a matter of grave concern. Some of these new variants, such as B.1.617.2, B.1.1.7, and B.1.351, manifest higher infectivity and virulence than the earlier SARS-CoV-2 variants, with potential dramatic effects on the course of the COVID-19 pandemic. So far, analysis of new SARS-CoV-2 variants focused primarily on point nucleotide substitutions and short deletions that are readily identifiable by comparison to consensus genome sequences. In contrast, insertions have largely escaped the attention of researchers although the furin site insert in the spike protein is thought to be a determinant of SARS-CoV-2 virulence and other inserts might have contributed to coronavirus pathogenicity as well. Here, we investigate insertions in SARS-CoV-2 genomes and identify 347 unique inserts of different lengths. We present evidence that these inserts reflect actual virus variance rather than sequencing errors. Two principal mechanisms appear to account for the inserts in the SARS-CoV-2 genomes, polymerase slippage and template switch that might be associated with the synthesis of subgenomic RNAs. We show that inserts in the Spike glycoprotein can affect its antigenic properties and thus merit monitoring. At least, three inserts in the N-terminal domain of the Spike (ins245IME, ins246DSWG, and ins248SSLT) that were first detected in 2021 are predicted to lead to escape from neutralizing antibodies, whereas other inserts might result in escape from T-cell immunity.
Collapse
Affiliation(s)
- Sofya K. Garushyants
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Igor B. Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|