1
|
Fonseca PLC, Braga-Paz I, de Araújo E Santos LCG, Dias RC, de Souza CSA, Carvalho NO, Queiroz DC, Alves HJ, de Araújo JLF, Moreira FRR, Menezes MT, Menezes D, Silva ABPE, Ferreira JGG, Adelino TER, Bernardes AFL, Carobin NV, Carvalho RS, Ferrari CZ, Guimarães NR, Lamounier LO, Souza FG, Vargas LA, Ribeiro MDO, Arruda MB, Alvarez P, Moreira RG, de Oliveira ES, Sabino ADP, de Oliveira JS, Januário JN, Iani FCDM, Souza RPD, Aguiar RS. Retrospective Analysis of Omicron in Minas Gerais, Brazil: Emergence, Dissemination, and Diversification. Microorganisms 2024; 12:1745. [PMID: 39338420 PMCID: PMC11434267 DOI: 10.3390/microorganisms12091745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Revised: 08/06/2024] [Accepted: 08/09/2024] [Indexed: 09/30/2024] Open
Abstract
Brazil is one of the countries most affected by COVID-19, with the highest number of deaths recorded. Brazilian Health Institutions have reported four main peaks of positive COVID-19 cases. The last two waves were characterized by the emergence of the VOC Omicron and its sublineages. This study aimed to conduct a retrospective surveillance study illustrating the emergence, dissemination, and diversification of the VOC Omicron in 15 regional health units (RHUs) in MG, the second most populous state in Brazil, by combining epidemiological and genomic data. A total of 5643 confirmed positive COVID-19 samples were genotyped using the panels TaqMan SARS-CoV-2 Mutation and 4Plex SC2/VOC Bio-Manguinhos to define mutations classifying the BA.1, BA.2, BA.4, and BA.5 sublineages. While sublineages BA.1 and BA.2 were more prevalent during the third wave, BA.4 and BA.5 dominated the fourth wave in the state. Epidemiological and viral genome data suggest that age and vaccination with booster doses were the main factors related to clinical outcomes, reducing the number of deaths, irrespective of the Omicron sublineages. Complete genome sequencing of 253 positive samples confirmed the circulation of the BA.1, BA.2, BA.4, and BA.5 subvariants, and phylogenomic analysis demonstrated that the VOC Omicron was introduced through multiple international events, followed by transmission within the state of MG. In addition to the four subvariants, other lineages have been identified at low frequency, including BQ.1.1 and XAG. This integrative study reinforces that the evolution of Omicron sublineages was the most significant factor driving the highest peaks of positive COVID-19 cases without an increase in more severe cases, prevented by vaccination boosters.
Collapse
Affiliation(s)
- Paula Luize Camargos Fonseca
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Isabela Braga-Paz
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Luiza Campos Guerra de Araújo E Santos
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Rillery Calixto Dias
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Carolina Senra Alves de Souza
- Subsecretaria de Vigilância em Saúde, Secretaria de Estado de Saúde de Minas Gerais, Belo Horizonte 31585-200, Brazil
| | - Nara Oliveira Carvalho
- Núcleo de Ações e Pesquisa em Apoio Diagnóstico-Nupad, Faculdade de Medicina, Universidade Federal de Minas Gerais, Belo Horizonte 30130-100, Brazil
| | - Daniel Costa Queiroz
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Hugo José Alves
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - João Locke Ferreira de Araújo
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Filipe Romero Rebello Moreira
- Departamento de Genetica, Instituto de Biologia, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-902, Brazil
| | - Mariane Talon Menezes
- Departamento de Genetica, Instituto de Biologia, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-902, Brazil
| | - Diego Menezes
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Aryel Beatriz Paz E Silva
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Jorge Gomes Goulart Ferreira
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | | | | | - Natália Virtude Carobin
- Laboratório Institucional de Pesquisa em Biomarcadores, Laboratório de Hematologia Clínica, Departamento de Análises Clínicas e Toxicológicas; Faculdade de Farmácia, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Renée Silva Carvalho
- Subsecretaria de Vigilância em Saúde, Secretaria de Estado de Saúde de Minas Gerais, Belo Horizonte 31585-200, Brazil
| | - Carolina Zaniboni Ferrari
- Subsecretaria de Vigilância em Saúde, Secretaria de Estado de Saúde de Minas Gerais, Belo Horizonte 31585-200, Brazil
| | | | | | - Fernanda Gil Souza
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Luisa Aimeé Vargas
- Subsecretaria de Vigilância em Saúde, Secretaria de Estado de Saúde de Minas Gerais, Belo Horizonte 31585-200, Brazil
| | - Marisa de Oliveira Ribeiro
- Institute of Technology in Immunobiology Bio-Manguinhos, Oswaldo Cruz Foundation/Fiocruz, Rio de Janeiro 21040-900, Brazil
| | - Monica Barcellos Arruda
- Institute of Technology in Immunobiology Bio-Manguinhos, Oswaldo Cruz Foundation/Fiocruz, Rio de Janeiro 21040-900, Brazil
| | - Patricia Alvarez
- Institute of Technology in Immunobiology Bio-Manguinhos, Oswaldo Cruz Foundation/Fiocruz, Rio de Janeiro 21040-900, Brazil
| | - Rennan Garcias Moreira
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | | | - Adriano de Paula Sabino
- Laboratório Institucional de Pesquisa em Biomarcadores, Laboratório de Hematologia Clínica, Departamento de Análises Clínicas e Toxicológicas; Faculdade de Farmácia, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Jaqueline Silva de Oliveira
- Subsecretaria de Vigilância em Saúde, Secretaria de Estado de Saúde de Minas Gerais, Belo Horizonte 31585-200, Brazil
| | - José Nélio Januário
- Núcleo de Ações e Pesquisa em Apoio Diagnóstico-Nupad, Faculdade de Medicina, Universidade Federal de Minas Gerais, Belo Horizonte 30130-100, Brazil
| | | | - Renan Pedra de Souza
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Renato Santana Aguiar
- Laboratório de Biologia Integrativa, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil
- Instituto D'OR de Pesquisa e Ensino, Rio de Janeiro 22281-100, Brazil
| |
Collapse
|
2
|
Majchrzak M, Madej Ł, Łysek-Gładysińska M, Zarębska-Michaluk D, Zegadło K, Dziuba A, Nogal-Nowak K, Kondziołka W, Sufin I, Myszona-Tarnowska M, Jaśkowski M, Kędzierski M, Maciukajć J, Matykiewicz J, Głuszek S, Adamus-Białek W. The RdRp genotyping of SARS-CoV-2 isolated from patients with different clinical spectrum of COVID-19. BMC Infect Dis 2024; 24:281. [PMID: 38439047 PMCID: PMC10913261 DOI: 10.1186/s12879-024-09146-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 02/16/2024] [Indexed: 03/06/2024] Open
Abstract
BACKGROUND The evolution of SARS-CoV-2 has been observed from the very beginning of the fight against COVID-19, some mutations are indicators of potentially dangerous variants of the virus. However, there is no clear association between the genetic variants of SARS-CoV-2 and the severity of COVID-19. We aimed to analyze the genetic variability of RdRp in correlation with different courses of COVID-19. RESULTS The prospective study included 77 samples of SARS-CoV-2 isolated from outpatients (1st degree of severity) and hospitalized patients (2nd, 3rd and 4th degree of severity). The retrospective analyses included 15,898,266 cases of SARS-CoV-2 genome sequences deposited in the GISAID repository. Single-nucleotide variants were identified based on the four sequenced amplified fragments of SARS-CoV-2. The analysis of the results was performed using appropriate statistical methods, with p < 0.05, considered statistically significant. Additionally, logistic regression analysis was performed to predict the strongest determinants of the observed relationships. The number of mutations was positively correlated with the severity of the COVID-19, and older male patients. We detected four mutations that significantly increased the risk of hospitalization of COVID-19 patients (14676C > T, 14697C > T, 15096 T > C, and 15279C > T), while the 15240C > T mutation was common among strains isolated from outpatients. The selected mutations were searched worldwide in the GISAID database, their presence was correlated with the severity of COVID-19. CONCLUSION Identified mutations have the potential to be used to assess the increased risk of hospitalization in COVID-19 positive patients. Experimental studies and extensive epidemiological data are needed to investigate the association between individual mutations and the severity of COVID-19.
Collapse
Affiliation(s)
- Michał Majchrzak
- Institute of Medical Sciences, Jan Kochanowski University, Kielce, Poland
| | - Łukasz Madej
- Institute of Medical Sciences, Jan Kochanowski University, Kielce, Poland
| | | | | | - Katarzyna Zegadło
- Institute of Medical Sciences, Jan Kochanowski University, Kielce, Poland
| | - Anna Dziuba
- Institute of Medical Sciences, Jan Kochanowski University, Kielce, Poland
| | | | | | | | | | | | | | | | | | - Stanisław Głuszek
- Institute of Medical Sciences, Jan Kochanowski University, Kielce, Poland
| | | |
Collapse
|
3
|
Wu X, Shan K, Zan F, Tang X, Qian Z, Lu J. Optimization and Deoptimization of Codons in SARS-CoV-2 and Related Implications for Vaccine Development. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2205445. [PMID: 37267926 PMCID: PMC10427376 DOI: 10.1002/advs.202205445] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 04/08/2023] [Indexed: 06/04/2023]
Abstract
The spread of coronavirus disease 2019 (COVID-19), caused by severe respiratory syndrome coronavirus 2 (SARS-CoV-2), has progressed into a global pandemic. To date, thousands of genetic variants have been identified among SARS-CoV-2 isolates collected from patients. Sequence analysis reveals that the codon adaptation index (CAI) values of viral sequences have decreased over time but with occasional fluctuations. Through evolution modeling, it is found that this phenomenon may result from the virus's mutation preference during transmission. Using dual-luciferase assays, it is further discovered that the deoptimization of codons in the viral sequence may weaken protein expression during virus evolution, indicating that codon usage may play an important role in virus fitness. Finally, given the importance of codon usage in protein expression and particularly for mRNA vaccines, it is designed several codon-optimized Omicron BA.2.12.1, BA.4/5, and XBB.1.5 spike mRNA vaccine candidates and experimentally validated their high levels of expression. This study highlights the importance of codon usage in virus evolution and provides guidelines for codon optimization in mRNA and DNA vaccine development.
Collapse
Affiliation(s)
- Xinkai Wu
- State Key Laboratory of Protein and Plant Gene ResearchCenter for BioinformaticsSchool of Life SciencesPeking UniversityBeijing100871China
| | - Ke‐jia Shan
- State Key Laboratory of Protein and Plant Gene ResearchCenter for BioinformaticsSchool of Life SciencesPeking UniversityBeijing100871China
| | - Fuwen Zan
- NHC Key Laboratory of Systems Biology of PathogensInstitute of Pathogen BiologyChinese Academy of Medical Sciences and Peking Union Medical CollegeBeijing100176China
| | - Xiaolu Tang
- State Key Laboratory of Protein and Plant Gene ResearchCenter for BioinformaticsSchool of Life SciencesPeking UniversityBeijing100871China
| | - Zhaohui Qian
- NHC Key Laboratory of Systems Biology of PathogensInstitute of Pathogen BiologyChinese Academy of Medical Sciences and Peking Union Medical CollegeBeijing100176China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene ResearchCenter for BioinformaticsSchool of Life SciencesPeking UniversityBeijing100871China
| |
Collapse
|
4
|
De La Cruz-Montoya AH, Díaz Velásquez CE, Martínez-Gregorio H, Ruiz-De La Cruz M, Bustos-Arriaga J, Castro-Jiménez TK, Olguín-Hernández JE, Rodríguez-Sosa M, Terrazas-Valdes LI, Jiménez-Alvarez LA, Regino-Zamarripa NE, Ramírez-Martínez G, Cruz-Lagunas A, Peralta-Arrieta I, Armas-López L, Contreras-Garza BM, Palma-Cortés G, Cabello-Gutierrez C, Báez-Saldaña R, Zúñiga J, Ávila-Moreno F, Vaca-Paniagua F. Molecular transition of SARS-CoV-2 from critical patients during the first year of the COVID-19 pandemic in Mexico City. Front Cell Infect Microbiol 2023; 13:1155938. [PMID: 37260697 PMCID: PMC10227454 DOI: 10.3389/fcimb.2023.1155938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 05/02/2023] [Indexed: 06/02/2023] Open
Abstract
Background The SARS-CoV-2 virus has caused unprecedented mortality since its emergence in late 2019. The continuous evolution of the viral genome through the concerted action of mutational forces has produced distinct variants that became dominant, challenging human immunity and vaccine development. Aim and methods In this work, through an integrative genomic approach, we describe the molecular transition of SARS-CoV-2 by analyzing the viral whole genome sequences from 50 critical COVID-19 patients recruited during the first year of the pandemic in Mexico City. Results Our results revealed differential levels of the evolutionary forces across the genome and specific mutational processes that have shaped the first two epidemiological waves of the pandemic in Mexico. Through phylogenetic analyses, we observed a genomic transition in the circulating SARS-CoV-2 genomes from several lineages prevalent in the first wave to a dominance of the B.1.1.519 variant (defined by T478K, P681H, and T732A mutations in the spike protein) in the second wave. Conclusion This work contributes to a better understanding of the evolutionary dynamics and selective pressures that act at the genomic level, the prediction of more accurate variants of clinical significance, and a better comprehension of the molecular mechanisms driving the evolution of SARS-CoV-2 to improve vaccine and drug development.
Collapse
Affiliation(s)
- Aldo Hugo De La Cruz-Montoya
- Laboratorio Nacional en Salud, Diagnóstico Molecular y Efecto Ambiental en Enfermedades Crónico-Degenerativas, Facultad de Estudios Superiores Iztacala, Tlalnepantla, Mexico
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | - Clara Estela Díaz Velásquez
- Laboratorio Nacional en Salud, Diagnóstico Molecular y Efecto Ambiental en Enfermedades Crónico-Degenerativas, Facultad de Estudios Superiores Iztacala, Tlalnepantla, Mexico
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | - Héctor Martínez-Gregorio
- Laboratorio Nacional en Salud, Diagnóstico Molecular y Efecto Ambiental en Enfermedades Crónico-Degenerativas, Facultad de Estudios Superiores Iztacala, Tlalnepantla, Mexico
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | - Miguel Ruiz-De La Cruz
- Laboratorio Nacional en Salud, Diagnóstico Molecular y Efecto Ambiental en Enfermedades Crónico-Degenerativas, Facultad de Estudios Superiores Iztacala, Tlalnepantla, Mexico
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
- Departamento de Infectómica y Patogénesis Molecular, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), Avenida Instituto Politécnico Nacional, Colonia San Pedro Zacatenco, Delegación Gustavo A. Madero, Ciudad de México, Mexico
| | - José Bustos-Arriaga
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | - Tannya Karen Castro-Jiménez
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | - Jonadab Efraín Olguín-Hernández
- Laboratorio Nacional en Salud, Diagnóstico Molecular y Efecto Ambiental en Enfermedades Crónico-Degenerativas, Facultad de Estudios Superiores Iztacala, Tlalnepantla, Mexico
| | - Miriam Rodríguez-Sosa
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | - Luis Ignacio Terrazas-Valdes
- Laboratorio Nacional en Salud, Diagnóstico Molecular y Efecto Ambiental en Enfermedades Crónico-Degenerativas, Facultad de Estudios Superiores Iztacala, Tlalnepantla, Mexico
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | - Luis Armando Jiménez-Alvarez
- Laboratorio de Inmunobiología y Genética y Departamento de Virología, Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
| | - Nora Elemi Regino-Zamarripa
- Laboratorio de Inmunobiología y Genética y Departamento de Virología, Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
- Tecnológico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Ciudad de México, Mexico
| | - Gustavo Ramírez-Martínez
- Laboratorio de Inmunobiología y Genética y Departamento de Virología, Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
| | - Alfredo Cruz-Lagunas
- Laboratorio de Inmunobiología y Genética y Departamento de Virología, Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
| | - Irlanda Peralta-Arrieta
- Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
| | - Leonel Armas-López
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | | | - Gabriel Palma-Cortés
- Department of Research in Virology and Mycology, Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
| | - Carlos Cabello-Gutierrez
- Department of Research in Virology and Mycology, Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
| | - Renata Báez-Saldaña
- Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
| | - Joaquín Zúñiga
- Laboratorio de Inmunobiología y Genética y Departamento de Virología, Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
- Tecnológico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Ciudad de México, Mexico
- Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
| | - Federico Ávila-Moreno
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
- Instituto Nacional de Enfermedades Respiratorias (INER) Ismael Cosio Villegas, Ciudad de México, Mexico
- Laboratorio 12 de Enfermedades Pulmonares y Epigenómica del Cáncer, Unidad de Investigación en Biomedicina (UBIMED), Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
| | - Felipe Vaca-Paniagua
- Laboratorio Nacional en Salud, Diagnóstico Molecular y Efecto Ambiental en Enfermedades Crónico-Degenerativas, Facultad de Estudios Superiores Iztacala, Tlalnepantla, Mexico
- Unidad de Investigación en Biomedicina, Facultad de Estudios Superiores Iztacala, Universidad Nacional Autónoma de México, Tlalnepantla, Mexico
- Subdirección de Investigación Básica, Instituto Nacional de Cancerología, Ciudad de México, Mexico
| |
Collapse
|
5
|
Cheng Y, Ji C, Zhou HY, Zheng H, Wu A. Web Resources for SARS-CoV-2 Genomic Database, Annotation, Analysis and Variant Tracking. Viruses 2023; 15:1158. [PMID: 37243244 PMCID: PMC10222785 DOI: 10.3390/v15051158] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 05/10/2023] [Accepted: 05/10/2023] [Indexed: 05/28/2023] Open
Abstract
The SARS-CoV-2 genomic data continue to grow, providing valuable information for researchers and public health officials. Genomic analysis of these data sheds light on the transmission and evolution of the virus. To aid in SARS-CoV-2 genomic analysis, many web resources have been developed to store, collate, analyze, and visualize the genomic data. This review summarizes web resources used for the SARS-CoV-2 genomic epidemiology, covering data management and sharing, genomic annotation, analysis, and variant tracking. The challenges and further expectations for these web resources are also discussed. Finally, we highlight the importance and need for continued development and improvement of related web resources to effectively track the spread and understand the evolution of the virus.
Collapse
Affiliation(s)
- Yexiao Cheng
- School of Life Science and Technology, China Pharmaceutical University, Nanjing 211100, China
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100005, China
- Suzhou Institute of Systems Medicine, Suzhou 215123, China
| | - Chengyang Ji
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100005, China
- Suzhou Institute of Systems Medicine, Suzhou 215123, China
| | - Hang-Yu Zhou
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100005, China
- Suzhou Institute of Systems Medicine, Suzhou 215123, China
| | - Heng Zheng
- School of Life Science and Technology, China Pharmaceutical University, Nanjing 211100, China
| | - Aiping Wu
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100005, China
- Suzhou Institute of Systems Medicine, Suzhou 215123, China
| |
Collapse
|
6
|
Fumagalli SE, Padhiar NH, Meyer D, Katneni U, Bar H, DiCuccio M, Komar AA, Kimchi-Sarfaty C. Analysis of 3.5 million SARS-CoV-2 sequences reveals unique mutational trends with consistent nucleotide and codon frequencies. Virol J 2023; 20:31. [PMID: 36812119 PMCID: PMC9936480 DOI: 10.1186/s12985-023-01982-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 02/02/2023] [Indexed: 02/19/2023] Open
Abstract
BACKGROUND Since the onset of the SARS-CoV-2 pandemic, bioinformatic analyses have been performed to understand the nucleotide and synonymous codon usage features and mutational patterns of the virus. However, comparatively few have attempted to perform such analyses on a considerably large cohort of viral genomes while organizing the plethora of available sequence data for a month-by-month analysis to observe changes over time. Here, we aimed to perform sequence composition and mutation analysis of SARS-CoV-2, separating sequences by gene, clade, and timepoints, and contrast the mutational profile of SARS-CoV-2 to other comparable RNA viruses. METHODS Using a cleaned, filtered, and pre-aligned dataset of over 3.5 million sequences downloaded from the GISAID database, we computed nucleotide and codon usage statistics, including calculation of relative synonymous codon usage values. We then calculated codon adaptation index (CAI) changes and a nonsynonymous/synonymous mutation ratio (dN/dS) over time for our dataset. Finally, we compiled information on the types of mutations occurring for SARS-CoV-2 and other comparable RNA viruses, and generated heatmaps showing codon and nucleotide composition at high entropy positions along the Spike sequence. RESULTS We show that nucleotide and codon usage metrics remain relatively consistent over the 32-month span, though there are significant differences between clades within each gene at various timepoints. CAI and dN/dS values vary substantially between different timepoints and different genes, with Spike gene on average showing both the highest CAI and dN/dS values. Mutational analysis showed that SARS-CoV-2 Spike has a higher proportion of nonsynonymous mutations than analogous genes in other RNA viruses, with nonsynonymous mutations outnumbering synonymous ones by up to 20:1. However, at several specific positions, synonymous mutations were overwhelmingly predominant. CONCLUSIONS Our multifaceted analysis covering both the composition and mutation signature of SARS-CoV-2 gives valuable insight into the nucleotide frequency and codon usage heterogeneity of SARS-CoV-2 over time, and its unique mutational profile compared to other RNA viruses.
Collapse
Affiliation(s)
- Sarah E Fumagalli
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Nigam H Padhiar
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Douglas Meyer
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Upendra Katneni
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Haim Bar
- Department of Statistics, University of Connecticut, Storrs, CT, USA
| | | | - Anton A Komar
- Department of Biological, Geological and Environmental Sciences, Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH, USA
| | - Chava Kimchi-Sarfaty
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA.
| |
Collapse
|
7
|
Bai H, Ata G, Sun Q, Rahman SU, Tao S. Natural selection pressure exerted on "Silent" mutations during the evolution of SARS-CoV-2: Evidence from codon usage and RNA structure. Virus Res 2023; 323:198966. [PMID: 36244617 PMCID: PMC9561399 DOI: 10.1016/j.virusres.2022.198966] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 10/08/2022] [Accepted: 10/10/2022] [Indexed: 01/25/2023]
Abstract
From the first emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) till now, multiple mutations that caused synonymous and nonsynonymous substitutions have accumulated. Among them, synonymous substitutions were regarded as "silent" mutations that received less attention than nonsynonymous substitutions that cause amino acid variations. However, the importance of synonymous substitutions can not be neglected. This research focuses on synonymous substitutions on SARS-CoV-2 and proves that synonymous substitutions were under purifying selection in its evolution. The evidence of purifying selection is provided by comparing the mutation number per site in coding and non-coding regions. We then study the two forces of purifying selection: synonymous codon usage and RNA secondary structure. Results show that the codon usage optimization leads to an adapted codon usage towards humans. Furthermore, our results show that the maintenance of RNA secondary structure causes the purifying of synonymous substitutions in the structural region. These results explain the selection pressure on synonymous substitutions during the evolution of SARS-CoV-2.
Collapse
Affiliation(s)
- Haoxiang Bai
- College of Life Sciences, Northwest A&F University, Yangling, China; Bioinformatics Center, Northwest A&F University, Yangling, China
| | - Galal Ata
- College of Life Sciences, Northwest A&F University, Yangling, China; Bioinformatics Center, Northwest A&F University, Yangling, China
| | - Qing Sun
- College of Life Sciences, Northwest A&F University, Yangling, China; Bioinformatics Center, Northwest A&F University, Yangling, China
| | - Siddiq Ur Rahman
- Department of Computer Science and Bioinformatics, Khushal Khan Khattak University, Karak, Khyber Pakhtunkhwa, Pakistan
| | - Shiheng Tao
- College of Life Sciences, Northwest A&F University, Yangling, China; Bioinformatics Center, Northwest A&F University, Yangling, China.
| |
Collapse
|
8
|
Liu Y. Attenuation and Degeneration of SARS-CoV-2 Despite Adaptive Evolution. Cureus 2023; 15:e33316. [PMID: 36741655 PMCID: PMC9894646 DOI: 10.7759/cureus.33316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 01/02/2023] [Indexed: 01/04/2023] Open
Abstract
The evolution of severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) has followed similar trends as other RNA viruses, such as human immunodeficiency virus type 1 and the influenza A virus. Rapid initial diversification was followed by strong competition and a rapid succession of dominant variants. Host-initiated RNA editing has been the primary mechanism for introducing mutations. A significant number of mutations detrimental to viral replication have been quickly purged. Fixed mutations are mostly diversifying mutations selected for host adaptation and immune evasion, with the latter accounting for the majority of the mutations. However, immune evasion often comes at the cost of functionality, and thus, optimal functionality is still far from being accomplished. Instead, selection for antibody-escaping variants and accumulation of near-neutral mutations have led to suboptimal codon usage and reduced replicative capacity, as demonstrated in non-respiratory cell lines. Beneficial adaptation of the virus includes reduced infectivity in lung tissues and increased tropism for the upper airway, resulting in shorter incubation periods, milder diseases, and more efficient transmission between people.
Collapse
Affiliation(s)
- Yingguang Liu
- Molecular and Cellular Sciences, Liberty University College of Osteopathic Medicine, Lynchburg, USA
| |
Collapse
|
9
|
Aroldi A, Angaroni F, D’Aliberti D, Spinelli S, Crespiatico I, Crippa V, Piazza R, Graudenzi A, Ramazzotti D. Characterization of SARS-CoV-2 Mutational Signatures from 1.5+ Million Raw Sequencing Samples. Viruses 2022; 15:7. [PMID: 36680048 PMCID: PMC9864147 DOI: 10.3390/v15010007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 12/01/2022] [Accepted: 12/15/2022] [Indexed: 12/24/2022] Open
Abstract
We present a large-scale analysis of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) substitutions, considering 1,585,456 high-quality raw sequencing samples, aimed at investigating the existence and quantifying the effect of mutational processes causing mutations in SARS-CoV-2 genomes when interacting with the human host. As a result, we confirmed the presence of three well-differentiated mutational processes likely ruled by reactive oxygen species (ROS), apolipoprotein B editing complex (APOBEC), and adenosine deaminase acting on RNA (ADAR). We then evaluated the activity of these mutational processes in different continental groups, showing that some samples from Africa present a significantly higher number of substitutions, most likely due to higher APOBEC activity. We finally analyzed the activity of mutational processes across different SARS-CoV-2 variants, and we found a significantly lower number of mutations attributable to APOBEC activity in samples assigned to the Omicron variant.
Collapse
Affiliation(s)
- Andrea Aroldi
- Hematology and Clinical Research Unit, San Gerardo Hospital, Via G. B. Pergolesi 33, 20900 Monza, Italy
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
| | - Fabrizio Angaroni
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Viale Sarca 336, 20100 Milano, Italy
- Computational Biology Research Centre, Human Technopole, Viale Rita Levi Montalcini 1, 20157 Milano, Italy
| | - Deborah D’Aliberti
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
| | - Silvia Spinelli
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
| | - Ilaria Crespiatico
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
| | - Valentina Crippa
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
| | - Rocco Piazza
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
- Bicocca Bioinformatics, Biostatistics and Bioimaging Center—B4, Via Follereau 3, 20854 Vedano al Lambro, Italy
| | - Alex Graudenzi
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Viale Sarca 336, 20100 Milano, Italy
- Bicocca Bioinformatics, Biostatistics and Bioimaging Center—B4, Via Follereau 3, 20854 Vedano al Lambro, Italy
| | - Daniele Ramazzotti
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
| |
Collapse
|
10
|
The Rise and Fall of SARS-CoV-2 Variants and Ongoing Diversification of Omicron. Viruses 2022; 14:v14092009. [PMID: 36146815 PMCID: PMC9505243 DOI: 10.3390/v14092009] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 09/06/2022] [Accepted: 09/08/2022] [Indexed: 12/12/2022] Open
Abstract
In late December of 2019, high-throughput sequencing technologies enabled rapid identification of SARS-CoV-2 as the etiological agent of COVID-19, and global sequencing efforts are now a critical tool for monitoring the ongoing spread and evolution of this virus. Here, we provide a short retrospective analysis of SARS-CoV-2 variants by analyzing a subset (n = 97,437) of all publicly available SARS-CoV-2 genomes (n = ~11.9 million) that were randomly selected but equally distributed over the course of the pandemic. We plot the appearance of new variants of concern (VOCs) over time and show that the mutation rates in Omicron (BA.1) and Omicron sub-lineages (BA.2–BA.5) are significantly elevated compared to previously identified SARS-CoV-2 variants. Mutations in Omicron are primarily restricted to the spike and nucleocapsid proteins, while 24 other viral proteins—including those involved in SARS-CoV-2 replication—are generally conserved. Collectively, this suggests that the genetic distinction of Omicron primarily arose from selective pressures on the spike, and that the fidelity of replication of this variant has not been altered.
Collapse
|
11
|
Robins WP, Mekalanos JJ. Covariance predicts conserved protein residue interactions important for the emergence and continued evolution of SARS-CoV-2 as a human pathogen. PLoS One 2022; 17:e0270276. [PMID: 35895734 PMCID: PMC9328546 DOI: 10.1371/journal.pone.0270276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 06/07/2022] [Indexed: 12/03/2022] Open
Abstract
SARS-CoV-2 is one of three recognized coronaviruses (CoVs) that have caused epidemics or pandemics in the 21st century and that likely emerged from animal reservoirs. Differences in nucleotide and protein sequence composition within related β-coronaviruses are often used to better understand CoV evolution, host adaptation, and their emergence as human pathogens. Here we report the comprehensive analysis of amino acid residue changes that have occurred in lineage B β-coronaviruses that show covariance with each other. This analysis revealed patterns of covariance within conserved viral proteins that potentially define conserved interactions within and between core proteins encoded by SARS-CoV-2 related β-coronaviruses. We identified not only individual pairs but also networks of amino acid residues that exhibited statistically high frequencies of covariance with each other using an independent pair model followed by a tandem model approach. Using 149 different CoV genomes that vary in their relatedness, we identified networks of unique combinations of alleles that can be incrementally traced genome by genome within different phylogenic lineages. Remarkably, covariant residues and their respective regions most abundantly represented are implicated in the emergence of SARS-CoV-2 and are also enriched in dominant SARS-CoV-2 variants.
Collapse
Affiliation(s)
- William P. Robins
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - John J. Mekalanos
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
12
|
Ramazzotti D, Maspero D, Angaroni F, Spinelli S, Antoniotti M, Piazza R, Graudenzi A. Early detection and improved genomic surveillance of SARS-CoV-2 variants from deep sequencing data. iScience 2022; 25:104487. [PMID: 35677393 PMCID: PMC9162787 DOI: 10.1016/j.isci.2022.104487] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 05/06/2022] [Accepted: 05/24/2022] [Indexed: 11/19/2022] Open
Affiliation(s)
- Daniele Ramazzotti
- Department of Medicine and Surgery, University of Milan-Bicocca, Monza, Italy
- Corresponding author
| | - Davide Maspero
- Department of Informatics, Systems and Communication, University of Milan-Bicocca, Milan, Italy
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR), Segrate, Milan, Italy
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Fabrizio Angaroni
- Department of Informatics, Systems and Communication, University of Milan-Bicocca, Milan, Italy
| | - Silvia Spinelli
- Department of Medicine and Surgery, University of Milan-Bicocca, Monza, Italy
| | - Marco Antoniotti
- Department of Informatics, Systems and Communication, University of Milan-Bicocca, Milan, Italy
- Bicocca Bioinformatics, Biostatistics and Bioimaging Centre – B4, Milan, Italy
| | - Rocco Piazza
- Department of Medicine and Surgery, University of Milan-Bicocca, Monza, Italy
- Bicocca Bioinformatics, Biostatistics and Bioimaging Centre – B4, Milan, Italy
| | - Alex Graudenzi
- Department of Informatics, Systems and Communication, University of Milan-Bicocca, Milan, Italy
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR), Segrate, Milan, Italy
- Bicocca Bioinformatics, Biostatistics and Bioimaging Centre – B4, Milan, Italy
- Corresponding author
| |
Collapse
|
13
|
Host adaptation of codon usage in SARS-CoV-2 from mammals indicates potential natural selection and viral fitness. Arch Virol 2022; 167:2677-2688. [PMID: 36166106 PMCID: PMC9514192 DOI: 10.1007/s00705-022-05612-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 08/19/2022] [Indexed: 12/14/2022]
Abstract
SARS-CoV-2 infection, which is the cause of the COVID-19 pandemic, has expanded across various animal hosts, and the virus can be transmitted particularly efficiently in minks. It is still not clear how SARS-CoV-2 is selected and evolves in its hosts, or how mutations affect viral fitness. In this report, sequences of SARS-CoV-2 isolated from human and animal hosts were analyzed, and the binding energy and capacity of the spike protein to bind human ACE2 and the mink receptor were compared. Codon adaptation index (CAI) analysis indicated the optimization of viral codons in some animals such as bats and minks, and a neutrality plot demonstrated that natural selection had a greater influence on some SARS-CoV-2 sequences than mutational pressure. Molecular dynamics simulation results showed that the mutations Y453F and N501T in mink SARS-CoV-2 could enhance the binding of the viral spike to the mink receptor, indicating the involvement of these mutations in natural selection and viral fitness. Receptor binding analysis revealed that the mink SARS-CoV-2 spike interacted more strongly with the mink receptor than the human receptor. Tracking the variations and codon bias of SARS-CoV-2 is helpful for understanding the fitness of the virus in virus transmission, pathogenesis, and immune evasion.
Collapse
|