1
|
Elkin ME, Zhu X. Paying attention to the SARS-CoV-2 dialect : a deep neural network approach to predicting novel protein mutations. Commun Biol 2025; 8:98. [PMID: 39838059 PMCID: PMC11751191 DOI: 10.1038/s42003-024-07262-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2024] [Accepted: 11/13/2024] [Indexed: 01/23/2025] Open
Abstract
Predicting novel mutations has long-lasting impacts on life science research. Traditionally, this problem is addressed through wet-lab experiments, which are often expensive and time consuming. The recent advancement in neural language models has provided stunning results in modeling and deciphering sequences. In this paper, we propose a Deep Novel Mutation Search (DNMS) method, using deep neural networks, to model protein sequence for mutation prediction. We use SARS-CoV-2 spike protein as the target and use a protein language model to predict novel mutations. Different from existing research which is often limited to mutating the reference sequence for prediction, we propose a parent-child mutation prediction paradigm where a parent sequence is modeled for mutation prediction. Because mutations introduce changing context to the underlying sequence, DNMS models three aspects of the protein sequences: semantic changes, grammatical changes, and attention changes, each modeling protein sequence aspects from shifting of semantics, grammar coherence, and amino-acid interactions in latent space. A ranking approach is proposed to combine all three aspects to capture mutations demonstrating evolving traits, in accordance with real-world SARS-CoV-2 spike protein sequence evolution. DNMS can be adopted for an early warning variant detection system, creating public health awareness of future SARS-CoV-2 mutations.
Collapse
Affiliation(s)
- Magdalyn E Elkin
- Dept. Electrical Engineering and Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, FL, 33431, USA.
| | - Xingquan Zhu
- Dept. Electrical Engineering and Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, FL, 33431, USA.
| |
Collapse
|
2
|
Duesterwald L, Nguyen M, Christensen P, Long SW, Olsen RJ, Musser JM, Davis JJ. Using intrahost single nucleotide variant data to predict SARS-CoV-2 detection cycle threshold values. PLoS One 2024; 19:e0312686. [PMID: 39475880 PMCID: PMC11524481 DOI: 10.1371/journal.pone.0312686] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 10/10/2024] [Indexed: 11/02/2024] Open
Abstract
Over the last four years, each successive wave of the COVID-19 pandemic has been caused by variants with mutations that improve the transmissibility of the virus. Despite this, we still lack tools for predicting clinically important features of the virus. In this study, we show that it is possible to predict the PCR cycle threshold (Ct) values from clinical detection assays using sequence data. Ct values often correspond with patient viral load and the epidemiological trajectory of the pandemic. Using a collection of 36,335 high quality genomes, we built models from SARS-CoV-2 intrahost single nucleotide variant (iSNV) data, computing XGBoost models from the frequencies of A, T, G, C, insertions, and deletions at each position relative to the Wuhan-Hu-1 reference genome. Our best model had an R2 of 0.604 [0.593-0.616, 95% confidence interval] and a Root Mean Square Error (RMSE) of 5.247 [5.156-5.337], demonstrating modest predictive power. Overall, we show that the results are stable relative to an external holdout set of genomes selected from SRA and are robust to patient status and the detection instruments that were used. This study highlights the importance of developing modeling strategies that can be applied to publicly available genome sequence data for use in disease prevention and control.
Collapse
Affiliation(s)
- Lea Duesterwald
- College of Engineering, Cornell University, Ithaca, NY, United States of America
- Northwestern-Argonne Institute for Science and Engineering, Evanston, IL, United States of America
| | - Marcus Nguyen
- Northwestern-Argonne Institute for Science and Engineering, Evanston, IL, United States of America
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, United States of America
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, United States of America
| | - Paul Christensen
- Laboratory of Human Molecular and Translational Human Infectious Diseases, Center for Infectious Diseases, Houston Methodist Research Institute and Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, United States of America
- Department of Pathology and Laboratory Medicine, Weill Cornell Medical College, New York City, NY, United States of America
| | - S. Wesley Long
- Laboratory of Human Molecular and Translational Human Infectious Diseases, Center for Infectious Diseases, Houston Methodist Research Institute and Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, United States of America
- Department of Pathology and Laboratory Medicine, Weill Cornell Medical College, New York City, NY, United States of America
| | - Randall J. Olsen
- Laboratory of Human Molecular and Translational Human Infectious Diseases, Center for Infectious Diseases, Houston Methodist Research Institute and Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, United States of America
- Department of Pathology and Laboratory Medicine, Weill Cornell Medical College, New York City, NY, United States of America
| | - James M. Musser
- Laboratory of Human Molecular and Translational Human Infectious Diseases, Center for Infectious Diseases, Houston Methodist Research Institute and Department of Pathology and Genomic Medicine, Houston Methodist Hospital, Houston, TX, United States of America
- Department of Pathology and Laboratory Medicine, Weill Cornell Medical College, New York City, NY, United States of America
| | - James J. Davis
- Northwestern-Argonne Institute for Science and Engineering, Evanston, IL, United States of America
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, United States of America
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, United States of America
| |
Collapse
|
3
|
Bendix AF, Trentin AB, Vasconcelos MW, Pilonetto JC, Kuhn BC, Leite DCDA, De Barros FRO, Cardoso JMK, Gabiatti NC, Wendt SN, Ghisi NDC. From chaos to clarity: The scientometric breakthrough in COVID-19 research. Diagn Microbiol Infect Dis 2024; 110:116438. [PMID: 39047387 DOI: 10.1016/j.diagmicrobio.2024.116438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 07/02/2024] [Accepted: 07/10/2024] [Indexed: 07/27/2024]
Abstract
BACKGROUND The COVID-19 pandemic paralyzed the world for over three years, generating unprecedented social changes in recent human history. AIMS We aimed to scientometrically summarize a global and temporal overview of publications on COVID-19 in the two worst years of the pandemic and its progression in early 2022, after the start of vaccination. METHODS Using the Web of Science database, this review covered the period from late 2019 to March 2022 and included all publications identified using the following terms: "SARS-CoV-2", "COVID-19", "Coronavirus Disease 19", and "2019-nCoV". We retrieved 268,904 publications, with evident global spreading, demonstrating that the pandemic triggered worldwide scientific research efforts. RESULTS Within the dataset, 195 countries have published about Covid-19. In initial publications, a solid trend in genotyping, sequencing, and detection of the virus was evident; however, in the development of the pandemic, new knowledge and research focus gained relevance, with continental solid trends, revealed by the keywords sustainability (eastern Europe); material sciences (Asia); public and mental health (Africa); information sciences (western Europe); education (Latin America). It identified high-impact research, mainly on diagnosis and vaccines, but also equally essential topics for returning life to the new normal, such as mental health, education, and remote work. The world experienced a highly transmissible infection that proved how fragile we are regarding organization and society. CONCLUSIONS It is necessary to learn from such an event and establish a protocol of actions and measures to be taken and avoided in a health emergency, aiming to act differently from the chaos experienced during the pandemic. Following the One Health approach, humanity must be aware of the need for more sustainable attitudes, given the inseparability of human beings from the environment.
Collapse
Affiliation(s)
- Andre Felipe Bendix
- Programa de Pós-Graduação em Biotecnologia - PPGBIOTEC, Universidade Tecnológica Federal do Paraná, Dois Vizinhos, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil
| | - Alex Batista Trentin
- Programa de Pós-Graduação em Biotecnologia - PPGBIOTEC, Universidade Tecnológica Federal do Paraná, Dois Vizinhos, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil
| | - Marina Wust Vasconcelos
- Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil; Programa de Pós-Graduação em Genética (PPGGEN), Universidade Federal do Paraná, Curitiba, Brasil
| | - Jessica Cousseau Pilonetto
- Programa de Pós-Graduação em Biotecnologia - PPGBIOTEC, Universidade Tecnológica Federal do Paraná, Dois Vizinhos, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil
| | - Betty Cristiane Kuhn
- Coordenação do Curso de Engenharia de Bioprocessos e Biotecnologia, Universidade Tecnológica Federal do Paraná, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil
| | - Deborah Catharine De Assis Leite
- Programa de Pós-Graduação em Biotecnologia - PPGBIOTEC, Universidade Tecnológica Federal do Paraná, Dois Vizinhos, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil; Programa de Pós-Graduação em Tecnologias Computacionais para o Agronegócio-PPGTCA, Universidade Tecnológica Federal do Paraná, Medianeira, Brasil
| | - Flavia Regina Oliveira De Barros
- Coordenação do Curso de Engenharia de Bioprocessos e Biotecnologia, Universidade Tecnológica Federal do Paraná, Brasil; Programa de Pós-Graduação em Zootecnia (PPZ), Universidade Tecnológica Federal do Paraná, Dois Vizinhos, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil
| | - Juliana Morini Küpper Cardoso
- Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil
| | - Naiana Cristine Gabiatti
- Programa de Pós-Graduação em Biotecnologia - PPGBIOTEC, Universidade Tecnológica Federal do Paraná, Dois Vizinhos, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil
| | - Simone Neumann Wendt
- Coordenação do Curso de Engenharia Florestal, Universidade Tecnológica Federal do Paraná, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil
| | - Nédia de Castilhos Ghisi
- Programa de Pós-Graduação em Biotecnologia - PPGBIOTEC, Universidade Tecnológica Federal do Paraná, Dois Vizinhos, Brasil; Dois Vizinhos/ Laboratório Multiusuário de Análises Biológicas e Biologia Molecular (BioMol) - UTFPR, Grupo de Pesquisa em Biologia Molecular - UTFPR, Brasil.
| |
Collapse
|
4
|
Nunes MC, Thommes E, Fröhlich H, Flahault A, Arino J, Baguelin M, Biggerstaff M, Bizel-Bizellot G, Borchering R, Cacciapaglia G, Cauchemez S, Barbier--Chebbah A, Claussen C, Choirat C, Cojocaru M, Commaille-Chapus C, Hon C, Kong J, Lambert N, Lauer KB, Lehr T, Mahe C, Marechal V, Mebarki A, Moghadas S, Niehus R, Opatowski L, Parino F, Pruvost G, Schuppert A, Thiébaut R, Thomas-Bachli A, Viboud C, Wu J, Crépey P, Coudeville L. Redefining pandemic preparedness: Multidisciplinary insights from the CERP modelling workshop in infectious diseases, workshop report. Infect Dis Model 2024; 9:501-518. [PMID: 38445252 PMCID: PMC10912817 DOI: 10.1016/j.idm.2024.02.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 02/07/2024] [Accepted: 02/16/2024] [Indexed: 03/07/2024] Open
Abstract
In July 2023, the Center of Excellence in Respiratory Pathogens organized a two-day workshop on infectious diseases modelling and the lessons learnt from the Covid-19 pandemic. This report summarizes the rich discussions that occurred during the workshop. The workshop participants discussed multisource data integration and highlighted the benefits of combining traditional surveillance with more novel data sources like mobility data, social media, and wastewater monitoring. Significant advancements were noted in the development of predictive models, with examples from various countries showcasing the use of machine learning and artificial intelligence in detecting and monitoring disease trends. The role of open collaboration between various stakeholders in modelling was stressed, advocating for the continuation of such partnerships beyond the pandemic. A major gap identified was the absence of a common international framework for data sharing, which is crucial for global pandemic preparedness. Overall, the workshop underscored the need for robust, adaptable modelling frameworks and the integration of different data sources and collaboration across sectors, as key elements in enhancing future pandemic response and preparedness.
Collapse
Affiliation(s)
- Marta C. Nunes
- Center of Excellence in Respiratory Pathogens (CERP), Hospices Civils de Lyon (HCL) and Centre International de Recherche en Infectiologie (CIRI), Équipe Santé Publique, Épidémiologie et Écologie Évolutive des Maladies Infectieuses (PHE3ID), Inserm U1111, CNRS UMR5308, ENS de Lyon, Université Claude Bernard Lyon 1, Lyon, France
- South African Medical Research Council, Vaccines & Infectious Diseases Analytics Research Unit, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Edward Thommes
- New Products and Innovation (NPI), Sanofi Vaccines (Global), Toronto, Ontario, Canada
- Department of Mathematics and Statistics, University of Guelph, Guelph, Ontario, Canada
| | - Holger Fröhlich
- Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Department of Bioinformatics, Schloss Birlinghoven, Sankt Augustin, Germany
- University of Bonn, Bonn-Aachen International Center for IT (b-it), Bonn, Germany
| | - Antoine Flahault
- Institute of Global Health, Faculty of Medicine, University of Geneva, Geneva, Switzerland and Swiss School of Public Health, Zürich, Switzerland
| | - Julien Arino
- Department of Mathematics, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Marc Baguelin
- MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, UK
- Centre for Mathematical Modelling of Infectious Diseases, Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - Matthew Biggerstaff
- National Center for Immunization and Respiratory Diseases (NCIRD), US Centers for Disease Control and Prevention (CDC), Atlanta, GA, USA
| | - Gaston Bizel-Bizellot
- Departement of Computational Biology, Departement of Global Health, Institut Pasteur, Paris, France
| | - Rebecca Borchering
- National Center for Immunization and Respiratory Diseases (NCIRD), US Centers for Disease Control and Prevention (CDC), Atlanta, GA, USA
| | - Giacomo Cacciapaglia
- Institut de Physique des Deux Infinis de Lyon (IP2I), UMR5822, IN2P3/CNRS, Université Claude Bernard Lyon 1, Villeurbanne, France
| | - Simon Cauchemez
- Mathematical Modelling of Infectious Diseases Unit, Institut Pasteur, Université Paris Cité, UMR2000 CNRS, Paris, France
| | - Alex Barbier--Chebbah
- Decision and Bayesian Computation, Institut Pasteur, Université Paris Cité, CNRS UMR 3571, France
| | - Carsten Claussen
- Fraunhofer-Institute for Translational Medicine and Pharmacology, Hamburg, Germany
| | - Christine Choirat
- Institute of Global Health, Faculty of Medicine, University of Geneva, Switzerland
| | - Monica Cojocaru
- Mathematics & Statistics Department, College of Engineering and Physical Sciences, University of Guelph, Guelph, Ontario, Canada
| | | | - Chitin Hon
- Respiratory Disease AI Laboratory on Epidemic Intelligence and Medical Big Data Instrument Applications, Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology, Taipa, Macau, China
| | - Jude Kong
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Global South Artificial Intelligence for Pandemic and Epidemic Preparedness and Response Network (AI4PEP), Laboratory for Industrial and Applied Mathematics (LIAM), Department of Mathematics and Statistics, York University, Toronto, Ontario, Canada
| | | | | | - Thorsten Lehr
- Clinical Pharmacy, Saarland University, Saarbrücken, Germany
| | | | - Vincent Marechal
- Sorbonne Université, INSERM, Centre de Recherche Saint-Antoine, Paris, France
| | | | - Seyed Moghadas
- Agent-Based Modelling Laboratory, York University, Toronto, Ontario, Canada
| | - Rene Niehus
- European Centre for Disease Prevention and Control (ECDC), Stockholm, Sweden
| | - Lulla Opatowski
- UMR 1018, Team “Anti-infective Evasion and Pharmacoepidemiology”, Université Paris-Saclay, UVSQ, INSERM, France
- Epidemiology and Modelling of Antibiotic Evasion, Institut Pasteur, Université Paris Cité, Paris, France
| | - Francesco Parino
- Sorbonne Université, INSERM, Pierre Louis Institute of Epidemiology and Public Health, Paris, France
| | | | - Andreas Schuppert
- Institute for Computational Biomedicine, RWTH Aachen University, Aachen, Germany
| | - Rodolphe Thiébaut
- Bordeaux University, Department of Public Health, Inserm UMR 1219 Bordeaux Population Health Research Center, Inria SISTM, Bordeaux, France
| | | | - Cecile Viboud
- Fogarty International Center, National Institutes of Health, Bethesda, MD, USA
| | - Jianhong Wu
- York Emergency Mitigation, Engagement, Response, and Governance Institute, Laboratory for Industrial and Applied Mathematics, York University, Toronto, Ontario, Canada
| | - Pascal Crépey
- EHESP, Université de Rennes, CNRS, IEP Rennes, Arènes - UMR 6051, RSMS – Inserm U 1309, Rennes, France
| | | |
Collapse
|
5
|
Alfonsi T, Bernasconi A, Chiara M, Ceri S. Data-driven recombination detection in viral genomes. Nat Commun 2024; 15:3313. [PMID: 38632281 PMCID: PMC11024102 DOI: 10.1038/s41467-024-47464-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 03/25/2024] [Indexed: 04/19/2024] Open
Abstract
Recombination is a key molecular mechanism for the evolution and adaptation of viruses. The first recombinant SARS-CoV-2 genomes were recognized in 2021; as of today, more than ninety SARS-CoV-2 lineages are designated as recombinant. In the wake of the COVID-19 pandemic, several methods for detecting recombination in SARS-CoV-2 have been proposed; however, none could faithfully confirm manual analyses by experts in the field. We hereby present RecombinHunt, an original data-driven method for the identification of recombinant genomes, capable of recognizing recombinant SARS-CoV-2 genomes (or lineages) with one or two breakpoints with high accuracy and within reduced turn-around times. ReconbinHunt shows high specificity and sensitivity, compares favorably with other state-of-the-art methods, and faithfully confirms manual analyses by experts. RecombinHunt identifies recombinant viral genomes from the recent monkeypox epidemic in high concordance with manually curated analyses by experts, suggesting that our approach is robust and can be applied to any epidemic/pandemic virus.
Collapse
Affiliation(s)
- Tommaso Alfonsi
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, 20133, Milan, Italy
| | - Anna Bernasconi
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, 20133, Milan, Italy.
| | - Matteo Chiara
- Department of Biosciences, Università degli Studi di Milano, Via Celoria 26, 20133, Milan, Italy
| | - Stefano Ceri
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, 20133, Milan, Italy
| |
Collapse
|
6
|
Pinoli P, Canakoglu A, Ceri S, Chiara M, Ferrandi E, Minotti L, Bernasconi A. VariantHunter: a method and tool for fast detection of emerging SARS-CoV-2 variants. Database (Oxford) 2023; 2023:baad044. [PMID: 37410916 PMCID: PMC10325486 DOI: 10.1093/database/baad044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 05/31/2023] [Accepted: 06/16/2023] [Indexed: 07/08/2023]
Abstract
With the progression of the COVID-19 pandemic, large datasets of SARS-CoV-2 genome sequences were collected to closely monitor the evolution of the virus and identify the novel variants/strains. By analyzing genome sequencing data, health authorities can 'hunt' novel emerging variants of SARS-CoV-2 as early as possible, and then monitor their evolution and spread. We designed VariantHunter, a highly flexible and user-friendly tool for systematically monitoring the evolution of SARS-CoV-2 at global and regional levels. In VariantHunter, amino acid changes are analyzed over an interval of 4 weeks in an arbitrary geographical area (continent, country, or region); for every week in the interval, the prevalence is computed and changes are ranked based on their increase or decrease in prevalence. VariantHunter supports two main types of analysis: lineage-independent and lineage-specific. The former considers all the available data and aims to discover new viral variants. The latter evaluates specific lineages/viral variants to identify novel candidate designations (sub-lineages and sub-variants). Both analyses use simple statistics and visual representations (diffusion charts and heatmaps) to track viral evolution. A dataset explorer allows users to visualize available data and refine their selection. VariantHunter is a web application free to all users. The two types of supported analysis (lineage-independent and lineage-specific) allow user-friendly monitoring of the viral evolution, empowering genomic surveillance without requiring any computational background. Database URL http://gmql.eu/variant_hunter/.
Collapse
Affiliation(s)
- Pietro Pinoli
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, Milan 20133, Italy
| | - Arif Canakoglu
- Department of Anesthesia, Critical Care and Emergency, Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Via Francesco Sforza 28, Milan 20122, Italy
| | - Stefano Ceri
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, Milan 20133, Italy
| | - Matteo Chiara
- Department of Biosciences, University of Milan, Via Celoria 26, Milan 20133, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnology, Consiglio Nazionale delle Ricerche, via Amendola 122/O, Bari 70126, Italy
| | - Erika Ferrandi
- Department of Biosciences, University of Milan, Via Celoria 26, Milan 20133, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnology, Consiglio Nazionale delle Ricerche, via Amendola 122/O, Bari 70126, Italy
| | - Luca Minotti
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, Milan 20133, Italy
| | - Anna Bernasconi
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, Milan 20133, Italy
| |
Collapse
|
7
|
Zhao LP, Cohen S, Zhao M, Madeleine M, Payne TH, Lybrand TP, Geraghty DE, Jerome KR, Corey L. Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations. JAMA Netw Open 2023; 6:e230191. [PMID: 36809468 PMCID: PMC9945077 DOI: 10.1001/jamanetworkopen.2023.0191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 01/05/2023] [Indexed: 02/23/2023] Open
Abstract
Importance Earlier detection of emerging novel SARS-COV-2 variants is important for public health surveillance of potential viral threats and for earlier prevention research. Artificial intelligence may facilitate early detection of SARS-CoV2 emerging novel variants based on variant-specific mutation haplotypes and, in turn, be associated with enhanced implementation of risk-stratified public health prevention strategies. Objective To develop a haplotype-based artificial intelligence (HAI) model for identifying novel variants, including mixture variants (MVs) of known variants and new variants with novel mutations. Design, Setting, and Participants This cross-sectional study used serially observed viral genomic sequences globally (prior to March 14, 2022) to train and validate the HAI model and used it to identify variants arising from a prospective set of viruses from March 15 to May 18, 2022. Main Outcomes and Measures Viral sequences, collection dates, and locations were subjected to statistical learning analysis to estimate variant-specific core mutations and haplotype frequencies, which were then used to construct an HAI model to identify novel variants. Results Through training on more than 5 million viral sequences, an HAI model was built, and its identification performance was validated on an independent validation set of more than 5 million viruses. Its identification performance was assessed on a prospective set of 344 901 viruses. In addition to achieving an accuracy of 92.8% (95% CI within 0.1%), the HAI model identified 4 Omicron MVs (Omicron-Alpha, Omicron-Delta, Omicron-Epsilon, and Omicron-Zeta), 2 Delta MVs (Delta-Kappa and Delta-Zeta), and 1 Alpha-Epsilon MV, among which Omicron-Epsilon MVs were most frequent (609/657 MVs [92.7%]). Furthermore, the HAI model found that 1699 Omicron viruses had unidentifiable variants given that these variants acquired novel mutations. Lastly, 524 variant-unassigned and variant-unidentifiable viruses carried 16 novel mutations, 8 of which were increasing in prevalence percentages as of May 2022. Conclusions and Relevance In this cross-sectional study, an HAI model found SARS-COV-2 viruses with MV or novel mutations in the global population, which may require closer examination and monitoring. These results suggest that HAI may complement phylogenic variant assignment, providing additional insights into emerging novel variants in the population.
Collapse
Affiliation(s)
- Lue Ping Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Seth Cohen
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
- Department of Medicine, University of Washington School of Medicine, Seattle
| | | | - Margaret Madeleine
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Thomas H. Payne
- Department of Medicine, University of Washington School of Medicine, Seattle
| | - Terry P. Lybrand
- Quintepa Computing LLC, Nashville, Tennessee
- Department of Chemistry, Vanderbilt University; Nashville, Tennessee
| | - Daniel E. Geraghty
- Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Keith R. Jerome
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
- Department of Medicine, University of Washington School of Medicine, Seattle
| | - Lawrence Corey
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
- Department of Medicine, University of Washington School of Medicine, Seattle
| |
Collapse
|
8
|
Milanesi S, Rosset F, Colaneri M, Giordano G, Pesenti K, Blanchini F, Bolzern P, Colaneri P, Sacchi P, De Nicolao G, Bruno R. Early detection of variants of concern via funnel plots of regional reproduction numbers. Sci Rep 2023; 13:1052. [PMID: 36658143 PMCID: PMC9852294 DOI: 10.1038/s41598-022-27116-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 12/26/2022] [Indexed: 01/20/2023] Open
Abstract
Early detection of the emergence of a new variant of concern (VoC) is essential to develop strategies that contain epidemic outbreaks. For example, knowing in which region a VoC starts spreading enables prompt actions to circumscribe the geographical area where the new variant can spread, by containing it locally. This paper presents 'funnel plots' as a statistical process control method that, unlike tools whose purpose is to identify rises of the reproduction number ([Formula: see text]), detects when a regional [Formula: see text] departs from the national average and thus represents an anomaly. The name of the method refers to the funnel-like shape of the scatter plot that the data take on. Control limits with prescribed false alarm rate are derived from the observation that regional [Formula: see text]'s are normally distributed with variance inversely proportional to the number of infectious cases. The method is validated on public COVID-19 data demonstrating its efficacy in the early detection of SARS-CoV-2 variants in India, South Africa, England, and Italy, as well as of a malfunctioning episode of the diagnostic infrastructure in England, during which the Immensa lab in Wolverhampton gave 43,000 incorrect negative tests relative to South West and West Midlands territories.
Collapse
Affiliation(s)
- Simone Milanesi
- Department of Mathematics, University of Pavia, Pavia, Italy
| | - Francesca Rosset
- Department of Mathematics, Computer Science and Physics, University of Udine, Udine, Italy
| | - Marta Colaneri
- Division of Infectious Diseases I, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
| | - Giulia Giordano
- Department of Industrial Engineering, University of Trento, Trento, Italy
| | - Kenneth Pesenti
- Department of Surgical Medical and Health Sciences, University of Trieste, Trieste, Italy
| | - Franco Blanchini
- Department of Mathematics, Computer Science and Physics, University of Udine, Udine, Italy
| | - Paolo Bolzern
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy
| | - Patrizio Colaneri
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy
- Institute of Electronics, Information Engineering and Telecommunication (IEIIT), Italian National Research Council (CNR), Turin, Italy
| | - Paolo Sacchi
- Division of Infectious Diseases I, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
| | - Giuseppe De Nicolao
- Division of Infectious Diseases I, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy.
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.
| | - Raffaele Bruno
- Division of Infectious Diseases I, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
- Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Pavia, Italy
| |
Collapse
|
9
|
Kumar A, Asghar A, Singh HN, Faiq MA, Kumar S, Narayan RK, Kumar G, Dwivedi P, Sahni C, Jha RK, Kulandhasamy M, Prasoon P, Sesham K, Kant K, Pandey SN. SARS-CoV-2 Omicron Variant Genomic Sequences and Their Epidemiological Correlates Regarding the End of the Pandemic: In Silico Analysis. JMIR BIOINFORMATICS AND BIOTECHNOLOGY 2023; 4:e42700. [PMID: 36688013 PMCID: PMC9843602 DOI: 10.2196/42700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 11/29/2022] [Accepted: 12/16/2022] [Indexed: 06/17/2023]
Abstract
Background Emergence of the new SARS-CoV-2 variant B.1.1.529 worried health policy makers worldwide due to a large number of mutations in its genomic sequence, especially in the spike protein region. The World Health Organization (WHO) designated this variant as a global variant of concern (VOC), which was named "Omicron." Following Omicron's emergence, a surge of new COVID-19 cases was reported globally, primarily in South Africa. Objective The aim of this study was to understand whether Omicron had an epidemiological advantage over existing variants. Methods We performed an in silico analysis of the complete genomic sequences of Omicron available on the Global Initiative on Sharing Avian Influenza Data (GISAID) database to analyze the functional impact of the mutations present in this variant on virus-host interactions in terms of viral transmissibility, virulence/lethality, and immune escape. In addition, we performed a correlation analysis of the relative proportion of the genomic sequences of specific SARS-CoV-2 variants (in the period from October 1 to November 29, 2021) with matched epidemiological data (new COVID-19 cases and deaths) from South Africa. Results Compared with the current list of global VOCs/variants of interest (VOIs), as per the WHO, Omicron bears more sequence variation, specifically in the spike protein and host receptor-binding motif (RBM). Omicron showed the closest nucleotide and protein sequence homology with the Alpha variant for the complete sequence and the RBM. The mutations were found to be primarily condensed in the spike region (n=28-48) of the virus. Further mutational analysis showed enrichment for the mutations decreasing binding affinity to angiotensin-converting enzyme 2 receptor and receptor-binding domain protein expression, and for increasing the propensity of immune escape. An inverse correlation of Omicron with the Delta variant was noted (r=-0.99, P<.001; 95% CI -0.99 to -0.97) in the sequences reported from South Africa postemergence of the new variant, subsequently showing a decrease. There was a steep rise in new COVID-19 cases in parallel with the increase in the proportion of Omicron isolates since the report of the first case (74%-100%). By contrast, the incidence of new deaths did not increase (r=-0.04, P>.05; 95% CI -0.52 to 0.58). Conclusions In silico analysis of viral genomic sequences suggests that the Omicron variant has more remarkable immune-escape ability than existing VOCs/VOIs, including Delta, but reduced virulence/lethality than other reported variants. The higher power for immune escape for Omicron was a likely reason for the resurgence in COVID-19 cases and its rapid rise as the globally dominant strain. Being more infectious but less lethal than the existing variants, Omicron could have plausibly led to widespread unnoticed new, repeated, and vaccine breakthrough infections, raising the population-level immunity barrier against the emergence of new lethal variants. The Omicron variant could have thus paved the way for the end of the pandemic.
Collapse
Affiliation(s)
- Ashutosh Kumar
- Department of Anatomy All India Institute of Medical Sciences-Patna Patna India
- Etiologically Elusive Disorders Research Network New Delhi India
| | - Adil Asghar
- Department of Anatomy All India Institute of Medical Sciences-Patna Patna India
- Etiologically Elusive Disorders Research Network New Delhi India
| | - Himanshu N Singh
- Etiologically Elusive Disorders Research Network New Delhi India
- Department of Systems Biology Columbia University Irving Medical Center New York, NY United States
| | - Muneeb A Faiq
- Etiologically Elusive Disorders Research Network New Delhi India
- New York University Langone Health Center Robert I Grossman School of Medicine New York University New York, NY United States
| | - Sujeet Kumar
- Etiologically Elusive Disorders Research Network New Delhi India
- Center for Proteomics and Drug Discovery Amity Institute of Biotechnology Amity University, Maharashtra Mumbai India
| | - Ravi K Narayan
- Etiologically Elusive Disorders Research Network New Delhi India
- Dr BC Roy Multi-speciality Medical Research Centre Indian Institute of Technology Kharagpur India
| | - Gopichand Kumar
- Department of Anatomy All India Institute of Medical Sciences-Patna Patna India
- Etiologically Elusive Disorders Research Network New Delhi India
| | - Prakhar Dwivedi
- Department of Anatomy All India Institute of Medical Sciences-Patna Patna India
- Etiologically Elusive Disorders Research Network New Delhi India
| | - Chetan Sahni
- Etiologically Elusive Disorders Research Network New Delhi India
- Department of Anatomy Institute of Medical Sciences Banaras Hindu University Varanasi India
| | - Rakesh K Jha
- Department of Anatomy All India Institute of Medical Sciences-Patna Patna India
- Etiologically Elusive Disorders Research Network New Delhi India
| | - Maheswari Kulandhasamy
- Etiologically Elusive Disorders Research Network New Delhi India
- Department of Biochemistry Maulana Azad Medical College New Delhi India
| | - Pranav Prasoon
- Etiologically Elusive Disorders Research Network New Delhi India
- School of Medicine University of Pittsburgh Pittsburgh, PA United States
| | - Kishore Sesham
- Etiologically Elusive Disorders Research Network New Delhi India
- Department of Anatomy All India Institute of Medical Sciences-Mangalagiri Mangalagiri India
| | - Kamla Kant
- Etiologically Elusive Disorders Research Network New Delhi India
- Department of Microbiology All India Institute of Medical Sciences-Bathinda Bathinda India
| | - Sada N Pandey
- Etiologically Elusive Disorders Research Network New Delhi India
- Department of Zoology Banaras Hindu University Varanasi India
| |
Collapse
|
10
|
Chrysostomou AC, Vrancken B, Haralambous C, Alexandrou M, Aristokleous A, Christodoulou C, Gregoriou I, Ioannides M, Kalakouta O, Karagiannis C, Koumbaris G, Loizides C, Mendris M, Papastergiou P, Patsalis PC, Pieridou D, Richter J, Schmitt M, Shammas C, Stylianou DC, Themistokleous G, Lemey P, Kostrikis LG. Genomic Epidemiology of the SARS-CoV-2 Epidemic in Cyprus from November 2020 to October 2021: The Passage of Waves of Alpha and Delta Variants of Concern. Viruses 2022; 15:108. [PMID: 36680148 PMCID: PMC9862594 DOI: 10.3390/v15010108] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 12/27/2022] [Accepted: 12/28/2022] [Indexed: 01/03/2023] Open
Abstract
The emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in December 2019 resulted in the coronavirus disease 2019 (COVID-19) pandemic, which has had devastating repercussions for public health. Over the course of this pandemic, the virus has continuously been evolving, resulting in new, more infectious variants that have frequently led to surges of new SARS-CoV-2 infections. In the present study, we performed detailed genetic, phylogenetic, phylodynamic and phylogeographic analyses to examine the SARS-CoV-2 epidemic in Cyprus using 2352 SARS-CoV-2 sequences from infected individuals in Cyprus during November 2020 to October 2021. During this period, a total of 61 different lineages and sublineages were identified, with most falling into three groups: B.1.258 & sublineages, Alpha (B.1.1.7 & Q. sublineages), and Delta (B.1.617.2 & AY. sublineages), each encompassing a set of S gene mutations that primarily confer increased transmissibility as well as immune evasion. Specifically, these lineages were coupled with surges of new infections in Cyprus, resulting in the following: the second wave of SARS-CoV-2 infections in Cyprus, comprising B.1.258 & sublineages, during late autumn 2020/beginning of winter 2021; the third wave, comprising Alpha (B.1.1.7 & Q. sublineages), during spring 2021; and the fourth wave, comprising Delta (B.1.617.2 & AY. sublineages) during summer 2021. Additionally, it was identified that these lineages were primarily imported from and exported to the UK, Greece, and Sweden; many other migration links were also identified, including Switzerland, Denmark, Russia, and Germany. Taken together, the results of this study indicate that the SARS-CoV-2 epidemic in Cyprus was characterized by successive introduction of new lineages from a plethora of countries, resulting in the generation of waves of infection. Overall, this study highlights the importance of investigating the spatiotemporal evolution of the SARS-CoV-2 epidemic in the context of Cyprus, as well as the impact of protective measures placed to mitigate transmission of the virus, providing necessary information to safeguard public health.
Collapse
Affiliation(s)
| | - Bram Vrancken
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, 3000 Leuven, Belgium
- Spatial Epidemiology Lab (SpELL), Université Libre de Bruxelles, 1050 Bruxelles, Belgium
| | - Christos Haralambous
- Unit for Surveillance and Control of Communicable Diseases, Ministry of Health, Nicosia 1148, Cyprus
| | - Maria Alexandrou
- Microbiology Department, Larnaca General Hospital, Larnaca 6301, Cyprus
| | - Antonia Aristokleous
- Department of Biological Sciences, University of Cyprus, Aglantzia, Nicosia 2109, Cyprus
| | - Christina Christodoulou
- Department of Molecular Virology, Cyprus Institute of Neurology and Genetics, Nicosia 2371, Cyprus
| | - Ioanna Gregoriou
- Unit for Surveillance and Control of Communicable Diseases, Ministry of Health, Nicosia 1148, Cyprus
| | | | - Olga Kalakouta
- Unit for Surveillance and Control of Communicable Diseases, Ministry of Health, Nicosia 1148, Cyprus
| | | | | | | | - Michail Mendris
- Microbiology Department, Limassol General Hospital, Limassol 4131, Cyprus
| | | | - Philippos C. Patsalis
- NIPD Genetics, Nicosia 2409, Cyprus
- Medical School, University of Nicosia, Nicosia 2417, Cyprus
| | - Despo Pieridou
- Microbiology Department, Nicosia General Hospital, Nicosia 2029, Cyprus
| | - Jan Richter
- Department of Molecular Virology, Cyprus Institute of Neurology and Genetics, Nicosia 2371, Cyprus
| | - Markus Schmitt
- Eurofins Genomics Sequencing Europe, 85560 Ebersberg, Germany
| | - Christos Shammas
- S.C.I.N.A Bioanalysis Sciomedical Centre Ltd., Limassol 4040, Cyprus
| | - Dora C. Stylianou
- Department of Biological Sciences, University of Cyprus, Aglantzia, Nicosia 2109, Cyprus
| | | | | | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, 3000 Leuven, Belgium
| | - Leondios G. Kostrikis
- Department of Biological Sciences, University of Cyprus, Aglantzia, Nicosia 2109, Cyprus
- Cyprus Academy of Sciences, Letters, and Arts, 60-68 Phaneromenis Street, Nicosia 1011, Cyprus
| |
Collapse
|
11
|
Dingemans J, van der Veer BMJW, Gorgels KMF, Hackert V, den Heijer CDJ, Hoebe CJPA, Savelkoul PHM, van Alphen LB. Investigating SARS-CoV-2 breakthrough infections per variant and vaccine type. Front Microbiol 2022; 13:1027271. [PMID: 36504818 PMCID: PMC9729533 DOI: 10.3389/fmicb.2022.1027271] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 11/02/2022] [Indexed: 11/27/2022] Open
Abstract
Breakthrough SARS-CoV-2 infections have been reported in fully vaccinated individuals, in spite of the high efficacy of the currently available vaccines, proven in trials and real-world studies. Several variants of concern (VOC) have been proffered to be associated with breakthrough infections following immunization. In this study, we investigated 378 breakthrough infections recorded between January and July 2021 and compared the distribution of SARS-CoV-2 genotypes identified in 225 fully vaccinated individuals to the frequency of circulating community lineages in the region of South Limburg (The Netherlands) in a week-by-week comparison. Although the proportion of breakthrough infections was relatively low and stable when the Alpha variant was predominant, the rapid emergence of the Delta variant lead to a strong increase in breakthrough infections, with a higher relative proportion of individuals vaccinated with Vaxzevria or Jcovden being infected compared to those immunized with mRNA-based vaccines. A significant difference in median age was observed when comparing fully vaccinated individuals with severe symptoms (83 years) to asymptomatic cases (46.5 years) or individuals with mild-to-moderate symptoms (42 years). There was no association between SARS-CoV-2 genotype or vaccine type and disease symptoms. Furthermore, the majority of adaptive mutations were concentrated in the N-terminal domain of the Spike protein, highlighting its role in immune evasion. Interestingly, symptomatic individuals harbored significantly higher SARS-CoV-2 loads than asymptomatic vaccinated individuals and breakthrough infections caused by the Delta variant were associated with increased viral loads compared to those caused by the Alpha variant. In addition, we investigated the role of the Omicron variant in causing breakthrough infections by analyzing 135 samples that were randomly selected for genomic surveillance during the transition period from Delta to Omicron. We found that the proportion of Omicron vs. Delta infections was significantly higher in individuals who received a booster vaccine compared to both unvaccinated and fully vaccinated individuals. Altogether, these results indicate that the emergence of the Delta variant and in particular Omicron has lowered the efficiency of particular vaccine types to prevent SARS-CoV-2 infections and that, although rare, the elderly are particularly at risk of becoming severely infected as the consequence of a breakthrough infection.
Collapse
Affiliation(s)
- Jozef Dingemans
- Department of Medical Microbiology, Infectious diseases and Infection prevention, Care and Public Health Research Institute (CAPHRI), Faculty of Health, Medicine and Life Sciences, Maastricht University Medical Center (MUMC+), Maastricht, Netherlands,*Correspondence: Jozef Dingemans, ; Brian M. J. W. van der Veer,
| | - Brian M. J. W. van der Veer
- Department of Medical Microbiology, Infectious diseases and Infection prevention, Care and Public Health Research Institute (CAPHRI), Faculty of Health, Medicine and Life Sciences, Maastricht University Medical Center (MUMC+), Maastricht, Netherlands,*Correspondence: Jozef Dingemans, ; Brian M. J. W. van der Veer,
| | - Koen M. F. Gorgels
- Department of Sexual Health, Infectious Diseases and Environmental Health, South Limburg Public Health Service, Heerlen, Netherlands
| | - Volker Hackert
- Department of Sexual Health, Infectious Diseases and Environmental Health, South Limburg Public Health Service, Heerlen, Netherlands,Department of Social Medicine, Care and Public Health Research Institute (CAPHRI), Faculty of Health, Medicine and Life Sciences, Maastricht University, Maastricht, Netherlands
| | - Casper D. J. den Heijer
- Department of Sexual Health, Infectious Diseases and Environmental Health, South Limburg Public Health Service, Heerlen, Netherlands,Department of Social Medicine, Care and Public Health Research Institute (CAPHRI), Faculty of Health, Medicine and Life Sciences, Maastricht University, Maastricht, Netherlands
| | - Christian J. P. A Hoebe
- Department of Medical Microbiology, Infectious diseases and Infection prevention, Care and Public Health Research Institute (CAPHRI), Faculty of Health, Medicine and Life Sciences, Maastricht University Medical Center (MUMC+), Maastricht, Netherlands,Department of Sexual Health, Infectious Diseases and Environmental Health, South Limburg Public Health Service, Heerlen, Netherlands,Department of Social Medicine, Care and Public Health Research Institute (CAPHRI), Faculty of Health, Medicine and Life Sciences, Maastricht University, Maastricht, Netherlands
| | - Paul H. M. Savelkoul
- Department of Medical Microbiology, Infectious diseases and Infection prevention, Care and Public Health Research Institute (CAPHRI), Faculty of Health, Medicine and Life Sciences, Maastricht University Medical Center (MUMC+), Maastricht, Netherlands
| | - Lieke B. van Alphen
- Department of Medical Microbiology, Infectious diseases and Infection prevention, Care and Public Health Research Institute (CAPHRI), Faculty of Health, Medicine and Life Sciences, Maastricht University Medical Center (MUMC+), Maastricht, Netherlands
| |
Collapse
|
12
|
Al Khalaf R, Bernasconi A, Pinoli P, Ceri S. Analysis of co-occurring and mutually exclusive amino acid changes and detection of convergent and divergent evolution events in SARS-CoV-2. Comput Struct Biotechnol J 2022; 20:4238-4250. [PMID: 35945925 PMCID: PMC9352683 DOI: 10.1016/j.csbj.2022.07.051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 07/29/2022] [Accepted: 07/29/2022] [Indexed: 11/28/2022] Open
Abstract
The inflation of SARS-CoV-2 lineages with a high number of accumulated mutations (such as the recent case of Omicron) has risen concerns about the evolutionary capacity of this virus. Here, we propose a computational study to examine non-synonymous mutations gathered within genomes of SARS-CoV-2 from the beginning of the pandemic until February 2022. We provide both qualitative and quantitative descriptions of such corpus, focusing on statistically significant co-occurring and mutually exclusive mutations within single genomes. Then, we examine in depth the distributions of mutations over defined lineages and compare those of frequently co-occurring mutation pairs. Based on this comparison, we study mutations' convergence/divergence on the phylogenetic tree. As a result, we identify 1,818 co-occurring pairs of non-synonymous mutations showing at least one event of convergent evolution and 6,625 co-occurring pairs with at least one event of divergent evolution. Notable examples of both types are shown by means of a tree-based representation of lineages, visually capturing mutations' behaviors. Our method confirms several well-known cases; moreover, the provided evidence suggests that our workflow can explain aspects of the future mutational evolution of SARS-CoV-2.
Collapse
Affiliation(s)
- Ruba Al Khalaf
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, Milan, Italy
| | - Anna Bernasconi
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, Milan, Italy
| | - Pietro Pinoli
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, Milan, Italy
| | - Stefano Ceri
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, Milan, Italy
| |
Collapse
|
13
|
Cacciapaglia G, Cot C, de Hoffer A, Hohenegger S, Sannino F, Vatani S. Epidemiological theory of virus variants. PHYSICA A 2022; 596:127071. [PMID: 35185268 PMCID: PMC8848575 DOI: 10.1016/j.physa.2022.127071] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 12/15/2021] [Indexed: 05/02/2023]
Abstract
We propose a physics-inspired mathematical model underlying the temporal evolution of competing virus variants that relies on the existence of (quasi) fixed points capturing the large time scale invariance of the dynamics. To motivate our result we first modify the time-honoured compartmental models of the SIR type to account for the existence of competing variants and then show how their evolution can be naturally re-phrased in terms of flow equations ending at quasi fixed points. As the natural next step we employ (near) scale invariance to organise the time evolution of the competing variants within the effective description of the epidemic Renormalisation Group framework. We test the resulting theory against the time evolution of COVID-19 virus variants that validate the theory empirically.
Collapse
Affiliation(s)
- Giacomo Cacciapaglia
- Institut de Physique des 2 Infinis (IP2I) de Lyon, CNRS/IN2P3, UMR5822, 69622 Villeurbanne, France
- Université de Lyon, Université Claude Bernard Lyon 1, 69001 Lyon, France
| | - Corentin Cot
- Institut de Physique des 2 Infinis (IP2I) de Lyon, CNRS/IN2P3, UMR5822, 69622 Villeurbanne, France
- Université de Lyon, Université Claude Bernard Lyon 1, 69001 Lyon, France
| | | | - Stefan Hohenegger
- Institut de Physique des 2 Infinis (IP2I) de Lyon, CNRS/IN2P3, UMR5822, 69622 Villeurbanne, France
- Université de Lyon, Université Claude Bernard Lyon 1, 69001 Lyon, France
| | - Francesco Sannino
- Scuola Superiore Meridionale, Largo S. Marcellino, 10, 80138 Napoli NA, Italy
- Dipartimento di Fisica, E. Pancini, Univ. di Napoli, Federico II and INFN sezione di Napoli, Complesso Universitario di Monte S. Angelo Edificio 6, via Cintia, 80126 Napoli, Italy
- CP-Origins and D-IAS, Univ. of Southern Denmark, Campusvej 55, DK-5230 Odense, Denmark
| | - Shahram Vatani
- Institut de Physique des 2 Infinis (IP2I) de Lyon, CNRS/IN2P3, UMR5822, 69622 Villeurbanne, France
- Université de Lyon, Université Claude Bernard Lyon 1, 69001 Lyon, France
| |
Collapse
|
14
|
Kumar A, Asghar A, Dwivedi P, Kumar G, Narayan RK, Jha RK, Parashar R, Sahni C, Pandey SN. A Bioinformatics Tool for Predicting Future COVID-19 Waves Based on a Retrospective Analysis of the Second Wave in India: Model Development Study. JMIR BIOINFORMATICS AND BIOTECHNOLOGY 2022; 3:e36860. [PMID: 36193192 PMCID: PMC9516867 DOI: 10.2196/36860] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 08/26/2022] [Accepted: 09/12/2022] [Indexed: 11/13/2022]
Abstract
Background Since the start of the COVID-19 pandemic, health policymakers globally have been attempting to predict an impending wave of COVID-19. India experienced a devastating second wave of COVID-19 in the late first week of May 2021. We retrospectively analyzed the viral genomic sequences and epidemiological data reflecting the emergence and spread of the second wave of COVID-19 in India to construct a prediction model. Objective We aimed to develop a bioinformatics tool that can predict an impending COVID-19 wave. Methods We analyzed the time series distribution of genomic sequence data for SARS-CoV-2 and correlated it with epidemiological data for new cases and deaths for the corresponding period of the second wave. In addition, we analyzed the phylodynamics of circulating SARS-CoV-2 variants in the Indian population during the study period. Results Our prediction analysis showed that the first signs of the arrival of the second wave could be seen by the end of January 2021, about 2 months before its peak in May 2021. By the end of March 2021, it was distinct. B.1.617 lineage variants powered the wave, most notably B.1.617.2 (Delta variant). Conclusions Based on the observations of this study, we propose that genomic surveillance of SARS-CoV-2 variants, complemented with epidemiological data, can be a promising tool to predict impending COVID-19 waves.
Collapse
Affiliation(s)
- Ashutosh Kumar
- Department of Anatomy All India Institute of Medical Sciences - Patna Patna India
| | - Adil Asghar
- Department of Anatomy All India Institute of Medical Sciences - Patna Patna India
| | - Prakhar Dwivedi
- Department of Anatomy All India Institute of Medical Sciences - Patna Patna India
| | - Gopichand Kumar
- Department of Anatomy All India Institute of Medical Sciences - Patna Patna India
| | - Ravi K Narayan
- Department of Anatomy Dr B C Roy Multispeciality Medical Research Center Indian Institute of Technology-Kharagpur Kharagpur India
| | - Rakesh K Jha
- Department of Anatomy All India Institute of Medical Sciences - Patna Patna India
| | - Rakesh Parashar
- India Health Lead Oxford Policy Management Limited Oxford United Kingdom
| | - Chetan Sahni
- Department of Anatomy Institute of Medical Sciences Banaras Hindu University Varanasi India
| | - Sada N Pandey
- Department of Zoology Banaras Hindu University Varanasi India
| |
Collapse
|