1
|
Li X, Trovão NS, Wertheim JO, Baele G, de Bernardi Schneider A. Optimizing ancestral trait reconstruction of large HIV Subtype C datasets through multiple-trait subsampling. Virus Evol 2023; 9:vead069. [PMID: 38046219 PMCID: PMC10691791 DOI: 10.1093/ve/vead069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 10/29/2023] [Accepted: 11/20/2023] [Indexed: 12/05/2023] Open
Abstract
Large datasets along with sampling bias represent a challenge for phylodynamic reconstructions, particularly when the study data are obtained from various heterogeneous sources and/or through convenience sampling. In this study, we evaluate the presence of unbalanced sampled distribution by collection date, location, and risk group of human immunodeficiency virus Type 1 Subtype C using a comprehensive subsampling strategy and assess their impact on the reconstruction of the viral spatial and risk group dynamics using phylogenetic comparative methods. Our study shows that a most suitable dataset for ancestral trait reconstruction can be obtained through subsampling by all available traits, particularly using multigene datasets. We also demonstrate that sampling bias is inflated when considerable information for a given trait is unavailable or of poor quality, as we observed for the trait risk group. In conclusion, we suggest that, even if traits are not well recorded, including them deliberately optimizes the representativeness of the original dataset rather than completely excluding them. Therefore, we advise the inclusion of as many traits as possible with the aid of subsampling approaches in order to optimize the dataset for phylodynamic analysis while reducing the computational burden. This will benefit research communities investigating the evolutionary and spatio-temporal patterns of infectious diseases.
Collapse
Affiliation(s)
| | - Nídia S Trovão
- Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, 31 Center Dr, Bethesda, MA 20892, USA
| | - Joel O Wertheim
- Department of Medicine, University of California, La Jolla, San Diego, CA 92093, USA
| | - Guy Baele
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven BE-3000, Belgium
| | - Adriano de Bernardi Schneider
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
- Ningbo No.2 Hospital, Ningbo 315010, China
- Ningbo Institute of Life and Health Industry, University of Chinese Academy of Sciences, Ningbo 315000, China
| |
Collapse
|
2
|
Lee DW, Kim JM, Park AK, Kim DW, Kim JY, Lim N, Lee H, Kim IH, Kim JA, Lee CY, Kwon JH, Kim EJ. Genomic epidemiology of SARS- CoV-2 Omicron variants in the Republic of Korea. Sci Rep 2022; 12:22414. [PMID: 36575217 PMCID: PMC9793390 DOI: 10.1038/s41598-022-26803-w] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 12/20/2022] [Indexed: 12/28/2022] Open
Abstract
The outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a global pandemic since 2019. Variants of concern (VOCs) declared by the World Health Organization require continuous monitoring because of their possible changes in transmissibility, virulence, and antigenicity. The Omicron variant, a VOC, has become the dominant variant worldwide since November 2021. In the Republic of Korea (South Korea), the number of confirmed cases increased rapidly after the detection of Omicron VOC on November 24, 2021. In this study, we estimated the underlying epidemiological processes of Omicron VOC in South Korea using time-scaled phylodynamic analysis. Three distinct phylogenetic subgroups (Kor-O1, Kor-O2, and Kor-O3) were detected in South Korea. The Kor-O1 subgroup circulated in the Daegu region, whereas Kor-O2 and Kor-O3 circulated in Incheon and Jeollanam-do, respectively. The viral population size and case number of the Kor-O1 subgroup increased more rapidly than those of the other subgroups, indicating the rapid spread of the virus. The results indicated the multiple introductions of Omicron sub-lineages into South Korea and their subsequent co-circulation. The evolution and transmission of SARS-CoV-2 should be continuously monitored, and control strategies need to be improved to control the multiple variants.
Collapse
Affiliation(s)
- Dong-Wook Lee
- grid.258803.40000 0001 0661 1556College of Veterinary Medicine, Kyungpook National University, Daegu, 41566 Republic of Korea
| | - Jeong-Min Kim
- grid.418967.50000 0004 1763 8617Division of Emerging Infectious Diseases, Bureau of Infectious Disease Diagnosis Control, Korea Disease Control and Prevention Agency, Cheongju-Si, 28159 Republic of Korea
| | - Ae Kyung Park
- grid.418967.50000 0004 1763 8617Division of Emerging Infectious Diseases, Bureau of Infectious Disease Diagnosis Control, Korea Disease Control and Prevention Agency, Cheongju-Si, 28159 Republic of Korea
| | - Da-Won Kim
- grid.258803.40000 0001 0661 1556College of Veterinary Medicine, Kyungpook National University, Daegu, 41566 Republic of Korea
| | - Ji-Yun Kim
- grid.258803.40000 0001 0661 1556College of Veterinary Medicine, Kyungpook National University, Daegu, 41566 Republic of Korea
| | - Noori Lim
- grid.258803.40000 0001 0661 1556College of Veterinary Medicine, Kyungpook National University, Daegu, 41566 Republic of Korea
| | - Hyeokjin Lee
- grid.418967.50000 0004 1763 8617Division of Emerging Infectious Diseases, Bureau of Infectious Disease Diagnosis Control, Korea Disease Control and Prevention Agency, Cheongju-Si, 28159 Republic of Korea
| | - Il-Hwan Kim
- grid.418967.50000 0004 1763 8617Division of Emerging Infectious Diseases, Bureau of Infectious Disease Diagnosis Control, Korea Disease Control and Prevention Agency, Cheongju-Si, 28159 Republic of Korea
| | - Jeong-Ah Kim
- grid.418967.50000 0004 1763 8617Division of Emerging Infectious Diseases, Bureau of Infectious Disease Diagnosis Control, Korea Disease Control and Prevention Agency, Cheongju-Si, 28159 Republic of Korea
| | - Chae young Lee
- grid.418967.50000 0004 1763 8617Division of Emerging Infectious Diseases, Bureau of Infectious Disease Diagnosis Control, Korea Disease Control and Prevention Agency, Cheongju-Si, 28159 Republic of Korea
| | - Jung-Hoon Kwon
- grid.258803.40000 0001 0661 1556College of Veterinary Medicine, Kyungpook National University, Daegu, 41566 Republic of Korea
| | - Eun-Jin Kim
- grid.418967.50000 0004 1763 8617Division of Emerging Infectious Diseases, Bureau of Infectious Disease Diagnosis Control, Korea Disease Control and Prevention Agency, Cheongju-Si, 28159 Republic of Korea
| |
Collapse
|
3
|
Otieno JR, Cherry JL, Spiro DJ, Nelson MI, Trovão NS. Origins and Evolution of Seasonal Human Coronaviruses. Viruses 2022; 14:1551. [PMID: 35891531 PMCID: PMC9320361 DOI: 10.3390/v14071551] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 07/11/2022] [Accepted: 07/11/2022] [Indexed: 11/16/2022] Open
Abstract
Four seasonal human coronaviruses (sHCoVs) are endemic globally (229E, NL63, OC43, and HKU1), accounting for 5-30% of human respiratory infections. However, the epidemiology and evolution of these CoVs remain understudied due to their association with mild symptomatology. Using a multigene and complete genome analysis approach, we find the evolutionary histories of sHCoVs to be highly complex, owing to frequent recombination of CoVs including within and between sHCoVs, and uncertain, due to the under sampling of non-human viruses. The recombination rate was highest for 229E and OC43 whereas substitutions per recombination event were highest in NL63 and HKU1. Depending on the gene studied, OC43 may have ungulate, canine, or rabbit CoV ancestors. 229E may have origins in a bat, camel, or an unsampled intermediate host. HKU1 had the earliest common ancestor (1809-1899) but fell into two distinct clades (genotypes A and B), possibly representing two independent transmission events from murine-origin CoVs that appear to be a single introduction due to large gaps in the sampling of CoVs in animals. In fact, genotype B was genetically more diverse than all the other sHCoVs. Finally, we found shared amino acid substitutions in multiple proteins along the non-human to sHCoV host-jump branches. The complex evolution of CoVs and their frequent host switches could benefit from continued surveillance of CoVs across non-human hosts.
Collapse
Affiliation(s)
- James R. Otieno
- Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, MD 20892, USA; (J.L.C.); (D.J.S.); (M.I.N.)
| | - Joshua L. Cherry
- Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, MD 20892, USA; (J.L.C.); (D.J.S.); (M.I.N.)
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - David J. Spiro
- Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, MD 20892, USA; (J.L.C.); (D.J.S.); (M.I.N.)
| | - Martha I. Nelson
- Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, MD 20892, USA; (J.L.C.); (D.J.S.); (M.I.N.)
| | - Nídia S. Trovão
- Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, MD 20892, USA; (J.L.C.); (D.J.S.); (M.I.N.)
| |
Collapse
|
4
|
Deviatkin AA, Vakulenko YA, Dashian MA, Lukashev AN. Evaluating the Impact of Anthropogenic Factors on the Dissemination of Contemporary Cosmopolitan, Arctic, and Arctic-like Rabies Viruses. Viruses 2021; 14:66. [PMID: 35062270 PMCID: PMC8777955 DOI: 10.3390/v14010066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 12/29/2021] [Indexed: 11/16/2022] Open
Abstract
Rabies is a globally prevalent viral zoonosis that causes 59,000 deaths per year and has important economic consequences. Most virus spread is associated with the migration of its primary hosts. Anthropogenic dissemination, mainly via the transportation of rabid dogs, shaped virus ecology a few hundred years ago and is responsible for several current outbreaks. A systematic analysis of aberrant long-distance events in the steppe and Arctic-like groups of rabies virus was performed using statistical (Bayesian) phylogeography and plots of genetic vs. geographic distances. The two approaches produced similar results but had some significant differences and complemented each other. No phylogeographic analysis could be performed for the Arctic group because polar foxes transfer the virus across the whole circumpolar region at high velocity, and there was no correlation between genetic and geographic distances in this virus group. In the Arctic-like group and the steppe subgroup of the cosmopolitan group, a significant number of known sequences (15-20%) was associated with rapid long-distance transfers, which mainly occurred within Eurasia. Some of these events have been described previously, while others have not been documented. Most of the recent long-distance transfers apparently did not result in establishing the introduced virus, but a few had important implications for the phylogeographic history of rabies. Thus, human-mediated long-distance transmission of the rabies virus remains a significant threat that needs to be addressed.
Collapse
Affiliation(s)
- Andrei A. Deviatkin
- Laboratory of Molecular Biology and Biochemistry, Institute of Molecular Medicine, Sechenov First Moscow State Medical University, 119435 Moscow, Russia
- The National Medical Research Center for Endocrinology, 117036 Moscow, Russia
| | - Yulia A. Vakulenko
- Martsinovsky Institute of Medical Parasitology, Tropical and Vector Borne Diseases, Sechenov First Moscow State Medical University, 119435 Moscow, Russia; (Y.A.V.); (A.N.L.)
- Department of Virology, Faculty of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia
| | - Mariia A. Dashian
- Faculty of Biomedicine, Pirogov Medical University, 117997 Moscow, Russia;
| | - Alexander N. Lukashev
- Martsinovsky Institute of Medical Parasitology, Tropical and Vector Borne Diseases, Sechenov First Moscow State Medical University, 119435 Moscow, Russia; (Y.A.V.); (A.N.L.)
| |
Collapse
|
5
|
Wang Y, Zhao Y, Pan Q. Advances, challenges and opportunities of phylogenetic and social network analysis using COVID-19 data. Brief Bioinform 2021; 23:6380452. [PMID: 34601563 DOI: 10.1093/bib/bbab406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 08/04/2021] [Accepted: 09/03/2021] [Indexed: 11/15/2022] Open
Abstract
Coronavirus disease 2019 (COVID-19) has attracted research interests from all fields. Phylogenetic and social network analyses based on connectivity between either COVID-19 patients or geographic regions and similarity between syndrome coronavirus 2 (SARS-CoV-2) sequences provide unique angles to answer public health and pharmaco-biological questions such as relationships between various SARS-CoV-2 mutants, the transmission pathways in a community and the effectiveness of prevention policies. This paper serves as a systematic review of current phylogenetic and social network analyses with applications in COVID-19 research. Challenges in current phylogenetic network analysis on SARS-CoV-2 such as unreliable inferences, sampling bias and batch effects are discussed as well as potential solutions. Social network analysis combined with epidemiology models helps to identify key transmission characteristics and measure the effectiveness of prevention and control strategies. Finally, future new directions of network analysis motivated by COVID-19 data are summarized.
Collapse
Affiliation(s)
- Yue Wang
- School of Mathematical and Natural Science, Arizona State University, 4701 W Thunderbird Rd, 85306, Arizona, USA
| | - Yunpeng Zhao
- School of Mathematical and Natural Science, Arizona State University, 4701 W Thunderbird Rd, 85306, Arizona, USA
| | - Qing Pan
- Department of Statistics, George Washington University, 801 22nd St. NW, 20052, Washington DC, USA
| |
Collapse
|
6
|
Deviatkin AA, Kholodilov IS, Vakulenko YA, Karganova GG, Lukashev AN. Tick-Borne Encephalitis Virus: An Emerging Ancient Zoonosis? Viruses 2020; 12:v12020247. [PMID: 32102228 PMCID: PMC7077300 DOI: 10.3390/v12020247] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 02/10/2020] [Accepted: 02/19/2020] [Indexed: 12/19/2022] Open
Abstract
Tick-borne encephalitis (TBE) is one of the most important viral zoonosis transmitted by the bite of infected ticks. In this study, all tick-borne encephalitis virus (TBEV) E gene sequences available in GenBank as of June 2019 with known date of isolation (n = 551) were analyzed. Simulation studies showed that a sample bias could significantly affect earlier studies, because small TBEV datasets (n = 50) produced non-overlapping intervals for evolutionary rate estimates. An apparent lack of a temporal signal in TBEV, in general, was found, precluding molecular clock analysis of all TBEV subtypes in one dataset. Within all subtypes and most of the smaller groups in these subtypes, there was evidence of many medium- and long-distance virus transfers. These multiple random events may play a key role in the virus spreading. For some groups, virus diversity within one territory was similar to diversity over the whole geographic range. This is best exemplified by the virus diversity observed in Switzerland or Czech Republic. These two countries yielded most of the known European subtype Eu3 subgroup sequences, and the diversity of viruses found within each of these small countries is comparable to that of the whole Eu3 subgroup, which is prevalent all over Central and Eastern Europe. Most of the deep tree nodes within all three established TBEV subtypes dated less than 300 years back. This could be explained by the recent emergence of most of the known TBEV diversity. Results of bioinformatics analysis presented here, together with multiple field findings, suggest that TBEV may be regarded as an emerging disease.
Collapse
Affiliation(s)
- Andrei A. Deviatkin
- Laboratory of Molecular Biology and Biochemistry, Institute of Molecular Medicine, Sechenov First Moscow State Medical University, 119048 Moscow, Russia;
- Laboratory of Postgenomic Technologies, Izmerov Research Institute of Occupational Health, 105275 Moscow, Russia
- Correspondence: ; Tel.: +7-906-739-0860
| | - Ivan S. Kholodilov
- Laboratory of Biology of Arboviruses, Chumakov Institute of Poliomyelitis and Viral Encephalitides (FSBSI “Chumakov FSC R&D IBP RAS), 108819 Moscow, Russia; (I.S.K.); (G.G.K.)
| | - Yulia A. Vakulenko
- Martsinovsky Institute of Medical Parasitology, Tropical and Vector Borne Diseases, Sechenov First Moscow State Medical University, 119435 Moscow, Russia;
- Department of Virology, Faculty of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia
| | - Galina G. Karganova
- Laboratory of Biology of Arboviruses, Chumakov Institute of Poliomyelitis and Viral Encephalitides (FSBSI “Chumakov FSC R&D IBP RAS), 108819 Moscow, Russia; (I.S.K.); (G.G.K.)
- Department of Organization and Technology of Immunobiological Preparations, Institute for Translational Medicine and Biotechnology, Sechenov First Moscow State Medical University, 119991 Moscow, Russia
| | - Alexander N. Lukashev
- Laboratory of Molecular Biology and Biochemistry, Institute of Molecular Medicine, Sechenov First Moscow State Medical University, 119048 Moscow, Russia;
- Martsinovsky Institute of Medical Parasitology, Tropical and Vector Borne Diseases, Sechenov First Moscow State Medical University, 119435 Moscow, Russia;
| |
Collapse
|
7
|
Susi P. Special Issue "Human Picornaviruses". Viruses 2020; 12:v12010093. [PMID: 31941146 PMCID: PMC7019337 DOI: 10.3390/v12010093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 01/10/2020] [Indexed: 12/02/2022] Open
Affiliation(s)
- Petri Susi
- Institute of Biomedicine, University of Turku, 20520 Turku, Finland
| |
Collapse
|
8
|
Vakulenko Y, Deviatkin A, Lukashev A. Using Statistical Phylogenetics for Investigation of Enterovirus 71 Genotype A Reintroduction into Circulation. Viruses 2019; 11:E895. [PMID: 31557961 PMCID: PMC6832606 DOI: 10.3390/v11100895] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Revised: 09/19/2019] [Accepted: 09/23/2019] [Indexed: 02/08/2023] Open
Abstract
Neurovirulent enterovirus 71 (EV-A71) caused a massive epidemic in China in 2008-2011. While subgenotype C4 was the major causative agent, a few isolates were almost identical to the prototype EV-A71 strain and belonged to genotype A. This variant was allegedly extinct since 1970, and its identification in this epidemic suggests reintroduction of the archive virus. Regression analysis of genetic distances (TempEst software) was of moderate utility due to the low resolution of classical phylogenetic methods. Bayesian phylogenetic analysis (BEAST software) suggested artificial introduction event based on highly aberrant phylogenetic tree branch rates that differed by over three standard deviations from the mean substitution rate for EV71. Manual nucleotide-level analysis was used to further explore the virus spread pattern after introduction into circulation. Upon reintroduction, the virus accumulated up to seven substitutions in VP1, most of them non-synonymous and located within the capsid's canyon or at its rims, compatible with readaptation of a lab strain to natural circulation.
Collapse
Affiliation(s)
- Yulia Vakulenko
- Sechenov First Moscow State Medical University, Martsinovsky Institute of Medical Parasitology, Tropical and Vector Borne Diseases, 119435 Moscow, Russia.
- Faculty of Biology, Lomonosov Moscow State University, 119234 Moscow, Russia.
| | - Andrei Deviatkin
- Sechenov First Moscow State Medical University, Institute of Molecular Medicine, 119048 Moscow, Russia.
| | - Alexander Lukashev
- Sechenov First Moscow State Medical University, Martsinovsky Institute of Medical Parasitology, Tropical and Vector Borne Diseases, 119435 Moscow, Russia.
- Sechenov First Moscow State Medical University, Institute of Molecular Medicine, 119048 Moscow, Russia.
| |
Collapse
|