1
|
Revisiting the recombinant history of HIV-1 group M with dynamic network community detection. Proc Natl Acad Sci U S A 2022; 119:e2108815119. [PMID: 35500121 PMCID: PMC9171507 DOI: 10.1073/pnas.2108815119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Recombination is a major mechanism through which HIV type 1 (HIV-1) maintains genetic diversity and interferes with viral eradication efforts. There is growing evidence demonstrating a recombinant origin of primate lentiviruses including HIV-1 group M (HIV-1/M). Inferring the extent of recombination across the entire HIV-1/M genome is of great importance as it provides deeper insights into the origin, dynamics, and evolution of the global pandemic. Here we propose an alternative method that can reconstruct the extent of genome-wide recombination in HIV-1, uncover reticulate patterns, and serve as a framework for HIV-1 classification. Our method provides an alternative approach for understanding the roles of virus recombination in the early evolutionary history of zoonosis for other emerging viruses. The prevailing abundance of full-length HIV type 1 (HIV-1) genome sequences provides an opportunity to revisit the standard model of HIV-1 group M (HIV-1/M) diversity that clusters genomes into largely nonrecombinant subtypes, which is not consistent with recent evidence of deep recombinant histories for simian immunodeficiency virus (SIV) and other HIV-1 groups. Here we develop an unsupervised nonparametric clustering approach, which does not rely on predefined nonrecombinant genomes, by adapting a community detection method developed for dynamic social network analysis. We show that this method (dynamic stochastic block model [DSBM]) attains a significantly lower mean error rate in detecting recombinant breakpoints in simulated data (quasibinomial generalized linear model (GLM), P<8×10−8), compared to other reference-free recombination detection programs (genetic algorithm for recombination detection [GARD], recombination detection program 4 [RDP4], and RDP5). When this method was applied to a representative sample of n = 525 actual HIV-1 genomes, we determined k = 29 as the optimal number of DSBM clusters and used change-point detection to estimate that at least 95% of these genomes are recombinant. Further, we identified both known and undocumented recombination hotspots in the HIV-1 genome and evidence of intersubtype recombination in HIV-1 subtype reference genomes. We propose that clusters generated by DSBM can provide an informative framework for HIV-1 classification.
Collapse
|
2
|
Ohshima K, Kawakubo S, Muraoka S, Gao F, Ishimaru K, Kayashima T, Fukuda S. Genomic Epidemiology and Evolution of Scallion Mosaic Potyvirus From Asymptomatic Wild Japanese Garlic. Front Microbiol 2021; 12:789596. [PMID: 34956155 PMCID: PMC8692251 DOI: 10.3389/fmicb.2021.789596] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 11/11/2021] [Indexed: 11/30/2022] Open
Abstract
Scallion mosaic virus (ScaMV) belongs to the turnip mosaic virus phylogenetic group of potyvirus and is known to infect domestic scallion plants (Allium chinense) in China and wild Japanese garlic (Allium macrostemon Bunge) in Japan. Wild Japanese garlic plants showing asymptomatic leaves were collected from different sites in Japan during 2012–2015. We found that 73 wild Japanese garlic plants out of 277 collected plants were infected with ScaMV, identified by partial genomic nucleotide sequences of the amplified RT-PCR products using potyvirus-specific primer pairs. Sixty-three ScaMV isolates were then chosen, and those full genomic sequences were determined. We carried out evolutionary analyses of the complete polyprotein-coding sequences and four non-recombinogenic regions of partial genomic sequences. We found that 80% of ScaMV samples have recombination-like genome structure and identified 12 recombination-type patterns in the genomes of the Japanese ScaMV isolates. Furthermore, we found two non-recombinant-type patterns in the Japanese population. Because the wild plants and weeds may often serve as reservoirs of viruses, it is important to study providing the exploratory investigation before emergence in the domestic plants. This is possibly the first epidemiological and evolutionary study of a virus from asymptomatic wild plants.
Collapse
Affiliation(s)
- Kazusato Ohshima
- Department of Biological Resource Science, Faculty of Agriculture, Saga University, Saga, Japan.,Institute of Wild Onion Science, Saga University, Saga, Japan.,The United Graduate School of Agricultural Sciences, Kagoshima University, Kagoshima, Japan
| | - Shusuke Kawakubo
- Department of Biological Resource Science, Faculty of Agriculture, Saga University, Saga, Japan
| | - Satoshi Muraoka
- Department of Biological Resource Science, Faculty of Agriculture, Saga University, Saga, Japan
| | - Fangluan Gao
- Institute of Plant Virology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Kanji Ishimaru
- Department of Biological Resource Science, Faculty of Agriculture, Saga University, Saga, Japan.,Institute of Wild Onion Science, Saga University, Saga, Japan.,The United Graduate School of Agricultural Sciences, Kagoshima University, Kagoshima, Japan
| | - Tomoko Kayashima
- Institute of Wild Onion Science, Saga University, Saga, Japan.,Department of School Education Course, Faculty of Education, Saga University, Saga, Japan
| | - Shinji Fukuda
- Department of Biological Resource Science, Faculty of Agriculture, Saga University, Saga, Japan.,Institute of Wild Onion Science, Saga University, Saga, Japan.,The United Graduate School of Agricultural Sciences, Kagoshima University, Kagoshima, Japan.,Saga University Center for Education and Research in Agricultural Innovation, Faculty of Agriculture, Saga University, Saga, Japan
| |
Collapse
|
3
|
Rubio-Garrido M, González-Alba JM, Reina G, Ndarabu A, Barquín D, Carlos S, Galán JC, Holguín Á. Current and historic HIV-1 molecular epidemiology in paediatric and adult population from Kinshasa in the Democratic Republic of Congo. Sci Rep 2020; 10:18461. [PMID: 33116151 PMCID: PMC7595211 DOI: 10.1038/s41598-020-74558-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 09/30/2020] [Indexed: 12/22/2022] Open
Abstract
HIV-1 diversity may impact monitoring and vaccine development. We describe the most recent data of HIV-1 variants and their temporal trends in the Democratic Republic of Congo (DRC) from 1976 to 2018 and in Kinshasa from 1983-2018. HIV-1 pol sequencing from dried blood collected in Kinshasa during 2016-2018 was done in 340 HIV-infected children/adolescents/adults to identify HIV-1 variants by phylogenetic reconstructions. Recombination events and transmission clusters were also analyzed. Variant distribution and genetic diversity were compared to historical available pol sequences from the DRC in Los Alamos Database (LANL). We characterized 165 HIV-1 pol variants circulating in Kinshasa (2016-2018) and compared them with 2641 LANL sequences from the DRC (1976-2012) and Kinshasa (1983-2008). During 2016-2018 the main subtypes were A (26.7%), G (9.7%) and C (7.3%). Recombinants accounted for a third of infections (12.7%/23.6% Circulant/Unique Recombinant Forms). We identified the first CRF47_BF reported in Africa and four transmission clusters. A significant increase of subtype A and sub-subtype F1 and a significant reduction of sub-subtype A1 and subtype D were observed in Kinshasa during 2016-2018 compared to variants circulating in the city from 1983 to 2008. We provide unique and updated information related to HIV-1 variants currently circulating in Kinshasa, reporting the temporal trends of subtypes/CRF/URF during 43 years in the DRC, and providing the most extensive data on children/adolescents.
Collapse
Affiliation(s)
- Marina Rubio-Garrido
- HIV-1 Molecular Epidemiology Laboratory, Microbiology and Parasitology Department, Hospital Ramón y Cajal-IRYCIS and CIBEREsp-RITIP, 28034, Madrid, Spain
| | - José María González-Alba
- Virology Section, Microbiology and Parasitology Department, Hospital Ramón y Cajal-IRYCIS and CIBEREsp, 28034, Madrid, Spain
| | - Gabriel Reina
- Microbiology Department, Clínica Universidad de Navarra, Navarra Institute for Health Research (IdiSNA), Institute of Tropical Health, Universidad de Navarra (ISTUN), 31008, Pamplona, Spain.
| | - Adolphe Ndarabu
- Monkole Hospital, Kinshasa, Democratic Republic of the Congo
| | - David Barquín
- Microbiology Department, Clínica Universidad de Navarra, Navarra Institute for Health Research (IdiSNA), Institute of Tropical Health, Universidad de Navarra (ISTUN), 31008, Pamplona, Spain
| | - Silvia Carlos
- Department of Preventive Medicine and Public Health, Navarra Institute for Health Research (IdiSNA), Institute of Tropical Health, Universidad de Navarra (ISTUN), Pamplona, 31008, Spain
| | - Juan Carlos Galán
- Virology Section, Microbiology and Parasitology Department, Hospital Ramón y Cajal-IRYCIS and CIBEREsp, 28034, Madrid, Spain
| | - África Holguín
- HIV-1 Molecular Epidemiology Laboratory, Microbiology and Parasitology Department, Hospital Ramón y Cajal-IRYCIS and CIBEREsp-RITIP, 28034, Madrid, Spain.
| |
Collapse
|
4
|
Ribeiro JMC, Mans BJ. TickSialoFam (TSFam): A Database That Helps to Classify Tick Salivary Proteins, a Review on Tick Salivary Protein Function and Evolution, With Considerations on the Tick Sialome Switching Phenomenon. Front Cell Infect Microbiol 2020; 10:374. [PMID: 32850476 PMCID: PMC7396615 DOI: 10.3389/fcimb.2020.00374] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Accepted: 06/17/2020] [Indexed: 01/09/2023] Open
Abstract
Tick saliva contains a complex mixture of peptides and non-peptides that counteract their hosts' hemostasis, immunity, and tissue-repair reactions. Recent transcriptomic studies have revealed over one thousand different transcripts coding for secreted polypeptides in a single tick species. Not only do these gene products belong to many expanded families, such as the lipocalins, metalloproteases, Antigen-5, cystatins, and apyrases, but also families that are found exclusively in ticks, such as the evasins, Isac, DAP36, and many others. Phylogenetic analysis of the deduced protein sequences indicate that the salivary genes exhibit an increased rate of evolution due to a lower evolutionary constraint and/or positive selection, allowing for a large diversity of tick salivary proteins. Thus, for each new tick species that has its salivary transcriptome sequenced and assembled, a formidable task of annotation of these transcripts awaits. Currently, as of November 2019, there are over 287 thousand coding sequences deposited at the National Center for Biotechnology Information (NCBI) that are derived from tick salivary gland mRNA. Here, from these 287 thousand sequences we identified 45,264 potential secretory proteins which possess a signal peptide and no transmembrane domains on the mature peptide. By using the psiblast tools, position-specific matrices were constructed and assembled into the TickSialoFam (TSF) database. The TSF is a rpsblastable database that can help with the annotation of tick sialotranscriptomes. The TSA database identified 136 tick salivary secreted protein families, as well as 80 families of endosomal-related products, mostly having a protein modification function. As the number of sequences increases, and new annotation details become available, new releases of the TSF database may become available.
Collapse
Affiliation(s)
- José M. C. Ribeiro
- Section of Vector Biology, Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Diseases, Rockville, MD, United States
| | - Ben J. Mans
- Epidemiology, Parasites and Vectors, Agricultural Research Council - Onderstepoort Veterinary Research, Pretoria, South Africa
- The Department of Veterinary Tropical Diseases, University of Pretoria, Pretoria, South Africa
- Department of Life and Consumer Sciences, University of South Africa, Pretoria, South Africa
| |
Collapse
|
5
|
Grant HE, Hodcroft EB, Ssemwanga D, Kitayimbwa JM, Yebra G, Esquivel Gomez LR, Frampton D, Gall A, Kellam P, de Oliveira T, Bbosa N, Nsubuga RN, Kibengo F, Kwan TH, Lycett S, Kao R, Robertson DL, Ratmann O, Fraser C, Pillay D, Kaleebu P, Leigh Brown AJ. Pervasive and non-random recombination in near full-length HIV genomes from Uganda. Virus Evol 2020; 6:veaa004. [PMID: 32395255 PMCID: PMC7204518 DOI: 10.1093/ve/veaa004] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Recombination is an important feature of HIV evolution, occurring both within and between the major branches of diversity (subtypes). The Ugandan epidemic is primarily composed of two subtypes, A1 and D, that have been co-circulating for 50 years, frequently recombining in dually infected patients. Here, we investigate the frequency of recombinants in this population and the location of breakpoints along the genome. As part of the PANGEA-HIV consortium, 1,472 consensus genome sequences over 5 kb have been obtained from 1,857 samples collected by the MRC/UVRI & LSHTM Research unit in Uganda, 465 (31.6 per cent) of which were near full-length sequences (>8 kb). Using the subtyping tool SCUEAL, we find that of the near full-length dataset, 233 (50.1 per cent) genomes contained only one subtype, 30.8 per cent A1 (n = 143), 17.6 per cent D (n = 82), and 1.7 per cent C (n = 8), while 49.9 per cent (n = 232) contained more than one subtype (including A1/D (n = 164), A1/C (n = 13), C/D (n = 9); A1/C/D (n = 13), and 33 complex types). K-means clustering of the recombinant A1/D genomes revealed a section of envelope (C2gp120-TMgp41) is often inherited intact, whilst a generalized linear model was used to demonstrate significantly fewer breakpoints in the gag-pol and envelope C2-TM regions compared with accessory gene regions. Despite similar recombination patterns in many recombinants, no clearly supported circulating recombinant form (CRF) was found, there was limited evidence of the transmission of breakpoints, and the vast majority (153/164; 93 per cent) of the A1/D recombinants appear to be unique recombinant forms. Thus, recombination is pervasive with clear biases in breakpoint location, but CRFs are not a significant feature, characteristic of a complex, and diverse epidemic.
Collapse
Affiliation(s)
- Heather E Grant
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK
| | - Emma B Hodcroft
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Deogratius Ssemwanga
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe, Uganda
- Uganda Virus Research Institute, Entebbe, Uganda
| | | | - Gonzalo Yebra
- The Roslin Institute, University of Edinburgh, Edinburgh, UK
| | | | - Dan Frampton
- Division of Infection and Immunity, University College London, London, UK
| | - Astrid Gall
- European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, UK
| | - Paul Kellam
- European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, UK
| | - Tulio de Oliveira
- Nelson R. Mandela School of Medicine, Africa Health Research Institute, Durban, South Africa
| | - Nicholas Bbosa
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe, Uganda
| | - Rebecca N Nsubuga
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe, Uganda
| | - Freddie Kibengo
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe, Uganda
| | - Tsz Ho Kwan
- Stanley Ho Centre for Emerging Infectious Diseases, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Samantha Lycett
- The Roslin Institute, University of Edinburgh, Edinburgh, UK
| | - Rowland Kao
- The Roslin Institute, University of Edinburgh, Edinburgh, UK
| | | | - Oliver Ratmann
- Department of Mathematics, Imperial College London, London, UK
| | - Christophe Fraser
- Nuffield Department of Medicine, Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
| | - Deenan Pillay
- European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, UK
- Nelson R. Mandela School of Medicine, Africa Health Research Institute, Durban, South Africa
| | - Pontiano Kaleebu
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe, Uganda
- Uganda Virus Research Institute, Entebbe, Uganda
| | | |
Collapse
|
6
|
Palmer J, Poon AFY. Phylogenetic measures of indel rate variation among the HIV-1 group M subtypes. Virus Evol 2019; 5:vez022. [PMID: 31341641 PMCID: PMC6642732 DOI: 10.1093/ve/vez022] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The transmission fitness and pathogenesis of HIV-1 is disproportionately influenced by evolution in the five variable regions (V1–V5) of the surface envelope glycoprotein (gp120). Insertions and deletions (indels) are a significant source of evolutionary change in these regions. However, the rate and composition of indels has not yet been quantified through a large-scale comparative analysis of HIV-1 sequences. Here, we develop and report results from a phylogenetic method to estimate indel rates for the gp120 variable regions across five major subtypes and two circulating recombinant forms (CRFs) of HIV-1 group M. We processed over 26,000 published HIV-1 gp120 sequences, from which we extracted 6,605 sequences for phylogenetic analysis. We reconstructed time-scaled phylogenies by maximum likelihood and fit a binomial-Poisson model to the observed distribution of indels between closely related pairs of sequences in each tree (cherries). By focusing on cherries in each tree, we obtained phylogenetically independent indel reconstructions, and the shorter time scales in cherries reduced the bias due to purifying selection. Rate estimates ranged from 3.0×10−5 to 1.5×10−3 indels/nt/year and varied significantly among variable regions and subtypes. Indel rates were significantly lower in V3 relative to V1, and were also lower in HIV-1 subtype B relative to the 01_AE reference. We also found that V1, V2, and V4 tended to accumulate significantly longer indels. Furthermore, we observed that the nucleotide composition of indels was distinct from the flanking sequence, with higher frequencies of G and lower frequencies of T. Indels affected N-linked glycosylation sites more often in V1 and V2 than expected by chance, consistent with positive selection on glycosylation patterns within these regions. These results represent the first comprehensive measures of indel rates in HIV-1 gp120 across multiple subtypes and CRFs, and identifies novel and unexpected patterns for further research in the molecular evolution of HIV-1.
Collapse
Affiliation(s)
- John Palmer
- Department of Pathology & Laboratory Medicine, Western University, London, Canada
| | - Art F Y Poon
- Department of Pathology & Laboratory Medicine, Western University, London, Canada.,Department of Applied Mathematics, Western University, London, Canada.,Department of Microbiology & Immunology, Western University, London, Canada
| |
Collapse
|