1
|
Bu F, Kagaayi J, Grabowski MK, Ratmann O, Xu J. Inferring HIV transmission patterns from viral deep-sequence data via latent typed point processes. Biometrics 2024; 80:ujad015. [PMID: 38372402 PMCID: PMC10875513 DOI: 10.1093/biomtc/ujad015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 10/09/2023] [Accepted: 11/27/2023] [Indexed: 02/20/2024]
Abstract
Viral deep-sequencing data play a crucial role toward understanding disease transmission network flows, providing higher resolution compared to standard Sanger sequencing. To more fully utilize these rich data and account for the uncertainties in outcomes from phylogenetic analyses, we propose a spatial Poisson process model to uncover human immunodeficiency virus (HIV) transmission flow patterns at the population level. We represent pairings of individuals with viral sequence data as typed points, with coordinates representing covariates such as gender and age and point types representing the unobserved transmission statuses (linkage and direction). Points are associated with observed scores on the strength of evidence for each transmission status that are obtained through standard deep-sequence phylogenetic analysis. Our method is able to jointly infer the latent transmission statuses for all pairings and the transmission flow surface on the source-recipient covariate space. In contrast to existing methods, our framework does not require preclassification of the transmission statuses of data points, and instead learns them probabilistically through a fully Bayesian inference scheme. By directly modeling continuous spatial processes with smooth densities, our method enjoys significant computational advantages compared to previous methods that rely on discretization of the covariate space. We demonstrate that our framework can capture age structures in HIV transmission at high resolution, bringing valuable insights in a case study on viral deep-sequencing data from Southern Uganda.
Collapse
Affiliation(s)
- Fan Bu
- Department of Biostatistics, University of California - Los Angeles, Los Angeles, CA 90024, United States
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, United States
| | - Joseph Kagaayi
- School of Public Health, Makerere University, Kampala, Uganda
| | - Mary Kate Grabowski
- School of Medicine, Johns Hopkins University, Baltimore, MD 21218, United States
| | - Oliver Ratmann
- Department of Mathematics, Imperial College London, London SW7 2AZ, United Kingdom
| | - Jason Xu
- Department of Statistical Science, Duke University, Durham, NC 27708, United States
| |
Collapse
|
2
|
Hall M, Golubchik T, Bonsall D, Abeler-Dörner L, Limbada M, Kosloff B, Schaap A, de Cesare M, MacIntyre-Cockett G, Otecko N, Probert W, Ratmann O, Bulas Cruz A, Piwowar-Manning E, Burns DN, Cohen MS, Donnell DJ, Eshleman SH, Simwinga M, Fidler S, Hayes R, Ayles H, Fraser C. Demographics of sources of HIV-1 transmission in Zambia: a molecular epidemiology analysis in the HPTN 071 PopART study. THE LANCET. MICROBE 2024; 5:e62-e71. [PMID: 38081203 PMCID: PMC10789608 DOI: 10.1016/s2666-5247(23)00220-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 07/07/2023] [Accepted: 07/14/2023] [Indexed: 01/19/2024]
Abstract
BACKGROUND In the last decade, universally available antiretroviral therapy (ART) has led to greatly improved health and survival of people living with HIV in sub-Saharan Africa, but new infections continue to appear. The design of effective prevention strategies requires the demographic characterisation of individuals acting as sources of infection, which is the aim of this study. METHODS Between 2014 and 2018, the HPTN 071 PopART study was conducted to quantify the public health benefits of ART. Viral samples from 7124 study participants in Zambia were deep-sequenced as part of HPTN 071-02 PopART Phylogenetics, an ancillary study. We used these sequences to identify likely transmission pairs. After demographic weighting of the recipients in these pairs to match the overall HIV-positive population, we analysed the demographic characteristics of the sources to better understand transmission in the general population. FINDINGS We identified a total of 300 likely transmission pairs. 178 (59·4%) were male to female, with 130 (95% CI 110-150; 43·3%) from males aged 25-40 years. Overall, men transmitted 2·09-fold (2·06-2·29) more infections per capita than women, a ratio peaking at 5·87 (2·78-15·8) in the 35-39 years source age group. 40 (26-57; 13·2%) transmissions linked individuals from different communities in the trial. Of 288 sources with recorded information on drug resistance mutations, 52 (38-69; 18·1%) carried viruses resistant to first-line ART. INTERPRETATION HIV-1 transmission in the HPTN 071 study communities comes from a wide range of age and sex groups, and there is no outsized contribution to new infections from importation or drug resistance mutations. Men aged 25-39 years, underserved by current treatment and prevention services, should be prioritised for HIV testing and ART. FUNDING National Institute of Allergy and Infectious Diseases, US President's Emergency Plan for AIDS Relief, International Initiative for Impact Evaluation, Bill & Melinda Gates Foundation, National Institute on Drug Abuse, and National Institute of Mental Health.
Collapse
Affiliation(s)
- Matthew Hall
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Tanya Golubchik
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK; Sydney Infectious Diseases Institute, School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia
| | - David Bonsall
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK; Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Lucie Abeler-Dörner
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | | | - Barry Kosloff
- Zambart, University of Zambia, Lusaka, Zambia; Department of Clinical Research, London School of Hygiene & Tropical Medicine, London, UK
| | - Ab Schaap
- Zambart, University of Zambia, Lusaka, Zambia
| | - Mariateresa de Cesare
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK; Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - George MacIntyre-Cockett
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK; Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Newton Otecko
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - William Probert
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Oliver Ratmann
- Department of Mathematics, Imperial College London, London, UK
| | - Ana Bulas Cruz
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | | | - David N Burns
- Division of AIDS, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, MD, USA
| | - Myron S Cohen
- Department of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - Susan H Eshleman
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | | | - Sarah Fidler
- Department of Infectious Disease Epidemiology, Imperial College London, London, UK
| | - Richard Hayes
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - Helen Ayles
- Zambart, University of Zambia, Lusaka, Zambia; Department of Clinical Research, London School of Hygiene & Tropical Medicine, London, UK
| | - Christophe Fraser
- Pandemic Sciences Institute and Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK.
| |
Collapse
|
3
|
A large population sample of African HIV genomes from the 1980s reveals a reduction in subtype D over time associated with propensity for CXCR4 tropism. Retrovirology 2022; 19:28. [PMID: 36514107 PMCID: PMC9746199 DOI: 10.1186/s12977-022-00612-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 11/12/2022] [Indexed: 12/15/2022] Open
Abstract
We present 109 near full-length HIV genomes amplified from blood serum samples obtained during early 1986 from across Uganda, which to our knowledge is the earliest and largest population sample from the initial phase of the HIV epidemic in Africa. Consensus sequences were made from paired-end Illumina reads with a target-capture approach to amplify HIV material following poor success with standard approaches. In comparisons with a smaller 'intermediate' genome dataset from 1998 to 1999 and a 'modern' genome dataset from 2007 to 2016, the proportion of subtype D was significantly higher initially, dropping from 67% (73/109), to 57% (26/46) to 17% (82/465) respectively (p < 0.0001). Subtype D has previously been shown to have a faster rate of disease progression than other subtypes in East African population studies, and to have a higher propensity to use the CXCR4 co-receptor ("X4 tropism"); associated with a decrease in time to AIDS. Here we find significant differences in predicted tropism between A1 and D subtypes in all three sample periods considered, which is particularly striking the 1986 sample: 66% (53/80) of subtype D env sequences were predicted to be X4 tropic compared with none of the 24 subtype A1. We also analysed the frequency of subtype in the envelope region of inter-subtype recombinants, and found that subtype A1 is over-represented in env, suggesting recombination and selection have acted to remove subtype D env from circulation. The reduction of subtype D frequency over three decades therefore appears to be a result of selective pressure against X4 tropism and its higher virulence. Lastly, we find a subtype D specific codon deletion at position 24 of the V3 loop, which may explain the higher propensity for subtype D to utilise X4 tropism.
Collapse
|
4
|
Pujol-Hodge E, Salazar-Gonzalez JF, Ssemwanga D, Charlebois ED, Ayieko J, Grant HE, Liegler T, Atkins KE, Kaleebu P, Kamya MR, Petersen M, Havlir DV, Leigh Brown AJ. Detection of HIV-1 Transmission Clusters from Dried Blood Spots within a Universal Test-and-Treat Trial in East Africa. Viruses 2022; 14:1673. [PMID: 36016295 PMCID: PMC9414799 DOI: 10.3390/v14081673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 07/15/2022] [Accepted: 07/25/2022] [Indexed: 11/24/2022] Open
Abstract
The Sustainable East Africa Research in Community Health (SEARCH) trial was a universal test-and-treat (UTT) trial in rural Uganda and Kenya, aiming to lower regional HIV-1 incidence. Here, we quantify breakthrough HIV-1 transmissions occurring during the trial from population-based, dried blood spot samples. Between 2013 and 2017, we obtained 549 gag and 488 pol HIV-1 consensus sequences from 745 participants: 469 participants infected prior to trial commencement and 276 SEARCH-incident infections. Putative transmission clusters, with a 1.5% pairwise genetic distance threshold, were inferred from maximum likelihood phylogenies; clusters arising after the start of SEARCH were identified with Bayesian time-calibrated phylogenies. Our phylodynamic approach identified nine clusters arising after the SEARCH start date: eight pairs and one triplet, representing mostly opposite-gender linked (6/9), within-community transmissions (7/9). Two clusters contained individuals with non-nucleoside reverse transcriptase inhibitor (NNRTI) resistance, both linked to intervention communities. The identification of SEARCH-incident, within-community transmissions reveals the role of unsuppressed individuals in sustaining the epidemic in both arms of a UTT trial setting. The presence of transmitted NNRTI resistance, implying treatment failure to the efavirenz-based antiretroviral therapy (ART) used during SEARCH, highlights the need to improve delivery and adherence to up-to-date ART recommendations, to halt HIV-1 transmission.
Collapse
Affiliation(s)
- Emma Pujol-Hodge
- Ashworth Laboratories, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK; (E.P.-H.); (H.E.G.)
| | - Jesus F. Salazar-Gonzalez
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe P.O. Box 49, Uganda; (J.F.S.-G.); (D.S.); (P.K.)
| | - Deogratius Ssemwanga
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe P.O. Box 49, Uganda; (J.F.S.-G.); (D.S.); (P.K.)
- Uganda Virus Research Institute, Entebbe P.O. Box 49, Uganda
| | - Edwin D. Charlebois
- Division of Prevention Science, Department of Medicine, University of California, San Francisco, CA 94158, USA;
| | - James Ayieko
- Kenya Medical Research Institute, Nairobi P.O. Box 54840-00200, Kenya;
| | - Heather E. Grant
- Ashworth Laboratories, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK; (E.P.-H.); (H.E.G.)
| | - Teri Liegler
- Division of HIV, Infectious Diseases and Global Medicine, Department of Medicine, University of California, San Francisco, CA 94110, USA; (T.L.); (D.V.H.)
| | - Katherine E. Atkins
- Usher Institute, University of Edinburgh, Edinburgh EH8 9AG, UK;
- Department of Infectious Disease Epidemiology, Faculty of Epidemiology and Population Health, LSHTM, London WC1E 7HT, UK
- Centre for Mathematical Modelling of Infectious Diseases, LSHTM, London WC1E 7HT, UK
| | - Pontiano Kaleebu
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe P.O. Box 49, Uganda; (J.F.S.-G.); (D.S.); (P.K.)
- Uganda Virus Research Institute, Entebbe P.O. Box 49, Uganda
| | - Moses R. Kamya
- School of Medicine, Makerere University, Kampala P.O. Box 7072, Uganda;
| | - Maya Petersen
- Division of Biostatistics, School of Public Health, University of California, Berkeley, CA 94720, USA;
| | - Diane V. Havlir
- Division of HIV, Infectious Diseases and Global Medicine, Department of Medicine, University of California, San Francisco, CA 94110, USA; (T.L.); (D.V.H.)
| | - Andrew J. Leigh Brown
- Ashworth Laboratories, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK; (E.P.-H.); (H.E.G.)
| |
Collapse
|
5
|
Xi X, Spencer SEF, Hall M, Grabowski MK, Kagaayi J, Ratmann O. Inferring the sources of HIV infection in Africa from deep‐sequence data with semi‐parametric Bayesian Poisson flow models. J R Stat Soc Ser C Appl Stat 2022. [DOI: 10.1111/rssc.12544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Xiaoyue Xi
- Department of MathematicsImperial College London LondonUK
| | | | - Matthew Hall
- Big Data Institute, Nuffield Department of MedicineUniversity of Oxford OxfordUK
| | - M. Kate Grabowski
- Department of PathologyJohns Hopkins University BaltimoreMDUSA
- Rakai Health Sciences Program KalisizoUganda
| | - Joseph Kagaayi
- Rakai Health Sciences Program KalisizoUganda
- Makerere University School of Public Health KampalaUganda
| | - Oliver Ratmann
- Department of MathematicsImperial College London LondonUK
| | | |
Collapse
|
6
|
Methods Combining Genomic and Epidemiological Data in the Reconstruction of Transmission Trees: A Systematic Review. Pathogens 2022; 11:pathogens11020252. [PMID: 35215195 PMCID: PMC8875843 DOI: 10.3390/pathogens11020252] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 02/08/2022] [Accepted: 02/11/2022] [Indexed: 11/17/2022] Open
Abstract
In order to better understand transmission dynamics and appropriately target control and preventive measures, studies have aimed to identify who-infected-whom in actual outbreaks. Numerous reconstruction methods exist, each with their own assumptions, types of data, and inference strategy. Thus, selecting a method can be difficult. Following PRISMA guidelines, we systematically reviewed the literature for methods combing epidemiological and genomic data in transmission tree reconstruction. We identified 22 methods from the 41 selected articles. We defined three families according to how genomic data was handled: a non-phylogenetic family, a sequential phylogenetic family, and a simultaneous phylogenetic family. We discussed methods according to the data needed as well as the underlying sequence mutation, within-host evolution, transmission, and case observation. In the non-phylogenetic family consisting of eight methods, pairwise genetic distances were estimated. In the phylogenetic families, transmission trees were inferred from phylogenetic trees either simultaneously (nine methods) or sequentially (five methods). While a majority of methods (17/22) modeled the transmission process, few (8/22) took into account imperfect case detection. Within-host evolution was generally (7/8) modeled as a coalescent process. These practical and theoretical considerations were highlighted in order to help select the appropriate method for an outbreak.
Collapse
|
7
|
Jiang H, Lan G, Zhu Q, Liang S, Li J, Feng Y, Lin M, Xing H, Shao Y. Non-student young men put students at high risk of human immunodeficiency virus acquisition in Guangxi, China: a phylogenetic analysis of surveillance data. Open Forum Infect Dis 2022; 9:ofac042. [PMID: 35198650 PMCID: PMC8860155 DOI: 10.1093/ofid/ofac042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 01/24/2022] [Indexed: 11/27/2022] Open
Abstract
Background We sought to identify students and their sexual partners in a molecular transmission network. Methods We obtained 5996 HIV protease and reverse transcriptase gene sequences in Guangxi (165 from students and 5831 from the general populations) and the relevant demographic data. We constructed a molecular transmission network and introduced a permutation test to assess the robust genetic linkages. We calculated the centrality measures to describe the transmission patterns in clusters. Results At the network level, 68 (41.2%) students fell within the network across 43 (8.1%) clusters. Of 141 genetic linkages between students and their partners, only 25 (17.7%) occurred within students. Students were more likely than random permutations to link to other students (odds ratio [OR], 7.2; P < .001), private company employees aged 16–24 years (OR, 3.3; P = .01), private company or government employees aged 25–49 years (OR, 1.7; P = .03), and freelancers or unemployed individuals aged 16–24 years (OR, 5.0; P < .001). At the cluster level, the median age of nonstudents directly linked to students (interquartile range) was 25 (22–30) years, and 80.3% of them had a high school or higher education background. Compared with students, they showed a significantly higher median degree (4.0 vs 2.0; P < .001) but an equivalent median Eigenvector Centrality (0.83 vs 0.81; P = .60). Conclusions The tendency of genetic linkage between students and nonstudent young men and their important position in the HIV transmission network emphasizes the urgent need for 2-pronged public health interventions based on both school and society.
Collapse
Affiliation(s)
- He Jiang
- Guangxi Key Laboratory of Major Infectious Disease Prevention Control and Biosafety Emergency Response, Guangxi Center for Disease Prevention and Control, Nanning, China
- State of Key Laboratory for Infectious Disease Prevention and Control, National Center for AIDS/STD Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
- Guangxi Key Laboratory of AIDS Prevention and Treatment & Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China
| | - Guanghua Lan
- Guangxi Key Laboratory of Major Infectious Disease Prevention Control and Biosafety Emergency Response, Guangxi Center for Disease Prevention and Control, Nanning, China
| | - Qiuying Zhu
- Guangxi Key Laboratory of Major Infectious Disease Prevention Control and Biosafety Emergency Response, Guangxi Center for Disease Prevention and Control, Nanning, China
| | - Shujia Liang
- Guangxi Key Laboratory of Major Infectious Disease Prevention Control and Biosafety Emergency Response, Guangxi Center for Disease Prevention and Control, Nanning, China
| | - Jianjun Li
- Guangxi Key Laboratory of Major Infectious Disease Prevention Control and Biosafety Emergency Response, Guangxi Center for Disease Prevention and Control, Nanning, China
| | - Yi Feng
- State of Key Laboratory for Infectious Disease Prevention and Control, National Center for AIDS/STD Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Mei Lin
- Guangxi Key Laboratory of Major Infectious Disease Prevention Control and Biosafety Emergency Response, Guangxi Center for Disease Prevention and Control, Nanning, China
| | - Hui Xing
- State of Key Laboratory for Infectious Disease Prevention and Control, National Center for AIDS/STD Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Yiming Shao
- State of Key Laboratory for Infectious Disease Prevention and Control, National Center for AIDS/STD Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
- Guangxi Key Laboratory of AIDS Prevention and Treatment & Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, School of Public Health, Guangxi Medical University, Nanning, Guangxi, China
| |
Collapse
|
8
|
Kayondo HW, Ssekagiri A, Nabakooza G, Bbosa N, Ssemwanga D, Kaleebu P, Mwalili S, Mango JM, Leigh Brown AJ, Saenz RA, Galiwango R, Kitayimbwa JM. Employing phylogenetic tree shape statistics to resolve the underlying host population structure. BMC Bioinformatics 2021; 22:546. [PMID: 34758743 PMCID: PMC8579572 DOI: 10.1186/s12859-021-04465-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 10/29/2021] [Indexed: 12/24/2022] Open
Abstract
Background Host population structure is a key determinant of pathogen and infectious disease transmission patterns. Pathogen phylogenetic trees are useful tools to reveal the population structure underlying an epidemic. Determining whether a population is structured or not is useful in informing the type of phylogenetic methods to be used in a given study. We employ tree statistics derived from phylogenetic trees and machine learning classification techniques to reveal an underlying population structure. Results In this paper, we simulate phylogenetic trees from both structured and non-structured host populations. We compute eight statistics for the simulated trees, which are: the number of cherries; Sackin, Colless and total cophenetic indices; ladder length; maximum depth; maximum width, and width-to-depth ratio. Based on the estimated tree statistics, we classify the simulated trees as from either a non-structured or a structured population using the decision tree (DT), K-nearest neighbor (KNN) and support vector machine (SVM). We incorporate the basic reproductive number (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$R_0$$\end{document}R0) in our tree simulation procedure. Sensitivity analysis is done to investigate whether the classifiers are robust to different choice of model parameters and to size of trees. Cross-validated results for area under the curve (AUC) for receiver operating characteristic (ROC) curves yield mean values of over 0.9 for most of the classification models. Conclusions Our classification procedure distinguishes well between trees from structured and non-structured populations using the classifiers, the two-sample Kolmogorov-Smirnov, Cucconi and Podgor-Gastwirth tests and the box plots. SVM models were more robust to changes in model parameters and tree size compared to KNN and DT classifiers. Our classification procedure was applied to real -world data and the structured population was revealed with high accuracy of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$92.3\%$$\end{document}92.3% using SVM-polynomial classifier.
Collapse
Affiliation(s)
- Hassan W Kayondo
- Institute of Basic Sciences, Technology and Innovation (PAUSTI), Pan African University, Nairobi, Kenya. .,Department of Mathematics, Makerere University, Kampala, Uganda.
| | - Alfred Ssekagiri
- Uganda Virus Research Institute (UVRI), Entebbe, Uganda.,Department of Immunology and Molecular Biology, Makerere University, Kampala, Uganda
| | - Grace Nabakooza
- Department of Immunology and Molecular Biology, Makerere University, Kampala, Uganda.,UVRI Centre of Excellence in Infection and Immunity Research and Training (MUII-Plus), Makerere University, Entebbe, Uganda.,Centre for Computational Biology, Uganda Christian University, Mukono, Uganda
| | - Nicholas Bbosa
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe, Uganda
| | - Deogratius Ssemwanga
- Uganda Virus Research Institute (UVRI), Entebbe, Uganda.,Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe, Uganda
| | - Pontiano Kaleebu
- Uganda Virus Research Institute (UVRI), Entebbe, Uganda.,Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe, Uganda
| | - Samuel Mwalili
- Department of Statistics and Actuarial Sciences, Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya
| | - John M Mango
- Department of Mathematics, Makerere University, Kampala, Uganda
| | | | | | - Ronald Galiwango
- Centre for Computational Biology, Uganda Christian University, Mukono, Uganda
| | - John M Kitayimbwa
- Centre for Computational Biology, Uganda Christian University, Mukono, Uganda
| |
Collapse
|
9
|
Campbell EM, Boyles A, Shankar A, Kim J, Knyazev S, Cintron R, Switzer WM. MicrobeTrace: Retooling molecular epidemiology for rapid public health response. PLoS Comput Biol 2021; 17:e1009300. [PMID: 34492010 PMCID: PMC8491948 DOI: 10.1371/journal.pcbi.1009300] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 10/05/2021] [Accepted: 07/23/2021] [Indexed: 12/12/2022] Open
Abstract
Outbreak investigations use data from interviews, healthcare providers, laboratories and surveillance systems. However, integrated use of data from multiple sources requires a patchwork of software that present challenges in usability, interoperability, confidentiality, and cost. Rapid integration, visualization and analysis of data from multiple sources can guide effective public health interventions. We developed MicrobeTrace to facilitate rapid public health responses by overcoming barriers to data integration and exploration in molecular epidemiology. MicrobeTrace is a web-based, client-side, JavaScript application (https://microbetrace.cdc.gov) that runs in Chromium-based browsers and remains fully operational without an internet connection. Using publicly available data, we demonstrate the analysis of viral genetic distance networks and introduce a novel approach to minimum spanning trees that simplifies results. We also illustrate the potential utility of MicrobeTrace in support of contact tracing by analyzing and displaying data from an outbreak of SARS-CoV-2 in South Korea in early 2020. MicrobeTrace is developed and actively maintained by the Centers for Disease Control and Prevention. Users can email microbetrace@cdc.gov for support. The source code is available at https://github.com/cdcgov/microbetrace. Rapid advances in the fields of data science and bioinformatics have significantly improved molecular epidemiology tools used in public health and have led to major changes in the way outbreak investigation and pathogen transmission studies are conducted. However, the need for specialized computer skills often impedes the use of many of these tools in the public heath domain. We bridge this knowledge gap by development of an intuitive, standalone tool called MicrobeTrace to securely integrate, visualize and explore pathogen epidemiologic data. MicrobeTrace is an easy to use browser-based tool which can effectively merge contact tracing and/or microbial genomic data with demographic or behavioral information, resulting in elegant and informative networks as well as multiple customizable visualizations. MicrobeTrace can be used offline, with analyses being performed locally in the field, ensuring secure and confidential use of personally identifiable information (PII). We provide real world examples of how MicrobeTrace has been used in public health, including COVID outbreak investigations.
Collapse
Affiliation(s)
- Ellsworth M Campbell
- Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Anthony Boyles
- Northrup Grumman, Atlanta, Georgia, United States of America
| | - Anupama Shankar
- Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Jay Kim
- Northrup Grumman, Atlanta, Georgia, United States of America
| | - Sergey Knyazev
- Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America.,Oak Ridge Institute for Science and Education, Oak Ridge, Tennessee, United States of America.,Department of Computer Science, Georgia State University, Atlanta, Georgia, United States of America
| | - Roxana Cintron
- Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - William M Switzer
- Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| |
Collapse
|
10
|
Phylogenetic Networks and Parameters Inferred from HIV Nucleotide Sequences of High-Risk and General Population Groups in Uganda: Implications for Epidemic Control. Viruses 2021; 13:v13060970. [PMID: 34073846 PMCID: PMC8225143 DOI: 10.3390/v13060970] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Revised: 05/13/2021] [Accepted: 05/18/2021] [Indexed: 12/17/2022] Open
Abstract
Phylogenetic inference is useful in characterising HIV transmission networks and assessing where prevention is likely to have the greatest impact. However, estimating parameters that influence the network structure is still scarce, but important in evaluating determinants of HIV spread. We analyzed 2017 HIV pol sequences (728 Lake Victoria fisherfolk communities (FFCs), 592 female sex workers (FSWs) and 697 general population (GP)) to identify transmission networks on Maximum Likelihood (ML) phylogenetic trees and refined them using time-resolved phylogenies. Network generative models were fitted to the observed degree distributions and network parameters, and corrected Akaike Information Criteria and Bayesian Information Criteria values were estimated. 347 (17.2%) HIV sequences were linked on ML trees (maximum genetic distance ≤4.5%, ≥95% bootstrap support) and, of these, 303 (86.7%) that consisted of pure A1 (n = 168) and D (n = 135) subtypes were analyzed in BEAST v1.8.4. The majority of networks (at least 40%) were found at a time depth of ≤5 years. The waring and yule models fitted best networks of FFCs and FSWs respectively while the negative binomial model fitted best networks in the GP. The network structure in the HIV-hyperendemic FFCs is likely to be scale-free and shaped by preferential attachment, in contrast to the GP. The findings support the targeting of interventions for FFCs in a timely manner for effective epidemic control. Interventions ought to be tailored according to the dynamics of the HIV epidemic in the target population and understanding the network structure is critical in ensuring the success of HIV prevention programs.
Collapse
|
11
|
Reichmuth ML, Chaudron SE, Bachmann N, Nguyen H, Böni J, Metzner KJ, Perreau M, Klimkait T, Yerly S, Hirsch HH, Hauser C, Ramette A, Vernazza P, Cavassini M, Bernasconi E, Günthard HF, Kusejko K, Kouyos RD. Using longitudinally sampled viral nucleotide sequences to characterize the drivers of HIV-1 transmission. HIV Med 2020; 22:346-359. [PMID: 33368946 DOI: 10.1111/hiv.13030] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 10/17/2020] [Accepted: 10/21/2020] [Indexed: 12/11/2022]
Abstract
OBJECTIVES Understanding the drivers of HIV-1 transmission is of importance for curbing the ongoing epidemic. Phylogenetic methods based on single viral sequences allow us to assess whether two individuals are part of the same viral outbreak, but cannot on their own assess who potentially transmitted the virus. We developed and assessed a molecular epidemiology method with the main aim to screen cohort studies for and to characterize individuals who are 'potential HIV-1 transmitters', in order to understand the drivers of HIV-1 transmission. METHODS We developed and validated a molecular epidemiology approach using longitudinally sampled viral Sanger sequences to characterize potential HIV-1 transmitters in the Swiss HIV Cohort Study. RESULTS Our method was able to identify 279 potential HIV-1 transmitters and allowed us to determine the main epidemiological and virological drivers of transmission. We found that the directionality of transmission was consistent with infection times for 72.9% of 85 potential HIV-1 transmissions with accurate infection date estimates. Being a potential HIV-1 transmitter was associated with risk factors including viral load [adjusted odds ratiomultivariable (95% confidence interval): 1.86 (1.49-2.32)], syphilis coinfection [1.52 (1.06-2.19)], and recreational drug use [1.45 (1.06-1.98)]. By contrast for the potential HIV-1 recipients, this association was weaker or even absent [1.18 (0.82-1.72), 0.89 (0.52-1.55) and 1.53 (0.98-2.39), respectively], indicating that inferred directionality of transmission is useful at the population level. CONCLUSIONS Our results indicate that longitudinally sampled Sanger sequences do not provide sufficient information to identify transmitters with high certainty at the individual level, but that they allow the drivers of transmission at the population level to be characterized.
Collapse
Affiliation(s)
- M L Reichmuth
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland.,Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - S E Chaudron
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland.,Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - N Bachmann
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland.,Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - H Nguyen
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland.,Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - J Böni
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - K J Metzner
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland.,Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - M Perreau
- Service of Immunology and Allergy, Lausanne University Hospital, University of Lausanne, Lausanne, Switzerland
| | - T Klimkait
- Department of Biomedicine, University of Basel, Basel, Switzerland
| | - S Yerly
- Laboratory of Virology, Geneva University Hospital, Geneva, Switzerland
| | - H H Hirsch
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Basel, Basel, Switzerland.,Clinical Virology, Laboratory Medicine, University Hospital Basel, Basel, Switzerland
| | - C Hauser
- Department of Infectious Diseases, Bern University Hospital, Bern, Switzerland
| | - A Ramette
- Institute for Infectious Diseases, University of Bern, Bern, Switzerland
| | - P Vernazza
- Division of Infectious Diseases, Cantonal Hospital St Gallen, St Gallen, Switzerland
| | - M Cavassini
- Service of Infectious Diseases, Lausanne University Hospital, University of Lausanne, Lausanne, Switzerland
| | - E Bernasconi
- Division of Infectious Diseases, Regional Hospital Lugano, Lugano, Switzerland
| | - H F Günthard
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland.,Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - K Kusejko
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland.,Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - R D Kouyos
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland.,Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | | |
Collapse
|
12
|
Ssemwanga D, Bbosa N, Nsubuga RN, Ssekagiri A, Kapaata A, Nannyonjo M, Nassolo F, Karabarinde A, Mugisha J, Seeley J, Yebra G, Leigh Brown A, Kaleebu P. The Molecular Epidemiology and Transmission Dynamics of HIV Type 1 in a General Population Cohort in Uganda. Viruses 2020; 12:v12111283. [PMID: 33182587 PMCID: PMC7697205 DOI: 10.3390/v12111283] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 10/01/2020] [Accepted: 10/02/2020] [Indexed: 12/13/2022] Open
Abstract
The General Population Cohort (GPC) in south-western Uganda has a low HIV-1 incidence rate (<1%). However, new infections continue to emerge. In this research, 3796 HIV-1 pol sequences (GPC: n = 1418, non-GPC sites: n = 1223, Central Uganda: n = 1010 and Eastern Uganda: n = 145) generated between 2003–2015 were analysed using phylogenetic methods with demographic data to understand HIV-1 transmission in this cohort and inform the epidemic response. HIV-1 subtype A1 was the most prevalent strain in the GPC area (GPC and non-GPC sites) (39.8%), central (45.9%) and eastern (52.4%) Uganda. However, in the GPC alone, subtype D was the predominant subtype (39.1%). Of the 524 transmission clusters identified by Cluster Picker, all large clusters (≥5 individuals, n = 8) involved individuals from the GPC. In a multivariate analysis, clustering was strongly associated with being female (adjusted Odds Ratio, aOR = 1.28; 95% CI, 1.06–1.54), being >25 years (aOR = 1.52; 95% CI, 1.16–2.0) and being a resident in the GPC (aOR = 6.90; 95% CI, 5.22–9.21). Phylogeographic analysis showed significant viral dissemination (Bayes Factor test, BF > 3) from the GPC without significant viral introductions (BF < 3) into the GPC. The findings suggest localized HIV-1 transmission in the GPC. Intensifying geographically focused combination interventions in the GPC would contribute towards controlling HIV-1 infections.
Collapse
Affiliation(s)
- Deogratius Ssemwanga
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
- Department of General Virology, Uganda Virus Research Institute, Entebbe 256, Uganda;
- Correspondence: ; Tel.: +256-(0)-417-704000
| | - Nicholas Bbosa
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
| | - Rebecca N. Nsubuga
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
| | - Alfred Ssekagiri
- Department of General Virology, Uganda Virus Research Institute, Entebbe 256, Uganda;
| | - Anne Kapaata
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
| | - Maria Nannyonjo
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
| | - Faridah Nassolo
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
| | - Alex Karabarinde
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
| | - Joseph Mugisha
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
| | - Janet Seeley
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
- Department of Global Health and Development, London School of Hygiene and Tropical Medicine, 15-17 Tavistock Place, London WC1H 9SH, UK
| | - Gonzalo Yebra
- The Roslin Institute, Royal (Dick) School of Veterinary Medicine, University of Edinburgh, Easter Bush Campus, Edinburgh EH25 9RG, UK;
| | - Andrew Leigh Brown
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK;
| | - Pontiano Kaleebu
- Medical Research Council (MRC)/Uganda Virus Research Institute (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, Entebbe 256, Uganda; (N.B.); (R.N.N.); (A.K.); (M.N.); (F.N.); (A.K.); (J.M.); (J.S.); (P.K.)
- Department of General Virology, Uganda Virus Research Institute, Entebbe 256, Uganda;
| |
Collapse
|
13
|
Novitsky V, Steingrimsson JA, Howison M, Gillani FS, Li Y, Manne A, Fulton J, Spence M, Parillo Z, Marak T, Chan PA, Bertrand T, Bandy U, Alexander-Scott N, Dunn CW, Hogan J, Kantor R. Empirical comparison of analytical approaches for identifying molecular HIV-1 clusters. Sci Rep 2020; 10:18547. [PMID: 33122765 PMCID: PMC7596705 DOI: 10.1038/s41598-020-75560-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 09/21/2020] [Indexed: 01/10/2023] Open
Abstract
Public health interventions guided by clustering of HIV-1 molecular sequences may be impacted by choices of analytical approaches. We identified commonly-used clustering analytical approaches, applied them to 1886 HIV-1 Rhode Island sequences from 2004-2018, and compared concordance in identifying molecular HIV-1 clusters within and between approaches. We used strict (topological support ≥ 0.95; distance 0.015 substitutions/site) and relaxed (topological support 0.80-0.95; distance 0.030-0.045 substitutions/site) thresholds to reflect different epidemiological scenarios. We found that clustering differed by method and threshold and depended more on distance than topological support thresholds. Clustering concordance analyses demonstrated some differences across analytical approaches, with RAxML having the highest (91%) mean summary percent concordance when strict thresholds were applied, and three (RAxML-, FastTree regular bootstrap- and IQ-Tree regular bootstrap-based) analytical approaches having the highest (86%) mean summary percent concordance when relaxed thresholds were applied. We conclude that different analytical approaches can yield diverse HIV-1 clustering outcomes and may need to be differentially used in diverse public health scenarios. Recognizing the variability and limitations of commonly-used methods in cluster identification is important for guiding clustering-triggered interventions to disrupt new transmissions and end the HIV epidemic.
Collapse
Affiliation(s)
| | | | - Mark Howison
- Research Improving People's Life, Providence, RI, USA
| | | | | | | | | | | | | | | | - Philip A Chan
- Brown University, Providence, RI, USA
- Rhode Island Department of Health, Providence, RI, USA
| | | | - Utpala Bandy
- Rhode Island Department of Health, Providence, RI, USA
| | | | | | | | | |
Collapse
|