1
|
Weaver S, Dávila Conn VM, Ji D, Verdonk H, Ávila-Ríos S, Leigh Brown AJ, Wertheim JO, Kosakovsky Pond SL. AUTO-TUNE: selecting the distance threshold for inferring HIV transmission clusters. FRONTIERS IN BIOINFORMATICS 2024; 4:1400003. [PMID: 39086842 PMCID: PMC11289888 DOI: 10.3389/fbinf.2024.1400003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 05/17/2024] [Indexed: 08/02/2024] Open
Abstract
Molecular surveillance of viral pathogens and inference of transmission networks from genomic data play an increasingly important role in public health efforts, especially for HIV-1. For many methods, the genetic distance threshold used to connect sequences in the transmission network is a key parameter informing the properties of inferred networks. Using a distance threshold that is too high can result in a network with many spurious links, making it difficult to interpret. Conversely, a distance threshold that is too low can result in a network with too few links, which may not capture key insights into clusters of public health concern. Published research using the HIV-TRACE software package frequently uses the default threshold of 0.015 substitutions/site for HIV pol gene sequences, but in many cases, investigators heuristically select other threshold parameters to better capture the underlying dynamics of the epidemic they are studying. Here, we present a general heuristic scoring approach for tuning a distance threshold adaptively, which seeks to prevent the formation of giant clusters. We prioritize the ratio of the sizes of the largest and the second largest cluster, maximizing the number of clusters present in the network. We apply our scoring heuristic to outbreaks with different characteristics, such as regional or temporal variability, and demonstrate the utility of using the scoring mechanism's suggested distance threshold to identify clusters exhibiting risk factors that would have otherwise been more difficult to identify. For example, while we found that a 0.015 substitutions/site distance threshold is typical for US-like epidemics, recent outbreaks like the CRF07_BC subtype among men who have sex with men (MSM) in China have been found to have a lower optimal threshold of 0.005 to better capture the transition from injected drug use (IDU) to MSM as the primary risk factor. Alternatively, in communities surrounding Lake Victoria in Uganda, where there has been sustained heterosexual transmission for many years, we found that a larger distance threshold is necessary to capture a more risk factor-diverse population with sparse sampling over a longer period of time. Such identification may allow for more informed intervention action by respective public health officials.
Collapse
Affiliation(s)
- Steven Weaver
- Center for Viral Evolution, Temple University, Philadelphia, PA, United States
| | - Vanessa M. Dávila Conn
- Center for Research in Infectious Diseases, National Institute of Respiratory Diseases, Mexico City, Mexico
| | - Daniel Ji
- Department of Medicine, University of California San Diego, La Jolla, CA, United States
| | - Hannah Verdonk
- Center for Viral Evolution, Temple University, Philadelphia, PA, United States
| | | | - Andrew J. Leigh Brown
- Department of Medicine, University of California San Diego, La Jolla, CA, United States
| | - Joel O. Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA, United States
| | | |
Collapse
|
2
|
Weaver S, Dávila-Conn V, Ji D, Verdonk H, Ávila-Ríos S, Leigh Brown AJ, Wertheim JO, Kosakovsky Pond SL. AUTO-TUNE: SELECTING THE DISTANCE THRESHOLD FOR INFERRING HIV TRANSMISSION CLUSTERS. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.11.584522. [PMID: 38559140 PMCID: PMC10979987 DOI: 10.1101/2024.03.11.584522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Molecular surveillance of viral pathogens and inference of transmission networks from genomic data play an increasingly important role in public health efforts, especially for HIV-1. For many methods, the genetic distance threshold used to connect sequences in the transmission network is a key parameter informing the properties of inferred networks. Using a distance threshold that is too high can result in a network with many spurious links, making it difficult to interpret. Conversely, a distance threshold that is too low can result in a network with too few links, which may not capture key insights into clusters of public health concern. Published research using the HIV-TRACE software package frequently uses the default threshold of 0.015 substitutions/site for HIV pol gene sequences, but in many cases, investigators heuristically select other threshold parameters to better capture the underlying dynamics of the epidemic they are studying. Here, we present a general heuristic scoring approach for tuning a distance threshold adaptively, which seeks to prevent the formation of giant clusters. We prioritize the ratio of the sizes of the largest and the second largest cluster, maximizing the number of clusters present in the network. We apply our scoring heuristic to outbreaks with different characteristics, such as regional or temporal variability, and demonstrate the utility of using the scoring mechanism's suggested distance threshold to identify clusters exhibiting risk factors that would have otherwise been more difficult to identify. For example, while we found that a 0.015 substitutions/site distance threshold is typical for US-like epidemics, recent outbreaks like the CRF07_BC subtype among men who have sex with men (MSM) in China have been found to have a lower optimal threshold of 0.005 to better capture the transition from injected drug use (IDU) to MSM as the primary risk factor. Alternatively, in communities surrounding Lake Victoria in Uganda, where there has been sustained hetero-sexual transmission for many years, we found that a larger distance threshold is necessary to capture a more risk factor-diverse population with sparse sampling over a longer period of time. Such identification may allow for more informed intervention action by respective public health officials.
Collapse
Affiliation(s)
- Steven Weaver
- Center for Viral Evolution, Temple University, Philadelphia, PA, USA
| | - Vanessa Dávila-Conn
- Center for Research in Infectious Diseases, National Institute of Respiratory Diseases, Mexico City, Mexico
| | - Daniel Ji
- Department of Computer Science & Engineering, UC San Diego, La Jolla, CA 92093, USA
| | - Hannah Verdonk
- Center for Viral Evolution, Temple University, Philadelphia, PA, USA
| | - Santiago Ávila-Ríos
- Center for Research in Infectious Diseases, National Institute of Respiratory Diseases, Mexico City, Mexico
| | - Andrew J Leigh Brown
- School of Biological Sciences, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Joel O Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | | |
Collapse
|
3
|
Optimized phylogenetic clustering of HIV-1 sequence data for public health applications. PLoS Comput Biol 2022; 18:e1010745. [PMID: 36449514 PMCID: PMC9744331 DOI: 10.1371/journal.pcbi.1010745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 12/12/2022] [Accepted: 11/17/2022] [Indexed: 12/02/2022] Open
Abstract
Clusters of genetically similar infections suggest rapid transmission and may indicate priorities for public health action or reveal underlying epidemiological processes. However, clusters often require user-defined thresholds and are sensitive to non-epidemiological factors, such as non-random sampling. Consequently the ideal threshold for public health applications varies substantially across settings. Here, we show a method which selects optimal thresholds for phylogenetic (subset tree) clustering based on population. We evaluated this method on HIV-1 pol datasets (n = 14, 221 sequences) from four sites in USA (Tennessee, Washington), Canada (Northern Alberta) and China (Beijing). Clusters were defined by tips descending from an ancestral node (with a minimum bootstrap support of 95%) through a series of branches, each with a length below a given threshold. Next, we used pplacer to graft new cases to the fixed tree by maximum likelihood. We evaluated the effect of varying branch-length thresholds on cluster growth as a count outcome by fitting two Poisson regression models: a null model that predicts growth from cluster size, and an alternative model that includes mean collection date as an additional covariate. The alternative model was favoured by AIC across most thresholds, with optimal (greatest difference in AIC) thresholds ranging 0.007-0.013 across sites. The range of optimal thresholds was more variable when re-sampling 80% of the data by location (IQR 0.008 - 0.016, n = 100 replicates). Our results use prospective phylogenetic cluster growth and suggest that there is more variation in effective thresholds for public health than those typically used in clustering studies.
Collapse
|
4
|
OUP accepted manuscript. Syst Biol 2022; 71:1378-1390. [DOI: 10.1093/sysbio/syac008] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 02/05/2022] [Accepted: 02/08/2022] [Indexed: 11/12/2022] Open
|
5
|
Chindelevitch L, Hayati M, Poon AFY, Colijn C. Network science inspires novel tree shape statistics. PLoS One 2021; 16:e0259877. [PMID: 34941890 PMCID: PMC8699983 DOI: 10.1371/journal.pone.0259877] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 10/28/2021] [Indexed: 11/18/2022] Open
Abstract
The shape of phylogenetic trees can be used to gain evolutionary insights. A tree’s shape specifies the connectivity of a tree, while its branch lengths reflect either the time or genetic distance between branching events; well-known measures of tree shape include the Colless and Sackin imbalance, which describe the asymmetry of a tree. In other contexts, network science has become an important paradigm for describing structural features of networks and using them to understand complex systems, ranging from protein interactions to social systems. Network science is thus a potential source of many novel ways to characterize tree shape, as trees are also networks. Here, we tailor tools from network science, including diameter, average path length, and betweenness, closeness, and eigenvector centrality, to summarize phylogenetic tree shapes. We thereby propose tree shape summaries that are complementary to both asymmetry and the frequencies of small configurations. These new statistics can be computed in linear time and scale well to describe the shapes of large trees. We apply these statistics, alongside some conventional tree statistics, to phylogenetic trees from three very different viruses (HIV, dengue fever and measles), from the same virus in different epidemiological scenarios (influenza A and HIV) and from simulation models known to produce trees with different shapes. Using mutual information and supervised learning algorithms, we find that the statistics adapted from network science perform as well as or better than conventional statistics. We describe their distributions and prove some basic results about their extreme values in a tree. We conclude that network science-based tree shape summaries are a promising addition to the toolkit of tree shape features. All our shape summaries, as well as functions to select the most discriminating ones for two sets of trees, are freely available as an R package at http://github.com/Leonardini/treeCentrality.
Collapse
Affiliation(s)
- Leonid Chindelevitch
- MRC Centre for Global Infectious Disease Analysis, Imperial College London, London, United Kingdom
- * E-mail:
| | - Maryam Hayati
- School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
| | - Art F. Y. Poon
- Department of Pathology & Laboratory Medicine, University of Western Ontario, London, ON, Canada
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
| |
Collapse
|
6
|
Sivay MV, Totmenin AV, Zyryanova DP, Osipova IP, Nalimova TM, Gashnikova MP, Ivlev VV, Meshkov IO, Chokmorova UZ, Narmatova E, Motorov U, Akmatova Z, Asybalieva N, Bekbolotov AA, Kadyrbekov UK, Maksutov RA, Gashnikova NM. Characterization of HIV-1 Epidemic in Kyrgyzstan. Front Microbiol 2021; 12:753675. [PMID: 34721358 PMCID: PMC8554114 DOI: 10.3389/fmicb.2021.753675] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Accepted: 09/14/2021] [Indexed: 11/13/2022] Open
Abstract
Kyrgyzstan has one of the highest rates of HIV-1 spread in Central Asia. In this study, we used molecular–epidemiological approaches to examine the HIV-1 epidemic in Kyrgyzstan. Samples were obtained from HIV-positive individuals who visited HIV/AIDS clinics. Partial pol gene sequences were used to identify HIV-1 subtypes and drug resistance mutations (DRMs) and to perform phylogenetic analysis. Genetic diversity and history reconstruction of the major HIV-1 subtypes were explored using BEAST. This study includes an analysis of 555 HIV-positive individuals. The study population was equally represented by men and women aged 1–72 years. Heterosexual transmission was the most frequent, followed by nosocomial infection. Men were more likely to acquire HIV-1 during injection drug use and while getting clinical services, while women were more likely to be infected through sexual contacts (p < 0.01). Heterosexual transmission was the more prevalent among individuals 25–49 years old; individuals over 49 years old were more likely to be persons who inject drugs (PWID). The major HIV-1 variants were CRF02_AG, CRF63_02A, and sub-subtype A6. Major DRMs were detected in 26.9% of the study individuals; 62.2% of those had DRMs to at least two antiretroviral (ARV) drug classes. Phylogenetic analysis revealed a well-defined structure of CRF02_AG, indicating locally evolving sub-epidemics. The lack of well-defined phylogenetic structure was observed for sub-subtype A6. The estimated origin date of CRF02_AG was January 1997; CRF63_02A, April 2004; and A6, June 1995. A rapid evolutionary dynamic of CRF02_AG and A6 among Kyrgyz population since the mid-1990s was observed. We observed the high levels of HIV-1 genetic diversity and drug resistance in the study population. Complex patterns of HIV-1 phylogenetics in Kyrgyzstan were found. This study highlights the importance of molecular–epidemiological analysis for HIV-1 surveillance and treatment implementation to reduce new HIV-1 infections.
Collapse
Affiliation(s)
- Mariya V Sivay
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| | - Alexei V Totmenin
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| | - Daria P Zyryanova
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| | - Irina P Osipova
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| | - Tatyana M Nalimova
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| | - Mariya P Gashnikova
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| | - Vladimir V Ivlev
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| | | | - Umut Z Chokmorova
- Republican Center of AIDS, Ministry of Health of Kyrgyzstan, Bishkek, Kyrgyzstan
| | - Elmira Narmatova
- Osh Regional Center of AIDS Treatment and Prevention, Osh, Kyrgyzstan
| | - Ulukbek Motorov
- Osh Regional Center of AIDS Treatment and Prevention, Osh, Kyrgyzstan
| | - Zhyldyz Akmatova
- Republican Center of AIDS, Ministry of Health of Kyrgyzstan, Bishkek, Kyrgyzstan
| | - Nazgul Asybalieva
- Republican Center of AIDS, Ministry of Health of Kyrgyzstan, Bishkek, Kyrgyzstan
| | - Aybek A Bekbolotov
- Republican Center of AIDS, Ministry of Health of Kyrgyzstan, Bishkek, Kyrgyzstan
| | - Ulan K Kadyrbekov
- Republican Center of AIDS, Ministry of Health of Kyrgyzstan, Bishkek, Kyrgyzstan
| | - Rinat A Maksutov
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| | - Natalya M Gashnikova
- Department of Retroviruses, State Research Center of Virology and Biotechnology "Vector", Koltsovo, Russia
| |
Collapse
|
7
|
Fujimoto K, Bahl J, Wertheim JO, Del Vecchio N, Hicks JT, Damodaran L, Hallmark CJ, Lavingia R, Mora R, Carr M, Yang B, Schneider JA, Hwang LY, McNeese M. Methodological synthesis of Bayesian phylodynamics, HIV-TRACE, and GEE: HIV-1 transmission epidemiology in a racially/ethnically diverse Southern U.S. context. Sci Rep 2021; 11:3325. [PMID: 33558579 PMCID: PMC7870963 DOI: 10.1038/s41598-021-82673-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 01/22/2021] [Indexed: 12/30/2022] Open
Abstract
This study introduces an innovative methodological approach to identify potential drivers of structuring HIV-1 transmission clustering patterns between different subpopulations in the culturally and racially/ethnically diverse context of Houston, TX, the largest city in the Southern United States. Using 6332 HIV-1 pol sequences from persons newly diagnosed with HIV during the period 2010–2018, we reconstructed HIV-1 transmission clusters, using the HIV-TRAnsmission Cluster Engine (HIV-TRACE); inferred demographic and risk parameters on HIV-1 transmission dynamics by jointly estimating viral transmission rates across racial/ethnic, age, and transmission risk groups; and modeled the degree of network connectivity by using generalized estimating equations (GEE). Our results indicate that Hispanics/Latinos are most vulnerable to the structure of transmission clusters and serve as a bridge population, acting as recipients of transmissions from Whites (3.0 state changes/year) and from Blacks (2.6 state changes/year) as well as sources of transmissions to Whites (1.8 state changes/year) and to Blacks (1.2 state changes/year). There were high rates of transmission and high network connectivity between younger and older Hispanics/Latinos as well as between younger and older Blacks. Prevention and intervention efforts are needed for transmission clusters that involve younger racial/ethnic minorities, in particular Hispanic/Latino youth, to reduce onward transmission of HIV in Houston.
Collapse
Affiliation(s)
- Kayo Fujimoto
- Department of Health Promotion and Behavioral Sciences, The University of Texas Health Science Center at Houston, 7000 Fannin Street, UCT 2514, Houston, TX, 77030, USA.
| | - Justin Bahl
- Department of Infectious Diseases, University of Georgia, Athens, GA, USA
| | - Joel O Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Natascha Del Vecchio
- Department of Biostatistics and Data Science, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Joseph T Hicks
- Department of Infectious Diseases, University of Georgia, Athens, GA, USA
| | | | - Camden J Hallmark
- Division of Disease Prevention and Control, Houston Health Department, Houston, TX, USA
| | - Richa Lavingia
- Department of Health Promotion and Behavioral Sciences, The University of Texas Health Science Center at Houston, 7000 Fannin Street, UCT 2514, Houston, TX, 77030, USA
| | - Ricardo Mora
- Division of Disease Prevention and Control, Houston Health Department, Houston, TX, USA
| | - Michelle Carr
- Division of Disease Prevention and Control, Houston Health Department, Houston, TX, USA
| | - Biru Yang
- Division of Disease Prevention and Control, Houston Health Department, Houston, TX, USA
| | | | - Lu-Yu Hwang
- Department of Epidemiology, Human Genetics, and Environmental Science, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Marlene McNeese
- Division of Disease Prevention and Control, Houston Health Department, Houston, TX, USA
| |
Collapse
|
8
|
Abstract
PURPOSE OF REVIEW A major goal of public health in relation to HIV/AIDS is to prevent new transmissions in communities. Phylogenetic techniques have improved our understanding of the structure and dynamics of HIV transmissions. However, there is still no consensus about phylogenetic methodology, sampling coverage, gene target and/or minimum fragment size. RECENT FINDINGS Several studies use a combined methodology, which includes both a genetic or patristic distance cut-off and a branching support threshold to identify phylogenetic clusters. However, the choice about these thresholds remains an inherently subjective process, which affects the results of these studies. There is still a lack of consensus about the genomic region and the size of fragments that should be used, although there seems to be emerging a consensus that using longer segments, allied with the use of a realistic model of evolution and a codon alignment, increases the likelihood of inferring true transmission clusters. The pol gene is still the most used genomic region, but recent studies have suggested that whole genomes and/or sequences from nef and gp41 are also good targets for cluster reconstruction. SUMMARY The development and application of standard methodologies for phylogenetic clustering analysis will advance our understanding of factors associated with HIV transmission. This will lead to the design of more precise public health interventions.
Collapse
|
9
|
Chato C, Kalish ML, Poon AFY. Public health in genetic spaces: a statistical framework to optimize cluster-based outbreak detection. Virus Evol 2020; 6:veaa011. [PMID: 32190349 PMCID: PMC7069216 DOI: 10.1093/ve/veaa011] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Genetic clustering is a popular method for characterizing variation in transmission rates for rapidly evolving viruses, and could potentially be used to detect outbreaks in 'near real time'. However, the statistical properties of clustering are poorly understood in this context, and there are no objective guidelines for setting clustering criteria. Here, we develop a new statistical framework to optimize a genetic clustering method based on the ability to forecast new cases. We analysed the pairwise Tamura-Nei (TN93) genetic distances for anonymized HIV-1 subtype B pol sequences from Seattle (n = 1,653) and Middle Tennessee, USA (n = 2,779), and northern Alberta, Canada (n = 809). Under varying TN93 thresholds, we fit two models to the distributions of new cases relative to clusters of known cases: 1, a null model that assumes cluster growth is strictly proportional to cluster size, i.e. no variation in transmission rates among individuals; and 2, a weighted model that incorporates individual-level covariates, such as recency of diagnosis. The optimal threshold maximizes the difference in information loss between models, where covariates are used most effectively. Optimal TN93 thresholds varied substantially between data sets, e.g. 0.0104 in Alberta and 0.016 in Seattle and Tennessee, such that the optimum for one population would potentially misdirect prevention efforts in another. For a given population, the range of thresholds where the weighted model conferred greater predictive accuracy tended to be narrow (±0.005 units), and the optimal threshold tended to be stable over time. Our framework also indicated that variation in the recency of HIV diagnosis among clusters was significantly more predictive of new cases than sample collection dates (ΔAIC > 50). These results suggest that one cannot rely on historical precedence or convention to configure genetic clustering methods for public health applications, especially when translating methods between settings of low-level and generalized epidemics. Our framework not only enables investigators to calibrate a clustering method to a specific public health setting, but also provides a variable selection procedure to evaluate different predictive models of cluster growth.
Collapse
Affiliation(s)
- Connor Chato
- Department of Pathology and Laboratory Medicine, Western University, Dental Sciences Building DSB4044, London N6A 5C1, Canada
| | - Marcia L Kalish
- Division of Infectious Diseases, Department of Medicine, Vanderbilt University School of Medicine, 1161 21st Ave S, Nashville, TN 37232, USA
| | - Art F Y Poon
- Department of Pathology and Laboratory Medicine, Western University, Dental Sciences Building DSB4044, London N6A 5C1, Canada
- Department of Applied Mathematics, Western University, Middlesex College MC255, London N6A 5B7, Canada
- Department of Microbiology and Immunology, Western University, Dental Science Building DSB3014, London N6A 5C1, Canada
| |
Collapse
|
10
|
Volz EM, Le Vu S, Ratmann O, Tostevin A, Dunn D, Orkin C, O'Shea S, Delpech V, Brown A, Gill N, Fraser C. Molecular Epidemiology of HIV-1 Subtype B Reveals Heterogeneous Transmission Risk: Implications for Intervention and Control. J Infect Dis 2019; 217:1522-1529. [PMID: 29506269 PMCID: PMC5913615 DOI: 10.1093/infdis/jiy044] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 01/22/2018] [Indexed: 11/25/2022] Open
Abstract
Background The impact of HIV pre-exposure prophylaxis (PrEP) depends on infections averted by protecting vulnerable individuals as well as infections averted by preventing transmission by those who would have been infected if not receiving PrEP. Analysis of HIV phylogenies reveals risk factors for transmission, which we examine as potential criteria for allocating PrEP. Methods We analyzed 6912 HIV-1 partial pol sequences from men who have sex with men (MSM) in the United Kingdom combined with global reference sequences and patient-level metadata. Population genetic models were developed that adjust for stage of infection, global migration of HIV lineages, and changing incidence of infection through time. Models were extended to simulate the effects of providing susceptible MSM with PrEP. Results We found that young age <25 years confers higher risk of HIV transmission (relative risk = 2.52 [95% confidence interval, 2.32–2.73]) and that young MSM are more likely to transmit to one another than expected by chance. Simulated interventions indicate that 4-fold more infections can be averted over 5 years by focusing PrEP on young MSM. Conclusions Concentrating PrEP doses on young individuals can avert more infections than random allocation.
Collapse
Affiliation(s)
- Erik M Volz
- Department of Infectious Disease Epidemiology and the National Institute for Health Research Health Protection Research Unit on Modeling Methodology, Imperial College London
| | - Stephane Le Vu
- Department of Infectious Disease Epidemiology and the National Institute for Health Research Health Protection Research Unit on Modeling Methodology, Imperial College London
| | - Oliver Ratmann
- Department of Infectious Disease Epidemiology and the National Institute for Health Research Health Protection Research Unit on Modeling Methodology, Imperial College London
| | - Anna Tostevin
- Institute for Global Health, University College London
| | - David Dunn
- Institute for Global Health, University College London
| | | | - Siobhan O'Shea
- Infection Sciences, Viapath Analytics, Guy's and St Thomas' NHS Foundation Trust, London
| | | | | | | | - Christophe Fraser
- Li Ka Shing Centre for Health Information and Discovery, Oxford University, United Kingdom
| | | |
Collapse
|
11
|
Le Vu S, Ratmann O, Delpech V, Brown AE, Gill ON, Tostevin A, Dunn D, Fraser C, Volz EM. HIV-1 Transmission Patterns in Men Who Have Sex with Men: Insights from Genetic Source Attribution Analysis. AIDS Res Hum Retroviruses 2019; 35:805-813. [PMID: 31280593 PMCID: PMC6735327 DOI: 10.1089/aid.2018.0236] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Near 60% of new HIV infections in the United Kingdom are estimated to occur in men who have sex with men (MSM). Age-disassortative partnerships in MSM have been suggested to spread the HIV epidemics in many Western developed countries and to contribute to ethnic disparities in infection rates. Understanding these mixing patterns in transmission can help to determine which groups are at a greater risk and guide public health interventions. We analyzed combined epidemiological data and viral sequences from MSM diagnosed with HIV at the national level. We applied a phylodynamic source attribution model to infer patterns of transmission between groups of patients. From pair probabilities of transmission between 14,603 MSM patients, we found that potential transmitters of HIV subtype B were on average 8 months older than recipients. We also found a moderate overall assortativity of transmission by ethnic group and a stronger assortativity by region. Our findings suggest that there is only a modest net flow of transmissions from older to young MSM in subtype B epidemics and that young MSM, both for Black or White groups, are more likely to be infected by one another than expected in a sexual network with random mixing.
Collapse
Affiliation(s)
- Stéphane Le Vu
- Department of Infectious Disease Epidemiology, National Institute for Health Research Health Protection Research Unit on Modeling Methodology, Imperial College London, London, United Kingdom
| | - Oliver Ratmann
- Department of Mathematics, Imperial College London, London, United Kingdom
| | - Valerie Delpech
- HIV and STI Department of Public Health England's Center for Infectious Disease Surveillance and Control, London, United Kingdom
| | - Alison E. Brown
- HIV and STI Department of Public Health England's Center for Infectious Disease Surveillance and Control, London, United Kingdom
| | - O. Noel Gill
- HIV and STI Department of Public Health England's Center for Infectious Disease Surveillance and Control, London, United Kingdom
| | - Anna Tostevin
- Institute for Global Health, University College London, London, United Kingdom
| | - David Dunn
- Institute for Global Health, University College London, London, United Kingdom
| | - Christophe Fraser
- Nuffield Department of Medicine, Big Data Institute, Li Ka Shing Center for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
| | - Erik M. Volz
- Department of Infectious Disease Epidemiology, National Institute for Health Research Health Protection Research Unit on Modeling Methodology, Imperial College London, London, United Kingdom
| | | |
Collapse
|
12
|
Hamilton DT, Rosenberg ES, Jenness SM, Sullivan PS, Wang LY, Dunville RL, Barrios LC, Aslam M, Goodreau SM. Modeling the joint effects of adolescent and adult PrEP for sexual minority males in the United States. PLoS One 2019; 14:e0217315. [PMID: 31116802 PMCID: PMC6530873 DOI: 10.1371/journal.pone.0217315] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Accepted: 05/08/2019] [Indexed: 12/02/2022] Open
Abstract
Background Pre-exposure prophylaxis (PrEP) is an effective and safe intervention approved for use to prevent HIV transmission. PrEP scale-up strategies and clinical practice are currently being informed by modeling studies, which have estimated the impact of PrEP in adult and adolescent MSM populations separately. This partitioning may miss important effects or yield biased estimates by excluding dependencies between populations. Methods We combined two published models of HIV transmission among adults and adolescent MSM. We simulated an HIV epidemic among MSM aged 13–39 without PrEP, with PrEP for adult MSM ages (19–39) and with the addition of PrEP for adolescents ages (16–18), comparing percent of incident infections averted (impact), the number of person-years on PrEP per infection averted (efficiency), and changes in prevalence. Results PrEP use among eligible 19–39 year old MSM averted 29.0% of infections and reduced HIV prevalence from 23.2% to 17.0% over ten years in the population as a whole. Despite being ineligible for PrEP in this scenario, prevalence among sexually active 18 year-olds declined from 6.0% to 4.3% due to reduced transmissions across age cohorts. The addition of PrEP for adolescents ages 16–18 had a small impact on the overall epidemic, further reducing overall prevalence from 17.0% to 16.8%; however prevalence among the sexually active 18 year-olds further declined from 4.3% to 3.8%. Conclusions PrEP use among adults may significantly reduce HIV prevalence among MSM and may also have significant downstream effects on HIV incidence among adolescents; PrEP targeting adolescents remains an important intervention for HIV prevention.
Collapse
Affiliation(s)
- Deven T. Hamilton
- Center for Studies in Demography and Ecology, University of Washington, Seattle, Washington, United States of America
- * E-mail:
| | - Eli S. Rosenberg
- Department of Epidemiology and Biostatistics, University at Albany School of Public Health, State University of New York, Rensselaer, New York, United States of America
| | - Samuel M. Jenness
- Department of Epidemiology, Emory University, Atlanta, Georgia, United States of America
| | - Patrick S. Sullivan
- Department of Epidemiology, Emory University, Atlanta, Georgia, United States of America
- Department of Global Health, Emory University, Atlanta, Georgia, United States of America
| | - Li Yan Wang
- Division of Adolescent and School Health, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Richard L. Dunville
- Division of Adolescent and School Health, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Lisa C. Barrios
- Division of Adolescent and School Health, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Maria Aslam
- Program and Performance Improvement Office National Center for HIV, Viral Hepatitis, STD, and TB Prevention, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Steven M. Goodreau
- Center for Studies in Demography and Ecology, University of Washington, Seattle, Washington, United States of America
- Department of Anthropology, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
13
|
Insights on transmission of HIV from phylogenetic analysis to locally optimize HIV prevention strategies. Curr Opin HIV AIDS 2019; 13:95-101. [PMID: 29266012 DOI: 10.1097/coh.0000000000000443] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
PURPOSE OF REVIEW Phylogenetic analysis can identify transmission networks by clustering genetically related HIV genotypes that are routinely collected. In this study, we will review phylogenetic insights gained on transmission of HIV and phylogenetically optimized HIV prevention strategies. RECENT FINDINGS Phylogenetic analysis reports that HIV transmission varies by geographical region and by route of transmission. In high-income countries, HIV is predominantly transmitted between recently infected MSM who live in the same country. In rural Uganda, transmission of HIV is frequently between different communities. Age-discrepant transmission has been reported across the world. Four studies have used phylogenetic optimization of HIV prevention. Three studies predict that immediate treatment after diagnosis would have prevented 19-42% of infections, and that preexposure prophylaxis would have prevented 66% of infections. One phylogenetic study guided a public health response to an actively ongoing HIV outbreak. Phylogenetic clustering requires a dense sample of patients and small time-gaps between infection and diagnosis. SUMMARY Phylogenetic analysis can be an important tool to identify a local strategy that prevents most infections. Future studies that use phylogenetic analysis for optimizing HIV prevention strategies should also include cost-effectiveness so that the most cost-effective prevention method is identified.
Collapse
|
14
|
An open-source k-mer based machine learning tool for fast and accurate subtyping of HIV-1 genomes. PLoS One 2018; 13:e0206409. [PMID: 30427878 PMCID: PMC6235296 DOI: 10.1371/journal.pone.0206409] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Accepted: 10/14/2018] [Indexed: 01/11/2023] Open
Abstract
For many disease-causing virus species, global diversity is clustered into a taxonomy of subtypes with clinical significance. In particular, the classification of infections among the subtypes of human immunodeficiency virus type 1 (HIV-1) is a routine component of clinical management, and there are now many classification algorithms available for this purpose. Although several of these algorithms are similar in accuracy and speed, the majority are proprietary and require laboratories to transmit HIV-1 sequence data over the network to remote servers. This potentially exposes sensitive patient data to unauthorized access, and makes it impossible to determine how classifications are made and to maintain the data provenance of clinical bioinformatic workflows. We propose an open-source supervised and alignment-free subtyping method (Kameris) that operates on k-mer frequencies in HIV-1 sequences. We performed a detailed study of the accuracy and performance of subtype classification in comparison to four state-of-the-art programs. Based on our testing data set of manually curated real-world HIV-1 sequences (n = 2, 784), Kameris obtained an overall accuracy of 97%, which matches or exceeds all other tested software, with a processing rate of over 1,500 sequences per second. Furthermore, our fully standalone general-purpose software provides key advantages in terms of data security and privacy, transparency and reproducibility. Finally, we show that our method is readily adaptable to subtype classification of other viruses including dengue, influenza A, and hepatitis B and C virus.
Collapse
|
15
|
Dennis AM, Volz E, Frost AMSD, Hossain M, Poon AF, Rebeiro PF, Vermund SH, Sterling TR, Kalish ML. HIV-1 Transmission Clustering and Phylodynamics Highlight the Important Role of Young Men Who Have Sex with Men. AIDS Res Hum Retroviruses 2018; 34:879-888. [PMID: 30027754 PMCID: PMC6204570 DOI: 10.1089/aid.2018.0039] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
More persons living with HIV reside in the Southern United States than in any other region, yet little is known about HIV molecular epidemiology in the South. We used cluster and phylodynamic analyses to evaluate HIV transmission patterns in middle Tennessee. We performed cross-sectional analyses of HIV-1 pol sequences and clinical data collected from 2001 to 2015 among persons attending the Vanderbilt Comprehensive Care Clinic. Transmission clusters were identified using maximum likelihood phylogenetics and patristic distance differences. Demographic, risk behavior, and clinical factors were assessed evaluating “active” clusters (clusters including sequences sampled 2011–2015) and associations estimated with logistic regression. Transmission risk ratios for men who have sex with men (MSM) were estimated with phylodynamic models. Among 2915 persons (96% subtype-B sequences), 963 (33%) were members of 292 clusters (distance ≤1.5%, size range 2–39). Most clusters (62%, n = 690 persons) were active, either being newly identified (n = 80) or showing expansion on existing clusters (n = 101). Correlates of active clustering among persons with sequences collected during 2011–2015 included MSM risk and ≤30 years of age. Active clusters were significantly more concentrated in MSM and younger persons than historical clusters. Young MSM (YMSM) (≤26.4 years) had high estimated transmission risk [risk ratio = 4.04 (2.85–5.65) relative to older MSM] and were much more likely to transmit to YMSM. In this Tennessee cohort, transmission clusters over time were more concentrated by MSM and younger age, with high transmission risk among and between YMSM, highlighting the importance of interventions among this group. Detecting active clusters could help direct interventions to disrupt ongoing transmission chains.
Collapse
Affiliation(s)
- Ann M. Dennis
- Division of Infectious Diseases, University of North Carolina, Chapel Hill, North Carolina
| | - Erik Volz
- Department of Infectious Disease Epidemiology and Centre for Outbreak Analysis and Modeling, Imperial College, London, United Kingdom
| | | | - Mukarram Hossain
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Art F.Y. Poon
- Department of Pathology and Laboratory Medicine, Western University, London, Canada
| | - Peter F. Rebeiro
- Division of Infectious Diseases, Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee
| | - Sten H. Vermund
- Department of Epidemiology of Microbial Diseases, Yale University School of Public Health, New Haven, Connecticut
| | - Timothy R. Sterling
- Division of Infectious Diseases, Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee
| | - Marcia L. Kalish
- Division of Infectious Diseases, Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee
| |
Collapse
|
16
|
Brenner BG, Ibanescu RI, Hardy I, Roger M. Genotypic and Phylogenetic Insights on Prevention of the Spread of HIV-1 and Drug Resistance in "Real-World" Settings. Viruses 2017; 10:v10010010. [PMID: 29283390 PMCID: PMC5795423 DOI: 10.3390/v10010010] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Revised: 12/22/2017] [Accepted: 12/24/2017] [Indexed: 12/15/2022] Open
Abstract
HIV continues to spread among vulnerable heterosexual (HET), Men-having-Sex with Men (MSM) and intravenous drug user (IDU) populations, influenced by a complex array of biological, behavioral and societal factors. Phylogenetics analyses of large sequence datasets from national drug resistance testing programs reveal the evolutionary interrelationships of viral strains implicated in the dynamic spread of HIV in different regional settings. Viral phylogenetics can be combined with demographic and behavioral information to gain insights on epidemiological processes shaping transmission networks at the population-level. Drug resistance testing programs also reveal emergent mutational pathways leading to resistance to the 23 antiretroviral drugs used in HIV-1 management in low-, middle- and high-income settings. This article describes how genotypic and phylogenetic information from Quebec and elsewhere provide critical information on HIV transmission and resistance, Cumulative findings can be used to optimize public health strategies to tackle the challenges of HIV in “real-world” settings.
Collapse
Affiliation(s)
- Bluma G Brenner
- McGill University AIDS Centre, Lady Davis Institute for Medical Research, Montreal, QC H3T 1E2, Canada.
| | - Ruxandra-Ilinca Ibanescu
- McGill University AIDS Centre, Lady Davis Institute for Medical Research, Montreal, QC H3T 1E2, Canada.
| | - Isabelle Hardy
- Département de Microbiologie et d'Immunologie et Centre de Recherche du Centre Hospitalier de l'Université de Montréal (CHUM), Montreal, QC H2X 0A9, Canada.
| | - Michel Roger
- Département de Microbiologie et d'Immunologie et Centre de Recherche du Centre Hospitalier de l'Université de Montréal (CHUM), Montreal, QC H2X 0A9, Canada.
| |
Collapse
|
17
|
McCloskey RM, Poon AFY. A model-based clustering method to detect infectious disease transmission outbreaks from sequence variation. PLoS Comput Biol 2017; 13:e1005868. [PMID: 29131825 PMCID: PMC5703573 DOI: 10.1371/journal.pcbi.1005868] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 11/27/2017] [Accepted: 11/02/2017] [Indexed: 01/07/2023] Open
Abstract
Clustering infections by genetic similarity is a popular technique for identifying potential outbreaks of infectious disease, in part because sequences are now routinely collected for clinical management of many infections. A diverse number of nonparametric clustering methods have been developed for this purpose. These methods are generally intuitive, rapid to compute, and readily scale with large data sets. However, we have found that nonparametric clustering methods can be biased towards identifying clusters of diagnosis—where individuals are sampled sooner post-infection—rather than the clusters of rapid transmission that are meant to be potential foci for public health efforts. We develop a fundamentally new approach to genetic clustering based on fitting a Markov-modulated Poisson process (MMPP), which represents the evolution of transmission rates along the tree relating different infections. We evaluated this model-based method alongside five nonparametric clustering methods using both simulated and actual HIV sequence data sets. For simulated clusters of rapid transmission, the MMPP clustering method obtained higher mean sensitivity (85%) and specificity (91%) than the nonparametric methods. When we applied these clustering methods to published sequences from a study of HIV-1 genetic clusters in Seattle, USA, we found that the MMPP method categorized about half (46%) as many individuals to clusters compared to the other methods. Furthermore, the mean internal branch lengths that approximate transmission rates were significantly shorter in clusters extracted using MMPP, but not by other methods. We determined that the computing time for the MMPP method scaled linearly with the size of trees, requiring about 30 seconds for a tree of 1,000 tips and about 20 minutes for 50,000 tips on a single computer. This new approach to genetic clustering has significant implications for the application of pathogen sequence analysis to public health, where it is critical to robustly and accurately identify clusters for the most cost-effective deployment of outbreak management and prevention resources. Many pathogens evolve so rapidly that they accumulate genetic differences within a host before becoming transmitted to the next host. Consequently, clusters of sampled infections with nearly identical genomes may reveal outbreaks of recent or ongoing transmissions. There is rapidly growing interest in using model-free genetic clustering methods to guide public health responses to epidemics in near real-time, including HIV, Ebola virus and tuberculosis. However, we show that current methods are relatively ineffective at detecting transmission outbreaks; instead, they are predominantly influenced by how infections are sampled from the population. We describe a fundamentally new approach to genetic clustering that is based on modelling changes in transmission rates during the spread of the epidemic. We use simulated and real pathogen sequence data sets to demonstrate that this model-based approach is substantially more effective for detecting transmission outbreaks, and remains fast enough for real-time applications to large sequence databases.
Collapse
Affiliation(s)
| | - Art F. Y. Poon
- Department of Pathology and Laboratory Medicine, Western University, London, Ontario, Canada
- Department of Microbiology and Immunology, Western University, London, Ontario, Canada
- Department of Applied Mathematics, Western University, London, Ontario, Canada
- * E-mail:
| |
Collapse
|