1
|
Segawa T, Masuda K, Hisatsune J, Ishida-Kuroki K, Sugawara Y, Kuwabara M, Nishikawa H, Hiratsuka T, Aota T, Tao Y, Iwahashi Y, Ueda K, Mae K, Masumoto K, Kitagawa H, Komatsuzawa H, Ohge H, Sugai M. Genomic analysis of inter-hospital transmission of vancomycin-resistant Enterococcus faecium sequence type 80 isolated during an outbreak in Hiroshima, Japan. Antimicrob Agents Chemother 2024; 68:e0171623. [PMID: 38506550 PMCID: PMC11064488 DOI: 10.1128/aac.01716-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 03/01/2024] [Indexed: 03/21/2024] Open
Abstract
Outbreaks caused by vancomycin-resistant enterococci that transcend jurisdictional boundaries are occurring worldwide. This study focused on a vancomycin-resistant enterococcus outbreak that occurred between 2018 and 2021 across two cities in Hiroshima, Japan. The study involved genetic and phylogenetic analyses using whole-genome sequencing of 103 isolates of vancomycin-resistant enterococci to identify the source and transmission routes of the outbreak. Phylogenetic analysis was performed using core genome multilocus sequence typing and core single-nucleotide polymorphisms; infection routes between hospitals were inferred using BadTrIP. The outbreak was caused by Enterococcus faecium sequence type (ST) 80 carrying the vanA plasmid, which was derived from strain A10290 isolated in India. Of the 103 isolates, 93 were E. faecium ST80 transmitted across hospitals. The circular vanA plasmid of the Hiroshima isolates was similar to the vanA plasmid of strain A10290 and transferred from E. faecium ST80 to other STs of E. faecium and other Enterococcus species by conjugation. The inferred transmission routes across hospitals suggest the existence of a central hospital serving as a hub, propagating vancomycin-resistant enterococci to multiple hospitals. Our study highlights the importance of early intervention at the key central hospital to prevent the spread of the infection to small medical facilities, such as nursing homes, with limited medical resources and a high number of vulnerable individuals.
Collapse
Affiliation(s)
- Takaya Segawa
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Higashimurayama, Japan
| | - Kanako Masuda
- Hiroshima Prefectural Center for Disease Control and Prevention, Hiroshima, Japan
- Project Research Center for Nosocomial Infectious Diseases, Hiroshima University, Hiroshima, Japan
| | - Junzo Hisatsune
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Higashimurayama, Japan
- Project Research Center for Nosocomial Infectious Diseases, Hiroshima University, Hiroshima, Japan
- Department of Antimicrobial Resistance, Hiroshima University Graduate School of Biomedical & Health Sciences, Hiroshima, Japan
| | - Kasumi Ishida-Kuroki
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Higashimurayama, Japan
| | - Yo Sugawara
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Higashimurayama, Japan
| | - Masao Kuwabara
- Hiroshima Prefectural Center for Disease Control and Prevention, Hiroshima, Japan
| | - Hideki Nishikawa
- Hiroshima Prefectural Center for Disease Control and Prevention, Hiroshima, Japan
| | - Takahiro Hiratsuka
- Hiroshima Prefectural Technology Research Institute, Public Health and Environment Center, Hiroshima, Japan
| | - Tatsuaki Aota
- Hiroshima City Institute of Public Health, Hiroshima, Japan
| | - Yasuo Tao
- Hiroshima City Public Health Center, Hiroshima, Japan
| | | | - Kuniko Ueda
- Hiroshima City Public Health Center, Hiroshima, Japan
| | - Kaori Mae
- Hiroshima City Medical Association Clinical Laboratory, Hiroshima, Japan
| | - Ken Masumoto
- Hiroshima City Medical Association Clinical Laboratory, Hiroshima, Japan
| | - Hiroki Kitagawa
- Project Research Center for Nosocomial Infectious Diseases, Hiroshima University, Hiroshima, Japan
- Department of Infectious Diseases, Hiroshima University Hospital, Hiroshima, Japan
| | - Hitoshi Komatsuzawa
- Project Research Center for Nosocomial Infectious Diseases, Hiroshima University, Hiroshima, Japan
- Department of Bacteriology, Hiroshima University Graduate School of Biomedical and Health Sciences, Hiroshima, Japan
| | - Hiroki Ohge
- Project Research Center for Nosocomial Infectious Diseases, Hiroshima University, Hiroshima, Japan
- Department of Infectious Diseases, Hiroshima University Hospital, Hiroshima, Japan
| | - Motoyuki Sugai
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Higashimurayama, Japan
- Project Research Center for Nosocomial Infectious Diseases, Hiroshima University, Hiroshima, Japan
- Department of Antimicrobial Resistance, Hiroshima University Graduate School of Biomedical & Health Sciences, Hiroshima, Japan
| |
Collapse
|
2
|
Carson J, Keeling M, Wyllie D, Ribeca P, Didelot X. Inference of Infectious Disease Transmission through a Relaxed Bottleneck Using Multiple Genomes Per Host. Mol Biol Evol 2024; 41:msad288. [PMID: 38168711 PMCID: PMC10798190 DOI: 10.1093/molbev/msad288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 12/21/2023] [Accepted: 12/29/2023] [Indexed: 01/05/2024] Open
Abstract
In recent times, pathogen genome sequencing has become increasingly used to investigate infectious disease outbreaks. When genomic data is sampled densely enough amongst infected individuals, it can help resolve who infected whom. However, transmission analysis cannot rely solely on a phylogeny of the genomes but must account for the within-host evolution of the pathogen, which blurs the relationship between phylogenetic and transmission trees. When only a single genome is sampled for each host, the uncertainty about who infected whom can be quite high. Consequently, transmission analysis based on multiple genomes of the same pathogen per host has a clear potential for delivering more precise results, even though it is more laborious to achieve. Here, we present a new methodology that can use any number of genomes sampled from a set of individuals to reconstruct their transmission network. Furthermore, we remove the need for the assumption of a complete transmission bottleneck. We use simulated data to show that our method becomes more accurate as more genomes per host are provided, and that it can infer key infectious disease parameters such as the size of the transmission bottleneck, within-host growth rate, basic reproduction number, and sampling fraction. We demonstrate the usefulness of our method in applications to real datasets from an outbreak of Pseudomonas aeruginosa amongst cystic fibrosis patients and a nosocomial outbreak of Klebsiella pneumoniae.
Collapse
Affiliation(s)
- Jake Carson
- Mathematics Institute, University of Warwick, Coventry CV4 7AL, UK
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK
- Zeeman Institute for Systems Biology and Infectious Disease Epidemiology Research (SBIDER), University of Warwick, Coventry CV4 7AL, UK
| | - Matt Keeling
- Mathematics Institute, University of Warwick, Coventry CV4 7AL, UK
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK
- Zeeman Institute for Systems Biology and Infectious Disease Epidemiology Research (SBIDER), University of Warwick, Coventry CV4 7AL, UK
| | | | | | - Xavier Didelot
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, UK
- Zeeman Institute for Systems Biology and Infectious Disease Epidemiology Research (SBIDER), University of Warwick, Coventry CV4 7AL, UK
- Department of Statistics, University of Warwick, Coventry CV4 7AL, UK
| |
Collapse
|
3
|
Walter KS, Cohen T, Mathema B, Colijn C, Sobkowiak B, Comas I, Goig GA, Croda J, Andrews JR. Signatures of transmission in within-host M. tuberculosis variation. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.28.23300451. [PMID: 38234741 PMCID: PMC10793532 DOI: 10.1101/2023.12.28.23300451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Background Because M. tuberculosis evolves slowly, transmission clusters often contain multiple individuals with identical consensus genomes, making it difficult to reconstruct transmission chains. Finding additional sources of shared M. tuberculosis variation could help overcome this problem. Previous studies have reported M. tuberculosis diversity within infected individuals; however, whether within-host variation improves transmission inferences remains unclear. Methods To evaluate the transmission information present in within-host M. tuberculosis variation, we re-analyzed publicly available sequence data from three household transmission studies, using household membership as a proxy for transmission linkage between donor-recipient pairs. Findings We found moderate levels of minority variation present in M. tuberculosis sequence data from cultured isolates that varied significantly across studies (mean: 6, 7, and 170 minority variants above a 1% minor allele frequency threshold, outside of PE/PPE genes). Isolates from household members shared more minority variants than did isolates from unlinked individuals in the three studies (mean 98 shared minority variants vs. 10; 0.8 vs. 0.2, and 0.7 vs. 0.2, respectively). Shared within-host variation was significantly associated with household membership (OR: 1.51 [1.30,1.71], for one standard deviation increase in shared minority variants). Models that included shared within-host variation improved the accuracy of predicting household membership in all three studies as compared to models without within-host variation (AUC: 0.95 versus 0.92, 0.99 versus 0.95, and 0.93 versus 0.91). Interpretation Within-host M. tuberculosis variation persists through culture and could enhance the resolution of transmission inferences. The substantial differences in minority variation recovered across studies highlights the need to optimize approaches to recover and incorporate within-host variation into automated phylogenetic and transmission inference. Funding NIAID: 5K01AI173385.
Collapse
Affiliation(s)
| | - Ted Cohen
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, USA
| | - Barun Mathema
- Department of Epidemiology, Columbia University Mailman School of Public Health; New York, United States
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University; Burnaby, Canada
| | - Benjamin Sobkowiak
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, USA
| | - Iñaki Comas
- Institute of Biomedicine of Valencia (CSIC), Valencia, Spain
| | - Galo A Goig
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland
- University of Basel, Basel, Switzerland
| | - Julio Croda
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, USA
- Federal University of Mato Grosso do Sul - UFMS, Campo Grande, MS, Brazil
- Oswaldo Cruz Foundation Mato Grosso do Sul, Mato Grosso do Sul, Brazil
| | - Jason R Andrews
- Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, CA, USA
| |
Collapse
|
4
|
Specht IOA, Petros BA, Moreno GK, Brock-Fisher T, Krasilnikova LA, Schifferli M, Yang K, Cronan P, Glennon O, Schaffner SF, Park DJ, MacInnis BL, Ozonoff A, Fry B, Mitzenmacher MD, Varilly P, Sabeti PC. Inferring Viral Transmission Pathways from Within-Host Variation. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.10.14.23297039. [PMID: 37873325 PMCID: PMC10593003 DOI: 10.1101/2023.10.14.23297039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Genome sequencing can offer critical insight into pathogen spread in viral outbreaks, but existing transmission inference methods use simplistic evolutionary models and only incorporate a portion of available genetic data. Here, we develop a robust evolutionary model for transmission reconstruction that tracks the genetic composition of within-host viral populations over time and the lineages transmitted between hosts. We confirm that our model reliably describes within-host variant frequencies in a dataset of 134,682 SARS-CoV-2 deep-sequenced genomes from Massachusetts, USA. We then demonstrate that our reconstruction approach infers transmissions more accurately than two leading methods on synthetic data, as well as in a controlled outbreak of bovine respiratory syncytial virus and an epidemiologically-investigated SARS-CoV-2 outbreak in South Africa. Finally, we apply our transmission reconstruction tool to 5,692 outbreaks among the 134,682 Massachusetts genomes. Our methods and results demonstrate the utility of within-host variation for transmission inference of SARS-CoV-2 and other pathogens, and provide an adaptable mathematical framework for tracking within-host evolution.
Collapse
Affiliation(s)
- Ivan O. A. Specht
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Harvard College, Faculty of Arts and Sciences, Harvard University, Cambridge, MA 02138, USA
| | - Brittany A. Petros
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA 02139, USA
- Harvard/MIT MD-PhD Program, Boston, MA 02115, USA
- Systems, Synthetic, and Quantitative Biology PhD Program, Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Gage K. Moreno
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Taylor Brock-Fisher
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Organismic and Evolutionary Biology, Faculty of Arts and Sciences, Harvard University, Cambridge, MA 02138, USA
| | - Lydia A. Krasilnikova
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | | | | | - Paul Cronan
- Fathom Information Design, Boston, MA 02114, USA
| | | | | | - Daniel J. Park
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Bronwyn L. MacInnis
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA 02115, USA
- Massachusetts Consortium on Pathogen Readiness, Harvard Medical School, Harvard University, Boston, MA 02115, USA
| | - Al Ozonoff
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ben Fry
- Fathom Information Design, Boston, MA 02114, USA
| | - Michael D. Mitzenmacher
- Department of Computer Science, School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138, USA
| | - Patrick Varilly
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Pardis C. Sabeti
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Organismic and Evolutionary Biology, Faculty of Arts and Sciences, Harvard University, Cambridge, MA 02138, USA
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA 02115, USA
- Massachusetts Consortium on Pathogen Readiness, Harvard Medical School, Harvard University, Boston, MA 02115, USA
- Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| |
Collapse
|
5
|
Torres Ortiz A, Kendall M, Storey N, Hatcher J, Dunn H, Roy S, Williams R, Williams C, Goldstein RA, Didelot X, Harris K, Breuer J, Grandjean L. Within-host diversity improves phylogenetic and transmission reconstruction of SARS-CoV-2 outbreaks. eLife 2023; 12:e84384. [PMID: 37732733 PMCID: PMC10602588 DOI: 10.7554/elife.84384] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Accepted: 09/20/2023] [Indexed: 09/22/2023] Open
Abstract
Accurate inference of who infected whom in an infectious disease outbreak is critical for the delivery of effective infection prevention and control. The increased resolution of pathogen whole-genome sequencing has significantly improved our ability to infer transmission events. Despite this, transmission inference often remains limited by the lack of genomic variation between the source case and infected contacts. Although within-host genetic diversity is common among a wide variety of pathogens, conventional whole-genome sequencing phylogenetic approaches exclusively use consensus sequences, which consider only the most prevalent nucleotide at each position and therefore fail to capture low-frequency variation within samples. We hypothesized that including within-sample variation in a phylogenetic model would help to identify who infected whom in instances in which this was previously impossible. Using whole-genome sequences from SARS-CoV-2 multi-institutional outbreaks as an example, we show how within-sample diversity is partially maintained among repeated serial samples from the same host, it can transmitted between those cases with known epidemiological links, and how this improves phylogenetic inference and our understanding of who infected whom. Our technique is applicable to other infectious diseases and has immediate clinical utility in infection prevention and control.
Collapse
Affiliation(s)
- Arturo Torres Ortiz
- Department of Infectious Diseases, Imperial College LondonLondonUnited Kingdom
- Department of Infection, Immunity and Inflammation, University College LondonLondonUnited Kingdom
| | - Michelle Kendall
- Department of Statistics, University of WarwickCoventryUnited Kingdom
| | - Nathaniel Storey
- Department of Microbiology, Great Ormond Street HospitalLondonUnited Kingdom
| | - James Hatcher
- Department of Microbiology, Great Ormond Street HospitalLondonUnited Kingdom
| | - Helen Dunn
- Department of Microbiology, Great Ormond Street HospitalLondonUnited Kingdom
| | - Sunando Roy
- Department of Infection, Immunity and Inflammation, University College LondonLondonUnited Kingdom
| | | | | | | | - Xavier Didelot
- Department of Statistics, University of WarwickCoventryUnited Kingdom
| | - Kathryn Harris
- Department of Microbiology, Great Ormond Street HospitalLondonUnited Kingdom
- Department of Virology, East & South East London Pathology Partnership, Royal London Hospital, Barts Health NHS TrustLondonUnited Kingdom
| | - Judith Breuer
- Department of Infection, Immunity and Inflammation, University College LondonLondonUnited Kingdom
| | - Louis Grandjean
- Department of Infection, Immunity and Inflammation, University College LondonLondonUnited Kingdom
| |
Collapse
|
6
|
Ribaud M, Gabriel E, Hughes J, Soubeyrand S. Identifying potential significant factors impacting zero-inflated proportion data. Stat Med 2023; 42:3467-3486. [PMID: 37290435 DOI: 10.1002/sim.9814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 04/03/2023] [Accepted: 05/19/2023] [Indexed: 06/10/2023]
Abstract
Classical supervised methods like linear regression and decision trees are not completely adapted for identifying impacting factors on a response variable corresponding to zero-inflated proportion data (ZIPD) that are dependent, continuous and bounded. In this article we propose a within-block permutation-based methodology to identify factors (discrete or continuous) that are significantly correlated with ZIPD, we propose a performance indicator quantifying the percentage of correlation explained by the subset of significant factors, and we show how to predict the ranks of the response variables conditionally on the observation of these factors. The methodology is illustrated on simulated data and on two real data sets dealing with epidemiology. In the first data set, ZIPD correspond to probabilities of transmission of Influenza between horses. In the second data set, ZIPD correspond to probabilities that geographic entities (eg, states and countries) have the same COVID-19 mortality dynamics.
Collapse
Affiliation(s)
| | | | - Joseph Hughes
- Centre for Virus Research, MRC-University of Glasgow, Glasgow, UK
| | | |
Collapse
|
7
|
Ke Z, Vikalo H. Graph-Based Reconstruction and Analysis of Disease Transmission Networks Using Viral Genomic Data. J Comput Biol 2023. [PMID: 37347892 DOI: 10.1089/cmb.2022.0373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/24/2023] Open
Abstract
Understanding the patterns of viral disease transmissions helps establish public health policies and aids in controlling and ending a disease outbreak. Classical methods for studying disease transmission dynamics that rely on epidemiological data, such as times of sample collection and duration of exposure intervals, struggle to provide desired insight due to limited informativeness of such data. A more precise characterization of disease transmissions may be acquired from sequencing data that reveal genetic distance between viral genomes in patient samples. Indeed, genetic distance between viral strains present in hosts contains valuable information about transmission history, thus motivating the design of methods that rely on genomic data to reconstruct a directed disease transmission network, detect transmission clusters, and identify significant network nodes (e.g., super-spreaders). In this article, we present a novel end-to-end framework for the analysis of viral transmissions utilizing viral genomic (sequencing) data. The proposed framework groups infected hosts into transmission clusters based on the reconstructed viral strains infecting them; the genetic distance between a pair of hosts is calculated using Earth Mover's Distance, and further used to infer transmission direction between the hosts. To quantify the significance of a host in the transmission network, the importance score is calculated by a graph convolutional autoencoder. The viral transmission network is represented by a directed minimum spanning tree utilizing the Edmond's algorithm modified to incorporate constraints on the importance scores of the hosts. The proposed framework outperforms state-of-the-art techniques for the analysis of viral transmission dynamics in several experiments on semiexperimental as well as experimental data.
Collapse
Affiliation(s)
- Ziqi Ke
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, Texas, USA
| | - Haris Vikalo
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, Texas, USA
| |
Collapse
|
8
|
Walter KS, Kim E, Verma R, Altamirano J, Leary S, Carrington YJ, Jagannathan P, Singh U, Holubar M, Subramanian A, Khosla C, Maldonado Y, Andrews JR. Challenges in Harnessing Shared Within-Host Severe Acute Respiratory Syndrome Coronavirus 2 Variation for Transmission Inference. Open Forum Infect Dis 2023; 10:ofad001. [PMID: 36751652 PMCID: PMC9898879 DOI: 10.1093/ofid/ofad001] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 01/06/2023] [Indexed: 01/09/2023] Open
Abstract
Background The limited variation observed among severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) consensus sequences makes it difficult to reconstruct transmission linkages in outbreak settings. Previous studies have recovered variation within individual SARS-CoV-2 infections but have not yet measured the informativeness of within-host variation for transmission inference. Methods We performed tiled amplicon sequencing on 307 SARS-CoV-2 samples, including 130 samples from 32 individuals in 14 households and 47 longitudinally sampled individuals, from 4 prospective studies with household membership data, a proxy for transmission linkage. Results Consensus sequences from households had limited diversity (mean pairwise distance, 3.06 single-nucleotide polymorphisms [SNPs]; range, 0-40). Most (83.1%, 255 of 307) samples harbored at least 1 intrahost single-nucleotide variant ([iSNV] median, 117; interquartile range [IQR], 17-208), above a minor allele frequency threshold of 0.2%. Pairs in the same household shared significantly more iSNVs (mean, 1.20 iSNVs; 95% confidence interval [CI], 1.02-1.39) than did pairs in different households infected with the same viral clade (mean, 0.31 iSNVs; 95% CI, .28-.34), a signal that decreases with increasingly stringent minor allele frequency thresholds. The number of shared iSNVs was significantly associated with an increased odds of household membership (adjusted odds ratio, 1.35; 95% CI, 1.23-1.49). However, the poor concordance of iSNVs detected across sequencing replicates (24.8% and 35.0% above a 0.2% and 1% threshold) confirms technical concerns that current sequencing and bioinformatic workflows do not consistently recover low-frequency within-host variants. Conclusions Shared within-host variation may augment the information in consensus sequences for predicting transmission linkages. Improving sensitivity and specificity of within-host variant identification will improve the informativeness of within-host variation.
Collapse
Affiliation(s)
- Katharine S Walter
- Correspondence: Katharine S. Walter, PhD, Division of Epidemiology, University of Utah, 295 Chipeta Way, Salt Lake City, UT 84108, USA ()
| | - Eugene Kim
- Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Renu Verma
- Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Jonathan Altamirano
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, California, USA
| | - Sean Leary
- Department of Pediatrics, Stanford University School of Medicine, Stanford, California, USA
| | - Yuan J Carrington
- Department of Pediatrics, Stanford University School of Medicine, Stanford, California, USA
| | - Prasanna Jagannathan
- Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, California, USA,Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, California, USA
| | - Upinder Singh
- Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, California, USA,Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, California, USA
| | - Marisa Holubar
- Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Aruna Subramanian
- Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Chaitan Khosla
- Stanford ChEM-H, Stanford University, Stanford, California, USA,Department of Chemistry and Chemical Engineering, Stanford University, Stanford, California, USA
| | - Yvonne Maldonado
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, California, USA,Department of Pediatrics, Stanford University School of Medicine, Stanford, California, USA
| | - Jason R Andrews
- Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, California, USA
| |
Collapse
|
9
|
Johnson PCD, Hägglund S, Näslund K, Meyer G, Taylor G, Orton RJ, Zohari S, Haydon DT, Valarcher JF. Evaluating the potential of whole-genome sequencing for tracing transmission routes in experimental infections and natural outbreaks of bovine respiratory syncytial virus. Vet Res 2022; 53:107. [PMID: 36510312 PMCID: PMC9746130 DOI: 10.1186/s13567-022-01127-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 09/09/2022] [Indexed: 12/14/2022] Open
Abstract
Bovine respiratory syncytial virus (BRSV) is a major cause of respiratory disease in cattle. Genomic sequencing can resolve phylogenetic relationships between virus populations, which can be used to infer transmission routes and potentially inform the design of biosecurity measures. Sequencing of short (<2000 nt) segments of the 15 000-nt BRSV genome has revealed geographic and temporal clustering of BRSV populations, but insufficient variation to distinguish viruses collected from herds infected close together in space and time. This study investigated the potential for whole-genome sequencing to reveal sufficient genomic variation for inferring transmission routes between herds. Next-generation sequencing (NGS) data were generated from experimental infections and from natural outbreaks in Jämtland and Uppsala counties in Sweden. Sufficient depth of coverage for analysis of consensus and sub-consensus sequence diversity was obtained from 47 to 20 samples respectively. Few (range: 0-6 polymorphisms across the six experiments) consensus-level polymorphisms were observed along experimental transmissions. A much higher level of diversity (146 polymorphic sites) was found among the consensus sequences from the outbreak samples. The majority (144/146) of polymorphisms were between rather than within counties, suggesting that consensus whole-genome sequences show insufficient spatial resolution for inferring direct transmission routes, but might allow identification of outbreak sources at the regional scale. By contrast, within-sample diversity was generally higher in the experimental than the outbreak samples. Analyses to infer known (experimental) and suspected (outbreak) transmission links from within-sample diversity data were uninformative. In conclusion, analysis of the whole-genome sequence of BRSV from experimental samples discriminated between circulating isolates from distant areas, but insufficient diversity was observed between closely related isolates to aid local transmission route inference.
Collapse
Affiliation(s)
- Paul C D Johnson
- School of Biodiversity, One Health and Veterinary Medicine, University of Glasgow, Glasgow, UK.
| | - Sara Hägglund
- HPIG. Unit of Ruminant Medicine. Department of Clinical Sciences, Swedish University of Agricultural Sciences (SLU), Uppsala, Sweden
| | - Katarina Näslund
- Department of Microbiology, National Veterinary Institute, SVA, Uppsala, Sweden
| | - Gilles Meyer
- IHAP, Université de Toulouse, INRAE, ENVT, Toulouse, France
| | | | - Richard J Orton
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Siamak Zohari
- Department of Microbiology, National Veterinary Institute, SVA, Uppsala, Sweden
| | - Daniel T Haydon
- School of Biodiversity, One Health and Veterinary Medicine, University of Glasgow, Glasgow, UK
| | - Jean François Valarcher
- HPIG. Unit of Ruminant Medicine. Department of Clinical Sciences, Swedish University of Agricultural Sciences (SLU), Uppsala, Sweden
| |
Collapse
|
10
|
Chao E, Chato C, Vender R, Olabode AS, Ferreira RC, Poon AFY. Molecular source attribution. PLoS Comput Biol 2022; 18:e1010649. [PMID: 36395093 PMCID: PMC9671344 DOI: 10.1371/journal.pcbi.1010649] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
- Elisa Chao
- Department of Pathology and Laboratory Medicine, Western University, London, Ontario, Canada
| | - Connor Chato
- Department of Pathology and Laboratory Medicine, Western University, London, Ontario, Canada
| | - Reid Vender
- Department of Pathology and Laboratory Medicine, Western University, London, Ontario, Canada
- School of Medicine, Queen’s University, Kingston, Ontario, Canada
| | - Abayomi S. Olabode
- Department of Pathology and Laboratory Medicine, Western University, London, Ontario, Canada
| | - Roux-Cil Ferreira
- Department of Pathology and Laboratory Medicine, Western University, London, Ontario, Canada
| | - Art F. Y. Poon
- Department of Pathology and Laboratory Medicine, Western University, London, Ontario, Canada
- * E-mail:
| |
Collapse
|
11
|
Skums P, Mohebbi F, Tsyvina V, Baykal PI, Nemira A, Ramachandran S, Khudyakov Y. SOPHIE: Viral outbreak investigation and transmission history reconstruction in a joint phylogenetic and network theory framework. Cell Syst 2022; 13:844-856.e4. [PMID: 36265470 PMCID: PMC9590096 DOI: 10.1016/j.cels.2022.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 07/05/2022] [Accepted: 07/19/2022] [Indexed: 01/26/2023]
Abstract
Genomic epidemiology is now widely used for viral outbreak investigations. Still, this methodology faces many challenges. First, few methods account for intra-host viral diversity. Second, maximum parsimony principle continues to be employed for phylogenetic inference of transmission histories, even though maximum likelihood or Bayesian models are usually more consistent. Third, many methods utilize case-specific data, such as sampling times or infection exposure intervals. This impedes study of persistent infections in vulnerable groups, where such information has a limited use. Finally, most methods implicitly assume that transmission events are independent, although common source outbreaks violate this assumption. We propose a maximum likelihood framework, SOPHIE, based on the integration of phylogenetic and random graph models. It infers transmission networks from viral phylogenies and expected properties of inter-host social networks modeled as random graphs with given expected degree distributions. SOPHIE is scalable, accounts for intra-host diversity, and accurately infers transmissions without case-specific epidemiological data.
Collapse
Affiliation(s)
- Pavel Skums
- Department of Computer Science, Georgia State University, Atlanta, GA, USA.
| | - Fatemeh Mohebbi
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Vyacheslav Tsyvina
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Pelin Icer Baykal
- Department of Biosystems Science & Engineering, ETH Zurich, Basel, Switzerland
| | - Alina Nemira
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Sumathi Ramachandran
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Yury Khudyakov
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, Atlanta, GA, USA
| |
Collapse
|
12
|
Alamil M, Thébaud G, Berthier K, Soubeyrand S. Characterizing viral within-host diversity in fast and non-equilibrium demo-genetic dynamics. Front Microbiol 2022; 13:983938. [PMID: 36274731 PMCID: PMC9581327 DOI: 10.3389/fmicb.2022.983938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 09/08/2022] [Indexed: 11/13/2022] Open
Abstract
High-throughput sequencing has opened the route for a deep assessment of within-host genetic diversity that can be used, e.g., to characterize microbial communities and to infer transmission links in infectious disease outbreaks. The performance of such characterizations and inferences cannot be analytically assessed in general and are often grounded on computer-intensive evaluations. Then, being able to simulate within-host genetic diversity across time under various demo-genetic assumptions is paramount to assess the performance of the approaches of interest. In this context, we built an original model that can be simulated to investigate the temporal evolution of genotypes and their frequencies under various demo-genetic assumptions. The model describes the growth and the mutation of genotypes at the nucleotide resolution conditional on an overall within-host viral kinetics, and can be tuned to generate fast non-equilibrium demo-genetic dynamics. We ran simulations of this model and computed classic diversity indices to characterize the temporal variation of within-host genetic diversity (from high-throughput amplicon sequences) of virus populations under three demographic kinetic models of viral infection. Our results highlight how demographic (viral load) and genetic (mutation, selection, or drift) factors drive variations in within-host diversity during the course of an infection. In particular, we observed a non-monotonic relationship between pathogen population size and genetic diversity, and a reduction of the impact of mutation on diversity when a non-specific host immune response is activated. The large variation in the diversity patterns generated in our simulations suggests that the underlying model provides a flexible basis to produce very diverse demo-genetic scenarios and test, for instance, methods for the inference of transmission links during outbreaks.
Collapse
Affiliation(s)
- Maryam Alamil
- INRAE, BioSP, Avignon, France
- Department of Mathematics and Computer Science, Alfaisal University, Riyadh, Saudi Arabia
- *Correspondence: Maryam Alamil ;
| | - Gaël Thébaud
- PHIM Plant Health Institute, INRAE, Univ Montpellier, CIRAD, Institut Agro, IRD, Montpellier, France
| | | | | |
Collapse
|
13
|
Brintnell E, Poon A. Traversing missing links in the spread of HIV. eLife 2022; 11:82610. [PMID: 36178092 PMCID: PMC9525057 DOI: 10.7554/elife.82610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Combining clinical and genetic data can improve the effectiveness of virus tracking with the aim of reducing the number of HIV cases by 2030.
Collapse
Affiliation(s)
- Erin Brintnell
- Department of Pathology and Laboratory Medicine, Western University, London, Canada
| | - Art Poon
- Department of Pathology and Laboratory Medicine, Western University, London, Canada.,Department of Computer Science, Western University, London, Canada
| |
Collapse
|
14
|
Lundgren E, Romero-Severson E, Albert J, Leitner T. Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification. PLoS Comput Biol 2022; 18:e1009741. [PMID: 36026480 PMCID: PMC9455879 DOI: 10.1371/journal.pcbi.1009741] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 09/08/2022] [Accepted: 08/02/2022] [Indexed: 01/07/2023] Open
Abstract
To identify and stop active HIV transmission chains new epidemiological techniques are needed. Here, we describe the development of a multi-biomarker augmentation to phylogenetic inference of the underlying transmission history in a local population. HIV biomarkers are measurable biological quantities that have some relationship to the amount of time someone has been infected with HIV. To train our model, we used five biomarkers based on real data from serological assays, HIV sequence data, and target cell counts in longitudinally followed, untreated patients with known infection times. The biomarkers were modeled with a mixed effects framework to allow for patient specific variation and general trends, and fit to patient data using Markov Chain Monte Carlo (MCMC) methods. Subsequently, the density of the unobserved infection time conditional on observed biomarkers were obtained by integrating out the random effects from the model fit. This probabilistic information about infection times was incorporated into the likelihood function for the transmission history and phylogenetic tree reconstruction, informed by the HIV sequence data. To critically test our methodology, we developed a coalescent-based simulation framework that generates phylogenies and biomarkers given a specific or general transmission history. Testing on many epidemiological scenarios showed that biomarker augmented phylogenetics can reach 90% accuracy under idealized situations. Under realistic within-host HIV-1 evolution, involving substantial within-host diversification and frequent transmission of multiple lineages, the average accuracy was at about 50% in transmission clusters involving 5-50 hosts. Realistic biomarker data added on average 16 percentage points over using the phylogeny alone. Using more biomarkers improved the performance. Shorter temporal spacing between transmission events and increased transmission heterogeneity reduced reconstruction accuracy, but larger clusters were not harder to get right. More sequence data per infected host also improved accuracy. We show that the method is robust to incomplete sampling and that adding biomarkers improves reconstructions of real HIV-1 transmission histories. The technology presented here could allow for better prevention programs by providing data for locally informed and tailored strategies.
Collapse
Affiliation(s)
- Erik Lundgren
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Ethan Romero-Severson
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Jan Albert
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Microbiology, Karolinska University Hospital, Stockholm, Sweden
| | - Thomas Leitner
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- * E-mail:
| |
Collapse
|
15
|
Carson J, Ledda A, Ferretti L, Keeling M, Didelot X. The bounded coalescent model: Conditioning a genealogy on a minimum root date. J Theor Biol 2022; 548:111186. [PMID: 35697144 DOI: 10.1016/j.jtbi.2022.111186] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 05/05/2022] [Accepted: 06/02/2022] [Indexed: 01/27/2023]
Abstract
The coalescent model represents how individuals sampled from a population may have originated from a last common ancestor. The bounded coalescent model is obtained by conditioning the coalescent model such that the last common ancestor must have existed after a certain date. This conditioned model arises in a variety of applications, such as speciation, horizontal gene transfer or transmission analysis, and yet the bounded coalescent model has not been previously analysed in detail. Here we describe a new algorithm to simulate from this model directly, without resorting to rejection sampling. We show that this direct simulation algorithm is more computationally efficient than the rejection sampling approach. We also show how to calculate the probability of the last common ancestor occurring after a given date, which is required to compute the probability density of realisations under the bounded coalescent model. Our results are applicable in both the isochronous (when all samples have the same date) and heterochronous (where samples can have different dates) settings. We explore the effect of setting a bound on the date of the last common ancestor, and show that it affects a number of properties of the resulting phylogenies. All our methods are implemented in a new R package called BoundedCoalescent which is freely available online.
Collapse
Affiliation(s)
- Jake Carson
- Mathematics Institute, University of Warwick, United Kingdom
| | - Alice Ledda
- HCAI, Fungal, AMR, AMU & Sepsis Division, UK Health Security Agency, United Kingdom
| | - Luca Ferretti
- Big Data Institute, University of Oxford, United Kingdom
| | - Matt Keeling
- Mathematics Institute, University of Warwick, United Kingdom
| | - Xavier Didelot
- Department of Statistics and School of Life Sciences, University of Warwick, United Kingdom
| |
Collapse
|
16
|
Ortiz AT, Kendall M, Storey N, Hatcher J, Dunn H, Roy S, Williams R, Williams C, Goldstein RA, Didelot X, Harris K, Breuer J, Grandjean L. Within-host diversity improves phylogenetic and transmission reconstruction of SARS-CoV-2 outbreaks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.06.07.495142. [PMID: 35702156 PMCID: PMC9196117 DOI: 10.1101/2022.06.07.495142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Accurate inference of who infected whom in an infectious disease outbreak is critical for the delivery of effective infection prevention and control. The increased resolution of pathogen whole-genome sequencing has significantly improved our ability to infer transmission events. Despite this, transmission inference often remains limited by the lack of genomic variation between the source case and infected contacts. Although within-host genetic diversity is common among a wide variety of pathogens, conventional whole-genome sequencing phylogenetic approaches to reconstruct outbreaks exclusively use consensus sequences, which consider only the most prevalent nucleotide at each position and therefore fail to capture low frequency variation within samples. We hypothesized that including within-sample variation in a phylogenetic model would help to identify who infected whom in instances in which this was previously impossible. Using whole-genome sequences from SARS-CoV-2 multi-institutional outbreaks as an example, we show how within-sample diversity is stable among repeated serial samples from the same host, is transmitted between those cases with known epidemiological links, and how this improves phylogenetic inference and our understanding of who infected whom. Our technique is applicable to other infectious diseases and has immediate clinical utility in infection prevention and control.
Collapse
Affiliation(s)
| | - Michelle Kendall
- Department of Statistics, University of Warwick, Coventry, CV4 7AL
| | - Nathaniel Storey
- Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH
| | - James Hatcher
- Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH
| | - Helen Dunn
- Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH
| | - Sunando Roy
- Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH
| | - Rachel Williams
- UCL Genomics, Institute of Child Health, UCL, London WC1N 1EH
| | | | | | - Xavier Didelot
- Department of Statistics, University of Warwick, Coventry, CV4 7AL
| | - Kathryn Harris
- Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH
- Department of Virology, East South East London Pathology Partnership, Royal London Hospital, Barts Health NHS Trust, London E12ES
| | - Judith Breuer
- Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH
| | - Louis Grandjean
- Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH
| |
Collapse
|
17
|
Waddington C, Carey ME, Boinett CJ, Higginson E, Veeraraghavan B, Baker S. Exploiting genomics to mitigate the public health impact of antimicrobial resistance. Genome Med 2022; 14:15. [PMID: 35172877 PMCID: PMC8849018 DOI: 10.1186/s13073-022-01020-2] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 02/04/2022] [Indexed: 12/13/2022] Open
Abstract
Antimicrobial resistance (AMR) is a major global public health threat, which has been largely driven by the excessive use of antimicrobials. Control measures are urgently needed to slow the trajectory of AMR but are hampered by an incomplete understanding of the interplay between pathogens, AMR encoding genes, and mobile genetic elements at a microbial level. These factors, combined with the human, animal, and environmental interactions that underlie AMR dissemination at a population level, make for a highly complex landscape. Whole-genome sequencing (WGS) and, more recently, metagenomic analyses have greatly enhanced our understanding of these processes, and these approaches are informing mitigation strategies for how we better understand and control AMR. This review explores how WGS techniques have advanced global, national, and local AMR surveillance, and how this improved understanding is being applied to inform solutions, such as novel diagnostic methods that allow antimicrobial use to be optimised and vaccination strategies for better controlling AMR. We highlight some future opportunities for AMR control informed by genomic sequencing, along with the remaining challenges that must be overcome to fully realise the potential of WGS approaches for international AMR control.
Collapse
Affiliation(s)
- Claire Waddington
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 0AW, UK.,Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
| | - Megan E Carey
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 0AW, UK.,Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
| | | | - Ellen Higginson
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 0AW, UK.,Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
| | - Balaji Veeraraghavan
- Department of Microbiology, Christian Medical College, Vellore, Tamil Nadu, India
| | - Stephen Baker
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 0AW, UK. .,Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK.
| |
Collapse
|
18
|
Methods Combining Genomic and Epidemiological Data in the Reconstruction of Transmission Trees: A Systematic Review. Pathogens 2022; 11:pathogens11020252. [PMID: 35215195 PMCID: PMC8875843 DOI: 10.3390/pathogens11020252] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 02/08/2022] [Accepted: 02/11/2022] [Indexed: 11/17/2022] Open
Abstract
In order to better understand transmission dynamics and appropriately target control and preventive measures, studies have aimed to identify who-infected-whom in actual outbreaks. Numerous reconstruction methods exist, each with their own assumptions, types of data, and inference strategy. Thus, selecting a method can be difficult. Following PRISMA guidelines, we systematically reviewed the literature for methods combing epidemiological and genomic data in transmission tree reconstruction. We identified 22 methods from the 41 selected articles. We defined three families according to how genomic data was handled: a non-phylogenetic family, a sequential phylogenetic family, and a simultaneous phylogenetic family. We discussed methods according to the data needed as well as the underlying sequence mutation, within-host evolution, transmission, and case observation. In the non-phylogenetic family consisting of eight methods, pairwise genetic distances were estimated. In the phylogenetic families, transmission trees were inferred from phylogenetic trees either simultaneously (nine methods) or sequentially (five methods). While a majority of methods (17/22) modeled the transmission process, few (8/22) took into account imperfect case detection. Within-host evolution was generally (7/8) modeled as a coalescent process. These practical and theoretical considerations were highlighted in order to help select the appropriate method for an outbreak.
Collapse
|
19
|
Senghore M, Chaguza C, Bojang E, Tientcheu PE, Bancroft RE, Lo SW, Gladstone RA, McGee L, Worwui A, Foster-Nyarko E, Ceesay F, Okoi CB, Klugman KP, Breiman RF, Bentley SD, Adegbola R, Antonio M, Hanage WP, Kwambana-Adams BA. Widespread sharing of pneumococcal strains in a rural African setting: proximate villages are more likely to share similar strains that are carried at multiple timepoints. Microb Genom 2022; 8. [PMID: 35119356 PMCID: PMC8942022 DOI: 10.1099/mgen.0.000732] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The transmission dynamics of Streptococcus pneumoniae in sub-Saharan Africa are poorly understood due to a lack of adequate epidemiological and genomic data. Here we leverage a longitudinal cohort from 21 neighbouring villages in rural Africa to study how closely related strains of S. pneumoniae are shared among infants. We analysed 1074 pneumococcal genomes isolated from 102 infants from 21 villages. Strains were designated for unique serotype and sequence-type combinations, and we arbitrarily defined strain sharing where the pairwise genetic distance between strains could be accounted for by the mean within host intra-strain diversity. We used non-parametric statistical tests to assess the role of spatial distance and prolonged carriage on strain sharing using a logistic regression model. We recorded 458 carriage episodes including 318 (69.4 %) where the carried strain was shared with at least one other infant. The odds of strain sharing varied significantly across villages (χ2=47.5, df=21, P-value <0.001). Infants in close proximity to each other were more likely to be involved in strain sharing, but we also show a considerable amount of strain sharing across longer distances. Close geographic proximity (<5 km) between shared strains was associated with a significantly lower pairwise SNP distance compared to strains shared over longer distances (P-value <0.005). Sustained carriage of a shared strain among the infants was significantly more likely to occur if they resided in villages within a 5 km radius of each other (P-value <0.005, OR 3.7). Conversely, where both infants were transiently colonized by the shared strain, they were more likely to reside in villages separated by over 15 km (P-value <0.05, OR 1.5). PCV7 serotypes were rare (13.5 %) and were significantly less likely to be shared (P-value <0.001, OR −1.07). Strain sharing was more likely to occur over short geographical distances, especially where accompanied by sustained colonization. Our results show that strain sharing is a useful proxy for studying transmission dynamics in an under-sampled population with limited genomic data. This article contains data hosted by Microreact.
Collapse
Affiliation(s)
- Madikay Senghore
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia.,Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, 677 Huntington Ave, Boston, MA 02115, USA
| | - Chrispin Chaguza
- Infection Genomics, Wellcome Sanger Institute, Hinxton, UK.,Darwin College, University of Cambridge, Silver Street, Cambridge, UK.,Department of Clinical Infection, Microbiology and Immunology, Institute of Infection and Global Health, University of Liverpool, Liverpool, UK
| | - Ebrima Bojang
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Peggy-Estelle Tientcheu
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Rowan E Bancroft
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Stephanie W Lo
- Infection Genomics, Wellcome Sanger Institute, Hinxton, UK
| | | | - Lesley McGee
- Respiratory Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Archibald Worwui
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Ebenezer Foster-Nyarko
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Fatima Ceesay
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Catherine Bi Okoi
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia
| | - Keith P Klugman
- Rollins School Public Health, Emory University, Atlanta, USA
| | - Robert F Breiman
- Hubert Department of Global Health, Rollins School of Public Health, Emory University, Atlanta, GA 30322, USA
| | | | - Richard Adegbola
- Immunisation and Global Health Consulting, RAMBICON, Lagos, Nigeria
| | - Martin Antonio
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia.,Microbiology and Infection Unit, Warwick Medical School, University of Warwick, Coventry, UK
| | - William P Hanage
- Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, 677 Huntington Ave, Boston, MA 02115, USA
| | - Brenda A Kwambana-Adams
- WHO Regional Reference Laboratory (RRL), West Africa Strategy and Partnership, Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical Medicine, Atlantic Road, Fajara, The Gambia.,NIHR Global Health Research Unit on Mucosal Pathogens, Division of Infection and Immunity, University College London, London, UK
| |
Collapse
|
20
|
Dhar S, Zhang C, Măndoiu II, Bansal MS. TNet: Transmission Network Inference Using Within-Host Strain Diversity and its Application to Geographical Tracking of COVID-19 Spread. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:230-242. [PMID: 34255632 PMCID: PMC8956368 DOI: 10.1109/tcbb.2021.3096455] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Revised: 07/03/2021] [Accepted: 07/08/2021] [Indexed: 06/13/2023]
Abstract
The inference of disease transmission networks is an important problem in epidemiology. One popular approach for building transmission networks is to reconstruct a phylogenetic tree using sequences from disease strains sampled from infected hosts and infer transmissions based on this tree. However, most existing phylogenetic approaches for transmission network inference are highly computationally intensive and cannot take within-host strain diversity into account. Here, we introduce a new phylogenetic approach for inferring transmission networks, TNet, that addresses these limitations. TNet uses multiple strain sequences from each sampled host to infer transmissions and is simpler and more accurate than existing approaches. Furthermore, TNet is highly scalable and able to distinguish between ambiguous and unambiguous transmission inferences. We evaluated TNet on a large collection of 560 simulated transmission networks of various sizes and diverse host, sequence, and transmission characteristics, as well as on 10 real transmission datasets with known transmission histories. Our results show that TNet outperforms two other recently developed methods, phyloscanner and SharpTNI, that also consider within-host strain diversity. We also applied TNet to a large collection of SARS-CoV-2 genomes sampled from infected individuals in many countries around the world, demonstrating how our inference framework can be adapted to accurately infer geographical transmission networks. TNet is freely available from https://compbio.engr.uconn.edu/software/TNet/.
Collapse
Affiliation(s)
- Saurav Dhar
- Department of Computer Science & EngineeringUniversity of ConnecticutStorrsCT06269USA
| | - Chengchen Zhang
- Department of Computer Science & EngineeringUniversity of ConnecticutStorrsCT06269USA
| | - Ion I. Măndoiu
- Department of Computer Science & EngineeringUniversity of ConnecticutStorrsCT06269USA
| | - Mukul S. Bansal
- Department of Computer Science & EngineeringUniversity of ConnecticutStorrsCT06269USA
| |
Collapse
|
21
|
Tonkin-Hill G, Ling C, Chaguza C, Salter SJ, Hinfonthong P, Nikolaou E, Tate N, Pastusiak A, Turner C, Chewapreecha C, Frost SDW, Corander J, Croucher NJ, Turner P, Bentley SD. Pneumococcal within-host diversity during colonization, transmission and treatment. Nat Microbiol 2022; 7:1791-1804. [PMID: 36216891 PMCID: PMC9613479 DOI: 10.1038/s41564-022-01238-1] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 07/18/2022] [Indexed: 11/13/2022]
Abstract
Characterizing the genetic diversity of pathogens within the host promises to greatly improve surveillance and reconstruction of transmission chains. For bacteria, it also informs our understanding of inter-strain competition and how this shapes the distribution of resistant and sensitive bacteria. Here we study the genetic diversity of Streptococcus pneumoniae within 468 infants and 145 of their mothers by deep sequencing whole pneumococcal populations from 3,761 longitudinal nasopharyngeal samples. We demonstrate that deep sequencing has unsurpassed sensitivity for detecting multiple colonization, doubling the rate at which highly invasive serotype 1 bacteria were detected in carriage compared with gold-standard methods. The greater resolution identified an elevated rate of transmission from mothers to their children in the first year of the child's life. Comprehensive treatment data demonstrated that infants were at an elevated risk of both the acquisition and persistent colonization of a multidrug-resistant bacterium following antimicrobial treatment. Some alleles were enriched after antimicrobial treatment, suggesting that they aided persistence, but generally purifying selection dominated within-host evolution. Rates of co-colonization imply that in the absence of treatment, susceptible lineages outcompeted resistant lineages within the host. These results demonstrate the many benefits of deep sequencing for the genomic surveillance of bacterial pathogens.
Collapse
Affiliation(s)
- Gerry Tonkin-Hill
- grid.10306.340000 0004 0606 5382Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK ,grid.5510.10000 0004 1936 8921Department of Biostatistics, University of Oslo, Blindern, Norway
| | - Clare Ling
- grid.10223.320000 0004 1937 0490Shoklo Malaria Research Unit, Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, Thailand ,grid.4991.50000 0004 1936 8948Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Chrispin Chaguza
- grid.10306.340000 0004 0606 5382Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK ,grid.47100.320000000419368710Department of Epidemiology of Microbial Diseases, Yale School of Public Health, Yale University, New Haven, CT USA
| | - Susannah J. Salter
- grid.5335.00000000121885934Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - Pattaraporn Hinfonthong
- grid.10223.320000 0004 1937 0490Shoklo Malaria Research Unit, Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, Thailand
| | - Elissavet Nikolaou
- grid.48004.380000 0004 1936 9764Department of Clinical Sciences, Liverpool School of Tropical Medicine, Liverpool, UK ,grid.1058.c0000 0000 9442 535XInfection and Immunity, Murdoch Children’s Research Institute, Melbourne, Victoria Australia ,grid.1008.90000 0001 2179 088XDepartment of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria Australia
| | - Natalie Tate
- grid.48004.380000 0004 1936 9764Department of Clinical Sciences, Liverpool School of Tropical Medicine, Liverpool, UK
| | | | - Claudia Turner
- grid.4991.50000 0004 1936 8948Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, UK ,grid.459332.a0000 0004 0418 5364Cambodia-Oxford Medical Research Unit, Angkor Hospital for Children, Siem Reap, Cambodia
| | - Claire Chewapreecha
- grid.10306.340000 0004 0606 5382Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK ,grid.10223.320000 0004 1937 0490Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Simon D. W. Frost
- grid.419815.00000 0001 2181 3404Microsoft Research, Redmond, WA USA ,grid.8991.90000 0004 0425 469XLondon School of Hygiene and Tropical Medicine, London, UK
| | - Jukka Corander
- grid.10306.340000 0004 0606 5382Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK ,grid.5510.10000 0004 1936 8921Department of Biostatistics, University of Oslo, Blindern, Norway ,grid.7737.40000 0004 0410 2071Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Nicholas J. Croucher
- grid.7445.20000 0001 2113 8111MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, Imperial College London, London, UK
| | - Paul Turner
- grid.4991.50000 0004 1936 8948Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, UK ,grid.459332.a0000 0004 0418 5364Cambodia-Oxford Medical Research Unit, Angkor Hospital for Children, Siem Reap, Cambodia
| | - Stephen D. Bentley
- grid.10306.340000 0004 0606 5382Parasites and Microbes, Wellcome Sanger Institute, Cambridge, UK
| |
Collapse
|
22
|
Mäklin T, Kallonen T, Alanko J, Samuelsen Ø, Hegstad K, Mäkinen V, Corander J, Heinz E, Honkela A. Bacterial genomic epidemiology with mixed samples. Microb Genom 2021; 7:000691. [PMID: 34779765 PMCID: PMC8743562 DOI: 10.1099/mgen.0.000691] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 09/13/2021] [Indexed: 11/18/2022] Open
Abstract
Genomic epidemiology is a tool for tracing transmission of pathogens based on whole-genome sequencing. We introduce the mGEMS pipeline for genomic epidemiology with plate sweeps representing mixed samples of a target pathogen, opening the possibility to sequence all colonies on selective plates with a single DNA extraction and sequencing step. The pipeline includes the novel mGEMS read binner for probabilistic assignments of sequencing reads, and the scalable pseudoaligner Themisto. We demonstrate the effectiveness of our approach using closely related samples in a nosocomial setting, obtaining results that are comparable to those based on single-colony picks. Our results lend firm support to more widespread consideration of genomic epidemiology with mixed infection samples.
Collapse
Affiliation(s)
- Tommi Mäklin
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Teemu Kallonen
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Wellcome Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Jarno Alanko
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
| | - Ørjan Samuelsen
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
- Department of Pharmacy, UT The Arctic University of Norway, Tromsø, Norway
| | - Kristin Hegstad
- Norwegian National Advisory Unit on Detection of Antimicrobial Resistance, Department of Microbiology and Infection Control, University Hospital of North Norway, Tromsø, Norway
- Research group for Host-Microbe Interactions, Department of Medical Biology, Faculty of Health Sciences, UT The Arctic University of Norway, Tromsø, Norway
| | - Veli Mäkinen
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
| | - Jukka Corander
- Helsinki Institute for Information Technology HIIT, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Wellcome Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Eva Heinz
- Department of Biostatistics, University of Oslo, Oslo, Norway
- Liverpool School of Tropical Medicine, Liverpool, UK
| | - Antti Honkela
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
| |
Collapse
|
23
|
Tonkin-Hill G, Martincorena I, Amato R, Lawson ARJ, Gerstung M, Johnston I, Jackson DK, Park N, Lensing SV, Quail MA, Gonçalves S, Ariani C, Spencer Chapman M, Hamilton WL, Meredith LW, Hall G, Jahun AS, Chaudhry Y, Hosmillo M, Pinckert ML, Georgana I, Yakovleva A, Caller LG, Caddy SL, Feltwell T, Khokhar FA, Houldcroft CJ, Curran MD, Parmar S, Alderton A, Nelson R, Harrison EM, Sillitoe J, Bentley SD, Barrett JC, Torok ME, Goodfellow IG, Langford C, Kwiatkowski D. Patterns of within-host genetic diversity in SARS-CoV-2. eLife 2021; 10:e66857. [PMID: 34387545 PMCID: PMC8363274 DOI: 10.7554/elife.66857] [Citation(s) in RCA: 92] [Impact Index Per Article: 30.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Accepted: 07/22/2021] [Indexed: 12/15/2022] Open
Abstract
Monitoring the spread of SARS-CoV-2 and reconstructing transmission chains has become a major public health focus for many governments around the world. The modest mutation rate and rapid transmission of SARS-CoV-2 prevents the reconstruction of transmission chains from consensus genome sequences, but within-host genetic diversity could theoretically help identify close contacts. Here we describe the patterns of within-host diversity in 1181 SARS-CoV-2 samples sequenced to high depth in duplicate. 95.1% of samples show within-host mutations at detectable allele frequencies. Analyses of the mutational spectra revealed strong strand asymmetries suggestive of damage or RNA editing of the plus strand, rather than replication errors, dominating the accumulation of mutations during the SARS-CoV-2 pandemic. Within- and between-host diversity show strong purifying selection, particularly against nonsense mutations. Recurrent within-host mutations, many of which coincide with known phylogenetic homoplasies, display a spectrum and patterns of purifying selection more suggestive of mutational hotspots than recombination or convergent evolution. While allele frequencies suggest that most samples result from infection by a single lineage, we identify multiple putative examples of co-infection. Integrating these results into an epidemiological inference framework, we find that while sharing of within-host variants between samples could help the reconstruction of transmission chains, mutational hotspots and rare cases of superinfection can confound these analyses.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Naomi Park
- Wellcome Sanger InstituteHinxtonUnited Kingdom
| | | | | | | | | | | | | | - Luke W Meredith
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Grant Hall
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Aminu S Jahun
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Yasmin Chaudhry
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Myra Hosmillo
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Malte L Pinckert
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Iliana Georgana
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Anna Yakovleva
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Laura G Caller
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Sarah L Caddy
- Department of Medicine, University of CambridgeCambridgeUnited Kingdom
| | - Theresa Feltwell
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | - Fahad A Khokhar
- Department of Medicine, University of CambridgeCambridgeUnited Kingdom
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, University of CambridgeCambridgeUnited Kingdom
| | | | | | | | | | | | | | - Ewan M Harrison
- Wellcome Sanger InstituteHinxtonUnited Kingdom
- European Bioinformatics InstituteHinxtonUnited Kingdom
| | | | | | | | - M Estee Torok
- Department of Medicine, University of CambridgeCambridgeUnited Kingdom
| | - Ian G Goodfellow
- Department of Pathology, University of CambridgeCambridgeUnited Kingdom
| | | | - Dominic Kwiatkowski
- Wellcome Sanger InstituteHinxtonUnited Kingdom
- Nuffield Department of Medicine, University of OxfordOxfordUnited Kingdom
| | | |
Collapse
|
24
|
Berry IM, Melendrez MC, Pollett S, Figueroa K, Buddhari D, Klungthong C, Nisalak A, Panciera M, Thaisomboonsuk B, Li T, Vallard TG, Macareo L, Yoon IK, Thomas SJ, Endy T, Jarman RG. Precision Tracing of Household Dengue Spread Using Inter- and Intra-Host Viral Variation Data, Kamphaeng Phet, Thailand. Emerg Infect Dis 2021; 27:1637-1644. [PMID: 34013878 PMCID: PMC8153871 DOI: 10.3201/eid2706.204323] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Dengue control approaches are best informed by granular spatial epidemiology of these viruses, yet reconstruction of inter- and intra-household transmissions is limited when analyzing case count, serologic, or genomic consensus sequence data. To determine viral spread on a finer spatial scale, we extended phylogenomic discrete trait analyses to reconstructions of house-to-house transmissions within a prospective cluster study in Kamphaeng Phet, Thailand. For additional resolution and transmission confirmation, we mapped dengue intra-host single nucleotide variants on the taxa of these time-scaled phylogenies. This approach confirmed 19 household transmissions and revealed that dengue disperses an average of 70 m per day between households in these communities. We describe an evolutionary biology framework for the resolution of dengue transmissions that cannot be differentiated based on epidemiologic and consensus genome data alone. This framework can be used as a public health tool to inform control approaches and enable precise tracing of dengue transmissions.
Collapse
|
25
|
Dawson D, Rasmussen D, Peng X, Lanzas C. Inferring environmental transmission using phylodynamics: a case-study using simulated evolution of an enteric pathogen. J R Soc Interface 2021; 18:20210041. [PMID: 34102084 DOI: 10.1098/rsif.2021.0041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Indirect (environmental) and direct (host-host) transmission pathways cannot easily be distinguished when they co-occur in epidemics, particularly when they occur on similar time scales. Phylodynamic reconstruction is a potential approach to this problem that combines epidemiological information (temporal, spatial information) with pathogen whole-genome sequencing data to infer transmission trees of epidemics. However, factors such as differences in mutation and transmission rates between host and non-host environments may obscure phylogenetic inference from these methods. In this study, we used a network-based transmission model that explicitly models pathogen evolution to simulate epidemics with both direct and indirect transmission. Epidemics were simulated according to factorial combinations of direct/indirect transmission proportions, host mutation rates and conditions of environmental pathogen growth. Transmission trees were then reconstructed using the phylodynamic approach SCOTTI (structured coalescent transmission tree inference) and evaluated. We found that although insufficient diversity sets a lower bound on when accurate phylodynamic inferences can be made, transmission routes and assumed pathogen lifestyle affected pathogen population structure and subsequently influenced both reconstruction success and the likelihood of direct versus indirect pathways being reconstructed. We conclude that prior knowledge of the likely ecology and population structure of pathogens in host and non-host environments is critical to fully using phylodynamic techniques.
Collapse
Affiliation(s)
- Daniel Dawson
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| | - David Rasmussen
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.,Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC, USA
| | - Xinxia Peng
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.,Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| | - Cristina Lanzas
- Department of Population Health and Pathobiology, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| |
Collapse
|
26
|
Valesano AL, Rumfelt KE, Dimcheff DE, Blair CN, Fitzsimmons WJ, Petrie JG, Martin ET, Lauring AS. Temporal dynamics of SARS-CoV-2 mutation accumulation within and across infected hosts. PLoS Pathog 2021; 17:e1009499. [PMID: 33826681 PMCID: PMC8055005 DOI: 10.1371/journal.ppat.1009499] [Citation(s) in RCA: 71] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 04/19/2021] [Accepted: 03/24/2021] [Indexed: 01/12/2023] Open
Abstract
Analysis of SARS-CoV-2 genetic diversity within infected hosts can provide insight into the generation and spread of new viral variants and may enable high resolution inference of transmission chains. However, little is known about temporal aspects of SARS-CoV-2 intrahost diversity and the extent to which shared diversity reflects convergent evolution as opposed to transmission linkage. Here we use high depth of coverage sequencing to identify within-host genetic variants in 325 specimens from hospitalized COVID-19 patients and infected employees at a single medical center. We validated our variant calling by sequencing defined RNA mixtures and identified viral load as a critical factor in variant identification. By leveraging clinical metadata, we found that intrahost diversity is low and does not vary by time from symptom onset. This suggests that variants will only rarely rise to appreciable frequency prior to transmission. Although there was generally little shared variation across the sequenced cohort, we identified intrahost variants shared across individuals who were unlikely to be related by transmission. These variants did not precede a rise in frequency in global consensus genomes, suggesting that intrahost variants may have limited utility for predicting future lineages. These results provide important context for sequence-based inference in SARS-CoV-2 evolution and epidemiology.
Collapse
Affiliation(s)
- Andrew L. Valesano
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Kalee E. Rumfelt
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Derek E. Dimcheff
- Division of Hospital Medicine, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Christopher N. Blair
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - William J. Fitzsimmons
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Joshua G. Petrie
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Emily T. Martin
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Adam S. Lauring
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
27
|
Hufsky F, Lamkiewicz K, Almeida A, Aouacheria A, Arighi C, Bateman A, Baumbach J, Beerenwinkel N, Brandt C, Cacciabue M, Chuguransky S, Drechsel O, Finn RD, Fritz A, Fuchs S, Hattab G, Hauschild AC, Heider D, Hoffmann M, Hölzer M, Hoops S, Kaderali L, Kalvari I, von Kleist M, Kmiecinski R, Kühnert D, Lasso G, Libin P, List M, Löchel HF, Martin MJ, Martin R, Matschinske J, McHardy AC, Mendes P, Mistry J, Navratil V, Nawrocki EP, O’Toole ÁN, Ontiveros-Palacios N, Petrov AI, Rangel-Pineros G, Redaschi N, Reimering S, Reinert K, Reyes A, Richardson L, Robertson DL, Sadegh S, Singer JB, Theys K, Upton C, Welzel M, Williams L, Marz M. Computational strategies to combat COVID-19: useful tools to accelerate SARS-CoV-2 and coronavirus research. Brief Bioinform 2021; 22:642-663. [PMID: 33147627 PMCID: PMC7665365 DOI: 10.1093/bib/bbaa232] [Citation(s) in RCA: 78] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 07/28/2020] [Accepted: 08/26/2020] [Indexed: 12/16/2022] Open
Abstract
SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) is a novel virus of the family Coronaviridae. The virus causes the infectious disease COVID-19. The biology of coronaviruses has been studied for many years. However, bioinformatics tools designed explicitly for SARS-CoV-2 have only recently been developed as a rapid reaction to the need for fast detection, understanding and treatment of COVID-19. To control the ongoing COVID-19 pandemic, it is of utmost importance to get insight into the evolution and pathogenesis of the virus. In this review, we cover bioinformatics workflows and tools for the routine detection of SARS-CoV-2 infection, the reliable analysis of sequencing data, the tracking of the COVID-19 pandemic and evaluation of containment measures, the study of coronavirus evolution, the discovery of potential drug targets and development of therapeutic strategies. For each tool, we briefly describe its use case and how it advances research specifically for SARS-CoV-2. All tools are free to use and available online, either through web applications or public code repositories. Contact:evbc@unj-jena.de.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Christian Brandt
- Institute of Infectious Disease and Infection Control at Jena University Hospital, Germany
| | - Marco Cacciabue
- Consejo Nacional de Investigaciones Científicas y Tócnicas (CONICET) working on FMDV virology at the Instituto de Agrobiotecnología y Biología Molecular (IABiMo, INTA-CONICET) and at the Departamento de Ciencias Básicas, Universidad Nacional de Luján (UNLu), Argentina
| | | | - Oliver Drechsel
- bioinformatics department at the Robert Koch-Institute, Germany
| | | | - Adrian Fritz
- Computational Biology of Infection Research group of Alice C. McHardy at the Helmholtz Centre for Infection Research, Germany
| | - Stephan Fuchs
- bioinformatics department at the Robert Koch-Institute, Germany
| | - Georges Hattab
- Bioinformatics Division at Philipps-University Marburg, Germany
| | | | - Dominik Heider
- Data Science in Biomedicine at the Philipps-University of Marburg, Germany
| | | | | | - Stefan Hoops
- Biocomplexity Institute and Initiative at the University of Virginia, USA
| | - Lars Kaderali
- Bioinformatics and head of the Institute of Bioinformatics at University Medicine Greifswald, Germany
| | | | - Max von Kleist
- bioinformatics department at the Robert Koch-Institute, Germany
| | - Renó Kmiecinski
- bioinformatics department at the Robert Koch-Institute, Germany
| | | | - Gorka Lasso
- Chandran Lab, Albert Einstein College of Medicine, USA
| | | | | | | | | | | | | | - Alice C McHardy
- Computational Biology of Infection Research Lab at the Helmholtz Centre for Infection Research in Braunschweig, Germany
| | - Pedro Mendes
- Center for Quantitative Medicine of the University of Connecticut School of Medicine, USA
| | | | - Vincent Navratil
- Bioinformatics and Systems Biology at the Rhône Alpes Bioinformatics core facility, Universitó de Lyon, France
| | | | | | | | | | | | - Nicole Redaschi
- Development of the Swiss-Prot group at the SIB for UniProt and SIB resources that cover viral biology (ViralZone)
| | - Susanne Reimering
- Computational Biology of Infection Research group of Alice C. McHardy at the Helmholtz Centre for Infection Research
| | | | | | | | | | - Sepideh Sadegh
- Chair of Experimental Bioinformatics at Technical University of Munich, Germany
| | - Joshua B Singer
- MRC-University of Glasgow Centre for Virus Research, Glasgow, Scotland, UK
| | | | - Chris Upton
- Department of Biochemistry and Microbiology, University of Victoria, Canada
| | | | | | - Manja Marz
- Friedrich Schiller University Jena, Germany
| |
Collapse
|
28
|
Ramazzotti D, Angaroni F, Maspero D, Gambacorti-Passerini C, Antoniotti M, Graudenzi A, Piazza R. VERSO: A comprehensive framework for the inference of robust phylogenies and the quantification of intra-host genomic diversity of viral samples. PATTERNS (NEW YORK, N.Y.) 2021; 2:100212. [PMID: 33728416 PMCID: PMC7953447 DOI: 10.1016/j.patter.2021.100212] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Revised: 11/30/2020] [Accepted: 01/22/2021] [Indexed: 12/22/2022]
Abstract
We introduce VERSO, a two-step framework for the characterization of viral evolution from sequencing data of viral genomes, which is an improvement on phylogenomic approaches for consensus sequences. VERSO exploits an efficient algorithmic strategy to return robust phylogenies from clonal variant profiles, also in conditions of sampling limitations. It then leverages variant frequency patterns to characterize the intra-host genomic diversity of samples, revealing undetected infection chains and pinpointing variants likely involved in homoplasies. On simulations, VERSO outperforms state-of-the-art tools for phylogenetic inference. Notably, the application to 6,726 amplicon and RNA sequencing samples refines the estimation of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) evolution, while co-occurrence patterns of minor variants unveil undetected infection paths, which are validated with contact tracing data. Finally, the analysis of SARS-CoV-2 mutational landscape uncovers a temporal increase of overall genomic diversity and highlights variants transiting from minor to clonal state and homoplastic variants, some of which fall on the spike gene. Available at: https://github.com/BIMIB-DISCo/VERSO.
Collapse
Affiliation(s)
- Daniele Ramazzotti
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Monza, Italy
| | - Fabrizio Angaroni
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Milan, Italy
| | - Davide Maspero
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Milan, Italy
- Inst. of Molecular Bioimaging and Physiology, Consiglio Nazionale delle Ricerche (IBFM-CNR), Segrate, Milan, Italy
| | | | - Marco Antoniotti
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Milan, Italy
- Bicocca Bioinformatics, Biostatistics and Bioimaging Centre – B4, Milan, Italy
| | - Alex Graudenzi
- Inst. of Molecular Bioimaging and Physiology, Consiglio Nazionale delle Ricerche (IBFM-CNR), Segrate, Milan, Italy
- Bicocca Bioinformatics, Biostatistics and Bioimaging Centre – B4, Milan, Italy
| | - Rocco Piazza
- Department of Medicine and Surgery, Università degli Studi di Milano-Bicocca, Monza, Italy
| |
Collapse
|
29
|
Dynamic Molecular Epidemiology Reveals Lineage-Associated Single-Nucleotide Variants That Alter RNA Structure in Chikungunya Virus. Genes (Basel) 2021; 12:genes12020239. [PMID: 33567556 PMCID: PMC7914560 DOI: 10.3390/genes12020239] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 01/29/2021] [Accepted: 02/04/2021] [Indexed: 01/21/2023] Open
Abstract
Chikungunya virus (CHIKV) is an emerging Alphavirus which causes millions of human infections every year. Outbreaks have been reported in Africa and Asia since the early 1950s, from three CHIKV lineages: West African, East Central South African, and Asian Urban. As new outbreaks occurred in the Americas, individual strains from the known lineages have evolved, creating new monophyletic groups that generated novel geographic-based lineages. Building on a recently updated phylogeny of CHIKV, we report here the availability of an interactive CHIKV phylodynamics dataset, which is based on more than 900 publicly available CHIKV genomes. We provide an interactive view of CHIKV molecular epidemiology built on Nextstrain, a web-based visualization framework for real-time tracking of pathogen evolution. CHIKV molecular epidemiology reveals single nucleotide variants that change the stability and fold of locally stable RNA structures. We propose alternative RNA structure formation in different CHIKV lineages by predicting more than a dozen RNA elements that are subject to perturbation of the structure ensemble upon variation of a single nucleotide.
Collapse
|
30
|
Valesano AL, Rumfelt KE, Dimcheff DE, Blair CN, Fitzsimmons WJ, Petrie JG, Martin ET, Lauring AS. Temporal dynamics of SARS-CoV-2 mutation accumulation within and across infected hosts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2021.01.19.427330. [PMID: 33501443 PMCID: PMC7836113 DOI: 10.1101/2021.01.19.427330] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Analysis of SARS-CoV-2 genetic diversity within infected hosts can provide insight into the generation and spread of new viral variants and may enable high resolution inference of transmission chains. However, little is known about temporal aspects of SARS-CoV-2 intrahost diversity and the extent to which shared diversity reflects convergent evolution as opposed to transmission linkage. Here we use high depth of coverage sequencing to identify within-host genetic variants in 325 specimens from hospitalized COVID-19 patients and infected employees at a single medical center. We validated our variant calling by sequencing defined RNA mixtures and identified a viral load threshold that minimizes false positives. By leveraging clinical metadata, we found that intrahost diversity is low and does not vary by time from symptom onset. This suggests that variants will only rarely rise to appreciable frequency prior to transmission. Although there was generally little shared variation across the sequenced cohort, we identified intrahost variants shared across individuals who were unlikely to be related by transmission. These variants did not precede a rise in frequency in global consensus genomes, suggesting that intrahost variants may have limited utility for predicting future lineages. These results provide important context for sequence-based inference in SARS-CoV-2 evolution and epidemiology.
Collapse
Affiliation(s)
- Andrew L. Valesano
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Kalee E. Rumfelt
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Derek E. Dimcheff
- Division of Hospital Medicine, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
| | - Christopher N. Blair
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - William J. Fitzsimmons
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| | - Joshua G. Petrie
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
| | - Emily T. Martin
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
| | - Adam S. Lauring
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
31
|
Volz EM, Carsten W, Grad YH, Frost SDW, Dennis AM, Didelot X. Identification of Hidden Population Structure in Time-Scaled Phylogenies. Syst Biol 2021; 69:884-896. [PMID: 32049340 PMCID: PMC8559910 DOI: 10.1093/sysbio/syaa009] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 01/09/2020] [Accepted: 01/23/2020] [Indexed: 11/13/2022] Open
Abstract
Population structure influences genealogical patterns, however, data pertaining to how populations are structured are often unavailable or not directly observable. Inference of population structure is highly important in molecular epidemiology where pathogen phylogenetics is increasingly used to infer transmission patterns and detect outbreaks. Discrepancies between observed and idealized genealogies, such as those generated by the coalescent process, can be quantified, and where significant differences occur, may reveal the action of natural selection, host population structure, or other demographic and epidemiological heterogeneities. We have developed a fast non-parametric statistical test for detection of cryptic population structure in time-scaled phylogenetic trees. The test is based on contrasting estimated phylogenies with the theoretically expected phylodynamic ordering of common ancestors in two clades within a coalescent framework. These statistical tests have also motivated the development of algorithms which can be used to quickly screen a phylogenetic tree for clades which are likely to share a distinct demographic or epidemiological history. Epidemiological applications include identification of outbreaks in vulnerable host populations or rapid expansion of genotypes with a fitness advantage. To demonstrate the utility of these methods for outbreak detection, we applied the new methods to large phylogenies reconstructed from thousands of HIV-1 partial pol sequences. This revealed the presence of clades which had grown rapidly in the recent past and was significantly concentrated in young men, suggesting recent and rapid transmission in that group. Furthermore, to demonstrate the utility of these methods for the study of antimicrobial resistance, we applied the new methods to a large phylogeny reconstructed from whole genome Neisseria gonorrhoeae sequences. We find that population structure detected using these methods closely overlaps with the appearance and expansion of mutations conferring antimicrobial resistance. [Antimicrobial resistance; coalescent; HIV; population structure.].
Collapse
Affiliation(s)
- Erik M Volz
- Department of Infectious Disease Epidemiology and MRC Centre for Global Infectious Disease Analysis, Imperial College London, Norfolk Place, W2 1PG London, UK
| | - Wiuf Carsten
- Department of Mathematical Sciences, University of Copenhagen, Universitetsparken 5, DK-2100 Copenhagen, Denmark
| | - Yonatan H Grad
- Department of Immunology and Infectious Diseases, TH Chan School of Public Health, Harvard University, 677 Huntington Ave, Boston, MA 02115, USA
| | - Simon D W Frost
- Department of Veterinary Medicine, University of Cambridge, Madingley Rd, Cambridge CB3 0ES, UK.,The Alan Turing Institute, 96 Euston Rd, London NW1 2DB, London, UK
| | - Ann M Dennis
- Department of Medicine, University of North Carolina Chapel Hill, 321 S Columbia St, Chapel Hill, NC 27516, USA
| | - Xavier Didelot
- School of Life Sciences and Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK
| |
Collapse
|
32
|
Identifying likely transmissions in Mycobacterium bovis infected populations of cattle and badgers using the Kolmogorov Forward Equations. Sci Rep 2020; 10:21980. [PMID: 33319838 PMCID: PMC7738532 DOI: 10.1038/s41598-020-78900-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 11/20/2020] [Indexed: 11/16/2022] Open
Abstract
Established methods for whole-genome-sequencing (WGS) technology allow for the detection of single-nucleotide polymorphisms (SNPs) in the pathogen genomes sourced from host samples. The information obtained can be used to track the pathogen’s evolution in time and potentially identify ‘who-infected-whom’ with unprecedented accuracy. Successful methods include ‘phylodynamic approaches’ that integrate evolutionary and epidemiological data. However, they are typically computationally intensive, require extensive data, and are best applied when there is a strong molecular clock signal and substantial pathogen diversity. To determine how much transmission information can be inferred when pathogen genetic diversity is low and metadata limited, we propose an analytical approach that combines pathogen WGS data and sampling times from infected hosts. It accounts for ‘between-scale’ processes, in particular within-host pathogen evolution and between-host transmission. We applied this to a well-characterised population with an endemic Mycobacterium bovis (the causative agent of bovine/zoonotic tuberculosis, bTB) infection. Our results show that, even with such limited data and low diversity, the computation of the transmission probability between host pairs can help discriminate between likely and unlikely infection pathways and therefore help to identify potential transmission networks. However, the method can be sensitive to assumptions about within-host evolution.
Collapse
|
33
|
Bull RA, Adikari TN, Ferguson JM, Hammond JM, Stevanovski I, Beukers AG, Naing Z, Yeang M, Verich A, Gamaarachchi H, Kim KW, Luciani F, Stelzer-Braid S, Eden JS, Rawlinson WD, van Hal SJ, Deveson IW. Analytical validity of nanopore sequencing for rapid SARS-CoV-2 genome analysis. Nat Commun 2020; 11:6272. [PMID: 33298935 PMCID: PMC7726558 DOI: 10.1038/s41467-020-20075-6] [Citation(s) in RCA: 151] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 11/11/2020] [Indexed: 01/15/2023] Open
Abstract
Viral whole-genome sequencing (WGS) provides critical insight into the transmission and evolution of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Long-read sequencing devices from Oxford Nanopore Technologies (ONT) promise significant improvements in turnaround time, portability and cost, compared to established short-read sequencing platforms for viral WGS (e.g., Illumina). However, adoption of ONT sequencing for SARS-CoV-2 surveillance has been limited due to common concerns around sequencing accuracy. To address this, here we perform viral WGS with ONT and Illumina platforms on 157 matched SARS-CoV-2-positive patient specimens and synthetic RNA controls, enabling rigorous evaluation of analytical performance. We report that, despite the elevated error rates observed in ONT sequencing reads, highly accurate consensus-level sequence determination was achieved, with single nucleotide variants (SNVs) detected at >99% sensitivity and >99% precision above a minimum ~60-fold coverage depth, thereby ensuring suitability for SARS-CoV-2 genome analysis. ONT sequencing also identified a surprising diversity of structural variation within SARS-CoV-2 specimens that were supported by evidence from short-read sequencing on matched samples. However, ONT sequencing failed to accurately detect short indels and variants at low read-count frequencies. This systematic evaluation of analytical performance for SARS-CoV-2 WGS will facilitate widespread adoption of ONT sequencing within local, national and international COVID-19 public health initiatives.
Collapse
Affiliation(s)
- Rowena A Bull
- The Kirby Institute for Infection and Immunity, University of New South Wales, Sydney, NSW, Australia.,School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| | - Thiruni N Adikari
- The Kirby Institute for Infection and Immunity, University of New South Wales, Sydney, NSW, Australia.,School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| | - James M Ferguson
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Jillian M Hammond
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Igor Stevanovski
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Alicia G Beukers
- NSW Health Pathology, Department of Infectious Diseases and Microbiology, Royal Prince Alfred Hospital, Sydney, NSW, Australia
| | - Zin Naing
- School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.,Virology Research Laboratory, Serology and Virology Division (SAViD), NSW Health Pathology, Prince of Wales Hospital, Sydney, NSW, Australia
| | - Malinna Yeang
- School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.,Virology Research Laboratory, Serology and Virology Division (SAViD), NSW Health Pathology, Prince of Wales Hospital, Sydney, NSW, Australia
| | - Andrey Verich
- The Kirby Institute for Infection and Immunity, University of New South Wales, Sydney, NSW, Australia
| | - Hasindu Gamaarachchi
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia.,School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, Australia
| | - Ki Wook Kim
- Virology Research Laboratory, Serology and Virology Division (SAViD), NSW Health Pathology, Prince of Wales Hospital, Sydney, NSW, Australia.,School of Women's and Children's Health, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| | - Fabio Luciani
- The Kirby Institute for Infection and Immunity, University of New South Wales, Sydney, NSW, Australia.,School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| | - Sacha Stelzer-Braid
- School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.,Virology Research Laboratory, Serology and Virology Division (SAViD), NSW Health Pathology, Prince of Wales Hospital, Sydney, NSW, Australia
| | - John-Sebastian Eden
- Marie Bashir Institute for Infectious Diseases and Biosecurity & Sydney Medical School, The University of Sydney, Sydney, NSW, Australia.,Centre for Virus Research, Westmead Institute for Medical Research, Sydney, NSW, Australia
| | - William D Rawlinson
- School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.,Virology Research Laboratory, Serology and Virology Division (SAViD), NSW Health Pathology, Prince of Wales Hospital, Sydney, NSW, Australia.,School of Women's and Children's Health, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.,School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW, Australia
| | - Sebastiaan J van Hal
- NSW Health Pathology, Department of Infectious Diseases and Microbiology, Royal Prince Alfred Hospital, Sydney, NSW, Australia.,Central Clinical School, University of Sydney, Sydney, NSW, Australia
| | - Ira W Deveson
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia. .,St Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia.
| |
Collapse
|
34
|
Lequime S, Bastide P, Dellicour S, Lemey P, Baele G. nosoi: A stochastic agent-based transmission chain simulation framework in r. Methods Ecol Evol 2020; 11:1002-1007. [PMID: 32983401 PMCID: PMC7496779 DOI: 10.1111/2041-210x.13422] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 05/13/2020] [Indexed: 12/22/2022]
Abstract
The transmission process of an infectious agent creates a connected chain of hosts linked by transmission events, known as a transmission chain. Reconstructing transmission chains remains a challenging endeavour, except in rare cases characterized by intense surveillance and epidemiological inquiry. Inference frameworks attempt to estimate or approximate these transmission chains but the accuracy and validity of such methods generally lack formal assessment on datasets for which the actual transmission chain was observed.We here introduce nosoi, an open-source r package that offers a complete, tunable and expandable agent-based framework to simulate transmission chains under a wide range of epidemiological scenarios for single-host and dual-host epidemics. nosoi is accessible through GitHub and CRAN, and is accompanied by extensive documentation, providing help and practical examples to assist users in setting up their own simulations.Once infected, each host or agent can undergo a series of events during each time step, such as moving (between locations) or transmitting the infection, all of these being driven by user-specified rules or data, such as travel patterns between locations. nosoi is able to generate a multitude of epidemic scenarios, that can-for example-be used to validate a wide range of reconstruction methods, including epidemic modelling and phylodynamic analyses. nosoi also offers a comprehensive framework to leverage empirically acquired data, allowing the user to explore how variations in parameters can affect epidemic potential. Aside from research questions, nosoi can provide lecturers with a complete teaching tool to offer students a hands-on exploration of the dynamics of epidemiological processes and the factors that impact it. Because the package does not rely on mathematical formalism but uses a more intuitive algorithmic approach, even extensive changes of the entire model can be easily and quickly implemented.
Collapse
Affiliation(s)
- Sebastian Lequime
- Department of Microbiology, Immunology and TransplantationRega InstituteKU LeuvenLeuvenBelgium
- Cluster of Microbial EcologyGroningen Institute for Evolutionary Life SciencesUniversity of GroningenGroningenThe Netherlands
| | - Paul Bastide
- Department of Microbiology, Immunology and TransplantationRega InstituteKU LeuvenLeuvenBelgium
- IMAGCNRSUniversity of MontpellierMontpellierFrance
| | - Simon Dellicour
- Department of Microbiology, Immunology and TransplantationRega InstituteKU LeuvenLeuvenBelgium
- Spatial Epidemiology Lab (SpELL)Université Libre de BruxellesBrusselsBelgium
| | - Philippe Lemey
- Department of Microbiology, Immunology and TransplantationRega InstituteKU LeuvenLeuvenBelgium
| | - Guy Baele
- Department of Microbiology, Immunology and TransplantationRega InstituteKU LeuvenLeuvenBelgium
| |
Collapse
|
35
|
Abstract
MOTIVATION The combination of genomic and epidemiological data holds the potential to enable accurate pathogen transmission history inference. However, the inference of outbreak transmission histories remains challenging due to various factors such as within-host pathogen diversity and multi-strain infections. Current computational methods ignore within-host diversity and/or multi-strain infections, often failing to accurately infer the transmission history. Thus, there is a need for efficient computational methods for transmission tree inference that accommodate the complexities of real data. RESULTS We formulate the direct transmission inference (DTI) problem for inferring transmission trees that support multi-strain infections given a timed phylogeny and additional epidemiological data. We establish hardness for the decision and counting version of the DTI problem. We introduce Transmission Tree Uniform Sampler (TiTUS), a method that uses SATISFIABILITY to almost uniformly sample from the space of transmission trees. We introduce criteria that prioritize parsimonious transmission trees that we subsequently summarize using a novel consensus tree approach. We demonstrate TiTUS's ability to accurately reconstruct transmission trees on simulated data as well as a documented HIV transmission chain. AVAILABILITY AND IMPLEMENTATION https://github.com/elkebir-group/TiTUS. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Palash Sashittal
- Department of Aerospace Engineering, University of Illinois at Urbana-Champaign, Urbama, IL 61801, USA
| | - Mohammed El-Kebir
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbama, IL 61801, USA
| |
Collapse
|
36
|
Cassidy R, Kypraios T, O'Neill PD. Modelling, Bayesian inference, and model assessment for nosocomial pathogens using whole-genome-sequence data. Stat Med 2020; 39:1746-1765. [PMID: 32142587 PMCID: PMC7217057 DOI: 10.1002/sim.8510] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 01/15/2020] [Accepted: 01/31/2020] [Indexed: 12/28/2022]
Abstract
Whole‐genome sequencing of pathogens in outbreaks of infectious disease provides the potential to reconstruct transmission pathways and enhance the information contained in conventional epidemiological data. In recent years, there have been numerous new methods and models developed to exploit such high‐resolution genetic data. However, corresponding methods for model assessment have been largely overlooked. In this article, we develop both new modelling methods and new model assessment methods, specifically by building on the work of Worby et al. Although the methods are generic in nature, we focus specifically on nosocomial pathogens and analyze a dataset collected during an outbreak of MRSA in a hospital setting.
Collapse
Affiliation(s)
- Rosanna Cassidy
- School of Mathematical Sciences, University of Nottingham, Nottingham, UK
| | - Theodore Kypraios
- School of Mathematical Sciences, University of Nottingham, Nottingham, UK
| | - Philip D O'Neill
- School of Mathematical Sciences, University of Nottingham, Nottingham, UK
| |
Collapse
|
37
|
Alamil M, Hughes J, Berthier K, Desbiez C, Thébaud G, Soubeyrand S. Inferring epidemiological links from deep sequencing data: a statistical learning approach for human, animal and plant diseases. Philos Trans R Soc Lond B Biol Sci 2020; 374:20180258. [PMID: 31056055 PMCID: PMC6553606 DOI: 10.1098/rstb.2018.0258] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Pathogen sequence data have been exploited to infer who infected whom, by using empirical and model-based approaches. Most of these approaches exploit one pathogen sequence per infected host (e.g. individual, household, field). However, modern sequencing techniques can reveal the polymorphic nature of within-host populations of pathogens. Thus, these techniques provide a subsample of the pathogen variants that were present in the host at the sampling time. Such data are expected to give more insight on epidemiological links than a single sequence per host. In general, a mechanistic viewpoint to transmission and micro-evolution has been followed to infer epidemiological links from these data. Here, we investigate an alternative approach grounded on statistical learning. The idea consists of learning the structure of epidemiological links with a pseudo-evolutionary model applied to training data obtained from contact tracing, for example, and using this initial stage to infer links for the whole dataset. Such an approach has the potential to be particularly valuable in the case of a risk of erroneous mechanistic assumptions, it is sufficiently parsimonious to allow the handling of big datasets in the future, and it is versatile enough to be applied to very different contexts from animal, human and plant epidemiology. This article is part of the theme issue ‘Modelling infectious disease outbreaks in humans, animals and plants: approaches and important themes’. This issue is linked with the subsequent theme issue ‘Modelling infectious disease outbreaks in humans, animals and plants: epidemic forecasting and control’.
Collapse
Affiliation(s)
- M Alamil
- 1 BioSP, INRA, 84914 Avignon , France
| | - J Hughes
- 2 MRC-University of Glasgow Centre for Virus Research , Glasgow G61 1QH , UK
| | - K Berthier
- 3 Pathologie Végétale, INRA , 84140 Montfavet , France
| | - C Desbiez
- 3 Pathologie Végétale, INRA , 84140 Montfavet , France
| | - G Thébaud
- 4 BGPI, INRA, Univ. Montpellier , SupAgro, Cirad, 34398 Montpellier , France
| | | |
Collapse
|
38
|
de Bernardi Schneider A, Ford CT, Hostager R, Williams J, Cioce M, Çatalyürek ÜV, Wertheim JO, Janies D. StrainHub: a phylogenetic tool to construct pathogen transmission networks. Bioinformatics 2020; 36:945-947. [PMID: 31418766 PMCID: PMC8215912 DOI: 10.1093/bioinformatics/btz646] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 08/06/2019] [Accepted: 08/14/2019] [Indexed: 01/30/2023] Open
Abstract
SUMMARY In exploring the epidemiology of infectious diseases, networks have been used to reconstruct contacts among individuals and/or populations. Summarizing networks using pathogen metadata (e.g. host species and place of isolation) and a phylogenetic tree is a nascent, alternative approach. In this paper, we introduce a tool for reconstructing transmission networks in arbitrary space from phylogenetic information and metadata. Our goals are to provide a means of deriving new insights and infection control strategies based on the dynamics of the pathogen lineages derived from networks and centrality metrics. We created a web-based application, called StrainHub, in which a user can input a phylogenetic tree based on genetic or other data along with characters derived from metadata using their preferred tree search method. StrainHub generates a transmission network based on character state changes in metadata, such as place or source of isolation, mapped on the phylogenetic tree. The user has the option to calculate centrality metrics on the nodes including betweenness, closeness, degree and a new metric, the source/hub ratio. The outputs include the network with values for metrics on its nodes and the tree with characters reconstructed. All of these results can be exported for further analysis. AVAILABILITY AND IMPLEMENTATION strainhub.io and https://github.com/abschneider/StrainHub.
Collapse
Affiliation(s)
| | - Colby T Ford
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Reilly Hostager
- Department of Medicine, University of California, San Diego, San Diego, CA 92103, USA
| | - John Williams
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Michael Cioce
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Ümit V Çatalyürek
- School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Joel O Wertheim
- Department of Medicine, University of California, San Diego, San Diego, CA 92103, USA
| | - Daniel Janies
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| |
Collapse
|
39
|
Mak L, Perera D, Lang R, Kossinna P, He J, Gill MJ, Long Q, van Marle G. Evaluation of A Phylogenetic Pipeline to Examine Transmission Networks in A Canadian HIV Cohort. Microorganisms 2020; 8:E196. [PMID: 32023939 PMCID: PMC7074708 DOI: 10.3390/microorganisms8020196] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 01/23/2020] [Accepted: 01/29/2020] [Indexed: 01/08/2023] Open
Abstract
Keywords: HIV; Canada; molecular phylogenetics; viral evolution; person-to-person transmission inference; transmission network; summary statistics.
Collapse
Affiliation(s)
- Lauren Mak
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
| | - Deshan Perera
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
| | - Raynell Lang
- Department of Medicine, Cumming School of Medicine, University of Calgary and Alberta Health Services, Calgary, AB T2N 4N1, Canada
| | - Pathum Kossinna
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
| | - Jingni He
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
| | - M. John Gill
- Department of Medicine, Cumming School of Medicine, University of Calgary and Alberta Health Services, Calgary, AB T2N 4N1, Canada
| | - Quan Long
- Department of Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada (P.K.)
- Department of Medical Genetics, and Mathematics & Statistics, Alberta Children’s Hospital Research Institute, O’Brien Institute for Public Health, University of Calgary, Calgary, AB T2N 4N1, Canada
- Department of Mathematics & Statistics, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Guido van Marle
- Department of Microbiology, Immunology, and Infectious Diseases, Cumming School of Medicine, University of Calgary, Calgary, AB T2N 4N1, Canada
| |
Collapse
|
40
|
Crispell J, Benton CH, Balaz D, De Maio N, Ahkmetova A, Allen A, Biek R, Presho EL, Dale J, Hewinson G, Lycett SJ, Nunez-Garcia J, Skuce RA, Trewby H, Wilson DJ, Zadoks RN, Delahay RJ, Kao RR. Combining genomics and epidemiology to analyse bi-directional transmission of Mycobacterium bovis in a multi-host system. eLife 2019; 8:e45833. [PMID: 31843054 PMCID: PMC6917503 DOI: 10.7554/elife.45833] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Accepted: 10/15/2019] [Indexed: 01/02/2023] Open
Abstract
Quantifying pathogen transmission in multi-host systems is difficult, as exemplified in bovine tuberculosis (bTB) systems, but is crucial for control. The agent of bTB, Mycobacterium bovis, persists in cattle populations worldwide, often where potential wildlife reservoirs exist. However, the relative contribution of different host species to bTB persistence is generally unknown. In Britain, the role of badgers in infection persistence in cattle is highly contentious, despite decades of research and control efforts. We applied Bayesian phylogenetic and machine-learning approaches to bacterial genome data to quantify the roles of badgers and cattle in M. bovis infection dynamics in the presence of data biases. Our results suggest that transmission occurs more frequently from badgers to cattle than vice versa (10.4x in the most likely model) and that within-species transmission occurs at higher rates than between-species transmission for both. If representative, our results suggest that control operations should target both cattle and badgers.
Collapse
Affiliation(s)
- Joseph Crispell
- School of Veterinary Medicine, Veterinary Sciences CentreUniversity College DublinDublinIreland
| | - Clare H Benton
- National Wildlife Management CentreAnimal & Plant Health Agency (APHA)LondonUnited Kingdom
| | - Daniel Balaz
- Roslin InstituteUniversity of EdinburghEdinburghUnited Kingdom
| | - Nicola De Maio
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI)CambridgeUnited Kingdom
| | - Assel Ahkmetova
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life SciencesUniversity of GlasgowGlasgowUnited Kingdom
| | - Adrian Allen
- Agri-Food & Biosciences Institute Northern Ireland (AFBNI)BelfastUnited Kingdom
| | - Roman Biek
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life SciencesUniversity of GlasgowGlasgowUnited Kingdom
| | - Eleanor L Presho
- Agri-Food & Biosciences Institute Northern Ireland (AFBNI)BelfastUnited Kingdom
| | - James Dale
- Animal & Plant Health Agency (APHA)LondonUnited Kingdom
| | - Glyn Hewinson
- Centre for Bovine Tuberculosis, Institute of Biological, Environmental and Rural SciencesUniversity of AberystwythAberystwythUnited Kingdom
| | | | | | - Robin A Skuce
- Agri-Food & Biosciences Institute Northern Ireland (AFBNI)BelfastUnited Kingdom
| | | | - Daniel J Wilson
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Population HealthUniversity of OxfordOxfordUnited Kingdom
| | - Ruth N Zadoks
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life SciencesUniversity of GlasgowGlasgowUnited Kingdom
| | - Richard J Delahay
- National Wildlife Management CentreAnimal & Plant Health Agency (APHA)LondonUnited Kingdom
| | - Rowland Raymond Kao
- Roslin InstituteUniversity of EdinburghEdinburghUnited Kingdom
- Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEdinburghUnited Kingdom
| |
Collapse
|
41
|
Fujikura Y, Hamamoto T, Kanayama A, Kaku K, Yamagishi J, Kawana A. Bayesian reconstruction of a vancomycin-resistant Enterococcus transmission route using epidemiologic data and genomic variants from whole genome sequencing. J Hosp Infect 2019; 103:395-403. [PMID: 31425718 DOI: 10.1016/j.jhin.2019.08.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Accepted: 08/12/2019] [Indexed: 11/16/2022]
Abstract
BACKGROUND Outbreaks of vancomycin-resistant enterococcus (VRE) are a serious problem in hospitals. Inferring the transmission route is an important factor to institute appropriate infection control measures; however, the methodology has not been fully established. AIM To reconstruct and evaluate the transmission model using sequence variants extracted from whole genome sequencing (WGS) data and epidemiological information from patients involved in a VRE outbreak. METHODS During a VRE outbreak in our hospital, 23 samples were collected from patients and environmental surfaces and analysed using WGS. By combining genome alignment information with patient epidemiological data, the VRE transmission route was reconstructed using a Bayesian approach. With the transmission model, evaluation and further analyses were performed to identify risk factors that contributed to the outbreak. FINDINGS All VREs were identified as Enterococcus faecium belonging to sequence type 17, which consisted of two VRE genotypes: vanA (N = 8, including one environmental sample) and vanB (N = 15). The reconstruction model using the Bayesian approach showed the transmission direction with posterior probability and revealed transmission through an environmental surface. In addition, some cases acting as VRE spreaders were identified, which can interfere with appropriate infection control. Vancomycin administration was identified as a significant risk factor for spreaders. CONCLUSION A Bayesian approach for transmission route reconstruction using epidemiologic data and genomic variants from WGS can be applied in actual VRE outbreaks. This may contribute to the design and implementation of effective infection control measures.
Collapse
Affiliation(s)
- Y Fujikura
- Department of Medical Risk Management and Infection Control, National Defense Medical College Hospital, Saitama, Japan; Division of Infectious Diseases and Respiratory Medicine, Department of Internal Medicine, National Defense Medical College, Saitama, Japan.
| | - T Hamamoto
- Department of Clinical Laboratory, National Defense Medical College Hospital, Saitama, Japan
| | - A Kanayama
- Division of Infectious Diseases Epidemiology and Control, National Defense Medical College Research Institute, Saitama, Japan
| | - K Kaku
- Division of Infectious Diseases Epidemiology and Control, National Defense Medical College Research Institute, Saitama, Japan
| | - J Yamagishi
- Research Center for Zoonosis Control, Hokkaido University, Sapporo, Japan
| | - A Kawana
- Division of Infectious Diseases and Respiratory Medicine, Department of Internal Medicine, National Defense Medical College, Saitama, Japan
| |
Collapse
|
42
|
Whole genome sequencing of Mycobacterium tuberculosis: current standards and open issues. Nat Rev Microbiol 2019; 17:533-545. [DOI: 10.1038/s41579-019-0214-5] [Citation(s) in RCA: 155] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
43
|
Bouckaert R, Vaughan TG, Barido-Sottani J, Duchêne S, Fourment M, Gavryushkina A, Heled J, Jones G, Kühnert D, De Maio N, Matschiner M, Mendes FK, Müller NF, Ogilvie HA, du Plessis L, Popinga A, Rambaut A, Rasmussen D, Siveroni I, Suchard MA, Wu CH, Xie D, Zhang C, Stadler T, Drummond AJ. BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLoS Comput Biol 2019; 15:e1006650. [PMID: 30958812 PMCID: PMC6472827 DOI: 10.1371/journal.pcbi.1006650] [Citation(s) in RCA: 1610] [Impact Index Per Article: 322.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 04/18/2019] [Accepted: 02/04/2019] [Indexed: 11/18/2022] Open
Abstract
Elaboration of Bayesian phylogenetic inference methods has continued at pace in recent years with major new advances in nearly all aspects of the joint modelling of evolutionary data. It is increasingly appreciated that some evolutionary questions can only be adequately answered by combining evidence from multiple independent sources of data, including genome sequences, sampling dates, phenotypic data, radiocarbon dates, fossil occurrences, and biogeographic range information among others. Including all relevant data into a single joint model is very challenging both conceptually and computationally. Advanced computational software packages that allow robust development of compatible (sub-)models which can be composed into a full model hierarchy have played a key role in these developments. Developing such software frameworks is increasingly a major scientific activity in its own right, and comes with specific challenges, from practical software design, development and engineering challenges to statistical and conceptual modelling challenges. BEAST 2 is one such computational software platform, and was first announced over 4 years ago. Here we describe a series of major new developments in the BEAST 2 core platform and model hierarchy that have occurred since the first release of the software, culminating in the recent 2.5 release.
Collapse
Affiliation(s)
- Remco Bouckaert
- Centre of Computational Evolution, University of Auckland, Auckland, New Zealand
- Max Planck Institute for the Science of Human History, Jena, Germany
| | - Timothy G. Vaughan
- ETH Zürich, Department of Biosystems Science and Engineering, 4058 Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Joëlle Barido-Sottani
- ETH Zürich, Department of Biosystems Science and Engineering, 4058 Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Sebastián Duchêne
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Victoria, Australia
| | - Mathieu Fourment
- ithree institute, University of Technology Sydney, Sydney, Australia
| | | | | | - Graham Jones
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, SE 405 30 Göteborg, Sweden
| | - Denise Kühnert
- Max Planck Institute for the Science of Human History, Jena, Germany
| | - Nicola De Maio
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridgeshire, UK
| | - Michael Matschiner
- Department of Environmental Sciences, University of Basel, 4051 Basel, Switzerland
| | - Fábio K. Mendes
- Centre of Computational Evolution, University of Auckland, Auckland, New Zealand
| | - Nicola F. Müller
- ETH Zürich, Department of Biosystems Science and Engineering, 4058 Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Huw A. Ogilvie
- Department of Computer Science, Rice University, Houston, TX 77005-1892, USA
| | - Louis du Plessis
- Department of Zoology, University of Oxford, Oxford, OX1 3PS, UK
| | - Alex Popinga
- Centre of Computational Evolution, University of Auckland, Auckland, New Zealand
| | - Andrew Rambaut
- Institute of Evolutionary Biology, University of Edinburgh, Ashworth Laboratories, Edinburgh, EH9 3FL UK
| | - David Rasmussen
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC 27695, USA
| | - Igor Siveroni
- Department of Infectious Disease Epidemiology, Imperial College London, Norfolk Place, W2 1PG, UK
| | - Marc A. Suchard
- Department of Biomathematics, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
| | - Chieh-Hsi Wu
- Department of Statistics, University of Oxford, OX1 3LB, UK
| | - Dong Xie
- Centre of Computational Evolution, University of Auckland, Auckland, New Zealand
| | - Chi Zhang
- Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing, China
| | - Tanja Stadler
- ETH Zürich, Department of Biosystems Science and Engineering, 4058 Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Alexei J. Drummond
- Centre of Computational Evolution, University of Auckland, Auckland, New Zealand
| |
Collapse
|
44
|
Abstract
OBJECTIVES Molecular epidemiology is applied to various aspects of HIV transmission analyses. With ultradeep sequencing (UDS), in-depth characterization of transmission episodes involving minority variants is permitted. We explored HIV-1 epidemiological linkage and evaluated characteristics of transmission dynamics and transmitted drug resistance (TDR) detection through the added value of UDS. DESIGN HIV pol gene fragments were sequenced by UDS and Sanger sequencing on samples of 70 HIV-1-infected, treatment-naive recently diagnosed MSM. METHODS Pairwise genetic distances and maximum likelihood phylogenies were computed. Transmission events were identified as clades with branch support at least 70% and intraclade genetic difference less than 4.5%. TDR mutations were recognized from the TDR consensus list. Transmission directionality, directness and inoculum size were inferred from tree topologies. RESULTS Both datasets concurred in the identification of seven transmission pairs and one cluster of three patients. With UDS, direction of transmission was inferred in four out of eight chains. Evidence for multiple founder viruses was found in two out of eight chains. No transmission of minority-resistant variants was evidenced. TDR mutations prevalence in protease and reverse transcriptase fragments was 4.3% with Sanger sequencing and 18.6% with UDS. CONCLUSION Although Sanger sequencing and UDS identified the same transmission chains, UDS provided additional information on founder viruses, direction of transmission and levels of TDR. Nevertheless, topology of clusters was not always consistent across gene fragments, calling for a cautious interpretation of the data. Moreover, unobserved intermediary links cannot be excluded. Phylogenetic analysis use as a forensic technique for HIV transmission investigations is risky.
Collapse
|
45
|
Martin MA, Lee RS, Cowley LA, Gardy JL, Hanage WP. Within-host Mycobacterium tuberculosis diversity and its utility for inferences of transmission. Microb Genom 2018; 4. [PMID: 30303479 PMCID: PMC6249434 DOI: 10.1099/mgen.0.000217] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Whole genome sequencing in conjunction with traditional epidemiology has been used to reconstruct transmission networks of Mycobacterium tuberculosis during outbreaks. Given its low mutation rate, genetic diversity within M. tuberculosis outbreaks can be extremely limited - making it difficult to determine precisely who transmitted to whom. In addition to consensus SNPs (cSNPs), examining heterogeneous alleles (hSNPs) has been proposed to improve resolution. However, few studies have examined the potential biases in detecting these hSNPs. Here, we analysed genome sequence data from 25 specimens from British Columbia, Canada. Specimens were sequenced to a depth of 112-296×. We observed biases in read depth, base quality, strand distribution and read placement where possible hSNPs were initially identified, so we applied conservative filters to reduce false positives. Overall, there was phylogenetic concordance between the observed 2542 cSNP and 63 hSNP loci. Furthermore, we identified hSNPs shared exclusively by epidemiologically linked patients, supporting their use in transmission inferences. We conclude that hSNPs may add resolution to transmission networks, particularly where the overall genetic diversity is low.
Collapse
Affiliation(s)
- Michael A Martin
- 1Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Robyn S Lee
- 2Department of Epidemiology, Harvard University, Boston, MA 02115, USA
| | - Lauren A Cowley
- 1Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Jennifer L Gardy
- 3School of Population and Public Health, University of British Columbia, Vancouver, Canada.,4British Columbia Centre for Disease Control, Vancouver, Canada
| | - William P Hanage
- 1Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| |
Collapse
|