51
|
Eyre DW, Lumley SF, O'Donnell D, Stoesser NE, Matthews PC, Howarth A, Hatch SB, Marsden BD, Cox S, James T, Cornall RJ, Stuart DI, Screaton G, Ebner D, Crook DW, Conlon CP, Jeffery K, Walker TM, Peto TEA. Stringent thresholds in SARS-CoV-2 IgG assays lead to under-detection of mild infections. BMC Infect Dis 2021; 21:187. [PMID: 33602152 PMCID: PMC7889711 DOI: 10.1186/s12879-021-05878-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Accepted: 01/22/2021] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Thresholds for SARS-CoV-2 antibody assays have typically been determined using samples from symptomatic, often hospitalised, patients. In this setting the sensitivity and specificity of the best performing assays can both exceed 98%. However, antibody assay performance following mild infection is less clear. METHODS We assessed quantitative IgG responses in a cohort of healthcare workers in Oxford, UK, with a high pre-test probability of Covid-19, in particular the 991/11,475(8.6%) who reported loss of smell/taste. We use anosmia/ageusia and other risk factors as probes for Covid-19 infection potentially undiagnosed by immunoassays by investigating their relationship with antibody readings either side of assay thresholds. RESULTS The proportion of healthcare workers reporting anosmia/ageusia increased at antibody readings below diagnostic thresholds using an in-house ELISA (n = 9324) and the Abbott Architect chemiluminescent microparticle immunoassay (CMIA; n = 11,324): 426/906 (47%) reported anosmia/ageusia with a positive ELISA, 59/449 (13.1%) with high-negative and 326/7969 (4.1%) with low-negative readings. Similarly, by CMIA, 518/1093 (47.4%) with a positive result reported anosmia/ageusia, 106/686 (15.5%) with a high-negative and 358/9563 (3.7%) with a low-negative result. Adjusting for the proportion of staff reporting anosmia/ageusia suggests the sensitivity of both assays in mild infection is lower than previously reported: Oxford ELISA 89.8% (95%CI 86.6-92.8%) and Abbott CMIA 79.3% (75.9-82.7%). CONCLUSION Following mild SARS-CoV-2 infection 10-30% of individuals may have negative immunoassay results. While lowered diagnostic thresholds may result in unacceptable specificity, our findings have implications for epidemiological analyses and result interpretation in individuals with a high pre-test probability. Samples from mild PCR-confirmed infections should be included in SARS-CoV-2 immunoassay evaluations.
Collapse
|
52
|
Lumley SF, O'Donnell D, Stoesser NE, Matthews PC, Howarth A, Hatch SB, Marsden BD, Cox S, James T, Warren F, Peck LJ, Ritter TG, de Toledo Z, Warren L, Axten D, Cornall RJ, Jones EY, Stuart DI, Screaton G, Ebner D, Hoosdally S, Chand M, Crook DW, O'Donnell AM, Conlon CP, Pouwels KB, Walker AS, Peto TEA, Hopkins S, Walker TM, Jeffery K, Eyre DW. Antibody Status and Incidence of SARS-CoV-2 Infection in Health Care Workers. N Engl J Med 2021; 384:533-540. [PMID: 33369366 PMCID: PMC7781098 DOI: 10.1056/nejmoa2034545] [Citation(s) in RCA: 609] [Impact Index Per Article: 203.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
BACKGROUND The relationship between the presence of antibodies to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the risk of subsequent reinfection remains unclear. METHODS We investigated the incidence of SARS-CoV-2 infection confirmed by polymerase chain reaction (PCR) in seropositive and seronegative health care workers attending testing of asymptomatic and symptomatic staff at Oxford University Hospitals in the United Kingdom. Baseline antibody status was determined by anti-spike (primary analysis) and anti-nucleocapsid IgG assays, and staff members were followed for up to 31 weeks. We estimated the relative incidence of PCR-positive test results and new symptomatic infection according to antibody status, adjusting for age, participant-reported gender, and changes in incidence over time. RESULTS A total of 12,541 health care workers participated and had anti-spike IgG measured; 11,364 were followed up after negative antibody results and 1265 after positive results, including 88 in whom seroconversion occurred during follow-up. A total of 223 anti-spike-seronegative health care workers had a positive PCR test (1.09 per 10,000 days at risk), 100 during screening while they were asymptomatic and 123 while symptomatic, whereas 2 anti-spike-seropositive health care workers had a positive PCR test (0.13 per 10,000 days at risk), and both workers were asymptomatic when tested (adjusted incidence rate ratio, 0.11; 95% confidence interval, 0.03 to 0.44; P = 0.002). There were no symptomatic infections in workers with anti-spike antibodies. Rate ratios were similar when the anti-nucleocapsid IgG assay was used alone or in combination with the anti-spike IgG assay to determine baseline status. CONCLUSIONS The presence of anti-spike or anti-nucleocapsid IgG antibodies was associated with a substantially reduced risk of SARS-CoV-2 reinfection in the ensuing 6 months. (Funded by the U.K. Government Department of Health and Social Care and others.).
Collapse
|
53
|
Lipworth S, Vihta KD, Chau KK, Kavanagh J, Davies T, George S, Barker L, Vaughan A, Andersson M, Jeffery K, Oakley S, Morgan M, Peto TEA, Crook DW, Walker AS, Stoesser N. Ten years of population-level genomic Escherichia coli and Klebsiella pneumoniae serotype surveillance informs vaccine development for invasive infections. Clin Infect Dis 2021; 73:2276-2282. [PMID: 33411882 PMCID: PMC8677521 DOI: 10.1093/cid/ciab006] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND The incidence of bloodstream infections (BSIs) caused by Escherichia coli and Klebsiella pneumoniae is increasing, with substantial associated morbidity, mortality and antimicrobial resistance. Unbiased serotyping studies to guide vaccine target selection are limited. METHODS We conducted unselected, population-level genomic surveillance of bloodstream E. coli and Klebsiella pneumoniae isolates from 2008-2018 in Oxfordshire, UK. We supplemented this with an analysis of publicly available global sequencing data (n=3678). RESULTS We sequenced 3478 E. coli isolates (3278 passed quality control) and 556 K. pneumoniae isolates (535 (K-antigen) and 549 (O-antigen) passed quality control). The four most common E. coli O-antigens (O1/O2/O6/O25) were identified in 1499/3278 isolates; the incidence of these O-types increased over time (IRRy=1.14, 95% CI:1.11-1.16). These O-types accounted for 616/1434 multidrug resistant (MDR) and 173/256 extended-spectrum beta-lactamase(ESBL)-resistant isolates in Oxfordshire, but only 19/90 carbapenem-resistant isolates across all studies. For Klebsiella pneumoniae, the most common O-antigens (O2v2/O1v1/O3b/O1v2) accounted for 410/549 isolates; the incidence of BSIs caused by these also increased annually (IRRy=1.09; 95% CI:1.05-1.12). These O-types accounted for 122/148 MDR and 106/123 ESBL isolates in Oxfordshire and 557/734 carbapenem-resistant isolates across all studies. Conversely we observed substantial capsular antigen diversity. Analysis of 3678 isolates from global studies demonstrated the generalisability of these findings. For E. coli, based on serotyping, the ExPEC4V and ExPEC10V vaccines under investigation would cover 46% and 72% of Oxfordshire isolates respectively, and 47% and 71% of MDR isolates. CONCLUSIONS O-antigen targeted vaccines may be useful in reducing the morbidity, mortality and antimicrobial resistance associated with E. coli and K. pneumoniae BSIs.
Collapse
|
54
|
Ainsworth M, Andersson M, Auckland K, Baillie JK, Barnes E, Beer S, Beveridge A, Bibi S, Blackwell L, Borak M, Bown A, Brooks T, Burgess-Brown NA, Camara S, Catton M, Chau KK, Christott T, Clutterbuck E, Coker J, Cornall RJ, Cox S, Crawford-Jones D, Crook DW, D'Arcangelo S, Dejnirattsai W, Dequaire JMM, Dimitriadis S, Dingle KE, Doherty G, Dold C, Dong T, Dunachie SJ, Ebner D, Emmenegger M, Espinosa A, Eyre DW, Fairhead R, Fassih S, Feehily C, Felle S, Fernandez-Cid A, Fernandez Mendoza M, Foord TH, Fordwoh T, Fox McKee D, Frater J, Gallardo Sanchez V, Gent N, Georgiou D, Groves CJ, Hallis B, Hammond PM, Hatch SB, Harvala HJ, Hill J, Hoosdally SJ, Horsington B, Howarth A, James T, Jeffery K, Jones E, Justice A, Karpe F, Kavanagh J, Kim DS, Kirton R, Klenerman P, Knight JC, Koukouflis L, Kwok A, Leuschner U, Levin R, Linder A, Lockett T, Lumley SF, Marinou S, Marsden BD, Martinez J, Martins Ferreira L, Mason L, Matthews PC, Mentzer AJ, Mobbs A, Mongkolsapaya J, Morrow J, Mukhopadhyay SMM, Neville MJ, Oakley S, Oliveira M, Otter A, Paddon K, Pascoe J, Peng Y, Perez E, Perumal PK, Peto TEA, Pickford H, Ploeg RJ, Pollard AJ, Richardson A, Ritter TG, Roberts DJ, Rodger G, Rollier CS, Rowe C, Rudkin JK, Screaton G, Semple MG, Sienkiewicz A, Silva-Reyes L, Skelly DT, Sobrino Diaz A, Stafford L, Stockdale L, Stoesser N, Street T, Stuart DI, Sweed A, Taylor A, Thraves H, Tsang HP, Verheul MK, Vipond R, Walker TM, Wareing S, Warren Y, Wells C, Wilson C, Withycombe K, Young RK. Performance characteristics of five immunoassays for SARS-CoV-2: a head-to-head benchmark comparison. THE LANCET. INFECTIOUS DISEASES 2020; 20:1390-1400. [PMID: 32979318 PMCID: PMC7511171 DOI: 10.1016/s1473-3099(20)30634-4] [Citation(s) in RCA: 260] [Impact Index Per Article: 65.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 07/29/2020] [Accepted: 08/03/2020] [Indexed: 01/19/2023]
Abstract
BACKGROUND Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a global pandemic in 2020. Testing is crucial for mitigating public health and economic effects. Serology is considered key to population-level surveillance and potentially individual-level risk assessment. However, immunoassay performance has not been compared on large, identical sample sets. We aimed to investigate the performance of four high-throughput commercial SARS-CoV-2 antibody immunoassays and a novel 384-well ELISA. METHODS We did a head-to-head assessment of SARS-CoV-2 IgG assay (Abbott, Chicago, IL, USA), LIAISON SARS-CoV-2 S1/S2 IgG assay (DiaSorin, Saluggia, Italy), Elecsys Anti-SARS-CoV-2 assay (Roche, Basel, Switzerland), SARS-CoV-2 Total assay (Siemens, Munich, Germany), and a novel 384-well ELISA (the Oxford immunoassay). We derived sensitivity and specificity from 976 pre-pandemic blood samples (collected between Sept 4, 2014, and Oct 4, 2016) and 536 blood samples from patients with laboratory-confirmed SARS-CoV-2 infection, collected at least 20 days post symptom onset (collected between Feb 1, 2020, and May 31, 2020). Receiver operating characteristic (ROC) curves were used to assess assay thresholds. FINDINGS At the manufacturers' thresholds, for the Abbott assay sensitivity was 92·7% (95% CI 90·2-94·8) and specificity was 99·9% (99·4-100%); for the DiaSorin assay sensitivity was 96·2% (94·2-97·7) and specificity was 98·9% (98·0-99·4); for the Oxford immunoassay sensitivity was 99·1% (97·8-99·7) and specificity was 99·0% (98·1-99·5); for the Roche assay sensitivity was 97·2% (95·4-98·4) and specificity was 99·8% (99·3-100); and for the Siemens assay sensitivity was 98·1% (96·6-99·1) and specificity was 99·9% (99·4-100%). All assays achieved a sensitivity of at least 98% with thresholds optimised to achieve a specificity of at least 98% on samples taken 30 days or more post symptom onset. INTERPRETATION Four commercial, widely available assays and a scalable 384-well ELISA can be used for SARS-CoV-2 serological testing to achieve sensitivity and specificity of at least 98%. The Siemens assay and Oxford immunoassay achieved these metrics without further optimisation. This benchmark study in immunoassay assessment should enable refinements of testing strategies and the best use of serological testing resource to benefit individuals and population health. FUNDING Public Health England and UK National Institute for Health Research.
Collapse
|
55
|
Lipworth S, Pickford H, Sanderson N, Chau KK, Kavanagh J, Barker L, Vaughan A, Swann J, Andersson M, Jeffery K, Morgan M, Peto TEA, Crook DW, Stoesser N, Walker AS. Optimized use of Oxford Nanopore flowcells for hybrid assemblies. Microb Genom 2020; 6:mgen000453. [PMID: 33174830 PMCID: PMC7725331 DOI: 10.1099/mgen.0.000453] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 09/25/2020] [Indexed: 01/16/2023] Open
Abstract
Hybrid assemblies are highly valuable for studies of Enterobacteriaceae due to their ability to fully resolve the structure of mobile genetic elements, such as plasmids, which are involved in the carriage of clinically important genes (e.g. those involved in antimicrobial resistance/virulence). The widespread application of this technique is currently primarily limited by cost. Recent data have suggested that non-inferior, and even superior, hybrid assemblies can be produced using a fraction of the total output from a multiplexed nanopore [Oxford Nanopore Technologies (ONT)] flowcell run. In this study we sought to determine the optimal minimal running time for flowcells when acquiring reads for hybrid assembly. We then evaluated whether the ONT wash kit might allow users to exploit shorter running times by sequencing multiple libraries per flowcell. After 24 h of sequencing, most chromosomes and plasmids had circularized and there was no benefit associated with longer running times. Quality was similar at 12 h, suggesting that shorter running times are likely to be acceptable for certain applications (e.g. plasmid genomics). The ONT wash kit was highly effective in removing DNA between libraries. Contamination between libraries did not appear to affect subsequent hybrid assemblies, even when the same barcodes were used successively on a single flowcell. Utilizing shorter run times in combination with between-library nuclease washes allows at least 36 Enterobacteriaceae isolates to be sequenced per flowcell, significantly reducing the per-isolate sequencing cost. Ultimately this will facilitate large-scale studies utilizing hybrid assembly, advancing our understanding of the genomics of key human pathogens.
Collapse
|
56
|
Lumley SF, Eyre DW, McNaughton AL, Howarth A, Hoosdally S, Hatch SB, Kavanagh J, Chau KK, Downs LO, Cox S, Dunn L, Justice A, Wareing S, Dingle K, Rudkin J, Auckland K, Fyfe A, Bolton J, Paton R, Mentzer AJ, Jeffery K, Andersson MI, James T, Peto TEA, Marsden BD, Screaton G, Cornall RJ, Klenerman P, Ebner D, Stuart DI, Crook DW, Stoesser N, Kennedy SH, Thompson C, Gupta S, Matthews PC. SARS-CoV-2 antibody prevalence, titres and neutralising activity in an antenatal cohort, United Kingdom, 14 April to 15 June 2020. Euro Surveill 2020; 25:2001721. [PMID: 33094717 PMCID: PMC7651878 DOI: 10.2807/1560-7917.es.2020.25.41.2001721] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 10/22/2020] [Indexed: 12/29/2022] Open
Abstract
SARS-CoV-2 IgG screening of 1,000 antenatal serum samples in the Oxford area, United Kingdom, between 14 April and 15 June 2020, yielded a 5.3% seroprevalence, mirroring contemporaneous regional data. Among the 53 positive samples, 39 showed in vitro neutralisation activity, correlating with IgG titre (Pearson's correlation p<0.0001). While SARS-CoV-2 seroprevalence in pregnancy cohorts could potentially inform population surveillance, clinical correlates of infection and immunity in pregnancy, and antenatal epidemiology evolution over time need further study.
Collapse
|
57
|
Janezic S, Dingle K, Alvin J, Accetto T, Didelot X, Crook DW, Lacy DB, Rupnik M. Comparative genomics of Clostridioides difficile toxinotypes identifies module-based toxin gene evolution. Microb Genom 2020; 6:mgen000449. [PMID: 33030421 PMCID: PMC7660249 DOI: 10.1099/mgen.0.000449] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Accepted: 09/14/2020] [Indexed: 12/16/2022] Open
Abstract
Clostridioides difficile is a common cause of nosocomial diarrhoea. Toxins TcdA and TcdB are considered to be the main virulence factors and are encoded by the PaLoc region, while the binary toxin encoded in the CdtLoc region also contributes to pathogenicity. Variant toxinotypes reflect the genetic diversity of a key toxin-encoding 19 kb genetic element (the PaLoc). Here, we present analysis of a comprehensive collection of all known major C. difficile toxinotypes to address the evolutionary relationships of the toxin gene variants, the mechanisms underlying the origin and development of variability in toxin genes and the PaLoc, and the relationship between structure and function in TcdB variants. The structure of both toxin genes is modular, composed of interspersed blocks of sequences corresponding to functional domains and having different evolutionary histories, as shown by the distribution of mutations along the toxin genes and by incongruences of domain phylogenies compared to overall C. difficile cluster organization. In TcdB protein, four mutation patterns could be differentiated, which correlated very well with the type of TcdB cytopathic effect (CPE) on cultured cells. Mapping these mutations to the three-dimensional structure of the TcdB showed that the majority of the variation occurs in surface residues and that point mutation at residue 449 in alpha helix 16 differentiated strains with different types of CPE. In contrast to the PaLoc, phylogenetic trees of the CdtLoc were more consistent with the core genome phylogenies, but there were clues that CdtLoc can also be exchanged between strains.
Collapse
|
58
|
Eyre DW, Lumley SF, O'Donnell D, Campbell M, Sims E, Lawson E, Warren F, James T, Cox S, Howarth A, Doherty G, Hatch SB, Kavanagh J, Chau KK, Fowler PW, Swann J, Volk D, Yang-Turner F, Stoesser N, Matthews PC, Dudareva M, Davies T, Shaw RH, Peto L, Downs LO, Vogt A, Amini A, Young BC, Drennan PG, Mentzer AJ, Skelly DT, Karpe F, Neville MJ, Andersson M, Brent AJ, Jones N, Martins Ferreira L, Christott T, Marsden BD, Hoosdally S, Cornall R, Crook DW, Stuart DI, Screaton G, Peto TEA, Holthof B, O'Donnell AM, Ebner D, Conlon CP, Jeffery K, Walker TM. Differential occupational risks to healthcare workers from SARS-CoV-2 observed during a prospective observational study. eLife 2020; 9:e60675. [PMID: 32820721 PMCID: PMC7486122 DOI: 10.7554/elife.60675] [Citation(s) in RCA: 175] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Accepted: 08/18/2020] [Indexed: 12/16/2022] Open
Abstract
We conducted voluntary Covid-19 testing programmes for symptomatic and asymptomatic staff at a UK teaching hospital using naso-/oro-pharyngeal PCR testing and immunoassays for IgG antibodies. 1128/10,034 (11.2%) staff had evidence of Covid-19 at some time. Using questionnaire data provided on potential risk-factors, staff with a confirmed household contact were at greatest risk (adjusted odds ratio [aOR] 4.82 [95%CI 3.45-6.72]). Higher rates of Covid-19 were seen in staff working in Covid-19-facing areas (22.6% vs. 8.6% elsewhere) (aOR 2.47 [1.99-3.08]). Controlling for Covid-19-facing status, risks were heterogenous across the hospital, with higher rates in acute medicine (1.52 [1.07-2.16]) and sporadic outbreaks in areas with few or no Covid-19 patients. Covid-19 intensive care unit staff were relatively protected (0.44 [0.28-0.69]), likely by a bundle of PPE-related measures. Positive results were more likely in Black (1.66 [1.25-2.21]) and Asian (1.51 [1.28-1.77]) staff, independent of role or working location, and in porters and cleaners (2.06 [1.34-3.15]).
Collapse
|
59
|
Constantinides B, Chau KK, Quan TP, Rodger G, Andersson MI, Jeffery K, Lipworth S, Gweon HS, Peniket A, Pike G, Millo J, Byukusenge M, Holdaway M, Gibbons C, Mathers AJ, Crook DW, Peto TE, Walker AS, Stoesser N. Genomic surveillance of Escherichia coli and Klebsiella spp. in hospital sink drains and patients. Microb Genom 2020; 6:mgen000391. [PMID: 32553019 PMCID: PMC7478627 DOI: 10.1099/mgen.0.000391] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 05/24/2020] [Indexed: 12/20/2022] Open
Abstract
Escherichia coli and Klebsiella spp. are important human pathogens that cause a wide spectrum of clinical disease. In healthcare settings, sinks and other wastewater sites have been shown to be reservoirs of antimicrobial-resistant E. coli and Klebsiella spp., particularly in the context of outbreaks of resistant strains amongst patients. Without focusing exclusively on resistance markers or a clinical outbreak, we demonstrate that many hospital sink drains are abundantly and persistently colonized with diverse populations of E. coli, Klebsiella pneumoniae and Klebsiella oxytoca, including both antimicrobial-resistant and susceptible strains. Using whole-genome sequencing of 439 isolates, we show that environmental bacterial populations are largely structured by ward and sink, with only a handful of lineages, such as E. coli ST635, being widely distributed, suggesting different prevailing ecologies, which may vary as a result of different inputs and selection pressures. Whole-genome sequencing of 46 contemporaneous patient isolates identified one (2 %; 95 % CI 0.05-11 %) E. coli urine infection-associated isolate with high similarity to a prior sink isolate, suggesting that sinks may contribute to up to 10 % of infections caused by these organisms in patients on the ward over the same timeframe. Using metagenomics from 20 sink-timepoints, we show that sinks also harbour many clinically relevant antimicrobial resistance genes including blaCTX-M, blaSHV and mcr, and may act as niches for the exchange and amplification of these genes. Our study reinforces the potential role of sinks in contributing to Enterobacterales infection and antimicrobial resistance in hospital patients, something that could be amenable to intervention. This article contains data hosted by Microreact.
Collapse
|
60
|
Armstrong J, Rudkin JK, Allen N, Crook DW, Wilson DJ, Wyllie DH, O’Connell AM. Dynamic linkage of COVID-19 test results between Public Health England's Second Generation Surveillance System and UK Biobank. Microb Genom 2020; 6:mgen000397. [PMID: 32553051 PMCID: PMC7478634 DOI: 10.1099/mgen.0.000397] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 06/02/2020] [Indexed: 12/21/2022] Open
Abstract
UK Biobank (UKB) is an international health resource enabling research into the genetic and lifestyle determinants of common diseases of middle and older age. It comprises 500 000 participants. Public Health England's Second Generation Surveillance System is a centralized microbiology database covering English clinical diagnostics laboratories that provides national surveillance of legally notifiable infections, bacterial isolations and antimicrobial resistance. We previously developed secure, pseudonymized, individual-level linkage of these systems. In this study, we implemented rapid dynamic linkage, which allows us to provide a regular feed of new COVID-19 (SARS-CoV-2) test results to UKB to facilitate rapid and urgent research into the epidemiological and human genetic risk factors for severe infection in the cohort. Here, we have characterized the first 1352 cases of COVID-19 in UKB participants, of whom 895 met our working definition of severe COVID-19 as inpatients hospitalized on or after 16 March 2020. We found that the incidence of severe COVID-19 among UKB cases was 27.4 % lower than the general population in England, although this difference varied significantly by age and sex. The total number of UKB cases could be estimated as 0.6 % of the publicly announced number of cases in England. We considered how increasing case numbers will affect the power of genome-wide association studies. This new dynamic linkage system has further potential to facilitate the investigation of other infections and the prospective collection of microbiological cultures to create a microbiological biobank (bugbank) for studying the interaction of environment, human and microbial genetics on infection in the UKB cohort.
Collapse
|
61
|
Bush SJ, Connor TR, Peto TE, Crook DW, Walker AS. Evaluation of methods for detecting human reads in microbial sequencing datasets. Microb Genom 2020; 6:mgen000393. [PMID: 32558637 PMCID: PMC7478626 DOI: 10.1099/mgen.0.000393] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 05/25/2020] [Indexed: 12/14/2022] Open
Abstract
Sequencing data from host-associated microbes can often be contaminated by the body of the investigator or research subject. Human DNA is typically removed from microbial reads either by subtractive alignment (dropping all reads that map to the human genome) or by using a read classification tool to predict those of human origin, and then discarding them. To inform best practice guidelines, we benchmarked eight alignment-based and two classification-based methods of human read detection using simulated data from 10 clinically prevalent bacteria and three viruses, into which contaminating human reads had been added. While the majority of methods successfully detected >99 % of the human reads, they were distinguishable by variance. The most precise methods, with negligible variance, were Bowtie2 and SNAP, both of which misidentified few, if any, bacterial reads (and no viral reads) as human. While correctly detecting a similar number of human reads, methods based on taxonomic classification, such as Kraken2 and Centrifuge, could misclassify bacterial reads as human, although the extent of this was species-specific. Among the most sensitive methods of human read detection was BWA, although this also made the greatest number of false positive classifications. Across all methods, the set of human reads not identified as such, although often representing <0.1 % of the total reads, were non-randomly distributed along the human genome with many originating from the repeat-rich sex chromosomes. For viral reads and longer (>300 bp) bacterial reads, the highest performing approaches were classification-based, using Kraken2 or Centrifuge. For shorter (c. 150 bp) bacterial reads, combining multiple methods of human read detection maximized the recovery of human reads from contaminated short read datasets without being compromised by false positives. A particularly high-performance approach with shorter bacterial reads was a two-stage classification using Bowtie2 followed by SNAP. Using this approach, we re-examined 11 577 publicly archived bacterial read sets for hitherto undetected human contamination. We were able to extract a sufficient number of reads to call known human SNPs, including those with clinical significance, in 6 % of the samples. These results show that phenotypically distinct human sequence is detectable in publicly archived microbial read datasets.
Collapse
|
62
|
Adams ER, Ainsworth M, Anand R, Andersson MI, Auckland K, Baillie JK, Barnes E, Beer S, Bell JI, Berry T, Bibi S, Carroll M, Chinnakannan SK, Clutterbuck E, Cornall RJ, Crook DW, de Silva T, Dejnirattisai W, Dingle KE, Dold C, Espinosa A, Eyre DW, Farmer H, Fernandez Mendoza M, Georgiou D, Hoosdally SJ, Hunter A, Jefferey K, Kelly DF, Klenerman P, Knight J, Knowles C, Kwok AJ, Leuschner U, Levin R, Liu C, López-Camacho C, Martinez J, Matthews PC, McGivern H, Mentzer AJ, Milton J, Mongkolsapaya J, Moore SC, Oliveira MS, Pereira F, Perez E, Peto T, Ploeg RJ, Pollard A, Prince T, Roberts DJ, Rudkin JK, Sanchez V, Screaton GR, Semple MG, Slon-Campos J, Skelly DT, Smith EN, Sobrinodiaz A, Staves J, Stuart DI, Supasa P, Surik T, Thraves H, Tsang P, Turtle L, Walker AS, Wang B, Washington C, Watkins N, Whitehouse J. Antibody testing for COVID-19: A report from the National COVID Scientific Advisory Panel. Wellcome Open Res 2020; 5:139. [PMID: 33748431 PMCID: PMC7941096 DOI: 10.12688/wellcomeopenres.15927.1] [Citation(s) in RCA: 129] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/04/2020] [Indexed: 01/26/2023] Open
Abstract
Background: The COVID-19 pandemic caused >1 million infections during January-March 2020. There is an urgent need for reliable antibody detection approaches to support diagnosis, vaccine development, safe release of individuals from quarantine, and population lock-down exit strategies. We set out to evaluate the performance of ELISA and lateral flow immunoassay (LFIA) devices. Methods: We tested plasma for COVID (severe acute respiratory syndrome coronavirus 2; SARS-CoV-2) IgM and IgG antibodies by ELISA and using nine different LFIA devices. We used a panel of plasma samples from individuals who have had confirmed COVID infection based on a PCR result (n=40), and pre-pandemic negative control samples banked in the UK prior to December-2019 (n=142). Results: ELISA detected IgM or IgG in 34/40 individuals with a confirmed history of COVID infection (sensitivity 85%, 95%CI 70-94%), vs. 0/50 pre-pandemic controls (specificity 100% [95%CI 93-100%]). IgG levels were detected in 31/31 COVID-positive individuals tested ≥10 days after symptom onset (sensitivity 100%, 95%CI 89-100%). IgG titres rose during the 3 weeks post symptom onset and began to fall by 8 weeks, but remained above the detection threshold. Point estimates for the sensitivity of LFIA devices ranged from 55-70% versus RT-PCR and 65-85% versus ELISA, with specificity 95-100% and 93-100% respectively. Within the limits of the study size, the performance of most LFIA devices was similar. Conclusions: Currently available commercial LFIA devices do not perform sufficiently well for individual patient applications. However, ELISA can be calibrated to be specific for detecting and quantifying SARS-CoV-2 IgM and IgG and is highly sensitive for IgG from 10 days following first symptoms.
Collapse
|
63
|
Roope LSJ, Tonkin-Crine S, Herd N, Michie S, Pouwels KB, Castro-Sanchez E, Sallis A, Hopkins S, Robotham JV, Crook DW, Peto T, Peters M, Butler CC, Walker AS, Wordsworth S. Reducing expectations for antibiotics in primary care: a randomised experiment to test the response to fear-based messages about antimicrobial resistance. BMC Med 2020; 18:110. [PMID: 32321478 PMCID: PMC7178623 DOI: 10.1186/s12916-020-01553-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 03/09/2020] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND To reduce inappropriate antibiotic use, public health campaigns often provide fear-based information about antimicrobial resistance (AMR). Meta-analyses have found that fear-based campaigns in other contexts are likely to be ineffective unless respondents feel confident they can carry out the recommended behaviour ('self-efficacy'). This study aimed to test the likely impact of fear-based messages, with and without empowering self-efficacy elements, on patient consultations/antibiotic requests for influenza-like illnesses, using a randomised design. METHODS We hypothesised that fear-based messages containing empowering information about self-management without antibiotics would be more effective than fear alone, particularly in a pre-specified subgroup with low AMR awareness. Four thousand respondents from an online panel, representative of UK adults, were randomised to receive three different messages about antibiotic use and AMR, designed to induce fear about AMR to varying degrees. Two messages (one 'strong-fear', one 'mild-fear') also contained empowering information regarding influenza-like symptoms being easily self-managed without antibiotics. The main outcome measures were self-reported effect of information on likelihood of visiting a doctor and requesting antibiotics, for influenza-like illness, analysed separately according to whether or not the AMR information was 'very/somewhat new' to respondents, pre-specified based on a previous (non-randomised) survey. RESULTS The 'fear-only' message was 'very/somewhat new' to 285/1000 (28.5%) respondents, 'mild-fear-plus-empowerment' to 336/1500 (22.4%), and 'strong-fear-plus-empowerment' to 388/1500 (25.9%) (p = 0.002). Of those for whom the respective information was 'very/somewhat new', only those given the 'strong-fear-plus-empowerment' message said they would be less likely to request antibiotics if they visited a doctor for an influenza-like illness (p < 0.0001; 182/388 (46.9%) 'much less likely'/'less likely', versus 116/336 (34.5%) with 'mild-fear-plus-empowerment' versus 85/285 (29.8%) with 'fear-alone'). Those for whom the respective information was not 'very/somewhat new' said they would be less likely to request antibiotics for influenza-like illness (p < 0.0001) across all messages (interaction p < 0.0001 versus 'very/somewhat new' subgroup). The three messages had analogous self-reported effects on likelihood of visiting a doctor and in subgroups defined by believing antibiotics would 'definitely/probably' help an influenza-like illness. Results were reproduced in an independent randomised survey (additional 4000 adults). CONCLUSIONS Fear could be effective in public campaigns to reduce inappropriate antibiotic use, but should be combined with messages empowering patients to self-manage symptoms effectively without antibiotics.
Collapse
|
64
|
Kouchaki S, Yang Y, Lachapelle A, Walker TM, Walker AS, Peto TEA, Crook DW, Clifton DA. Multi-Label Random Forest Model for Tuberculosis Drug Resistance Classification and Mutation Ranking. Front Microbiol 2020; 11:667. [PMID: 32390972 PMCID: PMC7188832 DOI: 10.3389/fmicb.2020.00667] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Accepted: 03/24/2020] [Indexed: 12/12/2022] Open
Abstract
Resistance prediction and mutation ranking are important tasks in the analysis of Tuberculosis sequence data. Due to standard regimens for the use of first-line antibiotics, resistance co-occurrence, in which samples are resistant to multiple drugs, is common. Analysing all drugs simultaneously should therefore enable patterns reflecting resistance co-occurrence to be exploited for resistance prediction. Here, multi-label random forest (MLRF) models are compared with single-label random forest (SLRF) for both predicting phenotypic resistance from whole genome sequences and identifying important mutations for better prediction of four first-line drugs in a dataset of 13402 Mycobacterium tuberculosis isolates. Results confirmed that MLRFs can improve performance compared to conventional clinical methods (by 18.10%) and SLRFs (by 0.91%). In addition, we identified a list of candidate mutations that are important for resistance prediction or that are related to resistance co-occurrence. Moreover, we found that retraining our analysis to a subset of top-ranked mutations was sufficient to achieve satisfactory performance. The source code can be found at http://www.robots.ox.ac.uk/~davidc/code.php.
Collapse
|
65
|
Kizny Gordon A, Phan HTT, Lipworth SI, Cheong E, Gottlieb T, George S, Peto TEA, Mathers AJ, Walker AS, Crook DW, Stoesser N. Genomic dynamics of species and mobile genetic elements in a prolonged blaIMP-4-associated carbapenemase outbreak in an Australian hospital. J Antimicrob Chemother 2020; 75:873-882. [PMID: 31960024 PMCID: PMC7069471 DOI: 10.1093/jac/dkz526] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 11/16/2019] [Accepted: 11/27/2019] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Hospital outbreaks of carbapenemase-producing organisms, such as blaIMP-4-containing organisms, are an increasing threat to patient safety. OBJECTIVES To investigate the genomic dynamics of a 10 year (2006-15) outbreak of blaIMP-4-containing organisms in a burns unit in a hospital in Sydney, Australia. METHODS All carbapenem-non-susceptible or MDR clinical isolates (2006-15) and a random selection of equivalent or ESBL-producing environmental isolates (2012-15) were sequenced [short-read (Illumina), long-read (Oxford Nanopore Technology)]. Sequence data were used to assess genetic relatedness of isolates (Mash; mapping and recombination-adjusted phylogenies), perform in silico typing (MLST, resistance genes and plasmid replicons) and reconstruct a subset of blaIMP plasmids for comparative plasmid genomics. RESULTS A total of 46/58 clinical and 67/96 environmental isolates contained blaIMP-4. All blaIMP-4-positive organisms contained five or more other resistance genes. Enterobacter cloacae was the predominant organism, with 12 other species mainly found in either the environment or patients, some persisting despite several cleaning methods. On phylogenetic analysis there were three genetic clusters of E. cloacae containing both clinical and environmental isolates, and an additional four clusters restricted to either reservoir. blaIMP-4 was mostly found as part of a cassette array (blaIMP-4-qacG2-aacA4-catB3) in a class 1 integron within a previously described IncM2 plasmid (pEl1573), with almost complete conservation of this cassette across the species over the 10 years. Several other plasmids were also implicated, including an IncF plasmid backbone not previously widely described in association with blaIMP-4. CONCLUSIONS Genetic backgrounds disseminating blaIMP-4 can persist, diversify and evolve amongst both human and environmental reservoirs during a prolonged outbreak despite intensive prevention efforts.
Collapse
|
66
|
Bush SJ, Foster D, Eyre DW, Clark EL, De Maio N, Shaw LP, Stoesser N, Peto TEA, Crook DW, Walker AS. Genomic diversity affects the accuracy of bacterial single-nucleotide polymorphism-calling pipelines. Gigascience 2020; 9:giaa007. [PMID: 32025702 PMCID: PMC7002876 DOI: 10.1093/gigascience/giaa007] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 12/02/2019] [Accepted: 01/15/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Accurately identifying single-nucleotide polymorphisms (SNPs) from bacterial sequencing data is an essential requirement for using genomics to track transmission and predict important phenotypes such as antimicrobial resistance. However, most previous performance evaluations of SNP calling have been restricted to eukaryotic (human) data. Additionally, bacterial SNP calling requires choosing an appropriate reference genome to align reads to, which, together with the bioinformatic pipeline, affects the accuracy and completeness of a set of SNP calls obtained. This study evaluates the performance of 209 SNP-calling pipelines using a combination of simulated data from 254 strains of 10 clinically common bacteria and real data from environmentally sourced and genomically diverse isolates within the genera Citrobacter, Enterobacter, Escherichia, and Klebsiella. RESULTS We evaluated the performance of 209 SNP-calling pipelines, aligning reads to genomes of the same or a divergent strain. Irrespective of pipeline, a principal determinant of reliable SNP calling was reference genome selection. Across multiple taxa, there was a strong inverse relationship between pipeline sensitivity and precision, and the Mash distance (a proxy for average nucleotide divergence) between reads and reference genome. The effect was especially pronounced for diverse, recombinogenic bacteria such as Escherichia coli but less dominant for clonal species such as Mycobacterium tuberculosis. CONCLUSIONS The accuracy of SNP calling for a given species is compromised by increasing intra-species diversity. When reads were aligned to the same genome from which they were sequenced, among the highest-performing pipelines was Novoalign/GATK. By contrast, when reads were aligned to particularly divergent genomes, the highest-performing pipelines often used the aligners NextGenMap or SMALT, and/or the variant callers LoFreq, mpileup, or Strelka.
Collapse
|
67
|
Kong LY, Eyre DW, Corbeil J, Raymond F, Walker AS, Wilcox MH, Crook DW, Michaud S, Toye B, Frost E, Dendukuri N, Schiller I, Bourgault AM, Dascal A, Oughton M, Longtin Y, Poirier L, Brassard P, Turgeon N, Gilca R, Loo VG. Clostridium difficile: Investigating Transmission Patterns Between Infected and Colonized Patients Using Whole Genome Sequencing. Clin Infect Dis 2020; 68:204-209. [PMID: 29846557 DOI: 10.1093/cid/ciy457] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Accepted: 05/25/2018] [Indexed: 12/30/2022] Open
Abstract
Background Whole genome sequencing (WGS) studies can enhance our understanding of the role of patients with asymptomatic Clostridium difficile colonization in transmission. Methods Isolates obtained from patients with Clostridium difficile infection (CDI) and colonization identified in a study conducted during 2006-2007 at 6 Canadian hospitals underwent typing by pulsed-field gel electrophoresis, multilocus sequence typing, and WGS. Isolates from incident CDI cases not in the initial study were also sequenced where possible. Ward movement and typing data were combined to identify plausible donors for each CDI case, as defined by shared time and space within predefined limits. Proportions of plausible donors for CDI cases that were colonized, infected, or both were examined. Results Five hundred fifty-four isolates were sequenced successfully, 353 from colonized patients and 201 from CDI cases. The NAP1/027/ST1 strain was the most common strain, found in 124 (62%) of infected and 92 (26%) of colonized patients. A donor with a plausible ward link was found for 81 CDI cases (40%) using WGS with a threshold of ≤2 single nucleotide polymorphisms to determine relatedness. Sixty-five (32%) CDI cases could be linked to both infected and colonized donors. Exclusive linkages to infected and colonized donors were found for 28 (14%) and 12 (6%) CDI cases, respectively. Conclusions Colonized patients contribute to transmission, but CDI cases are more likely linked to other infected patients than colonized patients in this cohort with high rates of the NAP1/027/ST1 strain, highlighting the importance of local prevalence of virulent strains in determining transmission dynamics.
Collapse
|
68
|
Adams ER, Ainsworth M, Anand R, Andersson MI, Auckland K, Baillie JK, Barnes E, Beer S, Bell JI, Berry T, Bibi S, Carroll M, Chinnakannan SK, Clutterbuck E, Cornall RJ, Crook DW, de Silva T, Dejnirattisai W, Dingle KE, Dold C, Espinosa A, Eyre DW, Farmer H, Fernandez Mendoza M, Georgiou D, Hoosdally SJ, Hunter A, Jefferey K, Kelly DF, Klenerman P, Knight J, Knowles C, Kwok AJ, Leuschner U, Levin R, Liu C, López-Camacho C, Martinez J, Matthews PC, McGivern H, Mentzer AJ, Milton J, Mongkolsapaya J, Moore SC, Oliveira MS, Pereira F, Perez E, Peto T, Ploeg RJ, Pollard A, Prince T, Roberts DJ, Rudkin JK, Sanchez V, Screaton GR, Semple MG, Slon-Campos J, Skelly DT, Smith EN, Sobrinodiaz A, Staves J, Stuart DI, Supasa P, Surik T, Thraves H, Tsang P, Turtle L, Walker AS, Wang B, Washington C, Watkins N, Whitehouse J. Antibody testing for COVID-19: A report from the National COVID Scientific Advisory Panel. Wellcome Open Res 2020. [PMID: 33748431 DOI: 10.12688/wellcomeopenres10.12688/wellcomeopenres.15927.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/10/2023] Open
Abstract
Background: The COVID-19 pandemic caused >1 million infections during January-March 2020. There is an urgent need for reliable antibody detection approaches to support diagnosis, vaccine development, safe release of individuals from quarantine, and population lock-down exit strategies. We set out to evaluate the performance of ELISA and lateral flow immunoassay (LFIA) devices. Methods: We tested plasma for COVID (severe acute respiratory syndrome coronavirus 2; SARS-CoV-2) IgM and IgG antibodies by ELISA and using nine different LFIA devices. We used a panel of plasma samples from individuals who have had confirmed COVID infection based on a PCR result (n=40), and pre-pandemic negative control samples banked in the UK prior to December-2019 (n=142). Results: ELISA detected IgM or IgG in 34/40 individuals with a confirmed history of COVID infection (sensitivity 85%, 95%CI 70-94%), vs. 0/50 pre-pandemic controls (specificity 100% [95%CI 93-100%]). IgG levels were detected in 31/31 COVID-positive individuals tested ≥10 days after symptom onset (sensitivity 100%, 95%CI 89-100%). IgG titres rose during the 3 weeks post symptom onset and began to fall by 8 weeks, but remained above the detection threshold. Point estimates for the sensitivity of LFIA devices ranged from 55-70% versus RT-PCR and 65-85% versus ELISA, with specificity 95-100% and 93-100% respectively. Within the limits of the study size, the performance of most LFIA devices was similar. Conclusions: Currently available commercial LFIA devices do not perform sufficiently well for individual patient applications. However, ELISA can be calibrated to be specific for detecting and quantifying SARS-CoV-2 IgM and IgG and is highly sensitive for IgG from 10 days following first symptoms.
Collapse
|
69
|
Eyre DW, Peto TEA, Crook DW, Walker AS, Wilcox MH. Hash-Based Core Genome Multilocus Sequence Typing for Clostridium difficile. J Clin Microbiol 2019; 58:e01037-19. [PMID: 31666367 PMCID: PMC6935933 DOI: 10.1128/jcm.01037-19] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 10/24/2019] [Indexed: 12/27/2022] Open
Abstract
Pathogen whole-genome sequencing has huge potential as a tool to better understand infection transmission. However, rapidly identifying closely related genomes among a background of thousands of other genomes is challenging. Here, we describe a refinement to core genome multilocus sequence typing (cgMLST) in which alleles at each gene are reproducibly converted to a unique hash, or short string of letters (hash-cgMLST). This avoids the resource-intensive need for a single centralized database of sequentially numbered alleles. We test the reproducibility and discriminatory power of cgMLST/hash-cgMLST compared to those of mapping-based approaches in Clostridium difficile, using repeated sequencing of the same isolates (replicates) and data from consecutive infection isolates from six English hospitals. Hash-cgMLST provided the same results as standard cgMLST, with minimal performance penalty. Comparing 272 replicate sequence pairs using reference-based mapping, there were 0, 1, or 2 single-nucleotide polymorphisms (SNPs) between 262 (96%), 5 (2%), and 1 (<1%) of the pairs, respectively. Using hash-cgMLST, 218 (80%) of replicate pairs assembled with SPAdes had zero gene differences, and 31 (11%), 5 (2%), and 18 (7%) pairs had 1, 2, and >2 differences, respectively. False gene differences were clustered in specific genes and associated with fragmented assemblies, but were reduced using the SKESA assembler. Considering 412 pairs of infections with ≤2 SNPS, i.e., consistent with recent transmission, 376 (91%) had ≤2 gene differences and 16 (4%) had ≥4. Comparing a genome to 100,000 others took <1 min using hash-cgMLST. Hash-cgMLST is an effective surveillance tool for rapidly identifying clusters of related genomes. However, cgMLST/hash-cgMLST generate more false variants than mapping-based approaches. Follow-up mapping-based analyses are likely required to precisely define close genetic relationships.
Collapse
|
70
|
Lewandowski K, Xu Y, Pullan ST, Lumley SF, Foster D, Sanderson N, Vaughan A, Morgan M, Bright N, Kavanagh J, Vipond R, Carroll M, Marriott AC, Gooch KE, Andersson M, Jeffery K, Peto TEA, Crook DW, Walker AS, Matthews PC. Metagenomic Nanopore Sequencing of Influenza Virus Direct from Clinical Respiratory Samples. J Clin Microbiol 2019; 58:e00963-19. [PMID: 31666364 PMCID: PMC6935926 DOI: 10.1128/jcm.00963-19] [Citation(s) in RCA: 91] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 10/21/2019] [Indexed: 01/11/2023] Open
Abstract
Influenza is a major global public health threat as a result of its highly pathogenic variants, large zoonotic reservoir, and pandemic potential. Metagenomic viral sequencing offers the potential for a diagnostic test for influenza virus which also provides insights on transmission, evolution, and drug resistance and simultaneously detects other viruses. We therefore set out to apply the Oxford Nanopore Technologies sequencing method to metagenomic sequencing of respiratory samples. We generated influenza virus reads down to a limit of detection of 102 to 103 genome copies/ml in pooled samples, observing a strong relationship between the viral titer and the proportion of influenza virus reads (P = 4.7 × 10-5). Applying our methods to clinical throat swabs, we generated influenza virus reads for 27/27 samples with mid-to-high viral titers (cycle threshold [CT ] values, <30) and 6/13 samples with low viral titers (CT values, 30 to 40). No false-positive reads were generated from 10 influenza virus-negative samples. Thus, Nanopore sequencing operated with 83% sensitivity (95% confidence interval [CI], 67 to 93%) and 100% specificity (95% CI, 69 to 100%) compared to the current diagnostic standard. Coverage of full-length virus was dependent on sample composition, being negatively influenced by increased host and bacterial reads. However, at high influenza virus titers, we were able to reconstruct >99% complete sequences for all eight gene segments. We also detected a human coronavirus coinfection in one clinical sample. While further optimization is required to improve sensitivity, this approach shows promise for the Nanopore platform to be used in the diagnosis and genetic analysis of influenza virus and other respiratory viruses.
Collapse
|
71
|
Hunt M, Bradley P, Lapierre SG, Heys S, Thomsit M, Hall MB, Malone KM, Wintringer P, Walker TM, Cirillo DM, Comas I, Farhat MR, Fowler P, Gardy J, Ismail N, Kohl TA, Mathys V, Merker M, Niemann S, Omar SV, Sintchenko V, Smith G, van Soolingen D, Supply P, Tahseen S, Wilcox M, Arandjelovic I, Peto TEA, Crook DW, Iqbal Z. Antibiotic resistance prediction for Mycobacterium tuberculosis from genome sequence data with Mykrobe. Wellcome Open Res 2019; 4:191. [PMID: 32055708 PMCID: PMC7004237 DOI: 10.12688/wellcomeopenres.15603.1] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2019] [Indexed: 01/08/2023] Open
Abstract
Two billion people are infected with Mycobacterium tuberculosis, leading to 10 million new cases of active tuberculosis and 1.5 million deaths annually. Universal access to drug susceptibility testing (DST) has become a World Health Organization priority. We previously developed a software tool, Mykrobe predictor, which provided offline species identification and drug resistance predictions for M. tuberculosis from whole genome sequencing (WGS) data. Performance was insufficient to support the use of WGS as an alternative to conventional phenotype-based DST, due to mutation catalogue limitations. Here we present a new tool, Mykrobe, which provides the same functionality based on a new software implementation. Improvements include i) an updated mutation catalogue giving greater sensitivity to detect pyrazinamide resistance, ii) support for user-defined resistance catalogues, iii) improved identification of non-tuberculous mycobacterial species, and iv) an updated statistical model for Oxford Nanopore Technologies sequencing data. Mykrobe is released under MIT license at https://github.com/mykrobe-tools/mykrobe. We incorporate mutation catalogues from the CRyPTIC consortium et al. (2018) and from Walker et al. (2015), and make improvements based on performance on an initial set of 3206 and an independent set of 5845 M. tuberculosis Illumina sequences. To give estimates of error rates, we use a prospectively collected dataset of 4362 M. tuberculosis isolates. Using culture based DST as the reference, we estimate Mykrobe to be 100%, 95%, 82%, 99% sensitive and 99%, 100%, 99%, 99% specific for rifampicin, isoniazid, pyrazinamide and ethambutol resistance prediction respectively. We benchmark against four other tools on 10207 (=5845+4362) samples, and also show that Mykrobe gives concordant results with nanopore data. We measure the ability of Mykrobe-based DST to guide personalized therapeutic regimen design in the context of complex drug susceptibility profiles, showing 94% concordance of implied regimen with that driven by phenotypic DST, higher than all other benchmarked tools.
Collapse
|
72
|
Peto L, Fawcett NJ, Crook DW, Peto TEA, Llewelyn MJ, Walker AS. Selective culture enrichment and sequencing of feces to enhance detection of antimicrobial resistance genes in third-generation cephalosporin resistant Enterobacteriaceae. PLoS One 2019; 14:e0222831. [PMID: 31703058 PMCID: PMC6839868 DOI: 10.1371/journal.pone.0222831] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 09/07/2019] [Indexed: 01/02/2023] Open
Abstract
Metagenomic sequencing of fecal DNA can usefully characterise an individual's intestinal resistome but is limited by its inability to detect important pathogens that may be present at low abundance, such as carbapenemase or extended-spectrum beta-lactamase producing Enterobacteriaceae. Here we aimed to develop a hybrid protocol to improve detection of resistance genes in Enterobacteriaceae by using a short period of culture enrichment prior to sequencing of DNA extracted directly from the enriched sample. Volunteer feces were spiked with carbapenemase-producing Enterobacteriaceae and incubated in selective broth culture for 6 hours before sequencing. Different DNA extraction methods were compared, including a plasmid extraction protocol to increase the detection of plasmid-associated resistance genes. Although enrichment prior to sequencing increased the detection of carbapenemase genes, the differing growth characteristics of the spike organisms precluded accurate quantification of their concentration prior to culture. Plasmid extraction increased detection of resistance genes present on plasmids, but the effects were heterogeneous and dependent on plasmid size. Our results demonstrate methods of improving the limit of detection of selected resistance mechanisms in a fecal resistome assay, but they also highlight the difficulties in using these techniques for accurate quantification and should inform future efforts to achieve this goal.
Collapse
|
73
|
Martin JSH, Eyre DW, Fawley WN, Griffiths D, Davies K, Mawer DPC, Peto TEA, Crook DW, Walker AS, Wilcox MH. Patient and Strain Characteristics Associated With Clostridium difficile Transmission and Adverse Outcomes. Clin Infect Dis 2019; 67:1379-1387. [PMID: 29659753 PMCID: PMC6186849 DOI: 10.1093/cid/ciy302] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2017] [Accepted: 05/23/2018] [Indexed: 01/05/2023] Open
Abstract
Background No study has used whole-genome sequencing (WGS) to investigate risk factors for Clostridium difficile (CD) transmission between cases, or assessed the impact of recent acquisition on patient outcome. Methods This 20 month retrospective cohort study included consecutive cytotoxin-positive diarrheal samples, which underwent culture, ribotyping, and WGS (Illumina). Sequenced isolates were compared using single nucleotide variants (SNVs). Independent predictors of acquisition from another case, onward transmission, 120-day recurrence, and 30-day mortality were identified using logistic regression with backwards elimination. Results Of 660 CD cases, 640 (97%) were sequenced, of which 567 (89%) shared a ribotype with a prior case, but only 227 (35%) were ≤2 SNVs from a prior case, supporting recent acquisition. Plausible (<2 SNVs) recent ward-based acquisition from a symptomatic case was more frequent in certain ribotypes; 64% (67/105) for ribotype-027 cases, compared with 11% (6/57) for ribotype-078. Independent risk factors (adjusted P < .05) for CD acquisition included older age, longer inpatient duration, and ribotype; these factors, and male sex, increased onward transmission. Patients with a plausible donor had a greater risk of recurrence (adjusted P = .001) and trended towards greater 30-day mortality (adjusted P = .06). Ribotype had no additional mortality or recurrence impact after adjusting for acquisition (P > .1). Conclusions Greater transmission of certain lineages suggests CD may have different reservoirs and modes of transmission. Acquiring CD from a recent case is associated with poorer clinical outcomes. Clinical characteristics associated with increased healthcare-associated CD transmission could be used to target preventative interventions.
Collapse
|
74
|
Eyre DW, Davies KA, Davis G, Fawley WN, Dingle KE, De Maio N, Karas A, Crook DW, Peto TEA, Walker AS, Wilcox MH. Two Distinct Patterns of Clostridium difficile Diversity Across Europe Indicating Contrasting Routes of Spread. Clin Infect Dis 2019; 67:1035-1044. [PMID: 29659747 PMCID: PMC6137122 DOI: 10.1093/cid/ciy252] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2017] [Accepted: 04/05/2018] [Indexed: 01/05/2023] Open
Abstract
Background Rates of Clostridium difficile infection vary widely across Europe, as do prevalent ribotypes. The extent of Europe-wide diversity within each ribotype, however, is unknown. Methods Inpatient diarrheal fecal samples submitted on a single day in summer and winter (2012–2013) to laboratories in 482 European hospitals were cultured for C. difficile, and isolates the 10 most prevalent ribotypes were whole-genome sequenced. Within each ribotype, country-based sequence clustering was assessed using the ratio of the median number of single-nucleotide polymorphisms between isolates within versus across different countries, using permutation tests. Time-scaled Bayesian phylogenies were used to reconstruct the historical location of each lineage. Results Sequenced isolates (n = 624) were from 19 countries. Five ribotypes had within-country clustering: ribotype 356, only in Italy; ribotype 018, predominantly in Italy; ribotype 176, with distinct Czech and German clades; ribotype 001/072, including distinct German, Slovakian, and Spanish clades; and ribotype 027, with multiple predominantly country-specific clades including in Hungary, Italy, Germany, Romania, and Poland. By contrast, we found no within-country clustering for ribotypes 078, 015, 002, 014, and 020, consistent with a Europe-wide distribution. Fluoroquinolone resistance was significantly more common in within-country clustered ribotypes (P = .009). Fluoroquinolone-resistant isolates were also more tightly clustered geographically with a median (interquartile range) of 43 (0–213) miles between each isolate and the most closely genetically related isolate, versus 421 (204–680) miles in nonresistant pairs (P < .001). Conclusions Two distinct patterns of C. difficile ribotype spread were observed, consistent with either predominantly healthcare-associated acquisition or Europe-wide dissemination via other routes/sources, for example, the food chain.
Collapse
|
75
|
Gweon HS, Shaw LP, Swann J, De Maio N, AbuOun M, Niehus R, Hubbard ATM, Bowes MJ, Bailey MJ, Peto TEA, Hoosdally SJ, Walker AS, Sebra RP, Crook DW, Anjum MF, Read DS, Stoesser N. The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples. ENVIRONMENTAL MICROBIOME 2019; 14:7. [PMID: 33902704 PMCID: PMC8204541 DOI: 10.1186/s40793-019-0347-1] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 09/28/2019] [Indexed: 05/18/2023]
Abstract
BACKGROUND Shotgun metagenomics is increasingly used to characterise microbial communities, particularly for the investigation of antimicrobial resistance (AMR) in different animal and environmental contexts. There are many different approaches for inferring the taxonomic composition and AMR gene content of complex community samples from shotgun metagenomic data, but there has been little work establishing the optimum sequencing depth, data processing and analysis methods for these samples. In this study we used shotgun metagenomics and sequencing of cultured isolates from the same samples to address these issues. We sampled three potential environmental AMR gene reservoirs (pig caeca, river sediment, effluent) and sequenced samples with shotgun metagenomics at high depth (~ 200 million reads per sample). Alongside this, we cultured single-colony isolates of Enterobacteriaceae from the same samples and used hybrid sequencing (short- and long-reads) to create high-quality assemblies for comparison to the metagenomic data. To automate data processing, we developed an open-source software pipeline, 'ResPipe'. RESULTS Taxonomic profiling was much more stable to sequencing depth than AMR gene content. 1 million reads per sample was sufficient to achieve < 1% dissimilarity to the full taxonomic composition. However, at least 80 million reads per sample were required to recover the full richness of different AMR gene families present in the sample, and additional allelic diversity of AMR genes was still being discovered in effluent at 200 million reads per sample. Normalising the number of reads mapping to AMR genes using gene length and an exogenous spike of Thermus thermophilus DNA substantially changed the estimated gene abundance distributions. While the majority of genomic content from cultured isolates from effluent was recoverable using shotgun metagenomics, this was not the case for pig caeca or river sediment. CONCLUSIONS Sequencing depth and profiling method can critically affect the profiling of polymicrobial animal and environmental samples with shotgun metagenomics. Both sequencing of cultured isolates and shotgun metagenomics can recover substantial diversity that is not identified using the other methods. Particular consideration is required when inferring AMR gene content or presence by mapping metagenomic reads to a database. ResPipe, the open-source software pipeline we have developed, is freely available ( https://gitlab.com/hsgweon/ResPipe ).
Collapse
|