1
|
Trisakul K, Hinwan Y, Eisiri J, Salao K, Chaiprasert A, Kamolwat P, Tongsima S, Campino S, Phelan J, Clark TG, Faksri K. Comparisons of genome assembly tools for characterization of Mycobacterium tuberculosis genomes using hybrid sequencing technologies. PeerJ 2024; 12:e17964. [PMID: 39221271 PMCID: PMC11366230 DOI: 10.7717/peerj.17964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Accepted: 08/01/2024] [Indexed: 09/04/2024] Open
Abstract
Background Next-generation sequencing of Mycobacterium tuberculosis, the infectious agent causing tuberculosis, is improving the understanding of genomic diversity of circulating lineages and strain-types, and informing knowledge of drug resistance mutations. An increasingly popular approach to characterizing M. tuberculosis genomes (size: 4.4 Mbp) and variants (e.g., single nucleotide polymorphisms (SNPs)) involves the de novo assembly of sequence data. Methods We compared the performance of genome assembly tools (Unicycler, RagOut, and RagTag) on sequence data from nine drug resistant M. tuberculosis isolates (multi-drug (MDR) n = 1; pre-extensively-drug (pre-XDR) n = 8) generated using Illumina HiSeq, Oxford Nanopore Technology (ONT) PromethION, and PacBio platforms. Results Our investigation found that Unicycler-based assemblies had significantly higher genome completeness (~98.7%; p values = 0.01) compared to other assembler tools (RagOut = 98.6%, and RagTag = 98.6%). The genome assembly sizes (bp) across isolates and sequencers based on RagOut was significantly longer (p values < 0.001) (4,418,574 ± 8,824 bp) than Unicycler and RagTag assemblies (Unicycler = 4,377,642 ± 55,257 bp, and RagTag = 4,380,711 ± 51,164 bp). RagOut-based assemblies had the fewest contigs (~32) and the longest genome size (4,418,574 bp; vs. H37Rv reference size 4,411,532 bp) and therefore were chosen for downstream analysis. Pan-genome analysis of Illumina and PacBio hybrid assemblies revealed the greatest number of detected genes (4,639 genes; H37Rv reference contains 3,976 genes), while Illumina and ONT hybrid assemblies produced the highest number of SNPs. The number of genes from hybrid assemblies with ONT and PacBio long-reads (mean: 4,620 genes) was greater than short-read assembly alone (4,478 genes). All nine RagOut hybrid genome assemblies detected known mutations in genes associated with MDR-TB and pre-XDR-TB. Conclusions Unicycler software performed the best in terms of achieving contiguous genomes, whereas RagOut improved the quality of Unicycler's genome assemblies by providing a longer genome size. Overall, our approach has demonstrated that short-read and long-read hybrid assembly can provide a more complete genome assembly than short-read assembly alone by detecting pan-genomes and more genes, including IS6110, and SNPs.
Collapse
Affiliation(s)
- Kanwara Trisakul
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| | - Yothin Hinwan
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| | - Jukgarin Eisiri
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| | - Kanin Salao
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| | - Angkana Chaiprasert
- Office for Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Phalin Kamolwat
- Division of Tuberculosis, Department of Disease Control, Ministry of Public Health, Bangkok, Thailand
| | - Sissades Tongsima
- National Biobank of Thailand, National Center for Genetics Engineering and Biotechnology, Pathum Thani, Thailand
| | - Susana Campino
- Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, University of London, London, United Kingdom
| | - Jody Phelan
- Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, University of London, London, United Kingdom
| | - Taane G. Clark
- Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, University of London, London, United Kingdom
- Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, University of London, London, United Kingdom
| | - Kiatichai Faksri
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| |
Collapse
|
2
|
Djeghout B, Le-Viet T, Martins LDO, Savva GM, Evans R, Baker D, Page A, Elumogo N, Wain J, Janecko N. Capturing clinically relevant Campylobacter attributes through direct whole genome sequencing of stool. Microb Genom 2024; 10. [PMID: 39213166 DOI: 10.1099/mgen.0.001284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024] Open
Abstract
Campylobacter is the leading bacterial cause of infectious intestinal disease, but the pathogen typically accounts for a very small proportion of the overall stool microbiome in each patient. Diagnosis is even more difficult due to the fastidious nature of Campylobacter in the laboratory setting. This has, in part, driven a change in recent years, from culture-based to rapid PCR-based diagnostic assays which have improved diagnostic detection, whilst creating a knowledge gap in our clinical and epidemiological understanding of Campylobacter genotypes - no isolates to sequence. In this study, direct metagenomic sequencing approaches were used to assess the possibility of replacing genome sequences with metagenome sequences; metagenomic sequencing outputs were used to describe clinically relevant attributes of Campylobacter genotypes. A total of 37 diarrhoeal stool samples with Campylobacter and five samples with an unknown pathogen result were collected and processed with and without filtration, DNA was extracted, and metagenomes were sequenced by short-read sequencing. Culture-based methods were used to validate Campylobacter metagenome-derived genome (MDG) results. Sequence output metrics were assessed for Campylobacter genome quality and accuracy of characterization. Of the 42 samples passing quality checks for analysis, identification of Campylobacter to the genus and species level was dependent on Campylobacter genome read count, coverage and genome completeness. A total of 65% (24/37) of samples were reliably identified to the genus level through Campylobacter MDG, 73% (27/37) by culture and 97% (36/37) by qPCR. The Campylobacter genomes with a genome completeness of over 60% (n=21) were all accurately identified at the species level (100%). Of those, 72% (15/21) were identified to sequence types (STs), and 95% (20/21) accurately identified antimicrobial resistance (AMR) gene determinants. Filtration of stool samples enhanced Campylobacter MDG recovery and genome quality metrics compared to the corresponding unfiltered samples, which improved the identification of STs and AMR profiles. The phylogenetic analysis in this study demonstrated the clustering of the metagenome-derived with culture-derived genomes and revealed the reliability of genomes from direct stool sequencing. Furthermore, Campylobacter genome spiking percentages ranging from 0 to 2% total metagenome abundance in the ONT MinION sequencer, configured to adaptive sequencing, exhibited better assembly quality and accurate identification of STs, particularly in the analysis of metagenomes containing 2 and 1% of Campylobacter jejuni genomes. Direct sequencing of Campylobacter from stool samples provides clinically relevant and epidemiologically important genomic information without the reliance on cultured genomes.
Collapse
Affiliation(s)
- Bilal Djeghout
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Thanh Le-Viet
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | | | - George M Savva
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Rhiannon Evans
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - David Baker
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Andrew Page
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Ngozi Elumogo
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
- Eastern Pathology Alliance, Norfolk and Norwich University Hospital, Norwich NR4 7UY, UK
| | - John Wain
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | - Nicol Janecko
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| |
Collapse
|
3
|
Dippenaar A, Ismail N, Heupink TH, Grobbelaar M, Loubser J, Van Rie A, Warren RM. Droplet based whole genome amplification for sequencing minute amounts of purified Mycobacterium tuberculosis DNA. Sci Rep 2024; 14:9931. [PMID: 38689002 PMCID: PMC11061190 DOI: 10.1038/s41598-024-60545-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 04/24/2024] [Indexed: 05/02/2024] Open
Abstract
Implementation of whole genome sequencing (WGS) for patient care is hindered by limited Mycobacterium tuberculosis (Mtb) in clinical specimens and slow Mtb growth. We evaluated droplet multiple displacement amplification (dMDA) for amplification of minute amounts of Mtb DNA to enable WGS as an alternative to other Mtb enrichment methods. Purified genomic Mtb-DNA (0.1, 0.5, 1, and 5 pg) was encapsulated and amplified using the Samplix Xdrop-instrument and sequenced alongside a control sample using standard Illumina protocols followed by MAGMA-analysis. The control and 5 pg input dMDA samples underwent nanopore sequencing followed by Nanoseq and TB-profiler analysis. dMDA generated 105-2400 ng DNA from the 0.1-5 pg input DNA, respectively. Followed by Illumina WGS, dMDA raised mean sequencing depth from 7 × for 0.1 pg input DNA to ≥ 60 × for 5 pg input and the control sample. Bioinformatic analysis revealed a high number of false positive and false negative variants when amplifying ≤ 0.5 pg input DNA. Nanopore sequencing of the 5 pg dMDA sample presented excellent coverage depth, breadth, and accurate strain characterization, albeit elevated false positive and false negative variants compared to Illumina-sequenced dMDA sample with identical Mtb DNA input. dMDA coupled with Illumina WGS for samples with ≥ 5 pg purified Mtb DNA, equating to approximately 1000 copies of the Mtb genome, offers precision for drug resistance, phylogeny, and transmission insights.
Collapse
Affiliation(s)
- Anzaan Dippenaar
- Tuberculosis Omics Research Consortium, Department of Family Medicine and Population Health, Global Health Institute, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium.
| | - Nabila Ismail
- South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Tim H Heupink
- Tuberculosis Omics Research Consortium, Department of Family Medicine and Population Health, Global Health Institute, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Melanie Grobbelaar
- South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Johannes Loubser
- South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Annelies Van Rie
- Tuberculosis Omics Research Consortium, Department of Family Medicine and Population Health, Global Health Institute, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Robin M Warren
- South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| |
Collapse
|
4
|
Patiño LH, Ballesteros N, Muñoz M, Jaimes J, Castillo-Castañeda AC, Madigan R, Paniz-Mondolfi A, Ramírez JD. Validation of Oxford nanopore sequencing for improved New World Leishmania species identification via analysis of 70-kDA heat shock protein. Parasit Vectors 2023; 16:458. [PMID: 38111024 PMCID: PMC10726620 DOI: 10.1186/s13071-023-06073-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 11/29/2023] [Indexed: 12/20/2023] Open
Abstract
BACKGROUND Leishmaniasis is a parasitic disease caused by obligate intracellular protozoa of the genus Leishmania. This infection is characterized by a wide range of clinical manifestations, with symptoms greatly dependent on the causal parasitic species. Here we present the design and application of a new 70-kDa heat shock protein gene (hsp70)-based marker of 771 bp (HSP70-Long). We evaluated its sensitivity, specificity and diagnostic performance employing an amplicon-based MinION™ DNA sequencing assay to identify different Leishmania species in clinical samples from humans and reservoirs with cutaneous leishmaniasis (CL) and visceral leishmaniasis (VL). We also conducted a comparative analysis between our novel marker and a previously published HSP70 marker known as HSP70-Short, which spans 330 bp. METHODS A dataset of 27 samples from Colombia, Venezuela and the USA was assembled, of which 26 samples were collected from humans, dogs and cats affected by CL and one sample was collected from a dog with VL in the USA (but originally from Greece). DNA was extracted from each sample and underwent conventional PCR amplification utilizing two distinct HSP70 markers: HSP70-Short and HSP70-Long. The subsequent products were then sequenced using the MinION™ sequencing platform. RESULTS The results highlight the distinct characteristics of the newly devised HSP70-Long primer, showcasing the notable specificity of this primer, although its sensitivity is lower than that of the HSP70-Short marker. Notably, both markers demonstrated strong discriminatory capabilities, not only in distinguishing between different species within the Leishmania genus but also in identifying instances of coinfection. CONCLUSIONS This study underscores the outstanding specificity and effectiveness of HSP70-based MinION™ sequencing, in successfully discriminating between diverse Leishmania species and identifying coinfection events within samples sourced from leishmaniasis cases.
Collapse
Affiliation(s)
- Luz Helena Patiño
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Nathalia Ballesteros
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Marina Muñoz
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Jesús Jaimes
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Adriana C Castillo-Castañeda
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Roy Madigan
- Animal Hospital of Smithson Valley, 286 Singing Oaks, Ste 113, Spring Branch, TX, 78070, USA
| | - Alberto Paniz-Mondolfi
- Molecular Microbiology Laboratory, Department of Pathology, Molecular and Cell-Based Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Juan David Ramírez
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia.
- Molecular Microbiology Laboratory, Department of Pathology, Molecular and Cell-Based Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.
| |
Collapse
|
5
|
Ghielmetti G, Loubser J, Kerr TJ, Stuber T, Thacker T, Martin LC, O'Hare MA, Mhlophe SK, Okunola A, Loxton AG, Warren RM, Moseley MH, Miller MA, Goosen WJ. Advancing animal tuberculosis surveillance using culture-independent long-read whole-genome sequencing. Front Microbiol 2023; 14:1307440. [PMID: 38075895 PMCID: PMC10699144 DOI: 10.3389/fmicb.2023.1307440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 10/23/2023] [Indexed: 02/12/2024] Open
Abstract
Animal tuberculosis is a significant infectious disease affecting both livestock and wildlife populations worldwide. Effective disease surveillance and characterization of Mycobacterium bovis (M. bovis) strains are essential for understanding transmission dynamics and implementing control measures. Currently, sequencing of genomic information has relied on culture-based methods, which are time-consuming, resource-demanding, and concerning in terms of biosafety. This study explores the use of culture-independent long-read whole-genome sequencing (WGS) for a better understanding of M. bovis epidemiology in African buffaloes (Syncerus caffer). By comparing two sequencing approaches, we evaluated the efficacy of Illumina WGS performed on culture extracts and culture-independent Oxford Nanopore adaptive sampling (NAS). Our objective was to assess the potential of NAS to detect genomic variants without sample culture. In addition, culture-independent amplicon sequencing, targeting mycobacterial-specific housekeeping and full-length 16S rRNA genes, was applied to investigate the presence of microorganisms, including nontuberculous mycobacteria. The sequencing quality obtained from DNA extracted directly from tissues using NAS is comparable to the sequencing quality of reads generated from culture-derived DNA using both NAS and Illumina technologies. We present a new approach that provides complete and accurate genome sequence reconstruction, culture independently, and using an economically affordable technique.
Collapse
Affiliation(s)
- Giovanni Ghielmetti
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
- Section of Veterinary Bacteriology, Institute for Food Safety and Hygiene, Vetsuisse Faculty, University of Zurich, Zurich, Switzerland
| | - Johannes Loubser
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Tanya J. Kerr
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Tod Stuber
- National Veterinary Services Laboratories, Veterinary Services, Animal and Plant Health Inspection Service, United States Department of Agriculture, Ames, IA, United States
| | - Tyler Thacker
- National Veterinary Services Laboratories, Veterinary Services, Animal and Plant Health Inspection Service, United States Department of Agriculture, Ames, IA, United States
| | - Lauren C. Martin
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Michaela A. O'Hare
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Sinegugu K. Mhlophe
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Abisola Okunola
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Andre G. Loxton
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Robin M. Warren
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Mark H. Moseley
- School of Biological Sciences, University of Aberdeen, Aberdeen, United Kingdom
| | - Michele A. Miller
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Wynand J. Goosen
- Division of Molecular Biology and Human Genetics, South African Medical Research Council Centre for Tuberculosis Research, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| |
Collapse
|