1
|
Kim D, Shin JI, Yoo IY, Jo S, Chu J, Cho WY, Shin SH, Chung YJ, Park YJ, Jung SH. GenoMycAnalyzer: a web-based tool for species and drug resistance prediction for Mycobacterium genomes. BMC Genomics 2024; 25:387. [PMID: 38643090 PMCID: PMC11031912 DOI: 10.1186/s12864-024-10320-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 04/17/2024] [Indexed: 04/22/2024] Open
Abstract
BACKGROUND Drug-resistant tuberculosis (TB) is a major threat to global public health. Whole-genome sequencing (WGS) is a useful tool for species identification and drug resistance prediction, and many clinical laboratories are transitioning to WGS as a routine diagnostic tool. However, user-friendly and high-confidence automated bioinformatics tools are needed to rapidly identify M. tuberculosis complex (MTBC) and non-tuberculous mycobacteria (NTM), detect drug resistance, and further guide treatment options. RESULTS We developed GenoMycAnalyzer, a web-based software that integrates functions for identifying MTBC and NTM species, lineage and spoligotype prediction, variant calling, annotation, drug-resistance determination, and data visualization. The accuracy of GenoMycAnalyzer for genotypic drug susceptibility testing (gDST) was evaluated using 5,473 MTBC isolates that underwent phenotypic DST (pDST). The GenoMycAnalyzer database was built to predict the gDST for 15 antituberculosis drugs using the World Health Organization mutational catalogue. Compared to pDST, the sensitivity of drug susceptibilities by the GenoMycAnalyzer for first-line drugs ranged from 95.9% for rifampicin (95% CI 94.8-96.7%) to 79.6% for pyrazinamide (95% CI 76.9-82.2%), whereas those for second-line drugs ranged from 98.2% for levofloxacin (95% CI 90.1-100.0%) to 74.9% for capreomycin (95% CI 69.3-80.0%). Notably, the integration of large deletions of the four resistance-conferring genes increased gDST sensitivity. The specificity of drug susceptibilities by the GenoMycAnalyzer ranged from 98.7% for amikacin (95% CI 97.8-99.3%) to 79.5% for ethionamide (95% CI 76.4-82.3%). The incorporated Kraken2 software identified 1,284 mycobacterial species with an accuracy of 98.8%. GenoMycAnalyzer also perfectly predicted lineages for 1,935 MTBC and spoligotypes for 54 MTBC. CONCLUSIONS GenoMycAnalyzer offers both web-based and graphical user interfaces, which can help biologists with limited access to high-performance computing systems or limited bioinformatics skills. By streamlining the interpretation of WGS data, the GenoMycAnalyzer has the potential to significantly impact TB management and contribute to global efforts to combat this infectious disease. GenoMycAnalyzer is available at http://www.mycochase.org .
Collapse
Affiliation(s)
- Doyoung Kim
- Department of Biomedicine & Health Sciences, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Jeong-Ih Shin
- Department of Biomedicine & Health Sciences, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Integrated Research Center for Genomic Polymorphism, Precision Medicine Research Center, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - In Young Yoo
- Department of Laboratory Medicine, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Sungjin Jo
- Department of Laboratory Medicine, Eunpyeong St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Jiyon Chu
- Department of Biomedicine & Health Sciences, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | | | | | - Yeun-Jun Chung
- Department of Biomedicine & Health Sciences, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Integrated Research Center for Genomic Polymorphism, Precision Medicine Research Center, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Departments of Microbiology, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Yeon-Joon Park
- Department of Laboratory Medicine, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Seung-Hyun Jung
- Department of Biomedicine & Health Sciences, College of Medicine, The Catholic University of Korea, Seoul, Korea.
- Integrated Research Center for Genomic Polymorphism, Precision Medicine Research Center, College of Medicine, The Catholic University of Korea, Seoul, Korea.
- Departments of Biochemistry, College of Medicine, The Catholic University of Korea, 222 Banpo-daero, Seoch-Gu, Seoul, 06591, Republic of Korea.
| |
Collapse
|
2
|
Bakuła Z, Dziurzyński M, Decewicz P, Bakonytė D, Vasiliauskaitė L, Nakčerienė B, Krenke R, Stakėnas P, Jagielski T. Spoligotyping of Mycobacterium tuberculosis - Comparing in vitro and in silico approaches. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2023; 115:105508. [PMID: 37757901 DOI: 10.1016/j.meegid.2023.105508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 09/20/2023] [Accepted: 09/25/2023] [Indexed: 09/29/2023]
Abstract
Spoligotyping is one of the molecular typing methods widely used for exploring the genetic variety of Mycobacterium tuberculosis. The aim of this study was to compare the spoligoprofiles of M. tuberculosis clinical isolates, obtained using in vitro and in silico approaches. The study included 230 M. tuberculosis isolates, recovered from Poland and Lithuania between 2018 and 2021. Spoligotyping in vitro was performed with a commercially available kit. Whole genome sequencing (WGS) was done with Illumina NovaSeq 6000 sequencer. Spoligotype International Types (SITs) were assigned according to the SITVIT2 database or using three different in silico tools, and based on WGS data, namely SpoTyping, SpolPred, and lorikeet. Upon in vitro spoligotyping, the isolates produced 65 different spoligotypes. Spoligotypes inferred from the WGS data were congruent with in vitro generated patterns in 81.7% (188/230) for lorikeet and 81.3% (187/230) for SpolPred and SpoTyping. Spacers 18 and 31 produced the highest ratio of discrepant results between in vitro and in silico approaches, with their signals discordantly assigned for 15 (6.5%) and 9 (3.9%) isolates, respectively. All three in silico approaches used were similarly efficient for M. tuberculosis spoligotype prediction. However, only SpoTyping could predict spoligotypes without a need for manual curation. Thus, we consider it as the most accurate tool. Its use is further advocated by the shortest time of analysis. A relatively high (ca. 20%) discordance between in vitro and in silico spoligotyping results was observed. While we discourage comparing conventional spoligotyping with in silico equivalents, we advise the use of the latter, as it improves the accuracy of spoligopatterns, and thus depicts the relatedness between the isolates more reliably.
Collapse
Affiliation(s)
- Zofia Bakuła
- Department of Medical Microbiology, Institute of Microbiology, Faculty of Biology, University of Warsaw, Poland.
| | - Mikołaj Dziurzyński
- Department of Biology (DBIO), University of Florence, via Madonna del Piano 10, Sesto Fiorentino 50019, Italy.
| | - Przemysław Decewicz
- Department of Environmental Microbiology and Biotechnology, Institute of Microbiology, Faculty of Biology, University of Warsaw, Poland.
| | - Daiva Bakonytė
- Department of Immunology and Cell Biology, Institute of Biotechnology, Life Sciences Center, Vilnius University, Lithuania.
| | - Laima Vasiliauskaitė
- Department of Physiology, Biochemistry, Microbiology and Laboratory Medicine, Institute of Biomedical Sciences, Vilnius University, Lithuania; Institute of Biotechnology, Life Sciences Center, Vilnius University, Lithuania; Centre of Laboratory Medicine, Laboratory of Infectious Diseases and Tuberculosis, Vilnius University Hospital Santaros klinikos, Lithuania.
| | - Birutė Nakčerienė
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Lithuania; Department of Programs and State Tuberculosis Information System, Vilnius University Hospital Santaros klinikos, Vilnius, Lithuania.
| | - Rafał Krenke
- Department of Internal Medicine, Pulmonary Diseases & Allergy, Medical University of Warsaw, Warsaw, Poland.
| | - Petras Stakėnas
- Department of Immunology and Cell Biology, Institute of Biotechnology, Life Sciences Center, Vilnius University, Lithuania.
| | - Tomasz Jagielski
- Department of Medical Microbiology, Institute of Microbiology, Faculty of Biology, University of Warsaw, Poland.
| |
Collapse
|
3
|
Reis AC, Salvador LCM, Robbe-Austerman S, Tenreiro R, Botelho A, Albuquerque T, Cunha MV. Whole Genome Sequencing Refines Knowledge on the Population Structure of Mycobacterium bovis from a Multi-Host Tuberculosis System. Microorganisms 2021; 9:1585. [PMID: 34442664 PMCID: PMC8401292 DOI: 10.3390/microorganisms9081585] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 07/22/2021] [Accepted: 07/22/2021] [Indexed: 12/30/2022] Open
Abstract
Classical molecular analyses of Mycobacterium bovis based on spoligotyping and Variable Number Tandem Repeat (MIRU-VNTR) brought the first insights into the epidemiology of animal tuberculosis (TB) in Portugal, showing high genotypic diversity of circulating strains that mostly cluster within the European 2 clonal complex. Previous surveillance provided valuable information on the prevalence and spatial occurrence of TB and highlighted prevalent genotypes in areas where livestock and wild ungulates are sympatric. However, links at the wildlife-livestock interfaces were established mainly via classical genotype associations. Here, we apply whole genome sequencing (WGS) to cattle, red deer and wild boar isolates to reconstruct the M. bovis population structure in a multi-host, multi-region disease system and to explore links at a fine genomic scale between M. bovis from wildlife hosts and cattle. Whole genome sequences of 44 representative M. bovis isolates, obtained between 2003 and 2015 from three TB hotspots, were compared through single nucleotide polymorphism (SNP) variant calling analyses. Consistent with previous results combining classical genotyping with Bayesian population admixture modelling, SNP-based phylogenies support the branching of this M. bovis population into five genetic clades, three with apparent geographic specificities, as well as the establishment of an SNP catalogue specific to each clade, which may be explored in the future as phylogenetic markers. The core genome alignment of SNPs was integrated within a spatiotemporal metadata framework to further structure this M. bovis population by host species and TB hotspots, providing a baseline for network analyses in different epidemiological and disease control contexts. WGS of M. bovis isolates from Portugal is reported for the first time in this pilot study, refining the spatiotemporal context of TB at the wildlife-livestock interface and providing further support to the key role of red deer and wild boar on disease maintenance. The SNP diversity observed within this dataset supports the natural circulation of M. bovis for a long time period, as well as multiple introduction events of the pathogen in this Iberian multi-host system.
Collapse
Affiliation(s)
- Ana C. Reis
- Centre for Ecology, Evolution and Environmental Changes (cE3c), Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal;
- Biosystems & Integrative Sciences Institute (BioISI), Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal;
| | - Liliana C. M. Salvador
- Department of Infectious Diseases, College of Veterinary Medicine, University of Georgia, Athens, GA 30602, USA;
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
- Center for the Ecology of Infectious Diseases, University of Georgia, Athens, GA 30602, USA
| | | | - Rogério Tenreiro
- Biosystems & Integrative Sciences Institute (BioISI), Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal;
| | - Ana Botelho
- INIAV, IP-National Institute for Agrarian and Veterinary Research, 2780-157 Oeiras, Portugal; (A.B.); (T.A.)
| | - Teresa Albuquerque
- INIAV, IP-National Institute for Agrarian and Veterinary Research, 2780-157 Oeiras, Portugal; (A.B.); (T.A.)
| | - Mónica V. Cunha
- Centre for Ecology, Evolution and Environmental Changes (cE3c), Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal;
- Biosystems & Integrative Sciences Institute (BioISI), Faculdade de Ciências da Universidade de Lisboa, 1749-016 Lisboa, Portugal;
| |
Collapse
|
4
|
Asare P, Asante-Poku A, Osei-Wusu S, Otchere ID, Yeboah-Manu D. The Relevance of Genomic Epidemiology for Control of Tuberculosis in West Africa. Front Public Health 2021; 9:706651. [PMID: 34368069 PMCID: PMC8342769 DOI: 10.3389/fpubh.2021.706651] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 06/29/2021] [Indexed: 12/30/2022] Open
Abstract
Tuberculosis (TB), an airborne infectious disease caused by Mycobacterium tuberculosis complex (MTBC), remains a global health problem. West Africa has a unique epidemiology of TB that is characterized by medium- to high-prevalence. Moreover, the geographical restriction of M. africanum to the sub-region makes West Africa have an extra burden to deal with a two-in-one pathogen. The region is also burdened with low case detection, late reporting, poor treatment adherence leading to development of drug resistance and relapse. Sporadic studies conducted within the subregion report higher burden of drug resistant TB (DRTB) than previously thought. The need for more sensitive and robust tools for routine surveillance as well as to understand the mechanisms of DRTB and transmission dynamics for the design of effective control tools, cannot be overemphasized. The advancement in molecular biology tools including traditional fingerprinting and next generation sequencing (NGS) technologies offer reliable tools for genomic epidemiology. Genomic epidemiology provides in-depth insight of the nature of pathogens, circulating strains and their spread as well as prompt detection of the emergence of new strains. It also offers the opportunity to monitor treatment and evaluate interventions. Furthermore, genomic epidemiology can be used to understand potential emergence and spread of drug resistant strains and resistance mechanisms allowing the design of simple but rapid tools. In this review, we will describe the local epidemiology of MTBC, highlight past and current investigations toward understanding their biology and spread as well as discuss the relevance of genomic epidemiology studies to TB control in West Africa.
Collapse
Affiliation(s)
- Prince Asare
- College of Health Sciences, Noguchi Memorial Institute for Medical Research, University of Ghana, Accra, Ghana
| | - Adwoa Asante-Poku
- College of Health Sciences, Noguchi Memorial Institute for Medical Research, University of Ghana, Accra, Ghana
| | - Stephen Osei-Wusu
- College of Health Sciences, Noguchi Memorial Institute for Medical Research, University of Ghana, Accra, Ghana
| | - Isaac Darko Otchere
- College of Health Sciences, Noguchi Memorial Institute for Medical Research, University of Ghana, Accra, Ghana
| | - Dorothy Yeboah-Manu
- College of Health Sciences, Noguchi Memorial Institute for Medical Research, University of Ghana, Accra, Ghana
| |
Collapse
|
5
|
Le Hang NT, Hijikata M, Maeda S, Miyabayashi A, Wakabayashi K, Seto S, Diem NTK, Yen NTT, Van Duc L, Thuong PH, Van Huan H, Hoang NP, Mitarai S, Keicho N, Kato S. Phenotypic and genotypic features of the Mycobacterium tuberculosis lineage 1 subgroup in central Vietnam. Sci Rep 2021; 11:13609. [PMID: 34193941 PMCID: PMC8245516 DOI: 10.1038/s41598-021-92984-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 06/15/2021] [Indexed: 11/09/2022] Open
Abstract
Mycobacterium tuberculosis (Mtb) has different features depending on different geographic areas. We collected Mtb strains from patients with smear-positive pulmonary tuberculosis in Da Nang, central Vietnam. Using a whole genome sequencing platform, including genome assembly complemented by long-read-sequencing data, genomic characteristics were studied. Of 181 Mtb isolates, predominant Vietnamese EAI4_VNM and EAI4-like spoligotypes (31.5%), ZERO strains (5.0%), and part of EAI5 (11.1%) were included in a lineage-1 (L1) sublineage, i.e., L1.1.1.1. These strains were found less often in younger people, and they genetically clustered less frequently than other modern strains. Patients infected with ZERO strains demonstrated less lung infiltration. A region in RD2bcg spanning six loci, i.e., PE_PGRS35, cfp21, Rv1985c, Rv1986, Rv1987, and erm(37), was deleted in EAI4_VNM, EAI4-like, and ZERO strains, whereas another 118 bp deletion in furA was specific only to ZERO strains. L1.1.1.1-sublineage-specific deletions in PE_PGRS4 and PE_PGRS22 were also identified. RD900, seen in ancestral lineages, was present in majority of the L1 members. All strains without IS6110 (5.0%) had the ZERO spoligo-pattern. Distinctive features of the ancestral L1 strains provide a basis for investigation of the modern versus ancestral Mtb lineages and allow consideration of countermeasures against this heterogeneous pathogen.
Collapse
Affiliation(s)
| | - Minako Hijikata
- Department of Pathophysiology and Host Defense, The Research Institute of Tuberculosis, JATA, Tokyo, Japan
| | - Shinji Maeda
- Faculty of Pharmaceutical Sciences, Hokkaido University of Science, Hokkaido, Japan
| | - Akiko Miyabayashi
- Department of Pathophysiology and Host Defense, The Research Institute of Tuberculosis, JATA, Tokyo, Japan
| | - Keiko Wakabayashi
- Department of Pathophysiology and Host Defense, The Research Institute of Tuberculosis, JATA, Tokyo, Japan
| | - Shintaro Seto
- Department of Pathophysiology and Host Defense, The Research Institute of Tuberculosis, JATA, Tokyo, Japan
| | | | | | - Le Van Duc
- Da Nang General Hospital, Da Nang, Vietnam
| | | | | | | | - Satoshi Mitarai
- Department of Mycobacterium Reference and Research, The Research Institute of Tuberculosis, JATA, Tokyo, Japan
| | - Naoto Keicho
- The Research Institute of Tuberculosis, JATA, Japan Anti-Tuberculosis Association, 3-1-24 Matsuyama, Kiyose, Tokyo, 204-8533, Japan. .,National Center for Global Health and Medicine, Tokyo, Japan.
| | - Seiya Kato
- The Research Institute of Tuberculosis, JATA, Japan Anti-Tuberculosis Association, 3-1-24 Matsuyama, Kiyose, Tokyo, 204-8533, Japan
| |
Collapse
|
6
|
Genotyping of Mycobacterium tuberculosis spreading in Hanoi, Vietnam using conventional and whole genome sequencing methods. INFECTION GENETICS AND EVOLUTION 2019; 78:104107. [PMID: 31706080 DOI: 10.1016/j.meegid.2019.104107] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Revised: 09/24/2019] [Accepted: 11/04/2019] [Indexed: 12/22/2022]
Abstract
Hanoi is the capital of Vietnam, one of the 30 countries with a high tuberculosis (TB) burden. Fundamental data on the molecular epidemiology of the disease is required for future TB management. To identify lineages and genotypes of Mycobacterium tuberculosis (Mtb), conventional genotyping data from clinical isolates of the Hanoi area was compared with whole genome sequencing (WGS) analysis from 332 of 470 samples. It was obtained from lineage-specific single nucleotide variants (SNVs), large sequence polymorphisms, spoligotyping, and variable number of tandem repeats (VNTR) analysis using mycobacterial interspersed repetitive unit (MIRU) and Japan anti-tuberculosis association (JATA) locus sets. This information was directly compared with results obtained from WGS. Mini-satellite repeat unit variants were identified using BLAST search against concatenated short read sequences, the RepUnitTyping tool. WGS analysis revealed that the Mtb strains tested are diverse and classified into lineage (L) 1, 2 and 4 (24.7, 57.2 and 18.1% respectively). The majority of the L2 strains were further divided into ancient and modern Beijing genotypes, and most of the L1 group were EAI4_VNM strains. Although conventional PCR-based genotyping results were mostly consistent with information obtained through WGS analysis, in-depth analysis identified aberrant deletions and spacers that may cause discordance. JATA-VNTR sets, including hypervariable loci, separated large Beijing genotypic clusters generated by MIRU15 into smaller groups. The distribution of repeat unit variants observed within 33 VNTR loci showed clear variation depending on the three lineages. WGS-based pairwise-SNV differences within VNTR-defined genotypic clusters were greater in L1 than in L2 and L4 (P = .001). Direct comparisons between results of PCR-based genotyping and in silico analysis of WGS data would bridge a gap between classical and modern technologies during this transition period, and provide further information on Mtb genotypes in specific geographical areas.
Collapse
|
7
|
Whole genome sequencing, analyses of drug resistance-conferring mutations, and correlation with transmission of Mycobacterium tuberculosis carrying katG-S315T in Hanoi, Vietnam. Sci Rep 2019; 9:15354. [PMID: 31653940 PMCID: PMC6814805 DOI: 10.1038/s41598-019-51812-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 10/08/2019] [Indexed: 12/13/2022] Open
Abstract
Drug-resistant tuberculosis (TB) is a serious global problem, and pathogen factors involved in the transmission of isoniazid (INH)-resistant TB have not been fully investigated. We performed whole genome sequencing of 332 clinical Mycobacterium tuberculosis (Mtb) isolates collected from patients newly diagnosed with smear-positive pulmonary TB in Hanoi, Vietnam. Using a bacterial genome-wide approach based on linear mixed models, we investigated the associations between 31-bp k-mers and clustered strains harboring katG-S315T, a major INH-resistance mutation in the present cohort and in the second panel previously published in South Africa. Five statistically significant genes, namely, PPE18/19, gid, emrB, Rv1588c, and pncA, were shared by the two panels. We further identified variants of the genes responsible for these k-mers, which are relevant to the spread of INH-resistant strains. Phylogenetic convergence test showed that variants relevant to PPE46/47-like chimeric genes were significantly associated with the same phenotype in Hanoi. The associations were further confirmed after adjustment for the confounders. These findings suggest that genomic variations of the pathogen facilitate the expansion of INH-resistance TB, at least in part, and our study provides a new insight into the mechanisms by which drug-resistant Mtb maintains fitness and spreads in Asia and Africa.
Collapse
|