Chen Y, Zhang X, Liang J, Jiang Q, Peierdun M, Xu P, Takiff HE, Gao Q. Advantages of updated WHO mutation catalog combined with existing whole-genome sequencing-based approaches for Mycobacterium tuberculosis resistance prediction.
Genome Med 2025;
17:31. [PMID:
40140944 PMCID:
PMC11938600 DOI:
10.1186/s13073-025-01458-0]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Accepted: 03/13/2025] [Indexed: 03/28/2025] Open
Abstract
BACKGROUND
The WHO recently released a second edition of the mutation catalog for predicting drug resistance in Mycobacterium tuberculosis (MTB). This study evaluated its effectiveness compared to existing whole-genome sequencing (WGS)-based prediction methods and proposes a novel approach for its optimization.
METHODS
We tested the accuracy of five tools-the WHO catalog, TB Profiler, SAM-TB, GenTB, and MD-CNN-for predicting drug susceptibility on a global dataset of 36,385 MTB isolates with high-quality phenotypic drug susceptibility testing (DST) and WGS data. By integrating the genotypic DST predictions of these five tools in an ensemble machine learning framework, we developed an improved computational model for MTB drug susceptibility prediction. We then validated the ensemble model on 860 MTB isolates with phenotypic and WGS data collected in Shenzhen, China (2013-2019) and Valencia, Spain (2014-2016).
RESULTS
Among the five genotypic DST tools for predicting susceptibility to ten drugs, MD-CNN exhibited the highest overall performance (AUC 92.1%; 95% CI 89.8-94.4%). The WHO catalog demonstrated the highest specificity of 97.3% (95% CI 95.8-98.4%), while TB Profiler had the best sensitivity at 79.5% (95% CI 71.8-86.2%). The ensemble machine learning model (AUC 93.4%; 95% CI 91.4-95.4%) outperformed all of the five individual tools, with a specificity of 95.4% (95% CI 93.0-97.6%) and a sensitivity of 84.1% (95% CI 78.8-88.8%), principally due to considerable improvements in second-line drug resistance predictions (AUC 91.8%; 95% CI 89.6-94.0%).
CONCLUSIONS
The second edition of the WHO MTB mutation catalog does not, by itself, perform better than existing tools for predicting MTB drug resistance. An integrative approach combining the WHO catalog with other genotypic DST methods significantly enhances prediction accuracy.
Collapse