Liu L, Jiang J, Wu L, Zeng DM, Yan C, Liang L, Shi J, Xie Q. Assessing the risk of concurrent mycoplasma pneumoniae pneumonia in children with tracheobronchial tuberculosis: retrospective study.
PeerJ 2024;
12:e17164. [PMID:
38560467 PMCID:
PMC10979740 DOI:
10.7717/peerj.17164]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 03/06/2024] [Indexed: 04/04/2024] Open
Abstract
Objective
This study aimed to create a predictive model based on machine learning to identify the risk for tracheobronchial tuberculosis (TBTB) occurring alongside Mycoplasma pneumoniae pneumonia in pediatric patients.
Methods
Clinical data from 212 pediatric patients were examined in this retrospective analysis. This cohort included 42 individuals diagnosed with TBTB and Mycoplasma pneumoniae pneumonia (combined group) and 170 patients diagnosed with lobar pneumonia alone (pneumonia group). Three predictive models, namely XGBoost, decision tree, and logistic regression, were constructed, and their performances were assessed using the receiver's operating characteristic (ROC) curve, precision-recall curve (PR), and decision curve analysis (DCA). The dataset was divided into a 7:3 ratio to test the first and second groups, utilizing them to validate the XGBoost model and to construct the nomogram model.
Results
The XGBoost highlighted eight significant signatures, while the decision tree and logistic regression models identified six and five signatures, respectively. The ROC analysis revealed an area under the curve (AUC) of 0.996 for XGBoost, significantly outperforming the other models (p < 0.05). Similarly, the PR curve demonstrated the superior predictive capability of XGBoost. DCA further confirmed that XGBoost offered the highest AIC (43.226), the highest average net benefit (0.764), and the best model fit. Validation efforts confirmed the robustness of the findings, with the validation groups 1 and 2 showing ROC and PR curves with AUC of 0.997, indicating a high net benefit. The nomogram model was shown to possess significant clinical value.
Conclusion
Compared to machine learning approaches, the XGBoost model demonstrated superior predictive efficacy in identifying pediatric patients at risk of concurrent TBTB and Mycoplasma pneumoniae pneumonia. The model's identification of critical signatures provides valuable insights into the pathogenesis of these conditions.
Collapse