Stratifying non-small cell lung cancer patients using an inverse of the treatment decision rules: validation using electronic health records with application to an administrative database.
BMC Med Inform Decis Mak 2023;
23:3. [PMID:
36609301 PMCID:
PMC9825000 DOI:
10.1186/s12911-022-02088-x]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 12/15/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND
To validate a stratification method using an inverse of treatment decision rules that can classify non-small cell lung cancer (NSCLC) patients in real-world treatment records.
METHODS
(1) To validate the index classifier against the TNM 7th edition, we analyzed electronic health records of NSCLC patients diagnosed from 2011 to 2015 in a tertiary referral hospital in Seoul, Korea. Predictive accuracy, stage-specific sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and c-statistic were measured. (2) To apply the index classifier in an administrative database, we analyzed NSCLC patients in Korean National Health Insurance Database, 2002-2013. Differential survival rates among the classes were examined with the log-rank test, and class-specific survival rates were compared with the reference survival rates.
RESULTS
(1) In the validation study (N = 1375), the overall accuracy was 93.8% (95% CI: 92.5-95.0%). Stage-specific c-statistic was the highest for stage I (0.97, 95% CI: 0.96-0.98) and the lowest for stage III (0.82, 95% CI: 0.77-0.87). (2) In the application study (N = 71,593), the index classifier showed a tendency for differentiating survival probabilities among classes. Compared to the reference TNM survival rates, the index classification under-estimated the survival probability for stages IA, IIIB, and IV, and over-estimated it for stages IIA and IIB.
CONCLUSION
The inverse of the treatment decision rules has a potential to supplement a routinely collected database with information encoded in the treatment decision rules to classify NSCLC patients. It requires further validation and replication in multiple clinical settings.
Collapse