Handa P, Goel N, Indu S, Gunjan D. AI-KODA score application for cleanliness assessment in video capsule endoscopy frames.
MINIM INVASIV THER 2024;
33:311-320. [PMID:
39138994 DOI:
10.1080/13645706.2024.2390879]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 07/03/2024] [Indexed: 08/15/2024]
Abstract
BACKGROUND
Currently, there is no automated method for assessing cleanliness in video capsule endoscopy (VCE). Our objectives were to automate the process of evaluating and collecting medical scores of VCE frames according to the existing KOrea-CanaDA (KODA) scoring system by developing an easy-to-use mobile application called artificial intelligence-KODA (AI-KODA) score, as well as to determine the inter-rater and intra-rater reliability of the KODA score among three readers for prospective AI applications, and check the efficacy of the application.
METHOD
From the 28 patient capsule videos considered, 1539 sequential frames were selected at five-minute intervals, and 634 random frames were selected at random intervals during small bowel transit. The frames were processed and shifted to AI-KODA. Three readers (gastroenterology fellows), who had been trained in reading VCE, rated 2173 frames in duplicate four weeks apart after completing the training module on AI-KODA. The scores were saved automatically in real time. Reliability was assessed for each video using estimate of intra-class correlation coefficients (ICCs). Then, the AI dataset was developed using the frames and their respective scores, and it was subjected to automatic classification of the scores via the random forest and the k-nearest neighbors classifiers.
RESULTS
For sequential frames, ICCs for inter-rater variability were 'excellent' to 'good' among the three readers, while ICCs for intra-rater variability were 'good' to 'moderate'. For random frames, ICCs for inter-rater and intra-rater variability were 'excellent' among the three readers. The overall accuracy achieved was up to 61% for the random forest classifier and 62.38% for the k-nearest neighbors classifier.
CONCLUSIONS
AI-KODA automates the process of scoring VCE frames based on the existing KODA score. It saves time in cleanliness assessment and is user-friendly for research and clinical use. Comprehensive benchmarking of the AI dataset is in process.
Collapse