1
|
Dubois C, Eigen D, Simon F, Couloigner V, Gormish M, Chalumeau M, Schmoll L, Cohen JF. Development and validation of a smartphone-based deep-learning-enabled system to detect middle-ear conditions in otoscopic images. NPJ Digit Med 2024; 7:162. [PMID: 38902477 PMCID: PMC11189910 DOI: 10.1038/s41746-024-01159-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 06/10/2024] [Indexed: 06/22/2024] Open
Abstract
Middle-ear conditions are common causes of primary care visits, hearing impairment, and inappropriate antibiotic use. Deep learning (DL) may assist clinicians in interpreting otoscopic images. This study included patients over 5 years old from an ambulatory ENT practice in Strasbourg, France, between 2013 and 2020. Digital otoscopic images were obtained using a smartphone-attached otoscope (Smart Scope, Karl Storz, Germany) and labeled by a senior ENT specialist across 11 diagnostic classes (reference standard). An Inception-v2 DL model was trained using 41,664 otoscopic images, and its diagnostic accuracy was evaluated by calculating class-specific estimates of sensitivity and specificity. The model was then incorporated into a smartphone app called i-Nside. The DL model was evaluated on a validation set of 3,962 images and a held-out test set comprising 326 images. On the validation set, all class-specific estimates of sensitivity and specificity exceeded 98%. On the test set, the DL model achieved a sensitivity of 99.0% (95% confidence interval: 94.5-100) and a specificity of 95.2% (91.5-97.6) for the binary classification of normal vs. abnormal images; wax plugs were detected with a sensitivity of 100% (94.6-100) and specificity of 97.7% (95.0-99.1); other class-specific estimates of sensitivity and specificity ranged from 33.3% to 92.3% and 96.0% to 100%, respectively. We present an end-to-end DL-enabled system able to achieve expert-level diagnostic accuracy for identifying normal tympanic aspects and wax plugs within digital otoscopic images. However, the system's performance varied for other middle-ear conditions. Further prospective validation is necessary before wider clinical deployment.
Collapse
Affiliation(s)
| | | | - François Simon
- Department of Pediatric Otolaryngology, Necker-Enfants malades Hospital, APHP, Université Paris Cité, Paris, France
| | - Vincent Couloigner
- Department of Pediatric Otolaryngology, Necker-Enfants malades Hospital, APHP, Université Paris Cité, Paris, France
| | | | - Martin Chalumeau
- Inserm UMR1153 (CRESS), Université Paris Cité, Paris, France
- Department of General Pediatrics and Pediatric Infectious Diseases, Necker-Enfants malades Hospital, APHP, Université Paris Cité, Paris, France
| | | | - Jérémie F Cohen
- Inserm UMR1153 (CRESS), Université Paris Cité, Paris, France.
- Department of General Pediatrics and Pediatric Infectious Diseases, Necker-Enfants malades Hospital, APHP, Université Paris Cité, Paris, France.
| |
Collapse
|
2
|
Noda M, Yoshimura H, Okubo T, Koshu R, Uchiyama Y, Nomura A, Ito M, Takumi Y. Feasibility of Multimodal Artificial Intelligence Using GPT-4 Vision for the Classification of Middle Ear Disease: Qualitative Study and Validation. JMIR AI 2024; 3:e58342. [PMID: 38875669 PMCID: PMC11179042 DOI: 10.2196/58342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 04/23/2024] [Accepted: 05/07/2024] [Indexed: 06/16/2024]
Abstract
BACKGROUND The integration of artificial intelligence (AI), particularly deep learning models, has transformed the landscape of medical technology, especially in the field of diagnosis using imaging and physiological data. In otolaryngology, AI has shown promise in image classification for middle ear diseases. However, existing models often lack patient-specific data and clinical context, limiting their universal applicability. The emergence of GPT-4 Vision (GPT-4V) has enabled a multimodal diagnostic approach, integrating language processing with image analysis. OBJECTIVE In this study, we investigated the effectiveness of GPT-4V in diagnosing middle ear diseases by integrating patient-specific data with otoscopic images of the tympanic membrane. METHODS The design of this study was divided into two phases: (1) establishing a model with appropriate prompts and (2) validating the ability of the optimal prompt model to classify images. In total, 305 otoscopic images of 4 middle ear diseases (acute otitis media, middle ear cholesteatoma, chronic otitis media, and otitis media with effusion) were obtained from patients who visited Shinshu University or Jichi Medical University between April 2010 and December 2023. The optimized GPT-4V settings were established using prompts and patients' data, and the model created with the optimal prompt was used to verify the diagnostic accuracy of GPT-4V on 190 images. To compare the diagnostic accuracy of GPT-4V with that of physicians, 30 clinicians completed a web-based questionnaire consisting of 190 images. RESULTS The multimodal AI approach achieved an accuracy of 82.1%, which is superior to that of certified pediatricians at 70.6%, but trailing behind that of otolaryngologists at more than 95%. The model's disease-specific accuracy rates were 89.2% for acute otitis media, 76.5% for chronic otitis media, 79.3% for middle ear cholesteatoma, and 85.7% for otitis media with effusion, which highlights the need for disease-specific optimization. Comparisons with physicians revealed promising results, suggesting the potential of GPT-4V to augment clinical decision-making. CONCLUSIONS Despite its advantages, challenges such as data privacy and ethical considerations must be addressed. Overall, this study underscores the potential of multimodal AI for enhancing diagnostic accuracy and improving patient care in otolaryngology. Further research is warranted to optimize and validate this approach in diverse clinical settings.
Collapse
Affiliation(s)
- Masao Noda
- Department of Otolaryngology, Head and Neck Surgery, Jichi Medical University, Shimotsuke, Japan
| | - Hidekane Yoshimura
- Department of Otolaryngology - Head and Neck Surgery, Shinshu University, Matsumoto, Japan
| | - Takuya Okubo
- Department of Otolaryngology - Head and Neck Surgery, Shinshu University, Matsumoto, Japan
| | - Ryota Koshu
- Department of Otolaryngology, Head and Neck Surgery, Jichi Medical University, Shimotsuke, Japan
| | - Yuki Uchiyama
- Department of Otolaryngology - Head and Neck Surgery, Shinshu University, Matsumoto, Japan
| | - Akihiro Nomura
- College of Transdisciplinary Sciences for Innovation, Kanazawa University, Kanazawa, Japan
| | - Makoto Ito
- Department of Otolaryngology, Head and Neck Surgery, Jichi Medical University, Shimotsuke, Japan
| | - Yutaka Takumi
- Department of Otolaryngology - Head and Neck Surgery, Shinshu University, Matsumoto, Japan
| |
Collapse
|
3
|
Shaikh N, Conway SJ, Kovačević J, Condessa F, Shope TR, Haralam MA, Campese C, Lee MC, Larsson T, Cavdar Z, Hoberman A. Development and Validation of an Automated Classifier to Diagnose Acute Otitis Media in Children. JAMA Pediatr 2024; 178:401-407. [PMID: 38436941 PMCID: PMC10985552 DOI: 10.1001/jamapediatrics.2024.0011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 11/17/2023] [Indexed: 03/05/2024]
Abstract
Importance Acute otitis media (AOM) is a frequently diagnosed illness in children, yet the accuracy of diagnosis has been consistently low. Multiple neural networks have been developed to recognize the presence of AOM with limited clinical application. Objective To develop and internally validate an artificial intelligence decision-support tool to interpret videos of the tympanic membrane and enhance accuracy in the diagnosis of AOM. Design, Setting, and Participants This diagnostic study analyzed otoscopic videos of the tympanic membrane captured using a smartphone during outpatient clinic visits at 2 sites in Pennsylvania between 2018 and 2023. Eligible participants included children who presented for sick visits or wellness visits. Exposure Otoscopic examination. Main Outcomes and Measures Using the otoscopic videos that were annotated by validated otoscopists, a deep residual-recurrent neural network was trained to predict both features of the tympanic membrane and the diagnosis of AOM vs no AOM. The accuracy of this network was compared with a second network trained using a decision tree approach. A noise quality filter was also trained to prompt users that the video segment acquired may not be adequate for diagnostic purposes. Results Using 1151 videos from 635 children (majority younger than 3 years of age), the deep residual-recurrent neural network had almost identical diagnostic accuracy as the decision tree network. The finalized deep residual-recurrent neural network algorithm classified tympanic membrane videos into AOM vs no AOM categories with a sensitivity of 93.8% (95% CI, 92.6%-95.0%) and specificity of 93.5% (95% CI, 92.8%-94.3%) and the decision tree model had a sensitivity of 93.7% (95% CI, 92.4%-94.9%) and specificity of 93.3% (92.5%-94.1%). Of the tympanic membrane features outputted, bulging of the TM most closely aligned with the predicted diagnosis; bulging was present in 230 of 230 cases (100%) in which the diagnosis was predicted to be AOM in the test set. Conclusions and Relevance These findings suggest that given its high accuracy, the algorithm and medical-grade application that facilitates image acquisition and quality filtering could reasonably be used in primary care or acute care settings to aid with automated diagnosis of AOM and decisions regarding treatment.
Collapse
Affiliation(s)
- Nader Shaikh
- Department of Pediatrics, Division of General Academic Pediatrics, University of Pittsburgh School of Medicine, University of Pittsburgh Medical Center Children’s Hospital of Pittsburgh, Pennsylvania
| | - Shannon J. Conway
- Department of Pediatrics, Division of General Academic Pediatrics, University of Pittsburgh School of Medicine, University of Pittsburgh Medical Center Children’s Hospital of Pittsburgh, Pennsylvania
| | - Jelena Kovačević
- Tandon School of Engineering, New York University, New York, New York
| | - Filipe Condessa
- Bosch Center for Artificial Intelligence, Pittsburgh, Pennsylvania
| | - Timothy R. Shope
- Department of Pediatrics, Division of General Academic Pediatrics, University of Pittsburgh School of Medicine, University of Pittsburgh Medical Center Children’s Hospital of Pittsburgh, Pennsylvania
| | - Mary Ann Haralam
- Department of Pediatrics, Division of General Academic Pediatrics, University of Pittsburgh School of Medicine, University of Pittsburgh Medical Center Children’s Hospital of Pittsburgh, Pennsylvania
| | - Catherine Campese
- Department of Pediatrics, Division of General Academic Pediatrics, University of Pittsburgh School of Medicine, University of Pittsburgh Medical Center Children’s Hospital of Pittsburgh, Pennsylvania
| | - Matthew C. Lee
- Department of Pediatrics, Division of General Academic Pediatrics, University of Pittsburgh School of Medicine, University of Pittsburgh Medical Center Children’s Hospital of Pittsburgh, Pennsylvania
| | | | | | - Alejandro Hoberman
- Department of Pediatrics, Division of General Academic Pediatrics, University of Pittsburgh School of Medicine, University of Pittsburgh Medical Center Children’s Hospital of Pittsburgh, Pennsylvania
| |
Collapse
|
4
|
Ding X, Huang Y, Tian X, Zhao Y, Feng G, Gao Z. Diagnosis, Treatment, and Management of Otitis Media with Artificial Intelligence. Diagnostics (Basel) 2023; 13:2309. [PMID: 37443702 DOI: 10.3390/diagnostics13132309] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 06/04/2023] [Accepted: 06/14/2023] [Indexed: 07/15/2023] Open
Abstract
A common infectious disease, otitis media (OM) has a low rate of early diagnosis, which significantly increases the difficulty of treating the disease and the likelihood of serious complications developing including hearing loss, speech impairment, and even intracranial infection. Several areas of healthcare have shown great promise in the application of artificial intelligence (AI) systems, such as the accurate detection of diseases, the automated interpretation of images, and the prediction of patient outcomes. Several articles have reported some machine learning (ML) algorithms such as ResNet, InceptionV3 and Unet, were applied to the diagnosis of OM successfully. The use of these techniques in the OM is still in its infancy, but their potential is enormous. We present in this review important concepts related to ML and AI, describe how these technologies are currently being applied to diagnosing, treating, and managing OM, and discuss the challenges associated with developing AI-assisted OM technologies in the future.
Collapse
Affiliation(s)
- Xin Ding
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Yu Huang
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Xu Tian
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Yang Zhao
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Guodong Feng
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Zhiqiang Gao
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| |
Collapse
|
5
|
Cao Z, Chen F, Grais EM, Yue F, Cai Y, Swanepoel DW, Zhao F. Machine Learning in Diagnosing Middle Ear Disorders Using Tympanic Membrane Images: A Meta-Analysis. Laryngoscope 2023; 133:732-741. [PMID: 35848851 DOI: 10.1002/lary.30291] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Revised: 06/18/2022] [Accepted: 06/21/2022] [Indexed: 11/12/2022]
Abstract
OBJECTIVE To systematically evaluate the development of Machine Learning (ML) models and compare their diagnostic accuracy for the classification of Middle Ear Disorders (MED) using Tympanic Membrane (TM) images. METHODS PubMed, EMBASE, CINAHL, and CENTRAL were searched up until November 30, 2021. Studies on the development of ML approaches for diagnosing MED using TM images were selected according to the inclusion criteria. PRISMA guidelines were followed with study design, analysis method, and outcomes extracted. Sensitivity, specificity, and area under the curve (AUC) were used to summarize the performance metrics of the meta-analysis. Risk of Bias was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 tool in combination with the Prediction Model Risk of Bias Assessment Tool. RESULTS Sixteen studies were included, encompassing 20254 TM images (7025 normal TM and 13229 MED). The sample size ranged from 45 to 6066 per study. The accuracy of the 25 included ML approaches ranged from 76.00% to 98.26%. Eleven studies (68.8%) were rated as having a low risk of bias, with the reference standard as the major domain of high risk of bias (37.5%). Sensitivity and specificity were 93% (95% CI, 90%-95%) and 85% (95% CI, 82%-88%), respectively. The AUC of total TM images was 94% (95% CI, 91%-96%). The greater AUC was found using otoendoscopic images than otoscopic images. CONCLUSIONS ML approaches perform robustly in distinguishing between normal ears and MED, however, it is proposed that a standardized TM image acquisition and annotation protocol should be developed. LEVEL OF EVIDENCE NA Laryngoscope, 133:732-741, 2023.
Collapse
Affiliation(s)
- Zuwei Cao
- Center for Rehabilitative Auditory Research, Guizhou Provincial People's Hospital, Guiyang City, China
| | - Feifan Chen
- Centre for Speech and Language Therapy and Hearing Science, Cardiff School of Sport and Health Sciences, Cardiff Metropolitan University, Cardiff, UK
| | - Emad M Grais
- Centre for Speech and Language Therapy and Hearing Science, Cardiff School of Sport and Health Sciences, Cardiff Metropolitan University, Cardiff, UK
| | - Fengjuan Yue
- Medical Examination Center, Guizhou Provincial People's Hospital, Guiyang City, China
| | - Yuexin Cai
- Department of Otolaryngology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou City, China
| | - De Wet Swanepoel
- Department of Speech-Language Pathology and Audiology, University of Pretoria, Pretoria, South Africa
| | - Fei Zhao
- Centre for Speech and Language Therapy and Hearing Science, Cardiff School of Sport and Health Sciences, Cardiff Metropolitan University, Cardiff, UK
| |
Collapse
|
6
|
Habib AR, Xu Y, Bock K, Mohanty S, Sederholm T, Weeks WB, Dodhia R, Ferres JL, Perry C, Sacks R, Singh N. Evaluating the generalizability of deep learning image classification algorithms to detect middle ear disease using otoscopy. Sci Rep 2023; 13:5368. [PMID: 37005441 PMCID: PMC10067817 DOI: 10.1038/s41598-023-31921-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 03/20/2023] [Indexed: 04/04/2023] Open
Abstract
To evaluate the generalizability of artificial intelligence (AI) algorithms that use deep learning methods to identify middle ear disease from otoscopic images, between internal to external performance. 1842 otoscopic images were collected from three independent sources: (a) Van, Turkey, (b) Santiago, Chile, and (c) Ohio, USA. Diagnostic categories consisted of (i) normal or (ii) abnormal. Deep learning methods were used to develop models to evaluate internal and external performance, using area under the curve (AUC) estimates. A pooled assessment was performed by combining all cohorts together with fivefold cross validation. AI-otoscopy algorithms achieved high internal performance (mean AUC: 0.95, 95%CI: 0.80-1.00). However, performance was reduced when tested on external otoscopic images not used for training (mean AUC: 0.76, 95%CI: 0.61-0.91). Overall, external performance was significantly lower than internal performance (mean difference in AUC: -0.19, p ≤ 0.04). Combining cohorts achieved a substantial pooled performance (AUC: 0.96, standard error: 0.01). Internally applied algorithms for otoscopy performed well to identify middle ear disease from otoscopy images. However, external performance was reduced when applied to new test cohorts. Further efforts are required to explore data augmentation and pre-processing techniques that might improve external performance and develop a robust, generalizable algorithm for real-world clinical applications.
Collapse
Affiliation(s)
- Al-Rahim Habib
- Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia.
- Department of Otolaryngology, Head and Neck Surgery, Westmead Hospital, Sydney, NSW, Australia.
| | - Yixi Xu
- AI for Good Lab, Microsoft, Redmond, WA, USA
| | - Kris Bock
- Azure FastTrack Engineering, Brisbane, QLD, Australia
| | | | | | | | | | | | - Chris Perry
- University of Queensland Medical School, Brisbane, QLD, Australia
| | - Raymond Sacks
- Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia
| | - Narinder Singh
- Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia
- Department of Otolaryngology, Head and Neck Surgery, Westmead Hospital, Sydney, NSW, Australia
| |
Collapse
|
7
|
Mao S, Wu X, Hou M, Mei L, Feng Y, Song J. Research and application progress in deep learning in otology. ZHONG NAN DA XUE XUE BAO. YI XUE BAN = JOURNAL OF CENTRAL SOUTH UNIVERSITY. MEDICAL SCIENCES 2023; 48:463-471. [PMID: 37164930 PMCID: PMC10930069 DOI: 10.11817/j.issn.1672-7347.2023.210588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Indexed: 05/12/2023]
Abstract
With the optimization of deep learning algorithms and the accumulation of medical big data, deep learning technology has been widely applied in research in various fields of otology in recent years. At present, research on deep learning in otology is combined with a variety of data such as endoscopy, temporal bone images, audiograms, and intraoperative images, which involves diagnosis of otologic diseases (including auricular malformations, external auditory canal diseases, middle ear diseases, and inner ear diseases), treatment (guiding medication and surgical planning), and prognosis prediction (involving hearing regression and speech learning). According to the type of data and the purpose of the study (disease diagnosis, treatment and prognosis), the different neural network models can be used to take advantage of their algorithms, and the deep learning can be a good aid in treating otologic diseases. The deep learning has a good applicable prospect in the clinical diagnosis and treatment of otologic diseases, which can play a certain role in promoting the development of deep learning combined with intelligent medicine.
Collapse
Affiliation(s)
- Shuang Mao
- Department of Otorhinolaryngology Head and Neck Surgery, Xiangya Hospital, Central South University, Changsha 410008.
- Hunan Provincial Key Laboratory of Major Otorhinolaryngology Diseases, Changsha 410008.
- National Clinical Research Center for Geriatric Diseases (Xiangya Hospital), Changsha 410008.
| | - Xuewen Wu
- Department of Otorhinolaryngology Head and Neck Surgery, Xiangya Hospital, Central South University, Changsha 410008
- Hunan Provincial Key Laboratory of Major Otorhinolaryngology Diseases, Changsha 410008
- National Clinical Research Center for Geriatric Diseases (Xiangya Hospital), Changsha 410008
| | - Muzhou Hou
- School of Mathematics and Statistics, Central South University, Changsha 410083
| | - Lingyun Mei
- Department of Otorhinolaryngology Head and Neck Surgery, Xiangya Hospital, Central South University, Changsha 410008
- Hunan Provincial Key Laboratory of Major Otorhinolaryngology Diseases, Changsha 410008
- National Clinical Research Center for Geriatric Diseases (Xiangya Hospital), Changsha 410008
| | - Yong Feng
- Department of Otorhinolaryngology Head and Neck Surgery, Xiangya Hospital, Central South University, Changsha 410008
- Hunan Provincial Key Laboratory of Major Otorhinolaryngology Diseases, Changsha 410008
- Department of Otorhinolaryngology Head and Neck Surgery, Changsha Central Hospital Affiliated to South China University, Changsha 410018, China
| | - Jian Song
- Department of Otorhinolaryngology Head and Neck Surgery, Xiangya Hospital, Central South University, Changsha 410008.
- Hunan Provincial Key Laboratory of Major Otorhinolaryngology Diseases, Changsha 410008.
- National Clinical Research Center for Geriatric Diseases (Xiangya Hospital), Changsha 410008.
| |
Collapse
|
8
|
Byun H, Lee SH, Kim TH, Oh J, Chung JH. Feasibility of the Machine Learning Network to Diagnose Tympanic Membrane Lesions without Coding Experience. J Pers Med 2022; 12:jpm12111855. [PMID: 36579584 PMCID: PMC9697619 DOI: 10.3390/jpm12111855] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 09/29/2022] [Accepted: 10/31/2022] [Indexed: 11/10/2022] Open
Abstract
A machine learning platform operated without coding knowledge (Teachable machine®) has been introduced. The aims of the present study were to assess the performance of the Teachable machine® for diagnosing tympanic membrane lesions. A total of 3024 tympanic membrane images were used to train and validate the diagnostic performance of the network. Tympanic membrane images were labeled as normal, otitis media with effusion (OME), chronic otitis media (COM), and cholesteatoma. According to the complexity of the categorization, Level I refers to normal versus abnormal tympanic membrane; Level II was defined as normal, OME, or COM + cholesteatoma; and Level III distinguishes between all four pathologies. In addition, eighty representative test images were used to assess the performance. Teachable machine® automatically creates a classification network and presents diagnostic performance when images are uploaded. The mean accuracy of the Teachable machine® for classifying tympanic membranes as normal or abnormal (Level I) was 90.1%. For Level II, the mean accuracy was 89.0% and for Level III it was 86.2%. The overall accuracy of the classification of the 80 representative tympanic membrane images was 78.75%, and the hit rates for normal, OME, COM, and cholesteatoma were 95.0%, 70.0%, 90.0%, and 60.0%, respectively. Teachable machine® could successfully generate the diagnostic network for classifying tympanic membrane.
Collapse
Affiliation(s)
- Hayoung Byun
- Department of Otolaryngology & Head and Neck Surgery, College of Medicine, Hanyang University, Seoul 04763, Korea
- Machine Learning Research Center for Medical Data, Hanyang University, Seoul 04763, Korea
| | - Seung Hwan Lee
- Department of Otolaryngology & Head and Neck Surgery, College of Medicine, Hanyang University, Seoul 04763, Korea
| | - Tae Hyun Kim
- Machine Learning Research Center for Medical Data, Hanyang University, Seoul 04763, Korea
- Department of Computer Science, Hanyang University, Seoul 04763, Korea
| | - Jaehoon Oh
- Machine Learning Research Center for Medical Data, Hanyang University, Seoul 04763, Korea
- Department of Emergency Medicine, College of Medicine, Hanyang University, Seoul 04763, Korea
| | - Jae Ho Chung
- Department of Otolaryngology & Head and Neck Surgery, College of Medicine, Hanyang University, Seoul 04763, Korea
- Machine Learning Research Center for Medical Data, Hanyang University, Seoul 04763, Korea
- Department of HY-KIST Bio-Convergence, College of Medicine, Hanyang University, Seoul 04763, Korea
- Correspondence:
| |
Collapse
|
9
|
Choi Y, Chae J, Park K, Hur J, Kweon J, Ahn JH. Automated multi-class classification for prediction of tympanic membrane changes with deep learning models. PLoS One 2022; 17:e0275846. [PMID: 36215265 PMCID: PMC9550050 DOI: 10.1371/journal.pone.0275846] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 09/25/2022] [Indexed: 11/18/2022] Open
Abstract
BACKGROUNDS AND OBJECTIVE Evaluating the tympanic membrane (TM) using an otoendoscope is the first and most important step in various clinical fields. Unfortunately, most lesions of TM have more than one diagnostic name. Therefore, we built a database of otoendoscopic images with multiple diseases and investigated the impact of concurrent diseases on the classification performance of deep learning networks. STUDY DESIGN This retrospective study investigated the impact of concurrent diseases in the tympanic membrane on diagnostic performance using multi-class classification. A customized architecture of EfficientNet-B4 was introduced to predict the primary class (otitis media with effusion (OME), chronic otitis media (COM), and 'None' without OME and COM) and secondary classes (attic cholesteatoma, myringitis, otomycosis, and ventilating tube). RESULTS Deep-learning classifications accurately predicted the primary class with dice similarity coefficient (DSC) of 95.19%, while misidentification between COM and OME rarely occurred. Among the secondary classes, the diagnosis of attic cholesteatoma and myringitis achieved a DSC of 88.37% and 88.28%, respectively. Although concurrent diseases hampered the prediction performance, there was only a 0.44% probability of inaccurately predicting two or more secondary classes (29/6,630). The inference time per image was 2.594 ms on average. CONCLUSION Deep-learning classification can be used to support clinical decision-making by accurately and reproducibly predicting tympanic membrane changes in real time, even in the presence of multiple concurrent diseases.
Collapse
Affiliation(s)
- Yeonjoo Choi
- Department of Otorhinolaryngology-Head and Neck Surgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea
| | - Jihye Chae
- Departments of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea
| | - Keunwoo Park
- Departments of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea
| | - Jaehee Hur
- Departments of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea
| | - Jihoon Kweon
- Departments of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea
- * E-mail: (JHA); (JK)
| | - Joong Ho Ahn
- Department of Otorhinolaryngology-Head and Neck Surgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea
- * E-mail: (JHA); (JK)
| |
Collapse
|
10
|
Intelligent Control Techniques for the Detection of Biomedical Ear Infections. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:9653513. [PMID: 36105634 PMCID: PMC9467762 DOI: 10.1155/2022/9653513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 07/20/2022] [Accepted: 08/10/2022] [Indexed: 12/02/2022]
Abstract
The capacity to carry out one's regular tasks is affected to varying degrees by hearing difficulties. Poorer understanding, slower learning, and an overall reduction in efficiency in academic endeavours are just a few of the negative impacts of hearing impairments on children's performance, which may range from mild to severe. A significant factor in determining whether or not there will be a decrease in performance is the kind and source of impairment. Research has shown that the Artificial Neural Network technique is capable of modelling both linear and nonlinear solution surfaces in a trustworthy way, as demonstrated in previous studies. To improve the precision with which hearing impairment challenges are diagnosed, a neural network backpropagation approach has been developed with the purpose of fine-tuning the diagnostic process. In particular, it highlights the vital role performed by medical informatics in supporting doctors in the identification of diseases as well as the formulation of suitable choices via the use of data management and knowledge discovery. As part of the intelligent control method, it is proposed in this research to construct a Histogram Equalization (HE)-based Adaptive Center-Weighted Median (ACWM) filter, which is then used to segment/detect the OM in tympanic membrane images using different segmentation methods in order to minimise noise and improve the image quality. A tympanic membrane dataset, which is freely accessible, was used in all experiments.
Collapse
|
11
|
Habib AR, Crossland G, Patel H, Wong E, Kong K, Gunasekera H, Richards B, Caffery L, Perry C, Sacks R, Kumar A, Singh N. An Artificial Intelligence Computer-vision Algorithm to Triage Otoscopic Images From Australian Aboriginal and Torres Strait Islander Children. Otol Neurotol 2022; 43:481-488. [PMID: 35239622 DOI: 10.1097/mao.0000000000003484] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVE To develop an artificial intelligence image classification algorithm to triage otoscopic images from rural and remote Australian Aboriginal and Torres Strait Islander children. STUDY DESIGN Retrospective observational study. SETTING Tertiary referral center. PATIENTS Rural and remote Aboriginal and Torres Strait Islander children who underwent tele-otology ear health screening in the Northern Territory, Australia between 2010 and 2018. INTERVENTIONS Otoscopic images were labeled by otolaryngologists to classify the ground truth. Deep and transfer learning methods were used to develop an image classification algorithm. MAIN OUTCOME MEASURES Accuracy, sensitivity, specificity, positive predictive value, negative predictive value, area under the curve (AUC) of the resultant algorithm compared with the ground truth. RESULTS Six thousand five hundred twenty seven images were used (5927 images for training and 600 for testing). The algorithm achieved an accuracy of 99.3% for acute otitis media, 96.3% for chronic otitis media, 77.8% for otitis media with effusion (OME), and 98.2% to classify wax/obstructed canal. To differentiate between multiple diagnoses, the algorithm achieved 74.4 to 92.8% accuracy and an AUC of 0.963 to 0.997. The most common incorrect classification pattern was OME misclassified as normal tympanic membranes. CONCLUSIONS The paucity of access to tertiary otolaryngology care for rural and remote Aboriginal and Torres Strait Islander communities may contribute to an under-identification of ear disease. Computer vision image classification algorithms can accurately classify ear disease from otoscopic images of Indigenous Australian children. In the future, a validated algorithm may integrate with existing telemedicine initiatives to support effective triage and facilitate early treatment and referral.
Collapse
Affiliation(s)
- Al-Rahim Habib
- Sydney Medical School, Faculty of Medicine and Health, University of Sydney, Camperdown, New South Wales, Australia
- Department of Otolaryngology-Head and Neck Surgery, Princess Alexandra Hospital, Brisbane, Queensland, Australia
- Department of Otolaryngology - Head and Neck Surgery, Westmead Hospital, Sydney, New South Wales, Australia
| | - Graeme Crossland
- Department of Otolaryngology - Head and Neck Surgery, Royal Darwin Hospital, Darwin, Northern Territory, Australia
| | - Hemi Patel
- Department of Otolaryngology - Head and Neck Surgery, Royal Darwin Hospital, Darwin, Northern Territory, Australia
| | - Eugene Wong
- Sydney Medical School, Faculty of Medicine and Health, University of Sydney, Camperdown, New South Wales, Australia
- Department of Otolaryngology - Head and Neck Surgery, Westmead Hospital, Sydney, New South Wales, Australia
| | - Kelvin Kong
- School of Medicine and Public Health, University of Newcastle, Newcastle, New South Wales, Australia
- Department of Linguistics, Faculty of Medicine, Macquarie University, Sydney, New South Wales, Australia
- School of Population Health, Faculty of Medicine, University of New South Wales, Sydney, Australia
| | - Hasantha Gunasekera
- Sydney Medical School, Faculty of Medicine and Health, University of Sydney, Camperdown, New South Wales, Australia
- The Children's Hospital at Westmead, Sydney, New South Wales, Australia
| | - Brent Richards
- Division of Medical Services, Gold Coast University Hospital, Gold Coast, Queensland, Australia
- Griffith Health, Griffith University Queensland, Australia
| | - Liam Caffery
- Centre for Online Health, University of Queensland, Australia
| | - Chris Perry
- Centre for Online Health, University of Queensland, Australia
| | - Raymond Sacks
- Sydney Medical School, Faculty of Medicine and Health, University of Sydney, Camperdown, New South Wales, Australia
| | - Ashnil Kumar
- School of Biomedical Engineering, Faculty of Engineering, University of Sydney, Camperdown, New South Wales, Australia
| | - Narinder Singh
- Sydney Medical School, Faculty of Medicine and Health, University of Sydney, Camperdown, New South Wales, Australia
- Department of Otolaryngology - Head and Neck Surgery, Westmead Hospital, Sydney, New South Wales, Australia
| |
Collapse
|