1
|
Kim J, Wang SG, Lee JC, Cheon YI, Shin SC, Lim DW, Jang DI, Bhattacharjee S, Hwang YB, Choi HK, Kwon I, Kim SJ, Kwon SB. Evaluation of Vertical Level Differences Between Left and Right Vocal Folds Using Artificial Intelligence System in Excised Canine Larynx. J Voice 2024:S0892-1997(23)00385-5. [PMID: 38216386 DOI: 10.1016/j.jvoice.2023.11.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Revised: 11/28/2023] [Accepted: 11/28/2023] [Indexed: 01/14/2024]
Abstract
OBJECTIVES This study aimed to establish an artificial intelligence (AI) system to classify vertical level differences between vocal folds during vocalization and to evaluate the accuracy of the classification. METHODS We designed models with different depths between the right and left vocal folds using an excised canine larynx. Video files for the data set were obtained using a high-speed camera system and a color complementary metal oxide semiconductor camera with global shutter. The data sets were divided into training, validation, and testing. We used 20,000 images for building the model and 8000 images for testing. To perform deep learning multiclass classification and to estimate the vertical level difference, we introduced DenseNet121-ConvLSTM. RESULTS The model was trained several times using different numbers of epochs. We achieved the most optimal results at 100 epochs, and the batch size used during training was 16. The proposed DenseNet121-ConvLSTM model achieved classification accuracies of 99.5% and 88.0% for training and testing, respectively. After verification using an external data set, the overall accuracy, precision, recall, and f1-score were 90.8%, 91.6%, 90.9%, and 91.2%, respectively. CONCLUSIONS The newly developed AI system may be an easy and accurate method for classifying superior and inferior vertical level differences between vocal folds. Thus, this AI system can be applied and may help in the assessment of vertical level differences in patients with unilateral vocal fold paralysis.
Collapse
Affiliation(s)
- Jaewon Kim
- Department of Cognitive Science, Pusan National University, Doctor's Course, Busan, South Korea; Department of Otorhinolaryngology, Head and Neck Surgery, Pusan National University Yangsan Hospital, Yangsan, Gyeongsangnam-do, South Korea
| | - Soo-Geun Wang
- Department of Otorhinolaryngology, Head and Neck Surgery, College of Medicine, Pusan National University and Medical Research Institute, Pusan National University Hospital, Busan, South Korea
| | - Jin-Choon Lee
- Department of Otorhinolaryngology, Head and Neck Surgery, Pusan National University School of Medicine, Pusan National University Yangsan Hospital, Yangsan, Gyeongsangnam-do, South Korea
| | - Yong-Il Cheon
- Department of Otorhinolaryngology, Head and Neck Surgery, Biomedical Research Institute, Pusan National University School of Medicine, Pusan National University Hospital, Busan, South Korea
| | - Sung-Chan Shin
- Department of Otorhinolaryngology, Head and Neck Surgery, Biomedical Research Institute, Pusan National University School of Medicine, Pusan National University Hospital, Busan, South Korea
| | - Dong-Won Lim
- Department of Otorhinolaryngology, Head and Neck Surgery, Pusan National University Hospital, Busan, South Korea
| | - Dae-Ik Jang
- Department of Otorhinolaryngology, Head and Neck Surgery, Kosin University Gospel Hospital, Kosin University College of Medicine, Busan, South Korea
| | | | - Yeong-Byn Hwang
- Department of Digital Anti-Aging Healthcare, u-AHRC, Inje University, Gimhae, South Korea
| | - Heung-Kook Choi
- Department of Digital Anti-Aging Healthcare, u-AHRC, Inje University, Gimhae, South Korea; Artificial Intelligence Research Center, JLK Inc., Seoul, South Korea
| | - Ickhwan Kwon
- Platform Development Headquarters, Autonomous A2Z, Daegu, South Korea
| | - Seon-Jong Kim
- Department of Applied IT and Engineering, Pusan National University, Miryang, Gyeongsangnam-do, South Korea
| | - Soon-Bok Kwon
- Department of Humanities, Language and Information, Pusan National University, Busan, South Korea.
| |
Collapse
|
2
|
Wang Z, Müller M, Caffier F, Caffier PP. Harnessing Machine Learning in Vocal Arts Medicine: A Random Forest Application for "Fach" Classification in Opera. Diagnostics (Basel) 2023; 13:2870. [PMID: 37761237 PMCID: PMC10528521 DOI: 10.3390/diagnostics13182870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/28/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open
Abstract
Vocal arts medicine provides care and prevention strategies for professional voice disorders in performing artists. The issue of correct "Fach" determination depending on the presence of a lyric or dramatic voice structure is of crucial importance for opera singers, as chronic overuse often leads to vocal fold damage. To avoid phonomicrosurgery or prevent a premature career end, our aim is to offer singers an improved, objective fach counseling using digital sound analyses and machine learning procedures. For this purpose, a large database of 2004 sound samples from professional opera singers was compiled. Building on this dataset, we employed a classic ensemble learning method, namely the Random Forest algorithm, to construct an efficient fach classifier. This model was trained to learn from features embedded within the sound samples, subsequently enabling voice classification as either lyric or dramatic. As a result, the developed system can decide with an accuracy of about 80% in most examined voice types whether a sound sample has a lyric or dramatic character. To advance diagnostic tools and health in vocal arts medicine and singing voice pedagogy, further machine learning methods will be applied to find the best and most efficient classification method based on artificial intelligence approaches.
Collapse
Affiliation(s)
- Zehui Wang
- Institute for Digital Transformation, University of Applied Sciences Ravensburg-Weingarten, Doggenriedstraße, 88250 Weingarten, Germany;
| | - Matthias Müller
- Occupational College of Music BFSM Krumbach, Mindelheimer Str. 47, 86381 Krumbach, Germany;
| | - Felix Caffier
- School of Computing, Communication and Business, HTW Berlin University of Applied Sciences, Treskowallee 8, 10318 Berlin, Germany;
| | - Philipp P. Caffier
- Department of Audiology and Phoniatrics, Charité—Universitätsmedizin Berlin, Campus Charité Mitte, Charitéplatz 1, 10117 Berlin, Germany
| |
Collapse
|
3
|
Swaminathan R, Rosa Opatha S, Mughal Z, Prasad S. A systematic review on efficacy, safety and cost-effectiveness of office-based laryngeal biopsy. J Laryngol Otol 2023; 137:946-952. [PMID: 35766586 DOI: 10.1017/s0022215122001566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
OBJECTIVE Laryngeal cancer is the second most prevalent head and neck malignancy in the USA. With recent advances in technology, this procedure is increasingly performed under local anaesthesia. This study aimed to identify the efficacy, safety and cost-effectiveness of laryngeal biopsy in out-patients by conducting a systematic review. METHOD A literature search was conducted using PubMed, Medline, Google Scholar and Embase over a 20-year period. Inclusion criteria were: studies performed on out-patient diagnostic biopsy procedures of the larynx. Exclusion criteria included all therapeutic procedures. The outcome measures were sensitivity and specificity, complication rate and cost-savings. RESULTS Thirty-five studies were included in the analysis. The sensitivity and specificity varied from 60 to 100 per cent with a low complication rate and cost savings. CONCLUSION Office-based laryngeal biopsies are increasingly used in the diagnosis of laryngeal cancers, resulting in earlier diagnosis and commencement of treatment. The barrier to undertaking this procedure is low sensitivity.
Collapse
Affiliation(s)
- R Swaminathan
- Department of Otorhinolaryngology and Head and Neck Surgery, Warwick Hospital, UK
| | - S Rosa Opatha
- Department of Otorhinolaryngology and Head and Neck Surgery, Warwick Hospital, UK
| | - Z Mughal
- Department of Otorhinolaryngology and Head and Neck Surgery, Queen Elizabeth Hospital, Birmingham, UK
| | - S Prasad
- Department of Otorhinolaryngology and Head and Neck Surgery, University Hospitals Coventry & Warwickshire NHS Trust, UK
| |
Collapse
|
4
|
Wasserzug O, Fishman G, Carmel-Neiderman N, Oestreicher-Kedem Y, Saada M, Dadia S, Golden E, Berman P, Handzel O, DeRowe A. Three dimensional printed models of the airway for preoperative planning of open Laryngotracheal surgery in children: Surgeon's perception of utility. J Otolaryngol Head Neck Surg 2021; 50:47. [PMID: 34256870 PMCID: PMC8278656 DOI: 10.1186/s40463-021-00524-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 06/13/2021] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND Preoperative planning of open laryngotracheal surgery is important for achieving good results. This study examines the surgeon's perception of the importance of using life size 3D printed models of the pediatric airway on surgical decision making. METHODS Life-size three-dimensional models of the upper airway were created based on CT images of children scheduled for laryngotracheal-reconstruction and cricotracheal resection with anastomosis. Five pediatric airway surgeons evaluated the three-dimensional models for determining the surgical approach, incision location and length, graft length, and need for single or double-stage surgery of seven children (median age 4.4 years, M:F ratio 4:3). They rated the importance of the three-dimensional model findings compared to the direct laryngoscopy videos and CT findings for each domain on a validated Likert scale of 1-5. RESULTS The mean rating for all domains was 3.6 ± 0.63 ("moderately important" to "very important"), and the median rating was 4 ("very important"). There was full agreement between raters for length of incision and length of graft. The between-rater agreement was 0.608 ("good") for surgical approach, 0.585 ("moderate") for incision location, and 0.429 ("moderate") for need for single- or two-stage surgery. CONCLUSION Patient-specific three-dimensional printed models of children's upper airways were scored by pediatric airway surgeons as being moderately to very important for preoperative planning of open laryngotracheal surgery. Large-scale, objective outcome studies are warranted to establish the reliability and efficiency of these models.
Collapse
Affiliation(s)
- Oshri Wasserzug
- Pediatric Otolaryngology Unit, Dana-Dwek Children's Hospital, Tel Aviv Sourasky Medical Center, 6 Weizman Street, 6423906, Tel Aviv, Israel
- Department of Otolaryngology, Head & Neck and Maxillofacial Surgery, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Gadi Fishman
- Pediatric Otolaryngology Unit, Dana-Dwek Children's Hospital, Tel Aviv Sourasky Medical Center, 6 Weizman Street, 6423906, Tel Aviv, Israel
- Department of Otolaryngology, Head & Neck and Maxillofacial Surgery, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Narin Carmel-Neiderman
- Department of Otolaryngology, Head & Neck and Maxillofacial Surgery, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Yael Oestreicher-Kedem
- Department of Otolaryngology, Head & Neck and Maxillofacial Surgery, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Maher Saada
- Pediatric Otolaryngology Unit, Dana-Dwek Children's Hospital, Tel Aviv Sourasky Medical Center, 6 Weizman Street, 6423906, Tel Aviv, Israel
| | - Solomon Dadia
- The Surgical 3D Printing Lab, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Eran Golden
- The Surgical 3D Printing Lab, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Philip Berman
- The Surgical 3D Printing Lab, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Ophir Handzel
- Department of Otolaryngology, Head & Neck and Maxillofacial Surgery, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Ari DeRowe
- Pediatric Otolaryngology Unit, Dana-Dwek Children's Hospital, Tel Aviv Sourasky Medical Center, 6 Weizman Street, 6423906, Tel Aviv, Israel.
- Department of Otolaryngology, Head & Neck and Maxillofacial Surgery, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel.
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel.
| |
Collapse
|
5
|
T1a Glottic Cancer: Advances in Vocal Outcome Assessment after Transoral CO 2-Laser Microsurgery Using the VEM. J Clin Med 2021; 10:jcm10061250. [PMID: 33802971 PMCID: PMC8002749 DOI: 10.3390/jcm10061250] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 03/10/2021] [Accepted: 03/15/2021] [Indexed: 12/18/2022] Open
Abstract
Patients with unilateral vocal fold cancer (T1a) have a favorable prognosis. In addition to the oncological results of CO2 transoral laser microsurgery (TOLMS), voice function is among the outcome measures. Previous early glottic cancer studies have reported voice function in patients grouped into combined T stages (Tis, T1, T2) and merged cordectomy types (lesser- vs. larger-extent cordectomies). Some authors have questioned the value of objective vocal parameters. Therefore, the purpose of this exploratory prospective study was to investigate TOLMS-associated oncological and vocal outcomes in 60 T1a patients, applying the ELS protocols for cordectomy classification and voice assessment. Pre- and postoperative voice function analysis included: Vocal Extent Measure (VEM), Dysphonia Severity Index (DSI), auditory-perceptual assessment (GRB), and 9-item Voice Handicap Index (VHI-9i). Altogether, 51 subjects (43 male, eight female, mean age 65 years) completed the study. The 5-year recurrence-free, overall, and disease-specific survival rates (Kaplan–Meier method) were 71.4%, 94.4%, and 100.0%. Voice function was preserved; the objective parameter VEM (64 ± 33 vs. 83 ± 31; mean ± SD) and subjective vocal measures (G: 1.9 ± 0.7 vs. 1.3 ± 0.7; VHI-9i: 18 ± 8 vs. 9 ± 9) even improved significantly (p < 0.001). The VEM best reflected self-perceived voice impairment. It represents a sensitive measure of voice function for quantification of vocal performance.
Collapse
|