Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lohscheller J, Svec JG, Döllinger M. Vocal fold vibration amplitude, open quotient, speed quotient and their variability along glottal length: kymographic data from normal subjects. LOGOP PHONIATR VOCO 2012;38:182-92. [PMID: 23173880 DOI: 10.3109/14015439.2012.731083] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

For:	Lohscheller J, Svec JG, Döllinger M. Vocal fold vibration amplitude, open quotient, speed quotient and their variability along glottal length: kymographic data from normal subjects. LOGOP PHONIATR VOCO 2012;38:182-92. [PMID: 23173880 DOI: 10.3109/14015439.2012.731083] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Number

Cited by Other Article(s)

Echternach M, Burk F, Köberlein M, Döllinger M, Burdumy M, Richter B, Titze IR, Elemans CPH, Herbst CT. Biomechanics of sound production in high-pitched classical singing. Sci Rep 2024;14:13132. [PMID: 38849382 PMCID: PMC11161605 DOI: 10.1038/s41598-024-62598-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 05/20/2024] [Indexed: 06/09/2024] Open

Donhauser J, Tur B, Döllinger M. Neural network-based estimation of biomechanical vocal fold parameters. Front Physiol 2024;15:1282574. [PMID: 38449783 PMCID: PMC10916882 DOI: 10.3389/fphys.2024.1282574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 01/09/2024] [Indexed: 03/08/2024] Open

Abstract

Vocal fold (VF) vibrations are the primary source of human phonation. High-speed video (HSV) endoscopy enables the computation of descriptive VF parameters for assessment of physiological properties of laryngeal dynamics, i.e., the vibration of the VFs. However, underlying biomechanical factors responsible for physiological and disordered VF vibrations cannot be accessed. In contrast, physically based numerical VF models reveal insights into the organ's oscillations, which remain inaccessible through endoscopy. To estimate biomechanical properties, previous research has fitted subglottal pressure-driven mass-spring-damper systems, as inverse problem to the HSV-recorded VF trajectories, by global optimization of the numerical model. A neural network trained on the numerical model may be used as a substitute for computationally expensive optimization, yielding a fast evaluating surrogate of the biomechanical inverse problem. This paper proposes a convolutional recurrent neural network (CRNN)-based architecture trained on regression of a physiological-based biomechanical six-mass model (6 MM). To compare with previous research, the underlying biomechanical factor "subglottal pressure" prediction was tested against 288 HSV ex vivo porcine recordings. The contributions of this work are two-fold: first, the presented CRNN with the 6 MM handles multiple trajectories along the VFs, which allows for investigations on local changes in VF characteristics. Second, the network was trained to reproduce further important biomechanical model parameters like VF mass and stiffness on synthetic data. Unlike in a previous work, the network in this study is therefore an entire surrogate of the inverse problem, which allowed for explicit computation of the fitted model using our approach. The presented approach achieves a best-case mean absolute error (MAE) of 133 Pa (13.9%) in subglottal pressure prediction with 76.6% correlation on experimental data and a re-estimated fundamental frequency MAE of 15.9 Hz (9.9%). In-detail training analysis revealed subglottal pressure as the most learnable parameter. With the physiological-based model design and advances in fast parameter prediction, this work is a next step in biomechanical VF model fitting and the estimation of laryngeal kinematics.

Collapse

Nogueira do Nascimento U, Santos MAR, Gama ACC. Digital Videokymography: Analysis of Glottal Closure in Adults. J Voice 2024;38:18-24. [PMID: 34417083 DOI: 10.1016/j.jvoice.2021.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 07/01/2021] [Accepted: 07/06/2021] [Indexed: 11/25/2022]

Abstract

INTRODUCTION

High-speed videolaryngoscopy and quantitative analysis of laryngeal images are relevant in accurately diagnosing vocal fold closure patterns.

OBJECTIVE

To analyze the parameters of digital videokymography obtained through high-speed videolaryngoscopy in women and men with complete and incomplete glottal closure, and posterior glottal chink.

METHODS

We conducted an observational, analytical, cross-sectional study with data from 65 adults, which we divided into groups according to sex and glottal closure. Digital videokymography parameters were analyzed using an image-processing program. The Anderson-Darling and Mann-Whitney U tests were used to verify sample normality and compare videokymography parameters between groups, respectively. The significance level was set at 5%.

RESULTS

Among 65 laryngeal images, 20 each were from women with complete and incomplete glottal closure, and 20 and 5 were from men with complete and incomplete glottal closure, respectively. Considering the clinical relevance of the evaluated data, groups of 11 women and 4 men with posterior glottal chink were compared with sex-similar groups with complete glottal closure. Digital videokymography showed a lower maximum and mean vocal fold opening in women with incomplete glottal closure, and a lower dominant left vocal fold-opening amplitude and higher dominant frequency of bilateral vocal fold opening in men with incomplete glottal closure. It also showed a lower closed phase percentage in the posterior region for women and men, with higher closed phase percentage in the anterior and middle regions in women. Both groups with posterior glottal chink showed similar results.

CONCLUSION

Incomplete glottal closure may interfere with the results of the digital videokymography parameters, with higher impact on the posterior vocal fold region in males and the middle and anterior vocal fold regions in females.

Collapse

Yamauchi A, Imagawa H, Yokonishi H, Sakakibara KI, Tayama N. Multivariate Analysis of Vocal Fold Vibrations in Normal Speakers Using High-Speed Digital Imaging. J Voice 2024;38:10-17. [PMID: 34470706 DOI: 10.1016/j.jvoice.2021.08.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Revised: 07/30/2021] [Accepted: 08/02/2021] [Indexed: 11/18/2022]

Malinowski J, Pietruszewska W, Stawiski K, Kowalczyk M, Barańska M, Rycerz A, Niebudek-Bogusz E. High-Speed Videoendoscopy Enhances the Objective Assessment of Glottic Organic Lesions: A Case-Control Study with Multivariable Data-Mining Model Development. Cancers (Basel) 2023;15:3716. [PMID: 37509377 PMCID: PMC10378075 DOI: 10.3390/cancers15143716] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Revised: 07/13/2023] [Accepted: 07/19/2023] [Indexed: 07/30/2023] Open

Fujiki RB, Croegaert-Koch CK, Thibeault SL. Videostroboscopy Versus High-Speed Videoendoscopy: Factors Influencing Ratings of Laryngeal Oscillation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023;66:1496-1510. [PMID: 37040690 PMCID: PMC10457078 DOI: 10.1044/2023_jslhr-22-00649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 01/16/2023] [Accepted: 01/23/2023] [Indexed: 05/11/2023]

Abstract

PURPOSE

The purpose of this study was to determine whether patient voice-related diagnosis, severity of dysphonia, and rater's experience influence the relationship between laryngeal oscillation ratings made from videostroboscopic and high-speed videoendoscopic (HSV) exams.

METHOD

Stroboscopy and HSV exams from 15 patients with adductor spasmodic dysphonia (ADSD) and 15 with benign vocal fold lesions were rated for laryngeal oscillation and closure by 10 licensed speech-language pathologists (SLPs). Raters were divided into low- (< 5 years) and high-experience (> 5 years) groups. Ratings of vocal fold amplitude, mucosal wave, periodicity, phase symmetry, nonvibrating portion of the vocal fold, and glottal closure were examined using an online form adapted from the Voice Vibratory Assessment of Laryngeal Imaging (VALI).

RESULTS

Stroboscopy and HSV ratings were more strongly positively correlated for patients with benign vocal fold lesions (r between .43 and .75) than for those with ADSD (r between .40 and .68). Differences between stroboscopy and HSV exams were significantly greater for ratings of amplitude, mucosal wave, and periodicity in patients with ADSD than for patients with benign vocal fold lesions. Raters with < 5 years of experience showed significantly greater differences between stroboscopy and HSV ratings of amplitude and nonvibrating portion of the vocal fold for patients with ADSD only. Significantly greater differences between ratings of periodicity and phase symmetry were observed in patients with more severe dysphonia.

CONCLUSIONS

Differences in laryngeal ratings made between HSV and stroboscopy exams may be influenced by patient diagnosis, severity of dysphonia, and rater experience. Future study is warranted to determine how the differences observed influence clinical diagnosis and outcomes.

Collapse

Motie-Shirazi M, Zañartu M, Peterson SD, Mehta DD, Hillman RE, Erath BD. Effect of nodule size and stiffness on phonation threshold and collision pressures in a synthetic hemilaryngeal vocal fold model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023;153:654. [PMID: 36732229 PMCID: PMC9884154 DOI: 10.1121/10.0016997] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 12/19/2022] [Accepted: 01/06/2023] [Indexed: 06/18/2023]

Hao Z, Peng J, Dang X, Yan H, Wang R. mmSafe: A Voice Security Verification System Based on Millimeter-Wave Radar. SENSORS (BASEL, SWITZERLAND) 2022;22:9309. [PMID: 36502011 PMCID: PMC9739021 DOI: 10.3390/s22239309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 11/15/2022] [Accepted: 11/25/2022] [Indexed: 06/17/2023]

Motie-Shirazi M, Zañartu M, Peterson SD, Mehta DD, Hillman RE, Erath BD. Collision Pressure and Dissipated Power Dose in a Self-Oscillating Silicone Vocal Fold Model With a Posterior Glottal Opening. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022;65:2829-2845. [PMID: 35914018 PMCID: PMC9911124 DOI: 10.1044/2022_jslhr-21-00471] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 01/24/2022] [Accepted: 05/04/2022] [Indexed: 06/15/2023]

Taylor CJ, Thomson SL. Optimization of Synthetic Vocal Fold Models for Glottal Closure. JOURNAL OF ENGINEERING AND SCIENCE IN MEDICAL DIAGNOSTICS AND THERAPY 2022;5:031106. [PMID: 35832120 PMCID: PMC9132011 DOI: 10.1115/1.4054194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 03/23/2022] [Indexed: 06/15/2023]

Quantitative Analysis of Vocal Fold Vibration using High-Speed Videoendoscopy in Children with and without Bilateral Lesions. J Voice 2022;36:176-182. [PMID: 32712076 PMCID: PMC7854946 DOI: 10.1016/j.jvoice.2020.05.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Revised: 05/04/2020] [Accepted: 05/07/2020] [Indexed: 11/22/2022]

Kopczynski B, Niebudek-Bogusz E, Pietruszewska W, Strumillo P. Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings. SENSORS (BASEL, SWITZERLAND) 2022;22:s22051751. [PMID: 35270897 PMCID: PMC8915112 DOI: 10.3390/s22051751] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 02/12/2022] [Accepted: 02/15/2022] [Indexed: 05/17/2023]

Movahhedi M, Geng B, Xue Q, Zheng X. A computational framework for patient-specific surgical planning of type 1 thyroplasty. JASA EXPRESS LETTERS 2021;1:125203. [PMID: 36154377 DOI: 10.1121/10.0009084] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Comparative analysis of high-speed videolaryngoscopy images and sound data simultaneously acquired from rigid and flexible laryngoscope: a pilot study. Sci Rep 2021;11:20480. [PMID: 34650174 PMCID: PMC8516923 DOI: 10.1038/s41598-021-99948-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 10/04/2021] [Indexed: 12/03/2022] Open

Malinowski J, Niebudek-Bogusz E, Just M, Morawska J, Racino A, Hoffman J, Barańska M, Kowalczyk MM, Pietruszewska W. Laryngeal High-Speed Videoendoscopy with Laser Illumination: A Preliminary Report. Otolaryngol Pol 2021;75:1-10. [PMID: 35175220 DOI: 10.5604/01.3001.0015.2575] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Abstract

Introduction: Advances in computer image analysis have enabled the use of new functional imaging methods in the diagnosis of laryngeal diseases. Particularly interesting techniques of dynamic laryngeal imaging involve High Speed Videoendoscopy (HSV). This still-developed technique allows to overcome the limitations of laryngovideostroboscopy (LVS) and a more detailed analysis of the glottal function based on the image of the actual vibrations of the vocal folds. It also enables the determination of objective coefficients parameterizing phonatory vibrations of the vocal folds. Aim: The aim of this pilot study was to evaluate the use of a high-speed videoendoscopy set with laser illumination for the diagnosis of glottic pathology in ENT practice. Material and methods: The study included 40 patients who underwent LVS followed by HSV. The modern HSV examination kit - Advanced Larynx Imager System (ALIS), used for the first time in a clinical setting in Poland, is characterized by significantly improved, compared to the previously used high-speed cameras, operational parameters - a light head, the possibility of continuous lighting operation without excessive heating of the head tip, registration of the image in full color scale. Thanks to such modernization, the safety and course of the examination do not differ from laryngoscopy conducted with commonly used recorders. The device owes some of these improvements to a laser illuminator which was used for the first time as the main light source in a high-speed camera. In the study, two cases were selected to present the results of HSV and the analysis of the generated kymograms - a woman with no glottic pathology and a man with a polyp of the right vocal fold. In the first case, the HSV examination compared with the LVS revealed a discrete glottis functional disorder in the form of a tendency to hyperphonation. The patient with an organic lesion had a clearly visible irregularity of vocal fold vibrations, which also allowed to trace mucosal wave disturbances related to its reflection from the pathological structure of the glottis and the formation of a return wave, both on the fold affected by the lesion and, to a lesser extent, contralaterally. The glottic dysfunctions observed in the studied patients were confirmed in the generated kymograms and the graphs of the glottal width waveform (GWW), as well as in the parameters calculated on their basis, assessing the frequency and amplitude of phonatory vibrations. Conclusions: The use of high-speed videoendoscopy allows for a much more accurate assessment of the phonatory function of the glottis than in laryngovideostroboscopy. The presented HSV system allows for obtaining high quality kinematic images of the larynx, color fidelity, and contrast. The use of this technology in laryngological practice enables precise structural and functional assessment of the glottis and detection of discrete phonation disorders that elude the techniques used so far.</br&gt.

Collapse

Motie-Shirazi M, Zañartu M, Peterson SD, Erath BD. Vocal fold dynamics in a synthetic self-oscillating model: Intraglottal aerodynamic pressure and energy. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021;150:1332. [PMID: 34470335 PMCID: PMC8387087 DOI: 10.1121/10.0005882] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Revised: 07/21/2021] [Accepted: 07/26/2021] [Indexed: 06/13/2023]

Motie-Shirazi M, Zañartu M, Peterson SD, Erath BD. Vocal fold dynamics in a synthetic self-oscillating model: Contact pressure and dissipated-energy dose. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021;150:478. [PMID: 34340498 PMCID: PMC8298101 DOI: 10.1121/10.0005596] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 06/18/2021] [Accepted: 06/21/2021] [Indexed: 06/13/2023]

Stewart ME, Erath BD. Investigating blunt force trauma to the larynx: The role of inferior-superior vocal fold displacement on phonation. J Biomech 2021;121:110377. [PMID: 33819698 DOI: 10.1016/j.jbiomech.2021.110377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 02/24/2021] [Accepted: 03/01/2021] [Indexed: 11/26/2022]

Fitting synthetic to clinical kymographic images for deriving kinematic vocal fold parameters: Application to left-right vibratory phase differences. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2020.102253] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Gómez P, Kist AM, Schlegel P, Berry DA, Chhetri DK, Dürr S, Echternach M, Johnson AM, Kniesburges S, Kunduk M, Maryn Y, Schützenberger A, Verguts M, Döllinger M. BAGLS, a multihospital Benchmark for Automatic Glottis Segmentation. Sci Data 2020;7:186. [PMID: 32561845 PMCID: PMC7305104 DOI: 10.1038/s41597-020-0526-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 05/15/2020] [Indexed: 02/06/2023] Open

Affiliation(s)

Pablo Gómez Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany.
Andreas M Kist Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany.
Patrick Schlegel Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
David A Berry Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California, Los Angeles, Los Angeles, California, USA
Dinesh K Chhetri Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California, Los Angeles, Los Angeles, California, USA
Stephan Dürr Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
Matthias Echternach Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
Aaron M Johnson NYU Voice Center, Department of Otolaryngology - Head and Neck Surgery, New York University School of Medicine, New York, New York, USA
Stefan Kniesburges Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
Melda Kunduk Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge, Louisiana, USA
Youri Maryn European Institute for ORL-HNS, Department of Otorhinolaryngology and Head & Neck Surgery, Sint-Augustinus GZA, Wilrijk, Belgium Department of Speech, Language and Hearing sciences, University of Ghent, Ghent, Belgium Faculty of Education, Health and Social Work, University College Ghent, Ghent, Belgium Faculty of Psychology and Educational Sciences, School of Logopedics, Université Catholique de Louvain, Louvain-la-Neuve, Belgium Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
Anne Schützenberger Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
Monique Verguts European Institute for ORL-HNS, Department of Otorhinolaryngology and Head & Neck Surgery, Sint-Augustinus GZA, Wilrijk, Belgium Department of Otorhinolaryngology and Voice Disorders, Diest General Hospital, Diest, Belgium
Michael Döllinger Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany

Collapse

Mohd Khairuddin KA, Ahmad K, Ibrahim HM, Yan Y. Effects of Using Laryngeal High-Speed Videoendoscopy Images Visualizing Partial Views of The Glottis on Measurement Outcomes. J Voice 2020;36:106-112. [PMID: 32456835 DOI: 10.1016/j.jvoice.2020.04.027] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 04/21/2020] [Accepted: 04/22/2020] [Indexed: 11/29/2022]

Abstract

Ideally, an analysis method for laryngeal high-speed videoendoscopy (LHSV) based on the glottal area waveforms (GAW) requires images of a complete view of the glottis to ensure findings that are representatives of the vibratory behaviors of the whole vocal folds. However, in practice, the preferred images may not be obtained at all times. Often, the only available images that a clinician has to work with consist of a partial view of the glottis. This study aims to examine the effects of using images of a partial view of the glottis (ie, posterior-middle, anterior-middle, or middle) on the LHSV-based measures (ie, fundamental frequency (F0_GAW), frequency perturbation (jitter_GAW), amplitude perturbation (shimmer_GAW), open quotient (OQ_GAW), and Nyquist plot). The participants consisted of 9 young normophonic females. The procedures involved LHSV recording of the vibration of the vocal folds. The images of the complete view of the glottis were analyzed to obtain the LHSV-based measures. The same images were used to simulate the images of partial views of the glottis by changing the outline of the region of interest to include only either the posterior-middle, anterior-middle, or middle parts of the glottis. The LHSV-based measures from the images of the partial views were then compared to those with the complete view . The results showed that all LHSV-based measures from the images of the posterior-middle view were similar to those of the complete view. However, only the F0_GAW, jitter_GAW, and shimmer_GAW from the images of the anterior-middle and middle views were similar to those of the complete view. Lower OQ_GAW and different Nyquist plots than those of the complete view were generated by the images of the anterior-middle and middle views. In conclusion, all LHSV-based measures from the images of the posterior-middle view of the glottis, and only the F0_GAW, jitter_GAW, and shimmer_GAW from the images of the anterior-middle and middle views of the glottis reflect the vibratory behaviors of the whole vocal folds. The same conclusion could not be applied to the OQ_GAW and Nyquist plots of the images of the anterior-middle and middle views of the glottis. A possible effect of the presence or absence of a posterior glottal gap on the findings warrants further confirmation.

Collapse

Laryngeal Image Processing of Vocal Folds Motion. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10051556] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Fehling MK, Grosch F, Schuster ME, Schick B, Lohscheller J. Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network. PLoS One 2020;15:e0227791. [PMID: 32040514 PMCID: PMC7010264 DOI: 10.1371/journal.pone.0227791] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Accepted: 12/25/2019] [Indexed: 01/22/2023] Open

Abstract

The objective investigation of the dynamic properties of vocal fold vibrations demands the recording and further quantitative analysis of laryngeal high-speed video (HSV). Quantification of the vocal fold vibration patterns requires as a first step the segmentation of the glottal area within each video frame from which the vibrating edges of the vocal folds are usually derived. Consequently, the outcome of any further vibration analysis depends on the quality of this initial segmentation process. In this work we propose for the first time a procedure to fully automatically segment not only the time-varying glottal area but also the vocal fold tissue directly from laryngeal high-speed video (HSV) using a deep Convolutional Neural Network (CNN) approach. Eighteen different Convolutional Neural Network (CNN) network configurations were trained and evaluated on totally 13,000 high-speed video (HSV) frames obtained from 56 healthy and 74 pathologic subjects. The segmentation quality of the best performing Convolutional Neural Network (CNN) model, which uses Long Short-Term Memory (LSTM) cells to take also the temporal context into account, was intensely investigated on 15 test video sequences comprising 100 consecutive images each. As performance measures the Dice Coefficient (DC) as well as the precisions of four anatomical landmark positions were used. Over all test data a mean Dice Coefficient (DC) of 0.85 was obtained for the glottis and 0.91 and 0.90 for the right and left vocal fold (VF) respectively. The grand average precision of the identified landmarks amounts 2.2 pixels and is in the same range as comparable manual expert segmentations which can be regarded as Gold Standard. The method proposed here requires no user interaction and overcomes the limitations of current semiautomatic or computational expensive approaches. Thus, it allows also for the analysis of long high-speed video (HSV)-sequences and holds the promise to facilitate the objective analysis of vocal fold vibrations in clinical routine. The here used dataset including the ground truth will be provided freely for all scientific groups to allow a quantitative benchmarking of segmentation approaches in future.

Collapse

Kim GH, Lee YW, Bae IH, Park HJ, Wang SG, Kwon SB. Usefulness of Two-Dimensional Digital Kymography in Patients With Vocal Fold Scarring. J Voice 2019;33:906-914. [DOI: 10.1016/j.jvoice.2018.06.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Revised: 06/04/2018] [Accepted: 06/06/2018] [Indexed: 11/29/2022]

Motie-Shirazi M, Zañartu M, Peterson SD, Mehta DD, Kobler JB, Hillman RE, Erath BD. Toward Development of a Vocal Fold Contact Pressure Probe: Sensor Characterization and Validation Using Synthetic Vocal Fold Models. APPLIED SCIENCES-BASEL 2019;9. [PMID: 32377408 PMCID: PMC7202565 DOI: 10.3390/app9153002] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Zhang Z. Vocal fold contact pressure in a three-dimensional body-cover phonation model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;146:256. [PMID: 31370600 PMCID: PMC6642050 DOI: 10.1121/1.5116138] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Revised: 06/18/2019] [Accepted: 06/20/2019] [Indexed: 05/18/2023]

Lee JC, Wang SG, Sung ES, Bae IH, Kim ST, Lee YW. Clinical Practicability of a Newly Developed Real-time Digital Kymographic System. J Voice 2019;33:346-351. [DOI: 10.1016/j.jvoice.2017.10.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Revised: 10/28/2017] [Accepted: 10/31/2017] [Indexed: 10/18/2022]

Sadeghi H, Döllinger M, Kaltenbacher M, Kniesburges S. Aerodynamic impact of the ventricular folds in computational larynx models. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019;145:2376. [PMID: 31046372 DOI: 10.1121/1.5098775] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Accepted: 04/01/2019] [Indexed: 06/09/2023]

Sielska-Badurek EM, Jędra K, Sobol M, Niemczyk K, Osuch-Wójcikiewicz E. Laryngeal stroboscopy-Normative values for amplitude, open quotient, asymmetry and phase difference in young adults. Clin Otolaryngol 2018;44:158-165. [PMID: 30353981 DOI: 10.1111/coa.13247] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2017] [Revised: 05/10/2018] [Accepted: 10/18/2018] [Indexed: 11/26/2022]

Kumar SP, Phadke KV, Vydrová J, Novozámský A, Zita A, Zitová B, Švec JG. Visual and Automatic Evaluation of Vocal Fold Mucosal Waves Through Sharpness of Lateral Peaks in High-Speed Videokymographic Images. J Voice 2018;34:170-178. [PMID: 30314931 DOI: 10.1016/j.jvoice.2018.08.022] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2018] [Revised: 07/12/2018] [Accepted: 08/30/2018] [Indexed: 01/14/2023]

Pathological Voice Source Analysis System Using a Flow Waveform-Matched Biomechanical Model. Appl Bionics Biomech 2018;2018:3158439. [PMID: 30057647 PMCID: PMC6051280 DOI: 10.1155/2018/3158439] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2018] [Accepted: 05/24/2018] [Indexed: 11/24/2022] Open

Semmler M, Döllinger M, Patel RR, Ziethe A, Schützenberger A. Clinical relevance of endoscopic three-dimensional imaging for quantitative assessment of phonation. Laryngoscope 2018. [DOI: 10.1002/lary.27165] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Krasnodębska P, Szkiełkowska A, Miaśkiewicz B, Włodarczyk E, Domeracka-Kołodziej A, Skarżyński H. Objective measurement of mucosal wave parameters in diagnosing benign lesions of the vocal folds. LOGOP PHONIATR VOCO 2018;44:73-78. [PMID: 29318925 DOI: 10.1080/14015439.2017.1402950] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Döllinger M, Gómez P, Patel RR, Alexiou C, Bohr C, Schützenberger A. Biomechanical simulation of vocal fold dynamics in adults based on laryngeal high-speed videoendoscopy. PLoS One 2017;12:e0187486. [PMID: 29121085 PMCID: PMC5679561 DOI: 10.1371/journal.pone.0187486] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 10/18/2017] [Indexed: 12/18/2022] Open

Abstract

MOTIVATION

Human voice is generated in the larynx by the two oscillating vocal folds. Owing to the limited space and accessibility of the larynx, endoscopic investigation of the actual phonatory process in detail is challenging. Hence the biomechanics of the human phonatory process are still not yet fully understood. Therefore, we adapt a mathematical model of the vocal folds towards vocal fold oscillations to quantify gender and age related differences expressed by computed biomechanical model parameters.

METHODS

The vocal fold dynamics are visualized by laryngeal high-speed videoendoscopy (4000 fps). A total of 33 healthy young subjects (16 females, 17 males) and 11 elderly subjects (5 females, 6 males) were recorded. A numerical two-mass model is adapted to the recorded vocal fold oscillations by varying model masses, stiffness and subglottal pressure. For adapting the model towards the recorded vocal fold dynamics, three different optimization algorithms (Nelder-Mead, Particle Swarm Optimization and Simulated Bee Colony) in combination with three cost functions were considered for applicability. Gender differences and age-related kinematic differences reflected by the model parameters were analyzed.

RESULTS AND CONCLUSION

The biomechanical model in combination with numerical optimization techniques allowed phonatory behavior to be simulated and laryngeal parameters involved to be quantified. All three optimization algorithms showed promising results. However, only one cost function seems to be suitable for this optimization task. The gained model parameters reflect the phonatory biomechanics for men and women well and show quantitative age- and gender-specific differences. The model parameters for younger females and males showed lower subglottal pressures, lower stiffness and higher masses than the corresponding elderly groups. Females exhibited higher subglottal pressures, smaller oscillation masses and larger stiffness than the corresponding similar aged male groups. Optimizing numerical models towards vocal fold oscillations is useful to identify underlying laryngeal components controlling the phonatory process.

Collapse

Evaluation of clinical value of videokymography for diagnosis and treatment of voice disorders. Eur Arch Otorhinolaryngol 2017;274:3941-3949. [PMID: 28856469 DOI: 10.1007/s00405-017-4726-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Accepted: 08/21/2017] [Indexed: 10/19/2022]

Herbst CT, Schutte HK, Bowling DL, Svec JG. Comparing Chalk With Cheese—The EGG Contact Quotient Is Only a Limited Surrogate of the Closed Quotient. J Voice 2017;31:401-409. [DOI: 10.1016/j.jvoice.2016.11.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Revised: 11/06/2016] [Accepted: 11/08/2016] [Indexed: 10/20/2022]

High-speed Videolaryngoscopy: Quantitative Parameters of Glottal Area Waveforms and High-speed Kymography in Healthy Individuals. J Voice 2017;31:282-290. [DOI: 10.1016/j.jvoice.2016.09.026] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Revised: 09/22/2016] [Accepted: 09/23/2016] [Indexed: 11/21/2022]

Andrade-Miranda G, Henrich Bernardoni N, Godino-Llorente JI. Synthesizing the motion of the vocal folds using optical flow based techniques. Biomed Signal Process Control 2017. [DOI: 10.1016/j.bspc.2017.01.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Volgger V, Felicio A, Lohscheller J, Englhard AS, Al-Muzaini H, Betz CS, Schuster ME. Evaluation of the combined use of narrow band imaging and high-speed imaging to discriminate laryngeal lesions. Lasers Surg Med 2017;49:609-618. [PMID: 28231400 DOI: 10.1002/lsm.22652] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/04/2017] [Indexed: 02/05/2023]

Granados A, Misztal MK, Brunskog J, Visseq V, Erleben K. A numerical strategy for finite element modeling of frictionless asymmetric vocal fold collision. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN BIOMEDICAL ENGINEERING 2017;33. [PMID: 27058999 DOI: 10.1002/cnm.2793] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Revised: 02/23/2016] [Accepted: 03/28/2016] [Indexed: 05/08/2023]

Yamauchi A, Yokonishi H, Imagawa H, Sakakibara KI, Nito T, Tayama N, Yamasoba T. Characterization of Vocal Fold Vibration in Sulcus Vocalis Using High-Speed Digital Imaging. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017;60:24-37. [PMID: 28114611 DOI: 10.1044/2016_jslhr-s-14-0285] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2014] [Accepted: 07/07/2016] [Indexed: 06/06/2023]

Ikuma T, Kunduk M, Fink D, McWhorter AJ. Synthetic multi-line kymographic analysis: A spatiotemporal data reduction technique for high-speed videoendoscopy. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016;140:2703. [PMID: 27794340 DOI: 10.1121/1.4964400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Semmler M, Kniesburges S, Birk V, Ziethe A, Patel R, Dollinger M. 3D Reconstruction of Human Laryngeal Dynamics Based on Endoscopic High-Speed Recordings. IEEE TRANSACTIONS ON MEDICAL IMAGING 2016;35:1615-1624. [PMID: 26829782 DOI: 10.1109/tmi.2016.2521419] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Niebudek-Bogusz E, Kopczynski B, Strumillo P, Morawska J, Wiktorowicz J, Sliwinska-Kowalska M. Quantitative assessment of videolaryngostroboscopic images in patients with glottic pathologies. LOGOP PHONIATR VOCO 2016;42:73-83. [DOI: 10.3109/14015439.2016.1174293] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Döllinger M, Berry DA, Kniesburges S. Dynamic vocal fold parameters with changing adduction in ex-vivo hemilarynx experiments. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016;139:2372. [PMID: 27250133 PMCID: PMC4859834 DOI: 10.1121/1.4947044] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2014] [Revised: 03/22/2016] [Accepted: 04/05/2016] [Indexed: 05/25/2023]

Patel RR, Unnikrishnan H, Donohue KD. Effects of Vocal Fold Nodules on Glottal Cycle Measurements Derived from High-Speed Videoendoscopy in Children. PLoS One 2016;11:e0154586. [PMID: 27124157 PMCID: PMC4849744 DOI: 10.1371/journal.pone.0154586] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2015] [Accepted: 04/17/2016] [Indexed: 11/18/2022] Open

Abstract

The goal of this study is to quantify the effects of vocal fold nodules on vibratory motion in children using high-speed videoendoscopy. Differences in vibratory motion were evaluated in 20 children with vocal fold nodules (5–11 years) and 20 age and gender matched typically developing children (5–11 years) during sustained phonation at typical pitch and loudness. Normalized kinematic features of vocal fold displacements from the mid-membranous vocal fold point were extracted from the steady-state high-speed video. A total of 12 kinematic features representing spatial and temporal characteristics of vibratory motion were calculated. Average values and standard deviations (cycle-to-cycle variability) of the following kinematic features were computed: normalized peak displacement, normalized average opening velocity, normalized average closing velocity, normalized peak closing velocity, speed quotient, and open quotient. Group differences between children with and without vocal fold nodules were statistically investigated. While a moderate effect size was observed for the spatial feature of speed quotient, and the temporal feature of normalized average closing velocity in children with nodules compared to vocally normal children, none of the features were statistically significant between the groups after Bonferroni correction. The kinematic analysis of the mid-membranous vocal fold displacement revealed that children with nodules primarily differ from typically developing children in closing phase kinematics of the glottal cycle, whereas the opening phase kinematics are similar. Higher speed quotients and similar opening phase velocities suggest greater relative forces are acting on vocal fold in the closing phase. These findings suggest that future large-scale studies should focus on spatial and temporal features related to the closing phase of the glottal cycle for differentiating the kinematics of children with and without vocal fold nodules.

Collapse

Relationship of Various Open Quotients With Acoustic Property, Phonation Types, Fundamental Frequency, and Intensity. J Voice 2016;30:145-57. [DOI: 10.1016/j.jvoice.2015.01.009] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 01/30/2015] [Indexed: 10/23/2022]

Unger J, Schuster M, Hecker DJ, Schick B, Lohscheller J. A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms. Artif Intell Med 2015;66:15-28. [PMID: 26597002 DOI: 10.1016/j.artmed.2015.10.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2015] [Revised: 09/28/2015] [Accepted: 10/20/2015] [Indexed: 12/01/2022]

Abstract

OBJECTIVE

This work presents a computer-based approach to analyze the two-dimensional vocal fold dynamics of endoscopic high-speed videos, and constitutes an extension and generalization of a previously proposed wavelet-based procedure. While most approaches aim for analyzing sustained phonation conditions, the proposed method allows for a clinically adequate analysis of both dynamic as well as sustained phonation paradigms.

MATERIALS AND METHODS

The analysis procedure is based on a spatio-temporal visualization technique, the phonovibrogram, that facilitates the documentation of the visible laryngeal dynamics. From the phonovibrogram, a low-dimensional set of features is computed using a principle component analysis strategy that quantifies the type of vibration patterns, irregularity, lateral symmetry and synchronicity, as a function of time. Two different test bench data sets are used to validate the approach: (I) 150 healthy and pathologic subjects examined during sustained phonation. (II) 20 healthy and pathologic subjects that were examined twice: during sustained phonation and a glissando from a low to a higher fundamental frequency. In order to assess the discriminative power of the extracted features, a Support Vector Machine is trained to distinguish between physiologic and pathologic vibrations. The results for sustained phonation sequences are compared to the previous approach. Finally, the classification performance of the stationary analyzing procedure is compared to the transient analysis of the glissando maneuver.

RESULTS

For the first test bench the proposed procedure outperformed the previous approach (proposed feature set: accuracy: 91.3%, sensitivity: 80%, specificity: 97%, previous approach: accuracy: 89.3%, sensitivity: 76%, specificity: 96%). Comparing the classification performance of the second test bench further corroborates that analyzing transient paradigms provides clear additional diagnostic value (glissando maneuver: accuracy: 90%, sensitivity: 100%, specificity: 80%, sustained phonation: accuracy: 75%, sensitivity: 80%, specificity: 70%).

CONCLUSIONS

The incorporation of parameters describing the temporal evolvement of vocal fold vibration clearly improves the automatic identification of pathologic vibration patterns. Furthermore, incorporating a dynamic phonation paradigm provides additional valuable information about the underlying laryngeal dynamics that cannot be derived from sustained conditions. The proposed generalized approach provides a better overall classification performance than the previous approach, and hence constitutes a new advantageous tool for an improved clinical diagnosis of voice disorders.

Collapse

Andrade-Miranda G, Godino-Llorente JI, Moro-Velázquez L, Gómez-García JA. An automatic method to detect and track the glottal gap from high speed videoendoscopic images. Biomed Eng Online 2015;14:100. [PMID: 26510707 PMCID: PMC4625946 DOI: 10.1186/s12938-015-0096-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Accepted: 10/20/2015] [Indexed: 11/17/2022] Open

A Preliminary Quantitative Comparison of Vibratory Amplitude Using Rigid and Flexible Stroboscopic Assessment. J Voice 2015;30:485-92. [PMID: 26149662 DOI: 10.1016/j.jvoice.2015.05.018] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Accepted: 05/29/2015] [Indexed: 11/22/2022]