Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Schlegel P, Kunduk M, Stingl M, Semmler M, Döllinger M, Bohr C, Schützenberger A. Influence of spatial camera resolution in high-speed videoendoscopy on laryngeal parameters. PLoS One 2019;14:e0215168. [PMID: 31009488 DOI: 10.1371/journal.pone.0215168] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Accepted: 03/27/2019] [Indexed: 11/19/2022] Open

For:	Schlegel P, Kunduk M, Stingl M, Semmler M, Döllinger M, Bohr C, Schützenberger A. Influence of spatial camera resolution in high-speed videoendoscopy on laryngeal parameters. PLoS One 2019;14:e0215168. [PMID: 31009488 DOI: 10.1371/journal.pone.0215168] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Accepted: 03/27/2019] [Indexed: 11/19/2022] Open

Number

Cited by Other Article(s)

Santuray R, Schlegel P, Zhang Z, Reddy N, Alhiyari Y, Long JL. Cell-Based Outer Vocal Fold Replacement Both Treats and Prevents Vocal Fold Scarring in Rabbits. Laryngoscope 2024;134:764-772. [PMID: 37597170 PMCID: PMC10842642 DOI: 10.1002/lary.30952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 07/10/2023] [Accepted: 07/25/2023] [Indexed: 08/21/2023]

Malinowski J, Pietruszewska W, Kowalczyk M, Niebudek-Bogusz E. Value of high-speed videoendoscopy as an auxiliary tool in differentiation of benign and malignant unilateral vocal lesions. J Cancer Res Clin Oncol 2024;150:10. [PMID: 38216796 PMCID: PMC10786956 DOI: 10.1007/s00432-023-05543-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 12/13/2023] [Indexed: 01/14/2024]

Abstract

PURPOSE

The study aimed to assess the relevance of objective vibratory parameters derived from high-speed videolaryngoscopy (HSV) as a supporting tool, to assist clinicians in establishing the initial diagnosis of benign and malignant glottal organic lesions.

METHODS

The HSV examinations were conducted in 175 subjects: 50 normophonic, 85 subjects with benign vocal fold lesions, and 40 with early glottic cancer; organic lesions were confirmed by histopathologic examination. The parameters, derived from HSV kymography: amplitude, symmetry, and glottal dynamic characteristics, were compared statistically between the groups with the following ROC analysis.

RESULTS

Among 14 calculated parameters, 10 differed significantly between the groups. Four of them, the average resultant amplitude of the involved vocal fold (AmpInvolvedAvg), average amplitude asymmetry for the whole glottis and its middle third part (AmplAsymAvg; AmplAsymAvg_2/3), and absolute average phase difference (AbsPhaseDiffAvg), showed significant differences between benign and malignant lesions. Amplitude values were decreasing, while asymmetry and phase difference values were increasing with the risk of malignancy. In ROC analysis, the highest AUC was observed for AmpAsymAvg (0.719; p < 0.0001), and next in order was AmpInvolvedAvg (0.70; p = 0.0002).

CONCLUSION

The golden standard in the diagnosis of organic lesions of glottis remains clinical examination with videolaryngoscopy, confirmed by histopathological examination. Our results showed that measurements of amplitude, asymmetry, and phase of vibrations in malignant vocal fold masses deteriorate significantly in comparison to benign vocal lesions. High-speed videolaryngoscopy could aid their preliminary differentiation noninvasively before histopathological examination; however, further research on larger groups is needed.

Collapse

Semmler M, Kniesburges S, Pelka F, Ensthaler M, Wendler O, Schützenberger A. Influence of Reduced Saliva Production on Phonation in Patients With Ectodermal Dysplasia. J Voice 2023;37:913-923. [PMID: 34353685 DOI: 10.1016/j.jvoice.2021.06.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 05/28/2021] [Accepted: 06/02/2021] [Indexed: 10/20/2022]

Abstract

OBJECTIVE

Patients with ectodermal dysplasia (ED) suffer from an inherited disorder in the development of the ectodermal structures. Besides the main symptoms, i.e. significantly reduced formation/expression of teeth, hair and sweat glands, a decreased saliva production is objectively accounted. In addition to difficulties with chewing/swallowing, ED patients frequently report on the subjective impression of rough and hoarse voices. A correlation between the reduced production of saliva and an affliction of the voice has not yet been investigated objectively for this rare disease.

METHODS

Following an established measurement protocol, a study has been conducted on 31 patients with ED and 47 controls (no ED, healthy voice). Additionally, the vocal fold oscillations were recorded by high-speed videoendoscopy (HSV@4 kHz). The glottal area waveform was determined by segmentation and objective glottal dynamic parameters were calculated. The generated acoustic signal was evaluated by objective and subjective measures. The individual impairment was documented by a standardized questionnaire (VHI). Additionally, the amount of generated saliva was measured for a defined period of time.

RESULTS

ED patients displayed a significantly reduced saliva production compared to the control group. Furthermore, the auditory-perceptual evaluation yielded significantly higher ratings for breathiness and hoarseness in the voices of male ED patients compared to male controls. The majority of male ED patients (67%) indicated at least minor impairment in the self-evaluation. Objective acoustic measures like Jitter and Shimmer confirmed the decreased acoustic quality in male ED patients, whereas none of the investigated HSV parameters showed significant differences between the test groups. Statistical analysis did not confirm a statistically significant correlation between reduced voice quality and amount of saliva.

CONCLUSIONS

An objective impairment of the acoustic outcome was demonstrated for male ED patients. However, the vocal folds dynamics in HSV recordings seem unaffected.

Collapse

Tur B, Gühring L, Wendler O, Schlicht S, Drummer D, Kniesburges S. Effect of Ligament Fibers on Dynamics of Synthetic, Self-Oscillating Vocal Folds in a Biomimetic Larynx Model. Bioengineering (Basel) 2023;10:1130. [PMID: 37892860 PMCID: PMC10604794 DOI: 10.3390/bioengineering10101130] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 09/13/2023] [Accepted: 09/25/2023] [Indexed: 10/29/2023] Open

Veltrup R, Kniesburges S, Semmler M. Influence of Perspective Distortion in Laryngoscopy. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023;66:3276-3289. [PMID: 37652062 DOI: 10.1044/2023_jslhr-23-00027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]

Abstract

OBJECTIVE

An experiment with controllable boundaries was designed to assess the influence of the recording angle and distance on two-dimensional (2D) imaging in laryngoscopy and resulting 2D parameter calculation derived from the glottal area waveform (GAW).

METHOD

Two high-speed camera setups were used to synchronously record an oscillating synthetic vocal fold (VF) model, simulating a high-speed videoendoscopy. One camera recorded at variable lateral recording angles and a reference camera in superior perspective. This was performed at different physiological recording distances and for two oscillation modes (with/without contacting VFs). The GAW was derived from the segmented glottis, and two parameters each for the categories of symmetry, periodicity, and closure were calculated, as well as two derivative measures. The percentage difference between the variable and reference camera value pairs was calculated, and the angle and height dependencies were quantified using linear regression.

RESULTS

The visual perception of a laryngoscopy was found to be influenced by the lateral recording angle, which may lead to misinterpretation of VF symmetry among inexperienced observers. The strongest influence of recording angle was observed for symmetry parameters, the strongest being the Amplitude Symmetry Index with up to 2.6%/° (p < .05). A dependence on the recording distance was only found for the Maximum Area Declination Rate.

CONCLUSIONS

The recording angle in 2D laryngoscopy should be carefully considered during visual inspection of the VF dynamics. Most of the investigated objective parameters were unaffected by the examined perspective distortion. However, especially left-right symmetry measures should only be used under controlled boundary conditions to avoid misdiagnosis and misinterpretation.

SUPPLEMENTAL MATERIAL

https://doi.org/10.23641/asha.23961183.

Collapse

Zhang Z. Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion. J Voice 2023;37:479-485. [PMID: 33849760 PMCID: PMC8502179 DOI: 10.1016/j.jvoice.2021.03.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 02/24/2021] [Accepted: 03/01/2021] [Indexed: 11/19/2022]

Abstract

OBJECTIVE

Estimation of physiological control parameters of the vocal system from the produced voice outcome has important applications in clinical management of voice disorders . Previously we developed a simulation-based neural network for estimation of vocal fold geometry, mechanical properties, and subglottal pressure from voice outcome features that characterize the acoustics of the produced voice. The goals of this study are to (1) explore the possibility of improving the estimation accuracy of physiological control parameters by including voice outcome features characterizing vocal fold vibration; and (2) identify voice feature sets that optimize both estimation accuracy and robustness to measurement noise.

METHODS

Feedforward neural networks are trained to solve the inversion problem of estimating the physiological control parameters of a three-dimensional body-cover vocal fold model from different sets of voice outcome features that characterize the simulated voice acoustics, glottal flow, and vocal fold vibration. A sensitivity analysis is then performed to evaluate the contribution of individual voice features to the overall performance of the neural networks in estimating the physiologic control parameters.

RESULTS AND CONCLUSIONS

While including voice outcome features characterizing vocal fold vibration increases estimation accuracy, it also reduces the network's robustness to measurement noise, due to high sensitivity of network performance to voice outcome features measuring the absolute amplitudes of the glottal flow and area waveforms, which are also difficult to measure accurately in practical applications. By excluding such glottal flow-based features and replacing glottal area-based features by their normalized counterparts, we are able to significantly improve both estimation accuracy and robustness to noise. We further show that similar estimation accuracy and robustness can be achieved with an even smaller set of voice outcome features by excluding features of small sensitivity.

Collapse

Pelka F, Ensthaler M, Wendler O, Kniesburges S, Schützenberger A, Semmler M. Mechanical Parameters Based on High-Speed Videoendoscopy of the Vocal Folds in Patients With Ectodermal Dysplasia. J Voice 2023:S0892-1997(23)00084-X. [PMID: 36973131 DOI: 10.1016/j.jvoice.2023.02.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 02/21/2023] [Accepted: 02/21/2023] [Indexed: 03/29/2023]

Abstract

OBJECTIVE

Patients suffering from ectodermal dysplasia (ED), which is an inherited disorder in the development of the ectodermal structures, have a significantly reduced expression of teeth, hair, sweat glands, and salivary glands in the respiratory tract including the larynx. Previous studies within the framework of the present project showed a significantly reduced saliva production and an impairment of the acoustic outcome in ED patients compared to the control group. However, until now, no statistically significant difference between EDs and controls could be found regarding vocal fold dynamics in the high-speed videoendoscopy (HSV) recordings using representative parameters on closure, symmetry, and periodicity. The aim of this study is to examine the role of tissue characteristics by means of objective mechanical parameters derived from HSV recordings.

METHODS

This study includes 28 ED patients and 42 controls (no ED, healthy voice). The vocal fold oscillations were recorded by high-speed videoendoscopy (HSV@4kHz). Based on the dynamical measures of the glottal area waveform (GAW), objective glottal dynamic parameters associated with tissue properties like flexibility and stiffness were computed.

RESULTS

The present evaluation displays a significant difference between male ED patients and male controls concerning the HSV-based mechanical parameters indicating reduced stiffness and increased deformability for the vocal folds of male ED patients. In contrast to strongly amplitude-dependent parameters, the primarily velocity-based parameters showed no statistically significant deviation.

CONCLUSIONS

The presented data provides the first promising indication toward the underlying causes on the laryngeal level leading to the voice conspicuities in ED patients. The significant difference concerning the mechanical parameters suggests a different composition of the extracellular matrix of the tissue of the vocal folds of ED patients compared to controls.

Collapse

Arias-Vergara T, Döllinger M, Schraut T, Mohd Khairuddin KA, Schützenberger A. Nyquist Plot Parametrization for Quantitative Analysis of Vibration of the Vocal Folds. J Voice 2023:S0892-1997(23)00014-0. [PMID: 36774264 DOI: 10.1016/j.jvoice.2023.01.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 01/12/2023] [Accepted: 01/12/2023] [Indexed: 02/11/2023]

Kaluza J, Niebudek-Bogusz E, Malinowski J, Strumillo P, Pietruszewska W. Assessment of Vocal Fold Stiffness by Means of High-Speed Videolaryngoscopy with Laryngotopography in Prediction of Early Glottic Malignancy: Preliminary Report. Cancers (Basel) 2022;14:cancers14194697. [PMID: 36230618 PMCID: PMC9563419 DOI: 10.3390/cancers14194697] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 09/07/2022] [Accepted: 09/19/2022] [Indexed: 11/16/2022] Open

Abstract

Simple Summary

The method described in our manuscript can help to objectively assess the vibration of each vocal fold using larygotopographic analysis of high-speed videoendoscopy (HSV) recordings. We have developed image processing and analysis procedures to detect vocal fold regions in HSV films and quantitatively analyze their shape and kinematics. We proposed the term Stiffness Asymmetry Index which can provide valuable information on the texture and kinematic properties of individual vocal fold tissues, which can be important in the diagnosis of early glottis cancer. Our study showed that a low value of SAI indicated large, non-vibrating vocal fold areas, characteristic of infiltrative lesions such as invasive carcinoma. This important clinical information can help to assess the depth of vocal fold invasion before direct histologic examination and discriminate benign from malignant lesions.

Abstract

One of the most important challenges in laryngological practice is the early diagnosis of laryngeal cancer. Detection of non-vibrating areas affected by neoplastic lesions of the vocal folds can be crucial in the recognition of early cancerogenous infiltration. Glottal pathologies associated with abnormal vibration patterns of the vocal folds can be detected and quantified using High-speed Videolaryngoscopy (HSV), also in subjects with severe voice disorders, and analyzed with the aid of computer image processing procedures. We present a method that enables the assessment of vocal fold pathologies with the use of HSV. The calculated laryngotopographic (LTG) maps of the vocal folds based on HSV allowed for a detailed characterization of vibration patterns and abnormalities in different regions of the vocal folds. We verified our methods with HSV recordings from 31 subjects with a normophonic voice and benign and malignant vocal fold lesions. We proposed the novel Stiffness Asymmetry Index (SAI) to differentiate between early glottis cancer (SAI = 0.65 ± 0.18) and benign vocal fold masses (SAI = 0.16 ± 0.13). Our results showed that these glottal pathologies might be noninvasively distinguished prior to histopathological examination. However, this needs to be confirmed by further research on larger groups of benign and malignant laryngeal lesions.

Collapse

Isolated Severe Dysphonia as a Presentation of Post-COVID-19 Syndrome. Diagnostics (Basel) 2022;12:diagnostics12081839. [PMID: 36010188 PMCID: PMC9406942 DOI: 10.3390/diagnostics12081839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 07/26/2022] [Accepted: 07/26/2022] [Indexed: 11/18/2022] Open

Comparative analysis of high-speed videolaryngoscopy images and sound data simultaneously acquired from rigid and flexible laryngoscope: a pilot study. Sci Rep 2021;11:20480. [PMID: 34650174 PMCID: PMC8516923 DOI: 10.1038/s41598-021-99948-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 10/04/2021] [Indexed: 12/03/2022] Open

Malinowski J, Niebudek-Bogusz E, Just M, Morawska J, Racino A, Hoffman J, Barańska M, Kowalczyk MM, Pietruszewska W. Laryngeal High-Speed Videoendoscopy with Laser Illumination: A Preliminary Report. Otolaryngol Pol 2021;75:1-10. [PMID: 35175220 DOI: 10.5604/01.3001.0015.2575] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Abstract

Introduction: Advances in computer image analysis have enabled the use of new functional imaging methods in the diagnosis of laryngeal diseases. Particularly interesting techniques of dynamic laryngeal imaging involve High Speed Videoendoscopy (HSV). This still-developed technique allows to overcome the limitations of laryngovideostroboscopy (LVS) and a more detailed analysis of the glottal function based on the image of the actual vibrations of the vocal folds. It also enables the determination of objective coefficients parameterizing phonatory vibrations of the vocal folds. Aim: The aim of this pilot study was to evaluate the use of a high-speed videoendoscopy set with laser illumination for the diagnosis of glottic pathology in ENT practice. Material and methods: The study included 40 patients who underwent LVS followed by HSV. The modern HSV examination kit - Advanced Larynx Imager System (ALIS), used for the first time in a clinical setting in Poland, is characterized by significantly improved, compared to the previously used high-speed cameras, operational parameters - a light head, the possibility of continuous lighting operation without excessive heating of the head tip, registration of the image in full color scale. Thanks to such modernization, the safety and course of the examination do not differ from laryngoscopy conducted with commonly used recorders. The device owes some of these improvements to a laser illuminator which was used for the first time as the main light source in a high-speed camera. In the study, two cases were selected to present the results of HSV and the analysis of the generated kymograms - a woman with no glottic pathology and a man with a polyp of the right vocal fold. In the first case, the HSV examination compared with the LVS revealed a discrete glottis functional disorder in the form of a tendency to hyperphonation. The patient with an organic lesion had a clearly visible irregularity of vocal fold vibrations, which also allowed to trace mucosal wave disturbances related to its reflection from the pathological structure of the glottis and the formation of a return wave, both on the fold affected by the lesion and, to a lesser extent, contralaterally. The glottic dysfunctions observed in the studied patients were confirmed in the generated kymograms and the graphs of the glottal width waveform (GWW), as well as in the parameters calculated on their basis, assessing the frequency and amplitude of phonatory vibrations. Conclusions: The use of high-speed videoendoscopy allows for a much more accurate assessment of the phonatory function of the glottis than in laryngovideostroboscopy. The presented HSV system allows for obtaining high quality kinematic images of the larynx, color fidelity, and contrast. The use of this technology in laryngological practice enables precise structural and functional assessment of the glottis and detection of discrete phonation disorders that elude the techniques used so far.</br&gt.

Collapse

Kist AM, Dürr S, Schützenberger A, Döllinger M. OpenHSV: an open platform for laryngeal high-speed videoendoscopy. Sci Rep 2021;11:13760. [PMID: 34215788 PMCID: PMC8253769 DOI: 10.1038/s41598-021-93149-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 06/03/2021] [Indexed: 11/22/2022] Open

Kim Y, Oh J, Choi SH, Jung A, Lee JG, Lee YS, Kim JK. A Portable Smartphone-Based Laryngoscope System for High-Speed Vocal Cord Imaging of Patients With Throat Disorders: Instrument Validation Study. JMIR Mhealth Uhealth 2021;9:e25816. [PMID: 34142978 PMCID: PMC8277344 DOI: 10.2196/25816] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 02/17/2021] [Accepted: 05/13/2021] [Indexed: 11/13/2022] Open

Abstract

Background

Currently, high-speed digital imaging (HSDI), especially endoscopic HSDI, is routinely used for the diagnosis of vocal cord disorders. However, endoscopic HSDI devices are usually large and costly, which limits access to patients in underdeveloped countries and in regions with inadequate medical infrastructure. Modern smartphones have sufficient functionality to process the complex calculations that are required for processing high-resolution images and videos with a high frame rate. Recently, several attempts have been made to integrate medical endoscopes with smartphones to make them more accessible to people in underdeveloped countries.

Objective

This study aims to develop a smartphone adaptor for endoscopes, which enables smartphone-based vocal cord imaging, to demonstrate the feasibility of performing high-speed vocal cord imaging via the high-speed imaging functions of a high-performance smartphone camera, and to determine the acceptability of the smartphone-based high-speed vocal cord imaging system for clinical applications in developing countries.

Methods

A customized smartphone adaptor optical relay was designed for clinical endoscopy using selective laser melting–based 3D printing. A standard laryngoscope was attached to the smartphone adaptor to acquire high-speed vocal cord endoscopic images. Only existing basic functions of the smartphone camera were used for HSDI of the vocal cords. Extracted still frames were observed for qualitative glottal volume and shape. For image processing, segmented glottal and vocal cord areas were calculated from whole HSDI frames to characterize the amplitude of the vibrations on each side of the glottis, including the frequency, edge length, glottal areas, base cord, and lateral phase differences over the acquisition time. The device was incorporated into a preclinical videokymography diagnosis routine to compare functionality.

Results

Smartphone-based HSDI with the smartphone-endoscope adaptor could achieve 940 frames per second and a resolution of 1280 by 720 frames, which corresponds to the detection of 3 to 8 frames per vocal cycle at double the spatial resolution of existing devices. The device was used to image the vocal cords of 4 volunteers: 1 healthy individual and 3 patients with vocal cord paralysis, chronic laryngitis, or vocal cord polyps. The resultant image stacks were sufficient for most diagnostic purposes. The cost of the device including the smartphone was lower than that of existing HSDI devices. The image processing and analytics demonstrated the successful calculation of relevant diagnostic variables from the acquired images. Patients with vocal pathologies were easily differentiable in the quantitative data.

Conclusions

A smartphone-based HSDI endoscope system can function as a point-of-care clinical diagnostic device. The resulting analysis is of higher quality than that accessible by videostroboscopy and promises comparable quality and greater accessibility than HSDI. In particular, this system is suitable for use as an accessible diagnostic tool in underdeveloped areas with inadequate medical service infrastructure.

Collapse

Kist AM, Gómez P, Dubrovskiy D, Schlegel P, Kunduk M, Echternach M, Patel R, Semmler M, Bohr C, Dürr S, Schützenberger A, Döllinger M. A Deep Learning Enhanced Novel Software Tool for Laryngeal Dynamics Analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021;64:1889-1903. [PMID: 34000199 DOI: 10.1044/2021_jslhr-20-00498] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abstract

Purpose High-speed videoendoscopy (HSV) is an emerging, but barely used, endoscopy technique in the clinic to assess and diagnose voice disorders because of the lack of dedicated software to analyze the data. HSV allows to quantify the vocal fold oscillations by segmenting the glottal area. This challenging task has been tackled by various studies; however, the proposed approaches are mostly limited and not suitable for daily clinical routine. Method We developed a user-friendly software in C# that allows the editing, motion correction, segmentation, and quantitative analysis of HSV data. We further provide pretrained deep neural networks for fully automatic glottis segmentation. Results We freely provide our software Glottis Analysis Tools (GAT). Using GAT, we provide a general threshold-based region growing platform that enables the user to analyze data from various sources, such as in vivo recordings, ex vivo recordings, and high-speed footage of artificial vocal folds. Additionally, especially for in vivo recordings, we provide three robust neural networks at various speed and quality settings to allow a fully automatic glottis segmentation needed for application by untrained personnel. GAT further evaluates video and audio data in parallel and is able to extract various features from the video data, among others the glottal area waveform, that is, the changing glottal area over time. In total, GAT provides 79 unique quantitative analysis parameters for video- and audio-based signals. Many of these parameters have already been shown to reflect voice disorders, highlighting the clinical importance and usefulness of the GAT software. Conclusion GAT is a unique tool to process HSV and audio data to determine quantitative, clinically relevant parameters for research, diagnosis, and treatment of laryngeal disorders. Supplemental Material https://doi.org/10.23641/asha.14575533.

Collapse

Semmler M, Berry DA, Schützenberger A, Döllinger M. Fluid-structure-acoustic interactions in an ex vivo porcine phonation model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021;149:1657. [PMID: 33765793 PMCID: PMC7952141 DOI: 10.1121/10.0003602] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 01/29/2021] [Accepted: 02/07/2021] [Indexed: 05/02/2023]

Schlegel P, Kist AM, Kunduk M, Dürr S, Döllinger M, Schützenberger A. Interdependencies between acoustic and high-speed videoendoscopy parameters. PLoS One 2021;16:e0246136. [PMID: 33529244 PMCID: PMC7853476 DOI: 10.1371/journal.pone.0246136] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 01/13/2021] [Indexed: 02/06/2023] Open

Schlegel P, Kniesburges S, Dürr S, Schützenberger A, Döllinger M. Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings. Sci Rep 2020;10:10517. [PMID: 32601277 PMCID: PMC7324600 DOI: 10.1038/s41598-020-66405-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 05/20/2020] [Indexed: 11/13/2022] Open

Chalich Y, Mallick A, Gupta B, Deen MJ. Development of a low-cost, user-customizable, high-speed camera. PLoS One 2020;15:e0232788. [PMID: 32384109 PMCID: PMC7209243 DOI: 10.1371/journal.pone.0232788] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 04/21/2020] [Indexed: 01/13/2023] Open

Kniesburges S, Lodermeyer A, Semmler M, Schulz YK, Schützenberger A, Becker S. Analysis of the tonal sound generation during phonation with and without glottis closure. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020;147:3285. [PMID: 32486803 DOI: 10.1121/10.0001184] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]

Maryn Y, Verguts M, Demarsin H, van Dinther J, Gomez P, Schlegel P, Döllinger M. Intersegmenter Variability in High-Speed Laryngoscopy-Based Glottal Area Waveform Measures. Laryngoscope 2019;130:E654-E661. [PMID: 31840827 DOI: 10.1002/lary.28475] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Accepted: 11/26/2019] [Indexed: 12/31/2022]

Abstract

OBJECTIVES/HYPOTHESIS

High-speed videoendoscopy (HSV) has potential to objectively quantify vibratory vocal fold characteristics during phonation. Glottal Analysis Tools (GAT) version 2018, developed in Erlangen, Germany, is software for determining various glottal area waveform (GAW) quantities. Before having GAT analyze HSV videos, segmenters have to define glottis manually across videos in a semiautomatic segmentation protocol. Such interventions are hypothesized to induce variability of subsequent GAW measure computation across segmenters and may attenuate GAT measures' reliability to a certain point. This study explored intersegmenter variability in GAT's GAW measures based on semiautomatic image processing.

STUDY DESIGN

Cohort study of rater reliability.

METHODS

In total, 20 HSV videos from normophonic and dysphonic subjects with various laryngeal disorders were selected for this study and segmented by three trained segmenters. They separately segmented glottis areas in the same frame sets of the videos. Upon analysis of GAW, GAT offers 46 measures related to topologic GAW dynamic characteristics, GAW periodicity and perturbation characteristics, and GAW harmonic components. To address GAT's reliability, intersegmenter-based variability in these measures was examined with intraclass correlation coefficient (ICC).

RESULTS

In general, ICC behavior of the 46 GAW measures across three raters was highly acceptable. ICC of one parameter was moderate (0.5 < ICC < 0.75), good for seven parameters (0.75 < ICC < 0.9), and excellent for 38 parameters (0.9 < ICC).

CONCLUSIONS

Overall, high ICC values confirm clinical applicability of GAT for objective and quantitative assessment of HSV. Small intersegmenter differences with actual small parameter differences suggest that manual or semiautomatic segmentation in GAT does not noticeably influence clinical assessment outcome. To guarantee the software's performance, we suggest segmentation training before clinical application.

LEVEL OF EVIDENCE

2b Laryngoscope, 130:E654-E661, 2020.

Collapse

Affiliation(s)

Youri Maryn Department of Otorhinolaryngology-Head and Neck Surgery, European Institute for Otorhinolaryngology-Head and Neck Surgery, GasthuisZusters Antwerpen Sint-Augustinus, Wilrijk/Antwerp, Belgium.,Department of Speech, Language, and Hearing Sciences, University of Ghent, Ghent, Belgium.,Faculty of Education, Health, and Social Work, University College of Ghent, Ghent, Belgium.,Faculty of Psychology and Educational Sciences, School of Logopedics, Université Catholique de Louvain, Louvain-la-Neuve, Belgium.,Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium.,Phonanium, Lokeren, Belgium
Monique Verguts Department of Otorhinolaryngology-Head and Neck Surgery, European Institute for Otorhinolaryngology-Head and Neck Surgery, GasthuisZusters Antwerpen Sint-Augustinus, Wilrijk/Antwerp, Belgium.,Department of Otorhinolaryngology and Voice Disorders, Diest General Hospital, Diest, Belgium
Hannelore Demarsin Department of Otorhinolaryngology-Head and Neck Surgery, European Institute for Otorhinolaryngology-Head and Neck Surgery, GasthuisZusters Antwerpen Sint-Augustinus, Wilrijk/Antwerp, Belgium
Joost van Dinther Department of Otorhinolaryngology-Head and Neck Surgery, European Institute for Otorhinolaryngology-Head and Neck Surgery, GasthuisZusters Antwerpen Sint-Augustinus, Wilrijk/Antwerp, Belgium
Pablo Gomez Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany
Patrick Schlegel Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany
Michael Döllinger Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany

Collapse