Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Eysholdt U, Rosanowski F, Hoppe U. Vocal fold vibration irregularities caused by different types of laryngeal asymmetry. Eur Arch Otorhinolaryngol 2003;260:412-7. [PMID: 12690514 DOI: 10.1007/s00405-003-0606-y] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2002] [Accepted: 03/12/2003] [Indexed: 11/29/2022]

For:	Eysholdt U, Rosanowski F, Hoppe U. Vocal fold vibration irregularities caused by different types of laryngeal asymmetry. Eur Arch Otorhinolaryngol 2003;260:412-7. [PMID: 12690514 DOI: 10.1007/s00405-003-0606-y] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2002] [Accepted: 03/12/2003] [Indexed: 11/29/2022]

Number

Cited by Other Article(s)

Nobel SMN, Swapno SMMR, Islam MR, Safran M, Alfarhood S, Mridha MF. A machine learning approach for vocal fold segmentation and disorder classification based on ensemble method. Sci Rep 2024;14:14435. [PMID: 38910146 DOI: 10.1038/s41598-024-64987-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2024] [Accepted: 06/14/2024] [Indexed: 06/25/2024] Open

Abstract

In the healthcare domain, the essential task is to understand and classify diseases affecting the vocal folds (VFs). The accurate identification of VF disease is the key issue in this domain. Integrating VF segmentation and disease classification into a single system is challenging but important for precise diagnostics. Our study addresses this challenge by combining VF illness categorization and VF segmentation into a single integrated system. We utilized two effective ensemble machine learning methods: ensemble EfficientNetV2L-LGBM and ensemble UNet-BiGRU. We utilized the EfficientNetV2L-LGBM model for classification, achieving a training accuracy of 98.88%, validation accuracy of 97.73%, and test accuracy of 97.88%. These exceptional outcomes highlight the system's ability to classify different VF illnesses precisely. In addition, we utilized the UNet-BiGRU model for segmentation, which attained a training accuracy of 92.55%, a validation accuracy of 89.87%, and a significant test accuracy of 91.47%. In the segmentation task, we examined some methods to improve our ability to divide data into segments, resulting in a testing accuracy score of 91.99% and an Intersection over Union (IOU) of 87.46%. These measures demonstrate skill of the model in accurately defining and separating VF. Our system's classification and segmentation results confirm its capacity to effectively identify and segment VF disorders, representing a significant advancement in enhancing diagnostic accuracy and healthcare in this specialized field. This study emphasizes the potential of machine learning to transform the medical field's capacity to categorize VF and segment VF, providing clinicians with a vital instrument to mitigate the profound impact of the condition. Implementing this innovative approach is expected to enhance medical procedures and provide a sense of optimism to those globally affected by VF disease.

Collapse

Bottasso-Arias N, Burra K, Sinner D, Riede T. Disruption of BMP4 signaling is associated with laryngeal birth defects in a mouse model. Dev Biol 2023;500:10-21. [PMID: 37230380 PMCID: PMC10330877 DOI: 10.1016/j.ydbio.2023.04.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 04/18/2023] [Accepted: 04/24/2023] [Indexed: 05/27/2023]

Schlegel P, Döllinger M, Reddy NK, Zhang Z, Chhetri DK. Validation and enhancement of a vocal fold medial surface 3D reconstruction approach for in-vivo application. Sci Rep 2023;13:10705. [PMID: 37400470 DOI: 10.1038/s41598-023-36022-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 05/27/2023] [Indexed: 07/05/2023] Open

Pedersen M, Larsen CF, Madsen B, Eeg M. Localization and quantification of glottal gaps on deep learning segmentation of vocal folds. Sci Rep 2023;13:878. [PMID: 36650265 PMCID: PMC9845318 DOI: 10.1038/s41598-023-27980-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 01/11/2023] [Indexed: 01/19/2023] Open

Ikuma T, McWhorter AJ, Adkins L, Kunduk M. Investigation of Vocal Bifurcations and Voice Patterns Induced by Asymmetry of Pathological Vocal Folds. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023;66:48-60. [PMID: 36472934 DOI: 10.1044/2022_jslhr-21-00499] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]

Stewart ME, Erath BD. Investigating blunt force trauma to the larynx: The role of inferior-superior vocal fold displacement on phonation. J Biomech 2021;121:110377. [PMID: 33819698 DOI: 10.1016/j.jbiomech.2021.110377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 02/24/2021] [Accepted: 03/01/2021] [Indexed: 11/26/2022]

Falk S, Kniesburges S, Schoder S, Jakubaß B, Maurerlehner P, Echternach M, Kaltenbacher M, Döllinger M. 3D-FV-FE Aeroacoustic Larynx Model for Investigation of Functional Based Voice Disorders. Front Physiol 2021;12:616985. [PMID: 33762964 PMCID: PMC7982522 DOI: 10.3389/fphys.2021.616985] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 02/09/2021] [Indexed: 12/02/2022] Open

Abstract

For the clinical analysis of underlying mechanisms of voice disorders, we developed a numerical aeroacoustic larynx model, called simVoice, that mimics commonly observed functional laryngeal disorders as glottal insufficiency and vibrational left-right asymmetries. The model is a combination of the Finite Volume (FV) CFD solver Star-CCM+ and the Finite Element (FE) aeroacoustic solver CFS++. simVoice models turbulence using Large Eddy Simulations (LES) and the acoustic wave propagation with the perturbed convective wave equation (PCWE). Its geometry corresponds to a simplified larynx and a vocal tract model representing the vowel /a/. The oscillations of the vocal folds are externally driven. In total, 10 configurations with different degrees of functional-based disorders were simulated and analyzed. The energy transfer between the glottal airflow and the vocal folds decreases with an increasing glottal insufficiency and potentially reflects the higher effort during speech for patients being concerned. This loss of energy transfer may also have an essential influence on the quality of the sound signal as expressed by decreasing sound pressure level (SPL), Cepstral Peak Prominence (CPP), and Vocal Efficiency (VE). Asymmetry in the vocal fold oscillations also reduces the quality of the sound signal. However, simVoice confirmed previous clinical and experimental observations that a high level of glottal insufficiency worsens the acoustic signal quality more than oscillatory left-right asymmetry. Both symptoms in combination will further reduce the quality of the sound signal. In summary, simVoice allows for detailed analysis of the origins of disordered voice production and hence fosters the further understanding of laryngeal physiology, including occurring dependencies. A current walltime of 10 h/cycle is, with a prospective increase in computing power, auspicious for a future clinical use of simVoice.

Collapse

Semmler M, Berry DA, Schützenberger A, Döllinger M. Fluid-structure-acoustic interactions in an ex vivo porcine phonation model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021;149:1657. [PMID: 33765793 PMCID: PMC7952141 DOI: 10.1121/10.0003602] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 01/29/2021] [Accepted: 02/07/2021] [Indexed: 05/02/2023]

Zäske R, Skuk VG, Schweinberger SR. Attractiveness and distinctiveness between speakers' voices in naturalistic speech and their faces are uncorrelated. ROYAL SOCIETY OPEN SCIENCE 2020;7:201244. [PMID: 33489273 PMCID: PMC7813223 DOI: 10.1098/rsos.201244] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 11/20/2020] [Indexed: 05/28/2023]

Phonation threshold pressure at large asymmetries of the vocal folds. Biomed Signal Process Control 2020. [DOI: 10.1016/j.bspc.2020.102105] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Schlegel P, Kniesburges S, Dürr S, Schützenberger A, Döllinger M. Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings. Sci Rep 2020;10:10517. [PMID: 32601277 PMCID: PMC7324600 DOI: 10.1038/s41598-020-66405-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 05/20/2020] [Indexed: 11/13/2022] Open

Mohd Khairuddin KA, Ahmad K, Ibrahim HM, Yan Y. Effects of Using Laryngeal High-Speed Videoendoscopy Images Visualizing Partial Views of The Glottis on Measurement Outcomes. J Voice 2020;36:106-112. [PMID: 32456835 DOI: 10.1016/j.jvoice.2020.04.027] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 04/21/2020] [Accepted: 04/22/2020] [Indexed: 11/29/2022]

Abstract

Ideally, an analysis method for laryngeal high-speed videoendoscopy (LHSV) based on the glottal area waveforms (GAW) requires images of a complete view of the glottis to ensure findings that are representatives of the vibratory behaviors of the whole vocal folds. However, in practice, the preferred images may not be obtained at all times. Often, the only available images that a clinician has to work with consist of a partial view of the glottis. This study aims to examine the effects of using images of a partial view of the glottis (ie, posterior-middle, anterior-middle, or middle) on the LHSV-based measures (ie, fundamental frequency (F0_GAW), frequency perturbation (jitter_GAW), amplitude perturbation (shimmer_GAW), open quotient (OQ_GAW), and Nyquist plot). The participants consisted of 9 young normophonic females. The procedures involved LHSV recording of the vibration of the vocal folds. The images of the complete view of the glottis were analyzed to obtain the LHSV-based measures. The same images were used to simulate the images of partial views of the glottis by changing the outline of the region of interest to include only either the posterior-middle, anterior-middle, or middle parts of the glottis. The LHSV-based measures from the images of the partial views were then compared to those with the complete view . The results showed that all LHSV-based measures from the images of the posterior-middle view were similar to those of the complete view. However, only the F0_GAW, jitter_GAW, and shimmer_GAW from the images of the anterior-middle and middle views were similar to those of the complete view. Lower OQ_GAW and different Nyquist plots than those of the complete view were generated by the images of the anterior-middle and middle views. In conclusion, all LHSV-based measures from the images of the posterior-middle view of the glottis, and only the F0_GAW, jitter_GAW, and shimmer_GAW from the images of the anterior-middle and middle views of the glottis reflect the vibratory behaviors of the whole vocal folds. The same conclusion could not be applied to the OQ_GAW and Nyquist plots of the images of the anterior-middle and middle views of the glottis. A possible effect of the presence or absence of a posterior glottal gap on the findings warrants further confirmation.

Collapse

Schlegel P, Kist AM, Semmler M, Döllinger M, Kunduk M, Dürr S, Schützenberger A. Determination of Clinical Parameters Sensitive to Functional Voice Disorders Applying Boosted Decision Stumps. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2020;8:2100511. [PMID: 32518739 PMCID: PMC7274815 DOI: 10.1109/jtehm.2020.2985026] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Revised: 02/21/2020] [Accepted: 03/28/2020] [Indexed: 12/30/2022]

Fehling MK, Grosch F, Schuster ME, Schick B, Lohscheller J. Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network. PLoS One 2020;15:e0227791. [PMID: 32040514 PMCID: PMC7010264 DOI: 10.1371/journal.pone.0227791] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Accepted: 12/25/2019] [Indexed: 01/22/2023] Open

Abstract

The objective investigation of the dynamic properties of vocal fold vibrations demands the recording and further quantitative analysis of laryngeal high-speed video (HSV). Quantification of the vocal fold vibration patterns requires as a first step the segmentation of the glottal area within each video frame from which the vibrating edges of the vocal folds are usually derived. Consequently, the outcome of any further vibration analysis depends on the quality of this initial segmentation process. In this work we propose for the first time a procedure to fully automatically segment not only the time-varying glottal area but also the vocal fold tissue directly from laryngeal high-speed video (HSV) using a deep Convolutional Neural Network (CNN) approach. Eighteen different Convolutional Neural Network (CNN) network configurations were trained and evaluated on totally 13,000 high-speed video (HSV) frames obtained from 56 healthy and 74 pathologic subjects. The segmentation quality of the best performing Convolutional Neural Network (CNN) model, which uses Long Short-Term Memory (LSTM) cells to take also the temporal context into account, was intensely investigated on 15 test video sequences comprising 100 consecutive images each. As performance measures the Dice Coefficient (DC) as well as the precisions of four anatomical landmark positions were used. Over all test data a mean Dice Coefficient (DC) of 0.85 was obtained for the glottis and 0.91 and 0.90 for the right and left vocal fold (VF) respectively. The grand average precision of the identified landmarks amounts 2.2 pixels and is in the same range as comparable manual expert segmentations which can be regarded as Gold Standard. The method proposed here requires no user interaction and overcomes the limitations of current semiautomatic or computational expensive approaches. Thus, it allows also for the analysis of long high-speed video (HSV)-sequences and holds the promise to facilitate the objective analysis of vocal fold vibrations in clinical routine. The here used dataset including the ground truth will be provided freely for all scientific groups to allow a quantitative benchmarking of segmentation approaches in future.

Collapse

Influence of spatial camera resolution in high-speed videoendoscopy on laryngeal parameters. PLoS One 2019;14:e0215168. [PMID: 31009488 PMCID: PMC6476512 DOI: 10.1371/journal.pone.0215168] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Accepted: 03/27/2019] [Indexed: 11/19/2022] Open

Abstract

In laryngeal high-speed videoendoscopy (HSV) the area between the vibrating vocal folds during phonation is of interest, being referred to as glottal area waveform (GAW). Varying camera resolution may influence parameters computed on the GAW and hence hinder the comparability between examinations. This study investigates the influence of spatial camera resolution on quantitative vocal fold vibratory function parameters obtained from the GAW. In total 40 HSV recordings during sustained phonation (20 healthy males and 20 healthy females) were investigated. A clinically used Photron Fastcam MC2 camera with a frame rate of 4000 fps and a spatial resolution of 512×256 pixels was applied. This initial resolution was reduced by pixel averaging to (1) a resolution of 256×128 and (2) to a resolution of 128×64 pixels, yielding three sets of recordings. The GAW was extracted and in total 50 vocal fold vibratory parameters representing different features of the GAW were computed. Statistical analyses using SPSS Statistics, version 21, was performed. 15 Parameters showing strong mathematical dependencies with other parameters were excluded from the main analysis but are given in the Supporting Information. Data analysis revealed clear influence of spatial resolution on GAW parameters. Fundamental period measures and period perturbation measures were the least affected. Amplitude perturbation measures and mechanical measures were most strongly influenced. Most glottal dynamic characteristics and symmetry measures deviated significantly. Most energy perturbation measures changed significantly in males but were mostly unaffected in females. In females 18 of 35 remaining parameters (51%) and in males 22 parameters (63%) changed significantly between spatial resolutions. This work represents the first step in studying the impact of video resolution on quantitative HSV parameters. Clear influences of spatial camera resolution on computed parameters were found. The study results suggest avoiding the use of the most strongly affected parameters. Further, the use of cameras with high resolution is recommended to analyze GAW measures in HSV data.

Collapse

Powell ME, Deliyski DD, Zeitels SM, Burns JA, Hillman RE, Gerlach TT, Mehta DD. Efficacy of Videostroboscopy and High-Speed Videoendoscopy to Obtain Functional Outcomes From Perioperative Ratings in Patients With Vocal Fold Mass Lesions. J Voice 2019;34:769-782. [PMID: 31005449 DOI: 10.1016/j.jvoice.2019.03.012] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 03/20/2019] [Accepted: 03/21/2019] [Indexed: 11/30/2022]

Abstract

OBJECTIVES

A major limitation of comparing the efficacy of videostroboscopy (VS) and high-speed videoendoscopy (HSV) is the lack of an objective reference by which to compare the functional assessment ratings of the two techniques. For patients with vocal fold mass lesions, intraoperative measures of lesion size and depth may serve as this objective reference. This study compared the relationships between the pre- to postoperative change in VS and HSV visual-perceptual ratings to intraoperative measures of lesion size and depth.

DESIGN

Prospective visual-perceptual study with intraoperative measures of lesion size and depth.

METHODS

VS and HSV samples were obtained preoperatively and postoperatively from 28 patients with vocal fold lesions and from 17 vocally healthy controls. Two experienced clinicians rated amplitude, mucosal wave, vertical phase difference, left-right phase asymmetry, and vocal fold edge on a visual-analog scale using both imaging techniques. The change in perioperative ratings from VS and HSV was compared between groups and correlated to intraoperative measures of lesion size and depth.

RESULTS

HSV was as reliable as VS for ratings of amplitude and edge, and substantially more reliable for ratings of mucosal wave and left-right phase asymmetry. Both VS and HSV had mild-moderate correlations between change in perioperative ratings and intraoperative measures of lesion area. Change in function could be obtained in more patients and for more parameters using HSV than VS. Group differences were noted for postoperative ratings of amplitude and edge; however, these differences were within one level of the visual-perceptual rating scale. The presence of asynchronicity in VS recordings renders vibratory features either uninterpretable or potentially distorted and thus should not be rated.

CONCLUSIONS

Amplitude and edge are robust vibratory measures for perioperative functional assessment, regardless of imaging modality. HSV is indicated for evaluation of subepithelial lesions or if asynchronicity is present in the VS image sequence.

Collapse

Caffier PP, Nawka T, Ibrahim-Nasr A, Thomas B, Müller H, Ko SR, Song W, Gross M, Weikert S. Development of three-dimensional laryngostroboscopy for office-based laryngeal diagnostics and phonosurgical therapy. Laryngoscope 2018;128:2823-2831. [PMID: 30328614 DOI: 10.1002/lary.27260] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2018] [Revised: 03/19/2018] [Accepted: 04/06/2018] [Indexed: 11/10/2022]

Abstract

OBJECTIVE

To develop a three-dimensional (3D) laryngostroboscopic examination unit, compare the optic playback quality in relation to established 2D procedures, and report the first case series using 3D rigid laryngostroboscopy for diagnosis and management of laryngotracheal diseases.

STUDY DESIGN

Laboratory study, prospective case series.

METHODS

The optical efficacy of newly developed rigid 3D endoscopes was examined in a laboratory setting. Diagnostic suitability was investigated in 100 subjects (50 male, 50 female) receiving 2D high-definition (HD) and 3D laryngostroboscopy. Two of the subjects subsequently underwent 3D-assisted office-based transoral phonosurgery under local anesthesia. Main outcome measures were comparative visualization of laryngotracheal pathologies, influence on preoperative planning, and evaluation of prognostic factors for the outcome of phonosurgical interventions.

RESULTS

Three-dimensional endostroboscopic procedures were effectively optimized to establish an examination protocol for all-day clinical use. Office-based 3D laryngostroboscopy was successfully applied in subjects with normal anatomy (n = 10) and various laryngotracheal findings (n = 90). In comparison to 2D HD videolaryngostroboscopy, the 3D view offered enhanced visualization of laryngotracheal anatomy, with qualitatively improved depth perception and spatial representation. In organic pathologies, this resulted in a more precise indication of phonosurgical procedures, increased accuracy in surgical planning, facilitated office-based endoscopic surgery, and better evaluation of prognostic factors for the outcome of phonosurgical interventions.

CONCLUSION

Three-dimensional laryngostroboscopy proved to increase the understanding of functional and surgical anatomy. Its application has enormous potential for improving the diagnostic value of laryngoscopy, surgical precision in laryngotracheal interventions, tissue preservation, and methods of teaching.

LEVEL OF EVIDENCE

NA Laryngoscope, 128:2823-2831, 2018.

Collapse

Semmler M, Döllinger M, Patel RR, Ziethe A, Schützenberger A. Clinical relevance of endoscopic three-dimensional imaging for quantitative assessment of phonation. Laryngoscope 2018. [DOI: 10.1002/lary.27165] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Birk V, Kniesburges S, Semmler M, Berry DA, Bohr C, Döllinger M, Schützenberger A. Influence of glottal closure on the phonatory process in ex vivo porcine larynges. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017;142:2197. [PMID: 29092569 PMCID: PMC6909995 DOI: 10.1121/1.5007952] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]

Tokuda IT, Shimamura R. Effect of level difference between left and right vocal folds on phonation: Physical experiment and theoretical study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017;142:482. [PMID: 28863607 DOI: 10.1121/1.4996105] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Samlan RA, Story BH. Influence of Left-Right Asymmetries on Voice Quality in Simulated Paramedian Vocal Fold Paralysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017;60:306-321. [PMID: 28199505 DOI: 10.1044/2016_jslhr-s-16-0076] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Accepted: 05/31/2016] [Indexed: 05/25/2023]

Investigation of the Immediate Effects of Humming on Vocal Fold Vibration Irregularity Using Electroglottography and High-speed Laryngoscopy in Patients With Organic Voice Disorders. J Voice 2017;31:48-56. [DOI: 10.1016/j.jvoice.2016.03.010] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Accepted: 03/17/2016] [Indexed: 11/22/2022]

Evaluation of an asymmetric anterior glottic web in an excised canine larynx model. Eur Arch Otorhinolaryngol 2016;274:1609-1615. [PMID: 27826648 DOI: 10.1007/s00405-016-4364-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Accepted: 10/26/2016] [Indexed: 10/20/2022]

Hill AK, Cárdenas RA, Wheatley JR, Welling LLM, Burriss RP, Claes P, Apicella CL, McDaniel MA, Little AC, Shriver MD, Puts DA. Are there vocal cues to human developmental stability? Relationships between facial fluctuating asymmetry and voice attractiveness. EVOL HUM BEHAV 2016;38:249-258. [PMID: 34629843 DOI: 10.1016/j.evolhumbehav.2016.10.008] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Quantitative Analysis of Vocal Fold Vibration in Vocal Fold Paralysis With the Use of High-speed Digital Imaging. J Voice 2016;30:766.e13-766.e22. [DOI: 10.1016/j.jvoice.2015.10.015] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Accepted: 10/22/2015] [Indexed: 11/21/2022]

Yamauchi A, Yokonishi H, Imagawa H, Sakakibara KI, Nito T, Tayama N, Yamasoba T. Visualization and Estimation of Vibratory Disturbance in Vocal Fold Scar Using High-Speed Digital Imaging. J Voice 2016;30:493-500. [DOI: 10.1016/j.jvoice.2015.07.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 07/08/2015] [Indexed: 11/17/2022]

Semmler M, Kniesburges S, Birk V, Ziethe A, Patel R, Dollinger M. 3D Reconstruction of Human Laryngeal Dynamics Based on Endoscopic High-Speed Recordings. IEEE TRANSACTIONS ON MEDICAL IMAGING 2016;35:1615-1624. [PMID: 26829782 DOI: 10.1109/tmi.2016.2521419] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Svec JG, Sram F, Schutte HK. Videokymography in Voice Disorders: What to Look For? Ann Otol Rhinol Laryngol 2016;116:172-80. [PMID: 17419520 DOI: 10.1177/000348940711600303] [Citation(s) in RCA: 107] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Efremova KO, Frey R, Volodin IA, Fritsch G, Soldatova NV, Volodina EV. The postnatal ontogeny of the sexually dimorphic vocal apparatus in goitred gazelles (Gazella subgutturosa). J Morphol 2016;277:826-44. [PMID: 26997608 DOI: 10.1002/jmor.20538] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Revised: 02/24/2016] [Accepted: 02/28/2016] [Indexed: 11/11/2022]

Abstract

This study quantitatively documents the progressive development of sexual dimorphism of the vocal organs along the ontogeny of the goitred gazelle (Gazella subgutturosa). The major, male-specific secondary sexual features, of vocal anatomy in goitred gazelle are an enlarged larynx and a marked laryngeal descent. These features appear to have evolved by sexual selection and may serve as a model for similar events in male humans. Sexual dimorphism of larynx size and larynx position in adult goitred gazelles is more pronounced than in humans, whereas the vocal anatomy of neonate goitred gazelles does not differ between sexes. This study examines the vocal anatomy of 19 (11 male, 8 female) goitred gazelle specimens across three age-classes, that is, neonates, subadults and mature adults. The postnatal ontogenetic development of the vocal organs up to their respective end states takes considerably longer in males than in females. Both sexes share the same features of vocal morphology but differences emerge in the course of ontogeny, ultimately resulting in the pronounced sexual dimorphism of the vocal apparatus in adults. The main differences comprise larynx size, vocal fold length, vocal tract length, and mobility of the larynx. The resilience of the thyrohyoid ligament and the pharynx, including the soft palate, and the length changes during contraction and relaxation of the extrinsic laryngeal muscles play a decisive role in the mobility of the larynx in both sexes but to substantially different degrees in adult females and males. Goitred gazelles are born with an undescended larynx and, therefore, larynx descent has to develop in the course of ontogeny. This might result from a trade-off between natural selection and sexual selection requiring a temporal separation of different laryngeal functions at birth and shortly after from those later in life. J. Morphol. 277:826-844, 2016. © 2016 Wiley Periodicals, Inc.

Collapse

Unger J, Schuster M, Hecker DJ, Schick B, Lohscheller J. A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms. Artif Intell Med 2015;66:15-28. [PMID: 26597002 DOI: 10.1016/j.artmed.2015.10.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2015] [Revised: 09/28/2015] [Accepted: 10/20/2015] [Indexed: 12/01/2022]

Abstract

OBJECTIVE

This work presents a computer-based approach to analyze the two-dimensional vocal fold dynamics of endoscopic high-speed videos, and constitutes an extension and generalization of a previously proposed wavelet-based procedure. While most approaches aim for analyzing sustained phonation conditions, the proposed method allows for a clinically adequate analysis of both dynamic as well as sustained phonation paradigms.

MATERIALS AND METHODS

The analysis procedure is based on a spatio-temporal visualization technique, the phonovibrogram, that facilitates the documentation of the visible laryngeal dynamics. From the phonovibrogram, a low-dimensional set of features is computed using a principle component analysis strategy that quantifies the type of vibration patterns, irregularity, lateral symmetry and synchronicity, as a function of time. Two different test bench data sets are used to validate the approach: (I) 150 healthy and pathologic subjects examined during sustained phonation. (II) 20 healthy and pathologic subjects that were examined twice: during sustained phonation and a glissando from a low to a higher fundamental frequency. In order to assess the discriminative power of the extracted features, a Support Vector Machine is trained to distinguish between physiologic and pathologic vibrations. The results for sustained phonation sequences are compared to the previous approach. Finally, the classification performance of the stationary analyzing procedure is compared to the transient analysis of the glissando maneuver.

RESULTS

For the first test bench the proposed procedure outperformed the previous approach (proposed feature set: accuracy: 91.3%, sensitivity: 80%, specificity: 97%, previous approach: accuracy: 89.3%, sensitivity: 76%, specificity: 96%). Comparing the classification performance of the second test bench further corroborates that analyzing transient paradigms provides clear additional diagnostic value (glissando maneuver: accuracy: 90%, sensitivity: 100%, specificity: 80%, sustained phonation: accuracy: 75%, sensitivity: 80%, specificity: 70%).

CONCLUSIONS

The incorporation of parameters describing the temporal evolvement of vocal fold vibration clearly improves the automatic identification of pathologic vibration patterns. Furthermore, incorporating a dynamic phonation paradigm provides additional valuable information about the underlying laryngeal dynamics that cannot be derived from sustained conditions. The proposed generalized approach provides a better overall classification performance than the previous approach, and hence constitutes a new advantageous tool for an improved clinical diagnosis of voice disorders.

Collapse

Lucero JC, Schoentgen J, Haas J, Luizard P, Pelorson X. Self-entrainment of the right and left vocal fold oscillators. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015;137:2036-46. [PMID: 25920854 DOI: 10.1121/1.4916601] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Unger J, Lohscheller J, Reiter M, Eder K, Betz CS, Schuster M. A Noninvasive Procedure for Early-Stage Discrimination of Malignant and Precancerous Vocal Fold Lesions Based on Laryngeal Dynamics Analysis. Cancer Res 2014;75:31-9. [DOI: 10.1158/0008-5472.can-14-1458] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Automatic recognizing of vocal fold disorders from glottis images. Proc Inst Mech Eng H 2014;228:952-61. [DOI: 10.1177/0954411914551851] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Chhetri DK, Neubauer J, Sofer E. Influence of asymmetric recurrent laryngeal nerve stimulation on vibration, acoustics, and aerodynamics. Laryngoscope 2014;124:2544-50. [PMID: 24913182 DOI: 10.1002/lary.24774] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2013] [Revised: 04/22/2014] [Accepted: 05/20/2014] [Indexed: 11/06/2022]

Unger J, Hecker DJ, Kunduk M, Schuster M, Schick B, Lohscheller J. Quantifying spatiotemporal properties of vocal fold dynamics based on a multiscale analysis of phonovibrograms. IEEE Trans Biomed Eng 2014;61:2422-33. [PMID: 24771562 DOI: 10.1109/tbme.2014.2318774] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Moisik SR, Esling JH. Modeling the biomechanical influence of epilaryngeal stricture on the vocal folds: a low-dimensional model of vocal-ventricular fold coupling. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014;57:S687-S704. [PMID: 24687007 DOI: 10.1044/2014_jslhr-s-12-0279] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Kuo CFJ, Wang HW, Hsiao SW, Peng KC, Chou YL, Lai CY, Hsu CTM. Development of laryngeal video stroboscope with laser marking module for dynamic glottis measurement. Comput Med Imaging Graph 2014;38:34-41. [DOI: 10.1016/j.compmedimag.2013.10.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 09/05/2013] [Accepted: 10/16/2013] [Indexed: 10/26/2022]

Kuo CFJ, Chu YH, Wang PC, Lai CY, Chu WL, Leu YS, Wang HW. Using image processing technology and mathematical algorithm in the automatic selection of vocal cord opening and closing images from the larynx endoscopy video. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2013;112:455-465. [PMID: 24070546 DOI: 10.1016/j.cmpb.2013.08.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2012] [Revised: 08/06/2013] [Accepted: 08/08/2013] [Indexed: 06/02/2023]

Chhetri DK, Neubauer J, Bergeron JL, Sofer E, Peng KA, Jamal N. Effects of asymmetric superior laryngeal nerve stimulation on glottic posture, acoustics, vibration. Laryngoscope 2013;123:3110-6. [PMID: 23712542 DOI: 10.1002/lary.24209] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Revised: 04/26/2013] [Accepted: 04/26/2013] [Indexed: 11/12/2022]

Zorrilla AM, Zapirain BG, Izquierdo AP. Computer aided tool for diagnosis of ENT pathologies using digital signal processing of speech and stroboscopic images. SPRINGERPLUS 2013;1:64. [PMID: 23483585 PMCID: PMC3586405 DOI: 10.1186/2193-1801-1-64] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2012] [Accepted: 12/08/2012] [Indexed: 11/19/2022]

Lohscheller J, Svec JG, Döllinger M. Vocal fold vibration amplitude, open quotient, speed quotient and their variability along glottal length: kymographic data from normal subjects. LOGOP PHONIATR VOCO 2012;38:182-92. [PMID: 23173880 DOI: 10.3109/14015439.2012.731083] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Elidan G, Elidan J. Vocal Folds Analysis Using Global Energy Tracking. J Voice 2012;26:760-8. [DOI: 10.1016/j.jvoice.2011.07.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Accepted: 07/18/2011] [Indexed: 10/14/2022]

Analysis of longitudinal phase differences in vocal-fold vibration using synchronous high-speed videoendoscopy and electroglottography. J Voice 2012;26:816.e13-20. [PMID: 23059188 DOI: 10.1016/j.jvoice.2012.04.009] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Accepted: 04/26/2012] [Indexed: 11/23/2022]

High-speed digital imaging of the larynx: recent advances. Curr Opin Otolaryngol Head Neck Surg 2012;20:466-71. [PMID: 23000735 DOI: 10.1097/moo.0b013e328359840d] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Bonilha HS, Deliyski DD, Whiteside JP, Gerlach TT. Vocal fold phase asymmetries in patients with voice disorders: a study across visualization techniques. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2012;21:3-15. [PMID: 22049403 PMCID: PMC7587608 DOI: 10.1044/1058-0360(2011/09-0086)] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Physical simulation of laryngeal disorders using a multiple-mass vocal fold model. Biomed Signal Process Control 2012. [DOI: 10.1016/j.bspc.2011.04.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Chodara AM, Krausert CR, Jiang JJ. Kymographic characterization of vibration in human vocal folds with nodules and polyps. Laryngoscope 2011;122:58-65. [PMID: 21898450 DOI: 10.1002/lary.22324] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2011] [Accepted: 07/22/2011] [Indexed: 11/10/2022]

Kniesburges S, Thomson SL, Barney A, Triep M, Sidlof P, Horáčcek J, Brücker C, Becker S. In vitro experimental investigation of voice production. Curr Bioinform 2011. [PMID: 23181007 DOI: 10.2174/157489311796904637] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Krausert CR, Ying D, Zhang Y, Jiang JJ. Quantitative study of vibrational symmetry of injured vocal folds via digital kymography in excised canine larynges. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2011;54:1022-1038. [PMID: 21173386 PMCID: PMC3187921 DOI: 10.1044/1092-4388(2010/10-0105)] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Dollinger M, Berry DA, Huttner B, Bohr C. Assessment of local vocal fold deformation characteristics in an in vitro static tensile test. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011;130:977-985. [PMID: 21877810 PMCID: PMC3190661 DOI: 10.1121/1.3605671] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2010] [Revised: 06/07/2011] [Accepted: 06/08/2011] [Indexed: 05/31/2023]