1
|
Donhauser J, Tur B, Döllinger M. Neural network-based estimation of biomechanical vocal fold parameters. Front Physiol 2024; 15:1282574. [PMID: 38449783 PMCID: PMC10916882 DOI: 10.3389/fphys.2024.1282574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 01/09/2024] [Indexed: 03/08/2024] Open
Abstract
Vocal fold (VF) vibrations are the primary source of human phonation. High-speed video (HSV) endoscopy enables the computation of descriptive VF parameters for assessment of physiological properties of laryngeal dynamics, i.e., the vibration of the VFs. However, underlying biomechanical factors responsible for physiological and disordered VF vibrations cannot be accessed. In contrast, physically based numerical VF models reveal insights into the organ's oscillations, which remain inaccessible through endoscopy. To estimate biomechanical properties, previous research has fitted subglottal pressure-driven mass-spring-damper systems, as inverse problem to the HSV-recorded VF trajectories, by global optimization of the numerical model. A neural network trained on the numerical model may be used as a substitute for computationally expensive optimization, yielding a fast evaluating surrogate of the biomechanical inverse problem. This paper proposes a convolutional recurrent neural network (CRNN)-based architecture trained on regression of a physiological-based biomechanical six-mass model (6 MM). To compare with previous research, the underlying biomechanical factor "subglottal pressure" prediction was tested against 288 HSV ex vivo porcine recordings. The contributions of this work are two-fold: first, the presented CRNN with the 6 MM handles multiple trajectories along the VFs, which allows for investigations on local changes in VF characteristics. Second, the network was trained to reproduce further important biomechanical model parameters like VF mass and stiffness on synthetic data. Unlike in a previous work, the network in this study is therefore an entire surrogate of the inverse problem, which allowed for explicit computation of the fitted model using our approach. The presented approach achieves a best-case mean absolute error (MAE) of 133 Pa (13.9%) in subglottal pressure prediction with 76.6% correlation on experimental data and a re-estimated fundamental frequency MAE of 15.9 Hz (9.9%). In-detail training analysis revealed subglottal pressure as the most learnable parameter. With the physiological-based model design and advances in fast parameter prediction, this work is a next step in biomechanical VF model fitting and the estimation of laryngeal kinematics.
Collapse
Affiliation(s)
- Jonas Donhauser
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany
| | | | | |
Collapse
|
2
|
Kirsch A, Gerstenberger C, Jakubaß B, Tschernitz M, Perkins JD, Groselj‐Strele A, Lanmüller H, Jarvis JC, Kniesburges S, Döllinger M, Gugatschka M. Bilateral Functional Electrical Stimulation for the Treatment of Presbyphonia in a Sheep Model. Laryngoscope 2024; 134:848-854. [PMID: 37597167 PMCID: PMC10952233 DOI: 10.1002/lary.30984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 07/31/2023] [Accepted: 08/03/2023] [Indexed: 08/21/2023]
Abstract
OBJECTIVES The aim of the study was to increase muscle volume and improve phonation characteristics of the aged ovine larynx by functional electrical stimulation (FES) using a minimally invasive surgical procedure. METHODS Stimulation electrodes were placed bilaterally near the terminal adduction branch of the recurrent laryngeal nerves (RLN). The electrodes were connected to battery powered pulse generators implanted subcutaneously at the neck region. Training patterns were programmed by an external programmer using a bidirectional radio frequency link. Training sessions were repeated automatically by the implant every other day for 1 week followed by every day for 8 weeks in the awake animal. Another group of animals were used as sham, with electrodes positioned but not connected to an implant. Outcome parameters included gene expression analysis, histological assessment of muscle fiber size, functional analysis, and volumetric measurements based on three-dimensional reconstructions of the entire thyroarytenoid muscle (TAM). RESULTS Increase in minimal muscle fiber diameter and an improvement in vocal efficiency were observed following FES, compared with sham animals. CONCLUSION This is the first study to demonstrate beneficial effects in the TAM of FES at molecular, histological, and functional levels. FES of the terminal branches of the RLN reversed the effects of age-related changes and improved vocal efficiency. LEVEL OF EVIDENCE NA Laryngoscope, 134:848-854, 2024.
Collapse
Affiliation(s)
- Andrijana Kirsch
- Division of Phoniatrics, ENT University HospitalMedical University of GrazGrazAustria
| | - Claus Gerstenberger
- Division of Phoniatrics, ENT University HospitalMedical University of GrazGrazAustria
| | - Bernhard Jakubaß
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich‐Alexander‐Universität Erlangen‐NürnbergErlangenGermany
| | - Magdalena Tschernitz
- Division of Phoniatrics, ENT University HospitalMedical University of GrazGrazAustria
| | | | - Andrea Groselj‐Strele
- Core Facility Computational Bioanalytics, Center for Medical ResearchMedical University of GrazGrazAustria
| | - Hermann Lanmüller
- Center of Medical Physics and Biomedical EngineeringMedical University of ViennaViennaAustria
| | - Jonathan C. Jarvis
- School of Sport and Exercise SciencesLiverpool John Moores UniversityLiverpoolUK
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich‐Alexander‐Universität Erlangen‐NürnbergErlangenGermany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich‐Alexander‐Universität Erlangen‐NürnbergErlangenGermany
| | - Markus Gugatschka
- Division of Phoniatrics, ENT University HospitalMedical University of GrazGrazAustria
| |
Collapse
|
3
|
Bouhabel S, Park S, Kolosova K, Latifi N, Kost K, Li-Jessen NYK, Mongeau L. Functional Analysis of Injectable Substance Treatment on Surgically Injured Rabbit Vocal Folds. J Voice 2023; 37:829-839. [PMID: 34353684 PMCID: PMC8807745 DOI: 10.1016/j.jvoice.2021.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 05/28/2021] [Accepted: 06/02/2021] [Indexed: 02/04/2023]
Abstract
OBJECTIVES The objective of this study was to evaluate the efficacy of immediate injection treatments of dexamethasone, hyaluronic acid (HA)/gelatin (Ge) hydrogel and glycol-chitosan solution on the phonatory function of rabbit larynges at 42 days after surgical injury of the vocal folds, piloting a novel ex vivo phonatory functional analysis protocol. METHODS A modified microflap procedure was performed on the left vocal fold of 12 rabbits to induce an acute injury. Animals were randomized into one of four treatment groups with 0.1 mL injections of dexamethasone, HA/Ge hydrogel, glycol-chitosan or saline as control. The left mid vocal fold lamina propria was injected immediately following injury. The right vocal fold served as an uninjured control. Larynges were harvested at Day 42 after injection, then were subjected to airflow-bench evaluation. Acoustic, aerodynamic and laryngeal high-speed videoendoscopy (HSV) analyses were performed. HSV segments of the vibrating vocal folds were rated by three expert laryngologists. Six parameters related to vocal fold vibratory characteristics were evaluated on a Likert scale. RESULTS The fundamental frequency, one possible surrogate of vocal fold stiffness and scarring, was lower in the dexamethasone and HA/Ge hydrogel treatment groups compared to that of the saline control (411.52±11.63 Hz). The lowest fundamental frequency value was observed in the dexamethasone group (348.79±14.99 Hz). Expert visual ratings of the HSV segments indicated an overall positive outcome in the dexamethasone treatment group, though the impacts were below statistical significance. CONCLUSION Dexamethasone injections might be used as an adjunctive option for iatrogenic vocal fold scarring. An increased sample size, histological correlate, and experimental method improvements will be needed to confirm this finding. Results suggested a promising use of HSV and acoustic analysis techniques to identify and monitor post-surgical vocal fold repair and scarring, providing a useful tool for future studies of vocal fold scar treatments.
Collapse
Affiliation(s)
- Sarah Bouhabel
- Department of Otolaryngology - Head and Neck Surgery, McGill University, Montreal, Quebec, Canada.
| | - Scott Park
- Department of Mechanical Engineering, McGill University, Montreal, Quebec, Canada
| | - Ksenia Kolosova
- Department of Physics, McGill University, Montreal, Quebec, Canada
| | - Neda Latifi
- Department of Mechanical Engineering, McGill University, Montreal, Quebec, Canada
| | - Karen Kost
- Department of Otolaryngology - Head and Neck Surgery, McGill University, Montreal, Quebec, Canada
| | - Nicole Y K Li-Jessen
- Department of Otolaryngology - Head and Neck Surgery, McGill University, Montreal, Quebec, Canada; School of Communication Sciences and Disorders, McGill University, Montreal, Quebec, Canada
| | - Luc Mongeau
- Department of Otolaryngology - Head and Neck Surgery, McGill University, Montreal, Quebec, Canada; Department of Mechanical Engineering, McGill University, Montreal, Quebec, Canada
| |
Collapse
|
4
|
Tur B, Gühring L, Wendler O, Schlicht S, Drummer D, Kniesburges S. Effect of Ligament Fibers on Dynamics of Synthetic, Self-Oscillating Vocal Folds in a Biomimetic Larynx Model. Bioengineering (Basel) 2023; 10:1130. [PMID: 37892860 PMCID: PMC10604794 DOI: 10.3390/bioengineering10101130] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 09/13/2023] [Accepted: 09/25/2023] [Indexed: 10/29/2023] Open
Abstract
Synthetic silicone larynx models are essential for understanding the biomechanics of physiological and pathological vocal fold vibrations. The aim of this study is to investigate the effects of artificial ligament fibers on vocal fold vibrations in a synthetic larynx model, which is capable of replicating physiological laryngeal functions such as elongation, abduction, and adduction. A multi-layer silicone model with different mechanical properties for the musculus vocalis and the lamina propria consisting of ligament and mucosa was used. Ligament fibers of various diameters and break resistances were cast into the vocal folds and tested at different tension levels. An electromechanical setup was developed to mimic laryngeal physiology. The measurements included high-speed video recordings of vocal fold vibrations, subglottal pressure and acoustic. For the evaluation of the vibration characteristics, all measured values were evaluated and compared with parameters from ex and in vivo studies. The fundamental frequency of the synthetic larynx model was found to be approximately 200-520 Hz depending on integrated fiber types and tension levels. This range of the fundamental frequency corresponds to the reproduction of a female normal and singing voice range. The investigated voice parameters from vocal fold vibration, acoustics, and subglottal pressure were within normal value ranges from ex and in vivo studies. The integration of ligament fibers leads to an increase in the fundamental frequency with increasing airflow, while the tensioning of the ligament fibers remains constant. In addition, a tension increase in the fibers also generates a rise in the fundamental frequency delivering the physiological expectation of the dynamic behavior of vocal folds.
Collapse
Affiliation(s)
- Bogac Tur
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Lucia Gühring
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Olaf Wendler
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Samuel Schlicht
- Institute of Polymer Technology, Friedrich-Alexander-Universität Erlangen-Nürnberg, Am Weichselgarten 10, 91058 Erlangen, Germany
| | - Dietmar Drummer
- Institute of Polymer Technology, Friedrich-Alexander-Universität Erlangen-Nürnberg, Am Weichselgarten 10, 91058 Erlangen, Germany
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| |
Collapse
|
5
|
Veltrup R, Kniesburges S, Semmler M. Influence of Perspective Distortion in Laryngoscopy. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:3276-3289. [PMID: 37652062 DOI: 10.1044/2023_jslhr-23-00027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
OBJECTIVE An experiment with controllable boundaries was designed to assess the influence of the recording angle and distance on two-dimensional (2D) imaging in laryngoscopy and resulting 2D parameter calculation derived from the glottal area waveform (GAW). METHOD Two high-speed camera setups were used to synchronously record an oscillating synthetic vocal fold (VF) model, simulating a high-speed videoendoscopy. One camera recorded at variable lateral recording angles and a reference camera in superior perspective. This was performed at different physiological recording distances and for two oscillation modes (with/without contacting VFs). The GAW was derived from the segmented glottis, and two parameters each for the categories of symmetry, periodicity, and closure were calculated, as well as two derivative measures. The percentage difference between the variable and reference camera value pairs was calculated, and the angle and height dependencies were quantified using linear regression. RESULTS The visual perception of a laryngoscopy was found to be influenced by the lateral recording angle, which may lead to misinterpretation of VF symmetry among inexperienced observers. The strongest influence of recording angle was observed for symmetry parameters, the strongest being the Amplitude Symmetry Index with up to 2.6%/° (p < .05). A dependence on the recording distance was only found for the Maximum Area Declination Rate. CONCLUSIONS The recording angle in 2D laryngoscopy should be carefully considered during visual inspection of the VF dynamics. Most of the investigated objective parameters were unaffected by the examined perspective distortion. However, especially left-right symmetry measures should only be used under controlled boundary conditions to avoid misdiagnosis and misinterpretation. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.23961183.
Collapse
Affiliation(s)
- Reinhard Veltrup
- University Hospital Erlangen, Medical School, Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Germany
| | - Stefan Kniesburges
- University Hospital Erlangen, Medical School, Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Germany
| | - Marion Semmler
- University Hospital Erlangen, Medical School, Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Germany
| |
Collapse
|
6
|
Jakubaß B, Peters G, Kniesburges S, Semmler M, Kirsch A, Gerstenberger C, Gugatschka M, Döllinger M. Effect of functional electric stimulation on phonation in an ex vivo aged ovine model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:2803. [PMID: 37154554 DOI: 10.1121/10.0017923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 04/07/2023] [Indexed: 05/10/2023]
Abstract
With age, the atrophy of the thyroarytenoid muscle (TAM), and thus atrophy of the vocal folds, leads to decreased glottal closure, increased breathiness, and a loss in voice quality, which results in a reduced quality of life. A method to counteract the atrophy of the TAM is to induce hypertrophy in the muscle by functional electric stimulation (FES). In this study, phonation experiments were performed with ex vivo larynges of six stimulated and six unstimulated ten-year-old sheep to investigate the impact of FES on phonation. Electrodes were implanted bilaterally near the cricothyroid joint. FES treatment was provided for nine weeks before harvesting. The multimodal measurement setup simultaneously recorded high-speed video of the vocal fold oscillation, the supraglottal acoustic signal, and the subglottal pressure signal. Results of 683 measurements show a 65.6% lower glottal gap index, a 22.7% higher tissue flexibility (measured by the amplitude to length ratio), and a 473.7% higher coefficient of determination (R2) of the regression of subglottal and supraglottal cepstral peak prominence during phonation for the stimulated group. These results suggest that FES improves the phonatory process for aged larynges or presbyphonia.
Collapse
Affiliation(s)
- Bernhard Jakubaß
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Gregor Peters
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Marion Semmler
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Andrijana Kirsch
- Division of Phoniatrics, ENT University Hospital Graz, Medical University of Graz, Auenbruggerplatz 26, Graz 8036, Austria
| | - Claus Gerstenberger
- Division of Phoniatrics, ENT University Hospital Graz, Medical University of Graz, Auenbruggerplatz 26, Graz 8036, Austria
| | - Markus Gugatschka
- Division of Phoniatrics, ENT University Hospital Graz, Medical University of Graz, Auenbruggerplatz 26, Graz 8036, Austria
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| |
Collapse
|
7
|
Scheible F, Lamprecht R, Schaan C, Veltrup R, Henningson JO, Semmler M, Sutor A. Behind the Complex Interplay of Phonation: Investigating Elasticity of Vocal Folds With Pipette Aspiration Technique During Ex Vivo Phonation Experiments. J Voice 2023:S0892-1997(23)00096-6. [PMID: 37005126 DOI: 10.1016/j.jvoice.2023.03.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 03/02/2023] [Accepted: 03/02/2023] [Indexed: 04/03/2023]
Abstract
OBJECTIVES The vibration of the vocal folds produces the primary sound for the human speech. The vibration depends mainly on the pressure, airflow of the lungs, and the material properties of the vocal folds. In order to change them, muscles in the larynx stretch the vocal folds. This interplay is rarely investigated, but can give insight in the complex process of speech production. Most material properties studies are damaging the tissue; therefore, a nondestructive one is desired. METHODS An ex vivo phonation experiment combined with the dynamic Pipette Aspiration Technique is used to investigate 10 porcine larynges, under manipulations of different adduction and elongation levels. For each manipulation, the near surface material properties of the vocal folds are measured as well as different phonation parameters like the subglottal pressure, glottal resistance, frequency, and stiffness. Thereby, a high-speed camera was used to record the vocal fold movement. RESULTS On most of the measured parameters, the manipulations do show an effect. Both manipulations lead to a higher phonation frequency and an increase of the stiffness of the tissue. Comparing both manipulations, the elongation results in higher elasticity values than the adduction. Different measurement parameters have been compared with each other and correlations could be found. Where the strongest correlation are found among the elasticity values of different frequencies. But it can also be seen that the elasticity values correlate with phonation parameters. CONCLUSION It was possible to produce a data set of 560 measurements in total. To our knowledge, this is the first time Pipette Aspiration Technique was combined with ex vivo phonation measurements for combined measurements. The amount of measurement data made it possible to carry out statistic investigations. The effect of the manipulations on material properties as well as on phonation parameters could be measured and different correlations could be found. The results lead to the hypothesis that the stretch does not have a huge effect on the material properties of the lamina propria, but more on the underlying muscle.
Collapse
|
8
|
Peters G, Jakubaß B, Weidenfeller K, Kniesburges S, Böhringer D, Wendler O, Mueller SK, Gostian AO, Berry DA, Döllinger M, Semmler M. Synthetic mucus for an ex vivo phonation setup: Creation, application, and effect on excised porcine larynges. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:3245. [PMID: 36586828 PMCID: PMC9729017 DOI: 10.1121/10.0015364] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 09/23/2022] [Accepted: 11/06/2022] [Indexed: 06/17/2023]
Abstract
Laryngeal mucus hydrates and lubricates the deformable tissue of the vocal folds and acts as a boundary layer with the airflow from the lungs. However, the effects of the mucus' viscoelasticity on phonation remain widely unknown and mucus has not yet been established in experimental procedures of voice research. In this study, four synthetic mucus samples were created on the basis of xanthan with focus on physiological frequency-dependent viscoelastic properties, which cover viscosities and elasticities over 2 orders of magnitude. An established ex vivo experimental setup was expanded by a reproducible and controllable application method of synthetic mucus. The application method and the suitability of the synthetic mucus samples were successfully verified by fluorescence evidence on the vocal folds even after oscillation experiments. Subsequently, the impact of mucus viscoelasticity on the oscillatory dynamics of the vocal folds, the subglottal pressure, and acoustic signal was investigated with 24 porcine larynges (2304 datasets). Despite the large differences of viscoelasticity, the phonatory characteristics remained stable with only minor statistically significant differences. Overall, this study increased the level of realism in the experimental setup for replication of the phonatory process enabling further research on pathological mucus and exploration of therapeutic options.
Collapse
Affiliation(s)
- Gregor Peters
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Bernhard Jakubaß
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Katrin Weidenfeller
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - David Böhringer
- Biophysics Group, Department of Physics, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91052 Erlangen, Germany
| | - Olaf Wendler
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Sarina K Mueller
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Antoniu-Oreste Gostian
- Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - David A Berry
- Department of Head and Neck Surgery, David Geffen School of Medicine at University of California Los Angeles, Los Angeles, California 90024, USA
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Marion Semmler
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| |
Collapse
|
9
|
Sakthivel S, Prabhu V. Optimal Deep Learning-Based Vocal Fold Disorder Detection and Classification Model on High-Speed Video Endoscopy. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:4248938. [PMID: 36353680 PMCID: PMC9640237 DOI: 10.1155/2022/4248938] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 09/04/2022] [Accepted: 09/21/2022] [Indexed: 08/08/2023]
Abstract
The use of high-speed video-endoscopy (HSV) in the study of phonatory processes linked to speech needs the precise identification of vocal fold boundaries at the time of vibration. The HSV is a unique laryngeal imaging technology that captures intracycle vocal fold vibrations at a higher frame rate without the need for auditory inputs. The HSV is also effective in identifying the vibrational characteristics of the vocal folds with an increased temporal resolution during retained phonation and flowing speech. Clinically significant vocal fold vibratory characteristics in running speech can be retrieved by creating automated algorithms for extracting HSV-based vocal fold vibration data. The best deep learning-based diagnosis and categorization of vocal fold abnormalities is due to the usage of HSV (ODL-VFDDC). The suggested ODL-VFDDC technique starts with temporal segmentation and motion correction to identify vocalized regions from the HSV recording and gathers the position of movable vocal folds across frames. The attributes gathered are fed into the deep belief network (DBN) model. Furthermore, the agricultural fertility algorithm (AFA) is used to optimize the hyperparameter tuning of the DBN model, which improves classification results. In terms of vocal fold disorder classification, the testing results demonstrated that the ODL-VFDDC technique beats the other existing methodologies. The farmland fertility algorithm (FFA) is then used to accurately determine the glottal limits of vibrating vocal folds. The suggested method has successfully tracked the speech fold boundaries across frames with minimum processing cost and high resilience to picture noise. This method gives a way to look at how the vocal folds move during a connected speech that is completely done by itself.
Collapse
Affiliation(s)
- S. Sakthivel
- Department of Computer Science and Engineering, Vel Tech High Tech Dr. Rangarajan Dr. Sakunthala Engineering College, Avadi, Chennai, India
| | - V. Prabhu
- Department of Electronics and Communication Engineering, Vel Tech Multi Tech Dr. Rangarajan Dr. Sakunthala Engineering College, Chennai, India
| |
Collapse
|
10
|
Gracioso Martins AM, Biehl A, Sze D, Freytes DO. Bioreactors for Vocal Fold Tissue Engineering. TISSUE ENGINEERING. PART B, REVIEWS 2022; 28:182-205. [PMID: 33446061 PMCID: PMC8892964 DOI: 10.1089/ten.teb.2020.0285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
It is estimated that almost one-third of the United States population will be affected by a vocal fold (VF) disorder during their lifespan. Promising therapies to treat VF injury and scarring are mostly centered on VF tissue engineering strategies such as the injection of engineered biomaterials and cell therapy. VF tissue engineering, however, is a challenging field as the biomechanical properties, structure, and composition of the VF tissue change upon exposure to mechanical stimulation. As a result, the development of long-term VF treatment strategies relies on the characterization of engineered tissues under a controlled mechanical environment. In this review, we highlight the importance of bioreactors as a powerful tool for VF tissue engineering with a focus on the current state of the art of bioreactors designed to mimic phonation in vitro. We discuss the influence of the phonatory environment on the development, function, injury, and healing of the VF tissue and its importance for the development of efficient therapeutic strategies. A concise and comprehensive overview of bioreactor designs, principles, operating parameters, and scalability are presented. An in-depth analysis of VF bioreactor data to date reveals that mechanical stimulation significantly influences cell viability and the expression of proinflammatory and profibrotic genes in vitro. Although the precision and accuracy of bioreactors contribute to generating reliable results, diverse gene expression profiles across the literature suggest that future efforts should focus on the standardization of bioreactor parameters to enable direct comparisons between studies. Impact statement We present a comprehensive review of bioreactors for vocal fold (VF) tissue engineering with a focus on the influence of the phonatory environment on the development, function, injury, and healing of the VFs and the importance of mimicking phonation on engineered VF tissues in vitro. Furthermore, we put forward a strong argument for the continued development of bioreactors in this area with an emphasis on the standardization of bioreactor designs, principles, operating parameters, and oscillatory regimes to enable comparisons between studies.
Collapse
Affiliation(s)
- Ana M Gracioso Martins
- Joint Department of Biomedical Engineering, College of Engineering, North Carolina State University/University of North Carolina-Chapel Hill, Raleigh, North Carolina, USA.,Comparative Medicine Institute, North Carolina State University, Raleigh, North Carolina, USA
| | - Andreea Biehl
- Joint Department of Biomedical Engineering, College of Engineering, North Carolina State University/University of North Carolina-Chapel Hill, Raleigh, North Carolina, USA.,Comparative Medicine Institute, North Carolina State University, Raleigh, North Carolina, USA
| | - Daphne Sze
- Joint Department of Biomedical Engineering, College of Engineering, North Carolina State University/University of North Carolina-Chapel Hill, Raleigh, North Carolina, USA.,Comparative Medicine Institute, North Carolina State University, Raleigh, North Carolina, USA
| | - Donald O Freytes
- Joint Department of Biomedical Engineering, College of Engineering, North Carolina State University/University of North Carolina-Chapel Hill, Raleigh, North Carolina, USA.,Comparative Medicine Institute, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
11
|
Kist AM, Gómez P, Dubrovskiy D, Schlegel P, Kunduk M, Echternach M, Patel R, Semmler M, Bohr C, Dürr S, Schützenberger A, Döllinger M. A Deep Learning Enhanced Novel Software Tool for Laryngeal Dynamics Analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1889-1903. [PMID: 34000199 DOI: 10.1044/2021_jslhr-20-00498] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Purpose High-speed videoendoscopy (HSV) is an emerging, but barely used, endoscopy technique in the clinic to assess and diagnose voice disorders because of the lack of dedicated software to analyze the data. HSV allows to quantify the vocal fold oscillations by segmenting the glottal area. This challenging task has been tackled by various studies; however, the proposed approaches are mostly limited and not suitable for daily clinical routine. Method We developed a user-friendly software in C# that allows the editing, motion correction, segmentation, and quantitative analysis of HSV data. We further provide pretrained deep neural networks for fully automatic glottis segmentation. Results We freely provide our software Glottis Analysis Tools (GAT). Using GAT, we provide a general threshold-based region growing platform that enables the user to analyze data from various sources, such as in vivo recordings, ex vivo recordings, and high-speed footage of artificial vocal folds. Additionally, especially for in vivo recordings, we provide three robust neural networks at various speed and quality settings to allow a fully automatic glottis segmentation needed for application by untrained personnel. GAT further evaluates video and audio data in parallel and is able to extract various features from the video data, among others the glottal area waveform, that is, the changing glottal area over time. In total, GAT provides 79 unique quantitative analysis parameters for video- and audio-based signals. Many of these parameters have already been shown to reflect voice disorders, highlighting the clinical importance and usefulness of the GAT software. Conclusion GAT is a unique tool to process HSV and audio data to determine quantitative, clinically relevant parameters for research, diagnosis, and treatment of laryngeal disorders. Supplemental Material https://doi.org/10.23641/asha.14575533.
Collapse
Affiliation(s)
- Andreas M Kist
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Pablo Gómez
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Denis Dubrovskiy
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Patrick Schlegel
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Melda Kunduk
- Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Germany
| | - Rita Patel
- Department of Speech, Language and Hearing Sciences, College of Arts and Sciences, Indiana University, Bloomington
| | - Marion Semmler
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Christopher Bohr
- Klinik und Poliklinik für Hals-Nasen-Ohren-Heilkunde Universitätsklinikum Regensburg, Germany
| | - Stephan Dürr
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| |
Collapse
|
12
|
Falk S, Kniesburges S, Schoder S, Jakubaß B, Maurerlehner P, Echternach M, Kaltenbacher M, Döllinger M. 3D-FV-FE Aeroacoustic Larynx Model for Investigation of Functional Based Voice Disorders. Front Physiol 2021; 12:616985. [PMID: 33762964 PMCID: PMC7982522 DOI: 10.3389/fphys.2021.616985] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 02/09/2021] [Indexed: 12/02/2022] Open
Abstract
For the clinical analysis of underlying mechanisms of voice disorders, we developed a numerical aeroacoustic larynx model, called simVoice, that mimics commonly observed functional laryngeal disorders as glottal insufficiency and vibrational left-right asymmetries. The model is a combination of the Finite Volume (FV) CFD solver Star-CCM+ and the Finite Element (FE) aeroacoustic solver CFS++. simVoice models turbulence using Large Eddy Simulations (LES) and the acoustic wave propagation with the perturbed convective wave equation (PCWE). Its geometry corresponds to a simplified larynx and a vocal tract model representing the vowel /a/. The oscillations of the vocal folds are externally driven. In total, 10 configurations with different degrees of functional-based disorders were simulated and analyzed. The energy transfer between the glottal airflow and the vocal folds decreases with an increasing glottal insufficiency and potentially reflects the higher effort during speech for patients being concerned. This loss of energy transfer may also have an essential influence on the quality of the sound signal as expressed by decreasing sound pressure level (SPL), Cepstral Peak Prominence (CPP), and Vocal Efficiency (VE). Asymmetry in the vocal fold oscillations also reduces the quality of the sound signal. However, simVoice confirmed previous clinical and experimental observations that a high level of glottal insufficiency worsens the acoustic signal quality more than oscillatory left-right asymmetry. Both symptoms in combination will further reduce the quality of the sound signal. In summary, simVoice allows for detailed analysis of the origins of disordered voice production and hence fosters the further understanding of laryngeal physiology, including occurring dependencies. A current walltime of 10 h/cycle is, with a prospective increase in computing power, auspicious for a future clinical use of simVoice.
Collapse
Affiliation(s)
- Sebastian Falk
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Stefan Schoder
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Bernhard Jakubaß
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Paul Maurerlehner
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| | - Manfred Kaltenbacher
- Institute of Fundamentals and Theory in Electrical Engineering, Division Vibro- and Aeroacoustics, Graz University of Technology, Graz, Austria
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head & Neck Surgery, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| |
Collapse
|
13
|
Semmler M, Berry DA, Schützenberger A, Döllinger M. Fluid-structure-acoustic interactions in an ex vivo porcine phonation model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:1657. [PMID: 33765793 PMCID: PMC7952141 DOI: 10.1121/10.0003602] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 01/29/2021] [Accepted: 02/07/2021] [Indexed: 05/02/2023]
Abstract
In the clinic, many diagnostic and therapeutic procedures focus on the oscillation patterns of the vocal folds (VF). Dynamic characteristics of the VFs, such as symmetry, periodicity, and full glottal closure, are considered essential features for healthy phonation. However, the relevance of these individual factors in the complex interaction between the airflow, laryngeal structures, and the resulting acoustics has not yet been quantified. Sustained phonation was induced in nine excised porcine larynges without vocal tract (supraglottal structures had been removed above the ventricular folds). The multimodal setup was designed to simultaneously control and monitor key aspects of phonation in the three essential parts of the larynx. More specifically, measurements will comprise (1) the subglottal pressure signal, (2) high-speed recordings in the glottal plane, and (3) the acoustic signal in the supraglottal region. The automated setup regulates glottal airflow, asymmetric arytenoid adduction, and the pre-phonatory glottal gap. Statistical analysis revealed a beneficial influence of VF periodicity and glottal closure on the signal quality of the subglottal pressure and the supraglottal acoustics, whereas VF symmetry only had a negligible influence. Strong correlations were found between the subglottal and supraglottal signal quality, with significant improvement of the acoustic quality for high levels of periodicity and glottal closure.
Collapse
Affiliation(s)
- Marion Semmler
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - David A Berry
- Laryngeal Dynamics Laboratory, Department of Head and Neck Surgery, David Geffen School of Medicine, UCLA, Los Angeles, California 90024, USA
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| |
Collapse
|
14
|
Schlegel P, Kniesburges S, Dürr S, Schützenberger A, Döllinger M. Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings. Sci Rep 2020; 10:10517. [PMID: 32601277 PMCID: PMC7324600 DOI: 10.1038/s41598-020-66405-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 05/20/2020] [Indexed: 11/13/2022] Open
Abstract
In voice research and clinical assessment, many objective parameters are in use. However, there is no commonly used set of parameters that reflect certain voice disorders, such as functional dysphonia (FD); i.e. disorders with no visible anatomical changes. Hence, 358 high-speed videoendoscopy (HSV) recordings (159 normal females (NF), 101 FD females (FDF), 66 normal males (NM), 32 FD males (FDM)) were analyzed. We investigated 91 quantitative HSV parameters towards their significance. First, 25 highly correlated parameters were discarded. Second, further 54 parameters were discarded by using a LogitBoost decision stumps approach. This yielded a subset of 12 parameters sufficient to reflect functional dysphonia. These parameters separated groups NF vs. FDF and NM vs. FDM with fair accuracy of 0.745 or 0.768, respectively. Parameters solely computed from the changing glottal area waveform (1D-function called GAW) between the vocal folds were less important than parameters describing the oscillation characteristics along the vocal folds (2D-function called Phonovibrogram). Regularity of GAW phases and peak shape, harmonic structure and Phonovibrogram-based vocal fold open and closing angles were mainly important. This study showed the high degree of redundancy of HSV-voice-parameters but also affirms the need of multidimensional based assessment of clinical data.
Collapse
Affiliation(s)
- Patrick Schlegel
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany.
| | - Stefan Kniesburges
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Stephan Dürr
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Anne Schützenberger
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Michael Döllinger
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| |
Collapse
|
15
|
Kniesburges S, Lodermeyer A, Semmler M, Schulz YK, Schützenberger A, Becker S. Analysis of the tonal sound generation during phonation with and without glottis closure. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:3285. [PMID: 32486803 DOI: 10.1121/10.0001184] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
The human phonation is characterized by periodical oscillations of the vocal folds with a complete glottis closure. In contrast, a glottal insufficiency (GI) represents an oscillation without glottis closure resulting in a breathy and weak voice. In this study, flow-induced oscillations of silicone vocal folds were modeled with and without glottis closure. The measurements comprised the flow pressure in the model, the generated sound, and the high-speed footage of the vocal fold motion. The analysis revealed that the sound signal for vocal fold oscillations without closure exhibits a lower number of harmonic tones with smaller amplitudes compared to the case with complete closure. The time series of the pressure signals showed small and periodical oscillations occurring less frequently and with smaller amplitude for the GI case. Accordingly, the pressure spectra include fewer harmonics similar to the sound. The analysis of the high-speed videos indicates that the strength of the pressure oscillations correlates with the divergence angle of the glottal duct during the closing motion. Physiologically, large divergence angles typically occur for a pronounced mucosal wave motion with glottis closure. Thus, the results indicate a correlation between the intensity of the mucosal wave and the development of harmonic tones.
Collapse
Affiliation(s)
- Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander University Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Alexander Lodermeyer
- Department of Process Machinery and Systems Engineering, Friedrich-Alexander University Erlangen-Nürnberg, Cauerstrasse 7, 91058 Erlangen, Germany
| | - Marion Semmler
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander University Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Yvonne Katrin Schulz
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander University Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander University Erlangen-Nürnberg, Waldstrasse 1, 91054 Erlangen, Germany
| | - Stefan Becker
- Department of Process Machinery and Systems Engineering, Friedrich-Alexander University Erlangen-Nürnberg, Cauerstrasse 7, 91058 Erlangen, Germany
| |
Collapse
|
16
|
Impact of Subharmonic and Aperiodic Laryngeal Dynamics on the Phonatory Process Analyzed in Ex Vivo Rabbit Models. APPLIED SCIENCES-BASEL 2019; 9. [PMID: 33815832 PMCID: PMC8018220 DOI: 10.3390/app9091963] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Normal voice is characterized by periodic oscillations of the vocal folds. On the other hand, disordered voice dynamics (e.g., subharmonic and aperiodic oscillations) are often associated with voice pathologies and dysphonia. Unfortunately, not all investigations may be conducted on human subjects; hence animal laryngeal studies have been performed for many years to better understand human phonation. The rabbit larynx has been shown to be a potential model of the human larynx. Despite this fact, only a few studies regarding the phonatory parameters of rabbit larynges have been performed. Further, to the best of our knowledge, no ex vivo study has systematically investigated phonatory parameters from high-speed, audio and subglottal pressure data with irregular oscillations. To remedy this, the present study analyzes experiments with sustained phonation in 11 ex vivo rabbit larynges for 51 conditions of disordered vocal fold dynamics. (1) The results of this study support previous findings on non-disordered data, that the stronger the glottal closure insufficiency is during phonation, the worse the phonatory characteristics are; (2) aperiodic oscillations showed worse phonatory results than subharmonic oscillations; (3) in the presence of both types of irregular vibrations, the voice quality (i.e., cepstral peak prominence) of the audio and subglottal signal greatly deteriorated compared to normal/periodic vibrations. In summary, our results suggest that the presence of both types of irregular vibration have a major impact on voice quality and should be considered along with glottal closure measures in medical diagnosis and treatment.
Collapse
|
17
|
Juvenile Ovine Ex Vivo Larynges: Phonatory, Histologic, and Micro CT Based Anatomic Analyses. BIOMED RESEARCH INTERNATIONAL 2019; 2019:6932047. [PMID: 30949506 PMCID: PMC6425324 DOI: 10.1155/2019/6932047] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Accepted: 02/11/2019] [Indexed: 11/24/2022]
Abstract
It is well known that the phonatory process changes during the life span. However, detailed investigations on potential factors concerned are rare. To deal with this issue, we performed extended biomechanical, macro anatomical, and histological analyses of the contributing laryngeal structures in ex vivo juvenile sheep models. Altogether twelve juvenile sheep larynges were analyzed within the phonatory experiments. Three different elongation levels and 16 different flow levels were applied to achieve a large variety of phonatory conditions. Vocal fold dynamics and acoustical and subglottal signals could be analyzed for 431 experimental runs. Subsequently, for six juvenile larynges microcomputed tomography following virtual 3D reconstruction was performed. The remaining six juvenile larynges as well as six ex vivo larynges from old sheep were histologically and immunohistologically analyzed. Results for juveniles showed more consistent dynamical behavior compared to old sheep larynges due to vocal fold tissue alterations during the life span. The phonatory process in juvenile sheep seems to be more effective going along with a greater dynamic range. These findings are supported by the histologically detected higher amounts of elastin and hyaluronic acid in the lamina propria of the juvenile sheep. The 3D reconstructions of the thyro-arytenoid muscles (TAM) showed a symmetrical shape. Intraindividual volume and surface differences of the TAM were small and comparable to those of aged sheep. However, TAM dimensions were statistically significant smaller for juvenile larynges. Finally, topographical landmarks were introduced for later comparison with other individuals and species. This work resulted in detailed functional, immunohistological, and anatomical information that was not yet reported. This data will also provide reference information for therapeutic strategies regarding aging effects, e.g. laryngeal muscle treatment by functional electrical stimulation.
Collapse
|
18
|
Gómez P, Schützenberger A, Semmler M, Döllinger M. Laryngeal Pressure Estimation With a Recurrent Neural Network. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2018; 7:2000111. [PMID: 30680252 PMCID: PMC6331197 DOI: 10.1109/jtehm.2018.2886021] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 10/24/2018] [Accepted: 11/30/2018] [Indexed: 11/24/2022]
Abstract
Quantifying the physical parameters of voice production is essential for understanding the process of phonation and can aid in voice research and diagnosis. As an alternative to invasive measurements, they can be estimated by formulating an inverse problem using a numerical forward model. However, high-fidelity numerical models are often computationally too expensive for this. This paper presents a novel approach to train a long short-term memory network to estimate the subglottal pressure in the larynx at massively reduced computational cost using solely synthetic training data. We train the network on synthetic data from a numerical two-mass model and validate it on experimental data from 288 high-speed ex vivo video recordings of porcine vocal folds from a previous study. The training requires significantly fewer model evaluations compared with the previous optimization approach. On the test set, we maintain a comparable performance of 21.2% versus previous 17.7% mean absolute percentage error in estimating the subglottal pressure. The evaluation of one sample requires a vanishingly small amount of computation time. The presented approach is able to maintain estimation accuracy of the subglottal pressure at significantly reduced computational cost. The methodology is likely transferable to estimate other parameters and training with other numerical models. This improvement should allow the adoption of more sophisticated, high-fidelity numerical models of the larynx. The vast speedup is a critical step to enable a future clinical application and knowledge of parameters such as the subglottal pressure will aid in diagnosis and treatment selection.
Collapse
Affiliation(s)
- Pablo Gómez
- Division of Phoniatrics and Pediatric AudiologyDepartment of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg91054ErlangenGermany
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric AudiologyDepartment of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg91054ErlangenGermany
| | - Marion Semmler
- Division of Phoniatrics and Pediatric AudiologyDepartment of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg91054ErlangenGermany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric AudiologyDepartment of Otorhinolaryngology, Head and Neck SurgeryUniversity Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg91054ErlangenGermany
| |
Collapse
|
19
|
Döllinger M, Kniesburges S, Berry DA, Birk V, Wendler O, Dürr S, Alexiou C, Schützenberger A. Investigation of phonatory characteristics using ex vivo rabbit larynges. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:142. [PMID: 30075689 PMCID: PMC6037535 DOI: 10.1121/1.5043384] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Quantitative analysis of phonatory characteristics of rabbits has been widely neglected. However, preliminary studies established the rabbit larynx as a potential model of human phonation. This study reports quantitative data on phonation using ex vivo rabbit larynx models to achieve more insight into dependencies of three main components of the phonation process, including airflow, vocal fold dynamics, and the acoustic output. Sustained phonation was induced in 11 ex vivo rabbit larynges. For 414 phonatory conditions, vocal fold vibrations, acoustic, and aerodynamic parameters were analyzed as functions of longitudinal vocal fold pre-stress, applied air flow, and glottal closure insufficiency. Dimensions of the vocal folds were measured and histological data were analyzed. Glottal closure characteristics improved for increasing longitudinal pre-stress and applied airflow. For the subglottal pressure signal only the cepstral peak prominence showed dependency on glottal closure. In contrast, vibrational, acoustic, and aerodynamic parameters were found to be highly dependent on the degree of glottal closure: The more complete the glottal closure during phonation, the better the aerodynamic and acoustic characteristics. Hence, complete or at least partial glottal closure appears to enhance acoustic signal quality. Finally, results validate the ex vivo rabbit larynx as an effective model for analyzing the phonatory process.
Collapse
Affiliation(s)
- Michael Döllinger
- Division for Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU Erlangen-Nürnberg, Waldstrasse 1, Erlangen, 91054, Germany
| | - Stefan Kniesburges
- Division for Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU Erlangen-Nürnberg, Waldstrasse 1, Erlangen, 91054, Germany
| | - David A Berry
- Laryngeal Dynamics Laboratory, Division of Head and Neck Surgery, David Geffen School of Medicine at UCLA, 1000 Veteran Avenue, 31-24 Rehab Center, Los Angeles, California 90095-1794, USA
| | - Veronika Birk
- Division for Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU Erlangen-Nürnberg, Waldstrasse 1, Erlangen, 91054, Germany
| | - Olaf Wendler
- Laboratory for Molecular Biology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU Erlangen-Nürnberg, Waldstrasse 1, Erlangen, 91054, Germany
| | - Stephan Dürr
- Division for Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU Erlangen-Nürnberg, Waldstrasse 1, Erlangen, 91054, Germany
| | - Christoph Alexiou
- Section of Experimental Oncology and Nanomedicine (SEON), Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, Else Kröner-Fresenius-Stiftung-Professorship, FAU Erlangen-Nürnberg, Glückstrasse 10a, Erlangen, 91054, Germany
| | - Anne Schützenberger
- Division for Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU Erlangen-Nürnberg, Waldstrasse 1, Erlangen, 91054, Germany
| |
Collapse
|
20
|
Gómez P, Schützenberger A, Kniesburges S, Bohr C, Döllinger M. Physical parameter estimation from porcine ex vivo vocal fold dynamics in an inverse problem framework. Biomech Model Mechanobiol 2017; 17:777-792. [DOI: 10.1007/s10237-017-0992-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 11/30/2017] [Indexed: 11/28/2022]
|
21
|
Birk V, Kniesburges S, Semmler M, Berry DA, Bohr C, Döllinger M, Schützenberger A. Influence of glottal closure on the phonatory process in ex vivo porcine larynges. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:2197. [PMID: 29092569 PMCID: PMC6909995 DOI: 10.1121/1.5007952] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Many cases of disturbed voice signals can be attributed to incomplete glottal closure, vocal fold oscillation asymmetries, and aperiodicity. Often these phenomena occur simultaneously and interact with each other, making a systematic, isolated investigation challenging. Therefore, ex vivo porcine experiments were performed which enable direct control of glottal configurations. Different pre-phonatory glottal gap sizes, adduction levels, and flow rates were adjusted. The resulting glottal closure types were identified in a post-processing step. Finally, the acoustic quality, aerodynamic parameters, and the characteristics of vocal fold oscillation were analyzed in reference to the glottal closure types. Results show that complete glottal closure stabilizes the phonation process indicated through a reduced left-right phase asymmetry, increased amplitude and time periodicity, and an increase in the acoustic quality. Although asymmetry and periodicity parameter variation covers only a small range of absolute values, these small variations have a remarkable influence on the acoustic quality. Due to the fact that these parameters cannot be influenced directly, the authors suggest that the (surgical) reduction of the glottal gap seems to be a promising method to stabilize the phonatory process, which has to be confirmed in future studies.
Collapse
Affiliation(s)
- Veronika Birk
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstr. 1, 91054 Erlangen, Germany
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstr. 1, 91054 Erlangen, Germany
| | - Marion Semmler
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstr. 1, 91054 Erlangen, Germany
| | - David A Berry
- Laryngeal Dynamics Laboratory, Division of Head and Neck Surgery, David Geffen School of Medicine at UCLA, 10833 Le Conte Avenue, Los Angeles, California 90095-1624, USA
| | - Christopher Bohr
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstr. 1, 91054 Erlangen, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstr. 1, 91054 Erlangen, Germany
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Medical School at Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstr. 1, 91054 Erlangen, Germany
| |
Collapse
|
22
|
Gerstenberger C, Döllinger M, Kniesburges S, Bubalo V, Karbiener M, Schlager H, Sadeghi H, Wendler O, Gugatschka M. Phonation Analysis Combined with 3D Reconstruction of the Thyroarytenoid Muscle in Aged Ovine Ex Vivo Larynx Models. J Voice 2017; 32:517-524. [PMID: 28964638 DOI: 10.1016/j.jvoice.2017.08.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2017] [Revised: 08/11/2017] [Accepted: 08/16/2017] [Indexed: 11/25/2022]
Abstract
OBJECTIVE The aim of the study was to establish a basic data set of combined functional and anatomical measures of aged sheep larynges using ex vivo models. Combining these two approaches in one and the same larynx is an unmet goal so far yet is important as newer treatment strategies aim to preserve the organ structure and new assessment tools are required. Ovine larynges were used as their dimensions, and muscle fiber type distribution highly resemble the human larynx. STUDY DESIGN Ex vivo animal study. METHODS Larynges of six sheep (~9 years of age) were subjected to ex vivo functional phonatory experiments. Phonatory characteristics were analyzed as a function of longitudinal vocal fold (VF) prestress. Anatomical measurements of the same larynges comprised micro-computed tomography scans followed by three-dimensional (3D) reconstructions. Using specially adapted radiological scan protocols with subsequent 3D reconstruction, muscle volumes, surface areas, and anatomical measurements were computed. RESULTS Increasing longitudinal prestress yielded higher subglottal pressure (PS) for the same airflow. Quantitative differences to previous studies-such as the increased PS and increased phonation threshold pressure-were detected. We achieved excellent visualization of the laryngeal muscles and framework, resulting in accurate 3D reconstructions for quantitative analysis. We found no significant intraindividual volume differences of the thyroarytenoid muscles. CONCLUSION The established protocol allows precise functional and anatomical measures. The data created provide a reference data set for upcoming therapeutic strategies (eg, growth factor therapy, functional electrical stimulation) that target essential structures of the VFs such as the laryngeal muscles and/or the VF mucosa.
Collapse
Affiliation(s)
- Claus Gerstenberger
- Department of Phoniatrics, ENT University Hospital, Medical University of Graz, Graz, Austria.
| | - Michael Döllinger
- Division for Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU-Erlangen-Nürnberg, Erlangen, Germany
| | - Stefan Kniesburges
- Division for Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU-Erlangen-Nürnberg, Erlangen, Germany
| | - Vladimir Bubalo
- Center of Biomedical Research, Medical University Graz, Graz, Austria
| | - Michael Karbiener
- Department of Phoniatrics, ENT University Hospital, Medical University of Graz, Graz, Austria
| | - Hansjörg Schlager
- Department of Phoniatrics, ENT University Hospital, Medical University of Graz, Graz, Austria
| | - Hossein Sadeghi
- Division for Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU-Erlangen-Nürnberg, Erlangen, Germany
| | - Olaf Wendler
- Laboratory of Molecular Biology, Department of Otorhinolaryngology, Head and Neck Surgery, Medical School, FAU-Erlangen-Nürnberg, Erlangen, Germany
| | - Markus Gugatschka
- Department of Phoniatrics, ENT University Hospital, Medical University of Graz, Graz, Austria
| |
Collapse
|