1
|
Deng JJ, Peterson SD. Sensitivity of Phonation Onset Pressure to Vocal Fold Stiffness Distribution. J Biomech Eng 2024; 146:081003. [PMID: 38345603 DOI: 10.1115/1.4064718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Indexed: 03/22/2024]
Abstract
Phonation onset is characterized by the unstable growth of vocal fold (VF) vibrations that ultimately results in self-sustained oscillation and the production of modal voice. Motivated by histological studies, much research has focused on the role of the layered structure of the vocal folds in influencing phonation onset, wherein the outer "cover" layer is relatively soft and the inner "body" layer is relatively stiff. Recent research, however, suggests that the body-cover (BC) structure over-simplifies actual stiffness distributions by neglecting important spatial variations, such as inferior-superior (IS) and anterior-posterior gradients and smooth transitions in stiffness from one histological layer to another. Herein, we explore sensitivity of phonation onset to stiffness gradients and smoothness. By assuming no a priori stiffness distribution and considering a second-order Taylor series sensitivity analysis of phonation onset pressure with respect to stiffness, we find two general smooth stiffness distributions most strongly influence onset pressure: a smooth stiffness containing aspects of BC differences and IS gradients in the cover, which plays a role in minimizing onset pressure, and uniform increases in stiffness, which raise onset pressure and frequency. While the smooth stiffness change contains aspects qualitatively similar to layered BC distributions used in computational studies, smooth transitions in stiffness result in higher sensitivity of onset pressure than discrete layering. These two general stiffness distributions also provide a simple, low-dimensional, interpretation of how complex variations in VF stiffness affect onset pressure, enabling refined exploration of the effects of stiffness distributions on phonation onset.
Collapse
Affiliation(s)
- Jonathan J Deng
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| |
Collapse
|
2
|
Thomson SL. Synthetic, self-oscillating vocal fold models for voice production researcha). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024; 156:1283-1308. [PMID: 39172710 PMCID: PMC11348498 DOI: 10.1121/10.0028267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 07/26/2024] [Accepted: 07/30/2024] [Indexed: 08/24/2024]
Abstract
Sound for the human voice is produced by vocal fold flow-induced vibration and involves a complex coupling between flow dynamics, tissue motion, and acoustics. Over the past three decades, synthetic, self-oscillating vocal fold models have played an increasingly important role in the study of these complex physical interactions. In particular, two types of models have been established: "membranous" vocal fold models, such as a water-filled latex tube, and "elastic solid" models, such as ultrasoft silicone formed into a vocal fold-like shape and in some cases with multiple layers of differing stiffness to mimic the human vocal fold tissue structure. In this review, the designs, capabilities, and limitations of these two types of models are presented. Considerations unique to the implementation of elastic solid models, including fabrication processes and materials, are discussed. Applications in which these models have been used to study the underlying mechanical principles that govern phonation are surveyed, and experimental techniques and configurations are reviewed. Finally, recommendations for continued development of these models for even more lifelike response and clinical relevance are summarized.
Collapse
Affiliation(s)
- Scott L Thomson
- Department of Mechanical and Civil Engineering, Brigham Young University-Idaho, Rexburg, Idaho 83460, USA
| |
Collapse
|
3
|
Deng JJ, Erath BD, Zañartu M, Peterson SD. The effect of swelling on vocal fold kinematics and dynamics. Biomech Model Mechanobiol 2023; 22:1873-1889. [PMID: 37428270 DOI: 10.1007/s10237-023-01740-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 06/19/2023] [Indexed: 07/11/2023]
Abstract
Swelling in the vocal folds is caused by the local accumulation of fluid, and has been implicated as a phase in the development of phonotraumatic vocal hyperfunction and related structural pathologies, such as vocal fold nodules. It has been posited that small degrees of swelling may be protective, but large amounts may lead to a vicious cycle wherein the engorged folds lead to conditions that promote further swelling, leading to pathologies. As a first effort to explore the mechanics of vocal fold swelling and its potential role in the etiology of voice disorders, this study employs a finite-element model with swelling confined to the superficial lamina propria, which changes the volume, mass, and stiffness of the cover layer. The impacts of swelling on a number of vocal fold kinematic and damage measures, including von Mises stress, internal viscous dissipation, and collision pressure, are presented. Swelling has small but consistent effects on voice outputs, including a reduction in fundamental frequency with increasing swelling (10 Hz at 30 % swelling). Average von Mises stress decreases slightly for small degrees of swelling but increases at large magnitudes, consistent with expectations for a vicious cycle. Both viscous dissipation and collision pressure consistently increase with the magnitude of swelling. This first effort at modeling the impact of swelling on vocal fold kinematics, kinetics, and damage measures highlights the complexity with which phonotrauma can influence performance metrics. Further identification and exploration of salient candidate measures of damage and refined studies coupling swelling with local phonotrauma are expected to shed further light on the etiological pathways of phonotraumatic vocal hyperfunction.
Collapse
Affiliation(s)
- Jonathan J Deng
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| | - Byron D Erath
- Department of Mechanical and Aerospace Engineering, Clarkson University, Potsdam, NY, 13699, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada.
| |
Collapse
|
4
|
Zhang Z. Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion. J Voice 2023; 37:479-485. [PMID: 33849760 PMCID: PMC8502179 DOI: 10.1016/j.jvoice.2021.03.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 02/24/2021] [Accepted: 03/01/2021] [Indexed: 11/19/2022]
Abstract
OBJECTIVE Estimation of physiological control parameters of the vocal system from the produced voice outcome has important applications in clinical management of voice disorders . Previously we developed a simulation-based neural network for estimation of vocal fold geometry, mechanical properties, and subglottal pressure from voice outcome features that characterize the acoustics of the produced voice. The goals of this study are to (1) explore the possibility of improving the estimation accuracy of physiological control parameters by including voice outcome features characterizing vocal fold vibration; and (2) identify voice feature sets that optimize both estimation accuracy and robustness to measurement noise. METHODS Feedforward neural networks are trained to solve the inversion problem of estimating the physiological control parameters of a three-dimensional body-cover vocal fold model from different sets of voice outcome features that characterize the simulated voice acoustics, glottal flow, and vocal fold vibration. A sensitivity analysis is then performed to evaluate the contribution of individual voice features to the overall performance of the neural networks in estimating the physiologic control parameters. RESULTS AND CONCLUSIONS While including voice outcome features characterizing vocal fold vibration increases estimation accuracy, it also reduces the network's robustness to measurement noise, due to high sensitivity of network performance to voice outcome features measuring the absolute amplitudes of the glottal flow and area waveforms, which are also difficult to measure accurately in practical applications. By excluding such glottal flow-based features and replacing glottal area-based features by their normalized counterparts, we are able to significantly improve both estimation accuracy and robustness to noise. We further show that similar estimation accuracy and robustness can be achieved with an even smaller set of voice outcome features by excluding features of small sensitivity.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, Los Angeles, California.
| |
Collapse
|
5
|
Movahhedi M, Liu XY, Geng B, Elemans C, Xue Q, Wang JX, Zheng X. Predicting 3D soft tissue dynamics from 2D imaging using physics informed neural networks. Commun Biol 2023; 6:541. [PMID: 37208428 DOI: 10.1038/s42003-023-04914-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 05/04/2023] [Indexed: 05/21/2023] Open
Abstract
Tissue dynamics play critical roles in many physiological functions and provide important metrics for clinical diagnosis. Capturing real-time high-resolution 3D images of tissue dynamics, however, remains a challenge. This study presents a hybrid physics-informed neural network algorithm that infers 3D flow-induced tissue dynamics and other physical quantities from sparse 2D images. The algorithm combines a recurrent neural network model of soft tissue with a differentiable fluid solver, leveraging prior knowledge in solid mechanics to project the governing equation on a discrete eigen space. The algorithm uses a Long-short-term memory-based recurrent encoder-decoder connected with a fully connected neural network to capture the temporal dependence of flow-structure-interaction. The effectiveness and merit of the proposed algorithm is demonstrated on synthetic data from a canine vocal fold model and experimental data from excised pigeon syringes. The results showed that the algorithm accurately reconstructs 3D vocal dynamics, aerodynamics, and acoustics from sparse 2D vibration profiles.
Collapse
Affiliation(s)
| | - Xin-Yang Liu
- Aerospace and Mechanical Engineering Department, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Biao Geng
- Mechanical Engineering Department, University of Maine, Orono, ME, 04469, USA
- Mechanical Engineering Department, Rochester Institute of Technology, Rochester, NY, 14623, USA
| | - Coen Elemans
- Department of Biology, University of Southern Denmark, Odense M, 5230, Denmark
| | - Qian Xue
- Mechanical Engineering Department, University of Maine, Orono, ME, 04469, USA
- Mechanical Engineering Department, Rochester Institute of Technology, Rochester, NY, 14623, USA
| | - Jian-Xun Wang
- Aerospace and Mechanical Engineering Department, University of Notre Dame, Notre Dame, IN, 46556, USA.
| | - Xudong Zheng
- Mechanical Engineering Department, University of Maine, Orono, ME, 04469, USA.
- Mechanical Engineering Department, Rochester Institute of Technology, Rochester, NY, 14623, USA.
| |
Collapse
|
6
|
Ibarra EJ, Parra JA, Alzamendi GA, Cortés JP, Espinoza VM, Mehta DD, Hillman RE, Zañartu M. Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model. Front Physiol 2021; 12:732244. [PMID: 34539451 PMCID: PMC8440844 DOI: 10.3389/fphys.2021.732244] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 08/09/2021] [Indexed: 11/23/2022] Open
Abstract
The ambulatory assessment of vocal function can be significantly enhanced by having access to physiologically based features that describe underlying pathophysiological mechanisms in individuals with voice disorders. This type of enhancement can improve methods for the prevention, diagnosis, and treatment of behaviorally based voice disorders. Unfortunately, the direct measurement of important vocal features such as subglottal pressure, vocal fold collision pressure, and laryngeal muscle activation is impractical in laboratory and ambulatory settings. In this study, we introduce a method to estimate these features during phonation from a neck-surface vibration signal through a framework that integrates a physiologically relevant model of voice production and machine learning tools. The signal from a neck-surface accelerometer is first processed using subglottal impedance-based inverse filtering to yield an estimate of the unsteady glottal airflow. Seven aerodynamic and acoustic features are extracted from the neck surface accelerometer and an optional microphone signal. A neural network architecture is selected to provide a mapping between the seven input features and subglottal pressure, vocal fold collision pressure, and cricothyroid and thyroarytenoid muscle activation. This non-linear mapping is trained solely with 13,000 Monte Carlo simulations of a voice production model that utilizes a symmetric triangular body-cover model of the vocal folds. The performance of the method was compared against laboratory data from synchronous recordings of oral airflow, intraoral pressure, microphone, and neck-surface vibration in 79 vocally healthy female participants uttering consecutive /pæ/ syllable strings at comfortable, loud, and soft levels. The mean absolute error and root-mean-square error for estimating the mean subglottal pressure were 191 Pa (1.95 cm H2O) and 243 Pa (2.48 cm H2O), respectively, which are comparable with previous studies but with the key advantage of not requiring subject-specific training and yielding more output measures. The validation of vocal fold collision pressure and laryngeal muscle activation was performed with synthetic values as reference. These initial results provide valuable insight for further vocal fold model refinement and constitute a proof of concept that the proposed machine learning method is a feasible option for providing physiologically relevant measures for laboratory and ambulatory assessment of vocal function.
Collapse
Affiliation(s)
- Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- School of Electrical Engineering, University of the Andes, Mérida, Venezuela
| | - Jesús A. Parra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Gabriel A. Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics, Consejo Nacional de Investigaciones Científicas y Técnicas - Universidad Nacional de Entre Ríos, Oro Verde, Argentina
| | - Juan P. Cortés
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Víctor M. Espinoza
- Department of Sound, Faculty of Arts, University of Chile, Santiago, Chile
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
7
|
Motie-Shirazi M, Zañartu M, Peterson SD, Erath BD. Vocal fold dynamics in a synthetic self-oscillating model: Contact pressure and dissipated-energy dose. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:478. [PMID: 34340498 PMCID: PMC8298101 DOI: 10.1121/10.0005596] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 06/18/2021] [Accepted: 06/21/2021] [Indexed: 06/13/2023]
Abstract
The energy dissipated during vocal fold (VF) contact is a predictor of phonotrauma. Difficulty measuring contact pressure has forced prior energy dissipation estimates to rely upon generalized approximations of the contact dynamics. To address this shortcoming, contact pressure was measured in a self-oscillating synthetic VF model with high spatiotemporal resolution using a hemilaryngeal configuration. The approach yields a temporal resolution of less than 0.26 ms and a spatial resolution of 0.254 mm in the inferior-superior direction. The average contact pressure was found to be 32% of the peak contact pressure, 60% higher than the ratio estimated in prior studies. It was found that 52% of the total power was dissipated due to collision. The power dissipated during contact was an order of magnitude higher than the power dissipated due to internal friction during the non-contact phase of oscillation. Both the contact pressure magnitude and dissipated power were found to be maximums at the mid anterior-posterior position, supporting the idea that collision is responsible for the formation of benign lesions, which normally appear at the middle third of the VF.
Collapse
Affiliation(s)
- Mohsen Motie-Shirazi
- Department of Mechanical and Aeronautical Engineering, Clarkson University, Potsdam, New York 13699, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario, Canada
| | - Byron D Erath
- Department of Mechanical and Aeronautical Engineering, Clarkson University, Potsdam, New York 13699, USA
| |
Collapse
|
8
|
Hadwin PJ, Erath BD, Peterson SD. The influence of flow model selection on finite element model parameter estimation using Bayesian inference. JASA EXPRESS LETTERS 2021; 1:045204. [PMID: 34136884 PMCID: PMC8182970 DOI: 10.1121/10.0004260] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 03/18/2021] [Indexed: 06/12/2023]
Abstract
Recently, Bayesian estimation coupled with finite element modeling has been demonstrated as a viable tool for estimating vocal fold material properties from kinematic information obtained via high-speed video recordings. In this article, the sensitivity of the parameter estimations to the employed fluid model is explored by considering Bernoulli and one-dimensional viscous fluid flow models. Simulation results indicate that prescribing an ad hoc separation location for the Bernoulli flow model can lead to large estimate biases, whereas including the separation location as an estimated parameter leads to results comparable to that of the viscous fluid flow model.
Collapse
Affiliation(s)
- Paul J Hadwin
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Byron D Erath
- Department of Mechanical and Aeronautical Engineering, Clarkson University, Potsdam, New York 13699, USA , ,
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| |
Collapse
|
9
|
Li Z, Chen Y, Chang S, Rousseau B, Luo H. A one-dimensional flow model enhanced by machine learning for simulation of vocal fold vibration. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:1712. [PMID: 33765799 PMCID: PMC7954577 DOI: 10.1121/10.0003561] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 01/25/2021] [Accepted: 02/01/2021] [Indexed: 06/02/2023]
Abstract
A one-dimensional (1D) unsteady and viscous flow model that is derived from the momentum and mass conservation equations is described, and to enhance this physics-based model, a machine learning approach is used to determine the unknown modeling parameters. Specifically, an idealized larynx model is constructed and ten cases of three-dimensional (3D) fluid-structure interaction (FSI) simulations are performed. The flow data are then extracted to train the 1D flow model using a sparse identification approach for nonlinear dynamical systems. As a result of training, we obtain the analytical expressions for the entrance effect and pressure loss in the glottis, which are then incorporated in the flow model to conveniently handle different glottal shapes due to vocal fold vibration. We apply the enhanced 1D flow model in the FSI simulation of both idealized vocal fold geometries and subject-specific anatomical geometries reconstructed from the magnetic resonance imaging images of rabbits' larynges. The 1D flow model is evaluated in both of these setups and shown to have robust performance. Therefore, it provides a fast simulation tool that is superior to the previous 1D models.
Collapse
Affiliation(s)
- Zheng Li
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Ye Chen
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Siyuan Chang
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Bernard Rousseau
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, Pennsylvania 15213, USA
| | - Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| |
Collapse
|
10
|
Li Z, Wilson A, Sayce L, Avhad A, Rousseau B, Luo H. Numerical and experimental investigations on vocal fold approximation in healthy and simulated unilateral vocal fold paralysis. APPLIED SCIENCES-BASEL 2021; 11. [PMID: 34671486 DOI: 10.3390/app11041817] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
We have developed a novel surgical/computational model for the investigation of unilateral vocal fold paralysis (UVFP) which will be used to inform future in silico approaches to improve surgical outcomes in type I thyroplasty. Healthy phonation (HP) was achieved using cricothyroid suture approximation on both sides of the larynx to generate symmetrical vocal fold closure. Following high-speed videoendoscopy (HSV) capture, sutures on the right side of the larynx were removed, partially releasing tension unilaterally and generating asymmetric vocal fold closure characteristic of UVFP (sUVFP condition). HSV revealed symmetric vibration in HP, while in sUVFP the sutured side demonstrated a higher frequency (10 - 11%). For the computational model, ex vivo magnetic resonance imaging (MRI) scans were captured at three configurations: non-approximated (NA), HP, and sUVFP. A finite-element method (FEM) model was built, in which cartilage displacements from the MRI images were used to prescribe the adduction and the vocal fold deformation was simulated before the eigenmode calculation. The results showed that the frequency comparison between the two sides were consistent with observations from HSV. This alignment between the surgical and computational models supports the future application of these methods for the investigation of treatment for UVFP.
Collapse
Affiliation(s)
- Zheng Li
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place PMB 401592, Nashville, TN, 37240, USA
| | - Azure Wilson
- Department of Communication Science and Disorders, University of Pittsburgh, 4200 Fifth Avenue, Pittsburgh, PA, 15260, USA
| | - Lea Sayce
- Department of Communication Science and Disorders, University of Pittsburgh, 4200 Fifth Avenue, Pittsburgh, PA, 15260, USA
| | - Amit Avhad
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place PMB 401592, Nashville, TN, 37240, USA
| | - Bernard Rousseau
- Department of Communication Science and Disorders, University of Pittsburgh, 4200 Fifth Avenue, Pittsburgh, PA, 15260, USA
| | - Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place PMB 401592, Nashville, TN, 37240, USA
| |
Collapse
|
11
|
Alzamendi GA, Manríquez R, Hadwin PJ, Deng JJ, Peterson SD, Erath BD, Mehta DD, Hillman RE, Zañartu M. Bayesian estimation of vocal function measures using laryngeal high-speed videoendoscopy and glottal airflow estimates: An in vivo case study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:EL434. [PMID: 32486812 PMCID: PMC7480079 DOI: 10.1121/10.0001276] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Revised: 05/01/2020] [Accepted: 05/03/2020] [Indexed: 05/30/2023]
Abstract
This study introduces the in vivo application of a Bayesian framework to estimate subglottal pressure, laryngeal muscle activation, and vocal fold contact pressure from calibrated transnasal high-speed videoendoscopy and oral airflow data. A subject-specific, lumped-element vocal fold model is estimated using an extended Kalman filter and two observation models involving glottal area and glottal airflow. Model-based inferences using data from a vocally healthy male individual are compared with empirical estimates of subglottal pressure and reference values for muscle activation and contact pressure in the literature, thus providing baseline error metrics for future clinical investigations.
Collapse
Affiliation(s)
- Gabriel A Alzamendi
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Rodrigo Manríquez
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Paul J Hadwin
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Jonathan J Deng
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Byron D Erath
- Department of Mechanical and Aeronautical Engineering, Clarkson University, Potsdam, New York 13699, USA
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , , , ,
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , , , ,
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| |
Collapse
|
12
|
Zhang Z. Estimation of vocal fold physiology from voice acoustics using machine learning. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:EL264. [PMID: 32237804 PMCID: PMC7075716 DOI: 10.1121/10.0000927] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2019] [Revised: 03/01/2020] [Accepted: 03/03/2020] [Indexed: 05/27/2023]
Abstract
The goal of this study is to estimate vocal fold geometry, stiffness, position, and subglottal pressure from voice acoustics, toward clinical and other voice technology applications. Unlike previous voice inversion research that often uses lumped-element models of phonation, this study explores the feasibility of voice inversion using data generated from a three-dimensional voice production model. Neural networks are trained to estimate vocal fold properties and subglottal pressure from voice features extracted from the simulation data. Results show reasonably good estimation accuracy, particularly for vocal fold properties with a consistent global effect on voice production, and reasonable agreement with excised human larynx experiment.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehab Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794,
| |
Collapse
|
13
|
Drioli C, Foresti GL. Fitting a biomechanical model of the folds to high-speed video data through bayesian estimation. INFORMATICS IN MEDICINE UNLOCKED 2020. [DOI: 10.1016/j.imu.2020.100373] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
14
|
Risk Quantification and Analysis of Coupled Factors Based on the DEMATEL Model and a Bayesian Network. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app10010317] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
With the developing of high integrations in large scale systems, such as aircraft and other industrial systems, there are new challenges in safety analysis due to the complexity of the mission process and the more complicated coupling characteristic of multi-factors. Aiming at the evaluation of coupled factors as well as the risk of the mission, this paper proposes a combined technology based on the Decision Making Trial and Evaluation Laboratory (DEMATEL) model and the Bayesian network (BN). After identifying and classifying the risk factors from the perspectives of humans, machines, the environment, and management, the DEMATEL technique is adopted to assess their direct and/or indirect coupling relationships to determine the importance and causality of each factor; moreover, the relationship matrix in the DEMATEL model is used to generate the BN model, including its parameterization. The inverse reasoning theory is then implemented to derive the probability, and the risk of the coupled factors is evaluated by an assessment model integrating the probability and severity. Furthermore, the key risk factors are identified based on the risk radar diagram and the Pareto rule to support the preventive measurements. Finally, an application of the take-off process of aircraft is provided to demonstrate the proposed method.
Collapse
|
15
|
Mehta DD, Kobler JB, Zeitels SM, Zañartu M, Erath BD, Motie-Shirazi M, Peterson SD, Petrillo RH, Hillman RE. Toward Development of a Vocal Fold Contact Pressure Probe: Bench-Top Validation of a Dual-Sensor Probe Using Excised Human Larynx Models. APPLIED SCIENCES (BASEL, SWITZERLAND) 2019; 9:4360. [PMID: 34084559 PMCID: PMC8171492 DOI: 10.3390/app9204360] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
A critical element in understanding voice production mechanisms is the characterization of vocal fold collision, which is widely considered a primary etiological factor in the development of common phonotraumatic lesions such as nodules and polyps. This paper describes the development of a transoral, dual-sensor intraglottal/subglottal pressure probe for the simultaneous measurement of vocal fold collision and subglottal pressures during phonation using two miniature sensors positioned 7.6 mm apart at the distal end of a rigid cannula. Proof-of-concept testing was performed using excised whole-mount and hemilarynx human tissue aerodynamically driven into self-sustained oscillation, with systematic variation of the superior-inferior positioning of the vocal fold collision sensor. In the hemilarynx experiment, signals from the pressure sensors were synchronized with an acoustic microphone, a tracheal-surface accelerometer, and two high-speed video cameras recording at 4000 frames per second for top-down and en face imaging of the superior and medial vocal fold surfaces, respectively. As expected, the intraglottal pressure signal exhibited an impulse-like peak when vocal fold contact occurred, followed by a broader peak associated with intraglottal pressure build-up during the de-contacting phase. As subglottal pressure was increased, the peak amplitude of the collision pressure increased and typically reached a value below that of the average subglottal pressure. Results provide important baseline vocal fold collision pressure data with which computational models of voice production can be developed and in vivo measurements can be referenced.
Collapse
Affiliation(s)
- Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- MGH Institute of Health Professions, Boston, MA 02129, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - James B. Kobler
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- MGH Institute of Health Professions, Boston, MA 02129, USA
| | - Steven M. Zeitels
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- MGH Institute of Health Professions, Boston, MA 02129, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Byron D. Erath
- Department of Mechanical & Aeronautical Engineering, Clarkson University, Potsdam, NY 13699, USA
| | - Mohsen Motie-Shirazi
- Department of Mechanical & Aeronautical Engineering, Clarkson University, Potsdam, NY 13699, USA
| | - Sean D. Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
| | - Robert H. Petrillo
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- MGH Institute of Health Professions, Boston, MA 02129, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|