1
|
Galindo GE, Peterson SD, Erath BD, Castro C, Hillman RE, Zañartu M. Modeling the Pathophysiology of Phonotraumatic Vocal Hyperfunction With a Triangular Glottal Model of the Vocal Folds. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:2452-2471. [PMID: 28837719 PMCID: PMC5831616 DOI: 10.1044/2017_jslhr-s-16-0412] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2016] [Accepted: 04/19/2017] [Indexed: 05/08/2023]
Abstract
PURPOSE Our goal was to test prevailing assumptions about the underlying biomechanical and aeroacoustic mechanisms associated with phonotraumatic lesions of the vocal folds using a numerical lumped-element model of voice production. METHOD A numerical model with a triangular glottis, posterior glottal opening, and arytenoid posturing is proposed. Normal voice is altered by introducing various prephonatory configurations. Potential compensatory mechanisms (increased subglottal pressure, muscle activation, and supraglottal constriction) are adjusted to restore an acoustic target output through a control loop that mimics a simplified version of auditory feedback. RESULTS The degree of incomplete glottal closure in both the membranous and posterior portions of the folds consistently leads to a reduction in sound pressure level, fundamental frequency, harmonic richness, and harmonics-to-noise ratio. The compensatory mechanisms lead to significantly increased vocal-fold collision forces, maximum flow-declination rate, and amplitude of unsteady flow, without significantly altering the acoustic output. CONCLUSION Modeling provided potentially important insights into the pathophysiology of phonotraumatic vocal hyperfunction by demonstrating that compensatory mechanisms can counteract deterioration in the voice acoustic signal due to incomplete glottal closure, but this also leads to high vocal-fold collision forces (reflected in aerodynamic measures), which significantly increases the risk of developing phonotrauma.
Collapse
Affiliation(s)
- Gabriel E. Galindo
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Sean D. Peterson
- Mechanical and Mechatronics Engineering, University of Waterloo, Ontario, Canada
| | - Byron D. Erath
- Department of Mechanical & Aeronautical Engineering, Clarkson University, Potsdam, NY
| | - Christian Castro
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- School of Speech and Hearing Sciences, Universidad de Valparaíso, Chile
| | - Robert E. Hillman
- Center for Laryngeal Surgery & Voice Rehabilitation, Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
2
|
Hadwin PJ, Peterson SD. An extended Kalman filter approach to non-stationary Bayesian estimation of reduced-order vocal fold model parameters. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:2909. [PMID: 28464670 DOI: 10.1121/1.4981240] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
The Bayesian framework for parameter inference provides a basis from which subject-specific reduced-order vocal fold models can be generated. Previously, it has been shown that a particle filter technique is capable of producing estimates and associated credibility intervals of time-varying reduced-order vocal fold model parameters. However, the particle filter approach is difficult to implement and has a high computational cost, which can be barriers to clinical adoption. This work presents an alternative estimation strategy based upon Kalman filtering aimed at reducing the computational cost of subject-specific model development. The robustness of this approach to Gaussian and non-Gaussian noise is discussed. The extended Kalman filter (EKF) approach is found to perform very well in comparison with the particle filter technique at dramatically lower computational cost. Based upon the test cases explored, the EKF is comparable in terms of accuracy to the particle filter technique when greater than 6000 particles are employed; if less particles are employed, the EKF actually performs better. For comparable levels of accuracy, the solution time is reduced by 2 orders of magnitude when employing the EKF. By virtue of the approximations used in the EKF, however, the credibility intervals tend to be slightly underpredicted.
Collapse
Affiliation(s)
- Paul J Hadwin
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1 Canada
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1 Canada
| |
Collapse
|
3
|
Hadwin PJ, Galindo GE, Daun KJ, Zañartu M, Erath BD, Cataldo E, Peterson SD. Non-stationary Bayesian estimation of parameters from a body cover model of the vocal folds. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 139:2683. [PMID: 27250162 PMCID: PMC10423076 DOI: 10.1121/1.4948755] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Revised: 04/15/2016] [Accepted: 04/22/2016] [Indexed: 05/09/2023]
Abstract
The evolution of reduced-order vocal fold models into clinically useful tools for subject-specific diagnosis and treatment hinges upon successfully and accurately representing an individual patient in the modeling framework. This, in turn, requires inference of model parameters from clinical measurements in order to tune a model to the given individual. Bayesian analysis is a powerful tool for estimating model parameter probabilities based upon a set of observed data. In this work, a Bayesian particle filter sampling technique capable of estimating time-varying model parameters, as occur in complex vocal gestures, is introduced. The technique is compared with time-invariant Bayesian estimation and least squares methods for determining both stationary and non-stationary parameters. The current technique accurately estimates the time-varying unknown model parameter and maintains tight credibility bounds. The credibility bounds are particularly relevant from a clinical perspective, as they provide insight into the confidence a clinician should have in the model predictions.
Collapse
Affiliation(s)
- Paul J Hadwin
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Gabriel E Galindo
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Kyle J Daun
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Byron D Erath
- Department of Mechanical and Aeronautical Engineering, Clarkson University, Potsdam, New York 13699, USA
| | - Edson Cataldo
- Applied Mathematics Department, Graduate Program in Electrical and Telecommunications Engineering (PPGEET), Universidade Federal Fluminense, Niteroi, Rio de Janeiro, CEP24020-140, Brazil
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| |
Collapse
|
4
|
Devine EE, Bulleit EE, Hoffman MR, McCulloch TM, Jiang JJ. Aerodynamic and nonlinear dynamic acoustic analysis of tension asymmetry in excised canine larynges. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2012; 55:1850-61. [PMID: 22562826 PMCID: PMC3593670 DOI: 10.1044/1092-4388(2012/11-0240)] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
PURPOSE To model tension asymmetry caused by superior laryngeal nerve paralysis (SLNP) in excised larynges and apply perturbation, nonlinear dynamic, and aerodynamic analyses. METHOD SLNP was modeled in 8 excised larynges using sutures and weights to mimic cricothyroid (CT) muscle function. Weights were removed from one side to create tension asymmetry, mimicking unilateral SLNP. Two sets of weights were used, 1 light and 1 heavy. Five conditions were evaluated: (a) no tension, (b) symmetrical light tension, (c) asymmetrical light tension, (d) symmetrical heavy tension, and (e) asymmetrical heavy tension. RESULTS Perturbation parameters were not significantly different across conditions: percent jitter, χ(2)(4) = 3.70, p = .451; percent shimmer, F(4) = 0.95, p = .321. In addition, many measurements were invalid (error values >10). Second-order entropy was significantly different across conditions, F(4) = 5.432, p = .002, whereas correlation dimension was not, F(4) = 0.99, p = .428. Validity of these nonlinear dynamic parameters was demonstrated by low standard deviations. Phonation threshold pressure, χ (2)(4) = 22.50, p < .001, and power, χ (2)(4) = 9.50, p = .05, differed significantly across conditions, whereas phonation threshold flow did not, χ (2)(4) = 4.08, p = .396. CONCLUSIONS Nonlinear dynamic analysis differentiated between symmetrical and asymmetrical tension conditions, whereas traditional perturbation analysis was less useful in characterizing type 2 or 3 vocal signals. Supplementing acoustic with aerodynamic parameters may help distinguish among laryngeal disorders of neuromuscular origin.
Collapse
Affiliation(s)
- Erin E Devine
- University of Wisconsin-Madison School of Medicine and Public Health, Wisconsin, USA
| | | | | | | | | |
Collapse
|
5
|
Mehta DD, Zaéartu M, Quatieri TF, Deliyski DD, Hillman RE. Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 130:3999-4009. [PMID: 22225054 PMCID: PMC3253599 DOI: 10.1121/1.3658441] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2010] [Revised: 09/29/2011] [Accepted: 10/06/2011] [Indexed: 05/09/2023]
Abstract
Vocal fold vibratory asymmetry is often associated with inefficient sound production through its impact on source spectral tilt. This association is investigated in both a computational voice production model and a group of 47 human subjects. The model provides indirect control over the degree of left-right phase asymmetry within a nonlinear source-filter framework, and high-speed videoendoscopy provides in vivo measures of vocal fold vibratory asymmetry. Source spectral tilt measures are estimated from the inverse-filtered spectrum of the simulated and recorded radiated acoustic pressure. As expected, model simulations indicate that increasing left-right phase asymmetry induces steeper spectral tilt. Subject data, however, reveal that none of the vibratory asymmetry measures correlates with spectral tilt measures. Probing further into physiological correlates of spectral tilt that might be affected by asymmetry, the glottal area waveform is parameterized to obtain measures of the open phase (open/plateau quotient) and closing phase (speed/closing quotient). Subjects' left-right phase asymmetry exhibits low, but statistically significant, correlations with speed quotient (r=0.45) and closing quotient (r=-0.39). Results call for future studies into the effect of asymmetric vocal fold vibration on glottal airflow and the associated impact on voice source spectral properties and vocal efficiency.
Collapse
Affiliation(s)
- Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA.
| | | | | | | | | |
Collapse
|
6
|
Erath BD, Zañartu M, Peterson SD, Plesniak MW. Nonlinear vocal fold dynamics resulting from asymmetric fluid loading on a two-mass model of speech. CHAOS (WOODBURY, N.Y.) 2011; 21:033113. [PMID: 21974648 DOI: 10.1063/1.3615726] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Nonlinear vocal fold dynamics arising from asymmetric flow formations within the glottis are investigated using a two-mass model of speech with asymmetric vocal fold tensioning, representative of unilateral vocal fold paralysis. A refined theoretical boundary-layer flow solver is implemented to compute the intraglottal pressures, providing a more realistic description of the flow than the standard one-dimensional, inviscid Bernoulli flow solution. Vocal fold dynamics are investigated for subglottal pressures of 0.6 < p(s) < 1.5 kPa and tension asymmetries of 0.5 < Q < 0.8. As tension asymmetries become pronounced the asymmetric flow incites nonlinear behavior in the vocal fold dynamics at subglottal pressures that are associated with normal speech, behavior that is not captured with standard Bernoulli flow solvers. Regions of bifurcation, coexistence of solutions, and chaos are identified.
Collapse
Affiliation(s)
- Byron D Erath
- Department of Mechanical and Aerospace Engineering, The George Washington University, Washington DC 20052, USA.
| | | | | | | |
Collapse
|
7
|
Voigt D, Döllinger M, Yang A, Eysholdt U, Lohscheller J. Automatic diagnosis of vocal fold paresis by employing phonovibrogram features and machine learning methods. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2010; 99:275-288. [PMID: 20138386 DOI: 10.1016/j.cmpb.2010.01.004] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2009] [Revised: 11/02/2009] [Accepted: 01/09/2010] [Indexed: 05/28/2023]
Abstract
The clinical diagnosis of voice disorders is based on examination of the rapidly moving vocal folds during phonation (f0: 80-300Hz) with state-of-the-art endoscopic high-speed cameras. Commonly, analysis is performed in a subjective and time-consuming manner via slow-motion video playback and exhibits low inter- and intra-rater reliability. In this study an objective method to overcome this drawback is presented being based on Phonovibrography, a novel image analysis technique. For a collective of 45 normophonic and paralytic voices the laryngeal dynamics were captured by specialized Phonovibrogram features and analyzed with different machine learning algorithms. Classification accuracies reached 93% for 2-class and 73% for 3-class discrimination. The results were validated by subjective expert ratings given the same diagnostic criteria. The automatic Phonovibrogram analysis approach exceeded the experienced raters' classifications by 9%. The presented method holds a lot of potential for providing reliable vocal fold diagnosis support in the future.
Collapse
Affiliation(s)
- Daniel Voigt
- Department of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Bohlenplatz 21, D-91054 Erlangen, Germany.
| | | | | | | | | |
Collapse
|
8
|
Classification of functional voice disorders based on phonovibrograms. Artif Intell Med 2010; 49:51-9. [DOI: 10.1016/j.artmed.2010.01.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2008] [Revised: 08/20/2009] [Accepted: 01/10/2010] [Indexed: 11/17/2022]
|
9
|
Huang D, Xing G, Wheeler DW. Multiparameter estimation using only a chaotic time series and its applications. CHAOS (WOODBURY, N.Y.) 2007; 17:023118. [PMID: 17614672 DOI: 10.1063/1.2732495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
An important extension to the techniques of synchronization-based parameter estimation is presented. Based on adaptive chaos synchronization, several methods are proposed to dynamically estimate multiple parameters using only a scalar chaotic time series. In comparison with previous schemes, the presented methods decrease the cost of parameter estimation and are more applicable in practice. Numerical examples are used to demonstrate the effectiveness and robustness of the presented methods. As an example application, an implementation of multichannel digital communication is proposed, where multiparameter modulation is used to simultaneously transmit more than one digital message. From a theoretical perspective, such an encoding increases the difficulty to directly read out the message from the transmitted signal and decreases the implementation cost.
Collapse
Affiliation(s)
- Debin Huang
- Department of Mathematics, Shanghai University, Shanghai 200444, People's Republic of China.
| | | | | |
Collapse
|
10
|
Tao C, Zhang Y, Jiang JJ. Extracting Physiologically Relevant Parameters of Vocal Folds From High-Speed Video Image Series. IEEE Trans Biomed Eng 2007; 54:794-801. [PMID: 17518275 DOI: 10.1109/tbme.2006.889182] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In this paper, a new method is proposed to extract the physiologically relevant parameters of the vocal fold mathematic model including masses, spring constants and damper constants from high-speed video (HSV) image series. This method uses a genetic algorithm to optimize the model parameters until the model and the realistic vocal folds have similar dynamic behavior. Numerical experiments theoretically test the validity of the proposed parameter estimation method. Then the validated method is applied to extract the physiologically relevant parameters from the glottal area series measured by HSV in an excised larynx model. With the estimated parameters, the vocal fold model accurately describes the vibration of the observed vocal folds. Further studies show that the proposed parameter estimation method can successfully detect the increase of longitudinal tension due to the vocal fold elongation from the glottal area signal. These results imply the potential clinical application of this method in inspecting the tissue properties of vocal fold.
Collapse
Affiliation(s)
- Chao Tao
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin Medical School, Madison, WI 53792-7375, USA.
| | | | | |
Collapse
|
11
|
Zhang Y, Tao C, Jiang JJ. Theoretical and experimental studies of parameter estimation based on chaos feedback synchronization. CHAOS (WOODBURY, N.Y.) 2006; 16:043122. [PMID: 17199400 DOI: 10.1063/1.2404356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
In this paper, we perform theoretical and experimental studies of parameter estimation based on chaos feedback synchronization. The Chua chaotic system is applied as the original system, and its time series of a single variable is applied as a feedback variable to drive a simulative system. The simplex method is employed to minimize the synchronization error in the simulative system to iteratively approach the original system parameters. Furthermore, the effects of feedback function, feedback weight, and noise are investigated. The parameter estimation method based on feedback synchronization shows its robustness to noise perturbation. Finally, the circuit experiment of parameter estimation is implemented in order to examine its practical applicability. The original Chua circuit parameters can be well approached from its recorded time series of a single variable. The results show the parameter estimation method based on feedback synchronization may be capable of estimating real system parameters from a chaotic time series.
Collapse
Affiliation(s)
- Yu Zhang
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin Medical School, Madison, Wisconsin 53792-7375, USA.
| | | | | |
Collapse
|