1
|
Tanchip C, Guarin DL, McKinlay S, Barnett C, Kalra S, Genge A, Korngut L, Green JR, Berry J, Zinman L, Yadollahi A, Abrahao A, Yunusova Y. Validating Automatic Diadochokinesis Analysis Methods Across Dysarthria Severity and Syllable Task in Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:940-953. [PMID: 35171700 PMCID: PMC9150739 DOI: 10.1044/2021_jslhr-21-00503] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 11/19/2021] [Accepted: 11/20/2021] [Indexed: 06/14/2023]
Abstract
PURPOSE Oral diadochokinesis (DDK) is a standard dysarthria assessment task. To extract automatic and semi-automatic DDK measurements, numerous DDK analysis algorithms based on acoustic signal processing are available, including amplitude based, spectral based, and hybrid. However, these algorithms have been predominantly validated in individuals with no perceptible to mild dysarthria. The behavior of these algorithms across dysarthria severity is largely unknown. Likewise, these algorithms have not been tested equally for various syllable types. The goal of this study was to evaluate the performance of five common DDK algorithms as a function of dysarthria severity, considering syllable types. METHOD We analyzed 282 DDK recordings of /ba/, /pa/, and /ta/ from 145 participants with amyotrophic lateral sclerosis. Recordings were stratified into mild, moderate, or severe dysarthria groups based on individual performance on the Speech Intelligibility Test. Analysis included manual and automatic estimation of the number of syllables, DDK rate, and cycle-to-cycle temporal variability (cTV). Validation metrics included Bland-Altman mixed-effects limits of agreement between manual and automatic syllable counts, recall and precision between manual and automatic syllable boundary detection, and Kendall's tau-b correlations between manual and algorithm-detected DDK rate and cTV. RESULTS The amplitude-based algorithm (absolute energy) yielded the strongest correlations with manual analysis across all severity groups for DDK rate (τ b = 0.7-0.84) and cTV (τ b = 0.7-0.84) and the narrowest limits of agreement (-5.92 to 7.12 syllable difference). Moreover, this algorithm also provided the highest mean recall and precision across severity groups for /ba/ and /pa/, but with significantly more variation for/ta/. CONCLUSIONS Algorithms based on signal energy analysis appeared to be the most robust for DDK analysis across dysarthria severity and syllable types; however, it remains prone to error against severe dysarthria and alveolar syllable context. Further development is needed to address this important issue.
Collapse
Affiliation(s)
- Chelsea Tanchip
- Department of Speech-Language Pathology, Rehabilitation Sciences Institute, University of Toronto, Ontario, Canada
| | - Diego L. Guarin
- Department of Biomedical Engineering, Florida Institute of Technology, Melbourne
| | - Scotia McKinlay
- Department of Speech-Language Pathology, Rehabilitation Sciences Institute, University of Toronto, Ontario, Canada
| | - Carolina Barnett
- Division of Neurology, Department of Medicine, University of Toronto and University Health Network, Ontario, Canada
| | - Sanjay Kalra
- Neuroscience and Mental Health Institute, University of Alberta, Edmonton, Canada
- Division of Neurology, University of Alberta, Edmonton, Canada
| | - Angela Genge
- Clinical Research Unit, Montreal Neurological Institute & Hospital, and Department of Neurology and Neurosurgery, McGill University, Québec, Canada
| | - Lawrence Korngut
- Department of Clinical Neurosciences, Hotchkiss Brain Institute, University of Calgary, Alberta, Canada
| | - Jordan R. Green
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA
| | - James Berry
- Department of Neurology, Massachusetts General Hospital, Boston
| | - Lorne Zinman
- Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Ontario, Canada
- Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Azadeh Yadollahi
- KITE, Toronto Rehabilitation Institute, University Health Network, Ontario, Canada
- Institute of Biomedical Engineering, University of Toronto, Ontario, Canada
| | - Agessandro Abrahao
- Division of Neurology, Department of Medicine, Sunnybrook Health Sciences Centre, University of Toronto, Ontario, Canada
- Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Yana Yunusova
- Department of Speech-Language Pathology, Rehabilitation Sciences Institute, University of Toronto, Ontario, Canada
- Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
- KITE, Toronto Rehabilitation Institute, University Health Network, Ontario, Canada
| |
Collapse
|
2
|
Calvache C, Solaque L, Velasco A, Peñuela L. Biomechanical Models to Represent Vocal Physiology: A Systematic Review. J Voice 2021; 37:465.e1-465.e18. [PMID: 33678534 DOI: 10.1016/j.jvoice.2021.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 01/26/2021] [Accepted: 02/02/2021] [Indexed: 11/24/2022]
Abstract
Biomechanical modeling allows obtaining information on physical phenomena that cannot be directly observed. This study aims to review models that represent voice production. A systematic review of the literature was conducted using PubMed/Medline, SCOPUS, and IEEE Xplore databases. To select the papers, we used the protocol PRISMA Statement. A total of 53 publications were included in this review. This article considers a taxonomic classification of models found in the literature. We propose four categories in the taxonomy: (1) Models representing the Source (Vocal folds); (2) Models representing the Filter (Vocal Tract); (3) Models representing the Source - Filter Interaction; and (4) Models representing the Airflow - Source Interaction. We include a bibliographic analysis with the evolution of the publications per category. We provide an analysis of the number as well of publications in journals per year. Moreover, we present an analysis of the term occurrence and its frequency of usage, as found in the literature. In each category, different types of vocal production models are mentioned and analyzed. The models account for the analysis of evidence about aerodynamic, biomechanical, and acoustic phenomena and their correlation with the physiological processes involved in the production of the human voice. This review gives an insight into the state of the art related to the mathematical modeling of voice production, analyzed from the viewpoint of vocal physiology.
Collapse
Affiliation(s)
- Carlos Calvache
- Vocology Center, Bogotá, Colombia; Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia.
| | - Leonardo Solaque
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Alexandra Velasco
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Lina Peñuela
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| |
Collapse
|
3
|
Yang S, Wang F, Yang L, Xu F, Luo M, Chen X, Feng X, Zou X. The physical significance of acoustic parameters and its clinical significance of dysarthria in Parkinson's disease. Sci Rep 2020; 10:11776. [PMID: 32678256 PMCID: PMC7366911 DOI: 10.1038/s41598-020-68754-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Accepted: 06/23/2020] [Indexed: 11/09/2022] Open
Abstract
Dysarthria is universal in Parkinson’s disease (PD) during disease progression; however, the quality of vocalization changes is often ignored. Furthermore, the role of changes in the acoustic parameters of phonation in PD patients remains unclear. We recruited 35 PD patients and 26 healthy controls to perform single, double, and multiple syllable tests. A logistic regression was performed to differentiate between protective and risk factors among the acoustic parameters. The results indicated that the mean f0, max f0, min f0, jitter, duration of speech and median intensity of speaking for the PD patients were significantly different from those of the healthy controls. These results reveal some promising indicators of dysarthric symptoms consisting of acoustic parameters, and they strengthen our understanding about the significance of changes in phonation by PD patients, which may accelerate the discovery of novel PD biomarkers.
Collapse
Affiliation(s)
- Shu Yang
- College of Medical Information Engineering, Chengdu University of Traditional Chinese Medicine, Chengdu, 610072, Sichuan, China.,Department of Public Health, Chengdu Medical College, Chengdu, 610500, Sichuan, China
| | - Fengbo Wang
- Department of Rehabilitation, First Affiliated Hospital of Chengdu Medical College, Chengdu, 610500, Sichuan, China
| | - Liqiong Yang
- Department of Pharmacy, Chengdu Medical College, Chengdu, 610500, Sichuan, China
| | - Fan Xu
- Department of Public Health, Chengdu Medical College, Chengdu, 610500, Sichuan, China
| | - Man Luo
- Department of Neurology, First Affiliated Hospital of Chengdu Medical College, Chengdu, 610500, Sichuan, China
| | - Xiaqing Chen
- Department of Neurology, First Affiliated Hospital of Chengdu Medical College, Chengdu, 610500, Sichuan, China
| | - Xixi Feng
- Department of Public Health, Chengdu Medical College, Chengdu, 610500, Sichuan, China.
| | - Xianwei Zou
- Department of Neurology, First Affiliated Hospital of Chengdu Medical College, Chengdu, 610500, Sichuan, China.
| |
Collapse
|
4
|
Abstract
This review provides a comprehensive compilation, from a digital image processing point of view of the most important techniques currently developed to characterize and quantify the vibration behaviour of the vocal folds, along with a detailed description of the laryngeal image modalities currently used in the clinic. The review presents an overview of the most significant glottal-gap segmentation and facilitative playbacks techniques used in the literature for the mentioned purpose, and shows the drawbacks and challenges that still remain unsolved to develop robust vocal folds vibration function analysis tools based on digital image processing.
Collapse
|
5
|
Lechien J, Blecic S, Huet K, Delvaux V, Piccaluga M, Roland V, Harmegnies B, Saussez S. Voice quality outcomes of idiopathic Parkinson's disease medical treatment: A systematic review. Clin Otolaryngol 2018; 43:882-903. [DOI: 10.1111/coa.13082] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/28/2018] [Indexed: 11/30/2022]
Affiliation(s)
- J.R. Lechien
- Laboratory of Anatomy and Cell Biology; Faculty of Medicine; UMONS Research Institute for Health Sciences and Technology; University of Mons; Mons Belgium
- Laboratory of Phonetics; Faculty of Psychology; Research Institute for Language sciences and Technology; University of Mons; Mons Belgium
- Department of Otorhinolaryngology and Head and Neck Surgery; RHMS Baudour, EpiCURA Hospital; Baudour Belgium
| | - S. Blecic
- Department of Neurology; EpiCURA Hospital; Baudour Belgium
| | - K. Huet
- Laboratory of Phonetics; Faculty of Psychology; Research Institute for Language sciences and Technology; University of Mons; Mons Belgium
| | - V. Delvaux
- Laboratory of Phonetics; Faculty of Psychology; Research Institute for Language sciences and Technology; University of Mons; Mons Belgium
| | - M. Piccaluga
- Laboratory of Phonetics; Faculty of Psychology; Research Institute for Language sciences and Technology; University of Mons; Mons Belgium
| | - V. Roland
- Laboratory of Phonetics; Faculty of Psychology; Research Institute for Language sciences and Technology; University of Mons; Mons Belgium
| | - B. Harmegnies
- Laboratory of Phonetics; Faculty of Psychology; Research Institute for Language sciences and Technology; University of Mons; Mons Belgium
| | - S. Saussez
- Laboratory of Anatomy and Cell Biology; Faculty of Medicine; UMONS Research Institute for Health Sciences and Technology; University of Mons; Mons Belgium
- Department of Otorhinolaryngology and Head and Neck Surgery; RHMS Baudour, EpiCURA Hospital; Baudour Belgium
| |
Collapse
|
6
|
Acoustic analysis of voice signal: Comparison of four applications software. Biomed Signal Process Control 2018. [DOI: 10.1016/j.bspc.2017.09.031] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
7
|
Li S, Scherer RC, Fulcher LP, Wang X, Qiu L, Wan M, Wang S. Effects of Vertical Glottal Duct Length on Intraglottal Pressures and Phonation Threshold Pressure in the Uniform Glottis. J Voice 2017; 32:8-22. [PMID: 28599995 DOI: 10.1016/j.jvoice.2017.04.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2017] [Revised: 04/01/2017] [Accepted: 04/05/2017] [Indexed: 10/19/2022]
Abstract
According to Titze's 1988 derivations for phonation threshold pressure (PTP), there are a number of important variables that PTP depends upon. A primary one is the vertical glottal duct length T: PTP decreases if T is increased. Changing the length of T, however, may have a significant effect on other variables that PTP depends upon. This study examined the effects of lengthening the vertical uniform glottal duct on the transglottal and intraglottal pressures, and the derived transglottal pressure coefficients, for five glottal diameters (0.01, 0.02, 0.04, 0.08, and 0.16 cm) and three transglottal pressures (500, 1000, and 1500 Pa) for the uniform glottis by using the computational fluid dynamics code Fluent (for laminar, incompressible, two-dimensional flow). The results suggest the following: A longer vertical glottal duct length increases the intraglottal and transglottal pressures, and more so for smaller glottal diameters, and increases the transglottal pressure coefficient. In addition, unlike the transglottal pressure coefficient, the glottal entrance pressure coefficient is highly dependent on the vertical glottal duct length only for lower flows and Reynolds numbers, but is relatively independent of duct length, glottal diameter, and transglottal pressure above a flow value of approximately 50 cm3/s, suggesting that the entrance pressure coefficient is a relatively local phenomenon. These results suggest that the vertical glottal duct length and its effects need to be taken into consideration for vocal fold modeling, aeroacoustics purposes, and the estimation of PTP.
Collapse
Affiliation(s)
- Sheng Li
- Department of Automation, College of Control Engineering, Xijing University, Xi'an, China
| | - Ronald C Scherer
- Department of Communication Sciences and Disorders, Bowling Green State University, Bowling Green, Ohio
| | - Lewis P Fulcher
- Department of Physics and Astronomy, Bowling Green State University, Bowling Green, Ohio
| | - Xianbo Wang
- Department of Automation, College of Control Engineering, Xijing University, Xi'an, China
| | - Lijun Qiu
- Department of Automation, College of Control Engineering, Xijing University, Xi'an, China
| | - Mingxi Wan
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China.
| | - Supin Wang
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| |
Collapse
|
8
|
Orozco-Arroyave JR, Belalcazar-Bolanos EA, Arias-Londono JD, Vargas-Bonilla JF, Skodda S, Rusz J, Daqrouq K, Honig F, Noth E. Characterization Methods for the Detection of Multiple Voice Disorders: Neurological, Functional, and Laryngeal Diseases. IEEE J Biomed Health Inform 2015; 19:1820-8. [DOI: 10.1109/jbhi.2015.2467375] [Citation(s) in RCA: 79] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
9
|
Aerodynamic measures of glottal function: what extra can they tell us and how do they guide management? Curr Opin Otolaryngol Head Neck Surg 2015; 22:450-4. [PMID: 25254405 DOI: 10.1097/moo.0000000000000107] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
PURPOSE OF REVIEW This article will define the major advances in laryngeal aerodynamics research from recent evidence-based literature. RECENT FINDINGS Recently published research focuses on new applications of aerodynamic parameters to improve patient diagnosis and outcomes, as well as further elucidating the mechanisms of phonation using computational modeling and excised larynges. SUMMARY Although there is an extensive amount of research on improving the diagnosis and treatment of voice disorders using aerodynamics, the majority of recent literature lacks any conclusive evidence on new methods for use in the clinic; further research in these is needed. The best practices for resonance tube phonation in water and semi-occluded voice therapy are being investigated, as is the exact mechanism by which glottal airflow interacts with vocal folds to produce phonation. It is recommended that clinicians evaluate patients with Parkinson's disease on the basis of airflow declination and lung volume expended per syllable to avoid dependence on an acoustic signal. In addition, advances in modeling laryngeal disorders and structures will contribute to future research into treatments and diagnosis. Now that the groundwork has been laid, it is crucial to begin evaluating such techniques in patient populations.
Collapse
|
10
|
Abstract
The aim of the study was to discuss physiology and pathology of speech and review of the literature on speech disorders in Parkinson disease. Additionally, the most effective methods to diagnose the speech disorders in Parkinson disease were also stressed. Afterward, articulatory, respiratory, acoustic and pragmatic factors contributing to the exacerbation of the speech disorders were discussed. Furthermore, the study dealt with the most important types of speech treatment techniques available (pharmacological and behavioral) and a significance of Lee Silverman Voice Treatment was highlighted.
Collapse
|
11
|
Lucero JC, Lourenço KG, Hermant N, Van Hirtum A, Pelorson X. Effect of source-tract acoustical coupling on the oscillation onset of the vocal folds. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:403-11. [PMID: 22779487 DOI: 10.1121/1.4728170] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
This paper analyzes the interaction between the vocal folds and vocal tract at phonation onset due to the acoustical coupling between both systems. Data collected from a mechanical replica of the vocal folds show that changes in vocal tract length induce fluctuations in the oscillation threshold values of both subglottal pressure and frequency. Frequency jumps and maxima of the threshold pressure occur when the oscillation frequency is slightly above a vocal tract resonance. Both the downstream and upstream vocal tracts may produce those same effects. A simple mathematical model is next proposed, based on a lumped description of tissue mechanics, quasi-steady flow and one-dimensional acoustics. The model shows that the frequency jumps are produced by saddle-node bifurcations between limit cycles forming a classical pattern of a cusp catastrophe. The transition from a low frequency oscillation to a high frequency one may be achieved through two different paths: in case of a large acoustical coupling (narrow vocal tract) or high subglottal pressure, the bifurcations are crossed, which causes a frequency jump with a hysteresis loop. By reducing the acoustical coupling (wide vocal tract) or the subglottal pressure, a path around the bifurcations may be followed with a smooth frequency variation.
Collapse
Affiliation(s)
- Jorge C Lucero
- Department of Computer Science, University of Brasilia, Brasilia, DF 70910-900, Brazil.
| | | | | | | | | |
Collapse
|
12
|
Choi SH, Zhang Y, Jiang JJ, Bless DM, Welham NV. Nonlinear dynamic-based analysis of severe dysphonia in patients with vocal fold scar and sulcus vocalis. J Voice 2012; 26:566-76. [PMID: 22516315 DOI: 10.1016/j.jvoice.2011.09.006] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2011] [Accepted: 09/15/2011] [Indexed: 11/24/2022]
Abstract
OBJECTIVE The primary goal of this study was to evaluate a nonlinear dynamic approach to the acoustic analysis of dysphonia associated with vocal fold scar and sulcus vocalis. STUDY DESIGN Case-control study. METHODS Acoustic voice samples from scar/sulcus patients and age-/sex-matched controls were analyzed using correlation dimension (D2) and phase plots, time-domain based perturbation indices (jitter, shimmer, signal-to-noise ratio [SNR]), and an auditory-perceptual rating scheme. Signal typing was performed to identify samples with bifurcations and aperiodicity. RESULTS Type 2 and 3 acoustic signals were highly represented in the scar/sulcus patient group. When data were analyzed irrespective of signal type, all perceptual and acoustic indices successfully distinguished scar/sulcus patients from controls. Removal of type 2 and 3 signals eliminated the previously identified differences between experimental groups for all acoustic indices except D2. The strongest perceptual-acoustic correlation in our data set was observed for SNR and the weakest correlation was observed for D2. CONCLUSIONS These findings suggest that D2 is inferior to time-domain based perturbation measures for the analysis of dysphonia associated with scar/sulcus; however, time-domain based algorithms are inherently susceptible to inflation under highly aperiodic (ie, type 2 and 3) signal conditions. Auditory-perceptual analysis, unhindered by signal aperiodicity, is therefore a robust strategy for distinguishing scar/sulcus patient voices from normal voices. Future acoustic analysis research in this area should consider alternative (e.g., frequency- and quefrency-domain based) measures alongside additional nonlinear approaches.
Collapse
Affiliation(s)
- Seong Hee Choi
- Department of Audiology and Speech-Language Pathology, Catholic University of Daegu, Gyeongsan, Korea
| | | | | | | | | |
Collapse
|
13
|
Erath BD, Zañartu M, Peterson SD, Plesniak MW. Nonlinear vocal fold dynamics resulting from asymmetric fluid loading on a two-mass model of speech. CHAOS (WOODBURY, N.Y.) 2011; 21:033113. [PMID: 21974648 DOI: 10.1063/1.3615726] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Nonlinear vocal fold dynamics arising from asymmetric flow formations within the glottis are investigated using a two-mass model of speech with asymmetric vocal fold tensioning, representative of unilateral vocal fold paralysis. A refined theoretical boundary-layer flow solver is implemented to compute the intraglottal pressures, providing a more realistic description of the flow than the standard one-dimensional, inviscid Bernoulli flow solution. Vocal fold dynamics are investigated for subglottal pressures of 0.6 < p(s) < 1.5 kPa and tension asymmetries of 0.5 < Q < 0.8. As tension asymmetries become pronounced the asymmetric flow incites nonlinear behavior in the vocal fold dynamics at subglottal pressures that are associated with normal speech, behavior that is not captured with standard Bernoulli flow solvers. Regions of bifurcation, coexistence of solutions, and chaos are identified.
Collapse
Affiliation(s)
- Byron D Erath
- Department of Mechanical and Aerospace Engineering, The George Washington University, Washington DC 20052, USA.
| | | | | | | |
Collapse
|
14
|
Zhang Y, Regner MF, Jiang JJ. Theoretical modeling and experimental high-speed imaging of elongated vocal folds. IEEE Trans Biomed Eng 2010; 58:2725-31. [PMID: 21118763 DOI: 10.1109/tbme.2010.2095012] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
In this paper, the role of vocal fold elongation in governing glottal movement dynamics was theoretically and experimentally investigated. A theoretical model was first proposed to incorporate vocal fold elongation into the two-mass model. This model predicted the direct and nondirect components of the glottal time series as a function of vocal fold elongation. Furthermore, high-speed digital imaging was applied in excised larynx experiments to visualize vocal fold vibrations with variable vocal fold elongation from -10% to 50% and subglottal pressures of 18- and 24-cm H(2)O. Comparison between theoretical model simulations and experimental observations showed good agreement. A relative maximum was seen in the nondirect component of glottal area, suggesting that an optimal elongation could maximize the vocal fold vibratory power. However, sufficiently large vocal fold elongations caused the nondirect component to approach zero and the direct component to approach a constant. These results showed that vocal fold elongation plays an important role in governing the dynamics of glottal area movement and validated the applicability of the proposed theoretical model and high-speed imaging to investigate laryngeal activity.
Collapse
Affiliation(s)
- Yu Zhang
- Laboratory of Underwater Acoustic Communication and Marine Information Technology of the Ministry of Education, College of Oceanography and Environmental Science, Xiamen University, Xiamen 361005, China.
| | | | | |
Collapse
|
15
|
Murugappan S, Boyce S, Khosla S, Kelchner L, Gutmark E. Acoustic characteristics of phonation in "wet voice" conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:2578-89. [PMID: 20370039 PMCID: PMC2865707 DOI: 10.1121/1.3308478] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2009] [Revised: 01/11/2010] [Accepted: 01/13/2010] [Indexed: 05/29/2023]
Abstract
A perceptible change in phonation characteristics after a swallow has long been considered evidence that food and/or drink material has entered the laryngeal vestibule and is on the surface of the vocal folds as they vibrate. The current paper investigates the acoustic characteristics of phonation when liquid material is present on the vocal folds, using ex vivo porcine larynges as a model. Consistent with instrumental examinations of swallowing disorders or dysphagia in humans, three liquids of different Varibar viscosity ("thin liquid," "nectar," and "honey") were studied at constant volume. The presence of materials on the folds during phonation was generally found to suppress the higher frequency harmonics and generate intermittent additional frequencies in the low and high end of the acoustic spectrum. Perturbation measures showed a higher percentage of jitter and shimmer when liquid material was present on the folds during phonation, but they were unable to differentiate statistically between the three fluid conditions. The finite correlation dimension and positive Lyapunov exponent measures indicated that the presence of materials on the vocal folds excited a chaotic system. Further, these measures were able to reliably differentiate between the baseline and different types of liquid on the vocal folds.
Collapse
Affiliation(s)
- Shanmugam Murugappan
- Department of Otolaryngology, Head and Neck Surgery, University of Cincinnati Medical Center, 231 Albert B. Sabin Way, Cincinnati, OH 45267-0528, USA.
| | | | | | | | | |
Collapse
|
16
|
Rosa JDC, Cielo CA, Cechella C. Função fonatória em pacientes com doença de Parkinson: uso de instrumento de sopro. REVISTA CEFAC 2009. [DOI: 10.1590/s1516-18462009000200016] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
OBJETIVO: verificar o efeito da terapia com instrumento de sopro na função fonatória de pacientes com doença de Parkinson, com enfoque nas avaliações laríngeas, acústicas, respiratórias e de intensidade vocal. MÉTODOS: esta pesquisa é um estudo de caso coletivo e prospectivo. Dois sujeitos foram submetidos a exame videolaringoscópico, avaliação da função respiratória, gravação de amostras da voz, medição da intensidade vocal e terapia. Para a análise acústica foi utilizado o programa Multidimensional Voice Program advanced. A análise dos dados foi ponderada, uma vez que a pesquisa é um estudo de caso coletivo. RESULTADOS: os resultados mostraram que as modificações respiratórias, acústicas e de intensidade vocal foram similares e positivas nos dois sujeitos estudados. CONCLUSÃO: a terapia realizada com o instrumento de sopro pode proporcionar maior eficiência na adução glótica, e melhor movimentação e controle respiratório, propiciando o uso mais efetivo do ar para fonação, com melhora da qualidade vocal, expressa pela redução das medidas de ruído, maior estabilidade fonatória e aumento da intensidade vocal.
Collapse
|
17
|
Tao C, Jiang JJ. Effects of mucosal loading on vocal fold vibration. CHAOS (WOODBURY, N.Y.) 2009; 19:023113. [PMID: 19566248 PMCID: PMC2832046 DOI: 10.1063/1.3120293] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2008] [Accepted: 03/24/2009] [Indexed: 05/28/2023]
Abstract
A chain model was proposed in this study to examine the effects of mucosal loading on vocal fold vibration. Mucosal loading was defined as the loading caused by the interaction between the vocal folds and the surrounding tissue. In the proposed model, the vocal folds and the surrounding tissue were represented by a series of oscillators connected by a coupling spring. The lumped masses, springs, and dampers of the oscillators modeled the tissue properties of mass, stiffness, and viscosity, respectively. The coupling spring exemplified the tissue interactions. By numerically solving this chain model, the effects of mucosal loading on the phonation threshold pressure, phonation instability pressure, and energy distribution in a voice production system were studied. It was found that when mucosal loading is small, phonation threshold pressure increases with the damping constant R(r), the mass constant R(m), and the coupling constant R(mu) of mucosal loading but decreases with the stiffness constant R(k). Phonation instability pressure is also related to mucosal loading. It was found that phonation instability pressure increases with the coupling constant R(mu) but decreases with the stiffness constant R(k) of mucosal loading. Therefore, it was concluded that mucosal loading directly affects voice production.
Collapse
Affiliation(s)
- Chao Tao
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin Medical School, Madison, Wisconsin 53792-7375, USA.
| | | |
Collapse
|
18
|
Lee VS, Zhou XP, Rahn DA, Wang EQ, Jiang JJ. Perturbation and nonlinear dynamic analysis of acoustic phonatory signal in Parkinsonian patients receiving deep brain stimulation. JOURNAL OF COMMUNICATION DISORDERS 2008; 41:485-500. [PMID: 18433765 PMCID: PMC3313602 DOI: 10.1016/j.jcomdis.2008.02.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2006] [Revised: 01/22/2008] [Accepted: 02/23/2008] [Indexed: 05/26/2023]
Abstract
UNLABELLED Nineteen PD patients who received deep brain stimulation (DBS), 10 non-surgical (control) PD patients, and 11 non-pathologic age- and gender-matched subjects performed sustained vowel phonations. The following acoustic measures were obtained on the sustained vowel phonations: correlation dimension (D2), percent jitter, percent shimmer, SNR, F0, vF0, and vAm. The results indicated the following: The mean D2 of control PD patients was significantly higher than the mean D2 of non-pathologic subjects and patients who received deep brain stimulation. These results suggest an improvement in PD voice in treated patients. Many PD vocal samples in this study have type 2 signals containing subharmonics that may not be suitable for perturbation analysis but are suitable for nonlinear dynamic analysis, making the D2 results more reliable. These findings show that DBS may provide measurable improvement in patients with severe vocal impairment. LEARNING OUTCOMES Readers will be able to: (1) identify the advantages of nonlinear dynamic analysis as a clinical tool to evaluate the aperiodic voice commonly found in patients with Parkinson's disease, (2) describe in general the method of obtaining a correlation dimension measure from a voice sample and the significance of this measure in terms of specific voice signal properties, (3) consider the preliminary implications from nonlinear dynamic analysis of a positive DBS effect on Parkinsonian voice and the potential for further investigations using nonlinear dynamic analysis on the influence of gender, severity of disease, and combined treatments on Parkinsonian voice improvement.
Collapse
Affiliation(s)
- Victoria S. Lee
- Department of Surgery, Division of Otolaryngology, University of Wisconsin-Madison, United States
| | - Xiao Ping Zhou
- Department of Neurosurgery, Shanghai Second Military Medical University, PR China
| | - Douglas A. Rahn
- Department of Surgery, Division of Otolaryngology, University of Wisconsin-Madison, United States
| | - Emily Q. Wang
- Department of Communication Disorders and Sciences & Department of Otolaryngology and Bronchoeso-phagology, Rush University Medical Center, United States
| | - Jack J. Jiang
- Department of Surgery, Division of Otolaryngology, University of Wisconsin-Madison, United States
| |
Collapse
|
19
|
Zhang Y, Jiang JJ. Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations. JOURNAL OF SOUND AND VIBRATION 2008; 316:248-262. [PMID: 22505778 PMCID: PMC3325097 DOI: 10.1016/j.jsv.2008.02.026] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.
Collapse
Affiliation(s)
- Yu Zhang
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin Medical School, Madison, WI 53792-7375
| | | |
Collapse
|
20
|
Zhang Y, Jiang JJ, Tao C, Bieging E, MacCallum JK. Quantifying the complexity of excised larynx vibrations from high-speed imaging using spatiotemporal and nonlinear dynamic analyses. CHAOS (WOODBURY, N.Y.) 2007; 17:043114. [PMID: 18163778 DOI: 10.1063/1.2784384] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
In this paper, we investigate the biomechanical applications of spatiotemporal analysis and nonlinear dynamic analysis to quantitatively describe regular and irregular vibrations of twelve excised larynges from high-speed image recordings. Regular vibrations show simple spatial symmetry, temporal periodicity, and discrete frequency spectra, while irregular vibrations show complex spatiotemporal plots, aperiodic time series, and broadband spectra. Furthermore, the global entropy and correlation length from spatiotemporal analysis and the correlation dimension from nonlinear dynamic analysis reveal a statistical difference between regular and irregular vibrations. In comparison with regular vibrations, the global entropy and correlation dimension of irregular vibrations are statistically higher, while the correlation length is significantly lower. These findings show that spatiotemporal analysis and nonlinear dynamic analysis are capable of describing the complex dynamics of vocal fold vibrations from high-speed imaging and may potentially be helpful for understanding disordered behaviors in biomedical laryngeal systems.
Collapse
Affiliation(s)
- Yu Zhang
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin Medical School, Madison, Wisconsin 53792-7375, USA.
| | | | | | | | | |
Collapse
|
21
|
Rahn DA, Chou M, Jiang JJ, Zhang Y. Phonatory Impairment in Parkinson's Disease: Evidence from Nonlinear Dynamic Analysis and Perturbation Analysis. J Voice 2007; 21:64-71. [PMID: 16377130 DOI: 10.1016/j.jvoice.2005.08.011] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2005] [Accepted: 08/11/2005] [Indexed: 11/30/2022]
Abstract
Many persons with Parkinson's disease (PD) will eventually experience vocal impairment as their condition advances. Using standard perturbation analyses (parameters like jitter and shimmer) to measure fluctuations in phonatory signal may inhibit researchers from recognizing severely disordered patterns that seem to be present in the voices of some PD patients. Nonlinear dynamic analysis can quantify these aperiodic patterns, which indicate severe pathology that is usually characterized perceptually by hoarseness. Here, sustained vowel phonations of a heterogeneous group of PD subjects (20 women and 21 men) were compared with those of a control group (22 women and 18 men) based on results of nonlinear dynamic analyses (D(2)) and perturbation analyses. Results showed PD subjects as a whole to have significantly higher D(2) values than control subjects (P = 0.016), which indicates increased signal complexity in PD vocal pathology. Differences in the comparison of these two groups were significant in jitter (P = 0.014) but nonsignificant in shimmer (P = 0.695). Furthermore, the performance on these three measures was affected by subject sex. Nonlinear dynamic analysis showed significantly higher D(2) in the female PD group than in the female control group (P = 0.001), but jitter and shimmer did not show such a difference. The male PD group had statistically higher jitter than the male control group (P = 0.036), but these groups did not differ in D(2) or shimmer. Overall, nonlinear dynamic analysis may be a valuable method for the diagnosis of Parkinsonian laryngeal pathology.
Collapse
Affiliation(s)
- Douglas A Rahn
- Department of Surgery, Division of Otolaryngology--HNS, University of Wisconsin Medical School, Madison, Wisconsin 53706, USA
| | | | | | | |
Collapse
|
22
|
Zhang Y, Tao C, Jiang JJ. Parameter estimation of an asymmetric vocal-fold system from glottal area time series using chaos synchronization. CHAOS (WOODBURY, N.Y.) 2006; 16:023118. [PMID: 16822021 DOI: 10.1063/1.2203092] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
In this paper, we apply an iterative parameter adaption scheme based on chaos synchronization to estimate system parameters of the asymmetric vocal folds from glottal area time series. The original asymmetric vocal-fold system associated with recurrent laryngeal paralysis shows chaotic vibrations with positive Lyapunov exponents. Aperiodic glottal area time series from the original system will be applied as the feedback variable coupling the simulative and the original vocal-fold systems. The parameter adaption technique based on chaos synchronization is employed to manipulate the simulative system parameters. The chaotic vibrations, system parameters, and the bifurcation diagram of the original vocal-fold system can be exactly reproduced in the simulative system, and the two chaotic systems can be synchronized. Furthermore, the effects of noise, sampling rate, and equation difference due to nonlinear spring terms on vocal-fold parameter estimations are investigated. Despite large noise perturbations, large equation differences, and low sampling rate, the parameter adaption scheme can effectively estimate the original vocal-fold system parameters. This study provides a theoretical base to apply chaos synchronization to estimate the vocal-fold system parameters from the glottal area data and show its potential application in laryngeal physiology.
Collapse
Affiliation(s)
- Yu Zhang
- Department of Surgery, Division of Otolaryngology Head and Neck Surgery, University of Wisconsin Medical School, Madison, Wisconsin 53792-7375, USA
| | | | | |
Collapse
|