1
|
Wu L, Zhang Z. Computational Study of the Impact of Dehydration-Induced Vocal Fold Stiffness Changes on Voice Production. J Voice 2024; 38:836-843. [PMID: 35260287 DOI: 10.1016/j.jvoice.2022.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 01/27/2022] [Accepted: 02/02/2022] [Indexed: 10/18/2022]
Abstract
OBJECTIVE Systemic vocal fold dehydration is known to increase vocal fold stiffness, which has been hypothesized to have important effect on voice production. However, it remains unclear whether the dehydration-induced vocal fold stiffness changes can have a noticeable impact on phonation, particularly in normal phonation conditions. The goal of this study was to investigate the impact of vocal fold stiffness changes due to vocal fold systemic dehydration and its significance in daily communication. METHODS Parametric computational simulation using a three-dimensional vocal fold model, in which the vocal fold stiffness was varied as a function of systemic dehydration levels based on previously-reported experimental data. RESULTS The results showed that systemic dehydration had significant effects on voice production only at high levels of dehydration, at which dehydration increased the phonation threshold pressure and fundamental frequency, and decreased glottal opening area, vocal intensity and glottal efficiency. The effect depended mainly on the overall dehydration level but was also slightly affected by the dehydration distribution and muscular control. However, for dehydration levels typical of normal phonation conditions, the effect was negligible. CONCLUSIONS The results indicated that dehydration-induced vocal fold stiffness change likely is not an important mechanism through which vocal fold systemic dehydration affects voice production. Nevertheless, a large decrease in glottal efficiency implies a possible perceived increase of vocal effort under a realistic dehydration condition.
Collapse
Affiliation(s)
- Liang Wu
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, PR China.
| | - Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, California
| |
Collapse
|
2
|
Chung HR, Reddy NK, Manzoor D, Schlegel P, Zhang Z, Chhetri DK. Histologic Examination of Vocal Fold Mucosal Wave and Vibration. Laryngoscope 2024; 134:264-271. [PMID: 37522475 PMCID: PMC10828106 DOI: 10.1002/lary.30928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 05/29/2023] [Accepted: 07/18/2023] [Indexed: 08/01/2023]
Abstract
OBJECTIVES Despite gross anatomic and histologic differences between human and canine vocal folds, similar wave patterns have been described yet not fully characterized. We reconstructed vocal fold (VF) vibration in a canine hemilarynx and performed histologic examination of the same vocal fold. We demonstrate comparable wave patterns while exploring the importance of certain anatomic architectures. METHODS An in vivo canine hemilarynx was phonated against a glass prism at low and high muscle activation conditions. Vibration was captured using high-speed video, and trajectories of VF medial surface tattooed landmarks were 3D-reconstructed. The method of empirical eigenfunctions was used to capture the essential dynamics of vibratory movement. Histologic examination of the hemilarynx was performed. RESULTS Oscillation patterns were highly similar between the in vivo canine and previous reports of ex vivo human models. The two most dominant eigenfunctions comprised over 90% of total variance of movement, representing opening/closing and convergent/divergent movement patterns, respectively. We demonstrate a vertical phase difference during the glottal cycle. The time delay between the inferior and superior VF was greater during opening than closing for both activation conditions. Histological examination of canine VF showed not only a thicker lamina propria layer superiorly but also a distinct pattern of thyroarytenoid muscle fibers and fascicles as described in human studies. CONCLUSIONS Histologic and vibratory examination of the canine vocal fold demonstrated human vocal fold vibratory patterns despite certain microstructural differences. This study suggests that the multilayered lamina propria may not be fundamental to vibratory patterns necessary for human-like voice production. LEVEL OF EVIDENCE NA (Basic science study) Laryngoscope, 134:264-271, 2024.
Collapse
Affiliation(s)
- Hye Rhyn Chung
- University of California, Los Angeles, David Geffen School of Medicine
| | - Neha K. Reddy
- University of California, Los Angeles, David Geffen School of Medicine
| | - Daniel Manzoor
- Department of Pathology, University of California, Los Angeles
| | - Patrick Schlegel
- Department of Head and Neck Surgery, University of California, Los Angeles
| | - Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles
| | - Dinesh K. Chhetri
- Department of Head and Neck Surgery, University of California, Los Angeles
| |
Collapse
|
3
|
Luizard P, Bailly L, Yousefi-Mashouf H, Girault R, Orgéas L, Henrich Bernardoni N. Flow-induced oscillations of vocal-fold replicas with tuned extensibility and material properties. Sci Rep 2023; 13:22658. [PMID: 38114547 PMCID: PMC10730560 DOI: 10.1038/s41598-023-48080-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 11/22/2023] [Indexed: 12/21/2023] Open
Abstract
Human vocal folds are highly deformable non-linear oscillators. During phonation, they stretch up to 50% under the complex action of laryngeal muscles. Exploring the fluid/structure/acoustic interactions on a human-scale replica to study the role of the laryngeal muscles remains a challenge. For that purpose, we designed a novel in vitro testbed to control vocal-folds pre-phonatory deformation. The testbed was used to study the vibration and the sound production of vocal-fold replicas made of (i) silicone elastomers commonly used in voice research and (ii) a gelatin-based hydrogel we recently optimized to approximate the mechanics of vocal folds during finite strains under tension, compression and shear loadings. The geometrical and mechanical parameters measured during the experiments emphasized the effect of the vocal-fold material and pre-stretch on the vibration patterns and sounds. In particular, increasing the material stiffness increases glottal flow resistance, subglottal pressure required to sustain oscillations and vibratory fundamental frequency. In addition, although the hydrogel vocal folds only oscillate at low frequencies (close to 60 Hz), the subglottal pressure they require for that purpose is realistic (within the range 0.5-2 kPa), as well as their glottal opening and contact during a vibration cycle. The results also evidence the effect of adhesion forces on vibration and sound production.
Collapse
Affiliation(s)
- Paul Luizard
- Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, Grenoble, 38000, France
- CNRS, Centrale Marseille, Aix Marseille Univ, LMA UMR 7031, Marseille, France
- Audio Communication Group, Technische Universität Berlin, Einsteinufer 17c, Berlin, 10587, Germany
| | - Lucie Bailly
- Univ. Grenoble Alpes, CNRS, Grenoble INP, 3SR, Grenoble, 38000, France
| | - Hamid Yousefi-Mashouf
- Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, Grenoble, 38000, France
- Univ. Grenoble Alpes, CNRS, Grenoble INP, 3SR, Grenoble, 38000, France
| | - Raphaël Girault
- Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, Grenoble, 38000, France
| | - Laurent Orgéas
- Univ. Grenoble Alpes, CNRS, Grenoble INP, 3SR, Grenoble, 38000, France
| | | |
Collapse
|
4
|
Zhang Z. The influence of sensor size on experimental measurement accuracy of vocal fold contact pressure. PROCEEDINGS OF MEETINGS ON ACOUSTICS. ACOUSTICAL SOCIETY OF AMERICA 2023; 52:060002. [PMID: 38872712 PMCID: PMC11173356 DOI: 10.1121/2.0001894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]
Abstract
The vocal folds experience repeated collision during phonation. The resulting contact pressure is often considered to play an important role in vocal fold injury, and has been the focus of many experimental studies. In this study, vocal fold contact pattern and contact pressure during phonation were numerically investigated. The results show that vocal fold contact in general occurs within a horizontal strip on the medial surface, first appearing at the inferior medial surface and propagating upward. Because of the localized and travelling nature of vocal fold contact, sensors of a finite size may significantly underestimate the peak vocal fold contact pressure, particularly for vocal folds of low transverse stiffness. This underestimation also makes it difficult to identify the contact pressure peak in the intraglottal pressure waveform. These results showed that the vocal fold contact pressure reported in previous experimental studies may have significantly underestimated the actual values. It is recommended that contact pressure sensors with a diameter no greater than 0.4 mm are used in future experiments to ensure adequate accuracy in measuring the peak vocal fold contact pressure during phonation.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Ave., Los Angeles, CA 90095-1794
| |
Collapse
|
5
|
Paiva GM, Silva POC, Silva LJAD, Nascimento KA, Silva ABDVE, Abreu SRD, Almeida AAFD, Lopes LW. Spectral and cepstral measurements in women with behavioral dysphonia. Codas 2023; 36:e20220327. [PMID: 37970895 DOI: 10.1590/2317-1782/20232022327pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 03/20/2023] [Indexed: 11/19/2023] Open
Abstract
PURPOSE To investigate whether there are differences in cepstral and spectral acoustic measures between women with behavioral dysphonia with and without laryngeal lesions and verify whether there is a correlation between such measures and the auditory-perceptual evaluation of voice quality. METHODS The sample comprised 78 women with behavioral dysphonia without laryngeal lesions (BDWOL) and 68 with behavioral dysphonia with laryngeal lesions (vocal nodules) (BDWL). Cepstral peak prominence (CPP), cepstral peak prominence-smoothed (CPPS), spectral decrease, and H1-H2 (difference between the amplitude of the first and second harmonics) were extracted. They were submitted to the auditory-perceptual evaluation (APE) of the grade of hoarseness (GH), roughness (RO), breathiness (BR), and strain (ST). RESULTS BDWL women had higher H1-H2 values and lower CPP and CPPS values than BDWOL women. More deviant voices had lower CPP and CPPS values. Breathy voices had lower CPP and CPPS values and higher H1-H2 values than rough ones. There was a weak negative correlation between CPP and RO, a moderate negative correlation with GH, and a strong negative correlation with BR. CPPS had a moderate negative correlation with GH, RO, and BR. H1-H2 had a weak positive correlation with BR. There was a weak positive correlation between spectral decrease and ST. CONCLUSION H1-H2, CPP, and CPPS were different between BDWOL and BDWL women. Furthermore, cepstral and spectral measures were correlated with the different APE parameters.
Collapse
|
6
|
Zhang Z. The influence of source-filter interaction on the voice source in a three-dimensional computational model of voice production. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:2462-2475. [PMID: 37855666 PMCID: PMC10589054 DOI: 10.1121/10.0021879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 09/28/2023] [Accepted: 09/30/2023] [Indexed: 10/20/2023]
Abstract
The goal of this computational study is to quantify global effects of vocal tract constriction at various locations (false vocal folds, aryepiglottic folds, pharynx, oral cavity, and lips) on the voice source across a large range of vocal fold conditions. The results showed that while inclusion of a uniform vocal tract had notable effects on the voice source, further constricting the vocal tract only had small effects except for conditions of extreme constriction, at which constrictions at any location along the vocal tract decreased the mean and peak-to-peak amplitude of the glottal flow waveform. Although narrowing in the epilarynx increased the normalized maximum flow declination rate, vocal tract constriction in general slightly reduced the source strength and high-frequency harmonic production at the glottis, except for a limited set of vocal fold conditions (e.g., soft, long vocal folds subject to relatively high pressure). This suggests that simultaneous laryngeal and vocal tract adjustments are required to maximize source-filter interaction. While vocal tract adjustments are often assumed to improve voice production, our results indicate that such improvements are mainly due to changes in vocal tract acoustic response rather than improved voice production at the glottis.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- UCLA School of Medicine, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
7
|
Perrine BL, Scherer RC. Using a vertical three-mass computational model of the vocal folds to match human phonation of three adult males. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:1505-1525. [PMID: 37695295 PMCID: PMC10497319 DOI: 10.1121/10.0020847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 08/10/2023] [Accepted: 08/19/2023] [Indexed: 09/12/2023]
Abstract
Computer models of phonation are used to study various parameters that are difficult to control, measure, and observe in human subjects. Imitating human phonation by varying the prephonatory conditions of computer models offers insight into the variations that occur across human phonatory production. In the present study, a vertical three-mass computer model of phonation [Perrine, Scherer, Fulcher, and Zhai (2020). J. Acoust. Soc. Am. 147, 1727-1737], driven by empirical pressures from a physical model of the vocal folds (model M5), with a vocal tract following the design of Ishizaka and Flanagan [(1972). Bell Sys. Tech. J. 51, 1233-1268] was used to match prolonged vowels produced by three male subjects using various pitch and loudness levels. The prephonatory conditions of tissue mass and tension, subglottal pressure, glottal diameter and angle, posterior glottal gap, false vocal fold gap, and vocal tract cross-sectional areas were varied in the model to match the model output with the fundamental frequency, alternating current airflow, direct current airflow, skewing quotient, open quotient, maximum flow negative derivative, and the first three formant frequencies from the human production. Parameters were matched between the model and human subjects with an average overall percent mismatch of 4.40% (standard deviation = 6.75%), suggesting a reasonable ability of the simple low dimensional model to mimic these variables.
Collapse
Affiliation(s)
- Brittany L Perrine
- Department of Communication Sciences and Disorders, Baylor University, One Bear Place #97332, Waco, Texas 76798, USA
| | - Ronald C Scherer
- Department of Communication Sciences and Disorders, Bowling Green State University, Ridge Street, Bowling Green, Ohio 43403, USA
| |
Collapse
|
8
|
Wu L, Zhang Z. Impact of the Paraglottic Space on Voice Production in an MRI-Based Vocal Fold Model. J Voice 2023; 37:633.e15-633.e23. [PMID: 33752927 PMCID: PMC8449798 DOI: 10.1016/j.jvoice.2021.02.021] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 02/09/2021] [Accepted: 02/11/2021] [Indexed: 11/28/2022]
Abstract
OBJECTIVE While the vocal fold is in direct contact anteriorly with the thyroid cartilage, posteriorly the vocal fold connects to the thyroid cartilage through a soft tissue layer in the paraglottic space. Currently the paraglottic space is often neglected in computational models of phonation, in which a fixed boundary condition is often imposed on the lateral surface of the vocal fold. The goal of this study was to investigate the effect of the paraglottic space on voice production in an MRI-based vocal fold model, and how this effect may be counteracted by vocal fold stiffening due to laryngeal muscle activation. METHODS Parametric simulation study using an MRI-based computational vocal fold model. RESULTS The results showed that the presence of the paraglottic space increased the mean and amplitude of the glottal area waveform, decreased the phonation frequency and closed quotient. For the particular vocal fold geometry used in this study, the presence of the paraglottic space also reduced the occurrence of irregular vocal fold vibration. These effects of the paraglottic space became smaller with increasing paraglottic space stiffness and to a lesser degree with vocal fold stiffening. CONCLUSIONS The results suggest that the paraglottic space may be neglected in qualitative evaluations of normal phonation, but needs to be included in simulations of pathological phonation or vocal fold posturing.
Collapse
Affiliation(s)
- Liang Wu
- Department of Head and Neck Surgery, University of California, Los Angeles, California
| | - Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, California.
| |
Collapse
|
9
|
Zhang Z. Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion. J Voice 2023; 37:479-485. [PMID: 33849760 PMCID: PMC8502179 DOI: 10.1016/j.jvoice.2021.03.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 02/24/2021] [Accepted: 03/01/2021] [Indexed: 11/19/2022]
Abstract
OBJECTIVE Estimation of physiological control parameters of the vocal system from the produced voice outcome has important applications in clinical management of voice disorders . Previously we developed a simulation-based neural network for estimation of vocal fold geometry, mechanical properties, and subglottal pressure from voice outcome features that characterize the acoustics of the produced voice. The goals of this study are to (1) explore the possibility of improving the estimation accuracy of physiological control parameters by including voice outcome features characterizing vocal fold vibration; and (2) identify voice feature sets that optimize both estimation accuracy and robustness to measurement noise. METHODS Feedforward neural networks are trained to solve the inversion problem of estimating the physiological control parameters of a three-dimensional body-cover vocal fold model from different sets of voice outcome features that characterize the simulated voice acoustics, glottal flow, and vocal fold vibration. A sensitivity analysis is then performed to evaluate the contribution of individual voice features to the overall performance of the neural networks in estimating the physiologic control parameters. RESULTS AND CONCLUSIONS While including voice outcome features characterizing vocal fold vibration increases estimation accuracy, it also reduces the network's robustness to measurement noise, due to high sensitivity of network performance to voice outcome features measuring the absolute amplitudes of the glottal flow and area waveforms, which are also difficult to measure accurately in practical applications. By excluding such glottal flow-based features and replacing glottal area-based features by their normalized counterparts, we are able to significantly improve both estimation accuracy and robustness to noise. We further show that similar estimation accuracy and robustness can be achieved with an even smaller set of voice outcome features by excluding features of small sensitivity.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, Los Angeles, California.
| |
Collapse
|
10
|
Fujiki RB, Croegaert-Koch CK, Thibeault SL. Videostroboscopy Versus High-Speed Videoendoscopy: Factors Influencing Ratings of Laryngeal Oscillation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:1496-1510. [PMID: 37040690 PMCID: PMC10457078 DOI: 10.1044/2023_jslhr-22-00649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 01/16/2023] [Accepted: 01/23/2023] [Indexed: 05/11/2023]
Abstract
PURPOSE The purpose of this study was to determine whether patient voice-related diagnosis, severity of dysphonia, and rater's experience influence the relationship between laryngeal oscillation ratings made from videostroboscopic and high-speed videoendoscopic (HSV) exams. METHOD Stroboscopy and HSV exams from 15 patients with adductor spasmodic dysphonia (ADSD) and 15 with benign vocal fold lesions were rated for laryngeal oscillation and closure by 10 licensed speech-language pathologists (SLPs). Raters were divided into low- (< 5 years) and high-experience (> 5 years) groups. Ratings of vocal fold amplitude, mucosal wave, periodicity, phase symmetry, nonvibrating portion of the vocal fold, and glottal closure were examined using an online form adapted from the Voice Vibratory Assessment of Laryngeal Imaging (VALI). RESULTS Stroboscopy and HSV ratings were more strongly positively correlated for patients with benign vocal fold lesions (r between .43 and .75) than for those with ADSD (r between .40 and .68). Differences between stroboscopy and HSV exams were significantly greater for ratings of amplitude, mucosal wave, and periodicity in patients with ADSD than for patients with benign vocal fold lesions. Raters with < 5 years of experience showed significantly greater differences between stroboscopy and HSV ratings of amplitude and nonvibrating portion of the vocal fold for patients with ADSD only. Significantly greater differences between ratings of periodicity and phase symmetry were observed in patients with more severe dysphonia. CONCLUSIONS Differences in laryngeal ratings made between HSV and stroboscopy exams may be influenced by patient diagnosis, severity of dysphonia, and rater experience. Future study is warranted to determine how the differences observed influence clinical diagnosis and outcomes.
Collapse
|
11
|
Duan C, Jimenez JM, Goergen C, Cox A, Sivasankar PM, Calve S. Hydration State and Hyaluronidase Treatment Significantly Affect Porcine Vocal Fold Biomechanics. J Voice 2023; 37:348-354. [PMID: 33541766 PMCID: PMC8325720 DOI: 10.1016/j.jvoice.2021.01.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Revised: 12/30/2020] [Accepted: 01/07/2021] [Indexed: 10/22/2022]
Abstract
OBJECTIVES The understanding of vocal fold hydration state, including dehydrated, euhydrated, rehydrated tissue, and how hydration affects vocal fold biomechanical properties is still evolving. Although clinical observations support the benefits of increasing vocal fold hydration after dehydrating events, more mechanistic information on the effects of vocal fold dehydration and the beneficial effects of rehydration are needed. Alterations to hyaluronic acid (HA), an important component of the vocal fold extracellular matrix, are likely to influence the biomechanical properties of vocal folds. In this study, we investigated the influence of hydration state and HA on vocal fold tissue stiffness via biomechanical testing. STUDY DESIGN Prospective, ex vivo study design. METHODS Fresh porcine vocal folds (N = 18) were examined following sequential immersion in hypertonic (dehydration) and isotonic solutions (rehydration). In a separate experiment, vocal folds were incubated in hyaluronidase (Hyal) to remove HA. Control tissues were not exposed to any challenges. A custom micromechanical system with a microforce sensing probe was used to measure the force-displacement response. Optical strain was calculated, and ultrasound imaging was used to measure tissue cross-sectional area to obtain stress-strain curves. RESULTS Significant increases (P ≤ 0.05) were found in tangent moduli between dehydrated and rehydrated vocal folds at strains of ε = 0.15. The tangent moduli of Hyal-digested tissues significantly increased at both ε = 0.15 and 0.3 (P ≤ 0.05). CONCLUSION Vocal fold dehydration increased tissue stiffness and rehydration reduced the stiffness. Loss of HA increased vocal fold stiffness, suggesting a potential mechanical role for HA in euhydrated vocal folds.
Collapse
Affiliation(s)
- Chenwei Duan
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, Indiana; Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana
| | - Julian M Jimenez
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, Indiana; Department of Mechanical Engineering, University of Colorado-Boulder, Boulder, Colorado
| | - Craig Goergen
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, Indiana
| | - Abigail Cox
- Department of Comparative Pathobiology, Purdue University, West Lafayette, Indiana
| | - Preeti M Sivasankar
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, Indiana; Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana
| | - Sarah Calve
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, Indiana; Department of Mechanical Engineering, University of Colorado-Boulder, Boulder, Colorado.
| |
Collapse
|
12
|
Deng JJ, Peterson SD. Examining the influence of epithelium layer modeling approaches on vocal fold kinematics and kinetics. Biomech Model Mechanobiol 2023; 22:479-493. [PMID: 36536195 PMCID: PMC10787511 DOI: 10.1007/s10237-022-01658-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 11/19/2022] [Indexed: 12/23/2022]
Abstract
Grouping the thin epithelium and thicker superficial lamina propria layers into a single cover layer has been widely adopted in finite element vocal fold models. Recent silicone vocal fold studies have suggested, however, that inclusion of a distinct epithelial layer leads to more physiologically representative motion. This study systematically explores the ramifications of incorporating an epithelial layer into a cover grouping for finite element vocal fold modeling. A membrane model for the epithelium is introduced to facilitate parametric investigation by reducing the mesh density requirement of the epithelium into a single infinitesimally thin layer. Excluding the epithelium entirely leads to increased energy in higher order modes and larger inferior-superior excursion of the folds. Integrating the epithelium into a cover layer with volume-weighted average stiffness results in similar kinematics to that of a model treating the epithelium as a distinct layer. However, the internal stress/strain and contact pressure during collision are higher, and viscous dissipation is lower, when the epithelium is integrated into the cover. Thus, careful treatment of the epithelium is recommended for finite element studies, particularly when employing the models for estimating measures dependent upon internal stress/strain and/or collision pressure, such as vocal dose.
Collapse
Affiliation(s)
- Jonathan J Deng
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada.
| |
Collapse
|
13
|
Zhang Z. Vocal Fold Vertical Thickness in Human Voice Production and Control: A Review. J Voice 2023:S0892-1997(23)00078-4. [PMID: 36964073 PMCID: PMC10514229 DOI: 10.1016/j.jvoice.2023.02.021] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 02/15/2023] [Accepted: 02/16/2023] [Indexed: 03/26/2023]
Abstract
While current voice research often focuses on laryngeal adjustments in a two-dimensional plane from a superior endoscopic view, recent computational simulations showed that vocal control is three-dimensional and the medial surface vertical thickness plays an important role in regulating the glottal closure pattern and the spectral shape of the produced voice. In contrast, while a small glottal gap is required to initiate and sustain phonation, further changes in the glottal gap within this small range have only small effects on glottal closure and spectral shape. Vocal fold stiffness, particularly along the anterior-posterior direction, plays an important role in pitch control but has only a small effect on glottal closure and spectral shape. These results suggest that voice research should pay more attention to medial surface shape in the vertical dimension. Future studies in a large population of both normal speakers and patients are needed to better characterize the three-dimensional medial surface shape, its variability between speakers, changes throughout the life span, and how it is impacted by voice disorders and clinical interventions. The implications for voice pedagogy and clinical intervention are discussed.
Collapse
|
14
|
Wu L, Zhang Z. Effects of implant and vocal fold stiffness on voice production after medialization laryngoplasty in an MRI-based vocal fold model. J Biomech 2023; 149:111483. [PMID: 36787673 PMCID: PMC10368372 DOI: 10.1016/j.jbiomech.2023.111483] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 01/05/2023] [Accepted: 02/01/2023] [Indexed: 02/07/2023]
Abstract
Medialization laryngoplasty is one of the primary surgical interventions in the treatment of glottal insufficiency due to vocal fold paralysis, paresis, or atrophy. During the surgery, an implant is laterally inserted into the larynx to medialize the affected vocal fold toward glottal midline, with the goal of improving glottal closure during phonation and voice production efficiency. While implants of different materials and geometry designs have been used, the effect of implant design on the voice outcome remains unclear. In this simulation study, the effect of implant stiffness was investigated in an MRI-based model of the vocal folds after medialization laryngoplasty. The results showed that implant stiffness had a significant impact on the phonation threshold pressure, glottal area waveform, and fundamental frequency, but only small effect on the closed quotient and other acoustic measures of the produced voice. The effect of implant stiffness also exhibited variability, depending on the stiffness conditions of the vocal fold and paraglottic tissues, indicating that individual differences need to be considered during the planning of medialization laryngoplasty.
Collapse
Affiliation(s)
- Liang Wu
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, PR China; Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, CA 90095-1794, USA
| | - Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, CA 90095-1794, USA.
| |
Collapse
|
15
|
Impact of Instructed Laryngeal Manipulation on Acoustic Measures of Voice-Preliminary Results. J Voice 2023; 37:143.e1-143.e11. [PMID: 33288382 DOI: 10.1016/j.jvoice.2020.11.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 11/10/2020] [Accepted: 11/11/2020] [Indexed: 01/11/2023]
Abstract
BACKGROUND Control of laryngeal muscles is required to manipulate pitch, volume, and voice quality. False vocal fold activity (FVFA) refers to the constriction and release of constriction of the false vocal folds. True vocal fold mass (TVFM) represents the cross-sectional thickness of the vocal folds. Larynx height (LH) refers to the vertical position of the larynx in the neck. To date, studies of voice control have examined the effects of these parameters separately. No study has investigated the impact of instructed systematic manipulation of these parameters on acoustic voice measures in vocally healthy trained subjects. AIMS This study examined the effects of systematically manipulating FVFA, TVFM, and LH on several acoustic voice measures. METHOD Twelve vocally trained speakers were instructed to use specific techniques to achieve experimental conditions of constriction and release of constriction of FVFA, thicker and thinner TVFM, and normal and low LH. Each condition was implemented in combination with manipulating the other parameters. Voice recordings of sustained vowel /a/ and Rainbow Passage were obtained for all laryngeal manipulation conditions and underwent acoustic analyses for fundamental frequency (F0), signal typing, harmonics-to-noise ratio (HNR), cepstral peak prominence (CPP), and vocal relative intensity. RESULTS Constricted FVFA caused more aperiodicity in the signals, lower CPP, and lower vocal relative intensity than release of constriction. Thicker TVFM resulted in significantly higher CPP and vocal relative intensity than thinner TVFM. Modifying TVFM did not affect F0 and HNR. Low LH had significantly lower F0 but did not impact on HNR, CPP, and intensity. CONCLUSIONS The effects of systematic manipulation of each laryngeal parameter resulted in independent acoustic effects without measurable interaction. Release of constriction of FVFA, thicker TVFM, and low LH were configurations that resulted in more optimal acoustic signals.
Collapse
|
16
|
Lamprecht R, Scheible F, Veltrup R, Schaan C, Semmler M, Henningson JO, Sutor A. Quasi-static ultrasound elastography of ex-vivo porcine vocal folds during passive elongation and adduction. J Voice 2022:S0892-1997(22)00386-1. [PMID: 36529564 DOI: 10.1016/j.jvoice.2022.11.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 11/24/2022] [Accepted: 11/28/2022] [Indexed: 12/23/2022]
Abstract
OBJECTIVES The elastic properties of the vocal folds have great influence on the primary sound and thus on the entire subsequent phonation process. Muscle contractions in the larynx can alter the elastic properties of the vocal fold tissue. Quasi-static ultrasound elastography is a non-destructive examination method that can be applied to ex-vivo vocal folds. In this work, porcine vocal folds were passively elongated and adducted and the changes of the elastic properties due to that manipulations were measured. METHODS Manipulations were performed by applying force to sewn-in sutures. Elongation was achieved by a suture attached to the thyroid cartilage, which was pulled forward by defined weights. Adduction was effected by two sutures exerting torque on the arytenoid cartilage. A series of ten specimens was examined and evaluated using a quasi-static elastography algorithm. In addition, the surface stretch was measured optically using tattooed reference points. RESULTS This study showed that the expected stiffening of the tissue during the manipulations can be measured using quasi-static ultrasound elastography. The measured effect of elongation and adduction, both of which result in stretching of the tissue, is stiffening. However, the relative change of specific manipulations is not the same for the same load on different larynges, but is rather related to stretch caused and other uninvestigated factors. CONCLUSION The passive elongation and adduction of vocal folds stiffen the tissue of the vocal folds and can be measured using ultrasound elastography.
Collapse
Affiliation(s)
- Raphael Lamprecht
- Institute of Measurement and Sensor Technology, UMIT TIROL - Private University for Health Sciences and Health Technology, Hall in Tirol, Austria.
| | - Florian Scheible
- Institute of Measurement and Sensor Technology, UMIT TIROL - Private University for Health Sciences and Health Technology, Hall in Tirol, Austria.
| | - Reinhard Veltrup
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head- and Neck surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany.
| | - Casey Schaan
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head- and Neck surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany.
| | - Marion Semmler
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head- and Neck surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany.
| | - Jann-Ole Henningson
- Chair of Visual Computing, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany.
| | - Alexander Sutor
- Institute of Measurement and Sensor Technology, UMIT TIROL - Private University for Health Sciences and Health Technology, Hall in Tirol, Austria.
| |
Collapse
|
17
|
Zhang Z. Oral vibratory sensations during voice production at different laryngeal and semi-occluded vocal tract configurations. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:302. [PMID: 35931496 PMCID: PMC9270996 DOI: 10.1121/10.0012365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 06/16/2022] [Accepted: 06/20/2022] [Indexed: 06/15/2023]
Abstract
Voice therapy often emphasizes vibratory sensations in the front part of the vocal tract during phonation to improve vocal efficiency. It remains unclear what laryngeal and vocal tract adjustments are elicited in speakers by this emphasis on oral vibratory sensations. Using a three-dimensional phonation model, this study aims to identify laryngeal and epilaryngeal adjustments that might produce maximal oral vibratory sensations during phonation, as quantified by the oral sound pressure level (SPL), and thus are likely to be elicited in voice therapy at different semi-occluded vocal tract configurations. Results show that maximum oral SPL occurs at intermediate vocal fold adduction configurations characterized by a trade-off between glottal gap and vocal fold vertical thickness. Epilaryngeal tube narrowing further increases the oral SPL in an open vocal tract, but has little effect on oral SPL in semi-occluded vocal tracts. Laryngeal and epilaryngeal configurations producing the maximum oral SPL generally have lower peak vocal fold contact pressure when producing a target output SPL. These favorable configurations are more easily identified in open vocal tracts than semi-occluded vocal tracts. However, semi-occlusion increases both the mean and dynamic oral pressure, which may familiarize speakers with oral vibratory sensations and facilitate adoption of favorable laryngeal configurations.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Ave., Los Angeles, California 90095-1794, USA
| |
Collapse
|
18
|
Zhang Z. Contribution of Undesired Medial Surface Shape to Suboptimal Voice Outcome After Medialization Laryngoplasty. J Voice 2022:S0892-1997(22)00079-0. [PMID: 35410779 DOI: 10.1016/j.jvoice.2022.03.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 03/09/2022] [Accepted: 03/10/2022] [Indexed: 10/18/2022]
Abstract
OBJECTIVES Voice production in pathological conditions or after surgical intervention often involves undesired medial surface shape such as reduced vertical thickness and/or left-right asymmetry in medial surface shape. The effect of such undesired medial surface on voice production remains unclear, and is often not taken into consideration during planning of surgical intervention, due to difficulty of imaging the medial surface in patients. This study aims to better understand how voice outcomes are impacted by undesired medial surface shape. METHODS Computational simulations were conducted to parametrically manipulate medial surface shape and stiffness and observe its consequence on voice production. RESULTS The results showed that undesired medial surface shape can result in incomplete glottal closure, weak voice production, increased phonation threshold, and significantly reduced vocal efficiency, particularly in the presence of left-right stiffness asymmetry. CONCLUSIONS In addition to approximating the vocal folds, medialization laryngoplasty should additionally aim to sufficiently increase medial surface thickness, which may improve voice outcomes in patients whose voices remain unsatisfactory or suboptimal after initial intervention. While a divergent implant may increase medial surface thickness, precise implant placement in anticipation of tissue and implant deformation during the insertion process is equally important in order to achieve desired medial surface shape and optimal voice outcomes.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, California.
| |
Collapse
|
19
|
Zhang Z. Contribution of laryngeal size to differences between male and female voice production. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:4511. [PMID: 34972311 PMCID: PMC8716178 DOI: 10.1121/10.0009033] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
In this study we investigated the effect of sex- and age-related differences in vocal fold length, thickness, and depth on voice production in a three-dimensional vocal fold model. The results showed that the cause-effect relationships between vocal fold physiology and voice production previously identified in an adult male-like vocal fold geometry remained qualitatively the same in vocal folds with geometry representative of adult females and children. We further showed that the often-observed differences in voice production between adult males, adult females, and children can be explained by differences in length and thickness. The lower F0, higher flow rate, larger vocal fold vibration amplitude, and higher sound pressure level (SPL) in adult males as compared to adult females and children can be explained by differences in vocal fold length. In contrast, the thickness effect dominated and contributed to the larger closed quotient of vocal fold vibration, larger normalized maximum flow declination rate, and lower H1-H2 in adult males as compared to adult females and children. The effect of differences in vocal fold depth was generally small. When targeting a specific SPL, adult males experienced a lower peak vocal fold contact pressure during phonation than adult females and children.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
20
|
Vatanparvar K, Nathan V, Nemati E, Rahman MM, McCaffrey D, Kuang J, Gao JA. SpeechSpiro: Lung Function Assessment from Speech Pattern as an Alternative to Spirometry for Mobile Health Tracking. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:7237-7243. [PMID: 34892769 DOI: 10.1109/embc46164.2021.9630077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Respiratory illnesses are common in the United States and globally; people deal with these illnesses in various forms, such as asthma, chronic obstructive pulmonary diseases, or infectious respiratory diseases (e.g., coronavirus). The lung function of subjects affected by these illnesses degrades due to infection or inflammation in their respiratory airways. Typically, lung function is assessed using in-clinic medical equipment, and quite recently, via portable spirometry devices. Research has shown that the obstruction and restriction in the respiratory airways affect individuals' voice characteristics. Hence, audio features could play a role in predicting the lung function and severity of the obstruction. In this paper, we go beyond well-known voice audio features and create a hybrid deep learning model using CNN-LSTM to discover spatiotemporal patterns in speech and predict the lung function parameters with accuracy comparable to conventional devices. We validate the performance and generalizability of our method using the data collected from 201 subjects enrolled in two studies internally and in collaboration with a pulmonary hospital. SpeechSpiro measures lung function parameters (e.g., forced vital capacity) with a mean normalized RMSE of 12% and R2 score of up to 76% using 60-second phone audio recordings of individuals reading a passage.Clinical relevance - Speech-based spirometry has the potential to eliminate the need for an additional device to carry out the lung function assessment outside clinical settings; hence, it can enable continuous and mobile track of the individual's condition, healthy or with a respiratory illness, using a smartphone.
Collapse
|
21
|
Zhang Z. Vocal tract adjustments to minimize vocal fold contact pressure during phonation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:1609. [PMID: 34598628 PMCID: PMC8425986 DOI: 10.1121/10.0006047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 08/09/2021] [Accepted: 08/16/2021] [Indexed: 06/13/2023]
Abstract
This computational study aims to identify vocal tract adjustments that minimize the peak vocal fold contact pressure during phonation and thus should be targeted in voice therapy treating phonotraumatic vocal hyperfunction. The results showed that for a given subglottal pressure, the effect of vocal tract adjustments on the peak vocal fold contact pressure was generally small except when such adjustments caused noticeable changes in the glottal flow amplitude. In this study, this occurred mainly when the lip opening was reduced and at conditions of large initial glottal angles or high subglottal pressures, which decreased the peak contact pressure but also significantly reduced the output sound pressure level (SPL). On the other hand, increasing lip opening significantly increased sound radiation efficiency from the mouth and reduced the subglottal pressure required to produce a target SPL. Because of the large effect of the subglottal pressure on the peak contact pressure, increasing lip opening thus was able to significantly reduce the peak contact pressure in voice tasks targeting a specific SPL. In contrast, the effect of pharyngeal expansion alone had only a small effect on the peak contact pressure, whether controlling for the subglottal pressure or targeting a specific SPL.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, Los Angeles 31-24 Rehabilitation Center, University of California, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
22
|
Calvache C, Solaque L, Velasco A, Peñuela L. Biomechanical Models to Represent Vocal Physiology: A Systematic Review. J Voice 2021; 37:465.e1-465.e18. [PMID: 33678534 DOI: 10.1016/j.jvoice.2021.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 01/26/2021] [Accepted: 02/02/2021] [Indexed: 11/24/2022]
Abstract
Biomechanical modeling allows obtaining information on physical phenomena that cannot be directly observed. This study aims to review models that represent voice production. A systematic review of the literature was conducted using PubMed/Medline, SCOPUS, and IEEE Xplore databases. To select the papers, we used the protocol PRISMA Statement. A total of 53 publications were included in this review. This article considers a taxonomic classification of models found in the literature. We propose four categories in the taxonomy: (1) Models representing the Source (Vocal folds); (2) Models representing the Filter (Vocal Tract); (3) Models representing the Source - Filter Interaction; and (4) Models representing the Airflow - Source Interaction. We include a bibliographic analysis with the evolution of the publications per category. We provide an analysis of the number as well of publications in journals per year. Moreover, we present an analysis of the term occurrence and its frequency of usage, as found in the literature. In each category, different types of vocal production models are mentioned and analyzed. The models account for the analysis of evidence about aerodynamic, biomechanical, and acoustic phenomena and their correlation with the physiological processes involved in the production of the human voice. This review gives an insight into the state of the art related to the mathematical modeling of voice production, analyzed from the viewpoint of vocal physiology.
Collapse
Affiliation(s)
- Carlos Calvache
- Vocology Center, Bogotá, Colombia; Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia.
| | - Leonardo Solaque
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Alexandra Velasco
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Lina Peñuela
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| |
Collapse
|
23
|
Li Z, Chen Y, Chang S, Rousseau B, Luo H. A one-dimensional flow model enhanced by machine learning for simulation of vocal fold vibration. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:1712. [PMID: 33765799 PMCID: PMC7954577 DOI: 10.1121/10.0003561] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 01/25/2021] [Accepted: 02/01/2021] [Indexed: 06/02/2023]
Abstract
A one-dimensional (1D) unsteady and viscous flow model that is derived from the momentum and mass conservation equations is described, and to enhance this physics-based model, a machine learning approach is used to determine the unknown modeling parameters. Specifically, an idealized larynx model is constructed and ten cases of three-dimensional (3D) fluid-structure interaction (FSI) simulations are performed. The flow data are then extracted to train the 1D flow model using a sparse identification approach for nonlinear dynamical systems. As a result of training, we obtain the analytical expressions for the entrance effect and pressure loss in the glottis, which are then incorporated in the flow model to conveniently handle different glottal shapes due to vocal fold vibration. We apply the enhanced 1D flow model in the FSI simulation of both idealized vocal fold geometries and subject-specific anatomical geometries reconstructed from the magnetic resonance imaging images of rabbits' larynges. The 1D flow model is evaluated in both of these setups and shown to have robust performance. Therefore, it provides a fast simulation tool that is superior to the previous 1D models.
Collapse
Affiliation(s)
- Zheng Li
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Ye Chen
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Siyuan Chang
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Bernard Rousseau
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, Pennsylvania 15213, USA
| | - Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| |
Collapse
|
24
|
Zhang Z. Interaction between epilaryngeal and laryngeal adjustments in regulating vocal fold contact pressure. JASA EXPRESS LETTERS 2021; 1:025201. [PMID: 33615313 PMCID: PMC7869442 DOI: 10.1121/10.0003393] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Accepted: 01/05/2021] [Indexed: 06/12/2023]
Abstract
This study investigates the peak vocal fold contact pressure at different conditions of epilaryngeal narrowing and laryngeal adjustments. The results show that for a given subglottal pressure, the peak vocal fold contact pressure may increase or decrease with epilaryngeal narrowing, depending on a complex interaction between vocal fold vertical thickness, initial glottal angle, and subglottal pressure. However, epilaryngeal narrowing also significantly increases vocal efficiency so that for a target sound pressure level, the peak vocal fold contact pressure decreases with epilaryngeal narrowing. Overall, the peak vocal fold contact pressure and respiratory effort can be minimized by epilaryngeal narrowing, adopting a small initial glottal angle, and an intermediate vocal fold thickness.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
25
|
Zhang Z. Laryngeal strategies to minimize vocal fold contact pressure and their effect on voice production. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:1039. [PMID: 32873018 PMCID: PMC7455307 DOI: 10.1121/10.0001796] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
The goal of this study is to identify laryngeal strategies that minimize vocal fold contact pressure while producing a target sound pressure level (SPL) using a three-dimensional voice production model. The results show that while the subglottal pressure and transverse stiffness can be manipulated to reduce the peak contact pressure, such manipulations also reduce the SPL, and are thus less effective in reducing contact pressure in voice tasks targeting a specific SPL level. In contrast, changes in the initial glottal angle and vocal fold vertical thickness that reduce the contact pressure also increase the SPL. Thus, in voice tasks targeting a specific SPL, such changes in the initial glottal angle and vertical thickness also lower the subglottal pressure, which further reduces the peak contact pressure. Overall the results show that for voice tasks with a target SPL level, vocal fold contact pressure can be significantly reduced by adopting a barely abducted glottal configuration or reducing the vocal fold vertical thickness. Aerodynamic measures are effective in identifying voice production with large initial glottal angles, but by themselves alone are not useful in differentiating hyperadducted vocal folds from barely abducted vocal folds, which may be better differentiated by closed quotient and voice type measures.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
26
|
Geng B, Pham N, Xue Q, Zheng X. A three-dimensional vocal fold posturing model based on muscle mechanics and magnetic resonance imaging of a canine larynx. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:2597. [PMID: 32359330 DOI: 10.1121/10.0001093] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 03/27/2020] [Indexed: 06/11/2023]
Abstract
In this work, a high-fidelity three-dimensional continuum model of the canine laryngeal framework was developed for simulating laryngeal posturing. By building each muscle and cartilage from magnetic resonance imaging (MRI), the model is highly realistic in anatomy. The muscle mechanics is modeled using the finite-element method. The model was tested by simulating vocal fold postures under systematic activations of individual as well as groups of laryngeal muscles, and it accurately predicted vocal fold posturing parameters reported from in vivo canine larynges. As a demonstration of its application, the model was then used to investigate muscle controls of arytenoid movements, medial surface morphology, and vocal fold abduction. The results show that the traditionally categorized adductor and abductor muscles can have opposite effects on vocal fold posturing, making highly complex laryngeal adjustments in speech and singing possible. These results demonstrate that a realistic comprehensive larynx model is feasible, which is a critical step toward a causal physics-based model of voice production.
Collapse
Affiliation(s)
- Biao Geng
- Department of Mechanical Engineering, University of Maine, Orono, Maine 04473, USA
| | - Ngoc Pham
- Department of Mechanical Engineering, University of Maine, Orono, Maine 04473, USA
| | - Qian Xue
- Department of Mechanical Engineering, University of Maine, Orono, Maine 04473, USA
| | - Xudong Zheng
- Department of Mechanical Engineering, University of Maine, Orono, Maine 04473, USA
| |
Collapse
|
27
|
Chen Y, Li Z, Chang S, Rousseau B, Luo H. A reduced-order flow model for vocal fold vibration: from idealized to subject-specific models. JOURNAL OF FLUIDS AND STRUCTURES 2020; 94:102940. [PMID: 32210520 PMCID: PMC7093056 DOI: 10.1016/j.jfluidstructs.2020.102940] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
We present a reduced-order model for fluid-structure interaction (FSI) simulation of vocal fold vibration during phonation. This model couples the three-dimensional (3D) tissue mechanics and a one-dimensional (1D) flow model that is derived from the momentum and mass conservation equations for the glottal airflow. The effects of glottal entrance and pressure loss in the glottis are incorporated in the flow model. We consider both idealized vocal fold geometries and subject-specific anatomical geometries segmented from the MRI images of rabbits. For the idealized vocal fold geometries, we compare the simulation results from the 1D/3D hybrid FSI model with those from the full 3D FSI simulation based on an immersed-boundary method. For the subject-specific geometries, we incorporate previously estimated tissue properties for individual samples and compare the results with those from the high-speed imaging experiment of in vivo phonation. In both setups, the comparison shows good agreement in the vibration frequency, amplitude, phase delay, and deformation pattern of the vocal fold, which suggests potential application of the present approach for future patient-specific modeling.
Collapse
Affiliation(s)
- Ye Chen
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235-1592
| | - Zheng Li
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235-1592
| | - Siyuan Chang
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235-1592
| | - Bernard Rousseau
- Department of Communication Science and Disorders, University of Pittsburgh
| | - Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235-1592
| |
Collapse
|
28
|
Zhang Z. Estimation of vocal fold physiology from voice acoustics using machine learning. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:EL264. [PMID: 32237804 PMCID: PMC7075716 DOI: 10.1121/10.0000927] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2019] [Revised: 03/01/2020] [Accepted: 03/03/2020] [Indexed: 05/27/2023]
Abstract
The goal of this study is to estimate vocal fold geometry, stiffness, position, and subglottal pressure from voice acoustics, toward clinical and other voice technology applications. Unlike previous voice inversion research that often uses lumped-element models of phonation, this study explores the feasibility of voice inversion using data generated from a three-dimensional voice production model. Neural networks are trained to estimate vocal fold properties and subglottal pressure from voice features extracted from the simulation data. Results show reasonably good estimation accuracy, particularly for vocal fold properties with a consistent global effect on voice production, and reasonable agreement with excised human larynx experiment.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehab Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794,
| |
Collapse
|
29
|
Zhang Z, Wu L, Gray R, Chhetri DK. Three-dimensional vocal fold structural change due to implant insertion in medialization laryngoplasty. PLoS One 2020; 15:e0228464. [PMID: 31999758 PMCID: PMC6991949 DOI: 10.1371/journal.pone.0228464] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Accepted: 01/16/2020] [Indexed: 11/22/2022] Open
Abstract
Glottal insufficiency due to vocal fold paralysis, paresis, or atrophy often leads to degraded voice quality. One of the primary surgical intervention procedures to treat glottal insufficiency is medialization laryngoplasty, in which an implant is inserted through a lateral window on the thyroid cartilage to medialize the vocal folds. While the goal of medialization is to modify the vocal fold structure to restore normal phonation, few studies have attempted to quantify such structural changes of the vocal folds. The goal of this study is to quantify the three-dimensional structural changes of the vocal folds due to implant insertion in medialization laryngoplasty, and evaluate its potential effect on voice production. Medialization laryngoplasty were performed in excised human larynges using implants of different stiffness. Magnetic resonance images of the larynges were obtained with and without implant insertion. The results showed that implant insertion significantly changed the original body-cover structure of the vocal folds, with the implant taking over the large space used to be occupied by the original body layer and the vocal fold being stretched into a thin layer wrapped around the implant. The medial-lateral dimension of the vocal fold was significantly reduced from about 4 mm to 1 mm, and the vocal fold was stretched in the coronal plane by about 70%. It is hypothesized that use of implants with stiffness comparable to that of the vocal folds is beneficial because the degree of medialization can be adjusted without much negative effects on phonation frequency, phonation threshold pressure, or vibration amplitude.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, California, United States of America
| | - Liang Wu
- Department of Head and Neck Surgery, University of California, Los Angeles, California, United States of America
| | - Raluca Gray
- Department of Head and Neck Surgery, University of California, Los Angeles, California, United States of America
| | - Dinesh K Chhetri
- Department of Head and Neck Surgery, University of California, Los Angeles, California, United States of America
| |
Collapse
|
30
|
Wu L, Zhang Z. Voice production in a MRI-based subject-specific vocal fold model with parametrically controlled medial surface shape. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:4190. [PMID: 31893687 PMCID: PMC6901357 DOI: 10.1121/1.5134784] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 10/24/2019] [Accepted: 10/30/2019] [Indexed: 05/18/2023]
Abstract
The goal of this study was to investigate how realistic changes in medial surface shape, as occur in human phonation, affect voice production. In a parametric magnetic resonance imaging-based three-dimensional vocal fold model, the superior and inferior portions of the medial surface were systematically manipulated to produce different medial surface contours similar to those observed in previous excised larynx and in vivo canine larynx experiments. Voice simulations were performed to investigate the differences in the resulting voice production. The results showed that both superior-medial bulging and inferior-medial bulging of the medial surface, which led to an increased vertical thickness and a more rectangular glottal configuration, increased the closed quotient of vocal fold vibration. Changes in medial surface shape also had significant effects on the phonation threshold pressure. The degree of these effects of changes in medial surface shape was larynx specific, and varied significantly depending on the vocal fold cross-sectional geometry and its variation along the anterior-posterior direction. The results suggest that, in addition to vocal fold approximation, surgical interventions of voice disorders should also aim at restoring a rectangular and sufficiently thick medial surface.
Collapse
Affiliation(s)
- Liang Wu
- Department of Head and Neck Surgery, University of California, Los Angeles (UCLA), 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| | - Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles (UCLA), 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
31
|
Wang X, Jiang W, Zheng X, Xue Q. A computational study of the effects of vocal fold stiffness parameters on voice production. J Voice 2019; 35:327.e1-327.e11. [PMID: 31628047 DOI: 10.1016/j.jvoice.2019.09.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 09/03/2019] [Accepted: 09/04/2019] [Indexed: 10/25/2022]
Abstract
A three-dimensional flow-structure interaction model of voice production is used to investigate the effect of the stiffness parameters of vocal fold layers on voice production. The vocal fold is modeled as a three-layer structure consisting of the cover, ligament, and body layers. All the three layers are modeled as transversely isotropic materials for which the stiffness parameters include the transverse elastic modulus and longitudinal elastic modulus. The results show that, in addition to the obvious monotonic effects on the fundamental frequency, flow rate and glottis opening, the stiffness parameters also have significant and nonmonotonic effects on the divergent angle, open quotient, and closing velocity. It is further found that the longitudinal stiffness parameters generally have more significant impacts on glottal flows and vocal fold vibrations than the transverse stiffness parameters. The sensitivity analysis shows that, among all the stiffness parameters, the transverse and longitudinal stiffness of the ligament layer have the most dominant effect on most output measures.
Collapse
Affiliation(s)
- Xiaojian Wang
- Department of Mechanical Engineering, University of Maine, Orono, Maine
| | - Weili Jiang
- Department of Mechanical Engineering, University of Maine, Orono, Maine
| | - Xudong Zheng
- Department of Mechanical Engineering, University of Maine, Orono, Maine
| | - Qian Xue
- Department of Mechanical Engineering, University of Maine, Orono, Maine.
| |
Collapse
|
32
|
Li Z, Chen Y, Chang S, Luo H. A Reduced-Order Flow Model for Fluid-Structure Interaction Simulation of Vocal Fold Vibration. J Biomech Eng 2019; 142:2736602. [PMID: 31201740 DOI: 10.1115/1.4044033] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2018] [Indexed: 01/04/2023]
Abstract
We present a novel reduced-order glottal airflow model that can be coupled with the three-dimensional (3D) solid mechanics model of the vocal fold tissue to simulate the fluid-structure interaction (FSI) during voice production. This type of hybrid FSI models have potential applications in the estimation of the tissue properties that are unknown due to patient variations and/or neuromuscular activities. In this work, the flow is simplified to a one-dimensional (1D) momentum equation-based model incorporating the entrance effect and energy loss in the glottis. The performance of the flow model is assessed using a simplified yet 3D vocal fold configuration. We use the immersed-boundary method-based 3D FSI simulation as a benchmark to evaluate the momentum-based model as well as the Bernoulli-based 1D flow models. The results show that the new model has significantly better performance than the Bernoulli models in terms of prediction about the vocal fold vibration frequency, amplitude, and phase delay. Furthermore, the comparison results are consistent for different medial thicknesses of the vocal fold, subglottal pressures, and tissue material behaviors, indicating that the new model has better robustness than previous reduced-order models.
Collapse
Affiliation(s)
- Zheng Li
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235-1592
| | - Ye Chen
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235-1592
| | - Siyuan Chang
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235-1592
| | - Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, TN 37235-1592
| |
Collapse
|
33
|
Zhang Z. Vocal fold contact pressure in a three-dimensional body-cover phonation model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:256. [PMID: 31370600 PMCID: PMC6642050 DOI: 10.1121/1.5116138] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Revised: 06/18/2019] [Accepted: 06/20/2019] [Indexed: 05/18/2023]
Abstract
The goal of this study is to identify vocal fold geometric and mechanical conditions that are likely to produce large contact pressure and thus high risk of vocal fold injury. Using a three-dimensional computational model of phonation, parametric simulations are performed with co-variations in vocal fold geometry and stiffness, with and without a vocal tract. For each simulation, the peak contact pressure is calculated. The results show that the subglottal pressure and the transverse stiffness of the vocal folds in the coronal plane have the largest and most consistent effect on the peak contact pressure, indicating the importance of maintaining a balance between the subglottal pressure and transverse stiffness to avoiding vocal fold injury. The presence of a vocal tract generally increases the peak contact pressure, particularly for an open-mouth vocal tract configuration. While a low degree of vocal fold approximation significantly reduces vocal fold contact pressure, for conditions of moderate and tight vocal fold approximation changes in vocal fold approximation may increase or decrease the peak contact pressure. The effects of the medial surface thickness and vocal fold stiffness along the anterior-posterior direction are similarly inconsistent and vary depending on other control parameters and the vocal tract configuration.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
34
|
Jiang W, Zheng X, Xue Q. Influence of vocal fold cover layer thickness on its vibratory dynamics during voice production. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:369. [PMID: 31370576 PMCID: PMC6656577 DOI: 10.1121/1.5116567] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 06/17/2019] [Accepted: 06/24/2019] [Indexed: 05/31/2023]
Abstract
The influence of vocal fold cover layer thickness on the flow-induced vibration and voice production was studied by using a continuum-mechanics based computational model. The cover-body thickness ratio of a three-layer vocal fold was systematically varied. The effect on the vocal fold stiffness, eigenfrequencies and eigenmodes, fundamental frequencies, glottal flow rate, vocal fold vibratory dynamics, and synchronization of the eigenmodes were analyzed by using the structure eigen analysis and flow-structure interaction simulations. It was found that the cover-body layer thickness ratio significantly affected the strength and synchronization of the eigenmodes during flow-structure interactions, and ultimately affected the fundamental frequency and vibration pattern. With the increasing cover-body thickness ratio, the strength of the wave-type higher-eigenfrequency modes increased, and that resulted in a nonlinear bifurcation of the system in which the system evolved from a regular periodic vibration to a periodic doubling vibration and then back to a regular periodic vibration with increased fundamental frequencies. During the transition, the system vibrated chaotically. Because of the increased strength of the wave-type modes, the maximum divergent angle of the glottis was also increased with the increasing cover-body thickness ratio.
Collapse
Affiliation(s)
- Weili Jiang
- Department of Mechanical Engineering, University of Maine, Orono, Maine 04469, USA
| | - Xudong Zheng
- Department of Mechanical Engineering, University of Maine, Orono, Maine 04469, USA
| | - Qian Xue
- Department of Mechanical Engineering, University of Maine, Orono, Maine 04469, USA
| |
Collapse
|
35
|
Roessig S, Mücke D, Grice M. The dynamics of intonation: Categorical and continuous variation in an attractor-based model. PLoS One 2019; 14:e0216859. [PMID: 31120891 PMCID: PMC6532892 DOI: 10.1371/journal.pone.0216859] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 04/30/2019] [Indexed: 11/18/2022] Open
Abstract
The framework of dynamical systems offers powerful tools to understand the relation between stability and variability in human cognition in general and in speech in particular. In the current paper, we propose a dynamical systems approach to the description of German nuclear pitch accents in focus marking to account for both the categorical as well as the continuous variation found in intonational data. We report on results from 27 native speakers and employ an attractor landscape to represent pitch accent types in terms of f0 measures in a continuous dimension. We demonstrate how the same system can account for both the categorical variation (relative stability of one prosodic category) as well as the continuous variation (detailed modifications within one prosodic category). The model is able to capture the qualitative aspects of focus marking such as falling vs. rising pitch accent types as well as the quantitative aspects such as less rising vs. more rising accents in one system by means of scaling a single parameter. Furthermore, speaker group specific strategies are analysed and modelled as differences in the scaling of this parameter. Thus, the model contributes to the ongoing debate about the relation between phonetics and phonology and the importance of variation in language and speech.
Collapse
Affiliation(s)
- Simon Roessig
- IfL–Phonetics, University of Cologne, Cologne, Germany
| | - Doris Mücke
- IfL–Phonetics, University of Cologne, Cologne, Germany
| | - Martine Grice
- IfL–Phonetics, University of Cologne, Cologne, Germany
| |
Collapse
|
36
|
Zhang Z. Vocal instabilities in a three-dimensional body-cover phonation model. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:1216. [PMID: 30424612 PMCID: PMC6128715 DOI: 10.1121/1.5053116] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 08/17/2018] [Accepted: 08/20/2018] [Indexed: 05/08/2023]
Abstract
The goal of this study is to identify vocal fold conditions that produce irregular vocal fold vibration and the underlying physical mechanisms. Using a three-dimensional computational model of phonation, parametric simulations are performed with co-variations in vocal fold geometry, stiffness, and vocal tract shape. For each simulation, the cycle-to-cycle variations in the amplitude and period of the glottal area function are calculated, based on which the voice is classified into three types corresponding to regular, quasi-steady or subharmonic, and chaotic phonation. The results show that vocal folds with a large medial surface vertical thickness and low transverse stiffness are more likely to exhibit irregular vocal fold vibration when tightly approximated and subject to high subglottal pressure. Transition from regular vocal fold vibration to vocal instabilities is often accompanied by energy redistribution among the first few vocal fold eigenmodes, presumably due to nonlinear interaction between eigenmodes during vocal fold contact. The presence of a vocal tract may suppress such contact-related vocal instabilities, but also induce new instabilities, particularly for less constricted vocal fold conditions, almost doubling the number of vocal fold conditions producing irregular vibration.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|