1
|
Jiang W, Geng B, Zheng X, Xue Q. A computational study of the influence of thyroarytenoid and cricothyroid muscle interaction on vocal fold dynamics in an MRI-based human laryngeal model. Biomech Model Mechanobiol 2024; 23:1801-1813. [PMID: 38981946 DOI: 10.1007/s10237-024-01869-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 06/21/2024] [Indexed: 07/11/2024]
Abstract
A human laryngeal model, incorporating all the cartilages and the intrinsic muscles, was reconstructed based on MRI data. The vocal fold was represented as a multilayer structure with detailed inner components. The activation levels of the thyroarytenoid (TA) and cricothyroid (CT) muscles were systematically varied from zero to full activation allowing for the analysis of their interaction and influence on vocal fold dynamics and glottal flow. The finite element method was employed to calculate the vocal fold dynamics, while the one-dimensional Bernoulli equation was utilized to calculate the glottal flow. The analysis was focused on the muscle influence on the fundamental frequency (fo). We found that while CT and TA activation increased the fo in most of the conditions, TA activation resulted in a frequency drop when it was moderately activated. We show that this frequency drop was associated with the sudden increase of the vertical motion when the vibration transited from involving the whole tissue to mainly in the cover layer. The transition of the vibration pattern was caused by the increased body-cover stiffness ratio that resulted from TA activation.
Collapse
Affiliation(s)
- Weili Jiang
- Department of Mechanical Engineering, Kate Gleason College of Engineering, Rochester Institute of Technology, Rochester, NY, USA
| | - Biao Geng
- Department of Mechanical Engineering, Kate Gleason College of Engineering, Rochester Institute of Technology, Rochester, NY, USA
| | - Xudong Zheng
- Department of Mechanical Engineering, Kate Gleason College of Engineering, Rochester Institute of Technology, Rochester, NY, USA
| | - Qian Xue
- Department of Mechanical Engineering, Kate Gleason College of Engineering, Rochester Institute of Technology, Rochester, NY, USA.
| |
Collapse
|
2
|
Schlegel P, Berry DA, Moffatt C, Zhang Z, Chhetri DK. Register transitions in an in vivo canine model as a function of intrinsic laryngeal muscle stimulation, fundamental frequency, and sound pressure level. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024; 155:2139-2150. [PMID: 38498507 PMCID: PMC10954347 DOI: 10.1121/10.0025135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 01/09/2024] [Accepted: 02/16/2024] [Indexed: 03/20/2024]
Abstract
Phonatory instabilities and involuntary register transitions can occur during singing. However, little is known regarding the mechanisms which govern such transitions. To investigate this phenomenon, we systematically varied laryngeal muscle activation and airflow in an in vivo canine larynx model during phonation. We calculated voice range profiles showing average nerve activations for all combinations of fundamental frequency (F0) and sound pressure level (SPL). Further, we determined closed-quotient (CQ) and minimum-posterior-area (MPA) based on high-speed video recordings. While different combinations of muscle activation favored different combinations of F0 and SPL, in the investigated larynx there was a consistent region of instability at about 400 Hz which essentially precluded phonation. An explanation for this region may be a larynx specific coupling between sound source and subglottal tract or an effect based purely on larynx morphology. Register transitions crossed this region, with different combinations of cricothyroid and thyroarytenoid muscle (TA) activation stabilizing higher or lower neighboring frequencies. Observed patterns in CQ and MPA dependent on TA activation reproduced patterns found in singers in previous work. Lack of control of TA stimulation may result in phonation instabilities, and enhanced control of TA stimulation may help to avoid involuntary register transitions, especially in the singing voice.
Collapse
Affiliation(s)
- Patrick Schlegel
- Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California-Los Angeles, Los Angeles, California 90095, USA
| | - David A Berry
- Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California-Los Angeles, Los Angeles, California 90095, USA
| | - Clare Moffatt
- Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California-Los Angeles, Los Angeles, California 90095, USA
| | - Zhaoyan Zhang
- Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California-Los Angeles, Los Angeles, California 90095, USA
| | - Dinesh K Chhetri
- Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California-Los Angeles, Los Angeles, California 90095, USA
| |
Collapse
|
3
|
Movahhedi M, Liu XY, Geng B, Elemans C, Xue Q, Wang JX, Zheng X. Predicting 3D soft tissue dynamics from 2D imaging using physics informed neural networks. Commun Biol 2023; 6:541. [PMID: 37208428 DOI: 10.1038/s42003-023-04914-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 05/04/2023] [Indexed: 05/21/2023] Open
Abstract
Tissue dynamics play critical roles in many physiological functions and provide important metrics for clinical diagnosis. Capturing real-time high-resolution 3D images of tissue dynamics, however, remains a challenge. This study presents a hybrid physics-informed neural network algorithm that infers 3D flow-induced tissue dynamics and other physical quantities from sparse 2D images. The algorithm combines a recurrent neural network model of soft tissue with a differentiable fluid solver, leveraging prior knowledge in solid mechanics to project the governing equation on a discrete eigen space. The algorithm uses a Long-short-term memory-based recurrent encoder-decoder connected with a fully connected neural network to capture the temporal dependence of flow-structure-interaction. The effectiveness and merit of the proposed algorithm is demonstrated on synthetic data from a canine vocal fold model and experimental data from excised pigeon syringes. The results showed that the algorithm accurately reconstructs 3D vocal dynamics, aerodynamics, and acoustics from sparse 2D vibration profiles.
Collapse
Affiliation(s)
| | - Xin-Yang Liu
- Aerospace and Mechanical Engineering Department, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Biao Geng
- Mechanical Engineering Department, University of Maine, Orono, ME, 04469, USA
- Mechanical Engineering Department, Rochester Institute of Technology, Rochester, NY, 14623, USA
| | - Coen Elemans
- Department of Biology, University of Southern Denmark, Odense M, 5230, Denmark
| | - Qian Xue
- Mechanical Engineering Department, University of Maine, Orono, ME, 04469, USA
- Mechanical Engineering Department, Rochester Institute of Technology, Rochester, NY, 14623, USA
| | - Jian-Xun Wang
- Aerospace and Mechanical Engineering Department, University of Notre Dame, Notre Dame, IN, 46556, USA.
| | - Xudong Zheng
- Mechanical Engineering Department, University of Maine, Orono, ME, 04469, USA.
- Mechanical Engineering Department, Rochester Institute of Technology, Rochester, NY, 14623, USA.
| |
Collapse
|
4
|
Serry MA, Stepp CE, Peterson SD. Exploring the mechanics of fundamental frequency variation during phonation onset. Biomech Model Mechanobiol 2023; 22:339-356. [PMID: 36370231 PMCID: PMC10369356 DOI: 10.1007/s10237-022-01652-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 10/20/2022] [Indexed: 11/15/2022]
Abstract
Fundamental frequency patterns during phonation onset have received renewed interest due to their promising application in objective classification of normal and pathological voices. However, the associated underlying mechanisms producing the wide array of patterns observed in different phonetic contexts are not yet fully understood. Herein, we employ theoretical and numerical analyses in an effort to elucidate the potential mechanisms driving opposing frequency patterns for initial/isolated vowels versus vowels preceded by voiceless consonants. Utilizing deterministic lumped-mass oscillator models of the vocal folds, we systematically explore the roles of collision and muscle activation in the dynamics of phonation onset. We find that an increasing trend in fundamental frequency, as observed for initial/isolated vowels, arises naturally through a progressive increase in system stiffness as collision intensifies as onset progresses, without the need for time-varying vocal fold tension or changes in aerodynamic loading. In contrast, reduction in cricothyroid muscle activation during onset is required to generate the decrease in fundamental frequency observed for vowels preceded by voiceless consonants. For such phonetic contexts, our analysis shows that the magnitude of reduction in the cricothyroid muscle activation and the activation level of the thyroarytenoid muscle are potential factors underlying observed differences in (relative) fundamental frequency between speakers with healthy and hyperfunctional voices. This work highlights the roles of sometimes competing laryngeal factors in producing the complex array of observed fundamental frequency patterns during phonation onset.
Collapse
Affiliation(s)
- Mohamed A Serry
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| | - Cara E Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, MA, 02215, USA
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, ON, N2L 3G1, Canada.
| |
Collapse
|
5
|
Herbst CT, Elemans CPH, Tokuda IT, Chatziioannou V, Švec JG. Dynamic System Coupling in Voice Production. J Voice 2023:S0892-1997(22)00310-1. [PMID: 36737267 DOI: 10.1016/j.jvoice.2022.10.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 10/07/2022] [Accepted: 10/07/2022] [Indexed: 02/04/2023]
Abstract
Voice is a major means of communication for humans, non-human mammals and many other vertebrates like birds and anurans. The physical and physiological principles of voice production are described by two theories: the MyoElastic-AeroDynamic (MEAD) theory and the Source-Filter Theory (SFT). While MEAD employs a multiphysics approach to understand the motor control and dynamics of self-sustained vibration of vocal folds or analogous tissues, SFT predominantly uses acoustics to understand spectral changes of the source via linear propagation through the vocal tract. Because the two theories focus on different aspects of voice production, they are often applied distinctly in specific areas of science and engineering. Here, we argue that the MEAD and the SFT are linked integral aspects of a holistic theory of voice production, describing a dynamically coupled system. The aim of this manuscript is to provide a comprehensive review of both the MEAD and the source-filter theory with its nonlinear extension, the latter of which suggests a number of conceptual similarities to sound production in brass instruments. We discuss the application of both theories to voice production of humans as well as of animals. An appraisal of voice production in the light of non-linear dynamics supports the notion that voice production can best be described with a systems view, considering coupled systems rather than isolated contributions of individual sub-systems.
Collapse
Affiliation(s)
- Christian T Herbst
- Department of Vocal Studies, Mozarteum University, Salzburg, Austria; Janette Ogg Voice Research Center, Shenandoah Conservatory, Winchester, Virginia. http://www.christian-herbst.org
| | - Coen P H Elemans
- Vocal Neuromechanics Lab, Department of Biology, University of Southern Denmark, Odense M, Denmark
| | - Isao T Tokuda
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | | | - Jan G Švec
- Voice Research Laboratory, Department of Experimental Physics, Faculty of Science, Palacky University Olomouc, Olomouc, Czech Republic
| |
Collapse
|
6
|
Weerathunge HR, Alzamendi GA, Cler GJ, Guenther FH, Stepp CE, Zañartu M. LaDIVA: A neurocomputational model providing laryngeal motor control for speech acquisition and production. PLoS Comput Biol 2022; 18:e1010159. [PMID: 35737706 PMCID: PMC9258861 DOI: 10.1371/journal.pcbi.1010159] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 07/06/2022] [Accepted: 05/02/2022] [Indexed: 11/18/2022] Open
Abstract
Many voice disorders are the result of intricate neural and/or biomechanical impairments that are poorly understood. The limited knowledge of their etiological and pathophysiological mechanisms hampers effective clinical management. Behavioral studies have been used concurrently with computational models to better understand typical and pathological laryngeal motor control. Thus far, however, a unified computational framework that quantitatively integrates physiologically relevant models of phonation with the neural control of speech has not been developed. Here, we introduce LaDIVA, a novel neurocomputational model with physiologically based laryngeal motor control. We combined the DIVA model (an established neural network model of speech motor control) with the extended body-cover model (a physics-based vocal fold model). The resulting integrated model, LaDIVA, was validated by comparing its model simulations with behavioral responses to perturbations of auditory vocal fundamental frequency (fo) feedback in adults with typical speech. LaDIVA demonstrated capability to simulate different modes of laryngeal motor control, ranging from short-term (i.e., reflexive) and long-term (i.e., adaptive) auditory feedback paradigms, to generating prosodic contours in speech. Simulations showed that LaDIVA's laryngeal motor control displays properties of motor equivalence, i.e., LaDIVA could robustly generate compensatory responses to reflexive vocal fo perturbations with varying initial laryngeal muscle activation levels leading to the same output. The model can also generate prosodic contours for studying laryngeal motor control in running speech. LaDIVA can expand the understanding of the physiology of human phonation to enable, for the first time, the investigation of causal effects of neural motor control in the fine structure of the vocal signal.
Collapse
Affiliation(s)
- Hasini R. Weerathunge
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, United States of America
| | - Gabriel A. Alzamendi
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- Institute for Research and Development on Bioengineering and Bioinformatics (IBB), CONICET-UNER, Oro Verde, Argentina
| | - Gabriel J. Cler
- Department of Speech & Hearing Sciences, University of Washington, Seattle, Washington, United States of America
| | - Frank H. Guenther
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, United States of America
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts, United States of America
- Department of Otolaryngology-Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, United States of America
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|