1
|
Maskeliūnas R, Damaševičius R, Kulikajevas A, Pribuišis K, Uloza V. Alaryngeal Speech Enhancement for Noisy Environments Using a Pareto Denoising Gated LSTM. J Voice 2024:S0892-1997(24)00228-5. [PMID: 39107213 DOI: 10.1016/j.jvoice.2024.07.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 07/13/2024] [Accepted: 07/15/2024] [Indexed: 08/09/2024]
Abstract
Loss of the larynx significantly alters natural voice production, requiring alternative communication modalities and rehabilitation methods to restore speech intelligibility and improve the quality of life of affected individuals. This paper explores advances in alaryngeal speech enhancement to improve signal quality and reduce background noise, focusing on individuals who have undergone laryngectomy. In this study, speech samples were obtained from 23 Lithuanian males who had undergone laryngectomy with secondary implantation of the tracheoesophageal prosthesis (TEP). Pareto-optimized gated long short-term memory was trained on tracheoesophageal speech data to recognize complex temporal connections and contextual information in speech signals. The system was able to distinguish between actual speech and various forms of noise and artifacts, resulting in a 25% drop in the mean signal-to-noise ratio compared to other approaches. According to acoustic analysis, the system significantly decreased the number of unvoiced frames (proportion of voiced frames) from 40% to 10% while maintaining stable proportions of voiced frames (proportion of voiced speech frames) and average voicing evidence (average voice evidence in voiced frames), indicating the accuracy of the approach in selectively attenuating noise and undesired speech artifacts while preserving important speech information.
Collapse
Affiliation(s)
- Rytis Maskeliūnas
- Centre of Real Time Computer Systems, Kaunas University of Technology, Kaunas, Lithuania.
| | - Robertas Damaševičius
- Centre of Real Time Computer Systems, Kaunas University of Technology, Kaunas, Lithuania
| | - Audrius Kulikajevas
- Centre of Real Time Computer Systems, Kaunas University of Technology, Kaunas, Lithuania
| | - Kipras Pribuišis
- Department of Otolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania
| | - Virgilijus Uloza
- Department of Otolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania
| |
Collapse
|
2
|
Coyne M, Dellafaille J, Riede T. Postnatal changes in thyroid cartilage shape and cartilage matrix composition are not synchronized in Mus musculus. J Anat 2024; 244:739-748. [PMID: 38303104 PMCID: PMC11021632 DOI: 10.1111/joa.14006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 11/29/2023] [Accepted: 12/29/2023] [Indexed: 02/03/2024] Open
Abstract
The study was conducted to quantify laryngeal cartilage matrix composition and to investigate its relationship with cartilage shape in a mouse model. A sample of 30 mice (CD-1 mouse, Mus musculus) from five age groups (postnatal Days 2, 21, 90, 365, and 720) were used. Three-dimensional mouse laryngeal thyroid cartilage reconstructions were generated from contrast-enhanced micro-computed tomography (CT) image stacks. Cartilage matrix composition was estimated as Hounsfield units (HU). HU were determined by overlaying 3D reconstructions as masks on micro-CT image stacks and then measuring the attenuation. Cartilage shape was quantified with landmarks placed on the surface of the thyroid cartilage. Shape differences between the five age groups were analyzed using geometric morphometrics and multiparametric analysis of landmarks. The relationship between HU and shape was investigated with correlational analyses. Among five age groups, HU became higher in older animals. The shape of the thyroid cartilage changes with age throughout the entire life of a mouse. The changes in shape were not synchronized with changes in cartilage matrix composition. The thyroid cartilage of young and old M. musculus larynx showed a homogenous mineralization pattern. High-resolution contrast-enhanced micro-CT imaging makes the mouse larynx accessible for analysis of genetic and environmental factors affecting shape and matrix composition.
Collapse
Affiliation(s)
- Megan Coyne
- College of Veterinary Medicine, Midwestern University, Glendale, Arizona, USA
| | | | - Tobias Riede
- College of Veterinary Medicine, Midwestern University, Glendale, Arizona, USA
- College of Graduate Studies, Department of Physiology, Midwestern University, Glendale, Arizona, USA
| |
Collapse
|
3
|
Gocht A, Lüers G, Schumacher U. Amount and spatial arrangement of muscle fibers in the human laryngeal Musculus ventricularis. Clin Anat 2023; 36:1138-1146. [PMID: 37092576 DOI: 10.1002/ca.24050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 03/18/2023] [Accepted: 04/10/2023] [Indexed: 04/25/2023]
Abstract
Textbooks and atlases of human macroscopic and microscopic anatomy of the larynx generally provide, if at all, only sparse information on the laryngeal Musculus ventricularis. However, several studies indicate that this muscle takes over the function of vestibular (ventricular) fold phonation after denervation of the Musculus vocalis. In the present study, 29 laryngeal specimens were coronally dissected at different levels, i.e. the anterior (L1), middle (L2), and posterior third of the vestibular fold (L3), and they underwent histological analysis. In all specimens the vestibular folds of both hemi-larynxes contained striated muscle bundles in variable amounts, representing a ventricularis muscle. These muscle bundles obviously originated from the lateral (external) and thyroepiglottic part of the thyroarytenoid muscle and the aryepiglottic part of the oblique arytenoid muscle, as has been described by other authors. The areas of vestibular folds and their amounts of ventricularis muscle bundles were measured using image analysis software (imageJ) by manual tracing. The mean area of the vestibular folds of both hemi-larynxes was 27.9 mm2 (SD [standard deviation] ± 9.17), and the area occupied by fibers of the ventricularis muscle was 1.5 mm2 (SD ± 1.78). Statistical analysis comparing the areas of both hemi-larynxes and levels resulted in no significant differences, except for the levels 2 and 3. In level 2, significantly more muscle fibers (2.0 mm2 ; SD ± 2.21) were detectable within the vestibular fold than in level 3 (0.9 mm2 ; SD ± 1.43). Level 1 also contained more muscle fibers (1.1 mm2 ; SD ± 1.06) than level 3, however, without significance. In conclusion, the laryngeal ventricularis muscle is present in the majority of reported cases. Since the muscle is of clinical relevance, it should be included in anatomical textbooks by default.
Collapse
Affiliation(s)
- Andreas Gocht
- Institut für Pathologie, Universitätsklinikum Schleswig-Holstein, Lübeck, Germany
- Institut für Anatomie und Experimentelle Morphologie, Universitätsklinikum Hamburg-Eppendorf, Hamburg, Germany
| | - Georg Lüers
- Department Humanmedizin, Medical School Hamburg, Hamburg, Germany
| | - Udo Schumacher
- Institut für Anatomie und Experimentelle Morphologie, Universitätsklinikum Hamburg-Eppendorf, Hamburg, Germany
- Department Humanmedizin, Medical School Berlin, Berlin, Germany
| |
Collapse
|
4
|
Serry MA, Alzamendi GA, Zañartu M, Peterson SD. Modeling the influence of the extrinsic musculature on phonation. Biomech Model Mechanobiol 2023; 22:1365-1378. [PMID: 37169957 PMCID: PMC10529543 DOI: 10.1007/s10237-023-01724-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 04/11/2023] [Indexed: 05/13/2023]
Abstract
Neck muscles play important roles in various physiological tasks, including swallowing, head stabilization, and phonation. The mechanisms by which neck muscles influence phonation are not well understood, with conflicting reports on the change in fundamental frequency for ostensibly the same neck muscle activation scenarios. In this work, we introduce a reduced-order muscle-controlled vocal fold model, comprising both intrinsic muscle control and extrinsic muscle effects. The model predicts that when the neck muscles pull the thyroid cartilage in the superior-anterior direction (with a sufficiently large anterior component), inferior direction, or inferior-anterior direction, tension in the vocal folds increases, leading to fundamental frequency rise during sustained phonation. On the other hand, pulling in the superior direction, superior-posterior direction, or inferior-posterior direction (with a sufficiently large posterior component) tends to decrease vocal fold tension and phonation fundamental frequency. Varying the pulling force location alters the posture and phonation biomechanics, depending on the force direction. These findings suggest potential roles of particular neck muscles in modulating phonation fundamental frequency, with implications for vocal hyperfunction.
Collapse
Affiliation(s)
- Mohamed A Serry
- Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario, N2L 3G1, Canada
| | - Gabriel A Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics (IBB), CONICET-UNER, Oro Verde, Entre Ríos, 3100, Argentina
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Sean D Peterson
- Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario, N2L 3G1, Canada.
| |
Collapse
|
5
|
Howson PJ, Moisik S, Żygis M. Lateral vocalization in Brazilian Portuguese. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:281. [PMID: 35931518 DOI: 10.1121/10.0012186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 06/11/2022] [Indexed: 06/15/2023]
Abstract
Lateral vocalization is a cross-linguistically common phenomenon where a lateral is realized as a glide, such as [w, j], or a vowel [u, i]. In this paper, we focus on the articulatory triggers that could cause lateral vocalization. We examined Brazilian Portuguese, a language known for the process of lateral vocalization in coda position. We examined the lateral in onset and coda position in four vocalic environments and compared the dynamic tongue contours and contours at the point of maximum constriction in each environment. We also performed biomechanical simulations of lateral articulation and the vocalized lateral. The results indicate increased tongue body retraction in coda position, which is accompanied by tongue body raising. Simulations further revealed that vocalized laterals mainly recruit intrinsic lingual muscles along with the styloglossus. Taken together, the data suggest that vocalization is a result of positional phonetic effects including lenition and additional retraction in the coda position.
Collapse
Affiliation(s)
- Phil J Howson
- Leibniz-Zentrum Allgemeine Sprachwissenscaft, Schützenstraße 18, D-10117 Berlin, Germany
| | - Scott Moisik
- Division of Linguistics and Multilingual Studies, Nanyang Technological University, 50 Nanyang Ave, Singapore 639798
| | - Marzena Żygis
- Leibniz-Zentrum Allgemeine Sprachwissenscaft and Humboldt Universität, Berlin Schützenstraße 18, D-10117 Berlin, Germany
| |
Collapse
|
6
|
Alzamendi GA, Peterson SD, Erath BD, Hillman RE, Zañartu M. Triangular body-cover model of the vocal folds with coordinated activation of the five intrinsic laryngeal muscles. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 151:17. [PMID: 35105008 PMCID: PMC8727069 DOI: 10.1121/10.0009169] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 11/24/2021] [Accepted: 12/06/2021] [Indexed: 05/26/2023]
Abstract
Poor laryngeal muscle coordination that results in abnormal glottal posturing is believed to be a primary etiologic factor in common voice disorders such as non-phonotraumatic vocal hyperfunction. Abnormal activity of antagonistic laryngeal muscles is hypothesized to play a key role in the alteration of normal vocal fold biomechanics that results in the dysphonia associated with such disorders. Current low-order models of the vocal folds are unsatisfactory to test this hypothesis since they do not capture the co-contraction of antagonist laryngeal muscle pairs. To address this limitation, a self-sustained triangular body-cover model with full intrinsic muscle control is introduced. The proposed scheme shows good agreement with prior studies using finite element models, excised larynges, and clinical studies in sustained and time-varying vocal gestures. Simulations of vocal fold posturing obtained with distinct antagonistic muscle activation yield clear differences in kinematic, aerodynamic, and acoustic measures. The proposed tool is deemed sufficiently accurate and flexible for future comprehensive investigations of non-phonotraumatic vocal hyperfunction and other laryngeal motor control disorders.
Collapse
Affiliation(s)
- Gabriel A Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics (IBB), CONICET-UNER, Oro Verde, Entre Ríos 3100, Argentina
| | - Sean D Peterson
- Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario, N2L 3G1, Canada
| | - Byron D Erath
- Department of Mechanical and Aerospace Engineering, Clarkson University, Potsdam, New York 13699, USA
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| |
Collapse
|
7
|
Chiu C, Lu YA. Articulatory Evidence for the Syllable-final Nasal Merging in Taiwan Mandarin. LANGUAGE AND SPEECH 2021; 64:771-789. [PMID: 33300459 DOI: 10.1177/0023830920948084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Syllable-final nasals /n/ and /ŋ/ in Taiwan Mandarin have been reported to be undergoing merging. Perceptual studies have reported that the alleged merging is context-sensitive and the merging directions are vowel-dependent. These findings have been mostly attributed to dialectal and social factors. The current study uses ultrasonography to capture postures of the entire tongue during the production of syllable-final nasals. The results, though confirming previous findings that the merging directions of syllable-final nasals are vowel-dependent, are best accounted for by the biomechanics of the tongue, as supported by computational 3D model simulations. Furthermore, for some speakers, although nasals were merged in terms of tongue posture, the degrees of nasalization of the preceding vowel were contrastive, suggesting that the merging process may be incomplete.
Collapse
Affiliation(s)
- Chenhao Chiu
- Graduate Institute of Linguistics, Neurobiology and Cognitive Science Center, National Taiwan University, Taiwan
| | - Yu-An Lu
- Department of Foreign Languages and Literatures, National Chiao Tung University, Taiwan
| |
Collapse
|
8
|
Gick B, Mayer C, Chiu C, Widing E, Roewer-Després F, Fels S, Stavness I. Quantal biomechanical effects in speech postures of the lips. J Neurophysiol 2020; 124:833-843. [PMID: 32727259 DOI: 10.1152/jn.00676.2019] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The unique biomechanical and functional constraints on human speech make it a promising area for research investigating modular control of movement. The present article illustrates how a modular control approach to speech can provide insights relevant to understanding both motor control and observed variation across languages. We specifically explore the robust typological finding that languages produce different degrees of labial constriction using distinct muscle groupings and concomitantly distinct lip postures. Research has suggested that these lip postures exploit biomechanical regions of nonlinearity between neural activation and movement, also known as quantal regions, to allow movement goals to be realized despite variable activation signals. We present two sets of computer simulations showing that these labial postures can be generated under the assumption of modular control and that the corresponding modules are biomechanically robust: first to variation in the activation levels of participating muscles, and second to interference from surrounding muscles. These results provide support for the hypothesis that biomechanical robustness is an important factor in selecting the muscle groupings used for speech movements and provide insight into the neurological control of speech movements and how biomechanical and functional constraints govern the emergence of speech motor modules. We anticipate that future experimental work guided by biomechanical simulation results will provide new insights into the neural organization of speech movements.NEW & NOTEWORTHY This article provides additional evidence that speech motor control is organized in a modular fashion and that biomechanics constrain the kinds of motor modules that may emerge. It also suggests that speech can be a fruitful domain for the study of modularity and that a better understanding of speech motor modules will be useful for speech research. Finally, it suggests that biomechanical modeling can serve as a useful complement to experimental work when studying modularity.
Collapse
Affiliation(s)
- Bryan Gick
- Department of Linguistics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Connor Mayer
- Department of Linguistics, University of California, Los Angeles, Los Angeles, California
| | - Chenhao Chiu
- Graduate Institute of Linguistics, National Taiwan University, Taipei, Taiwan
| | - Erik Widing
- Department of Computer Science, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | | | - Sidney Fels
- Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, British Columbia, Canada
| | - Ian Stavness
- Department of Computer Science, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| |
Collapse
|
9
|
|