1
|
Mainka A, Platzek I, Klimova A, Mattheus W, Fleischer M, Mürbe D. Relationship between epilarynx tube shape and the radiated sound pressure level during phonation is gender specific. LOGOP PHONIATR VOCO 2023; 48:44-56. [PMID: 34644212 DOI: 10.1080/14015439.2021.1988143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
OBJECTIVE/HYPOTHESIS The aim of the study was to measure the morphology of the epilaryngeal tube during sustained phonation as a function of loudness variation and to compare subjects of different genders. STUDY DESIGN This is a prospective study. METHODS Five female and five male classically trained singers were recorded by magnetic resonance imaging with simultaneous audio recordings while sustaining phonation at three different loudness conditions. Three-dimensional subsections of the vocal tract were segmented on multi-image-based cross-sections. Different volume and area measures were determined and their relation to sound pressure level and loudness condition was analyzed. RESULTS Male singers tended to narrow the epilaryngeal tube when increasing sound pressure level whereas female singers did not. CONCLUSION Strategies of vocal tract adjustments during loudness variation in classical singing appear to be gender specific.
Collapse
Affiliation(s)
- Alexander Mainka
- Clinic for Audiology and Phoniatry, Charité Universitätsmedizin Berlin, Berlin, Germany
- Voice Research Laboratory, Hochschule für Musik Carl Maria von Weber, Dresden, Germany
| | - Ivan Platzek
- Department of Radiology, Universitätsklinikum Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Anna Klimova
- National Center for Tumor Diseases, Partner Site Dresden, Institute for Medical Informatics and Biometry, Dresden, Germany
| | - Willy Mattheus
- Department of Otorhinolaryngology, Universitätsklinikum Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Mario Fleischer
- Clinic for Audiology and Phoniatry, Charité Universitätsmedizin Berlin, Berlin, Germany
- Voice Research Laboratory, Hochschule für Musik Carl Maria von Weber, Dresden, Germany
| | - Dirk Mürbe
- Clinic for Audiology and Phoniatry, Charité Universitätsmedizin Berlin, Berlin, Germany
- Voice Research Laboratory, Hochschule für Musik Carl Maria von Weber, Dresden, Germany
| |
Collapse
|
2
|
Kist AM, Gómez P, Dubrovskiy D, Schlegel P, Kunduk M, Echternach M, Patel R, Semmler M, Bohr C, Dürr S, Schützenberger A, Döllinger M. A Deep Learning Enhanced Novel Software Tool for Laryngeal Dynamics Analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1889-1903. [PMID: 34000199 DOI: 10.1044/2021_jslhr-20-00498] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Purpose High-speed videoendoscopy (HSV) is an emerging, but barely used, endoscopy technique in the clinic to assess and diagnose voice disorders because of the lack of dedicated software to analyze the data. HSV allows to quantify the vocal fold oscillations by segmenting the glottal area. This challenging task has been tackled by various studies; however, the proposed approaches are mostly limited and not suitable for daily clinical routine. Method We developed a user-friendly software in C# that allows the editing, motion correction, segmentation, and quantitative analysis of HSV data. We further provide pretrained deep neural networks for fully automatic glottis segmentation. Results We freely provide our software Glottis Analysis Tools (GAT). Using GAT, we provide a general threshold-based region growing platform that enables the user to analyze data from various sources, such as in vivo recordings, ex vivo recordings, and high-speed footage of artificial vocal folds. Additionally, especially for in vivo recordings, we provide three robust neural networks at various speed and quality settings to allow a fully automatic glottis segmentation needed for application by untrained personnel. GAT further evaluates video and audio data in parallel and is able to extract various features from the video data, among others the glottal area waveform, that is, the changing glottal area over time. In total, GAT provides 79 unique quantitative analysis parameters for video- and audio-based signals. Many of these parameters have already been shown to reflect voice disorders, highlighting the clinical importance and usefulness of the GAT software. Conclusion GAT is a unique tool to process HSV and audio data to determine quantitative, clinically relevant parameters for research, diagnosis, and treatment of laryngeal disorders. Supplemental Material https://doi.org/10.23641/asha.14575533.
Collapse
Affiliation(s)
- Andreas M Kist
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Pablo Gómez
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Denis Dubrovskiy
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Patrick Schlegel
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Melda Kunduk
- Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Germany
| | - Rita Patel
- Department of Speech, Language and Hearing Sciences, College of Arts and Sciences, Indiana University, Bloomington
| | - Marion Semmler
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Christopher Bohr
- Klinik und Poliklinik für Hals-Nasen-Ohren-Heilkunde Universitätsklinikum Regensburg, Germany
| | - Stephan Dürr
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology-Head & Neck Surgery, University Hospital Erlangen, Germany
| |
Collapse
|
3
|
Fleischer M, Mainka A, Kürbis S, Birkholz P. How to precisely measure the volume velocity transfer function of physical vocal tract models by external excitation. PLoS One 2018; 13:e0193708. [PMID: 29543829 PMCID: PMC5854283 DOI: 10.1371/journal.pone.0193708] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Accepted: 02/18/2018] [Indexed: 11/18/2022] Open
Abstract
Recently, 3D printing has been increasingly used to create physical models of the vocal tract with geometries obtained from magnetic resonance imaging. These printed models allow measuring the vocal tract transfer function, which is not reliably possible in vivo for the vocal tract of living humans. The transfer functions enable the detailed examination of the acoustic effects of specific articulatory strategies in speaking and singing, and the validation of acoustic plane-wave models for realistic vocal tract geometries in articulatory speech synthesis. To measure the acoustic transfer function of 3D-printed models, two techniques have been described: (1) excitation of the models with a broadband sound source at the glottis and measurement of the sound pressure radiated from the lips, and (2) excitation of the models with an external source in front of the lips and measurement of the sound pressure inside the models at the glottal end. The former method is more frequently used and more intuitive due to its similarity to speech production. However, the latter method avoids the intricate problem of constructing a suitable broadband glottal source and is therefore more effective. It has been shown to yield a transfer function similar, but not exactly equal to the volume velocity transfer function between the glottis and the lips, which is usually used to characterize vocal tract acoustics. Here, we revisit this method and show both, theoretically and experimentally, how it can be extended to yield the precise volume velocity transfer function of the vocal tract.
Collapse
Affiliation(s)
- Mario Fleischer
- Division of Phoniatrics and Audiology, Department of Otorhinolaryngology, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Fetscherstrasse 74, 01307 Dresden, Germany
- * E-mail:
| | - Alexander Mainka
- Division of Phoniatrics and Audiology, Department of Otorhinolaryngology, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Fetscherstrasse 74, 01307 Dresden, Germany
- Voice Research Laboratory, Hochschule für Musik Carl Maria von Weber Dresden, Wettiner Platz 13, 01067 Dresden, Germany
| | - Steffen Kürbis
- Institute of Acoustics and Speech Communication, Faculty of Electrical and Computer Engineering, Technische Universität Dresden, Helmholtzstrasse 18, 01062 Dresden, Germany
| | - Peter Birkholz
- Institute of Acoustics and Speech Communication, Faculty of Electrical and Computer Engineering, Technische Universität Dresden, Helmholtzstrasse 18, 01062 Dresden, Germany
| |
Collapse
|
4
|
Three-dimensional Vocal Tract Morphology Based on Multiple Magnetic Resonance Images Is Highly Reproducible During Sustained Phonation. J Voice 2017; 31:504.e11-504.e20. [DOI: 10.1016/j.jvoice.2016.11.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2016] [Revised: 11/09/2016] [Accepted: 11/10/2016] [Indexed: 11/21/2022]
|