1
|
Echternach M, Burk F, Köberlein M, Döllinger M, Burdumy M, Richter B, Titze IR, Elemans CPH, Herbst CT. Biomechanics of sound production in high-pitched classical singing. Sci Rep 2024; 14:13132. [PMID: 38849382 PMCID: PMC11161605 DOI: 10.1038/s41598-024-62598-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 05/20/2024] [Indexed: 06/09/2024] Open
Abstract
Voice production of humans and most mammals is governed by the MyoElastic-AeroDynamic (MEAD) principle, where an air stream is modulated by self-sustained vocal fold oscillation to generate audible air pressure fluctuations. An alternative mechanism is found in ultrasonic vocalizations of rodents, which are established by an aeroacoustic (AA) phenomenon without vibration of laryngeal tissue. Previously, some authors argued that high-pitched human vocalization is also produced by the AA principle. Here, we investigate the so-called "whistle register" voice production in nine professional female operatic sopranos singing a scale from C6 (≈ 1047 Hz) to G6 (≈ 1568 Hz). Super-high-speed videolaryngoscopy revealed vocal fold collision in all participants, with closed quotients from 30 to 73%. Computational modeling showed that the biomechanical requirements to produce such high-pitched voice would be an increased contraction of the cricothyroid muscle, vocal fold strain of about 50%, and high subglottal pressure. Our data suggest that high-pitched operatic soprano singing uses the MEAD mechanism. Consequently, the commonly used term "whistle register" does not reflect the physical principle of a whistle with regard to voice generation in high pitched classical singing.
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, LMU University Hospital, Marchioninistr. 15, 81377, Munich, Germany.
| | - Fabian Burk
- Department of Otorhinolaryngology and Plastic Surgery, SRH Wald-Klinikum Gera, Strasse des Friedens 122, Gera, Germany
| | - Marie Köberlein
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, LMU University Hospital, Marchioninistr. 15, 81377, Munich, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Waldstr. 1, 91054, Erlangen, Germany
| | - Michael Burdumy
- Department of Medical Physics, Department of Radiology, Faculty of Medicine, Medical Center-University of Freiburg, Breisacher Str. 60, 79106, Freiburg, Germany
| | - Bernhard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center and Faculty of Medicine Freiburg University, Elsässer Str. 2m, 79110, Freiburg, Germany
| | - Ingo R Titze
- Utah Center for Vocology, 240 S 1500 E, Room 206, Salt Lake City, UT, 84112, USA
| | - Coen P H Elemans
- Vocal Neuromechanics Lab, Sound Communication and Behavior Group, Department of Biology, University of Southern Denmark, Campusvej 55, DK-5230, Odense M, Denmark
| | - Christian T Herbst
- Department of Behavioural and Cognitive Biology, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria.
- Janette Ogg Voice Research Center, Shenandoah Conservatory, Winchester, VA, USA.
| |
Collapse
|
2
|
Echternach M, Köberlein M, Döllinger M, Kirsch J, Pilsl T. Does forced whisper have an impact on voice parameters? Eur Arch Otorhinolaryngol 2024:10.1007/s00405-024-08698-7. [PMID: 38709324 DOI: 10.1007/s00405-024-08698-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 04/18/2024] [Indexed: 05/07/2024]
Abstract
OBJECTIVES There has been the assumption that whispering may impact vocal function, leading to the widespread recommendation against its practice after phonosurgery. However, the extent to which whispering affects vocal function and vocal fold oscillation patterns remains unclear. METHODS 10 vocally healthy subjects (5 male, 5 female) were instructed to forcefully whisper a standardized text for 10 min at a sound level of 70 dB(A), measured at a microphone distance of 30 cm to the mouth. Prior to and following the whisper loading, the dysphonia severity index was assessed. Simultaneously, recordings of high speed videolaryngoscopy (HSV), electroglottography, and audio signals during sustained phonation on the vowel /i/ (250 Hz for females and 125 Hz for males) were analyzed after segmentation of the HSV material. RESULTS The pre-post analysis revealed only minor changes after the intervention. These changes included a rise in minimum intensity, an increase in the glottal area waveform-derived open quotient, and the glottal gap index. However, no statistically significant changes were observed in the harmonic-to-noise-ratio, the glottal- to-noise-excitation-ratio, and the electroglottographic open quotient. CONCLUSION Overall, the study suggests that there are only small effects on vocal function in consequence of a forced whisper loading.
Collapse
Affiliation(s)
- Matthias Echternach
- Division Phoniatrics and Pediatric Audiology, Department of Otolaryngology, Munich University Hospital and Faculty of Medicine, Munich University (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany.
| | - Marie Köberlein
- Division Phoniatrics and Pediatric Audiology, Department of Otolaryngology, Munich University Hospital and Faculty of Medicine, Munich University (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otolaryngology Head & Neck Surgery, University Hospital Erlangen, FAU Erlangen-Nuremberg, Erlangen, Germany
| | - Jonas Kirsch
- Division Phoniatrics and Pediatric Audiology, Department of Otolaryngology, Munich University Hospital and Faculty of Medicine, Munich University (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany
| | - Theresa Pilsl
- Division Phoniatrics and Pediatric Audiology, Department of Otolaryngology, Munich University Hospital and Faculty of Medicine, Munich University (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany
| |
Collapse
|
3
|
Echternach M, Burk F, Kirsch J, Traser L, Birkholz P, Burdumy M, Richter B. Articulatory and acoustic differences between lyric and dramatic singing in Western classical music. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024; 155:2659-2669. [PMID: 38634661 DOI: 10.1121/10.0025751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 03/27/2024] [Indexed: 04/19/2024]
Abstract
Within the realm of voice classification, singers could be sub-categorized by the weight of their repertoire, the so-called "singer's Fach." However, the opposite pole terms "lyric" and "dramatic" singing are not yet well defined by their acoustic and articulatory characteristics. Nine professional singers of different singers' Fach were asked to sing a diatonic scale on the vowel /a/, first in what the singers considered as lyric and second in what they considered as dramatic. Image recording was performed using real time magnetic resonance imaging (MRI) with 25 frames/s, and the audio signal was recorded via an optical microphone system. Analysis was performed with regard to sound pressure level (SPL), vibrato amplitude, and frequency and resonance frequencies as well as articulatory settings of the vocal tract. The analysis revealed three primary differences between dramatic and lyric singing: Dramatic singing was associated with greater SPL and greater vibrato amplitude and frequency as well as lower resonance frequencies. The higher SPL is an indication of voice source changes, and the lower resonance frequencies are probably caused by the lower larynx position. However, all these strategies showed a considerable individual variability. The singers' Fach might contribute to perceptual differences even for the same singer with regard to the respective repertoire.
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Marchioninistr. 15, 81377 Munich, Germany
| | - Fabian Burk
- Department of Otorhinolaryngology and Plastic Surgery, SRH Wald-Klinikum Gera, Str. des Friedens 122, 07548 Gera, Germany
| | - Jonas Kirsch
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Marchioninistr. 15, 81377 Munich, Germany
| | - Louisa Traser
- Institute of Musicians' Medicine, Faculty of Medicine, Freiburg University and Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| | - Peter Birkholz
- Institute of Acoustics and Speech Communication, Technische Universität Dresden, 01062 Dresden, Germany
| | - Michael Burdumy
- Institute of Musicians' Medicine, Faculty of Medicine, Freiburg University and Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
- Department of Medical Physics, Radiology, Faculty of Medicine, Freiburg University and Freiburg University Medical Center Breisacher Str. 60, 79106 Freiburg, Germany
| | - Bernhard Richter
- Institute of Musicians' Medicine, Faculty of Medicine, Freiburg University and Freiburg University Medical Center, Breisacher Str. 60, 79106 Freiburg, Germany
| |
Collapse
|
4
|
Echternach M, Nusseck M, Strasding M, Richter B. Differences of Electroglottographical Contact Quotients between Connected Speech and Sustained Phonation in Clinical Measurement of Voice. J Voice 2023:S0892-1997(23)00077-2. [PMID: 36941166 DOI: 10.1016/j.jvoice.2023.02.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 02/15/2023] [Accepted: 02/15/2023] [Indexed: 03/23/2023]
Abstract
INTRODUCTION In clinical practice, sustained phonation is mostly used for acoustic voice measurements, while perceptual evaluation is based on connected speech. Since sustained phonation could be associated with the use of the singing voice, and since vocal registers are more relevant for singing rather than speech, it is unclear if vocal registers contribute to observable vocal fold contact differences between sustained phonation and speech. MATERIAL AND METHODS Sustained phonation (vowel [a] on comfortable pitch and loudness) and connected speech (German text: Der Nordwind und die Sonne) were analyzed for 1216 subjects (426 with and 790 without dysphonia) using the Laryngograph system (combining electroglottography and audio recordings). From these samples, fundamental frequency (ƒo), contact quotient (CQ), sound pressure level (SPL) and frequency perturbation (jitter first for sustained and cFx for connected speech) were evaluated. RESULTS Compared to connected speech, the values of ƒo and SPL were higher for sustained phonation. For female voices, ƒo difference was greater than for male voices. At the same time, and only for the females, CQ was lower for the sustained phonation, indicating a register difference. CONCLUSION In order to achieve a better comparability, sustained phonation should be standardized regarding the ƒo and SPL values in correspondence to the ƒo and SPL range of reading a text. This should also reduce the risk of using a different register for different types of phonation.
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany.
| | - Manfred Nusseck
- Institute of Musicians' Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Malin Strasding
- Division of Fixed Prosthodontics and Biomaterials, Université de Genève, Geneve, Switzerland
| | - Bernhard Richter
- Institute of Musicians' Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| |
Collapse
|
5
|
Influence of Loudness on Vocal Stability in the Male Passaggio. J Voice 2023; 37:296.e1-296.e8. [PMID: 33455852 DOI: 10.1016/j.jvoice.2020.12.044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/11/2020] [Accepted: 12/15/2020] [Indexed: 11/20/2022]
Abstract
INTRODUCTION Vocal registers and the frequency region where registration events occur, the passaggio, have been in focus of scientific research for almost 200 years. In professional tenors, it has been shown before that singing across the passaggio avoiding a register shift and therefore using their stage voice above the passaggio (SVaP) is associated with greater vocal stability than a register change to the falsetto. However, it is unclarified how much different loudness conditions contribute to this vocal stability. MATERIAL AND METHODS Six professional tenors were asked to perform four pitch glides from A3 to A4 (220-440 Hz) on the vowel [i:]. These glides included (1) the passaggio from modal register to falsetto. The following glides into SVaP were performed under different loudness conditions, (2) mezzoforte (average loudness), (3) pianissimo (as quietly as possible), and (4) fortissimo (the loudest possible). During phonation, high speed videoendoscopy (HSV), electroglottography, and audio signals were recorded simultaneously. The glottal area waveform was derived based on the HSV material. RESULTS Modal to falsetto transitions were associated with relatively low sound pressure level and rise of open quotients (OQ) for the falsetto. Transitions to SVaP showed a clear dependence on the intended loudness. The OQs were lower the louder the task was. There was no clear evidence that transitions with softer voice showed greater stability of vocal fold oscillation patterns than louder tasks. CONCLUSIONS The vocal fold oscillation pattern show- differences among various loudness conditions within the tenors' passaggio but no clear differences with regard to oscillatory stability.
Collapse
|
6
|
Lehoux S, Popeil L, Švec JG. Laryngeal and Acoustic Analysis of Chest and Head Registers Extended Across a Three-Octave Range: A Case Study. J Voice 2022:S0892-1997(22)00053-4. [PMID: 35504793 DOI: 10.1016/j.jvoice.2022.02.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 02/09/2022] [Accepted: 02/11/2022] [Indexed: 10/18/2022]
Abstract
Voice registers are assumed to be related to different laryngeal adjustments, but objective evidence has been insufficient. While chest register is usually associated with the lower pitch range, and head register with the higher pitch range, here we investigated a professional singer who claimed an ability to produce both these registers at every pitch, throughout her entire singing range. The singer performed separated phonations alternating between the two registers (further called chest-like and head-like) at all pitches from C3 (131 Hz) to C6 (1047 Hz). We monitored the vocal fold vibrations using high-speed video endoscopy and electroglottography. The microphone sound was recorded and used for blind listening tests performed by the three authors (insiders) and by six "naive" participants (outsiders). The outsiders correctly identified the registers in 64% of the cases, and the insiders in 89% of the cases. Objective analysis revealed larger closed quotient and vertical phase differences for the chest-like register within the lower range below G4 (<392 Hz), and also a larger closed quotient at the membranous glottis within the higher range above Bb4 (>466 Hz), but not between Ab4-A4 (415-440 Hz). The normalized amplitude quotient was consistently lower in the chest-like register throughout the entire range. The results indicate that that the singer employed subtle laryngeal control mechanisms for the chest-like and head-like phonations on top of the traditionally recognized low-pitched chest and high-pitched head register phenomena. Across all pitches, the chest-like register was produced with more rapid glottal closure that was usually, but not necessarily, accompanied also by stronger adduction of membranous glottis. These register changes were not always easily perceivable by listeners, however.
Collapse
Affiliation(s)
- Sarah Lehoux
- Voice Research Lab, Department of Experimental Physics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | | | - Jan G Švec
- Voice Research Lab, Department of Experimental Physics, Faculty of Science, Palacký University, Olomouc, Czech Republic.
| |
Collapse
|
7
|
Echternach M, Herbst CT, Köberlein M, Story B, Döllinger M, Gellrich D. Are source-filter interactions detectable in classical singing during vowel glides? THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:4565. [PMID: 34241428 DOI: 10.1121/10.0005432] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 06/03/2021] [Indexed: 06/13/2023]
Abstract
In recent studies, it has been assumed that vocal tract formants (Fn) and the voice source could interact. However, there are only few studies analyzing this assumption in vivo. Here, the vowel transition /i/-/a/-/u/-/i/ of 12 professional classical singers (6 females, 6 males) when phonating on the pitch D4 [fundamental frequency (ƒo) ca. 294 Hz] were analyzed using transnasal high speed videoendoscopy (20.000 fps), electroglottography (EGG), and audio recordings. Fn data were calculated using a cepstral method. Source-filter interaction candidates (SFICs) were determined by (a) algorithmic detection of major intersections of Fn/nƒo and (b) perceptual assessment of the EGG signal. Although the open quotient showed some increase for the /i-a/ and /u-i/ transitions, there were no clear effects at the expected Fn/nƒo intersections. In contrast, ƒo adjustments and changes in the phonovibrogram occurred at perceptually derived SFICs, suggesting level-two interactions. In some cases, these were constituted by intersections between higher nƒo and Fn. The presented data partially corroborates that vowel transitions may result in level-two interactions also in professional singers. However, the lack of systematically detectable effects suggests either the absence of a strong interaction or existence of confounding factors, which may potentially counterbalance the level-two-interactions.
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Marchioninistrasse 15, Munich, 81377, Germany
| | - Christian T Herbst
- Antonio Salieri Department of Vocal Studies and Vocal Research in Music Education, University of Music and Performing Arts Vienna, Vienna, Austria
| | - Marie Köberlein
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Marchioninistrasse 15, Munich, 81377, Germany
| | - Brad Story
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona 85718, USA
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Medical School Waldstrasse 1, Erlangen, 91054, Germany
| | - Donata Gellrich
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Marchioninistrasse 15, Munich, 81377, Germany
| |
Collapse
|
8
|
Echternach M, Döllinger M, Köberlein M, Kuranova L, Gellrich D, Kainz MA. Vocal fold oscillation pattern changes related to loudness in patients with vocal fold mass lesions. J Otolaryngol Head Neck Surg 2020; 49:80. [PMID: 33228812 PMCID: PMC7686765 DOI: 10.1186/s40463-020-00481-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 11/17/2020] [Indexed: 11/10/2022] Open
Abstract
Introduction Vocal fold mass lesions can affect vocal fold oscillation patterns and therefore voice production. It has been previously observed that perturbation values from audio signals were lower with increased loudness. However, how much the oscillation patterns change with gradual alteration of loudness is not yet fully understood. Material and methods Eight patients with vocal fold mass lesions were asked to perform a glide from minimum to maximum loudness on the vowel /i/, ƒo of 125 Hz for male or 250 Hz for female voices. During phonation the subjects were simultaneously recorded with transnasal high speed videoendoscopy (HSV, 20,000 fps), electroglottography (EGG), and an audio recording. Based on the HSV material the Glottal Area Waveform (GAW) was segmented and GAW parameters were computed. Results The greatest vocal fold irregularities were observed at different values between minimum and maximum sound pressure level. There was a relevant discrepancy between the HSV and EGG derived open quotients. Furthermore, the EGG derived sample entropy and GAW values also evidenced different behavior. Conclusions The amount of vocal fold irregularity changes with varying loudness. Therefore, any evaluation of the voice should be performed under different loudness conditions. The discrepancy between EGG and GAW values appears to be much stronger in patients with vocal fold mass lesions than those with normal physiological conditions. Level of evidence 4.
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology Head & Neck Surgery, Munich University Hospital (LMU), Marchioninistr. 15, 81377, Munich, Germany.
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Medical School, Bohlenplatz 21, 91054, Erlangen, Germany
| | - Marie Köberlein
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology Head & Neck Surgery, Munich University Hospital (LMU), Marchioninistr. 15, 81377, Munich, Germany.,Institute of Musicians' Medicine, Freiburg University Hospital and Faculty of Medicine Freiburg University, Elsässerstr 2m, Freiburg, Germany
| | - Liudmila Kuranova
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology Head & Neck Surgery, Munich University Hospital (LMU), Marchioninistr. 15, 81377, Munich, Germany
| | - Donata Gellrich
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology Head & Neck Surgery, Munich University Hospital (LMU), Marchioninistr. 15, 81377, Munich, Germany
| | - Marie-Anne Kainz
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology Head & Neck Surgery, Munich University Hospital (LMU), Marchioninistr. 15, 81377, Munich, Germany
| |
Collapse
|
9
|
Echternach M, Köberlein M, Gellrich D, Guzman M, Laukkanen AM, Burk F, Döllinger M, Richter B, Kainz MA. Duration of biodynamic changes associated with water resistance therapy. LOGOP PHONIATR VOCO 2020; 46:126-133. [PMID: 32583700 DOI: 10.1080/14015439.2020.1785000] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
INTRODUCTION In current voice research, there is a growing interest in semi-occluded vocal tract exercises and their contribution to an increased vocal efficiency. However, there are only few studies on the lasting effects. MATERIAL AND METHODS Eight vocally healthy subjects were asked to sustain a phonation at a comfortable loudness on the vowel /i/, with a Fundamental Frequency of 250 Hz (females) or 125 Hz (males). During phonation the subjects were simultaneously recorded with transnasal High Speed Videoendoscopy (HSV, 20,000fps), electroglottography, and audio signals. The subjects then performed a water resistance therapy (WRT) for 10 min (tube: 30 cm length, 5 cm below water surface, diameter 9 mm). Repeated measurements of sustained phonation were performed 0, 5, 10, 20, and 30 min after exercising. From the HSV material the Glottal Area Waveform (GAW) was segmented and GAW parameters were computed. RESULTS There were strong inter-individual differences concerning the changes of different measures over time after WRT. In general, directly after WRT there was a lowering of the GAW derived Period Perturbation Quotient, a lowering of the Closing Quotient and an increase of the Sound Pressure Level (SPL) in comparison to the pre-intervention measurement. However, only 5 min post WRT there was no longer a clear difference compared to baseline results before WRT. Other values such as Open Quotients exhibited no evident changes by WRT at any timepoint. CONCLUSIONS WRT showed strong inter-individual effects concerning the changes in the evaluated measures post-intervention. General tendencies of some measures directly after the intervention showed a brief effect of only a few minutes.
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| | - Marie Köberlein
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany.,Institute of Musicians' Medicine, Freiburg University Medical Center, Freiburg, Germany
| | - Donata Gellrich
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| | - Marco Guzman
- Department of Communication Sciences and Disorders, Universidad de los Andes, Chile, Santiago, Chile
| | - Anne Maria Laukkanen
- Faculty of Social Sciences, Speech and Voice Research Laboratory, Tampere University, Tampere, Finland
| | - Fabian Burk
- Department of Otorhinolaryngology, Head- and Neck Surgery, University Hospital of Schleswig-Holstein, Kiel, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Medical School, Erlangen, Germany
| | - Bernhard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center, Freiburg, Germany
| | - Marie-Anne Kainz
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| |
Collapse
|
10
|
Gómez P, Kist AM, Schlegel P, Berry DA, Chhetri DK, Dürr S, Echternach M, Johnson AM, Kniesburges S, Kunduk M, Maryn Y, Schützenberger A, Verguts M, Döllinger M. BAGLS, a multihospital Benchmark for Automatic Glottis Segmentation. Sci Data 2020; 7:186. [PMID: 32561845 PMCID: PMC7305104 DOI: 10.1038/s41597-020-0526-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 05/15/2020] [Indexed: 02/06/2023] Open
Abstract
Laryngeal videoendoscopy is one of the main tools in clinical examinations for voice disorders and voice research. Using high-speed videoendoscopy, it is possible to fully capture the vocal fold oscillations, however, processing the recordings typically involves a time-consuming segmentation of the glottal area by trained experts. Even though automatic methods have been proposed and the task is particularly suited for deep learning methods, there are no public datasets and benchmarks available to compare methods and to allow training of generalizing deep learning models. In an international collaboration of researchers from seven institutions from the EU and USA, we have created BAGLS, a large, multihospital dataset of 59,250 high-speed videoendoscopy frames with individually annotated segmentation masks. The frames are based on 640 recordings of healthy and disordered subjects that were recorded with varying technical equipment by numerous clinicians. The BAGLS dataset will allow an objective comparison of glottis segmentation methods and will enable interested researchers to train their own models and compare their methods.
Collapse
Affiliation(s)
- Pablo Gómez
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany.
| | - Andreas M Kist
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany.
| | - Patrick Schlegel
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
| | - David A Berry
- Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California, Los Angeles, Los Angeles, California, USA
| | - Dinesh K Chhetri
- Department of Head and Neck Surgery, David Geffen School of Medicine at the University of California, Los Angeles, Los Angeles, California, USA
| | - Stephan Dürr
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| | - Aaron M Johnson
- NYU Voice Center, Department of Otolaryngology - Head and Neck Surgery, New York University School of Medicine, New York, New York, USA
| | - Stefan Kniesburges
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
| | - Melda Kunduk
- Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge, Louisiana, USA
| | - Youri Maryn
- European Institute for ORL-HNS, Department of Otorhinolaryngology and Head & Neck Surgery, Sint-Augustinus GZA, Wilrijk, Belgium
- Department of Speech, Language and Hearing sciences, University of Ghent, Ghent, Belgium
- Faculty of Education, Health and Social Work, University College Ghent, Ghent, Belgium
- Faculty of Psychology and Educational Sciences, School of Logopedics, Université Catholique de Louvain, Louvain-la-Neuve, Belgium
- Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Anne Schützenberger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
| | - Monique Verguts
- European Institute for ORL-HNS, Department of Otorhinolaryngology and Head & Neck Surgery, Sint-Augustinus GZA, Wilrijk, Belgium
- Department of Otorhinolaryngology and Voice Disorders, Diest General Hospital, Diest, Belgium
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany
| |
Collapse
|
11
|
Immediate effects of water resistance therapy on patients with vocal fold mass lesions. Eur Arch Otorhinolaryngol 2020; 277:1995-2003. [PMID: 32172385 PMCID: PMC7286853 DOI: 10.1007/s00405-020-05887-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 02/24/2020] [Indexed: 11/29/2022]
Abstract
Introduction Semi-occluded vocal tract exercises, such as water resistance therapy (WRT), are widely used in voice therapy. However, the potential positive effects of such a therapy on vocal fold oscillation patterns in patients indicating a need for phonomicrosurgery have not yet been explored. The presented study aims to analyze the effect of WRT in patients suffering from vocal fold mass lesions. Materials and methods Eight participants with vocal fold mass lesions were asked to sustain a phonation on the vowel /i/ at a comfortable loudness and a fundamental frequency of 250 Hz (females) or 125 Hz (males). During phonation the subjects were simultaneously recorded with transnasal high-speed videoendoscopy (HSV, 20.000 fps), electroglottography, and audio signals. These subjects then performed a WRT (phonation in a silicone tube of 30 cm length, 5 cm below the water surface) for 10 min. Repeated measurements of sustained phonation were performed 0, 10, and 30 min after exercising. From the HSV data the glottal area waveform (GAW) was segmented and GAW parameters were computed. Results During WRT there was an increase of the GAW related open quotient and closing quotient. Immediately after WRT, there was a drop of both values followed by a rise of these parameters up to 30 min after the intervention. Furthermore, there was no correlation between GAW and electroglottographical open quotients. Conclusions The effects observed after a single session of WRT on participants with vocal fold mass lesions showed a similar pattern to vocal fatigue.
Collapse
|
12
|
Echternach M, Huseynov J, Döllinger M, Nusseck M, Richter B. The impact of a standardized vocal loading test on vocal fold oscillations. Eur Arch Otorhinolaryngol 2020; 277:1699-1705. [PMID: 32107614 PMCID: PMC7198646 DOI: 10.1007/s00405-020-05791-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 01/07/2020] [Indexed: 12/04/2022]
Abstract
Introduction Vocal loading capacity is an important aspect of vocal health and is measured using standardized vocal loading tests. However, it remains unclear how vocal fold oscillation patterns are influenced by a standardized vocal loading task. Methods 21 (10 male, 11 female) vocally healthy subjects were analyzed concerning the dysphonia severity index (DSI) and high speed videolaryngoscopy (HSV) on the vowel /i/ at a comfortable pitch and loudness before and after a standardized vocal loading test (10 min standardized text reading, at a level higher than 80 dB (A) measured at 30 cm from the mouth). Results Changes in DSI were statistically significant, diminishing by 1.2 points after the vocal loading test, which was mainly caused by an increase of the minimum intensity. However, the pre-post comparison of HSV derived measures failed to show any statistically significant changes. Conclusion It seems necessary to analyze the effects of a standardized vocal loading test on vocal fold oscillation patterns with respect to softest phonation and phonation threshold pressure rather than comfortable pitch and loudness. Level of evidence 2c
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany.
| | - Jamal Huseynov
- Freiburg Institute of Musicians' Medicine, Freiburg University, Freiburg, Germany
- Department of Otolaryngology/Head and Neck Surgery, Clinics of Villingen-Schwenningen, Villingen Schwenningen, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otolaryngology Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Manfred Nusseck
- Freiburg Institute of Musicians' Medicine, Freiburg University, Freiburg, Germany
| | - Bernhard Richter
- Freiburg Institute of Musicians' Medicine, Freiburg University, Freiburg, Germany
- Medical Faculty, Freiburg University, Freiburg, Germany
| |
Collapse
|
13
|
Echternach M, Högerle C, Köberlein M, Schlegel P, Döllinger M, Richter B, Kainz MA. The Effect of Nasalance on Vocal Fold Oscillation Patterns During the Male Passaggio. J Voice 2019; 35:500.e9-500.e16. [PMID: 31668917 DOI: 10.1016/j.jvoice.2019.09.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 09/23/2019] [Accepted: 09/23/2019] [Indexed: 10/25/2022]
Abstract
INTRODUCTION It is generally assumed that when singing across the region where registration events for untrained voices occur (the passaggio), singers modify the voice production system in order to avoid changes of voice quality. In this context, it has been postulated that nasalance could be used to stabilize vocal function throughout the passaggio. However, whether nasalance is frequently used by professional singers and if so, if it has a stabilizing effect on vocal fold oscillation patterns, is not yet fully understood. MATERIAL AND METHODS Eight western classically trained professional male singers (6 tenors and 2 baritones) were asked to perform transitions (1) from modal to falsetto register and (2) from modal to stage voice above the passaggio (SVaP) during ascending pitch glides from A3 (ƒo approx. 220 Hz) to A4 (ƒo approx. 440 Hz) on the vowel /i/. Transnasal high-speed videoendoscopy at 20.000 fps was captured simultaneously with electroglottographic, nasal and oral flow, and audio signals, recorded using the same frame rate. The nasalance was calculated from both oral and nasal DC-flow signals. RESULTS Transitions to SVaP showed greater periodicity and regularity than transitions to falsetto. For 5 subjects, nasalance was increased during the passaggio for the transition to SVaP. For 4 subjects the increase of nasalance for the SVaP was associated with a stabilization of the open quotient and occurred at a comparable fundamental frequency as the increase of the open quotient for the transition to falsetto. CONCLUSIONS Nasalance can be used in order to stabilize oscillatory regularity and open quotient in male singers for singing across the passaggio.
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| | - Catalina Högerle
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| | - Marie Köberlein
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany; Institute of Musicians' Medicine, Freiburg University Medical Center and Medical Faculty, Freiburg University, Freiburg, Germany
| | - Patrick Schlegel
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Erlangen, Germany
| | - Bernhard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center and Medical Faculty, Freiburg University, Freiburg, Germany
| | - Marie-Anne Kainz
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| |
Collapse
|
14
|
Pizzorni N, Schindler A, Sozzi M, Corbo M, Gilardone M. The Vocal Score Profile in Verdi's Characters. J Voice 2019; 33:805.e13-805.e20. [DOI: 10.1016/j.jvoice.2018.03.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Accepted: 03/16/2018] [Indexed: 10/28/2022]
|
15
|
Selamtzis A, Ternström S, Richter B, Burk F, Köberlein M, Echternach M. A comparison of electroglottographic and glottal area waveforms for phonation type differentiation in male professional singers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:3275. [PMID: 30599695 DOI: 10.1121/1.5066456] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 10/14/2018] [Indexed: 06/09/2023]
Abstract
This study compares the use of electroglottograms (EGGs) and glottal area waveforms (GAWs) to study phonation in different vibratory states as produced by professionally trained singers. Six western classical tenors were asked to phonate pitch glides from modal to falsetto phonation, or from modal to their stage voice above the passaggio (SVaP). For each pitch glide the sample entropy (SampEn) of the EGG signal was calculated to detect the occurrence of phonatory instabilities and establish a "ground truth" for the performed phonation type. The cycles before the maximum SampEn were labeled as modal, and the cycles after the peak were labeled as either falsetto, or SVaP. Three automatic categorizations of vibratory state were performed using clustering: one based only on the EGG, one based on the GAW, and one based on their combination. The error rate (clustering vs ground truth) was, on average, lower than 10% for all of the three settings, revealing no special advantage of the GAW over EGG, and vice versa. Modal voice cycles exhibited a larger contact quotient, larger normalized derivative peak ratio, and lower rise time, compared to SVaP and falsetto. The GAW-based normalized maximum area declination rate was larger in SVaP compared to modal voice.
Collapse
Affiliation(s)
- Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Bernard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Fabian Burk
- Department of Otorhinolaryngology, University Medical Center Schleswig-Holstein, Arnold-Heller-Straße 3, 24105 Kiel, Germany
| | - Marie Köberlein
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Matthias Echternach
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, Munich University, Campus Großhadern, Marchioninistraße 15, Munich, 81377, Germany
| |
Collapse
|