1
|
Iob NA, He L, Ternström S, Cai H, Brockmann-Bauser M. Effects of Speech Characteristics on Electroglottographic and Instrumental Acoustic Voice Analysis Metrics in Women With Structural Dysphonia Before and After Treatment. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:1660-1681. [PMID: 38758676 DOI: 10.1044/2024_jslhr-23-00253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2024]
Abstract
PURPOSE Literature suggests a dependency of the acoustic metrics, smoothed cepstral peak prominence (CPPS) and harmonics-to-noise ratio (HNR), on human voice loudness and fundamental frequency (F0). Even though this has been explained with different oscillatory patterns of the vocal folds, so far, it has not been specifically investigated. In the present work, the influence of three elicitation levels, calibrated sound pressure level (SPL), F0 and vowel on the electroglottographic (EGG) and time-differentiated EGG (dEGG) metrics hybrid open quotient (OQ), dEGG OQ and peak dEGG, as well as on the acoustic metrics CPPS and HNR, was examined, and their suitability for voice assessment was evaluated. METHOD In a retrospective study, 29 women with a mean age of 25 years (± 8.9, range: 18-53) diagnosed with structural vocal fold pathologies were examined before and after voice therapy or phonosurgery. Both acoustic and EGG signals were recorded simultaneously during the phonation of the sustained vowels /ɑ/, /i/, and /u/ at three elicited levels of loudness (soft/comfortable/loud) and unconstrained F0 conditions. RESULTS A linear mixed-model analysis showed a significant effect of elicitation effort levels on peak dEGG, HNR, and CPPS (all p < .01). Calibrated SPL significantly influenced HNR and CPPS (both p < .01). Furthermore, F0 had a significant effect on peak dEGG and CPPS (p < .0001). All metrics showed significant changes with regard to vowel (all p < .05). However, the treatment had no effect on the examined metrics, regardless of the treatment type (surgery vs. voice therapy). CONCLUSIONS The value of the investigated metrics for voice assessment purposes when sampled without sufficient control of SPL and F0 is limited, in that they are significantly influenced by the phonatory context, be it speech or elicited sustained vowels. Future studies should explore the diagnostic value of new data collation approaches such as voice mapping, which take SPL and F0 effects into account.
Collapse
Affiliation(s)
- Naomi Anna Iob
- Division of Phoniatrics and Speech Pathology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Switzerland
| | - Lei He
- Division of Phoniatrics and Speech Pathology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Switzerland
- Department of Computational Linguistics, University of Zurich, Switzerland
| | - Sten Ternström
- Division of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Huanchen Cai
- Division of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Meike Brockmann-Bauser
- Division of Phoniatrics and Speech Pathology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Switzerland
| |
Collapse
|
2
|
Cai H, Ternström S, Chaffanjon P, Henrich Bernardoni N. Effects on Voice Quality of Thyroidectomy: A Qualitative and Quantitative Study Using Voice Maps. J Voice 2024:S0892-1997(24)00082-1. [PMID: 38714436 DOI: 10.1016/j.jvoice.2024.03.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 03/11/2024] [Accepted: 03/12/2024] [Indexed: 05/09/2024]
Abstract
OBJECTIVES This study aims to explore the effects of thyroidectomy-a surgical intervention involving the removal of the thyroid gland-on voice quality, as represented by acoustic and electroglottographic measures. Given the thyroid gland's proximity to the inferior and superior laryngeal nerves, thyroidectomy carries a potential risk of affecting vocal function. While earlier studies have documented effects on the voice range, few studies have looked at voice quality after thyroidectomy. Since voice quality effects could manifest in many ways, that a priori are unknown, we wish to apply an exploratory approach that collects many data points from several metrics. METHODS A voice-mapping analysis paradigm was applied retrospectively on a corpus of spoken and sung sentences produced by patients who had thyroid surgery. Voice quality changes were assessed objectively for 57 patients prior to surgery and 2months after surgery, by making comparative voice maps, pre- and post-intervention, of six acoustic and electroglottographic (EGG) metrics. RESULTS After thyroidectomy, statistically significant changes consistent with a worsening of voice quality were observed in most metrics. For all individual metrics, however, the effect sizes were too small to be clinically relevant. Statistical clustering of the metrics helped to clarify the nature of these changes. While partial thyroidectomy demonstrated greater uniformity than did total thyroidectomy, the type of perioperative damage had no discernible impact on voice quality. CONCLUSIONS Changes in voice quality after thyroidectomy were related mostly to increased phonatory instability in both the acoustic and EGG metrics. Clustered voice metrics exhibited a higher correlation to voice complaints than did individual voice metrics.
Collapse
Affiliation(s)
- Huanchen Cai
- Division of Speech, Music and Hearing, KTH Royal Institute of Technology, Stockholm, Sweden.
| | - Sten Ternström
- Division of Speech, Music and Hearing, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Philippe Chaffanjon
- University of Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, Grenoble, France; Medical School, Université Grenoble Alpes, Grenoble, France
| | | |
Collapse
|
3
|
Traser L, Schwab C, Burk F, Özen AC, Bock M, Richter B, Echternach M. Differences of respiratory kinematics in female and male singers - A comparative study using dynamic magnetic resonance imaging. Front Psychol 2022; 13:844032. [PMID: 36544443 PMCID: PMC9760878 DOI: 10.3389/fpsyg.2022.844032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Accepted: 11/16/2022] [Indexed: 12/12/2022] Open
Abstract
Breath control is an important factor for singing voice production, but pedagogic descriptions of how a beneficial movement pattern should be performed vary widely and the underlying physiological processes are not understood in detail. Differences in respiratory movements during singing might be related to the sex of the singer. To study sex-related differences in respiratory kinematics during phonation, 12 singers (six male and six female) trained in the Western classical singing tradition were imaged with dynamic magnetic resonance imaging. Singers were asked to sustain phonation at five different pitches and loudness conditions, and cross-sectional images of the lung were acquired. In each dynamic image frame the distances between anatomical landmarks were measured to quantify the movements of the respiratory apparatus. No major difference between male and female singers was found for the general respiratory kinematics of the thorax and the diaphragm during sustained phonation. However when compared to sole breathing, male singers significantly increased their thoracic movements for singing. This behavior could not be observed in female singers. The presented data support the hypothesis that professional singers follow sex-specific breathing strategies. This finding may be important in a pedagogical context where the biological sex of singer and student differ and should be further investigated in a larger cohort.
Collapse
Affiliation(s)
- Louisa Traser
- Institute of Musicians’ Medicine, Faculty of Medicine, Medical Center – University of Freiburg, Freiburg, Germany,Faculty of Medicine, University of Freiburg, Freiburg, Germany,*Correspondence: Louisa Traser,
| | - Carmen Schwab
- Faculty of Medicine, University of Freiburg, Freiburg, Germany,Department of Prosthetic Dentistry, Center for Dental Medicine, Faculty of Medicine, Medical Center – University of Freiburg, Freiburg, Germany
| | - Fabian Burk
- Institute of Musicians’ Medicine, Faculty of Medicine, Medical Center – University of Freiburg, Freiburg, Germany,Department of Phoniatrics and Pediatric Audiology, University Medical Center Münster, Münster, Germany
| | - Ali Caglar Özen
- Faculty of Medicine, University of Freiburg, Freiburg, Germany,Department of Radiology, Medical Physics, Medical Center – University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Michael Bock
- Faculty of Medicine, University of Freiburg, Freiburg, Germany,Department of Radiology, Medical Physics, Medical Center – University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Bernhard Richter
- Institute of Musicians’ Medicine, Faculty of Medicine, Medical Center – University of Freiburg, Freiburg, Germany,Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Munich, Germany
| |
Collapse
|
4
|
Cunsolo F, Ottaviani V, Capobianco S, Calcinoni O, Dellacà RL. Simultaneous monitoring of vocal doses and breathing patterns in professional singers. Comput Biol Med 2022; 144:105352. [DOI: 10.1016/j.compbiomed.2022.105352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 02/22/2022] [Accepted: 02/23/2022] [Indexed: 11/28/2022]
|
5
|
Angelakis E, Kotsani N, Georgaki A. Towards a Singing Voice Multi-Sensor Analysis Tool: System Design, and Assessment Based on Vocal Breathiness. SENSORS 2021; 21:s21238006. [PMID: 34884019 PMCID: PMC8659512 DOI: 10.3390/s21238006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 11/14/2021] [Accepted: 11/19/2021] [Indexed: 11/16/2022]
Abstract
Singing voice is a human quality that requires the precise coordination of numerous kinetic functions and results in a perceptually variable auditory outcome. The use of multi-sensor systems can facilitate the study of correlations between the vocal mechanism kinetic functions and the voice output. This is directly relevant to vocal education, rehabilitation, and prevention of vocal health issues in educators; professionals; and students of singing, music, and acting. In this work, we present the initial design of a modular multi-sensor system for singing voice analysis, and describe its first assessment experiment on the ‘vocal breathiness’ qualitative characteristic. A system case study with two professional singers was conducted, utilizing signals from four sensors. Participants sung a protocol of vocal trials in various degrees of intended vocal breathiness. Their (i) vocal output, (ii) phonatory function, and (iii) respiratory behavior-per-condition were recorded through a condenser microphone (CM), an Electroglottograph (EGG), and thoracic and abdominal respiratory effort transducers (RET), respectively. Participants’ individual respiratory management strategies were studied through qualitative analysis of RET data. Microphone audio samples breathiness degree was rated perceptually, and correlation analysis was performed between sample ratings and parameters extracted from CM and EGG data. Smoothed Cepstral Peak Prominence (CPPS) and vocal folds’ Open Quotient (OQ), as computed with the Howard method (HOQ), demonstrated the higher correlation coefficients, when analyzed individually. DECOM method-computed OQ (DOQ) was also examined. Interestingly, the correlation coefficient of pitch difference between estimates from CM and EGG signals appeared to be (based on the Pearson correlation coefficient) statistically insignificant (a result that warrants investigation in larger populations). The study of multi-variate models revealed even higher correlation coefficients. Models studied were the Acoustic Breathiness Index (ABI) and the proposed multiple regression model CDH (CPPS, DOQ, and HOQ), which was attempted in order to combine analysis results from microphone and EGG signals. The model combination of ABI and the proposed CDH appeared to yield the highest correlation with perceptual breathiness ratings. Study results suggest potential for the use of a completed system version in vocal pedagogy and research, as the case study indicated system practicality, a number of pertinent correlations, and introduced topics with further research possibilities.
Collapse
|
6
|
The influence of gravity on respiratory kinematics during phonation measured by dynamic magnetic resonance imaging. Sci Rep 2021; 11:22965. [PMID: 34824315 PMCID: PMC8617256 DOI: 10.1038/s41598-021-02152-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 11/03/2021] [Indexed: 11/08/2022] Open
Abstract
Respiratory kinematics are important for the regulation of voice production. Dynamic MRI is an excellent tool to study respiratory motion providing high-resolution cross-sectional images. Unfortunately, in clinical MRI systems images can only be acquired in a horizontal subject position, which does not take into account gravitational effects on the respiratory apparatus. To study the effect of body posture on respiratory kinematics during phonation, 8 singers were examined both in an open-configuration MRI with a rotatable gantry and a conventional horizontal MRI system. During dynamic MRI the subjects sang sustained tones at different pitches in both supine and upright body positions. Sagittal images of the respiratory system were obtained at 1-3 images per second, from which 6 anatomically defined distances were extracted to characterize its movements in the anterior, medium and posterior section of the diaphragm as well as the rip cage (diameter at the height of the 3rd and 5th rip) and the anterior-posterior position of the diaphragm cupola. Regardless of body position, singers maintained their general principles of respiratory kinematics with combined diaphragm and thorax muscle activation for breath support. This was achieved by expanding their chest an additional 20% during inspiration when singing in the supine position but not for sole breathing. The diaphragm was cranially displaced in supine position for both singing and breathing and its motion range increased. These results facilitate a more realistic extrapolation of research data obtained in a supine position.
Collapse
|
7
|
Patel RR, Ternström S. Quantitative and Qualitative Electroglottographic Wave Shape Differences in Children and Adults Using Voice Map-Based Analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2977-2995. [PMID: 34319772 DOI: 10.1044/2021_jslhr-20-00717] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Purpose The purpose of this study is to identify the extent to which various measurements of contacting parameters differ between children and adults during habitual range and overlap vocal frequency/intensity, using voice map-based assessment of noninvasive electroglottography (EGG). Method EGG voice maps were analyzed from 26 adults (22-45 years) and 22 children (4-8 years) during connected speech and vowel /a/ over the habitual range and the overlap vocal frequency/intensity from the voice range profile task on the vowel /a/. Mean and standard deviations of contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were obtained. Group differences were evaluated using the linear mixed model analysis for the habitual range connected speech and the vowel, whereas analysis of covariance was conducted for the overlap vocal frequency/intensity from the voice range profile task. Presence of a "knee" on the EGG wave shape was determined by visual inspection of the presence of convexity along the decontacting slope of the EGG pulse and the presence of the second derivative zero-crossing. Results The contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were significantly different in children compared to (a) adult males for habitual range and (b) adult males and adult females for the overlap vocal frequency/intensity. None of the children had a "knee" on the decontacting slope of the EGG slope. Conclusion EGG parameters of contact quotient by integration, normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and absence of a "knee" on the decontacting slope characterize the wave shape differences between children and adults, whereas the normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and presence of a "knee" on the downward pulse slope characterize the wave shape differences between adult males and adult females. Supplemental Material https://doi.org/10.23641/asha.15057345.
Collapse
Affiliation(s)
- Rita R Patel
- Department of Speech, Language and Hearing Sciences, Indiana University Bloomington
| | - Sten Ternström
- Division of Speech, Music, and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
8
|
Lã FM, Ternström S. Flow ball-assisted voice training: Immediate effects on vocal fold contacting. Biomed Signal Process Control 2020. [DOI: 10.1016/j.bspc.2020.102064] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
9
|
Ternström S. Normalized time-domain parameters for electroglottographic waveforms. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:EL65. [PMID: 31370590 DOI: 10.1121/1.5117174] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 06/28/2019] [Indexed: 06/10/2023]
Abstract
The electroglottographic waveform is of interest for characterizing phonation non-invasively. Existing parameterizations tend to give disparate results because they rely on somewhat arbitrary thresholds and/or contacting events. It is shown that neither are needed for formulating a normalized contact quotient and a normalized peak derivative. A heuristic combination of the two resolves also the ambiguity of a moderate contact quotient, with regard to vocal fold contacting being firm versus weak or absent. As preliminaries, schemes for electroglottography signal preconditioning and time-domain period detection are described that improve somewhat on similar methods. The algorithms are simple and compute quickly.
Collapse
Affiliation(s)
- Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm,
| |
Collapse
|