1
|
Tomaszewska JZ, Georgakis A. Electroglottography in Medical Diagnostics of Vocal Tract Pathologies: A Systematic Review. J Voice 2023:S0892-1997(23)00388-0. [PMID: 38143204 DOI: 10.1016/j.jvoice.2023.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/02/2023] [Accepted: 12/04/2023] [Indexed: 12/26/2023]
Abstract
Electroglottography (EGG) is a technology developed for measuring the vocal fold contact area during human voice production. Although considered subjective and unreliable as a sole diagnostic method, with the correct application of relevant computational methods, it can constitute a most promising non-invasive voice disorder diagnostic tools in a form of a digital vocal tract pathology classifier. The aim of the following study is to gather and evaluate currently existing digital voice quality assessment systems and vocal tract abnormality classification systems that rely on the use of electroglottographic bio-impedance signals. To fully comprehend the findings of this review, first the subject of EGG is introduced. For that, we summarise most relevant existing research on EGG with a particular focus on its application in diagnostics. Then, we move on to the focal point of this work, which is describing and comparing the existing EGG-based digital voice pathology classification systems. With the application of PRISMA model, 13 articles were chosen and analysed in detail. Direct comparison between chosen studies brought us to pivotal conclusions, which have been described in Section 5 of this report. Meanwhile, certain limitations arising from the literature were identified, such as questionable understanding of the nature of EGG bio-impedance signals. The appropriate recommendations for future work were made, including the application of different methods for EGG feature extraction, as well as the need for continuous EGG datasets development containing signals gathered in various conditions and with different equipments.
Collapse
|
2
|
Codino J, Jackson-Menaldi MC, Rubin A, Torres ME. Automated Quantification of Inflection Events in The Electroglottographic Signal. J Voice 2023; 37:640-647. [PMID: 34162494 DOI: 10.1016/j.jvoice.2021.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 05/01/2021] [Accepted: 05/10/2021] [Indexed: 11/26/2022]
Affiliation(s)
- Juliana Codino
- Lakeshore Professional Voice Center, Lakeshore Ear, Nose and Throat Center, MI, USA
| | - María Cristina Jackson-Menaldi
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| | - Adam Rubin
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| | - María Eugenia Torres
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| |
Collapse
|
3
|
Yi S, Yang H. Update on the pediatric adverse vocal behavior voice disorders: a clinical practice review. Eur J Pediatr 2023:10.1007/s00431-023-04879-4. [PMID: 36973568 DOI: 10.1007/s00431-023-04879-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Revised: 01/16/2023] [Accepted: 02/14/2023] [Indexed: 03/29/2023]
Abstract
The main causes of voice disorders in children with adverse vocal behavior include benign lesions of the vocal folds caused by voice abuse or misuses, such as vocal fold nodules, vocal fold polyps, and laryngitis. Long-term voice disorders can affect the physical and mental health of children. Reviewing the literature of the last two decades on "Vocal Fold Nodules," "Vocal Fold Polyp," "Voice disorder," "Voice Abuse," "Voice Misuse," "Pediatrics," and "Children" with the appropriate Boolean operators. Conclusion: A total of 315 results were returned on an initial PubMed search. All articles from 2000 to 2022 written in English or Chinese were screened. Duplicate articles, those relating to adults only or concerned with the malignant lesion of the vocal cord, were excluded, resulting in 196 articles of interest. Relevant references and books have also been consulted, and we provide a review of the pathogenesis, diagnosis, and treatment of these maladaptive vocal behavioral voice disorders. What is Known: • Hoarseness is the most common voice symptom in children, and there are various causes of hoarseness in children. However, there is a lack of reviews on voice disorders caused by adverse vocal habits in children. • Voice training is a conservative treatment method for children with voice disorders , and it is important to clarify the factors that influence the effectiveness of voice training for children. What is New: • This review of the personality and family characteristics of children with adverse vocal behavioural voice disorders provides a valuable guide to the clinical planning of subsequent treatment. • This article discusses and summarises some of the factors that may influence the effectiveness of voice training in children and collates some of the scales and questionnaires currently used in children that are important in predicting the effectiveness of voice training.
Collapse
Affiliation(s)
- Sixi Yi
- Department of Otolaryngology Head and Neck Surgery, West China Hospital, Sichuan University, Chengdu, 610044, China
| | - Hui Yang
- Department of Otolaryngology Head and Neck Surgery, West China Hospital, Sichuan University, Chengdu, 610044, China.
| |
Collapse
|
4
|
Objective Measures of Two Musical Interpretations of an Excerpt From Berlioz's "La mort d'Ophélie". J Voice 2023; 37:301.e9-301.e25. [PMID: 33589372 DOI: 10.1016/j.jvoice.2020.12.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Revised: 12/15/2020] [Accepted: 12/21/2020] [Indexed: 11/21/2022]
Abstract
OBJECTIVE/HYPOTHESIS This study aimed to determine objective production differences relative to two emotional interpretations in performing an excerpt from a classical art song. The null hypothesis was proposed. METHODS The first author recorded an excerpt from an art song. The excerpt was sung with two contrasting musical interpretations: an "empathetic legato" approach, and a "sarcastic" approach characterized by emphatic attacks. Microphone, airflow, and electroglottography signals were digitized. The vowels were analyzed in terms of intensity, long term average spectra, fundamental frequency (fo), airflow vibrato rate and extent, vowel onset slope, intensity comparison of harmonic frequencies, and glottal measures based on electroglottograph waveforms. Four consonant tokens were analyzed relative to airflow, voice onset time, and production duration. RESULTS & CONCLUSIONS The emphatic performance had faster vowel onset, increased glottal adduction, increased intensity of harmonics in 2-3 kHz, increased intensity in the fourth and fifth formants, inferred subglottal pressure increase, increased airflow for /f/, and greater aspiration airflow for /p, t/. Vibrato extents for intensity, fo, and airflow were wider in the emphatic approach. Findings revealed larger EGGW25 and peak-to-peak amplitude values of the electroglottography waveform, suggesting greater vocal fold contact area and longer glottal closure for the emphatic approach. Long-term average spectrum analyses of the entire production displayed minor variation across all formant frequencies, suggesting an insignificant change in vocal tract shaping between the two approaches. This single-case objective study emphasizes the reality of physiological, aerodynamic, and acoustic production differences in the interpretive and pedagogical aspects of art song performance.
Collapse
|
5
|
Fleischer M, Rummel S, Stritt F, Fischer J, Bock M, Echternach M, Richter B, Traser L. Voice efficiency for different voice qualities combining experimentally derived sound signals and numerical modeling of the vocal tract. Front Physiol 2022; 13:1081622. [PMID: 36620215 PMCID: PMC9822708 DOI: 10.3389/fphys.2022.1081622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 12/01/2022] [Indexed: 12/24/2022] Open
Abstract
Purpose: Concerning voice efficiency considerations of different singing styles, from western classical singing to contemporary commercial music, only limited data is available to date. This single-subject study attempts to quantify the acoustic sound intensity within the human glottis depending on different vocal tract configurations and vocal fold vibration. Methods: Combining Finite-Element-Models derived from 3D-MRI data, audio recordings, and electroglottography (EGG) we analyzed vocal tract transfer functions, particle velocity and acoustic pressure at the glottis, and EGG-related quantities to evaluate voice efficiency at the glottal level and resonance characteristics of different voice qualities according to Estill Voice Training®. Results: Voice qualities Opera and Belting represent highly efficient strategies but apply different vowel strategies and should thus be capable of predominate orchestral sounds. Twang and Belting use similar vowels, but the twang vocal tract configuration enabled the occurrence of anti-resonances and was associated with reduced vocal fold contact but still partially comparable energy transfer from the glottis to the vocal tract. Speech was associated with highly efficient glottal to vocal tract energy transfer, but with the absence of psychoactive strategies makes it more susceptible to noise interference. Falsetto and Sobbing apply less efficiently. Falsetto mainly due to its voice source characteristics, Sobbing due to energy loss in the vocal tract. Thus technical amplification might be appropriate here. Conclusion: Differences exist between voice qualities regarding the sound intensity, caused by different vocal tract morphologies and oscillation characteristics of the vocal folds. The combination of numerical analysis of geometries inside the human body and experimentally determined data outside sheds light on acoustical quantities at the glottal level.
Collapse
Affiliation(s)
- Mario Fleischer
- Department of Audiology and Phoniatrics, Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany,*Correspondence: Mario Fleischer,
| | | | - Fiona Stritt
- Medical Center, Institute of Musicians’ Medicine, University of Freiburg, Freiburg, Germany,Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Johannes Fischer
- Faculty of Medicine, University of Freiburg, Freiburg, Germany,Medical Center, Department of Radiology, Medical Physics, University of Freiburg, Freiburg, Germany
| | - Michael Bock
- Faculty of Medicine, University of Freiburg, Freiburg, Germany,Medical Center, Department of Radiology, Medical Physics, University of Freiburg, Freiburg, Germany
| | - Matthias Echternach
- Department of Otorhinolaryngology, Ludwig-Maximilians-Universität München, Division of Phoniatrics and Pediatric Audiology, LMU Klinikum, Munich, Germany
| | - Bernhard Richter
- Medical Center, Institute of Musicians’ Medicine, University of Freiburg, Freiburg, Germany,Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Louisa Traser
- Medical Center, Institute of Musicians’ Medicine, University of Freiburg, Freiburg, Germany,Faculty of Medicine, University of Freiburg, Freiburg, Germany
| |
Collapse
|
6
|
Fischer J, Özen AC, Ilbey S, Traser L, Echternach M, Richter B, Bock M. Sub-millisecond 2D MRI of the vocal fold oscillation using single-point imaging with rapid encoding. MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE 2021; 35:301-310. [PMID: 34542771 PMCID: PMC8995286 DOI: 10.1007/s10334-021-00959-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 08/06/2021] [Accepted: 09/06/2021] [Indexed: 10/24/2022]
Abstract
OBJECTIVE The slow spatial encoding of MRI has precluded its application to rapid physiologic motion in the past. The purpose of this study is to introduce a new fast acquisition method and to demonstrate feasibility of encoding rapid two-dimensional motion of human vocal folds with sub-millisecond resolution. METHOD In our previous work, we achieved high temporal resolution by applying a rapidly switched phase encoding gradient along the direction of motion. In this work, we extend phase encoding to the second image direction by using single-point imaging with rapid encoding (SPIRE) to image the two-dimensional vocal fold oscillation in the coronal view. Image data were gated using electroglottography (EGG) and motion corrected. An iterative reconstruction with a total variation (TV) constraint was used and the sequence was also simulated using a motion phantom. RESULTS Dynamic images of the vocal folds during phonation at pitches of 150 and 165 Hz were acquired in two volunteers and the periodic motion of the vocal folds at a temporal resolution of about 600 µs was shown. The simulations emphasize the necessity of SPIRE for two-dimensional motion encoding. DISCUSSION SPIRE is a new MRI method to image rapidly oscillating structures and for the first time provides dynamic images of the vocal folds oscillations in the coronal plane.
Collapse
Affiliation(s)
- Johannes Fischer
- Department of Radiology, Medical Physics, University Medical Center Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany.
| | - Ali Caglar Özen
- Department of Radiology, Medical Physics, University Medical Center Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany.,German Consortium for Translational Cancer Research Partner Site Freiburg, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Serhat Ilbey
- Department of Radiology, Medical Physics, University Medical Center Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Louisa Traser
- Freiburg Institute for Musicians' Medicine, Freiburg University Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, Ludwig-Maximilians-University, Munich, Germany
| | - Bernhard Richter
- Freiburg Institute for Musicians' Medicine, Freiburg University Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Michael Bock
- Department of Radiology, Medical Physics, University Medical Center Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| |
Collapse
|
7
|
Patel RR, Ternström S. Quantitative and Qualitative Electroglottographic Wave Shape Differences in Children and Adults Using Voice Map-Based Analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2977-2995. [PMID: 34319772 DOI: 10.1044/2021_jslhr-20-00717] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Purpose The purpose of this study is to identify the extent to which various measurements of contacting parameters differ between children and adults during habitual range and overlap vocal frequency/intensity, using voice map-based assessment of noninvasive electroglottography (EGG). Method EGG voice maps were analyzed from 26 adults (22-45 years) and 22 children (4-8 years) during connected speech and vowel /a/ over the habitual range and the overlap vocal frequency/intensity from the voice range profile task on the vowel /a/. Mean and standard deviations of contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were obtained. Group differences were evaluated using the linear mixed model analysis for the habitual range connected speech and the vowel, whereas analysis of covariance was conducted for the overlap vocal frequency/intensity from the voice range profile task. Presence of a "knee" on the EGG wave shape was determined by visual inspection of the presence of convexity along the decontacting slope of the EGG pulse and the presence of the second derivative zero-crossing. Results The contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were significantly different in children compared to (a) adult males for habitual range and (b) adult males and adult females for the overlap vocal frequency/intensity. None of the children had a "knee" on the decontacting slope of the EGG slope. Conclusion EGG parameters of contact quotient by integration, normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and absence of a "knee" on the decontacting slope characterize the wave shape differences between children and adults, whereas the normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and presence of a "knee" on the downward pulse slope characterize the wave shape differences between adult males and adult females. Supplemental Material https://doi.org/10.23641/asha.15057345.
Collapse
Affiliation(s)
- Rita R Patel
- Department of Speech, Language and Hearing Sciences, Indiana University Bloomington
| | - Sten Ternström
- Division of Speech, Music, and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
8
|
Analysis of localized bioimpedance from healthy young adults during activities of the vocal folds using Cole-impedance model representation. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.102665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
9
|
Sai PV, Rajalakshmi T, Snekhalatha U. Non-invasive thyroid detection based on electroglottogram signal using machine learning classifiers. Proc Inst Mech Eng H 2021; 235:1128-1145. [PMID: 34176352 DOI: 10.1177/09544119211028070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Thyroid is a butterfly shaped gland located in the neck region. Hormones are secreted by the thyroid gland that is responsible for various functions that maintain metabolism of the body. The variance in secretion of the hormones causes disorders such as Hyperthyroidism or Hypothyroidism. Electroglottography signal is a bio signal which represents the impedance that exist between the glottis regions. The study aims at design and development of an hardware circuit for the acquisition of Electroglottogram signal from normal and thyroid subjects is proposed followed by feature extraction from the acquired bio signal is performed. Further, machine learning classifiers were used to classify the normal and thyroid individuals. This modality of acquisition is non-invasive. Performance evaluation is done by testing various classifiers to study the accuracy. The classifiers tested were Random Forest, Random Tree, Bayes Net, Multilayer Perceptron, Simple Logistic classifier, and One-R classifier. Classifiers such as Random Forest, Random Tree, and Multilayer Perceptron showed high accuracy. The accuracy estimated by these classifiers was tested and its ROC curves with AUC scores were derived. The highest accuracy was reported for Simple Logistic classifier which was about 95.1%. Random Forest and Random Tree reported 93.5% and 91.9% respectively. Similarly, Multilayer Perceptron and Bayes Net gave 93.5% and 91.9%. The One-R classifier algorithm reported the lowest accuracy of 90.3% among the studied classifier algorithms. The ROC-AUC score for the classifiers were also reported to be more than 0.9 which is considered more promising and supports the acquisition and processing methodology. Hence the proposed technique can be efficiently used to diagnose thyroid non-invasively.
Collapse
Affiliation(s)
- P Vijay Sai
- Department of Biomedical Engineering, college of Engineering and Technology, SRM Institute of Science and Technology, Kattankulathur, Tamil Nadu, India
| | - T Rajalakshmi
- Department of Electronics and Communication Engineering, College of Engineering and Technology, SRM Institute of Science and Technology, Kattankulathur, Tamil Nadu, India
| | - U Snekhalatha
- Department of Biomedical Engineering, college of Engineering and Technology, SRM Institute of Science and Technology, Kattankulathur, Tamil Nadu, India
| |
Collapse
|
10
|
Herbst CT. Performance Evaluation of Subharmonic-to-Harmonic Ratio (SHR) Computation. J Voice 2021; 35:365-375. [DOI: 10.1016/j.jvoice.2019.11.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 11/09/2019] [Accepted: 11/11/2019] [Indexed: 10/24/2022]
|
11
|
Hirosaki M, Kanazawa T, Komazawa D, Konomi U, Sakaguchi Y, Katori Y, Watanabe Y. Predominant Vertical Location of Benign Vocal Fold Lesions by Sex and Music Genre: Implication for Pathogenesis. Laryngoscope 2021; 131:E2284-E2291. [PMID: 33421134 DOI: 10.1002/lary.29378] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 12/04/2020] [Accepted: 12/24/2020] [Indexed: 11/09/2022]
Abstract
OBJECTIVES/HYPOTHESIS Vertical locations of vocal fold mucosal lesions (VFMLs) vary along the free edge. As the vertical contact area of vocal folds (VFs) depends on the vocal register, lesions may occur in the contact area of more frequently used vocal registers. This study investigated the cause of location variations by comparing the vertical sites of VFMLs in singers of both sexes with different music genres. STUDY DESIGN Retrospective review. METHODS Sixty professional classical and rock singers (11 male classical [M-classical], 22 male rock [M-rock], 13 female classical [F-classical], and 14 female rock [F-rock] singers) who underwent microlaryngeal surgery for VF polyps and nodules and their 108 lesions were enrolled. The VF free edge was vertically divided into three equal parts and classified into the following four lesion sites: upper, middle, lower, and multiple sites. RESULTS Upper lesions were most common among F-classical singers (73.9%), whereas lower lesions were most common among M-classical (90.0%) and M-rock (60.6%) singers. Among lesions localized to a single site, lower lesions were most common among F-rock singers (37.0%). F-classical singers had significantly more upper lesions than the other groups (P < .001). M-classical singers had significantly more lower lesions than female singers of any genre (P < .001). CONCLUSION Upper lesions were most common among F-classical singers who mostly used the head voice. Lower lesions were most common among singers who mainly used the modal voice. This study suggests that sex, the dominant vocal register used for singing, and mechanical stress on VFs influence the vertical site of VFMLs. LEVEL OF EVIDENCE 4 Laryngoscope, 131:E2284-E2291, 2021.
Collapse
Affiliation(s)
- Mayu Hirosaki
- Tokyo Voice Center, International University of Health and Welfare, Tokyo, Japan.,Department of Otolaryngology-Head and Neck Surgery, Tohoku University Graduate School of Medicine, Miyagi, Japan
| | - Takeharu Kanazawa
- Tokyo Voice Center, International University of Health and Welfare, Tokyo, Japan.,Department of Otolaryngology-Head and Neck Surgery, Jichi Medical University, Tochigi, Japan
| | - Daigo Komazawa
- Tokyo Voice Center, International University of Health and Welfare, Tokyo, Japan.,AKASAKA Voice Health Center, Tokyo, Japan
| | - Ujimoto Konomi
- Tokyo Voice Center, International University of Health and Welfare, Tokyo, Japan.,Voice and Dizziness Clinic Futakotamagawa Otolaryngology, Tokyo, Japan
| | - Yu Sakaguchi
- Tokyo Voice Center, International University of Health and Welfare, Tokyo, Japan
| | - Yukio Katori
- Department of Otolaryngology-Head and Neck Surgery, Tohoku University Graduate School of Medicine, Miyagi, Japan
| | - Yusuke Watanabe
- Tokyo Voice Center, International University of Health and Welfare, Tokyo, Japan
| |
Collapse
|
12
|
Lehoux S, Hampala V, Švec JG. Subglottal pressure oscillations in anechoic and resonant conditions and their influence on excised larynx phonations. Sci Rep 2021; 11:28. [PMID: 33420107 PMCID: PMC7794390 DOI: 10.1038/s41598-020-79265-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 11/30/2020] [Indexed: 11/16/2022] Open
Abstract
Excised larynges serve as natural models for studying behavior of the voice source. Acoustic resonances inside the air-supplying tubes below the larynx (i.e., subglottal space), however, interact with the vibratory behavior of the larynges and obscure their inherent vibration properties. Here, we explore a newly designed anechoic subglottal space which allows removing its acoustic resonances. We performed excised larynx experiments using both anechoic and resonant subglottal spaces in order to analyze and compare, for the very first time, the corresponding subglottal pressures, electroglottographic and radiated acoustic waveforms. In contrast to the resonant conditions, the anechoic subglottal pressure waveforms showed negligible oscillations during the vocal fold contact phase, as expected. When inverted, these waveforms closely matched the inverse filtered radiated sound waveforms. Subglottal resonances modified also the radiated sound pressures (Level 1 interactions). Furthermore, they changed the fundamental frequency (fo) of the vocal fold oscillations and offset phonation threshold pressures (Level 2 interactions), even for subglottal resonance frequencies 4-10 times higher than fo. The obtained data offer the basis for better understanding the inherent vibratory properties of the vocal folds, for studying the impact of structure-acoustic interactions on voice, and for validation of computational models of voice production.
Collapse
Affiliation(s)
- Sarah Lehoux
- Voice Research Lab, Department of Biophysics, Faculty of Science, Palacký University, 17. Listopadu 12, 771 46, Olomouc, Czechia
| | - Vít Hampala
- Voice Research Lab, Department of Biophysics, Faculty of Science, Palacký University, 17. Listopadu 12, 771 46, Olomouc, Czechia
| | - Jan G Švec
- Voice Research Lab, Department of Biophysics, Faculty of Science, Palacký University, 17. Listopadu 12, 771 46, Olomouc, Czechia.
| |
Collapse
|
13
|
Evaluation of the Electroglottographic Signal Variability in Organic and Functional Dysphonia. J Voice 2020; 36:881.e5-881.e16. [DOI: 10.1016/j.jvoice.2020.09.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 09/07/2020] [Accepted: 09/10/2020] [Indexed: 11/18/2022]
|
14
|
Electroglottography – An Update. J Voice 2020; 34:503-526. [DOI: 10.1016/j.jvoice.2018.12.014] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2018] [Revised: 12/27/2018] [Accepted: 12/28/2018] [Indexed: 11/21/2022]
|
15
|
Airflow Vibrato: Dependence on Pitch and Loudness. J Voice 2019; 33:815-830. [DOI: 10.1016/j.jvoice.2018.05.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Accepted: 05/18/2018] [Indexed: 11/22/2022]
|
16
|
Herbst CT, Dunn JC. Fundamental Frequency Estimation of Low-quality Electroglottographic Signals. J Voice 2019; 33:401-411. [DOI: 10.1016/j.jvoice.2018.01.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2017] [Accepted: 01/04/2018] [Indexed: 11/16/2022]
|
17
|
Szklanny K, Gubrynowicz R, Ratyńska J, Chojnacka-Wądołowska D. Electroglottographic and acoustic analysis of voice in children with vocal nodules. Int J Pediatr Otorhinolaryngol 2019; 122:82-88. [PMID: 30981945 DOI: 10.1016/j.ijporl.2019.03.030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Revised: 03/25/2019] [Accepted: 03/26/2019] [Indexed: 11/30/2022]
Abstract
PURPOSE Vocal fold nodules are usually caused by voice overuse or vocal hyperfunction, and their symptoms include persistent hoarseness - a disturbance in the vocal fold vibrations which results in a turbulent passage of air in the glottis, manifested as a raspy, rough voice. The aim of the study was to present data concerning voice quality in patients with vocal nodules and to compare electroglottographic analysis (EGG) with acoustic analysis. METHODS The study examined 57 children with vocal fold nodules (Group 1). Each patient underwent a phoniatric evaluation of the vocal tract, a videolaryngoscopic examination, and a voice quality assessment, employing electroglottographic and acoustic analyses. The control group consisted of 37 healthy children (Group 2). The following parameters were analyzed: Closed Quotient (EGG signal), Peak Slope, Normalized Amplitude Quotient and Cepstral Peak Prominence (acoustic signal - waveform). RESULTS Changes in the EGG signal could be detected in 95% of the patients with vocal nodules, indicating the occurrence of vocal nodules and glottal insufficiency. The acoustic analysis confirmed breathy phonation in 63% of the patients. The Closed Quotient parameter proved to be more effective than Peak Slope. Closed Quotient, Peak Slope and Normalized Amplitude Quotient allowed for the differentiation of the EGG signal and the acoustic signal in groups 1 and 2 in a statistically significant way. CONCLUSIONS The results of electroglottographic and acoustic analysis show incorrect voice parameters in patients with vocal nodules with reference to the control group. At the same time, the EGG analysis proved to be more effective than the analysis of the acoustic signal.
Collapse
Affiliation(s)
- K Szklanny
- Multimedia Department, Polish-Japanese Academy of Information Technology, Warsaw, Poland.
| | - R Gubrynowicz
- Multimedia Department, Polish-Japanese Academy of Information Technology, Warsaw, Poland
| | - J Ratyńska
- Phoniatrics and Audiology Department, The Children's Memorial Health Institute, Warsaw, Poland
| | | |
Collapse
|
18
|
Ning LH. The effects of age and pitch level on electroglottographic measures during sustained phonation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:640. [PMID: 31370629 DOI: 10.1121/1.5119127] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Accepted: 07/03/2019] [Indexed: 06/10/2023]
Abstract
The aim of the present study was to use electroglottography (EGG) to explore the effects of age and pitch level on sustained vowel phonation. Thirty female individuals (10 young, 10 middle-aged, and 10 older speakers) without voice disorders or training in singing participated in this study. Eight EGG parameters were measured during sustained vowel production with a high, mid, or low pitch: fundamental frequency, contact quotient, contacting-time quotient, decontacting-time quotient, speed quotient with a midslope criterion (SQ-mid), jitter, shimmer, and the harmonics-to-noise ratio. Age was found to be a significant factor in fundamental frequency, contact quotient, contacting-time quotient, decontacting-time quotient, and SQ-mid. With increasing age, the mean fundamental frequency decreased while the contact quotient increased. The middle-aged and older speakers had more asymmetrical vocal fold vibratory patterns than the young speakers. As for pitch level, the high pitch had a significantly less decontacting-time quotient and greater SQ-mid than low and mid pitches. The lack of significant interaction between age and pitch level indicates that the effects of age and pitch level could be additive. Finally, the discriminant analyses show that contact quotient is an important factor in predicting the age of a voice.
Collapse
Affiliation(s)
- Li-Hsin Ning
- Department of English, National Taiwan Normal University, 162 Heping East Road, Daan District, Taipei City 106, Taiwan
| |
Collapse
|
19
|
Nacci A, Romeo SO, Cavaliere MD, Macerata A, Bastiani L, Paludetti G, Galli J, Marchese MR, Barillari MR, Barillari U, Berrettini S, Laschi C, Cianchetti M, Manti M, Ursino F, Fattori B. Comparison of electroglottographic variability index in euphonic and pathological voice. ACTA ACUST UNITED AC 2019; 39:381-388. [PMID: 30745592 PMCID: PMC6966776 DOI: 10.14639/0392-100x-2127] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Accepted: 03/26/2018] [Indexed: 11/23/2022]
Abstract
In a recent study we introduced a new approach for analysis of the electroglottographic (ECG) signal. This method is based on the evaluation of variation of the EGG signal and its first derivative, through new software developed by the Pisan phoniatric school. This software is designed to extract quantitative indices related to the contacting and decontacting phases of the vocal folds during phonation. The software allows us to study the combined variability of vibration amplitude and velocity (i.e. the first derivative of the EGG signal). Pathological voices show a much more variable EGG signal compared to normal voices, since cordal vibration is made irregular due to the presence of glottis plane pathologies. With the aim of demonstrating the differences between normal and pathological voices relevant to combined vibration amplitude and velocity variability, we have introduced a new quantitative parameter named “variability index, VI”. We studied 95 subjects (35 normal and 60 with pathological voice); among pathologic subjects, 15 showed functional dysphonia and 45 showed organic dysphonia. Subjects affected by organic dysphonia presented: 15 bilateral vocal nodules, 15 unilateral polyps and 15 unilateral cysts. All subjects were studied with videolaryngostroboscopy; electro-acoustic parameters of the voice were analysed with the KayPENTAX CSL (Model 4500) system. The EGG signal was recorded using KAY Model 6103 connected to the CSL system. The new software for the analysis of the EGG signal allows us to obtain not only a VI total value relevant to variability during all the recording, but also partial VI values relevant to the different glottis cycle phases. In fact, plotting the amplitude variation and its first derivative on a Lissajous graph, it is possible to divide the whole glottis cycle into four phases (each represented by four quadrants on the graph): the initial vocal folds contacting activity (VI-Q1), the last phase of vocal folds contacting (VI-Q2), the first phase of vocal folds decontacting (VI-Q3) and the last phase, up to the complete decontacting of vocal folds (VI-Q4). For each quadrant, it is also possible to work out the percent variability index. By comparing the variability indices in the normal and pathological groups, we obtained the following results: the total VI was significantly higher in the pathological subjects (0.25 vs 0.18; p = 0.01); the absolute value of VI was higher in pathological subjects, although the difference was not significant (VI-Q2, 0.041 vs 0.029; VI-Q3, 0.065 vs 0.058; VI-Q4, 0.054 vs 0.052). The percent variability in the Q2 quadrant (VI-Q2%) was significantly higher in pathological subjects compared to normal subjects (0.22 vs 0.16) (p = 0.01). The results of this study confirm that our new software for analysis of EGG signal can distinguish normal voice from pathological voice based on the new quantitative parameter VI. Moreover, this study emphasises that the final contact phase of vocal folds is the most representative of the difference between the normal and pathological voice and shows a wider variability in terms of amplitude and vibration velocity. Further studies on larger groups of subjects will be required to confirm these results and assess differences in the EGG signal among the various vocal fold pathologies.
Collapse
Affiliation(s)
- A Nacci
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy
| | - S O Romeo
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy
| | - M D Cavaliere
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy
| | - A Macerata
- Department of Clinical and Experimental Medicine, University of Pisa, Italy
| | - L Bastiani
- Institute of Clinical Physiology of the Italian National Research Council (IFC-CNR), Pisa, Italy
| | - G Paludetti
- Institute of Otorhinolaryngology, Department of Head and Neck Surgery, Fondazione Policlinico Universitario A. Gemelli IRCCS, Roma - Università Cattolica del Sacro Cuore, Rome, Italy
| | - J Galli
- Institute of Otorhinolaryngology, Department of Head and Neck Surgery, Fondazione Policlinico Universitario A. Gemelli IRCCS, Roma - Università Cattolica del Sacro Cuore, Rome, Italy
| | - M R Marchese
- Institute of Otorhinolaryngology, Department of Head and Neck Surgery, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy
| | - M R Barillari
- Division of Phoniatrics and Audiology, Department of Mental and Physical Health and Preventive Medicine, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - U Barillari
- Division of Phoniatrics and Audiology, Department of Mental and Physical Health and Preventive Medicine, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - S Berrettini
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy.,Division of ENT Diseases, Karolinska Institutet, Stockholm, Sweden
| | - C Laschi
- The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy
| | - M Cianchetti
- The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy
| | - M Manti
- The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy
| | - F Ursino
- National Institute for Research in Phoniatrics, University of Pisa, Italy
| | - B Fattori
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy
| |
Collapse
|
20
|
Selamtzis A, Ternström S, Richter B, Burk F, Köberlein M, Echternach M. A comparison of electroglottographic and glottal area waveforms for phonation type differentiation in male professional singers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:3275. [PMID: 30599695 DOI: 10.1121/1.5066456] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 10/14/2018] [Indexed: 06/09/2023]
Abstract
This study compares the use of electroglottograms (EGGs) and glottal area waveforms (GAWs) to study phonation in different vibratory states as produced by professionally trained singers. Six western classical tenors were asked to phonate pitch glides from modal to falsetto phonation, or from modal to their stage voice above the passaggio (SVaP). For each pitch glide the sample entropy (SampEn) of the EGG signal was calculated to detect the occurrence of phonatory instabilities and establish a "ground truth" for the performed phonation type. The cycles before the maximum SampEn were labeled as modal, and the cycles after the peak were labeled as either falsetto, or SVaP. Three automatic categorizations of vibratory state were performed using clustering: one based only on the EGG, one based on the GAW, and one based on their combination. The error rate (clustering vs ground truth) was, on average, lower than 10% for all of the three settings, revealing no special advantage of the GAW over EGG, and vice versa. Modal voice cycles exhibited a larger contact quotient, larger normalized derivative peak ratio, and lower rise time, compared to SVaP and falsetto. The GAW-based normalized maximum area declination rate was larger in SVaP compared to modal voice.
Collapse
Affiliation(s)
- Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Bernard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Fabian Burk
- Department of Otorhinolaryngology, University Medical Center Schleswig-Holstein, Arnold-Heller-Straße 3, 24105 Kiel, Germany
| | - Marie Köberlein
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Matthias Echternach
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, Munich University, Campus Großhadern, Marchioninistraße 15, Munich, 81377, Germany
| |
Collapse
|
21
|
Ternström S, D'Amario S, Selamtzis A. Effects of the Lung Volume on the Electroglottographic Waveform in Trained Female Singers. J Voice 2018; 34:485.e1-485.e21. [PMID: 30337119 DOI: 10.1016/j.jvoice.2018.09.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Revised: 09/04/2018] [Accepted: 09/06/2018] [Indexed: 11/25/2022]
Abstract
OBJECTIVES To determine if in singing there is an effect of lung volume on the electroglottographic waveform, and if so, how it varies over the voice range. STUDY DESIGN Eight trained female singers sang the tune "Frère Jacques" in 18 conditions: three phonetic contexts, three dynamic levels, and high or low lung volume. Conditions were randomized and replicated. METHODS The audio and EGG signals were recorded in synchrony with signals tracking respiration and vertical larynx position. The first 10 Fourier descriptors of every EGG cycle were computed. These spectral data were clustered statistically, and the clusters were mapped by color into a voice range profile display, thus visualizing the EGG waveform changes under the influence of fo and SPL. The rank correlations and effect sizes of the relationships between relative lung volume and several adduction-related EGG wave shape metrics were similarly rendered on a color scale, in voice range profile-style 'voice maps.' RESULTS In most subjects, EGG waveforms varied considerably over the voice range. Within subjects, reproducibility was high, not only across the replications, but also across the phonetic contexts. The EGG waveforms were quite individual, as was the nature of the EGG shape variation across the range. EGG metrics were significantly correlated to changes in lung volume, in parts of the range of the song, and in most subjects. However, the effect sizes of the relative lung volume were generally much smaller than the effects of fo and SPL, and the relationships always varied, even changing polarity from one part of the range to another. CONCLUSIONS Most subjects exhibited small, reproducible effects of the relative lung volume on the EGG waveform. Some hypothesized influences of tracheal pull were seen, mostly at the lowest SPLs. The effects were however highly variable, both across the moderately wide fo-SPL range and across subjects. Different singers may be applying different techniques and compensatory behaviors with changing lung volume. The outcomes emphasize the importance of making observations over a substantial part of the voice range, and not only of phonations sustained at a few fundamental frequencies and sound levels.
Collapse
Affiliation(s)
- Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden.
| | - Sara D'Amario
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden; Audio Lab, Department of Electronic Engineering, University of York, Heslington, United Kingdom
| | - Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
22
|
Rasmussen JH, Herbst CT, Elemans CPH. Quantifying syringeal dynamics in vitro using electroglottography. ACTA ACUST UNITED AC 2018; 221:jeb.172247. [PMID: 29880637 DOI: 10.1242/jeb.172247] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 05/30/2018] [Indexed: 11/20/2022]
Abstract
The complex and elaborate vocalizations uttered by many of the 10,000 extant bird species are considered a major driver in their evolutionary success, warranting study of the underlying mechanisms of vocal production. Additionally, birdsong has developed into a highly productive model system for vocal imitation learning and motor control, where, in contrast to humans, we have experimental access to the entire neuromechanical control loop. In human voice production, complex laryngeal geometry, vocal fold tissue properties, airflow and laryngeal musculature all interact to ultimately control vocal fold kinematics. Quantifying vocal fold kinematics is thus critical to understanding neuromechanical control of voiced sound production, but in vivo imaging of vocal fold kinematics in birds is experimentally challenging. Here, we adapted and tested electroglottography (EGG) as a novel tool for examining vocal fold kinematics in the avian vocal organ, the syrinx. We furthermore imaged and quantified syringeal kinematics in the pigeon (Columba livia) syrinx with unprecedented detail. Our results show that EGG signals predict (1) the relative amount of contact between the avian equivalent of vocal folds and (2) essential parameters describing vibratory kinematics, such as fundamental frequency, and timing of syringeal opening and closing events. As such, EGG provides novel opportunities for measuring syringeal vibratory kinematic parameters in vivo Furthermore, the opportunity for imaging syringeal vibratory kinematics from multiple planar views (horizontal and coronal) simultaneously promotes birds as an excellent model system for studying kinematics and control of voiced sound production in general, including in humans and other mammals.
Collapse
Affiliation(s)
- Jeppe H Rasmussen
- Department of Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Christian T Herbst
- Department of Cognitive Biology, University of Vienna, 1090 Vienna, Austria
| | - Coen P H Elemans
- Department of Biology, University of Southern Denmark, 5230 Odense, Denmark
| |
Collapse
|
23
|
Herbst CT, Koda H, Kunieda T, Suzuki J, Garcia M, Fitch WT, Nishimura T. Japanese macaque phonatory physiology. ACTA ACUST UNITED AC 2018; 221:jeb.171801. [PMID: 29615529 DOI: 10.1242/jeb.171801] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Accepted: 03/26/2018] [Indexed: 11/20/2022]
Abstract
Although the call repertoire and its communicative function are relatively well explored in Japanese macaques (Macaca fuscata), little empirical data are available on the physics and the physiology of this species' vocal production mechanism. Here, a 6 year old female Japanese macaque was trained to phonate under an operant conditioning paradigm. The resulting 'coo' calls and spontaneously uttered 'growl' and 'chirp' calls were recorded with sound pressure level (SPL) calibrated microphones and electroglottography (EGG), a non-invasive method for assessing the dynamics of phonation. A total of 448 calls were recorded, complemented by ex vivo recordings on an excised Japanese macaque larynx. In this novel multidimensional investigative paradigm, in vivo and ex vivo data were matched via comparable EGG waveforms. Subsequent analysis suggests that the vocal range (range of fundamental frequency and SPL) of the macaque was comparable to that of a 7-10 year old human, with the exception of low intensity chirps, the production of which may be facilitated by the species' vocal membranes. In coo calls, redundant control of fundamental frequency in relation to SPL was also comparable to that in humans. EGG data revealed that growls, coos and chirps were produced by distinct laryngeal vibratory mechanisms. EGG further suggested changes in the degree of vocal fold adduction in vivo, resulting in spectral variation within the emitted coo calls, ranging from 'breathy' (including aerodynamic noise components) to 'non-breathy'. This is again analogous to humans, corroborating the notion that phonation in humans and non-human primates is based on universal physical and physiological principles.
Collapse
Affiliation(s)
- Christian T Herbst
- Bioacoustics Laboratory, Department of Cognitive Biology, University Vienna, Althanstrasse 14, 1090 Vienna, Austria
| | - Hiroki Koda
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Takumi Kunieda
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Juri Suzuki
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Maxime Garcia
- Bioacoustics Laboratory, Department of Cognitive Biology, University Vienna, Althanstrasse 14, 1090 Vienna, Austria.,ENES Lab, Université Lyon/Saint-Etienne, NEURO-PSI, CNRS UMR 9197, 23 rue Paul Michelon, 42023 Saint-Etienne, France
| | - W Tecumseh Fitch
- Bioacoustics Laboratory, Department of Cognitive Biology, University Vienna, Althanstrasse 14, 1090 Vienna, Austria
| | - Takeshi Nishimura
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| |
Collapse
|
24
|
HERBST CHRISTIANT, DUNN JACOBC. Non-invasive documentation of primate voice production using electroglottography. ANTHROPOL SCI 2018. [DOI: 10.1537/ase.180201] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Affiliation(s)
| | - JACOB C. DUNN
- Department of Animal and Environmental Biology, Faculty of Science & Technology, Anglia Ruskin University, Cambridge
- Division of Biological Anthropology, University of Cambridge, Cambridge
| |
Collapse
|
25
|
Herbst CT, Hampala V, Garcia M, Hofer R, Svec JG. Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions. J Vis Exp 2017. [PMID: 29286438 DOI: 10.3791/55303] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022] Open
Abstract
The voice of humans and most non-human mammals is generated in the larynx through self-sustaining oscillation of the vocal folds. Direct visual documentation of vocal fold vibration is challenging, particularly in non-human mammals. As an alternative, excised larynx experiments provide the opportunity to investigate vocal fold vibration under controlled physiological and physical conditions. However, the use of a full larynx merely provides a top view of the vocal folds, excluding crucial portions of the oscillating structures from observation during their interaction with aerodynamic forces. This limitation can be overcome by utilizing a hemi-larynx setup where one half of the larynx is mid-sagittally removed, providing both a superior and a lateral view of the remaining vocal fold during self-sustained oscillation. Here, a step-by-step guide for the anatomical preparation of hemi-laryngeal structures and their mounting on the laboratory bench is given. Exemplary phonation of the hemi-larynx preparation is documented with high-speed video data captured by two synchronized cameras (superior and lateral views), showing three-dimensional vocal fold motion and corresponding time-varying contact area. The documentation of the hemi-larynx setup in this publication will facilitate application and reliable repeatability in experimental research, providing voice scientists with the potential to better understand the biomechanics of voice production.
Collapse
Affiliation(s)
- Christian T Herbst
- Voice Research Lab, Department of Biophysics, Faculty of Science, Palacky University Olomouc; Laboratory of Bio-Acoustics, Dept. of Cognitive Biology, University of Vienna;
| | - Vit Hampala
- Voice Research Lab, Department of Biophysics, Faculty of Science, Palacky University Olomouc
| | - Maxime Garcia
- Laboratory of Bio-Acoustics, Dept. of Cognitive Biology, University of Vienna; ENES Lab, NEURO-PSI,CNRS UMR 9197, Université Lyon/Saint-Etienne, France
| | - Riccardo Hofer
- Laboratory of Bio-Acoustics, Dept. of Cognitive Biology, University of Vienna
| | - Jan G Svec
- Voice Research Lab, Department of Biophysics, Faculty of Science, Palacky University Olomouc
| |
Collapse
|
26
|
Macerata A, Nacci A, Manti M, Cianchetti M, Matteucci J, Romeo SO, Fattori B, Berrettini S, Laschi C, Ursino F. Evaluation of the Electroglottographic signal variability by amplitude-speed combined analysis. Biomed Signal Process Control 2017. [DOI: 10.1016/j.bspc.2016.10.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
27
|
Syndergaard KL, Dushku S, Thomson SL. Electrically conductive synthetic vocal fold replicas for voice production research. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:EL63. [PMID: 28764459 PMCID: PMC5724742 DOI: 10.1121/1.4990540] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2017] [Accepted: 06/15/2017] [Indexed: 06/01/2023]
Abstract
A method of fabricating electrically conductive synthetic vocal fold replicas and monitoring their vibration via resistance measurement is presented. Normally non-conductive silicone replicas were coated with conductive graphite and subjected to long-term vibration tests. Synchronized resistance and imaging data using hemilarynx and full larynx configurations showed an inverse correlation between replica contact area and resistance during vibration, similar to clinical electroglottography (EGG) used to estimate vocal fold contact area. This method has potential for long-term replica vibration monitoring and studying basic physical relationships between resistance and contact area in vocal folds and vocal fold replicas.
Collapse
Affiliation(s)
- Kyle L Syndergaard
- Department of Mechanical Engineering, Brigham Young University-Idaho, Rexburg, Idaho 83460, USA ,
| | - Shelby Dushku
- Department of Mechanical Engineering, Brigham Young University-Idaho, Rexburg, Idaho 83460, USA ,
| | - Scott L Thomson
- Department of Mechanical Engineering, Brigham Young University, Provo, Utah 84602, USA
| |
Collapse
|
28
|
Herbst CT, Schutte HK, Bowling DL, Svec JG. Comparing Chalk With Cheese—The EGG Contact Quotient Is Only a Limited Surrogate of the Closed Quotient. J Voice 2017; 31:401-409. [DOI: 10.1016/j.jvoice.2016.11.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Revised: 11/06/2016] [Accepted: 11/08/2016] [Indexed: 10/20/2022]
|
29
|
Echternach M, Burk F, Köberlein M, Selamtzis A, Döllinger M, Burdumy M, Richter B, Herbst CT. Laryngeal evidence for the first and second passaggio in professionally trained sopranos. PLoS One 2017; 12:e0175865. [PMID: 28467509 PMCID: PMC5414960 DOI: 10.1371/journal.pone.0175865] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2016] [Accepted: 03/31/2017] [Indexed: 11/18/2022] Open
Abstract
Introduction Due to a lack of empirical data, the current understanding of the laryngeal mechanics in the passaggio regions (i.e., the fundamental frequency ranges where vocal registration events usually occur) of the female singing voice is still limited. Material and methods In this study the first and second passaggio regions of 10 professionally trained female classical soprano singers were analyzed. The sopranos performed pitch glides from A3 (ƒo = 220 Hz) to A4 (ƒo = 440 Hz) and from A4 (ƒo = 440 Hz) to A5 (ƒo = 880 Hz) on the vowel [iː]. Vocal fold vibration was assessed with trans-nasal high speed videoendoscopy at 20,000 fps, complemented by simultaneous electroglottographic (EGG) and acoustic recordings. Register breaks were perceptually rated by 12 voice experts. Voice stability was documented with the EGG-based sample entropy. Glottal opening and closing patterns during the passaggi were analyzed, supplemented with open quotient data extracted from the glottal area waveform. Results In both the first and the second passaggio, variations of vocal fold vibration patterns were found. Four distinct patterns emerged: smooth transitions with either increasing or decreasing durations of glottal closure, abrupt register transitions, and intermediate loss of vocal fold contact. Audible register transitions (in both the first and second passaggi) generally coincided with higher sample entropy values and higher open quotient variance through the respective passaggi. Conclusions Noteworthy vocal fold oscillatory registration events occur in both the first and the second passaggio even in professional sopranos. The respective transitions are hypothesized to be caused by either (a) a change of laryngeal biomechanical properties; or by (b) vocal tract resonance effects, constituting level 2 source-filter interactions.
Collapse
Affiliation(s)
- Matthias Echternach
- Institute of Musicians’ Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Fabian Burk
- Institute of Musicians’ Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Marie Köberlein
- Institute of Musicians’ Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Andreas Selamtzis
- Royal Technical University, Music Acoustics. Lindstedtsvägen 24, Stockholm, Sweden
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Medical School, Waldstrasse 1, Erlangen, Germany
| | - Michael Burdumy
- Department of Medical Physics, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Bernhard Richter
- Institute of Musicians’ Medicine, University of Freiburg Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Christian Thomas Herbst
- Laboratory of Bio-Acoustics, Department of Cognitive Biology, University of Vienna, Althanstraße 14, Vienna, Austria
- * E-mail:
| |
Collapse
|
30
|
Jing B, Chigan P, Ge Z, Wu L, Wang S, Wan M. Visualizing the movement of the contact between vocal folds during vibration by using array-based transmission ultrasonic glottography. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:3312. [PMID: 28599522 PMCID: PMC5435516 DOI: 10.1121/1.4983472] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Revised: 04/26/2017] [Accepted: 04/28/2017] [Indexed: 05/31/2023]
Abstract
For the purpose of noninvasively visualizing the dynamics of the contact between vibrating vocal fold medial surfaces, an ultrasonic imaging method which is referred to as array-based transmission ultrasonic glottography is proposed. An array of ultrasound transducers is used to detect the ultrasound wave transmitted from one side of the vocal folds to the other side through the small-sized contact between the vocal folds. A passive acoustic mapping method is employed to visualize and locate the contact. The results of the investigation using tissue-mimicking phantoms indicate that it is feasible to use the proposed method to visualize and locate the contact between soft tissues. Furthermore, the proposed method was used for investigating the movement of the contact between the vibrating vocal folds of excised canine larynges. The results indicate that the vertical movement of the contact can be visualized as a vertical movement of a high-intensity stripe in a series of images obtained by using the proposed method. Moreover, a visualization and analysis method, which is referred to as array-based ultrasonic kymography, is presented. The velocity of the vertical movement of the contact, which is estimated from the array-based ultrasonic kymogram, could reach 0.8 m/s during the vocal fold vibration.
Collapse
Affiliation(s)
- Bowen Jing
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, No. 28, Xianning West Road, Xi'an, Shaanxi, 710049, People's Republic of China
| | - Pengju Chigan
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, No. 28, Xianning West Road, Xi'an, Shaanxi, 710049, People's Republic of China
| | - Zhengtong Ge
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, No. 28, Xianning West Road, Xi'an, Shaanxi, 710049, People's Republic of China
| | - Liang Wu
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, No. 28, Xianning West Road, Xi'an, Shaanxi, 710049, People's Republic of China
| | - Supin Wang
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, No. 28, Xianning West Road, Xi'an, Shaanxi, 710049, People's Republic of China
| | - Mingxi Wan
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, No. 28, Xianning West Road, Xi'an, Shaanxi, 710049, People's Republic of China
| |
Collapse
|
31
|
Somanath K, Mau T. A Measure of the Auditory-perceptual Quality of Strain from Electroglottographic Analysis of Continuous Dysphonic Speech: Application to Adductor Spasmodic Dysphonia. J Voice 2016; 30:770.e9-770.e21. [PMID: 26739857 DOI: 10.1016/j.jvoice.2015.11.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Accepted: 11/05/2015] [Indexed: 11/18/2022]
Abstract
OBJECTIVES (1) To develop an automated algorithm to analyze electroglottographic (EGG) signal in continuous dysphonic speech, and (2) to identify EGG waveform parameters that correlate with the auditory-perceptual quality of strain in the speech of patients with adductor spasmodic dysphonia (ADSD). STUDY DESIGN Software development with application in a prospective controlled study. METHODS EGG was recorded from 12 normal speakers and 12 subjects with ADSD reading excerpts from the Rainbow Passage. Data were processed by a new algorithm developed with the specific goal of analyzing continuous dysphonic speech. The contact quotient, pulse width, a new parameter peak skew, and various contact closing slope quotient and contact opening slope quotient measures were extracted. EGG parameters were compared between normal and ADSD speech. Within the ADSD group, intra-subject comparison was also made between perceptually strained syllables and unstrained syllables. RESULTS The opening slope quotient SO7525 distinguished strained syllables from unstrained syllables in continuous speech within individual subjects with ADSD. The standard deviations, but not the means, of contact quotient, EGGW50, peak skew, and SO7525 were different between normal and ADSD speakers. The strain-stress pattern in continuous speech can be visualized as color gradients based on the variation of EGG parameter values. CONCLUSIONS EGG parameters may provide a within-subject measure of vocal strain and serve as a marker for treatment response. The addition of EGG to multidimensional assessment may lead to improved characterization of the voice disturbance in ADSD.
Collapse
Affiliation(s)
- Keerthan Somanath
- Clinical Center for Voice Care, Department of Otolaryngology-Head and Neck Surgery, University of Texas Southwestern Medical Center, Dallas, Texas
| | - Ted Mau
- Clinical Center for Voice Care, Department of Otolaryngology-Head and Neck Surgery, University of Texas Southwestern Medical Center, Dallas, Texas..
| |
Collapse
|