1
|
Ren Z, Shang F, Zheng Y, Wu N, Ma L, Zhou X. The Role of EGG in Identifying Prevocalic Glottal Stop. J Voice 2024:S0892-1997(24)00020-1. [PMID: 38402112 DOI: 10.1016/j.jvoice.2024.01.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/26/2024] [Accepted: 01/26/2024] [Indexed: 02/26/2024]
Abstract
OBJECTIVE The aim of the study is to investigate the use of incidences and characteristics of Prevocalic Electroglottographic Signal (PVES) derived from electroglottography (EGG) in characterizing glottal stops (GS) in cleft palate speech. METHODS Mandarin nonaspirated monosyllabic first-tone words were used for the speech sampling procedure. A total of 1680 utterances (from 83 patients with repaired cleft palates) were divided into three categories based on the results of auditory-perceptual evaluation of recorded speech sounds by three independent reviewers: [Category A (absence of GS agreed by all three reviewers) (n = 1192 tokens), Category B (two out of three reviewers agreed on the presence of a GS) (n = 181 tokens) and Category C (all three reviewers agreed on the presence of a GS) (n = 307 tokens)]. The EGG signals of the 1680 utterances were analyzed using a MATLAB program to automatically mark the instances of PVES (amplitude and time-interval) in the GS utterances. RESULTS The result showed that the incidence of EGG PVES presented good positive correlation with auditory-perceptual evaluation (r = 0.703, P<0.000). Statistical analysis revealed a significant difference in mean PVES amplitude among different groups (P<0.05). There was a significant distinction in the time interval between groups A and B, as well as in groups A and C (P<0.05). CONCLUSIONS The study suggests PVES can be an objective means of identifying GS in cleft palate speech. It also indicates that proportion of amplitude and time interval of PVES tend to be positively correlate with subjective assessment.
Collapse
Affiliation(s)
- Zhen Ren
- Department of Oral & Maxillofacial Surgery, Peking University School and Hospital of Stomatology, Beijing, China
| | - Feifei Shang
- Department of Oral & Maxillofacial Surgery, Peking University School and Hospital of Stomatology, Beijing, China
| | - Yafeng Zheng
- Department of Oral & Maxillofacial Surgery, Peking University School and Hospital of Stomatology, Beijing, China
| | - Nankai Wu
- Department of Chinese Language and Literature, Jinan University, Guangzhou, China
| | - Lian Ma
- Department of Oral & Maxillofacial Surgery, Peking University School and Hospital of Stomatology, Beijing, China
| | - Xia Zhou
- Department of Oral & Maxillofacial Surgery, Peking University School and Hospital of Stomatology, Beijing, China.
| |
Collapse
|
2
|
Tomaszewska JZ, Georgakis A. Electroglottography in Medical Diagnostics of Vocal Tract Pathologies: A Systematic Review. J Voice 2023:S0892-1997(23)00388-0. [PMID: 38143204 DOI: 10.1016/j.jvoice.2023.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/02/2023] [Accepted: 12/04/2023] [Indexed: 12/26/2023]
Abstract
Electroglottography (EGG) is a technology developed for measuring the vocal fold contact area during human voice production. Although considered subjective and unreliable as a sole diagnostic method, with the correct application of relevant computational methods, it can constitute a most promising non-invasive voice disorder diagnostic tools in a form of a digital vocal tract pathology classifier. The aim of the following study is to gather and evaluate currently existing digital voice quality assessment systems and vocal tract abnormality classification systems that rely on the use of electroglottographic bio-impedance signals. To fully comprehend the findings of this review, first the subject of EGG is introduced. For that, we summarise most relevant existing research on EGG with a particular focus on its application in diagnostics. Then, we move on to the focal point of this work, which is describing and comparing the existing EGG-based digital voice pathology classification systems. With the application of PRISMA model, 13 articles were chosen and analysed in detail. Direct comparison between chosen studies brought us to pivotal conclusions, which have been described in Section 5 of this report. Meanwhile, certain limitations arising from the literature were identified, such as questionable understanding of the nature of EGG bio-impedance signals. The appropriate recommendations for future work were made, including the application of different methods for EGG feature extraction, as well as the need for continuous EGG datasets development containing signals gathered in various conditions and with different equipments.
Collapse
|
3
|
Codino J, Jackson-Menaldi MC, Rubin A, Torres ME. Automated Quantification of Inflection Events in The Electroglottographic Signal. J Voice 2023; 37:640-647. [PMID: 34162494 DOI: 10.1016/j.jvoice.2021.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 05/01/2021] [Accepted: 05/10/2021] [Indexed: 11/26/2022]
Affiliation(s)
- Juliana Codino
- Lakeshore Professional Voice Center, Lakeshore Ear, Nose and Throat Center, MI, USA
| | - María Cristina Jackson-Menaldi
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| | - Adam Rubin
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| | - María Eugenia Torres
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| |
Collapse
|
4
|
Patel RR, Ternström S. Quantitative and Qualitative Electroglottographic Wave Shape Differences in Children and Adults Using Voice Map-Based Analysis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2977-2995. [PMID: 34319772 DOI: 10.1044/2021_jslhr-20-00717] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Purpose The purpose of this study is to identify the extent to which various measurements of contacting parameters differ between children and adults during habitual range and overlap vocal frequency/intensity, using voice map-based assessment of noninvasive electroglottography (EGG). Method EGG voice maps were analyzed from 26 adults (22-45 years) and 22 children (4-8 years) during connected speech and vowel /a/ over the habitual range and the overlap vocal frequency/intensity from the voice range profile task on the vowel /a/. Mean and standard deviations of contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were obtained. Group differences were evaluated using the linear mixed model analysis for the habitual range connected speech and the vowel, whereas analysis of covariance was conducted for the overlap vocal frequency/intensity from the voice range profile task. Presence of a "knee" on the EGG wave shape was determined by visual inspection of the presence of convexity along the decontacting slope of the EGG pulse and the presence of the second derivative zero-crossing. Results The contact quotient by integration, normalized contacting speed, quotient of speed by integration, and cycle-rate sample entropy were significantly different in children compared to (a) adult males for habitual range and (b) adult males and adult females for the overlap vocal frequency/intensity. None of the children had a "knee" on the decontacting slope of the EGG slope. Conclusion EGG parameters of contact quotient by integration, normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and absence of a "knee" on the decontacting slope characterize the wave shape differences between children and adults, whereas the normalized contacting speed, quotient of speed by integration, cycle-rate sample entropy, and presence of a "knee" on the downward pulse slope characterize the wave shape differences between adult males and adult females. Supplemental Material https://doi.org/10.23641/asha.15057345.
Collapse
Affiliation(s)
- Rita R Patel
- Department of Speech, Language and Hearing Sciences, Indiana University Bloomington
| | - Sten Ternström
- Division of Speech, Music, and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
5
|
Evaluation of the Electroglottographic Signal Variability in Organic and Functional Dysphonia. J Voice 2020; 36:881.e5-881.e16. [DOI: 10.1016/j.jvoice.2020.09.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 09/07/2020] [Accepted: 09/10/2020] [Indexed: 11/18/2022]
|
6
|
Electroglottography – An Update. J Voice 2020; 34:503-526. [DOI: 10.1016/j.jvoice.2018.12.014] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2018] [Revised: 12/27/2018] [Accepted: 12/28/2018] [Indexed: 11/21/2022]
|
7
|
Radhakrishnan NN. Nasal Resistance (NR) Technique: A Novel Approach to Improve Glottal Adduction. J Voice 2020; 36:91-97. [PMID: 32522383 DOI: 10.1016/j.jvoice.2020.04.020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 04/23/2020] [Accepted: 04/28/2020] [Indexed: 11/26/2022]
Abstract
OBJECTIVE Several techniques have been practiced in the field of voice therapy to balance glottal pressures and improve glottal adduction. As such, Semioccluded Vocal Tract Exercises using narrow diameter straws have shown promise as a technique to increase glottal contact as reflected by Closed Quotients (CQ) measured from electroglottograph (EGG). This study explored the effect of nasal resistance (NR) technique, a manually induced NR approach, on glottal adduction in phononormal subjects as a possible new therapeutic technique to assist in glottal closure. SUBJECTS Thirty-four vocally healthy adult females qualified for this study. All subjects were reportedly and perceptually normal. METHODS Subjects were instructed to perform tasks including (a) steady modal phonation, (b) straw phonation, (c) humming, and (d) NR technique in a random order. EGG measures included calculation of the CQ and jitter during the mid-section of each stimuli for a minimum of 10 EGG cycles. One-way repeated measures ANOVAs and Bonferroni post-hoc comparisons were calculated to determine differences between each phonatory task for the outcome measures of CQ and jitter. RESULTS Analyses revealed that the mean CQ was significantly different in all the tasks performed with humming (33.99%) at the lowest and NR technique at highest (43.88%). Though the CQ of straw phonation (40.53%) was higher than modal steady phonation (36.65%), jitter was found to be significantly lower for NR technique (0.23%). Jitter during straw phonation (0.43%) did not differ significantly from modal phonation (0.44%). CONCLUSION These results provide preliminary evidence that NR technique can increase CQ with lower levels of jitter compared to straw phonation using a narrow diameter of 2 mm. Future studies should expand enrollment to men, larger age ranges, and patients with hypofunctional symptoms (eg, Parkinson disease) to evaluate the potential benefits of NR as a therapeutic technique.
Collapse
|
8
|
Ning LH. The effects of age and pitch level on electroglottographic measures during sustained phonation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:640. [PMID: 31370629 DOI: 10.1121/1.5119127] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Accepted: 07/03/2019] [Indexed: 06/10/2023]
Abstract
The aim of the present study was to use electroglottography (EGG) to explore the effects of age and pitch level on sustained vowel phonation. Thirty female individuals (10 young, 10 middle-aged, and 10 older speakers) without voice disorders or training in singing participated in this study. Eight EGG parameters were measured during sustained vowel production with a high, mid, or low pitch: fundamental frequency, contact quotient, contacting-time quotient, decontacting-time quotient, speed quotient with a midslope criterion (SQ-mid), jitter, shimmer, and the harmonics-to-noise ratio. Age was found to be a significant factor in fundamental frequency, contact quotient, contacting-time quotient, decontacting-time quotient, and SQ-mid. With increasing age, the mean fundamental frequency decreased while the contact quotient increased. The middle-aged and older speakers had more asymmetrical vocal fold vibratory patterns than the young speakers. As for pitch level, the high pitch had a significantly less decontacting-time quotient and greater SQ-mid than low and mid pitches. The lack of significant interaction between age and pitch level indicates that the effects of age and pitch level could be additive. Finally, the discriminant analyses show that contact quotient is an important factor in predicting the age of a voice.
Collapse
Affiliation(s)
- Li-Hsin Ning
- Department of English, National Taiwan Normal University, 162 Heping East Road, Daan District, Taipei City 106, Taiwan
| |
Collapse
|
9
|
Nacci A, Romeo SO, Cavaliere MD, Macerata A, Bastiani L, Paludetti G, Galli J, Marchese MR, Barillari MR, Barillari U, Berrettini S, Laschi C, Cianchetti M, Manti M, Ursino F, Fattori B. Comparison of electroglottographic variability index in euphonic and pathological voice. ACTA ACUST UNITED AC 2019; 39:381-388. [PMID: 30745592 PMCID: PMC6966776 DOI: 10.14639/0392-100x-2127] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Accepted: 03/26/2018] [Indexed: 11/23/2022]
Abstract
In a recent study we introduced a new approach for analysis of the electroglottographic (ECG) signal. This method is based on the evaluation of variation of the EGG signal and its first derivative, through new software developed by the Pisan phoniatric school. This software is designed to extract quantitative indices related to the contacting and decontacting phases of the vocal folds during phonation. The software allows us to study the combined variability of vibration amplitude and velocity (i.e. the first derivative of the EGG signal). Pathological voices show a much more variable EGG signal compared to normal voices, since cordal vibration is made irregular due to the presence of glottis plane pathologies. With the aim of demonstrating the differences between normal and pathological voices relevant to combined vibration amplitude and velocity variability, we have introduced a new quantitative parameter named “variability index, VI”. We studied 95 subjects (35 normal and 60 with pathological voice); among pathologic subjects, 15 showed functional dysphonia and 45 showed organic dysphonia. Subjects affected by organic dysphonia presented: 15 bilateral vocal nodules, 15 unilateral polyps and 15 unilateral cysts. All subjects were studied with videolaryngostroboscopy; electro-acoustic parameters of the voice were analysed with the KayPENTAX CSL (Model 4500) system. The EGG signal was recorded using KAY Model 6103 connected to the CSL system. The new software for the analysis of the EGG signal allows us to obtain not only a VI total value relevant to variability during all the recording, but also partial VI values relevant to the different glottis cycle phases. In fact, plotting the amplitude variation and its first derivative on a Lissajous graph, it is possible to divide the whole glottis cycle into four phases (each represented by four quadrants on the graph): the initial vocal folds contacting activity (VI-Q1), the last phase of vocal folds contacting (VI-Q2), the first phase of vocal folds decontacting (VI-Q3) and the last phase, up to the complete decontacting of vocal folds (VI-Q4). For each quadrant, it is also possible to work out the percent variability index. By comparing the variability indices in the normal and pathological groups, we obtained the following results: the total VI was significantly higher in the pathological subjects (0.25 vs 0.18; p = 0.01); the absolute value of VI was higher in pathological subjects, although the difference was not significant (VI-Q2, 0.041 vs 0.029; VI-Q3, 0.065 vs 0.058; VI-Q4, 0.054 vs 0.052). The percent variability in the Q2 quadrant (VI-Q2%) was significantly higher in pathological subjects compared to normal subjects (0.22 vs 0.16) (p = 0.01). The results of this study confirm that our new software for analysis of EGG signal can distinguish normal voice from pathological voice based on the new quantitative parameter VI. Moreover, this study emphasises that the final contact phase of vocal folds is the most representative of the difference between the normal and pathological voice and shows a wider variability in terms of amplitude and vibration velocity. Further studies on larger groups of subjects will be required to confirm these results and assess differences in the EGG signal among the various vocal fold pathologies.
Collapse
Affiliation(s)
- A Nacci
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy
| | - S O Romeo
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy
| | - M D Cavaliere
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy
| | - A Macerata
- Department of Clinical and Experimental Medicine, University of Pisa, Italy
| | - L Bastiani
- Institute of Clinical Physiology of the Italian National Research Council (IFC-CNR), Pisa, Italy
| | - G Paludetti
- Institute of Otorhinolaryngology, Department of Head and Neck Surgery, Fondazione Policlinico Universitario A. Gemelli IRCCS, Roma - Università Cattolica del Sacro Cuore, Rome, Italy
| | - J Galli
- Institute of Otorhinolaryngology, Department of Head and Neck Surgery, Fondazione Policlinico Universitario A. Gemelli IRCCS, Roma - Università Cattolica del Sacro Cuore, Rome, Italy
| | - M R Marchese
- Institute of Otorhinolaryngology, Department of Head and Neck Surgery, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy
| | - M R Barillari
- Division of Phoniatrics and Audiology, Department of Mental and Physical Health and Preventive Medicine, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - U Barillari
- Division of Phoniatrics and Audiology, Department of Mental and Physical Health and Preventive Medicine, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - S Berrettini
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy.,Division of ENT Diseases, Karolinska Institutet, Stockholm, Sweden
| | - C Laschi
- The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy
| | - M Cianchetti
- The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy
| | - M Manti
- The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy
| | - F Ursino
- National Institute for Research in Phoniatrics, University of Pisa, Italy
| | - B Fattori
- ENT, Audiology and Phoniatrics Unit, University Hospital of Pisa, Italy
| |
Collapse
|
10
|
D'Amario S, Daffern H, Bailes F. A Longitudinal Study Investigating Synchronization in a Singing Quintet. J Voice 2018; 34:159.e1-159.e12. [PMID: 30104125 DOI: 10.1016/j.jvoice.2018.06.011] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Revised: 06/24/2018] [Accepted: 06/27/2018] [Indexed: 10/28/2022]
Abstract
Research suggests that synchronization between musicians during ensemble performances can be affected by the rhythmic or tonal complexity of the piece being performed and by group roles such as leader-follower relationships. Since previous studies have mostly been conducted within single performance sessions, developmental aspects of interpersonal synchronization in ensembles remain underinvestigated. This longitudinal study followed a newly formed singing ensemble from initial rehearsals to the performance stage in order to investigate the evolution of synchronization between advanced singing students during a university term of study in relation to the musical content of the piece and leader-follower relationships. An advanced postgraduate singing quintet was recorded using head-worn microphones and laryngograph electrodes to allow fundamental frequency evaluation of the individual voices. The quintet, formed to complete a 1-year Master's programme in ensemble singing, rehearsed two pieces composed for the study, during five rehearsals over 3 months. Singers practised the same pieces in a randomized order across rehearsals and performed three repetitions of the same pieces before and after each rehearsal, resulting in six recordings per piece/rehearsal. Audio and laryngograph data of the repeated performances were collected, and synchronization was measured by extracting note times from the fundamental frequency values. The asynchronies of the two pieces before and after rehearsals were calculated and compared both within rehearsals (pre and post) and between rehearsals (rehearsals 1-5). Results demonstrate an increase in the precision of synchronization over the course of study, depending on the piece being rehearsed, and a more variable synchronization for the more rhythmically complex piece. Results also show changes in the distribution of the tendency to precede all co-performers across rehearsals, which became equally distributed among the musicians during the last rehearsal. The results reported here could have important implications for the tailoring of rehearsal strategies that could improve interpersonal synchronization between musicians during ensemble performances.
Collapse
Affiliation(s)
- Sara D'Amario
- Department of Electronic Engineering, University of York, Heslington, York, United Kingdom.
| | - Helena Daffern
- Department of Electronic Engineering, University of York, Heslington, York, United Kingdom
| | - Freya Bailes
- School of Music, University of Leeds, Leeds, United Kingdom
| |
Collapse
|
11
|
D'Amario S, Daffern H, Bailes F. A new method of onset and offset detection in ensemble singing. LOGOP PHONIATR VOCO 2018; 44:143-158. [PMID: 29583026 DOI: 10.1080/14015439.2018.1452977] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
This paper presents a novel method combining electrolaryngography and acoustic analysis to detect the onset and offset of phonation as well as the beginning and ending of notes within a sung legato phrase, through the application of a peak-picking algorithm, TIMEX. The evaluation of the method applied to a set of singing duo recordings shows an overall performance of 78% within a tolerance window of 50 ms compared with manual annotations performed by three experts. Results seem very promising in light of the state-of-the-art techniques presented at MIREX in 2016 yielding an overall performance of around 60%. The new method was applied to a pilot study with two duets to analyse synchronization between singers during ensemble performances. Results from this investigation demonstrate bidirectional temporal adaptations between performers, and suggest that the precision and consistency of synchronization, and the tendency to precede or lag a co-performer might be affected by visual contact between singers and leader-follower relationships. The outcomes of this paper promise to be beneficial for future investigations of synchronization in singing ensembles.
Collapse
Affiliation(s)
- Sara D'Amario
- Department of Electronic Engineering, University of York , York , UK
| | - Helena Daffern
- Department of Electronic Engineering, University of York , York , UK
| | - Freya Bailes
- School of Music, University of Leeds , Leeds , UK
| |
Collapse
|
12
|
Mizuta M, Kurita T, Dillon NP, Kimball EE, Garrett CG, Sivasankar MP, Webster RJ, Rousseau B. In vivo measurement of vocal fold surface resistance. Laryngoscope 2017; 127:E364-E370. [PMID: 28573762 PMCID: PMC5607073 DOI: 10.1002/lary.26715] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Revised: 04/23/2017] [Accepted: 05/03/2017] [Indexed: 11/07/2022]
Abstract
OBJECTIVES/HYPOTHESIS A custom-designed probe was developed to measure vocal fold surface resistance in vivo. The purpose of this study was to demonstrate proof of concept of using vocal fold surface resistance as a proxy of functional tissue integrity after acute phonotrauma using an animal model. STUDY DESIGN Prospective animal study. METHODS New Zealand White breeder rabbits received 120 minutes of airflow without vocal fold approximation (control) or 120 minutes of raised intensity phonation (experimental). The probe was inserted via laryngoscope and placed on the left vocal fold under endoscopic visualization. Vocal fold surface resistance of the middle one-third of the vocal fold was measured after 0 (baseline), 60, and 120 minutes of phonation. After the phonation procedure, the larynx was harvested and prepared for transmission electron microscopy. RESULTS In the control group, vocal fold surface resistance values remained stable across time points. In the experimental group, surface resistance (X% ± Y% relative to baseline) was significantly decreased after 120 minutes of raised intensity phonation. This was associated with structural changes using transmission electron microscopy, which revealed damage to the vocal fold epithelium after phonotrauma, including disruption of the epithelium and basement membrane, dilated paracellular spaces, and alterations to epithelial microprojections. In contrast, control vocal fold specimens showed well-preserved stratified squamous epithelia. CONCLUSIONS These data demonstrate the feasibility of measuring vocal fold surface resistance in vivo as a means of evaluating functional vocal fold epithelial barrier integrity. Device prototypes are in development for additional testing, validation, and for clinical applications in laryngology. LEVEL OF EVIDENCE NA Laryngoscope, 127:E364-E370, 2017.
Collapse
Affiliation(s)
- Masanobu Mizuta
- Department of Otolaryngology-Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan
- Department of Otolaryngology, Vanderbilt University Medical Center, Nashville, TN†
| | - Takashi Kurita
- Department of Otolaryngology, Vanderbilt University Medical Center, Nashville, TN†
| | - Neal P. Dillon
- Department of Mechanical Engineering, Vanderbilt University, Nashville, TN
| | - Emily E. Kimball
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, Nashville, TN
| | - C. Gaelyn Garrett
- Department of Otolaryngology, Vanderbilt University Medical Center, Nashville, TN†
| | - M. Preeti Sivasankar
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, IN
| | - Robert J. Webster
- Department of Otolaryngology, Vanderbilt University Medical Center, Nashville, TN†
- Department of Mechanical Engineering, Vanderbilt University, Nashville, TN
- Department of Electrical Engineering, Vanderbilt University, Nashville, TN
- Department of Neurological Surgery, Vanderbilt University Medical Center, Nashville, TN
- Department of Urologic Surgery, Vanderbilt University Medical Center, Nashville, TN
| | - Bernard Rousseau
- Department of Otolaryngology, Vanderbilt University Medical Center, Nashville, TN†
- Department of Mechanical Engineering, Vanderbilt University, Nashville, TN
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, Nashville, TN
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, IN
| |
Collapse
|
13
|
Macerata A, Nacci A, Manti M, Cianchetti M, Matteucci J, Romeo SO, Fattori B, Berrettini S, Laschi C, Ursino F. Evaluation of the Electroglottographic signal variability by amplitude-speed combined analysis. Biomed Signal Process Control 2017. [DOI: 10.1016/j.bspc.2016.10.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
14
|
Mayr A. Investigating the Voce Faringea: Physiological and Acoustic Characteristics of the Bel Canto Tenor's Forgotten Singing Practice. J Voice 2017; 31:255.e13-255.e23. [PMID: 27430862 DOI: 10.1016/j.jvoice.2016.06.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Revised: 06/17/2016] [Accepted: 06/21/2016] [Indexed: 10/21/2022]
Abstract
Several historical sources from the first half of the 19th century mention a distinct third register mechanism particular to tenor voices of that period. This so-called voce faringea-often described as an "intermediate" register-is a virtually forgotten historical singing practice used to extend the upper range of the voice, where the singer modifies falsetto, typically a weak and often feminine sound, into a more powerful, tenor-like vocal quality. Based on an evaluation of historical voice register theories, training strategies, and the sound ideals of the historical period, an informed discussion of that technique is developed. For this study, acoustic and electroglottographic signals for tones produced on the vowel /a/ by a professional tenor/countertenor in different vocal register mechanisms-voce faringea, falsetto, chest register, and mezza voce-were recorded using the VoceVista system. Analysis of the electroglottography (EGG) and audio data revealed specific characteristics of the voce faringea with regard to both the laryngeal mechanism and the sound spectrum, including high EGG contact quotient and low speed quotient values. EGG pulses were skewed significantly to the left and displayed a distinct knee shape during the de-contacting phase of the vocal folds, which consequently indicates a vibration with a clear mucosal wave. The long-term average spectrum and power spectrum exposed a considerable amplification and dislocation of F2 in the direction of high frequencies, thus boosting the third harmonic and showing a strong concentration of acoustic energy in the area of the singer's formant cluster.
Collapse
Affiliation(s)
- Alexander Mayr
- University for Music and Performing Arts, Vienna, Austria.
| |
Collapse
|
15
|
Automated Electroglottographic Inflection Events Detection. A Pilot Study. J Voice 2016; 30:768.e1-768.e10. [DOI: 10.1016/j.jvoice.2015.10.020] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Accepted: 10/29/2015] [Indexed: 11/20/2022]
|
16
|
Relationship Between Acoustic Voice Onset and Offset and Selected Instances of Oscillatory Onset and Offset in Young Healthy Men and Women. J Voice 2016; 31:389.e9-389.e17. [PMID: 27769696 DOI: 10.1016/j.jvoice.2016.09.021] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Revised: 09/16/2016] [Accepted: 09/19/2016] [Indexed: 11/20/2022]
Abstract
OBJECTIVE This study aimed to investigate the relationship between (1) onset of the acoustic signal (X1a) and prephonatory phases associated with oscillatory onset and (2) offset of the acoustic signal (X2a) with the postphonatory events associated with oscillatory offset across vocally healthy adults. SUBJECTS AND METHODS High-speed videoendoscopy was captured simultaneously with the acoustic signal during repeated production of /hi.hi.hi/ at typical pitch and loudness from 56 vocally healthy adults (aged 20-42 years; 21 men, 35 women). The relationships between the acoustic sound pressure signal and oscillatory onset and offset events from the glottal area waveforms (GAWs) were statistically investigated using a multivariate linear regression analysis. RESULTS The X1a is a significant predictor of the onset of first oscillatory motion (X1g) and onset of sustained oscillations (X2g). X1a as well as gender are significant predictors of the first medial contact of the vocal folds (X1.5g). The X2a is a significant predictor of the first instance of oscillatory offset (X3g), first instance of incomplete glottal closure (X3.5g), and complete cessation of (vocal fold) oscillatory motion (X4g). CONCLUSIONS The acoustic signal onset is closely related to the X1.5g, but the latency between these events is longer for women compared to men. The X2a occurs immediately after incomplete glottal adduction. The emerging normative group latencies between the onset and offset of the acoustic and the GAW from this study appear promising for future investigations.
Collapse
|
17
|
Relationship of Various Open Quotients With Acoustic Property, Phonation Types, Fundamental Frequency, and Intensity. J Voice 2016; 30:145-57. [DOI: 10.1016/j.jvoice.2015.01.009] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 01/30/2015] [Indexed: 10/23/2022]
|
18
|
Hampala V, Garcia M, Švec JG, Scherer RC, Herbst CT. Relationship Between the Electroglottographic Signal and Vocal Fold Contact Area. J Voice 2016; 30:161-71. [DOI: 10.1016/j.jvoice.2015.03.018] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2015] [Accepted: 03/30/2015] [Indexed: 11/24/2022]
|
19
|
Somanath K, Mau T. A Measure of the Auditory-perceptual Quality of Strain from Electroglottographic Analysis of Continuous Dysphonic Speech: Application to Adductor Spasmodic Dysphonia. J Voice 2016; 30:770.e9-770.e21. [PMID: 26739857 DOI: 10.1016/j.jvoice.2015.11.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Accepted: 11/05/2015] [Indexed: 11/18/2022]
Abstract
OBJECTIVES (1) To develop an automated algorithm to analyze electroglottographic (EGG) signal in continuous dysphonic speech, and (2) to identify EGG waveform parameters that correlate with the auditory-perceptual quality of strain in the speech of patients with adductor spasmodic dysphonia (ADSD). STUDY DESIGN Software development with application in a prospective controlled study. METHODS EGG was recorded from 12 normal speakers and 12 subjects with ADSD reading excerpts from the Rainbow Passage. Data were processed by a new algorithm developed with the specific goal of analyzing continuous dysphonic speech. The contact quotient, pulse width, a new parameter peak skew, and various contact closing slope quotient and contact opening slope quotient measures were extracted. EGG parameters were compared between normal and ADSD speech. Within the ADSD group, intra-subject comparison was also made between perceptually strained syllables and unstrained syllables. RESULTS The opening slope quotient SO7525 distinguished strained syllables from unstrained syllables in continuous speech within individual subjects with ADSD. The standard deviations, but not the means, of contact quotient, EGGW50, peak skew, and SO7525 were different between normal and ADSD speakers. The strain-stress pattern in continuous speech can be visualized as color gradients based on the variation of EGG parameter values. CONCLUSIONS EGG parameters may provide a within-subject measure of vocal strain and serve as a marker for treatment response. The addition of EGG to multidimensional assessment may lead to improved characterization of the voice disturbance in ADSD.
Collapse
Affiliation(s)
- Keerthan Somanath
- Clinical Center for Voice Care, Department of Otolaryngology-Head and Neck Surgery, University of Texas Southwestern Medical Center, Dallas, Texas
| | - Ted Mau
- Clinical Center for Voice Care, Department of Otolaryngology-Head and Neck Surgery, University of Texas Southwestern Medical Center, Dallas, Texas..
| |
Collapse
|
20
|
Hohm J, Döllinger M, Bohr C, Kniesburges S, Ziethe A. Influence of F0 and Sequence Length of Audio and Electroglottographic Signals on Perturbation Measures for Voice Assessment. J Voice 2015; 29:517.e11-21. [PMID: 25944290 DOI: 10.1016/j.jvoice.2014.10.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2014] [Accepted: 10/01/2014] [Indexed: 12/01/2022]
Abstract
OBJECTIVE Within the functional assessment of voice disorders, an objective analysis of measured parameters from audio, electroglottographic (EGG), or visual signals is desired. In a typical clinical situation, reliable objective analysis is not always possible due to missing standardization and unknown stability of the clinical parameters. The aim of this study was to investigate the robustness/stability of measured clinical parameters of the audio and EGG signals in a typical clinical setting to ensure a reliable objective analysis. In particular, the influence of F0 and of the sequence length on several definitions of jitter and shimmer will be analyzed. PATIENTS AND METHODS Seventy-four young healthy women produced a sustained vowel /a/ and an upward triad with abrupt changeovers. Different sequence lengths (100, 150, 500, and 1000 ms) of sustained phonation and triads (100 and 150 ms) were extracted from the audio and EGG signals. In total, six variations of jitter and four variations of shimmer parameters were analyzed. RESULTS Jitter%, Jitter11p, and JitterPPQ of the audio signal as well as Jittermean, Shimmer, and Shimmer11p of the EGG signal are unaffected by both sequence length and F0. CONCLUSIONS Influence of F0 and sequence length on several perturbation measures of the audio and EGG signals was identified. For an objective clinical voice assessment, unaffected definitions of jitter and shimmer should be preferred and applied to enable comparability between different recordings, examinations, and studies.
Collapse
Affiliation(s)
- Julian Hohm
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany
| | - Michael Döllinger
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany
| | - Christopher Bohr
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany
| | - Stefan Kniesburges
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany
| | - Anke Ziethe
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany.
| |
Collapse
|
21
|
Tang S, Zhang C, Wang S, Wan M. A Preliminary Study for a Slantwise-Placed Electroglottography. J Voice 2015; 29:129.e19-27. [DOI: 10.1016/j.jvoice.2014.07.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Accepted: 07/08/2014] [Indexed: 10/24/2022]
|
22
|
Huber S, Roebel A. On the use of voice descriptors for glottal source shape parameter estimation. COMPUT SPEECH LANG 2014. [DOI: 10.1016/j.csl.2013.09.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
23
|
Waaramaa T, Palo P, Kankare E. Emotions in freely varying and mono-pitched vowels, acoustic and EGG analyses. LOGOP PHONIATR VOCO 2014; 40:156-70. [PMID: 24998780 DOI: 10.3109/14015439.2014.934277] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Vocal emotions are expressed either by speech or singing. The difference is that in singing the pitch is predetermined while in speech it may vary freely. It was of interest to study whether there were voice quality differences between freely varying and mono-pitched vowels expressed by professional actors. Given their profession, actors have to be able to express emotions both by speech and singing. Electroglottogram and acoustic analyses of emotional utterances embedded in expressions of freely varying vowels [a:], [i:], [u:] (96 samples) and mono-pitched protracted vowels (96 samples) were studied. Contact quotient (CQEGG) was calculated using 35%, 55%, and 80% threshold levels. Three different threshold levels were used in order to evaluate their effects on emotions. Genders were studied separately. The results suggested significant gender differences for CQEGG 80% threshold level. SPL, CQEGG, and F4 were used to convey emotions, but to a lesser degree, when F0 was predetermined. Moreover, females showed fewer significant variations than males. Both genders used more hypofunctional phonation type in mono-pitched utterances than in the expressions with freely varying pitch. The present material warrants further study of the interplay between CQEGG threshold levels and formant frequencies, and listening tests to investigate the perceptual value of the mono-pitched vowels in the communication of emotions.
Collapse
Affiliation(s)
- Teija Waaramaa
- a University of Tampere, School of Communication, Media and Theatre , Tampere , Finland
| | - Pertti Palo
- b Clinical Audiology, Speech and Language (CASL) Research Centre, Queen Margaret University , UK
| | - Elina Kankare
- c University of Tampere, Speech and Voice Research Laboratory, School of Education , Tampere , Finland.,d Ear and Oral Diseases, Department of Phoniatrics in Tampere University Hospital , Tampere , Finland
| |
Collapse
|
24
|
Lien YAS, Stepp CE. Comparison of voice relative fundamental frequency estimates derived from an accelerometer signal and low-pass filtered and unprocessed microphone signals. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:2977-85. [PMID: 24815277 PMCID: PMC4032403 DOI: 10.1121/1.4870488] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
The relative fundamental frequency (RFF) surrounding the production of a voiceless consonant has previously been estimated using unprocessed and low-pass filtered microphone signals, but it can also be estimated using a neck-placed accelerometer signal that is less affected by vocal tract formants. Determining the effects of signal type on RFF will allow for comparisons across studies and aid in establishing a standard protocol with minimal within-speaker variability. Here RFF was estimated in 12 speakers with healthy voices using unprocessed microphone, low-pass filtered microphone, and unprocessed accelerometer signals. Unprocessed microphone and accelerometer signals were recorded simultaneously using a microphone and neck-placed accelerometer. The unprocessed microphone signal was filtered at 350 Hz to construct the low-pass filtered microphone signal. Analyses of variance showed that signal type and the interaction of vocal cycle × signal type had significant effects on both RFF means and standard deviations, but with small effect sizes. The overall RFF trend was preserved regardless of signal type and the intra-speaker variability of RFF was similar among the signal types. Thus, RFF can be estimated using either a microphone or an accelerometer signal in individuals with healthy voices. Future work extending these findings to individuals with disordered voices is warranted.
Collapse
Affiliation(s)
- Yu-An S Lien
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215
| | - Cara E Stepp
- Departments of Speech, Language, and Hearing Sciences and Biomedical Engineering, Boston University, Boston, Massachusetts 02215
| |
Collapse
|
25
|
Tsanas A, Zañartu M, Little MA, Fox C, Ramig LO, Clifford GD. Robust fundamental frequency estimation in sustained vowels: detailed algorithmic comparisons and information fusion with adaptive Kalman filtering. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:2885-901. [PMID: 24815269 PMCID: PMC4032429 DOI: 10.1121/1.4870484] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
There has been consistent interest among speech signal processing researchers in the accurate estimation of the fundamental frequency (F(0)) of speech signals. This study examines ten F(0) estimation algorithms (some well-established and some proposed more recently) to determine which of these algorithms is, on average, better able to estimate F(0) in the sustained vowel /a/. Moreover, a robust method for adaptively weighting the estimates of individual F(0) estimation algorithms based on quality and performance measures is proposed, using an adaptive Kalman filter (KF) framework. The accuracy of the algorithms is validated using (a) a database of 117 synthetic realistic phonations obtained using a sophisticated physiological model of speech production and (b) a database of 65 recordings of human phonations where the glottal cycles are calculated from electroglottograph signals. On average, the sawtooth waveform inspired pitch estimator and the nearly defect-free algorithms provided the best individual F(0) estimates, and the proposed KF approach resulted in a ∼16% improvement in accuracy over the best single F(0) estimation algorithm. These findings may be useful in speech signal processing applications where sustained vowels are used to assess vocal quality, when very accurate F(0) estimation is required.
Collapse
Affiliation(s)
- Athanasios Tsanas
- Institute of Biomedical Engineering, Department of Engineering Science, Old Road Campus Research Building, University of Oxford, Headington, Oxford OX3 7DQ, United Kingdom
| | - Matías Zañartu
- Department of Electronic Engineering at Universidad Técnica Federico Santa María, Av. España 1680, Casilla 110-V, Valparaiso 2390123, Chile
| | - Max A Little
- MIT Media Lab, 77 Massachusetts Avenue, E14/E15, Cambridge, Massachusetts 02139-4307
| | - Cynthia Fox
- National Center for Voice and Speech, 136 South Main Street, Suite 320, Salt Lake City, Utah 84101-1623
| | - Lorraine O Ramig
- Speech, Language, and Hearing Sciences, 2501 Kittredge Loop Road, 409 UCB, University of Colorado, Boulder, Colorado 80309-0409
| | - Gari D Clifford
- Institute of Biomedical Engineering, Department of Engineering Science, Old Road Campus Research Building, University of Oxford, Headington, Oxford OX3 7DQ, United Kingdom
| |
Collapse
|
26
|
|
27
|
Gaskill CS, Quinney DM. The Effect of Resonance Tubes on Glottal Contact Quotient With and Without Task Instruction: A Comparison of Trained and Untrained Voices. J Voice 2012; 26:e79-93. [DOI: 10.1016/j.jvoice.2011.03.003] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2010] [Accepted: 03/10/2011] [Indexed: 10/18/2022]
|
28
|
Kankare E, Laukkanen AM, Ilomäki I, Miettinen A, Pylkkänen T. Electroglottographic contact quotient in different phonation types using different amplitude threshold levels. LOGOP PHONIATR VOCO 2012; 37:127-32. [DOI: 10.3109/14015439.2012.664656] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
29
|
Kankare E, Laukkanen AM. Quasi-output-cost-ratio, perceived voice quality, and subjective evaluation in female kindergarten teachers. LOGOP PHONIATR VOCO 2012; 37:62-8. [DOI: 10.3109/14015439.2012.660500] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
30
|
Bhandari A, Izdebski K, Huang C, Yan Y. Comparative analysis of normal voice characteristics using simultaneous electroglottography and high speed digital imaging. Biomed Signal Process Control 2012. [DOI: 10.1016/j.bspc.2011.06.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
31
|
Hosokawa K, Yoshida M, Yoshii T, Takenaka Y, Hashimoto M, Ogawa M, Inohara H. Effectiveness of the Computed Analysis of Electroglottographic Signals in Muscle Tension Dysphonia. Folia Phoniatr Logop 2012; 64:145-50. [PMID: 22965009 DOI: 10.1159/000342146] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
- K Hosokawa
- Department of Otorhinolaryngology, Head and Neck Surgery, Osaka University Graduate School of Medicine, Toyonaka-Heisei Hospital, Osaka, Japan.
| | | | | | | | | | | | | |
Collapse
|
32
|
Herbst CT, Fitch WTS, Svec JG. Electroglottographic wavegrams: a technique for visualizing vocal fold dynamics noninvasively. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:3070-3078. [PMID: 21110602 DOI: 10.1121/1.3493423] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
A method for analyzing and displaying electroglottographic (EGG) signals (and their first derivative, DEGG) is introduced: the electroglottographic wavegram ("wavegram" hereafter). To construct a wavegram, the time-varying fundamental frequency is measured and consecutive individual glottal cycles are identified. Each cycle is locally normalized in duration and amplitude, the signal values are encoded by color intensity and the cycles are concatenated to display the entire voice sample in a single image, similar as in sound spectrography. The wavegram provides an intuitive means for quickly assessing vocal fold contact phenomena and their variation over time. Variations in vocal fold contact appear here as a sequence of events rather than single phenomena, taking place over a certain period of time, and changing with pitch, loudness and register. Multiple DEGG peaks are revealed in wavegrams to behave systematically, indicating subtle changes of vocal fold oscillatory regime. As such, EGG wavegrams promise to reveal more information on vocal fold contacting and de-contacting events than previous methods.
Collapse
Affiliation(s)
- Christian T Herbst
- Laboratory of Biophysics, Department of Experimental Physics, Faculty of Science, Palacký University Olomouc, tř 17 listopadu 12, 771 46 Olomouc, Czech Republic.
| | | | | |
Collapse
|
33
|
Mooshammer C. Acoustic and laryngographic measures of the laryngeal reflexes of linguistic prominence and vocal effort in German. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:1047-58. [PMID: 20136226 PMCID: PMC2830266 DOI: 10.1121/1.3277160] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
This study uses acoustic and physiological measures to compare laryngeal reflexes of global changes in vocal effort to the effects of modulating such aspects of linguistic prominence as sentence accent, induced by focus variation, and word stress. Seven speakers were recorded by using a laryngograph. The laryngographic pulses were preprocessed to normalize time and amplitude. The laryngographic pulse shape was quantified using open and skewness quotients and also by applying a functional version of the principal component analysis. Acoustic measures included the acoustic open quotient and spectral balance in the vowel /e/ during the test syllable. The open quotient and the laryngographic pulse shape indicated a significantly shorter open phase for loud speech than for soft speech. Similar results were found for lexical stress, suggesting that lexical stress and loud speech are produced with a similar voice source mechanism. Stressed syllables were distinguished from unstressed syllables by their open phase and pulse shape, even in the absence of sentence accent. Evidence for laryngeal involvement in signaling focus, independent of fundamental frequency changes, was not as consistent across speakers. Acoustic results on various spectral balance measures were generally much less consistent compared to results from laryngographic data.
Collapse
Affiliation(s)
- Christine Mooshammer
- Haskins Laboratories, 300 George Street, Suite 900, New Haven, Connecticut 06511, USA.
| |
Collapse
|
34
|
The Effect of an Artificially Lengthened Vocal Tract on Estimated Glottal Contact Quotient in Untrained Male Voices. J Voice 2010; 24:57-71. [DOI: 10.1016/j.jvoice.2008.05.004] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2007] [Accepted: 05/09/2008] [Indexed: 11/18/2022]
|
35
|
Konstantopoulos K, Vikelis M, Seikel JA, Mitsikostas DD. The existence of phonatory instability in multiple sclerosis: an acoustic and electroglottographic study. Neurol Sci 2009; 31:259-68. [DOI: 10.1007/s10072-009-0170-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2008] [Accepted: 10/07/2009] [Indexed: 10/20/2022]
|
36
|
|
37
|
Herbst C, Ternström S. A comparison of different methods to measure the EGG contact quotient. LOGOP PHONIATR VOCO 2009; 31:126-38. [PMID: 16966155 DOI: 10.1080/14015430500376580] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The results from six published electroglottographic (EGG-based) methods for calculating the EGG contact quotient (CQEGG) were compared to closed quotients derived from simultaneous videokymographic imaging (CQKYM). Two trained male singers phonated in falsetto and in chest register, with two degrees of adduction in both registers. The maximum difference between methods in the CQEGG was 0.3 (out of 1.0). The CQEGG was generally lower than the CQKYM. Within subjects, the CQEGG co-varied with the CQkym, but with changing offsets depending on method. The CQEGG cannot be calculated for falsetto phonation with little adduction, since there is no complete glottal closure. Basic criterion-level methods with thresholds of 0.2 or 0.25 gave the best match to the CQKYM data. The results suggest that contacting and de-contacting in the EGG might not refer to the same physical events as do the beginning and cessation of airflow.
Collapse
|
38
|
Laukkanen AM, Pulakka H, Alku P, Vilkman E, Hertegård S, Lindestad PÅ, Larsson H, Granqvist S. High-speed registration of phonation-related glottal area variation during artificial lengthening of the vocal tract. LOGOP PHONIATR VOCO 2009; 32:157-64. [DOI: 10.1080/14015430701547013] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
39
|
Gaskill CS, Erickson ML. The Effect of a Voiced Lip Trill on Estimated Glottal Closed Quotient. J Voice 2008; 22:634-43. [PMID: 17574810 DOI: 10.1016/j.jvoice.2007.03.012] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2005] [Accepted: 03/30/2007] [Indexed: 10/23/2022]
Abstract
The use of lip trills has been advocated for both vocal habilitation and rehabilitation. A voiced lip trill requires continuous vibration of the lips while simultaneously maintaining phonation. The mechanism of any effects of a lip trill on vocal fold vibration is still unknown. While other techniques that either constrict or artificially lengthen the vocal tract have been investigated, no studies thus far have systematically examined the effect of lip trills on vocal fold vibration. Classically trained singers and vocally untrained participants produced a lip trill for approximately 1 minute, and vocal fold closed quotient (CQ) was calculated both during the lip trill and on a sustained spoken vowel before and after the trill. Data are reported for both a group design and a single-subject design. Most participants showed a tendency for a reduction in CQ during the lip trill, with a more pronounced change in the untrained participants.
Collapse
Affiliation(s)
- Christopher S Gaskill
- Department of Communicative Disorders, The University of Alabama, Tuscaloosa, AL 35487, USA.
| | | |
Collapse
|
40
|
Zagólski O. Electroglottography in elderly patients with vocal-fold palsy. J Voice 2008; 23:567-71. [PMID: 18504110 DOI: 10.1016/j.jvoice.2008.01.014] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2007] [Accepted: 01/25/2008] [Indexed: 11/18/2022]
Abstract
The incidence of dysphonia in healthy elderly people is high. In individuals with iatrogenic vocal-fold paresis following thyroid surgery, serious aggravation of vocal skills contributes to impaired quality of life and requires proper management. Electroglottography is a common method for providing noninvasive measurements of glottal activity, yielding reliable indicators of glottal closing instants. The purpose of the study was to determine how electroglottography measures change with voice recovery in elderly speakers with vocal-fold palsy, compared with healthy elderly individuals, and which coefficient best represents dysphonia. An electroglottograph with Speech Studio 1.04 software was used to record and analyze the data. Electroglottography data were collected from 12 patients aged 65-78 years (mean=71.3, S.D.=3.8, median=71) and 10 healthy speakers aged 65-77 years (mean=70.9, S.D.=3.9, median=72). The findings show that the distribution of values of % irregularity differs between the groups of patients and controls. % Irregularity and closing quotient significantly correlate with the perceptual degree of dysphonia. Electroglottography can objectify dysphonia in elderly patients with vocal-fold palsy and is a suitable noninvasive tool for tracking the elderly patients' long-term progress. % Irregularity best represents the vocal-fold dysfunction in elderly patients with a vocal-fold palsy.
Collapse
Affiliation(s)
- Olaf Zagólski
- Diagnostic and Therapeutic Medical Centre Medicina, Kraków, Poland.
| |
Collapse
|
41
|
Lin E, Hwang TZ, Hornibrook J, Ormond T. Voice of Postradiotherapy Nasopharyngeal Carcinoma Patients: Evidence of Vocal Tract Effect. J Voice 2008; 22:351-64. [PMID: 17134872 DOI: 10.1016/j.jvoice.2006.09.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2006] [Indexed: 11/22/2022]
Abstract
This study was aimed at identifying acoustic and physiological measures useful for monitoring voice changes in postnasopharyngeal patients with nonlaryngeal malignancies, and providing evidences of vocal tract effect on voice through comparisons between individuals with and without intact vocal tract. Simultaneous acoustic-electroglottographic signals recorded during phonation of vowels /i/ and /a/ sustained at habitual, high, and low pitch levels were compared among 10 postradiotherapy patients with nasopharyngeal carcinoma (NPC), 10 voice patients (VPs) with intact vocal tract, and 10 healthy individuals with normal voice (NORM). Results from a series of discriminant analyses revealed that the NPC group generally exhibited lower signal-to-noise (SNR) and open quotient (OQ) and higher Formant 1 frequency (F(1)) and speed quotient (SQ) than the NORM group. Unlike both VP and NORM groups, the NPC group failed to show a pitch effect on all voice measures, including OQ, SQ, percent jitter, percent shimmer, and SNR, suggesting an effect of radiotherapy and/or vocal tract on laryngeal behaviors. For the vowel /i/, on the other hand, only the NPC and NORM groups showed a pattern of pitch-dependent F(1) raising, a reflection of increased pharyngeal narrowing. These findings suggested that the pitch effect on laryngeal behaviors differed not only between individuals with intact vocal tract and those without but also between those with structural and dynamic changes of vocal tract.
Collapse
Affiliation(s)
- Emily Lin
- University of Canterbury, Communication Disorders, Christchurch, New Zealand.
| | | | | | | |
Collapse
|
42
|
Michaud A. Final consonants and glottalization: new perspectives from Hanoi Vietnamese. PHONETICA 2004; 61:119-146. [PMID: 15662108 DOI: 10.1159/000082560] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The evolution from final obstruents to final glottal stop and then to rhyme glottalization (i.e. from /at/ to /a?/, then to /a'/) is a well-established general trend in the history of the Sino-Tibetan language family and beyond. It has further been shown by laryngoscopy that in three languages which retain the nonreleased syllable-final obstruents /p/, /t/ and /k/ (Standard Thai, and two Chinese dialects), these obstruents are often accompanied by a glottal stop. The present research raises the issue whether there is another typological possibility: can nonreleased final obstruents be accompanied consistently by modal phonation, without glottal stop? Analysis of electroglottographic recordings of 126 syllables in two carrier sentences spoken by 4 speakers shows that, in Hanoi Vietnamese, the final obstruents /p/, /t/ and /k/ are not accompanied by glottalization, and that the open quotient increases in the course of the syllable rhyme. Obstruent-final rhymes (which may carry either of two tones: D1 or D2) are compared with nasal-final rhymes which, under one of the tones (tone B2), are confirmed to be glottalized. Our finding is that tones D1 and D2 (i.e. obstruent-final rhymes) are both produced in modal voice, which shows that the typological paradigm of observed realizations of syllable-final obstruents must be enlarged. The discussion puts forward the hypothesis that the unusual association of segments and voice quality found in Hanoi Vietnamese is a strategy to maintain the opposition between B2-tone and D2-tone rhymes.
Collapse
Affiliation(s)
- Alexis Michaud
- Laboratoire Phonétique et Phonologie (UMR 7018) CNRS/Sorbonne Nouvelle, Paris, France.
| |
Collapse
|
43
|
Henrich N, d'Alessandro C, Doval B, Castellengo M. On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 115:1321-1332. [PMID: 15058354 DOI: 10.1121/1.1646401] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Electroglottography is a common method for providing noninvasive measurements of glottal activity. The derivative of the electroglottographic signal, however, has not attracted much attention, although it yields reliable indicators of glottal closing instants. The purpose of this paper is to provide a guide to the usefulness of this signal. The main features that are to be found in this signal are presented on the basis of an extensive analysis of a database of items sung by 18 trained singers. Glottal opening and closing instants are related to peaks in the signal; the latter can be used to measure glottal parameters such as fundamental frequency and open quotient. In some cases, peaks are doubled or imprecise, which points to special (but by no means uncommon) glottal configurations. A correlation-based algorithm for the automatic measurement of fundamental frequency and open quotient using the derivative of electroglottographic signals is proposed. It is compared to three other electroglottographic-based methods with regard to the measurement of open quotient in inverse-filtered derived glottal flow. It is shown that agreement with the glottal-flow measurements is much better than most threshold-based measurements in the case of sustained sounds.
Collapse
Affiliation(s)
- Nathalie Henrich
- LAM (UPMC, CNRS, Ministère de la culture), 11 rue de Lourmel, 75015 Paris, France.
| | | | | | | |
Collapse
|
44
|
Kochanski G, Shih C. A quasi-glottogram signal. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:2206-2216. [PMID: 14587618 DOI: 10.1121/1.1608964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
A novel, noninvasive experiment is proposed that reliably shows the strength of glottal oscillations. The quasi-glottogram (QGG) signal is generated from a microphone array that is trained to approximate the electroglottogram signal. The QGG may be useful to improve estimates of whether speech is voiced, to quantify partial voicing, and to reduce the phoneme effect when measuring the amplitude of speech signals. The technique is well adapted to the generation of text-to-speech systems, as it allows an estimate of the glottal flow during undisturbed, natural speech. For prosody studies, it can be used to provide an estimate of amplitude which is relatively unaffected by changes in phonemes, and is at least as reliable as standard estimators of amplitude.
Collapse
Affiliation(s)
- Greg Kochanski
- Oxford University, Phonetics Laboratory, Oxford OX1 2JF, United Kingdom.
| | | |
Collapse
|
45
|
Cahill LM, Murdoch BE, Theodoros DG. Perceptual and instrumental analysis of laryngeal function after traumatic brain injury in childhood. J Head Trauma Rehabil 2003; 18:268-83. [PMID: 12802169 DOI: 10.1097/00001199-200305000-00005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVE To investigate laryngeal function and phonatory disturbance in children with traumatic brain injury (TBI), using both perceptual and instrumental techniques. DESIGN AND PARTICIPANTS The performance of 16 individuals with moderate to severe TBI acquired in childhood and 16 nonneurologically impaired control subjects was compared on a battery of perceptual (Frenchay Dysarthria Assessment, speech sample analysis) and instrumental (Aerophone II, laryngograph) assessments. RESULTS AND CONCLUSIONS As a group, the children with TBI demonstrated normal, or only minimally impaired laryngeal function, when compared with the control group, which contrasts with the significant laryngeal impairment noted in adults after TBI. Several reasons for the different findings in relation to laryngeal function in adults and children after TBI are postulated: (1) differing types of injury usually incurred by adults and children may result in a relatively decreased degree of neurologic impairment in these children, (2) differences in recovery potential between adults and children, and (3) the pediatric larynx is still developing, hence it may be better able to compensate for any impairment incurred.
Collapse
Affiliation(s)
- Louise M Cahill
- Department of Speech Pathology and Audiology, The University of Queensland, Brisbane, Queensland, Australia
| | | | | |
Collapse
|
46
|
Zagólski O, Carlson E. Electroglottographic measurements of glottal function in vocal fold paralysis in women. CLINICAL OTOLARYNGOLOGY AND ALLIED SCIENCES 2002; 27:246-53. [PMID: 12169125 DOI: 10.1046/j.1365-2273.2002.00571.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The purpose of the study was to determine how electroglottographic measures change with voice recovery in women with vocal fold palsy compared with healthy speakers, and which coefficient best represents dysphonia. An electroglottograph with speech studio 1.04 software was used to record and analyse the data. Electroglottographic data were collected from 22 female patients and 16 healthy female speakers. Our findings show that the mean values of open quotient (OpQ), closing quotient (ClQ), % Irregularity and contact quotient (Qx) differ between the groups of patients and controls. % Irregularity, Qx and OpQ significantly correlated with the perceptual degree of dysphonia. Electroglottographic recordings can objectify dysphonia in patients with vocal fold palsy. Electroglottography is a suitable non-invasive tool for tracking the patients' long-term progress. % Irregularity best represents the vocal fold dysfunction in patients with a vocal fold palsy.
Collapse
Affiliation(s)
- O Zagólski
- Department of Otolaryngology, Jagiellonian University, Kraków, Poland.
| | | |
Collapse
|
47
|
Higgins MB, Schulte L. Gender differences in vocal fold contact computed from electroglottographic signals: the influence of measurement criteria. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1865-1871. [PMID: 12002870 DOI: 10.1121/1.1456517] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
EGGW is a phonatory parameter that can be derived from electroglottographic (EGG) signals and used to infer the relative degree of vocal fold contact. Vocal fold models predict that men will exhibit medial bulging of their vocal folds during phonation but women will not. These models lead us to expect gender differences in the magnitude of EGGW. Nevertheless, significant gender differences in EGGW for adults with normal voices have not been documented in previous studies when EGGW was computed from criterion lines placed at 25%-40% of the amplitude of the uninverted EGG wave form. We hypothesized that EGGW would better reflect gender differences in vocal fold adductory patterns if EGGW was computed from portions of the wave form that were associated with more vocal fold contact. EGGW was measured for seven men and seven women with normal voices. When EGGW was computed from segments of the wave form that were associated with relatively greater vocal fold contact (i.e., using criterion levels of > or = 55%), findings were consistent with the gender-specific adductory patterns that have been proposed from vocal fold models. Guidelines for appropriate placement of criterion lines when computing EGGW are discussed.
Collapse
|
48
|
Vieira MN, McInnes FR, Jack MA. On the influence of laryngeal pathologies on acoustic and electroglottographic jitter measures. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1045-1055. [PMID: 11863161 DOI: 10.1121/1.1430686] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
This study compared acoustic and electroglottographic (EGG) jitter from [a] vowels of 103 dysphonic speakers. The EGG recordings were chosen according to their intensity, signal-to-noise ratio, and percentage of unvoiced intervals, while acoustic signals were selected based on voicing detection and the reliability of jitter extraction. The agreement between jitter measures was expressed numerically as a normalized difference. In 63.1% (65/103) of the cases the differences fell within +/-22.5%. Positive differences above +22.5% were associated with increased acoustic jitter and occurred in 12.6% (13/103) of the speakers. These were, typically, cases of small nodular lesions without problems in the posterior larynx. On the other hand, substantial rises in EGG jitter leading to differences below -22.5% took place in 24.3% (25/103) of the speakers and were related to hyperfunctional voices, creaky-like voices, small laryngeal asymmetries affecting the arytenoids, or small-to-moderate glottal chinks. A clinically relevant outcome of the study was the possibility of detecting gentle laryngeal asymmetries among cases of large unilateral increase in EGG jitter. These asymmetries can be linked with vocal problems that are often overlooked in endoscopic examinations.
Collapse
Affiliation(s)
- Maurílio N Vieira
- Departamento de Física/ICEx, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
| | | | | |
Collapse
|
49
|
Abstract
The purpose of the experiment was to explore relations between jaw and laryngeal functions. The general question was whether laryngeal adduction was affected by jaw opening or by jaw biting. Twelve untrained, vocally healthy male and female adults participated as subjects. Subjects produced repeated tokens of /uh/ in each of 12 experimental conditions involving combinations of 3 jaw openings (10 mm, 25 mm, 40 mm), 2 jaw biting pressures (10 kPa, 200 kPa), and 2 fundamental frequencies (conversational and high). For each token, laryngeal adduction was estimated from the electroglottographic closed quotient. The most straightforward results were that (1) laryngeal adduction increased as jaw opening increased at the conversational pitch, for all subjects, independent of biting pressure, and (2) laryngeal adduction increased as biting pressure increased, at the conversational pitch, for males, independent of jaw opening. Other relations between estimated laryngeal adduction and jaw manipulations were more complex, varying with fundamental frequency and gender. Speculations are made about possible biomechanical and neurological explanations for the findings.
Collapse
Affiliation(s)
- S Cookman
- The University of Iowa Department of Speech Pathology and Audiology, and National Center for Voice and Speech, Iowa City, USA
| | | |
Collapse
|
50
|
Higgins MB, Netsell R, Schulte L. Vowel-related differences in laryngeal articulatory and phonatory function. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 1998; 41:712-724. [PMID: 9712121 DOI: 10.1044/jslhr.4104.712] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The purpose of this investigation was to study the interaction between the supralaryngeal and laryngeal components of the speech mechanism by examining vowel-related effects for a variety of vocal fold articulatory and phonatory measures. Secondary issues were to determine if vowel-related differences were influenced by the nature of the speaking task or gender. Between-vowel differences in estimated subglottal air pressure, peak oral air flow, mean phonatory air flow, air flow near the termination of the vowel, electroglottograph cycle width (EGGW), fundamental frequency, and voice onset time were examined for men and women during syllable repetitions and sentence productions. Significant vowel-related differences were found for all of the measures except mean phonatory air flow, and generally were not influenced by speaking task or gender. Vowel-related effects for estimated subglottal air pressure, peak oral air flow, fundamental frequency, and VOT were consistent with some earlier studies. New findings included vowel-related differences in EGGW and air flow near the termination of the vowel. We propose a model that includes the contribution of mechanical forces, reflexive neural activity, and learned neural activity to explain vowel-related effects. When vowel height is varied, changes in laryngeal cartilage positioning and vocal fold and vocal tract tension appear to influence laryngeal articulatory and phonatory function.
Collapse
Affiliation(s)
- M B Higgins
- Boys Town National Research Hospital, Omaha, NE 68131, USA.
| | | | | |
Collapse
|