1
|
Yamauchi A, Imagawa H, Yokonishi H, Sakakibara KI, Tayama N. Multivariate Analysis of Vocal Fold Vibrations in Normal Speakers Using High-Speed Digital Imaging. J Voice 2024; 38:10-17. [PMID: 34470706 DOI: 10.1016/j.jvoice.2021.08.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Revised: 07/30/2021] [Accepted: 08/02/2021] [Indexed: 11/18/2022]
Abstract
INTRODUCTION Little is known about the normal variations in vocal fold vibrations. We conducted a prospective study on normal subjects using high-speed digital imaging (HSDI) to elucidate key parameters regarding age/gender-related normal variations. METHODS Forty-six healthy adult volunteers were divided into young (aged ≤35 years) male, young female, elderly (aged ≥65 years) male, and elderly female subgroups. HSDI data of sustained phonation of /i/ at a comfortable pitch and loudness were obtained, and vibratory parameters were calculated using the visual-perceptual rating, laryngotopography, digital kymography, and glottal area waveform. Multivariate analysis was then performed on these parameters to clarify the subgroup-specific key parameters. RESULTS Four key parameters were identified from a total of 83: one from visual perceptual rating and three from laryngotopography. Subgroup analyses showed that posterior-to-anterior longitudinal phase difference (PD) and high fundamental frequency (F0) were specific to young female participants. A low F0 was specific to young male participants. Large anterior-to-posterior longitudinal PD and its left-right difference were specific to elderly male participants. There were no key parameters for elderly female participants. CONCLUSIONS Methods that can assess F0 and longitudinal PD, such as visual-perceptual rating and laryngotopography, were effective in the evaluation of normal vocal fold vibrations and their variations.
Collapse
Affiliation(s)
- Akihito Yamauchi
- Department of Otolaryngology, The University of Tokyo Hospital, Bunkyo-Ku, Tokyo, Japan.
| | - Hiroshi Imagawa
- Department of Otolaryngology, The University of Tokyo Hospital, Bunkyo-Ku, Tokyo, Japan
| | - Hisayuki Yokonishi
- Department of Otolaryngology, Tokyo Metropolitan Bokutoh Hospital, Sumida-Ku, Tokyo, Japan
| | - Ken-Ichi Sakakibara
- Department of Communication Disorders, Health Sciences University of Hokkaido, Ishikari-Gun, Hokkaido, Japan
| | - Niro Tayama
- Department of Otolaryngology and Tracheo-esophagology, National Center for Global Health and Medicine, Shinjuku-Ku, Tokyo, Japan
| |
Collapse
|
2
|
Kuo CFJ, Lin CS, Chiang KY, Barman J, Liu SC. In Vivo Automatic and Quantitative Measurement of Adult Human Larynx and Vocal Fold Images. J Voice 2023; 37:764-771. [PMID: 34175171 DOI: 10.1016/j.jvoice.2021.04.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 03/16/2021] [Accepted: 04/08/2021] [Indexed: 12/26/2022]
Abstract
INTRODUCTION Qualitative laryngoscopy belongs to a diagnostic routine. Nevertheless, quantitative morphometric measurements of laryngeal structures remain challenging. This study aimed to introduce a special laser projection device that can facilitate computer-assisted digitalized analysis and provide important quantitative information for diagnostics and treatment planning. MATERIALS AND METHODS The laryngeal images were captured with our device, which contained two parallel laser beams in order to provide the scaling reference. The maximum length of the vocal fold during respiration and vibration (phonation), vocal width at midpoint, total fold area, maximum cross-sectional area of the glottic space, and maximum vocal fold angle were determined and calculated. These parameters were analyzed and compared on the basis of age, sex, body height, body weight and body mass index. RESULTS A total of 87 subjects were enrolled in this study, comprising 39 males and 48 females. The age range for all subjects was 21 to 80 years old. The maximum value of the glottic area and vocal angle showed no significant gender difference. Both the respiration and vibration vocal fold length was significantly longer in males than in females. The vocal width revealed no gender difference, but the fold area during both respiration and phonation was significantly larger in men than in women. As for the respiration-to-vibration ratio of the vocal length, there was a trend, but without statistical significance (P = 0.06), toward a higher length compression ratio in men than in women. Meanwhile, age was found to have a strong relationship with vocal width during phonation. The width of vibration vocal fold decreased with aging significantly. CONCLUSION Our innovative module can provide reference parameters, which makes it possible to directly estimate the objective absolute values of relevant laryngeal structures. Our non-invasive approach can be used during routine laryngoscopy and the findings easily documented. In future, we can extend its clinical application to measure subtle laryngeal or hypopharyngeal changes, which are difficult to objectively quantify.
Collapse
Affiliation(s)
- Chung-Feng Jeffrey Kuo
- Department of Material Science & Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, Republic of China
| | - Chun-Shu Lin
- Department of Radiation Oncology, Tri-Service General Hospital, National Defense Medical Center
| | - Kai-Yao Chiang
- Department of Material Science & Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, Republic of China
| | - Jagadish Barman
- Department of Material Science & Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, Republic of China
| | - Shao-Cheng Liu
- Department of Otolaryngology-Head and Neck Surgery Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan, Republic of China.
| |
Collapse
|
3
|
Codino J, Jackson-Menaldi MC, Rubin A, Torres ME. Automated Quantification of Inflection Events in The Electroglottographic Signal. J Voice 2023; 37:640-647. [PMID: 34162494 DOI: 10.1016/j.jvoice.2021.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 05/01/2021] [Accepted: 05/10/2021] [Indexed: 11/26/2022]
Affiliation(s)
- Juliana Codino
- Lakeshore Professional Voice Center, Lakeshore Ear, Nose and Throat Center, MI, USA
| | - María Cristina Jackson-Menaldi
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| | - Adam Rubin
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| | - María Eugenia Torres
- Laboratorio de Señales y Dinámicas no Lineales, Facultad de Ingeniería, Universidad Nacional de Entre Ríos, Argentina, National Council for Scientific and Technical Research (CONICET), Argentina
| |
Collapse
|
4
|
Murtola T, Alku P. Indicators of anterior-posterior phase difference in glottal opening measured from natural production of vowels. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:EL141. [PMID: 32873022 DOI: 10.1121/10.0001722] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 07/22/2020] [Indexed: 06/11/2023]
Abstract
Voiced speech is generated by the glottal flow interacting with vocal fold vibrations. However, the details of vibrations in the anterior-posterior direction (the so-called zipper-effect) and their correspondence with speech and other glottal signals are not fully understood due to challenges in direct measurements of vocal fold vibrations. In this proof-of-concept study, the potential of four parameters extracted from high-speed videoendoscopy (HSV), electroglottography, and speech signals to indicate the presence of a zipper-type glottal opening is investigated. Comparison with manual labeling of the HSV videos highlighted the importance of multiple parameter-signal pairs in indicating the presence of a zipper-type glottal opening.
Collapse
Affiliation(s)
- Tiina Murtola
- Department of Signal Processing and Acoustics, Aalto University, Espoo, ,
| | - Paavo Alku
- Department of Signal Processing and Acoustics, Aalto University, Espoo, ,
| |
Collapse
|
5
|
Electroglottography – An Update. J Voice 2020; 34:503-526. [DOI: 10.1016/j.jvoice.2018.12.014] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2018] [Revised: 12/27/2018] [Accepted: 12/28/2018] [Indexed: 11/21/2022]
|
6
|
Selamtzis A, Ternström S, Richter B, Burk F, Köberlein M, Echternach M. A comparison of electroglottographic and glottal area waveforms for phonation type differentiation in male professional singers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:3275. [PMID: 30599695 DOI: 10.1121/1.5066456] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 10/14/2018] [Indexed: 06/09/2023]
Abstract
This study compares the use of electroglottograms (EGGs) and glottal area waveforms (GAWs) to study phonation in different vibratory states as produced by professionally trained singers. Six western classical tenors were asked to phonate pitch glides from modal to falsetto phonation, or from modal to their stage voice above the passaggio (SVaP). For each pitch glide the sample entropy (SampEn) of the EGG signal was calculated to detect the occurrence of phonatory instabilities and establish a "ground truth" for the performed phonation type. The cycles before the maximum SampEn were labeled as modal, and the cycles after the peak were labeled as either falsetto, or SVaP. Three automatic categorizations of vibratory state were performed using clustering: one based only on the EGG, one based on the GAW, and one based on their combination. The error rate (clustering vs ground truth) was, on average, lower than 10% for all of the three settings, revealing no special advantage of the GAW over EGG, and vice versa. Modal voice cycles exhibited a larger contact quotient, larger normalized derivative peak ratio, and lower rise time, compared to SVaP and falsetto. The GAW-based normalized maximum area declination rate was larger in SVaP compared to modal voice.
Collapse
Affiliation(s)
- Andreas Selamtzis
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Sten Ternström
- Department of Speech, Music and Hearing, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Lindstedtsvägen 24, Breisacherstraße 60, Stockholm, SE-100 44, Sweden
| | - Bernard Richter
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Fabian Burk
- Department of Otorhinolaryngology, University Medical Center Schleswig-Holstein, Arnold-Heller-Straße 3, 24105 Kiel, Germany
| | - Marie Köberlein
- Institute of Musicians' Medicine, Freiburg University Medical Center, Breisacher Strausse 60, Freiburg, 79106, Germany
| | - Matthias Echternach
- Department of Otorhinolaryngology, Division of Phoniatrics and Pediatric Audiology, Munich University, Campus Großhadern, Marchioninistraße 15, Munich, 81377, Germany
| |
Collapse
|
7
|
Sielska-Badurek EM, Jędra K, Sobol M, Niemczyk K, Osuch-Wójcikiewicz E. Laryngeal stroboscopy-Normative values for amplitude, open quotient, asymmetry and phase difference in young adults. Clin Otolaryngol 2018; 44:158-165. [PMID: 30353981 DOI: 10.1111/coa.13247] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2017] [Revised: 05/10/2018] [Accepted: 10/18/2018] [Indexed: 11/26/2022]
Abstract
OBJECTIVE To provide the normative values for laryngeal stroboscopy (LS) concerning amplitude, open quotient, asymmetry and phase difference in healthy, young subjects. STUDY DESIGN Prospective case-control study. SETTING Patients treated at a single institute. METHODS A total of 68 healthy subjects were included in the study (35 women, 33 men), aged 18-35 years. After obtaining LS recordings, image processing was performed to attain parameters of vocal fold vibration. RESULTS In women, the location of the maximum vibration amplitude is approximately in the 1/3 posterior part of the glottis, while in men, the location is moved to the glottis centre. In males, the relative amplitude vibration of the vocal folds in the 1/3 anterior part of the glottis was significantly higher than in females (P = 0.029). Women showed significantly higher open quotients (OQ) at the posterior part of the glottis than the male subjects (P < 0.001) and men presented significantly higher OQ at the anterior part of the glottis than the females (P < 0.001). The average OQ values for both sexes were almost the same. Females showed significantly higher relative glottal gap area (P = 0.044). Women presented a significantly lower amplitude asymmetry than men (P = 0.002). The weighted absolute left-right phase difference reached up to 24° and remained insignificantly higher in the men than the women (P = 0.142). CONCLUSIONS The study provides normative values for LS in young adults for the measurement of therapy outcomes in patients with voice disorders and realisation of evidence-based medicine. The LS parametrisation is easy to perform in clinical practice.
Collapse
Affiliation(s)
| | - Katarzyna Jędra
- Department of Otolaryngology, Medical University of Warsaw, Warsaw, Poland
| | - Maria Sobol
- Department of Biophysics and Human Physiology, Medical University of Warsaw, Warsaw, Poland
| | - Kazimierz Niemczyk
- Department of Otolaryngology, Medical University of Warsaw, Warsaw, Poland
| | | |
Collapse
|
8
|
Rasmussen JH, Herbst CT, Elemans CPH. Quantifying syringeal dynamics in vitro using electroglottography. ACTA ACUST UNITED AC 2018; 221:jeb.172247. [PMID: 29880637 DOI: 10.1242/jeb.172247] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 05/30/2018] [Indexed: 11/20/2022]
Abstract
The complex and elaborate vocalizations uttered by many of the 10,000 extant bird species are considered a major driver in their evolutionary success, warranting study of the underlying mechanisms of vocal production. Additionally, birdsong has developed into a highly productive model system for vocal imitation learning and motor control, where, in contrast to humans, we have experimental access to the entire neuromechanical control loop. In human voice production, complex laryngeal geometry, vocal fold tissue properties, airflow and laryngeal musculature all interact to ultimately control vocal fold kinematics. Quantifying vocal fold kinematics is thus critical to understanding neuromechanical control of voiced sound production, but in vivo imaging of vocal fold kinematics in birds is experimentally challenging. Here, we adapted and tested electroglottography (EGG) as a novel tool for examining vocal fold kinematics in the avian vocal organ, the syrinx. We furthermore imaged and quantified syringeal kinematics in the pigeon (Columba livia) syrinx with unprecedented detail. Our results show that EGG signals predict (1) the relative amount of contact between the avian equivalent of vocal folds and (2) essential parameters describing vibratory kinematics, such as fundamental frequency, and timing of syringeal opening and closing events. As such, EGG provides novel opportunities for measuring syringeal vibratory kinematic parameters in vivo Furthermore, the opportunity for imaging syringeal vibratory kinematics from multiple planar views (horizontal and coronal) simultaneously promotes birds as an excellent model system for studying kinematics and control of voiced sound production in general, including in humans and other mammals.
Collapse
Affiliation(s)
- Jeppe H Rasmussen
- Department of Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Christian T Herbst
- Department of Cognitive Biology, University of Vienna, 1090 Vienna, Austria
| | - Coen P H Elemans
- Department of Biology, University of Southern Denmark, 5230 Odense, Denmark
| |
Collapse
|
9
|
Herbst CT, Schutte HK, Bowling DL, Svec JG. Comparing Chalk With Cheese—The EGG Contact Quotient Is Only a Limited Surrogate of the Closed Quotient. J Voice 2017; 31:401-409. [DOI: 10.1016/j.jvoice.2016.11.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Revised: 11/06/2016] [Accepted: 11/08/2016] [Indexed: 10/20/2022]
|
10
|
Vahabzadeh-Hagh AM, Zhang Z, Chhetri DK. Quantitative Evaluation of the In Vivo Vocal Fold Medial Surface Shape. J Voice 2017; 31:513.e15-513.e23. [PMID: 28089390 DOI: 10.1016/j.jvoice.2016.12.004] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 12/02/2016] [Indexed: 11/29/2022]
Abstract
OBJECTIVES/HYPOTHESIS Glottal insufficiency is a common clinical problem in otolaryngology and medialization laryngoplasty (ML) procedures remain the primary treatment modality. Although the goal of ML is to restore physiologic glottal posture and achieve optimal phonation, this posture has not been directly measured. In this study, we assessed glottal medial surface contour changes with selective activation of the intrinsic laryngeal muscles (ILMs). STUDY DESIGN Basic science study using an in vivo canine hemilarynx model. METHODS In an in vivo canine hemilarynx, India ink was used to mark fleshpoints in a grid-like fashion along the medial surface of the vocal fold and ILMs were activated in a graded manner. A right-angled prism provided two views of the medial surface, which were recorded using a high-speed camera and used to reconstruct the 3D posture deformations of the medial surface. RESULTS Thyroarytenoid (TA) muscle activation results in initial inferomedial bulging and increased glottal channel thickness and then glottal adduction with a final rectangular glottal channel shape. Lateral cricoarytenoid (LCA) activation closes the posterior glottis but final posture remains slightly convergent. Together, TA + LCA forms a rectangular glottis with an increased glottal vertical thickness. Posterior cricoarytenoid activation results in abduction and a slightly divergent glottis, whereas cricothyroid activation elongates the glottis and reduces the glottal channel vertical thickness. CONCLUSIONS A quantitative analysis of in vivo canine vocal fold medial surface upon activation of selective ILMs is provided. This may guide our therapeutic efforts during medialization laryngoplasty, as well as computational modeling of laryngeal physiology.
Collapse
Affiliation(s)
- Andrew M Vahabzadeh-Hagh
- Department of Head and Neck Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California.
| | - Zhaoyan Zhang
- Department of Head and Neck Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California
| | - Dinesh K Chhetri
- Department of Head and Neck Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California
| |
Collapse
|
11
|
Automated Electroglottographic Inflection Events Detection. A Pilot Study. J Voice 2016; 30:768.e1-768.e10. [DOI: 10.1016/j.jvoice.2015.10.020] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Accepted: 10/29/2015] [Indexed: 11/20/2022]
|
12
|
Vahabzadeh-Hagh AM, Zhang Z, Chhetri DK. Three-dimensional posture changes of the vocal fold from paired intrinsic laryngeal muscles. Laryngoscope 2016; 127:656-664. [PMID: 27377032 DOI: 10.1002/lary.26145] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Accepted: 05/31/2016] [Indexed: 11/08/2022]
Abstract
OBJECTIVES/HYPOTHESIS Although the geometry of the vocal fold medial surface affects voice quality and is critical in the treatment of glottic insufficiency, the prephonatory shape of the vocal fold medial surface is not well understood. In this study, we activated intrinsic laryngeal muscles individually and in combinations, and recorded the temporal sequence and precise three-dimensional configurational changes of the vocal fold medial surface. STUDY DESIGN In vivo canine hemilarynx model. METHODS A hemilaryngectomy was performed in an in vivo canine model and ink was used to mark the medial surface of the in situ vocal fold in a grid-like fashion. The thyroarytenoid (TA), lateral cricoarytenoid (LCA), cricothyroid (CT), and posterior cricoarytenoid (PCA) muscles were stimulated individually and in combinations. A right-angle prism whose hypotenuse formed the glottal midline provided two distinct views of the medial surface for a high-speed digital camera. Image-processing package DaVis (LaVision Inc., Goettingen, Germany) allowed time series cross-correlation analysis for three-dimensional deformation calculations of the vocal fold medial surface. RESULTS Combined TA and LCA activation yields an evenly adducted rectangular glottal surface. Addition of thyroarytenoid to cricoarytenoid adducts the vocal fold from inferior to superior in a graded fashion allowing formation of a divergent glottis. Posterior cricoarytenoid has a bimodal relationship with thyroarytenoid favoring abduction. Cricothyroid and lateral cricoarytenoid yield unique glottal postures necessary but likely not conducive for efficient phonation. CONCLUSIONS Understanding the three-dimensional geometry of the vocal fold medial surface will help us better understand the cause-effect relationship between laryngeal physiology and phonation. LEVEL OF EVIDENCE NA Laryngoscope, 127:656-664, 2017.
Collapse
Affiliation(s)
- Andrew M Vahabzadeh-Hagh
- Department of Head and Neck Surgery, University of California Los Angeles, David Geffen School of Medicine, Los Angeles, California, U.S.A
| | - Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California Los Angeles, David Geffen School of Medicine, Los Angeles, California, U.S.A
| | - Dinesh K Chhetri
- Department of Head and Neck Surgery, University of California Los Angeles, David Geffen School of Medicine, Los Angeles, California, U.S.A
| |
Collapse
|
13
|
Herbst CT, Hess M, Müller F, Švec JG, Sundberg J. Glottal Adduction and Subglottal Pressure in Singing. J Voice 2015; 29:391-402. [PMID: 25944295 DOI: 10.1016/j.jvoice.2014.08.009] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2014] [Accepted: 08/13/2014] [Indexed: 11/17/2022]
Abstract
Previous research suggests that independent variation of vocal loudness and glottal configuration (type and degree of vocal fold adduction) does not occur in untrained speech production. This study investigated whether these factors can be varied independently in trained singing and how subglottal pressure is related to average glottal airflow, voice source properties, and sound level under these conditions. A classically trained baritone produced sustained phonations on the endoscopic vowel [i:] at pitch D4 (approximately 294 Hz), exclusively varying either (a) vocal register; (b) phonation type (from "breathy" to "pressed" via cartilaginous adduction); or (c) vocal loudness, while keeping the others constant. Phonation was documented by simultaneous recording of videokymographic, electroglottographic, airflow and voice source data, and by percutaneous measurement of relative subglottal pressure. Register shifts were clearly marked in the electroglottographic wavegram display. Compared with chest register, falsetto was produced with greater pulse amplitude of the glottal flow, H1-H2, mean airflow, and with lower maximum flow declination rate (MFDR), subglottal pressure, and sound pressure. Shifts of phonation type (breathy/flow/neutral/pressed) induced comparable systematic changes. Increase of vocal loudness resulted in increased subglottal pressure, average flow, sound pressure, MFDR, glottal flow pulse amplitude, and H1-H2. When changing either vocal register or phonation type, subglottal pressure and mean airflow showed an inverse relationship, that is, variation of glottal flow resistance. The direct relation between subglottal pressure and airflow when varying only vocal loudness demonstrated independent control of vocal loudness and glottal configuration. Achieving such independent control of phonatory control parameters would be an important target in vocal pedagogy and in voice therapy.
Collapse
Affiliation(s)
- Christian T Herbst
- Voice Research Lab, Department of Biophysics, Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic; Laboratory of Bio-Acoustics, Department of Cognitive Biology, University of Vienna, Wien, Austria.
| | - Markus Hess
- Department of Voice, Speech and Hearing Disorders, University Medical Center Hamburg-Eppendorf, University of Hamburg, Hamburg, Germany
| | - Frank Müller
- Department of Voice, Speech and Hearing Disorders, University Medical Center Hamburg-Eppendorf, University of Hamburg, Hamburg, Germany
| | - Jan G Švec
- Voice Research Lab, Department of Biophysics, Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Johan Sundberg
- Department of Speech, Music, and Hearing, School of Computer Science and Communication, KTH Voice Research Centre, Stockholm, Sweden; University College of Music Education Stockholm, Stockholm, Sweden
| |
Collapse
|
14
|
Abstract
The laryngeal video stroboscope is an important instrument to test glottal diseases and read vocal fold images and voice quality for physician clinical diagnosis. This study is aimed to develop a medical system with functionality of automatic intelligent recognition of dynamic images. The static images of glottis opening to the largest extent and closing to the smallest extent were screened automatically using color space transformation and image preprocessing. The glottal area was also quantized. As the tongue base movements affected the position of laryngoscope and saliva would result in unclear images, this study used the gray scale adaptive entropy value to set the threshold in order to establish an elimination system. The proposed system can improve the effect of automatically captured images of glottis and achieve an accuracy rate of 96%. In addition, the glottal area and area segmentation threshold were calculated effectively. The glottis area segmentation was corrected, and the glottal area waveform pattern was drawn automatically to assist in vocal fold diagnosis. When developing the intelligent recognition system for vocal fold disorders, this study analyzed the characteristic values of four vocal fold patterns, namely, normal vocal fold, vocal fold paralysis, vocal fold polyp, and vocal fold cyst. It also used the support vector machine classifier to identify vocal fold disorders and achieved an identification accuracy rate of 98.75%. The results can serve as a very valuable reference for diagnosis.
Collapse
|
15
|
Herbst CT, Svec JG, Lohscheller J, Frey R, Gumpenberger M, Stoeger AS, Fitch WT. Complex vibratory patterns in an elephant larynx. ACTA ACUST UNITED AC 2014; 216:4054-64. [PMID: 24133151 DOI: 10.1242/jeb.091009] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Elephants' low-frequency vocalizations are produced by flow-induced self-sustaining oscillations of laryngeal tissue. To date, little is known in detail about the vibratory phenomena in the elephant larynx. Here, we provide a first descriptive report of the complex oscillatory features found in the excised larynx of a 25 year old female African elephant (Loxodonta africana), the largest animal sound generator ever studied experimentally. Sound production was documented with high-speed video, acoustic measurements, air flow and sound pressure level recordings. The anatomy of the larynx was studied with computed tomography (CT) and dissections. Elephant CT vocal anatomy data were further compared with the anatomy of an adult human male. We observed numerous unusual phenomena, not typically reported in human vocal fold vibrations. Phase delays along both the inferior-superior and anterior-posterior (A-P) dimension were commonly observed, as well as transverse travelling wave patterns along the A-P dimension, previously not documented in the literature. Acoustic energy was mainly created during the instant of glottal opening. The vestibular folds, when adducted, participated in tissue vibration, effectively increasing the generated sound pressure level by 12 dB. The complexity of the observed phenomena is partly attributed to the distinct laryngeal anatomy of the elephant larynx, which is not simply a large-scale version of its human counterpart. Travelling waves may be facilitated by low fundamental frequencies and increased vocal fold tension. A travelling wave model is proposed, to account for three types of phenomena: A-P travelling waves, 'conventional' standing wave patterns, and irregular vocal fold vibration.
Collapse
Affiliation(s)
- Christian T Herbst
- Laboratory of Bio-Acoustics, Department of Cognitive Biology, University of Vienna, Althanstraße 14, 1090 Wien, Austria
| | | | | | | | | | | | | |
Collapse
|
16
|
Yamauchi A, Yokonishi H, Imagawa H, Sakakibara KI, Nito T, Tayama N, Yamasoba T. Age- and gender-related difference of vocal fold vibration and glottal configuration in normal speakers: analysis with glottal area waveform. J Voice 2014; 28:525-31. [PMID: 24836359 DOI: 10.1016/j.jvoice.2014.01.016] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2013] [Accepted: 01/28/2014] [Indexed: 11/17/2022]
Abstract
INTRODUCTION Glottal area waveform (GAW) analysis is widely used in the assessment of vocal fold vibration by high-speed digital imaging (HSDI). Because normative GAW data obtained from a large number of subjects have not been reported, we conducted a prospective study to obtain normative results for GAW analysis of HSDI findings and clarify normal variations associated with gender and age. METHODS Vocally healthy adults were divided into a young group (aged ≤ 35 years) and an elderly group (aged ≥ 65 years). The configuration and size of the glottal area were assessed at different phases of the glottal cycle, and gender- and age-related differences were evaluated. RESULTS A total of 26 young subjects (nine men and 17 women; mean age: 27 years) and 20 elderly subjects (eight men and 12 women; mean age: 73 years) were investigated. The glottal area at different points of the glottal cycle showed a negative correlation with frequency. Although the GAW parameters of young women appeared to be different from those of the other subgroups, the differences were not statistically significant. Young women predominantly had a triangular- or vase-shaped glottal configuration at all frequencies, whereas the other subgroups showed various glottal shapes. CONCLUSION The present study clarified gender- and age-related differences of GAW parameters obtained with HSDI. Young women were likely to show different glottal configurations and different responses to frequency changes from those of young men, elderly men, and elderly women. Phonosurgeons should pay attention to the normal variations detected in the present study.
Collapse
Affiliation(s)
- Akihito Yamauchi
- Department of Otolaryngology, The University of Tokyo Hospital, Bunkyo-Ku, Tokyo, Japan.
| | - Hisayuki Yokonishi
- Department of Otolaryngology, The Tokyo Metropolitan Police Hospital, Nakano-Ku, Tokyo, Japan
| | - Hiroshi Imagawa
- Department of Otolaryngology, The University of Tokyo Hospital, Bunkyo-Ku, Tokyo, Japan
| | - Ken-Ichi Sakakibara
- Department of Communication Disorders, The Health Sciences University of Hokkaido, Ishikari-Gun, Hokkaido, Japan
| | - Takaharu Nito
- Department of Otolaryngology, The University of Tokyo Hospital, Bunkyo-Ku, Tokyo, Japan
| | - Niro Tayama
- Department of Otolaryngology and Tracheo-esophagology, The National Center for Global Health and Medicine, Shinjuku-Ku, Tokyo, Japan
| | - Tatsuya Yamasoba
- Department of Otolaryngology, The University of Tokyo Hospital, Bunkyo-Ku, Tokyo, Japan
| |
Collapse
|
17
|
Yamauchi A, Imagawa H, Sakakibara KI, Yokonishi H, Nito T, Yamasoba T, Tayama N. Characteristics of vocal fold vibrations in vocally healthy subjects: analysis with multi-line kymography. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:S648-S657. [PMID: 24686860 DOI: 10.1044/2014_jslhr-s-12-0269] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
PURPOSE In this study, the authors aimed to analyze longitudinal data from high-speed digital images in normative subjects using multi-line kymography. METHOD Vocally healthy subjects were divided into young (9 men and 17 women; Mage = 27 years) and older groups (8 men and 12 women; Mage = 73 years). From high-speed digital images of phonation at a conversational frequency kymograms were created at 5 different levels of the vocal fold and were analyzed to determine the opening/closing longitudinal phase difference, open quotient, and speed index. Then age- and gender-related differences of these parameters were analyzed statistically. RESULTS Young women frequently showed a pattern of posterior-to-anterior glottal opening and anterior-to-posterior glottal closure, and older women demonstrated various opening and closing patterns. Both young men and older men were similar to older women. The open quotient was maximal at the most posterior glottal level in young women, but it tended to be maximal at the anterior glottis in the other subgroups. The mean value of the 5 open quotients was largest in young women. The mean speed index had a large negative value in older subjects. CONCLUSION This study provides the first information about age-related differences of longitudinal oscillatory characteristics of the vocal folds obtained with high-speed digital imaging.
Collapse
|
18
|
Titze IR. Bi-stable vocal fold adduction: a mechanism of modal-falsetto register shifts and mixed registration. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:2091-101. [PMID: 25235006 PMCID: PMC4167751 DOI: 10.1121/1.4868355] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
The origin of vocal registers has generally been attributed to differential activation of cricothyroid and thyroarytenoid muscles in the larynx. Register shifts, however, have also been shown to be affected by glottal pressures exerted on vocal fold surfaces, which can change with loudness, pitch, and vowel. Here it is shown computationally and with empirical data that intraglottal pressures can change abruptly when glottal adductory geometry is changed relatively smoothly from convergent to divergent. An intermediate shape between large convergence and large divergence, namely, a nearly rectangular glottal shape with almost parallel vocal fold surfaces, is associated with mixed registration. It can be less stable than either of the highly angular shapes unless transglottal pressure is reduced and upper stiffness of vocal fold tissues is balanced with lower stiffness. This intermediate state of adduction is desirable because it leads to a low phonation threshold pressure with moderate vocal fold collision. Achieving mixed registration consistently across wide ranges of F0, lung pressure, and vocal tract shapes appears to be a balancing act of coordinating laryngeal muscle activation with vocal tract pressures. Surprisingly, a large transglottal pressure is not facilitative in this process, exacerbating the bi-stable condition and the associated register contrast.
Collapse
Affiliation(s)
- Ingo R Titze
- National Center for Voice and Speech, 136 South Main Street, Suite 320, Salt Lake City, Utah 84101
| |
Collapse
|
19
|
Herbst CT, Lohscheller J, Švec JG, Henrich N, Weissengruber G, Fitch WT. Glottal opening and closing events investigated by electroglottography and super-high-speed video recordings. J Exp Biol 2014; 217:955-63. [DOI: 10.1242/jeb.093203] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Previous research has suggested that the peaks in the first derivative (dEGG) of the electroglottographic (EGG) signal are good approximate indicators of the events of glottal opening and closing. These findings were based on high-speed video (HSV) recordings with frame rates 10 times lower than the sampling frequencies of the corresponding EGG data. The present study attempts to corroborate these previous findings, utilizing super-HSV recordings. The HSV and EGG recordings (sampled at 27 and 44 kHz, respectively) of an excised canine larynx phonation were synchronized by an external TTL signal to within 0.037 ms. Data were analyzed by means of glottovibrograms, digital kymograms, the glottal area waveform and the vocal fold contact length (VFCL), a new parameter representing the time-varying degree of ‘zippering’ closure along the anterior–posterior (A–P) glottal axis. The temporal offsets between glottal events (depicted in the HSV recordings) and dEGG peaks in the opening and closing phase of glottal vibration ranged from 0.02 to 0.61 ms, amounting to 0.24–10.88% of the respective glottal cycle durations. All dEGG double peaks coincided with vibratory A–P phase differences. In two out of the three analyzed video sequences, peaks in the first derivative of the VFCL coincided with dEGG peaks, again co-occurring with A–P phase differences. The findings suggest that dEGG peaks do not always coincide with the events of glottal closure and initial opening. Vocal fold contacting and de-contacting do not occur at infinitesimally small instants of time, but extend over a certain interval, particularly under the influence of A–P phase differences.
Collapse
Affiliation(s)
- Christian T. Herbst
- Voice Research Laboratory, Department of Biophysics, Faculty of Science, Palacký University Olomouc, tr. 17. Listopadu 12, 771 46 Olomouc, Czech Republic
- Laboratory of Bio-Acoustics, Department of Cognitive Biology, University of Vienna, Althanstraße 14, 1090 Vienna, Austria
| | - Jörg Lohscheller
- University of Applied Sciences, Department of Computer Science, Schneidershof, 54293 Trier, Germany
| | - Jan G. Švec
- Voice Research Laboratory, Department of Biophysics, Faculty of Science, Palacký University Olomouc, tr. 17. Listopadu 12, 771 46 Olomouc, Czech Republic
| | - Nathalie Henrich
- GIPSA-lab, CNRS, Grenoble INP, Grenoble University, 11 rue des Mathématiques – BP 46, 38402 Saint Martin d'Hères cedex, France
| | - Gerald Weissengruber
- University of Veterinary Medicine Vienna, Institute for Anatomy, Histology and Embryology, Veterinärplatz 1, 1210 Vienna, Austria
| | - W. Tecumseh Fitch
- Laboratory of Bio-Acoustics, Department of Cognitive Biology, University of Vienna, Althanstraße 14, 1090 Vienna, Austria
| |
Collapse
|
20
|
Lohscheller J, Svec JG, Döllinger M. Vocal fold vibration amplitude, open quotient, speed quotient and their variability along glottal length: kymographic data from normal subjects. LOGOP PHONIATR VOCO 2012; 38:182-92. [PMID: 23173880 DOI: 10.3109/14015439.2012.731083] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Abstract Quantitative knowledge about healthy vocal fold vibration characteristics provides the basis for an objective assessment of vocal fold vibrations. In this study, using high-speed videolaryngoscopy the alterations of the relative vibration amplitudes, open quotients, and speed quotients were analyzed along the glottal length in 30 male and 30 female healthy subjects. The maximum vibration amplitude was identified at 41.1% ± 10.8% and 46.5% ± 18.0% of the visible glottal length in females and males, respectively. The average open quotients decreased in females and males from posterior to anterior, while the speed quotients did not change systematically. The reported normative values can be used to distinguish normal and abnormal vibrations in clinical practice when aiming at quantitative diagnosis of functional voice disorders.
Collapse
Affiliation(s)
- Jörg Lohscheller
- Department of Computer Science, University of Applied Sciences Trier , Germany
| | | | | |
Collapse
|
21
|
Analysis of longitudinal phase differences in vocal-fold vibration using synchronous high-speed videoendoscopy and electroglottography. J Voice 2012; 26:816.e13-20. [PMID: 23059188 DOI: 10.1016/j.jvoice.2012.04.009] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Accepted: 04/26/2012] [Indexed: 11/23/2022]
Abstract
OBJECTIVE This investigation used synchronous high-speed videoendoscopy and electroglottography (EGG) to systematically study contact and separation behavior along the length of the vocal folds. DESIGN Repeated measures. METHODS Facilitated by EGG and digital kymograms derived at 20%, 35%, 50%, 65%, and 80% of the posteroanterior length of the vocal folds, the pattern of vocal-fold contact and separation was determined for seven female and seven male vocally healthy subjects while producing "breathy," "comfortable," and "pressed" phonations. RESULTS The female subjects consistently used an anterior-to-posterior contact pattern and posterior-to-anterior separation pattern when producing a breathy or comfortable voice, with several using a simultaneous pattern of contact and/or separation for pressed phonation. The male subjects showed more variable "zipperlike" separation patterns, but consistently used a simultaneous contact pattern for pressed voice that was also commonly used when producing comfortable phonation. CONCLUSIONS Findings indicate longitudinal phase differences in vocal-fold vibration are both common and expected in vocally healthy speakers. The implications for vocal assessment, as well as for the use and interpretation of the EGG signal, are discussed.
Collapse
|
22
|
Kelley RT, Colton RH, Casper J, Paseman A, Brewer D. Evaluation of Stroboscopic Signs. J Voice 2011; 25:490-5. [DOI: 10.1016/j.jvoice.2010.03.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2010] [Accepted: 03/23/2010] [Indexed: 10/19/2022]
|
23
|
Herbst CT, Fitch WTS, Svec JG. Electroglottographic wavegrams: a technique for visualizing vocal fold dynamics noninvasively. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:3070-3078. [PMID: 21110602 DOI: 10.1121/1.3493423] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
A method for analyzing and displaying electroglottographic (EGG) signals (and their first derivative, DEGG) is introduced: the electroglottographic wavegram ("wavegram" hereafter). To construct a wavegram, the time-varying fundamental frequency is measured and consecutive individual glottal cycles are identified. Each cycle is locally normalized in duration and amplitude, the signal values are encoded by color intensity and the cycles are concatenated to display the entire voice sample in a single image, similar as in sound spectrography. The wavegram provides an intuitive means for quickly assessing vocal fold contact phenomena and their variation over time. Variations in vocal fold contact appear here as a sequence of events rather than single phenomena, taking place over a certain period of time, and changing with pitch, loudness and register. Multiple DEGG peaks are revealed in wavegrams to behave systematically, indicating subtle changes of vocal fold oscillatory regime. As such, EGG wavegrams promise to reveal more information on vocal fold contacting and de-contacting events than previous methods.
Collapse
Affiliation(s)
- Christian T Herbst
- Laboratory of Biophysics, Department of Experimental Physics, Faculty of Science, Palacký University Olomouc, tř 17 listopadu 12, 771 46 Olomouc, Czech Republic.
| | | | | |
Collapse
|
24
|
Avelino H. Acoustic and Electroglottographic Analyses of Nonpathological, Nonmodal Phonation. J Voice 2010; 24:270-80. [DOI: 10.1016/j.jvoice.2008.10.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2008] [Accepted: 10/02/2008] [Indexed: 10/20/2022]
|
25
|
Advances in laryngeal imaging. Eur Arch Otorhinolaryngol 2009; 266:1509-20. [PMID: 19618198 DOI: 10.1007/s00405-009-1050-4] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Accepted: 07/07/2009] [Indexed: 10/20/2022]
Abstract
Imaging and image analysis became an important issue in laryngeal diagnostics. Various techniques, such as videostroboscopy, videokymography, digital kymography, or ultrasonography are available and are used in research and clinical practice. This paper reviews recent advances in imaging for laryngeal diagnostics.
Collapse
|
26
|
Herbst C, Ternström S. A comparison of different methods to measure the EGG contact quotient. LOGOP PHONIATR VOCO 2009; 31:126-38. [PMID: 16966155 DOI: 10.1080/14015430500376580] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The results from six published electroglottographic (EGG-based) methods for calculating the EGG contact quotient (CQEGG) were compared to closed quotients derived from simultaneous videokymographic imaging (CQKYM). Two trained male singers phonated in falsetto and in chest register, with two degrees of adduction in both registers. The maximum difference between methods in the CQEGG was 0.3 (out of 1.0). The CQEGG was generally lower than the CQKYM. Within subjects, the CQEGG co-varied with the CQkym, but with changing offsets depending on method. The CQEGG cannot be calculated for falsetto phonation with little adduction, since there is no complete glottal closure. Basic criterion-level methods with thresholds of 0.2 or 0.25 gave the best match to the CQKYM data. The results suggest that contacting and de-contacting in the EGG might not refer to the same physical events as do the beginning and cessation of airflow.
Collapse
|
27
|
Kunduk M, Yan Y, McWhorter AJ, Bless D. Investigation of voice initiation and voice offset characteristics with high-speed digital imaging. LOGOP PHONIATR VOCO 2009; 31:139-44. [PMID: 16966156 DOI: 10.1080/14015430500364065] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
This study involves preliminary investigation of the characteristics of the voice initiation period (VIP) and voice offset period (VOP) using high-speed digital imaging. The goals of the study were to develop a methodology to objectively analyze these periods of phonation and to explore the feasibility of studying the effects of aging on these phonation segments. Results of the analysis of the data from two female subjects, one younger and one older, with the developed methodology, demonstrated that the older subject's VIP was characterized by a slow and irregular increase in glottal area waveform (GAW) until reaching 90% of the maximum opening of the glottis at 244 frames or 122 ms. The younger subject demonstrated a sharp increase in GAW during VIP, taking only 155 frames or 77.5 ms to reach the 90% mark. Also, the older subject took a greater number of frames for the vocal fold vibration to come to a complete stop than the younger subject during the VOP; 275 frames and 150 frames respectively.
Collapse
Affiliation(s)
- Melda Kunduk
- Louisiana State University, Department of Communication Disorders, New Orleans, LA 70112, USA.
| | | | | | | |
Collapse
|
28
|
Dynamic MRI of larynx and vocal fold vibrations in normal phonation. J Voice 2007; 23:235-9. [PMID: 18082366 DOI: 10.1016/j.jvoice.2007.08.008] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2007] [Accepted: 08/28/2007] [Indexed: 11/21/2022]
Abstract
Dynamic magnetic resonance imaging (MRI) of the larynx and vocal folds during phonation was used for measuring the vertical laryngeal movements and the glottal angle of the vocal folds opening and closing in dynamic phase. The data used in this analysis were taken on 10 healthy volunteers during maximal inspiration and the prolonged phonation of the vowels [i] (as in key), [a] (as in car), [u] (as in loop), and the consonant [sh] (as in ship). The results of our MRI data have demonstrated the difference of the vocal folds movement in relation to the vowel and consonant sounds, with a large glottal opening for [sh] and a narrow opening for [i] and [u], and the difference of the laryngeal position in relation to the vowels, with [a] and [u] having a lower larynx position than [i]. Imaging the larynx's positions and the vocal folds' vibrations is possible using dynamic MRI. This technique permits measurements of laryngeal structures and glottal parameters in dynamic function with multiplanar high-resolution imaging. Analysis of laryngeal activity and vocal folds' vibration may be helpful for the evaluation of the phonation function and for the understanding of the physiology of vocal production and voice modulation.
Collapse
|
29
|
Michaud A. Final consonants and glottalization: new perspectives from Hanoi Vietnamese. PHONETICA 2004; 61:119-146. [PMID: 15662108 DOI: 10.1159/000082560] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The evolution from final obstruents to final glottal stop and then to rhyme glottalization (i.e. from /at/ to /a?/, then to /a'/) is a well-established general trend in the history of the Sino-Tibetan language family and beyond. It has further been shown by laryngoscopy that in three languages which retain the nonreleased syllable-final obstruents /p/, /t/ and /k/ (Standard Thai, and two Chinese dialects), these obstruents are often accompanied by a glottal stop. The present research raises the issue whether there is another typological possibility: can nonreleased final obstruents be accompanied consistently by modal phonation, without glottal stop? Analysis of electroglottographic recordings of 126 syllables in two carrier sentences spoken by 4 speakers shows that, in Hanoi Vietnamese, the final obstruents /p/, /t/ and /k/ are not accompanied by glottalization, and that the open quotient increases in the course of the syllable rhyme. Obstruent-final rhymes (which may carry either of two tones: D1 or D2) are compared with nasal-final rhymes which, under one of the tones (tone B2), are confirmed to be glottalized. Our finding is that tones D1 and D2 (i.e. obstruent-final rhymes) are both produced in modal voice, which shows that the typological paradigm of observed realizations of syllable-final obstruents must be enlarged. The discussion puts forward the hypothesis that the unusual association of segments and voice quality found in Hanoi Vietnamese is a strategy to maintain the opposition between B2-tone and D2-tone rhymes.
Collapse
Affiliation(s)
- Alexis Michaud
- Laboratoire Phonétique et Phonologie (UMR 7018) CNRS/Sorbonne Nouvelle, Paris, France.
| |
Collapse
|
30
|
Henrich N, d'Alessandro C, Doval B, Castellengo M. On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 115:1321-1332. [PMID: 15058354 DOI: 10.1121/1.1646401] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Electroglottography is a common method for providing noninvasive measurements of glottal activity. The derivative of the electroglottographic signal, however, has not attracted much attention, although it yields reliable indicators of glottal closing instants. The purpose of this paper is to provide a guide to the usefulness of this signal. The main features that are to be found in this signal are presented on the basis of an extensive analysis of a database of items sung by 18 trained singers. Glottal opening and closing instants are related to peaks in the signal; the latter can be used to measure glottal parameters such as fundamental frequency and open quotient. In some cases, peaks are doubled or imprecise, which points to special (but by no means uncommon) glottal configurations. A correlation-based algorithm for the automatic measurement of fundamental frequency and open quotient using the derivative of electroglottographic signals is proposed. It is compared to three other electroglottographic-based methods with regard to the measurement of open quotient in inverse-filtered derived glottal flow. It is shown that agreement with the glottal-flow measurements is much better than most threshold-based measurements in the case of sustained sounds.
Collapse
Affiliation(s)
- Nathalie Henrich
- LAM (UPMC, CNRS, Ministère de la culture), 11 rue de Lourmel, 75015 Paris, France.
| | | | | | | |
Collapse
|
31
|
Ilgner JFR, Palm C, Schütz AG, Spitzer K, Westhofen M, Lehmann TM. Colour texture analysis for quantitative laryngoscopy. Acta Otolaryngol 2003; 123:730-4. [PMID: 12953773 DOI: 10.1080/00016480310000412] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
OBJECTIVE Whilst considerable progress has been made in enhancing the quality of indirect laryngoscopy and image processing, the evaluation of clinical findings is still based on the clinician's judgement. The aim of this paper was to examine the feasibility of an objective computer-based method for evaluating laryngeal disease. MATERIAL AND METHODS Digitally recorded images obtained by 90 degree- and 70 degree-angled indirect rod laryngoscopy using standardized white balance values were made of 16 patients and 19 healthy subjects. The digital images were evaluated manually by the clinician based on a standardized questionnaire, and suspect lesions were marked and classified on the image. Following colour separation, normal vocal cord areas as well as suspect lesions were analyzed automatically using co-occurrence matrices, which compare colour differences between neighbouring pixels over a predefined distance. RESULTS Whilst colour histograms did not provide sufficient information for distinguishing between healthy and diseased tissues, consideration of the blue content of neighbouring pixels enabled a correct classification in 81.4% of cases. If all colour channels (red, green and blue) were regarded simultaneously, the best classification correctness obtained was 77.1%. CONCLUSIONS Although only a very basic classification differentiating between healthy and diseased tissue was attempted, the results showed progress compared to grey-scale histograms, which have been evaluated before. The results document a first step towards an objective, machine-based classification of laryngeal images, which could provide the basis for further development of an expert system for use in indirect laryngoscopy.
Collapse
Affiliation(s)
- Justus F R Ilgner
- Department of Otorhinolaryngology, Plastic Head and Neck Surgery, University Hospital Aachen, Aachen, Germany.
| | | | | | | | | | | |
Collapse
|
32
|
Hess MM, Ludwigs M, Kobler JB, Schade G. Imaging of the larynx--extending the use of stroboscopy-related techniques. LOGOP PHONIATR VOCO 2003; 27:50-8. [PMID: 12487401 DOI: 10.1080/140154302760409257] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Three methods (I-III) are presented to demonstrate how technical modifications in the use of stroboscopy-related techniques can be applied to further analyze vocal fold vibrations. (I) With double exposure videostroboscopy, dynamic properties of vocal folds can be visualized within one single image. This allows for estimations of horizontal vibratory velocity of the vocal fold margins. (II) Stroboscopic transillumination of the larynx makes it possible to localize initial vocal fold opening in the horizontal glottal plane and to visualize different vibratory opening patterns. Bidirectional color-coded endoscopic imaging of the larynx facilitates a semi-automated, digital measurement of the glottal area. (III) Stroboscopic illumination using four light emitting diodes (LEDs) clipped on a rigid endoscope and triggered by a portable 'pocket-sized', battery-powered electronic control device offers new prospects for performing stroboscopy outside of specialized clinical facilities.
Collapse
Affiliation(s)
- Markus M Hess
- Department of Phoniatrics and Pedaudiology, University Clinics Hamburg-Eppendorf, University of Hamburg, Martinistrasse 52, D-20246 Hamburg, Germany.
| | | | | | | |
Collapse
|