1
|
Bullock L, Toles LE, Hillman RE, Mehta DD. Acoustic-Aerodynamic Voice Outcome Ratios Identify Changes in Vocal Function Following Vocal Fold Medialization for Unilateral Vocal Fold Paralysis. J Voice 2023:S0892-1997(23)00104-2. [PMID: 37068982 PMCID: PMC10576834 DOI: 10.1016/j.jvoice.2023.03.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 03/09/2023] [Accepted: 03/10/2023] [Indexed: 04/19/2023]
Abstract
PURPOSE This study aimed to determine whether ratio-based measures that combine acoustic (output) and aerodynamic (input) parameters detect postoperative change in vocal function following vocal fold medialization for unilateral vocal fold paralysis. METHOD Pre- and postoperative acoustic and aerodynamic measures were analyzed retrospectively from 149 patients who underwent vocal fold medialization for unilateral vocal fold paralysis. A 2 × 2 repeated-measures analysis of variance was conducted for each of four acoustic-aerodynamic ratios-traditional vocal efficiency (VE), sound pressure level to aerodynamic power (SPL/AP), SPL to average airflow (SPL/AFLOW), and SPL to subglottal pressure (SPL/Ps)-to investigate the main effects and interaction of treatment stage and loudness level (comfortable and loud). RESULTS The patient group showed significant postoperative improvements in self reports of vocal function (voice-related quality of life) and clinical auditory-perceptual judgments of dysphonia (consensus auditory-perceptual evaluation of voice). Main effects for both treatment stage and loudness level were statistically significant for all measures except SPL/Ps. There were interaction effects for VE and SPL/AP, suggesting that magnitude of the treatment effect differs based on loudness. SPL/AFLOW had medium-to-large effect sizes in both loudness conditions. There were postoperative changes in SPL/Ps that were dependent on the magnitude of the reduction in AFLOW; as expected, SPL/Ps increased postoperatively in a subgroup that had large postoperative reductions in AFLOW at the comfortable loudness level. CONCLUSIONS Acoustic-aerodynamic ratios can aid in tracking changes in vocal function following vocal fold medialization. SPL/AFLOW exhibited the largest effect size, which is expected since a reduction in abnormally high AFLOW typically accompanies the increased modulation of glottal air flow associated with successful vocal fold medialization. Future study is needed to model physiological changes in acoustic-aerodynamic voice outcome ratios across different types of voice disorders.
Collapse
Affiliation(s)
- Latané Bullock
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts
| | - Laura E Toles
- Department of Otolaryngology-Head and Neck Surgery, The University of Texas Southwestern Medical Center, Dallas, Texas
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; School of Health and Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; School of Health and Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts.
| |
Collapse
|
2
|
Cortés JP, Lin JZ, Marks KL, Espinoza VM, Ibarra EJ, Zañartu M, Hillman RE, Mehta DD. Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders. APPLIED SCIENCES (BASEL, SWITZERLAND) 2022; 12:10692. [PMID: 36777332 PMCID: PMC9910342 DOI: 10.3390/app122110692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The aerodynamic voice assessment of subglottal air pressure can discriminate between speakers with typical voices from patients with voice disorders, with further evidence validating subglottal pressure as a clinical outcome measure. Although estimating subglottal pressure during phonation is an important component of a standard voice assessment, current methods for estimating subglottal pressure rely on non-natural speech tasks in a clinical or laboratory setting. This study reports on the validation of a method for subglottal pressure estimation in individuals with and without voice disorders that can be translated to connected speech to enable the monitoring of vocal function and behavior in real-world settings. During a laboratory calibration session, a participant-specific multiple regression model was derived to estimate subglottal pressure from a neck-surface vibration signal that can be recorded during natural speech production. The model was derived for vocally typical individuals and patients diagnosed with phonotraumatic vocal fold lesions, primary muscle tension dysphonia, and unilateral vocal fold paralysis. Estimates of subglottal pressure using the developed method exhibited significantly lower error than alternative methods in the literature, with average errors ranging from 1.13 to 2.08 cm H2O for the participant groups. The model was then applied during activities of daily living, thus yielding ambulatory estimates of subglottal pressure for the first time in these populations. Results point to the feasibility and potential of real-time monitoring of subglottal pressure during an individual's daily life for the prevention, assessment, and treatment of voice disorders.
Collapse
Affiliation(s)
- Juan P. Cortés
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Jon Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Speech, Language & Hearing Sciences Department, College of Health & Rehabilitation: Sargent College, Boston University, Boston, MA 02215, USA
| | | | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
3
|
Toles LE, Seidman AY, Hillman RE, Mehta DD. Clinical Utility of the Ratio of Sound Pressure Level to Subglottal Pressure in Patients Surgically Treated for Phonotraumatic Vocal Fold Lesions. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2778-2788. [PMID: 35914023 PMCID: PMC9911101 DOI: 10.1044/2022_jslhr-21-00658] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 04/07/2022] [Accepted: 05/02/2022] [Indexed: 06/15/2023]
Abstract
PURPOSE This study aimed to determine whether a simplified, and potentially more stable, acoustic-aerodynamic voice outcome ratio (ratio of sound pressure level [SPL] to subglottal pressure) is comparable to a traditional vocal efficiency measure (ratio of acoustic power to the product of average subglottal pressure and average phonatory airflow) in terms of the ability to detect change in vocal function following surgical removal of bilateral phonotraumatic lesions. METHOD Pre- and postoperative acoustic and aerodynamic measures were analyzed retrospectively from 75 female patients who underwent surgical removal of bilateral phonotraumatic lesions. A 2 × 2 repeated-measures analysis of variance was conducted for each of three acoustic-aerodynamic voice outcome ratios-traditional vocal efficiency, an SPL-based ratio with both airflow and subglottal pressure, and a simplified SPL-based ratio with subglottal pressure only-to investigate the main effects of treatment stage (pre- and postsurgery), loudness condition (comfortable and loud), and their interaction. Post hoc paired samples t tests were conducted for statistically significant interactions. The within-subject variability of the measures was assessed using the coefficient of variation. RESULTS Although exhibiting an expected main effect of loudness (higher values in the loud condition), the traditional vocal efficiency ratio did not exhibit a main effect of treatment. For both SPL-based ratios, there were significant main effects of treatment stage (higher values postoperatively) and loudness condition (lower values in the loud condition). Within-subject, postoperative changes in the two SPL-based ratios moderately correlated with one another. The simplified ratio of SPL to subglottal pressure (without airflow) exhibited the least within-subject variability relative to the other two acoustic-aerodynamic ratios. CONCLUSIONS These findings indicate that SPL-based acoustic-aerodynamic voice outcome ratios increase significantly following the surgical removal of phonotraumatic vocal fold lesions. The simplified ratio of SPL to subglottal pressure exhibits the least variability and can be easily obtained without requiring the measurement of airflow.
Collapse
Affiliation(s)
- Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- The University of Texas Southwestern Medical Center, Dallas
| | | | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
4
|
Angelakis E, Kotsani N, Georgaki A. Towards a Singing Voice Multi-Sensor Analysis Tool: System Design, and Assessment Based on Vocal Breathiness. SENSORS 2021; 21:s21238006. [PMID: 34884019 PMCID: PMC8659512 DOI: 10.3390/s21238006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 11/14/2021] [Accepted: 11/19/2021] [Indexed: 11/16/2022]
Abstract
Singing voice is a human quality that requires the precise coordination of numerous kinetic functions and results in a perceptually variable auditory outcome. The use of multi-sensor systems can facilitate the study of correlations between the vocal mechanism kinetic functions and the voice output. This is directly relevant to vocal education, rehabilitation, and prevention of vocal health issues in educators; professionals; and students of singing, music, and acting. In this work, we present the initial design of a modular multi-sensor system for singing voice analysis, and describe its first assessment experiment on the ‘vocal breathiness’ qualitative characteristic. A system case study with two professional singers was conducted, utilizing signals from four sensors. Participants sung a protocol of vocal trials in various degrees of intended vocal breathiness. Their (i) vocal output, (ii) phonatory function, and (iii) respiratory behavior-per-condition were recorded through a condenser microphone (CM), an Electroglottograph (EGG), and thoracic and abdominal respiratory effort transducers (RET), respectively. Participants’ individual respiratory management strategies were studied through qualitative analysis of RET data. Microphone audio samples breathiness degree was rated perceptually, and correlation analysis was performed between sample ratings and parameters extracted from CM and EGG data. Smoothed Cepstral Peak Prominence (CPPS) and vocal folds’ Open Quotient (OQ), as computed with the Howard method (HOQ), demonstrated the higher correlation coefficients, when analyzed individually. DECOM method-computed OQ (DOQ) was also examined. Interestingly, the correlation coefficient of pitch difference between estimates from CM and EGG signals appeared to be (based on the Pearson correlation coefficient) statistically insignificant (a result that warrants investigation in larger populations). The study of multi-variate models revealed even higher correlation coefficients. Models studied were the Acoustic Breathiness Index (ABI) and the proposed multiple regression model CDH (CPPS, DOQ, and HOQ), which was attempted in order to combine analysis results from microphone and EGG signals. The model combination of ABI and the proposed CDH appeared to yield the highest correlation with perceptual breathiness ratings. Study results suggest potential for the use of a completed system version in vocal pedagogy and research, as the case study indicated system practicality, a number of pertinent correlations, and introduced topics with further research possibilities.
Collapse
|
5
|
Van Stan JH, Ortiz AJ, Marks KL, Toles LE, Mehta DD, Burns JA, Hron T, Stadelman-Cohen T, Krusemark C, Muise J, Fox AB, Nudelman C, Zeitels S, Hillman RE. Changes in the Daily Phonotrauma Index Following the Use of Voice Therapy as the Sole Treatment for Phonotraumatic Vocal Hyperfunction in Females. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:3446-3455. [PMID: 34463536 PMCID: PMC8642084 DOI: 10.1044/2021_jslhr-21-00082] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 04/26/2021] [Accepted: 05/24/2021] [Indexed: 05/31/2023]
Abstract
Purpose The aim of this study was to use the Daily Phonotrauma Index (DPI) to quantify group-based changes in the daily voice use of patients with phonotraumatic vocal hyperfunction (PVH) after receiving voice therapy as the sole treatment. This is part of an ongoing effort to validate an updated theoretical framework for PVH. Method A custom-designed ambulatory voice monitor was used to collect 1 week of pre- and posttreatment data from 52 female patients with PVH. Normative weeklong data were also obtained from 52 matched controls. Each week was represented by the DPI, which is a combination of neck-surface acceleration magnitude skewness and the standard deviation of the difference between the first and second harmonic magnitudes. Results Compared to pretreatment, the DPI statistically decreased towards normal in the patient group after treatment (Cohen's d = -0.25). The posttreatment patient group's DPI was still significantly higher than the control group (d = 0.68). Conclusions The DPI showed the pattern of improved ambulatory voice use in a group of patients with PVH following voice therapy that was predicted by the updated theoretical framework. Per the prediction, voice therapy was associated with a decreased potential for phonotrauma in daily voice use, but the posttreatment patient group data were still significantly different from the normative control group data. This posttreatment difference is interpreted as reflecting the impact on voice use of the persistence of phonotrauma-induced structural changes to the vocal folds. Further validation of the DPI is needed to better understand its potential clinical use.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Charles Nudelman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
6
|
Ibarra EJ, Parra JA, Alzamendi GA, Cortés JP, Espinoza VM, Mehta DD, Hillman RE, Zañartu M. Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model. Front Physiol 2021; 12:732244. [PMID: 34539451 PMCID: PMC8440844 DOI: 10.3389/fphys.2021.732244] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 08/09/2021] [Indexed: 11/23/2022] Open
Abstract
The ambulatory assessment of vocal function can be significantly enhanced by having access to physiologically based features that describe underlying pathophysiological mechanisms in individuals with voice disorders. This type of enhancement can improve methods for the prevention, diagnosis, and treatment of behaviorally based voice disorders. Unfortunately, the direct measurement of important vocal features such as subglottal pressure, vocal fold collision pressure, and laryngeal muscle activation is impractical in laboratory and ambulatory settings. In this study, we introduce a method to estimate these features during phonation from a neck-surface vibration signal through a framework that integrates a physiologically relevant model of voice production and machine learning tools. The signal from a neck-surface accelerometer is first processed using subglottal impedance-based inverse filtering to yield an estimate of the unsteady glottal airflow. Seven aerodynamic and acoustic features are extracted from the neck surface accelerometer and an optional microphone signal. A neural network architecture is selected to provide a mapping between the seven input features and subglottal pressure, vocal fold collision pressure, and cricothyroid and thyroarytenoid muscle activation. This non-linear mapping is trained solely with 13,000 Monte Carlo simulations of a voice production model that utilizes a symmetric triangular body-cover model of the vocal folds. The performance of the method was compared against laboratory data from synchronous recordings of oral airflow, intraoral pressure, microphone, and neck-surface vibration in 79 vocally healthy female participants uttering consecutive /pæ/ syllable strings at comfortable, loud, and soft levels. The mean absolute error and root-mean-square error for estimating the mean subglottal pressure were 191 Pa (1.95 cm H2O) and 243 Pa (2.48 cm H2O), respectively, which are comparable with previous studies but with the key advantage of not requiring subject-specific training and yielding more output measures. The validation of vocal fold collision pressure and laryngeal muscle activation was performed with synthetic values as reference. These initial results provide valuable insight for further vocal fold model refinement and constitute a proof of concept that the proposed machine learning method is a feasible option for providing physiologically relevant measures for laboratory and ambulatory assessment of vocal function.
Collapse
Affiliation(s)
- Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- School of Electrical Engineering, University of the Andes, Mérida, Venezuela
| | - Jesús A. Parra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Gabriel A. Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics, Consejo Nacional de Investigaciones Científicas y Técnicas - Universidad Nacional de Entre Ríos, Oro Verde, Argentina
| | - Juan P. Cortés
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Víctor M. Espinoza
- Department of Sound, Faculty of Arts, University of Chile, Santiago, Chile
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
7
|
Mehta DD, Kobler JB, Zeitels SM, Zañartu M, Ibarra EJ, Alzamendi GA, Manriquez R, Erath BD, Peterson SD, Petrillo RH, Hillman RE. Direct measurement and modeling of intraglottal, subglottal, and vocal fold collision pressures during phonation in an individual with a hemilaryngectomy. APPLIED SCIENCES (BASEL, SWITZERLAND) 2021; 11:7256. [PMID: 36210866 PMCID: PMC9541559 DOI: 10.3390/app11167256] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The purpose of this paper is to report on the first in vivo application of a recently developed transoral, dual-sensor pressure probe that directly measures intraglottal, subglottal, and vocal fold collision pressures during phonation. Synchronous measurement of intraglottal and subglottal pressures was accomplished using two miniature pressure sensors mounted on the end of the probe and inserted transorally in a 78-year-old male who had previously undergone surgical removal of his right vocal fold for treatment of laryngeal cancer. The endoscopist used one hand to position the custom probe against the surgically medialized scar band that replaced the right vocal fold and used the other hand to position a transoral endoscope to record laryngeal high-speed videoendoscopy of the vibrating left vocal fold contacting the pressure probe. Visualization of the larynx during sustained phonation allowed the endoscopist to place the dual-sensor pressure probe such that the proximal sensor was positioned intraglottally and the distal sensor subglottally. The proximal pressure sensor was verified to be in the strike zone of vocal fold collision during phonation when the intraglottal pressure signal exhibited three characteristics: an impulsive peak at the start of the closed phase, rounded peak during the open phase, and minimum value around zero immediately preceding the impulsive peak of the subsequent phonatory cycle. Numerical voice production modeling was applied to validate model-based predictions of vocal fold collision pressure using kinematic vocal fold measures. The results successfully demonstrated feasibility of in vivo measurement of vocal fold collision pressure in an individual with a hemilaryngectomy, motivating ongoing data collection that is designed to aid in the development of vocal dose measures that incorporate vocal fold impact collision and stresses.
Collapse
Affiliation(s)
- Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- MGH Institute of Health Professions, Boston, MA, USA
| | - James B. Kobler
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Steven M. Zeitels
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Gabriel A. Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics, National University of Entre Rios–CONICET, Entre Ríos, Argentina
| | - Rodrigo Manriquez
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Byron D. Erath
- Department of Mechanical & Aeronautical Engineering, Clarkson University, Potsdam, NY, USA
| | - Sean D. Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Ontario, Canada
| | - Robert H. Petrillo
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- MGH Institute of Health Professions, Boston, MA, USA
| |
Collapse
|
8
|
Umatani M, Ogawa M, Hosokawa K, Kato C, Okajima E, Iwahashi T, Inohara H. Skin Acceleration Levels Estimated by a Neck-surface Accelerometer during Phonation Are Affected by The Mechanical Properties of The Anterior Cervical Skin. J Voice 2021:S0892-1997(21)00111-9. [PMID: 34011459 DOI: 10.1016/j.jvoice.2021.03.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 03/25/2021] [Accepted: 03/30/2021] [Indexed: 10/21/2022]
Abstract
OBJECTIVES The aim of the present study was to assess whether skin acceleration levels (SAL) estimated by a neck-surface accelerometer (ACC) are affected by the anterior neck skin condition. METHODS Each of six healthy non-dysphonic participants wore a headset microphone (MIC), had an ACC sensor fitted on the skin over the cervical trachea, and were subsequently asked to gradually increase the vocal loudness during sustained phonation of the vowel /e:/ (crescendo task), while the sound pressure levels on a sound level meter (SPLSLM) and MIC/ACC signals were simultaneously recorded. Root mean squared values were calculated from the MIC and ACC signals as the sound pressure level (SPLMIC) and SAL, respectively, and the relationships between SPLSLM and SAL were compared between neck anteflexion and retroflexion or between outward and inward skin retraction on both sides of the sensor. RESULTS In the total samples for the successful crescendo performance in a natural head position, the SPLMIC and SPLSLM showed a strong linear correlation (r=0.980), whereas the correlation between the SAL and SPLSLM showed a distorted regression line (r=0.765) with individual differences. In all participants, the anteflexion and inward skin retraction decreased the SAL value at the same SPLSLM value, whereas the retroflexion increased the SAL value at the same SPLSLM value. CONCLUSION These results demonstrate that the signal intensity of a neck-surface ACC is affected by the condition of the anterior cervical skin, perhaps leading to inter-individual variability in SAL measurements.
Collapse
Affiliation(s)
- Masanori Umatani
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Makoto Ogawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan; Department of Otorhinolaryngology, JCHO Osaka Hospital, Osaka, Japan.
| | - Kiyohito Hosokawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Chieri Kato
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Eri Okajima
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Toshihiko Iwahashi
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Hidenori Inohara
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| |
Collapse
|
9
|
Van Stan JH, Ortiz AJ, Cortes JP, Marks KL, Toles LE, Mehta DD, Burns JA, Hron T, Stadelman-Cohen T, Krusemark C, Muise J, Fox-Galalis AB, Nudelman C, Zeitels S, Hillman RE. Differences in Daily Voice Use Measures Between Female Patients With Nonphonotraumatic Vocal Hyperfunction and Matched Controls. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1457-1470. [PMID: 33900807 PMCID: PMC8608188 DOI: 10.1044/2021_jslhr-20-00538] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose The purpose of this study was to obtain a more comprehensive understanding of the pathophysiology and impact on daily voice use of nonphonotraumatic vocal hyperfunction (NPVH). Method An ambulatory voice monitor collected 1 week of data from 36 patients with NPVH and 36 vocally healthy matched controls. A subset of 11 patients with NPVH were monitored after voice therapy. Daily voice use measures included neck-skin acceleration magnitude, fundamental frequency (f o), cepstral peak prominence (CPP), and the difference between the first and second harmonic magnitudes (H1-H2). Additional comparisons included 118 patients with phonotraumatic vocal hyperfunction (PVH) and 89 additional vocally healthy controls. Results The NPVH group, compared to the matched control group, exhibited increased f o (Cohen's d = 0.6), reduced CPP (d = -0.9), and less positive H1-H2 skewness (d = -1.1). Classifiers used CPP mean and H1-H2 mode to maximally differentiate the NPVH and matched control groups (area under the receiver operating characteristic curve of 0.78). Classifiers performed well on unseen data: the logit decreased in patients with NPVH after therapy; ≥ 85% of the control and PVH groups were identified as "normal" or "not NPVH," respectively. Conclusions The NPVH group's daily voice use is less periodic (CPP), is higher pitched (f o), and has less abrupt vocal fold closure (H1-H2 skew) compared to the matched control group. The combination of CPP mean and H1-H2 mode appears to reflect a pathophysiological continuum in NPVH patients of inefficient phonation with minimal potential for phonotrauma. Further validation of the classification model is needed to better understand potential clinical uses. Supplemental Material https://doi.org/10.23641/asha.14390771.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - Juan P. Cortes
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Charles Nudelman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
10
|
Umatani M, Ogawa M, Hosokawa K, Kato C, Okajima E, Iwahashi T, Inohara H. The Feasibility of Gastroesophageal Manometry for Continuously Evaluating the Degree of Expiratory Effort During Successful Crescendo Phonation. J Voice 2021; 37:470.e7-470.e16. [PMID: 33707030 DOI: 10.1016/j.jvoice.2021.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Revised: 01/23/2021] [Accepted: 02/01/2021] [Indexed: 10/22/2022]
Abstract
OBJECTIVES This study aimed to assess the feasibility of gastroesophageal manometry for continuously evaluating the degree of expiratory effort by measuring the pressures in the digestive tract during crescendo phonation. METHODS Each of 18 healthy nondysphonic speakers had a probe with a four-channel gastroesophageal manometer inserted through the nasal cavity to place four pressure sensors in the hypopharynx, cervical-/thoracic esophagus, and stomach, and was asked to gradually increase the vocal loudness during sustained phonation of the vowel /e:/ (vowel-crescendo task), while the sound pressure level and the pressures were simultaneously recorded. RESULTS 50% of the successful vowel-crescendo task samples with a gradual and adequate sound pressure level increase showed a concomitant gradual increase in both the intra-thoracic-esophageal/intra-gastric pressure values from approximately -5 mmHg /6 mmHg to -10 mmHg/20 mmHg, respectively. The maximum pressure value was the highest in the intra-gastric pressure followed by the intra-thoracic-esophageal and intra-cervical-esophageal pressures in order. However, most of the samples showed less than one of atypical pressure changes, such as fluctuations in the intra-thoracic-esophageal and intra-gastric pressure changes and dispersion in the intra-cervical-esophageal and intra-hypopharyngeal pressure values (perhaps due to the peristaltic motions, and the contact of the sensors to the membranous wall). CONCLUSION These results show that, during successful crescendo phonation, gastroesophageal manometry reveals a gradual increase in the intra-thoracic and intra-abdominal pressures with increasing the vocal intensity, even though showing some systematic errors, suggesting the usefulness of gastroesophageal manometry for continuously evaluating the degree of expiratory effort without influence by the laryngeal condition.
Collapse
Affiliation(s)
- Masanori Umatani
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Makoto Ogawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan; Department of Otorhinolaryngology, JCHO Osaka Hospital, Osaka, Japan.
| | - Kiyohito Hosokawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Chieri Kato
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Eri Okajima
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Toshihiko Iwahashi
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Hidenori Inohara
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| |
Collapse
|
11
|
Toles LE, Ortiz AJ, Marks KL, Burns JA, Hron T, Van Stan JH, Mehta DD, Hillman RE. Differences Between Female Singers With Phonotrauma and Vocally Healthy Matched Controls in Singing and Speaking Voice Use During 1 Week of Ambulatory Monitoring. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2021; 30:199-209. [PMID: 33472007 PMCID: PMC8740583 DOI: 10.1044/2020_ajslp-20-00227] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose Previous ambulatory voice monitoring studies have included many singers and have combined speech and singing in the analyses. This study applied a singing classifier to the ambulatory recordings of singers with phonotrauma and healthy controls to determine if analyzing speech and singing separately would reveal voice use differences that could provide new insights into the etiology and pathophysiology of phonotrauma in this at-risk population. Method Forty-two female singers with phonotrauma (vocal fold nodules or polyps) and 42 healthy matched controls were monitored using an ambulatory voice monitor. Weeklong statistics (average, standard deviation, skewness, kurtosis) for sound pressure level (SPL), fundamental frequency, cepstral peak prominence, the magnitude ratio of the first two harmonics (H1-H2 ), and three vocal dose measures were computed from the neck surface acceleration signal and separated into singing and speech using a singing classifier. Results Mixed analysis of variance models found expected differences between singing and speech in each voice parameter, except SPL kurtosis. SPL skewness, SPL kurtosis, and all H1-H2 distributional parameters differentiated patients and controls when singing and speech were combined. Interaction effects were found in H1-H2 kurtosis and all vocal dose measures. Patients had significantly higher vocal doses in speech compared to controls. Conclusions Consistent with prior work, the pathophysiology of phonotrauma in singers is characterized by more abrupt/complete glottal closure (decreased mean and variation for H1-H2 ) and increased laryngeal forces (negatively skewed SPL distribution) during phonation. Application of a singing classifier to weeklong data revealed that singers with phonotrauma spent more time speaking on a weekly basis, but not more time singing, compared to controls. Results are used as a basis for hypothesizing about the role of speaking voice in the etiology of phonotraumatic vocal hyperfunction in singers.
Collapse
Affiliation(s)
- Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
12
|
Van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Marks KL, Toles LE, Stadelman-Cohen T, Krusemark C, Muise J, Hron T, Zeitels SM, Fox AB, Hillman RE. Changes in a Daily Phonotrauma Index After Laryngeal Surgery and Voice Therapy: Implications for the Role of Daily Voice Use in the Etiology and Pathophysiology of Phonotraumatic Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:3934-3944. [PMID: 33197360 PMCID: PMC8608140 DOI: 10.1044/2020_jslhr-20-00168] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose This study attempts to gain insights into the role of daily voice use in the etiology and pathophysiology of phonotraumatic vocal hyperfunction (PVH) by applying a logistic regression-based daily phonotrauma index (DPI) to predict group-based improvements in patients with PVH after laryngeal surgery and/or postsurgical voice therapy. Method A custom-designed ambulatory voice monitor was used to collect 1 week of pre- and postsurgery data from 27 female patients with PVH; 13 of these patients were also monitored after postsurgical voice therapy. Normative weeklong data were obtained from 27 matched controls. Each week was represented by the DPI, standard deviation of the difference between the first and second harmonic amplitudes (H1-H2). Results Compared to pretreatment, the DPI significantly decreased in the patient group after surgery (Cohen's d effect size = -0.86) and voice therapy (d = -1.06). The patient group DPI only normalized after voice therapy. Conclusions The DPI produced the expected pattern of improved ambulatory voice use across laryngeal surgery and postsurgical voice therapy in a group of patients with PVH. The results were interpreted as providing new objective information about the role of daily voice use in the etiology and pathophysiology of PVH. The DPI is viewed as an estimate of potential vocal fold trauma that relies on combining the long-term distributional characteristics of two parameters representing the magnitude of phonatory forces (neck-surface acceleration magnitude) and vocal fold closure dynamics (H1-H2). Further validation of the DPI is needed to better understand its potential clinical use.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Steven M. Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | | | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
13
|
Espinoza VM, Mehta DD, Van Stan JH, Hillman RE, Zañartu M. Glottal Aerodynamics Estimated From Neck-Surface Vibration in Women With Phonotraumatic and Nonphonotraumatic Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2861-2869. [PMID: 32755502 PMCID: PMC7890221 DOI: 10.1044/2020_jslhr-20-00189] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 05/29/2020] [Accepted: 06/10/2020] [Indexed: 05/06/2023]
Abstract
Purpose The purpose of this study was to determine whether estimates of glottal aerodynamic measures based on neck-surface vibration are comparable to those previously obtained using oral airflow and air pressure signals (Espinoza et al., 2017) in terms of discriminating patients with phonotraumatic and nonphonotraumatic vocal hyperfunction (PVH and NPVH) from vocally healthy controls. Method Consecutive /pae/ syllables at comfortable and loud level were produced by 16 women with PVH (organic vocal fold lesions), 16 women with NPVH (primary muscle tension dysphonia), and 32 vocally healthy women who were each matched to a patient according to age and occupation. Subglottal impedance-based inverse filtering of the anterior neck-surface accelerometer (ACC) signal yielded estimates of peak-to-peak glottal airflow, open quotient, and maximum flow declination rate. Average subglottal pressure and microphone-based sound pressure level (SPL) were also estimated from the ACC signal using subject-specific linear regression models. The ACC-based measures of glottal aerodynamics were normalized for SPL and statistically compared between each patient and matched-control group. Results Patients with PVH and NPVH exhibited lower SPL-normalized glottal aerodynamics values than their respective control subjects (p values ranging from < .01 to .07) with very large effect sizes (1.04-2.16), regardless of loudness condition or measurement method (i.e., ACC-based values maintained discriminatory power). Conclusions The results of this study demonstrate that ACC-based estimates of most glottal aerodynamic measures are comparable to those previously obtained from oral airflow and air pressure (Espinoza et al., 2017) in terms of differentiating between hyperfunctional (PVH and NPVH) and normal vocal function. ACC-based estimates of glottal aerodynamic measures may be used to assess vocal function during continuous speech and enables this assessment of daily voice use during ambulatory monitoring to provide better insight into the pathophysiological mechanisms associated with vocal hyperfunction.
Collapse
Affiliation(s)
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Jarrad H. Van Stan
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
14
|
Marks KL, Lin JZ, Burns JA, Hron TA, Hillman RE, Mehta DD. Estimation of Subglottal Pressure From Neck Surface Vibration in Patients With Voice Disorders. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2202-2218. [PMID: 32610028 PMCID: PMC7838842 DOI: 10.1044/2020_jslhr-19-00409] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 03/12/2020] [Accepted: 04/15/2020] [Indexed: 05/15/2023]
Abstract
Purpose Given the established linear relationship between neck surface vibration magnitude and mean subglottal pressure (Ps) in vocally healthy speakers, the purpose of this study was to better understand the impact of the presence of a voice disorder on this baseline relationship. Method Data were obtained from participants with voice disorders representing a variety of glottal conditions, including phonotraumatic vocal hyperfunction, nonphonotraumatic vocal hyperfunction, and unilateral vocal fold paralysis. Participants were asked to repeat /p/-vowel syllable strings from loud-to-soft loudness levels in multiple vowel contexts (/pa/, /pi/, /pu/) and pitch levels (comfortable, higher than comfortable, lower than comfortable). Three statistical metrics were computed to analyze the regression line between neck surface accelerometer (ACC) signal magnitude and Ps within and across pitch, vowel, and voice disorder category: coefficient of determination (r 2), slope, and intercept. Three linear mixed-effects models were used to evaluate the impact of voice disorder category, pitch level, and vowel context on the relationship between ACC signal magnitude and Ps. Results The relationship between ACC signal magnitude and Ps was statistically different in patients with voice disorders than in vocally healthy controls; patients exhibited higher levels of Ps given similar values of ACC signal magnitude. Negligible effects were found for pitch condition within each voice disorder category, and negligible-to-small effects were found for vowel context. The mean of patient-specific r 2 values was .63, ranging from .13 to .92. Conclusions The baseline, linear relationship between ACC signal magnitude and Ps is affected by the presence of a voice disorder, with the relationship being participant-specific. Further work is needed to improve ACC-based prediction of Ps, across treatment, and during naturalistic speech production.
Collapse
Affiliation(s)
- Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
| | - Jonathan Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - James A. Burns
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Tiffiny A. Hron
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| |
Collapse
|
15
|
The Feasibility of a Neck-Surface Accelerometer for Estimating the Amount of Acoustic Output During Phonation Regardless of the Difference in the Mouth Configuration. J Voice 2020; 36:297-308. [DOI: 10.1016/j.jvoice.2020.06.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 06/10/2020] [Accepted: 06/10/2020] [Indexed: 11/23/2022]
|
16
|
Objective Characterization of Phonation Type Using Amplitude of Flow Glottogram Pulse and of Voice Source Fundamental. J Voice 2020; 36:4-14. [PMID: 32402665 DOI: 10.1016/j.jvoice.2020.03.018] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2019] [Revised: 03/24/2020] [Accepted: 03/24/2020] [Indexed: 12/18/2022]
Abstract
Phonation type, a phonatory dimension ranging from hypofunctional/breathy to hyperfunctional/pressed, is important both from a clinical and acoustical point of view; hyperfunctional voice can lead to voice disorders and hypofunctional voice reduces text intelligibility. Five male singers sang diminuendo sequences of the syllable /pae/ and three of them also produced speech or singing samples with different phonation types which were rated for phonatory pressedness by a panel of voice experts. The sequences were analyzed by inverse filtering and the associated subglottal pressures were estimated as the oral pressures during the /p/ occlusion. The results showed strong quasi-linear correlations between the peak-to-peak amplitude of the flow glottogram (henceforth pulse amplitude) and mean subglottal pressure, mean airflow, and the level difference between the first and second voice source partials L1-L2. These correlations were found to vary systematically with phonation type. Regardless of phonation type, the correlation between the pulse amplitude and the amplitude of the voice source fundamental frequency was very close to 1.0. The level difference between the first and second spectrum partials L1-L2 in narrow-band long-term-average spectra of speech and singing was found to vary systematically with phonation type in a manner related to voice experts' ratings of perceived degree of pressedness. The findings support the assumption that the combination of subglottal pressure and level of the voice source fundamental is useful for an objective measure of phonation type.
Collapse
|
17
|
Lin JZ, Espinoza VM, Marks KL, Zañartu M, Mehta DD. Improved subglottal pressure estimation from neck-surface vibration in healthy speakers producing non-modal phonation. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2020; 14:449-460. [PMID: 34079612 PMCID: PMC8168553 DOI: 10.1109/jstsp.2019.2959267] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Subglottal air pressure plays a major role in voice production and is a primary factor in controlling voice onset, offset, sound pressure level, glottal airflow, vocal fold collision pressures, and variations in fundamental frequency. Previous work has shown promise for the estimation of subglottal pressure from an unobtrusive miniature accelerometer sensor attached to the anterior base of the neck during typical modal voice production across multiple pitch and vowel contexts. This study expands on that work to incorporate additional accelerometer-based measures of vocal function to compensate for non-modal phonation characteristics and achieve an improved estimation of subglottal pressure. Subjects with normal voices repeated /p/-vowel syllable strings from loud-to-soft levels in multiple vowel contexts (/ɑ/, /i/, and /u/), pitch conditions (comfortable, lower than comfortable, higher than comfortable), and voice quality types (modal, breathy, strained, and rough). Subject-specific, stepwise regression models were constructed using root-mean-square (RMS) values of the accelerometer signal alone (baseline condition) and in combination with cepstral peak prominence, fundamental frequency, and glottal airflow measures derived using subglottal impedance-based inverse filtering. Five-fold cross-validation assessed the robustness of model performance using the root-mean-square error metric for each regression model. Each cross-validation fold exhibited up to a 25% decrease in prediction error when the model incorporated multidimensional aspects of the accelerometer signal compared with RMS-only models. Improved estimation of subglottal pressure for non-modal phonation was thus achievable, lending to future studies of subglottal pressure estimation in patients with voice disorders and in ambulatory voice recordings.
Collapse
Affiliation(s)
- Jon Z Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114 USA
| | | | - Katherine L Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114 USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa Maria, Valparaíso, Chile
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital-Harvard Medical School, Boston, MA 02114 USA
| |
Collapse
|