1
|
Ghasemzadeh H, Hillman RE, Espinoza VM, Erath BD, Mehta DD. Vocal Fold Dissipated Power in Females with Hyperfunctional Voice Disorders. J Voice 2024:S0892-1997(24)00332-1. [PMID: 39426910 DOI: 10.1016/j.jvoice.2024.09.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Revised: 09/20/2024] [Accepted: 09/23/2024] [Indexed: 10/21/2024]
Abstract
OBJECTIVE Phonotrauma has been hypothesized to be associated with prolonged and/or accumulated biomechanical stress on vocal fold tissue. This hypothesis can be tested using ambulatory monitoring of vocal fold dissipated power, which requires a reliable method for its noninvasive estimation during the activity of daily living. The first aim of this study was to show that a laboratory-based estimate of vocal fold dissipated power computed from intraoral pressure (IOP) has significant discriminative power in individuals with phonotraumatic vocal hyperfunction (PVH). Considering that estimation of subglottal pressure from IOP is not practical for ambulatory applications, an alternative approach should be used. The second aim of this study was to test the impact of two alternative methods for the estimation of subglottal pressure on the discriminative power of vocal fold dissipated power in individuals with PVH and, hence, to provide an evidence-based recommendation for future ambulatory monitoring studies of vocal fold dissipated power. METHOD Four groups of adult females were included in this study: 16 individuals with PVH, 16 individuals with nonphonotraumatic vocal hyperfunction (NPVH), and two groups of vocally typical controls matched to the participants in each patient group in terms of age and occupation. Each participant produced strings of five consecutive /pae/ syllables while wearing a pneumotachograph mask with an IOP tube. Neck-surface accelerometer and acoustic signals were recorded simultaneously using an ambulatory voice monitor and a head-mounted microphone, respectively. IOP was used to estimate subglottal pressure and subject-specific calibration factors were determined for the estimation of subglottal pressure from the accelerometer signal. RESULTS (1) Individuals with PVH had significantly higher dissipated power than controls (P = 0.001, Cohen's D=1.31) when the intraoral estimate of subglottal pressure was used in the computation of dissipated power. (2) The difference between the dissipated power of individuals with NPVH and their matched controls was not significant. (3) When microphone-based sound pressure levels was used for the estimation of subglottal pressure, the difference between individuals with PVH and their matched controls vanished (P = 0.23). (4) When subject-specific estimation of subglottal pressure from the accelerometer was used, the discriminative power returned with a very large effect size (P = 0.001, D=1.38). CONCLUSION Increased dissipated power is sensitive and specific to individuals with PVH among individuals with hyperfunctional voice disorders. The results provide evidence that accelerometer-based estimate of energy dissipation dose (power integrated over time) during daily life could be clinically useful.
Collapse
Affiliation(s)
- Hamzeh Ghasemzadeh
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan.
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, Massachusetts; MGH Institute of Health Professions, Boston, Massachusetts
| | - Víctor M Espinoza
- Department of Sound, Faculty of Arts, University of Chile, Santiago, Chile
| | - Byron D Erath
- Department of Mechanical Engineering, Rochester Institute of Technology, Rochester, NY
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, Massachusetts; MGH Institute of Health Professions, Boston, Massachusetts
| |
Collapse
|
2
|
Van Stan JH, Hillman RE, Krusemark C, Muise J, Stadelman-Cohen T, Mehta DD, Sternad D. Floating Ball Voice Therapy: Preliminary Effects on Outcomes and Predicting Individual Patient Differences in Generalization. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:3521-3535. [PMID: 39320344 PMCID: PMC11482575 DOI: 10.1044/2024_jslhr-23-00727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 05/14/2024] [Accepted: 07/23/2024] [Indexed: 09/26/2024]
Abstract
PURPOSE Floating ball voice therapy (FBVT) is a voice-controlled virtual environment based on a common treatment component across multiple evidence-based therapies: improved vocal efficiency (target) via practicing voicing with modified resonance and airflow (ingredient). This study preliminarily tested FBVT's effects on outcomes and the potential for its novel variability metrics to predict individual patient generalization. METHOD Ten patients with nonphonotraumatic vocal hyperfunction (NPVH) practiced FBVT for 10 days. Outcomes were assessed by a vocal efficiency ratio, a validated NPVH index, the patient-reported Voice-Related Quality of Life (V-RQOL), and forced-choice auditory judgments of overall severity. Exploration in early practice (Day 1) was estimated by how the patient's two-dimensional variability (mean airflow and intensity) related to error (difference between the patient-produced and normative vocal efficiency ratio). Generalization from the game to spontaneous speech was evaluated using the validated NPVH index. RESULTS Ten days of FBVT were associated with improved vocal efficiency (Cohen's d = 1.3), NPVH index (d = -1.1), V-RQOL total score (d = 0.9), and overall severity (odds ratio = 2.5). Patients who generalized on Day 10 exhibited airflow/intensity exploration that was more aligned with the error gradient on Day 1 (d = 0.6-1.2). CONCLUSIONS A relatively small dosage of FBVT (i.e., 10 practice sessions) was associated with multiple improved voice therapy outcomes. The FBVT variability metrics on Practice Day 1 demonstrated strong potential to predict which patients generalized to connected speech. Future work can more thoroughly evaluate effects on outcomes and characterizing the quality of vocal exploration with a larger patient population. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.27040873.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | |
Collapse
|
3
|
Morales A, Yuz JI, Cortés JP, Fontanet JG, Zañartu M. Glottal Airflow Estimation using Neck Surface Acceleration and Low-Order Kalman Smoothing. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 2023; 31:2055-2066. [PMID: 38130818 PMCID: PMC10732542 DOI: 10.1109/taslp.2023.3277269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
The use of non-invasive skin accelerometers placed over the extrathoracic trachea has been proposed in the literature for measuring vocal function. Glottal airflow is estimated using inverse filtering or Bayesian techniques based on a subglottal impedance-based model when utilizing these sensors. However, deviations in glottal airflow estimates can arise due to sensor positioning and model mismatch, and addressing them requires a significant computational load. In this paper, we utilize system identification techniques to obtain a low order state-space representation of the subglottal impedance-based model. We then employ the resulting low order model in a Kalman smoother to estimate the glottal airflow. Our proposed approach reduces the model order by 94% and requires only 1.5% of the computing time compared to previous Bayesian methods in the literature, while achieving slightly better accuracy when correcting for glottal airflow deviations. Additionally, our Kalman smoother approach provides a measure of uncertainty in the airflow estimate, which is valuable when measurements are taken under different conditions. With its comparable accuracy in signal estimation and reduced computational load, the proposed approach has the potential for real-time estimation of glottal airflow and its associated uncertainty in wearable voice ambulatory monitors using neck-surface acceleration.
Collapse
Affiliation(s)
- Arturo Morales
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Juan I Yuz
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Juan Pablo Cortés
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Javier G Fontanet
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| |
Collapse
|
4
|
Bullock L, Toles LE, Hillman RE, Mehta DD. Acoustic-Aerodynamic Voice Outcome Ratios Identify Changes in Vocal Function Following Vocal Fold Medialization for Unilateral Vocal Fold Paralysis. J Voice 2023:S0892-1997(23)00104-2. [PMID: 37068982 PMCID: PMC10576834 DOI: 10.1016/j.jvoice.2023.03.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 03/09/2023] [Accepted: 03/10/2023] [Indexed: 04/19/2023]
Abstract
PURPOSE This study aimed to determine whether ratio-based measures that combine acoustic (output) and aerodynamic (input) parameters detect postoperative change in vocal function following vocal fold medialization for unilateral vocal fold paralysis. METHOD Pre- and postoperative acoustic and aerodynamic measures were analyzed retrospectively from 149 patients who underwent vocal fold medialization for unilateral vocal fold paralysis. A 2 × 2 repeated-measures analysis of variance was conducted for each of four acoustic-aerodynamic ratios-traditional vocal efficiency (VE), sound pressure level to aerodynamic power (SPL/AP), SPL to average airflow (SPL/AFLOW), and SPL to subglottal pressure (SPL/Ps)-to investigate the main effects and interaction of treatment stage and loudness level (comfortable and loud). RESULTS The patient group showed significant postoperative improvements in self reports of vocal function (voice-related quality of life) and clinical auditory-perceptual judgments of dysphonia (consensus auditory-perceptual evaluation of voice). Main effects for both treatment stage and loudness level were statistically significant for all measures except SPL/Ps. There were interaction effects for VE and SPL/AP, suggesting that magnitude of the treatment effect differs based on loudness. SPL/AFLOW had medium-to-large effect sizes in both loudness conditions. There were postoperative changes in SPL/Ps that were dependent on the magnitude of the reduction in AFLOW; as expected, SPL/Ps increased postoperatively in a subgroup that had large postoperative reductions in AFLOW at the comfortable loudness level. CONCLUSIONS Acoustic-aerodynamic ratios can aid in tracking changes in vocal function following vocal fold medialization. SPL/AFLOW exhibited the largest effect size, which is expected since a reduction in abnormally high AFLOW typically accompanies the increased modulation of glottal air flow associated with successful vocal fold medialization. Future study is needed to model physiological changes in acoustic-aerodynamic voice outcome ratios across different types of voice disorders.
Collapse
Affiliation(s)
- Latané Bullock
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts
| | - Laura E Toles
- Department of Otolaryngology-Head and Neck Surgery, The University of Texas Southwestern Medical Center, Dallas, Texas
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; School of Health and Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; School of Health and Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts.
| |
Collapse
|
5
|
The Biomechanical Characteristics of Swallowing in Tracheostomized Patients with Aspiration following Acquired Brain Injury: A Cross-Sectional Study. Brain Sci 2023; 13:brainsci13010091. [PMID: 36672072 PMCID: PMC9856393 DOI: 10.3390/brainsci13010091] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 12/22/2022] [Accepted: 12/30/2022] [Indexed: 01/06/2023] Open
Abstract
Objectives: Investigate the biomechanical characteristics in tracheostomized patients with aspiration following acquired brain injury (ABI) and further explore the relationship between the biomechanical characteristics and aspiration. Methods: This is a single-center cross-sectional study. The tracheostomized patients with aspiration following ABI and age-matched healthy controls were recruited. The biomechanical characteristics, including velopharynx (VP) maximal pressure, tongue base (TB) maximal pressure, upper esophageal sphincter (UES) residual pressure, UES relaxation duration, and subglottic pressure, were examined by high-resolution manometry and computational fluid dynamics simulation analysis. The penetration−aspiration scale (PAS) score was evaluated by a videofluoroscopic swallowing study. Results: Fifteen healthy subjects and fifteen tracheostomized patients with aspiration following ABI were included. The decreased VP maximal pressure, increased UES residual pressure, and shortened UES relaxation duration were found in the patient group compared with the control group (p < 0.05). Furthermore, the subglottic pressure significantly decreased in patients (p < 0.05), while no significant difference was found in TB maximal pressure between groups (p > 0.05). In addition, in the patient group, VP maximal pressure (rs = −0.439; p = 0.015), UES relaxation duration (rs = −0.532; p = 0.002), and the subglottic pressure (rs = −0.775; p < 0.001) were negatively correlated with the PAS score, while UES residual pressure (rs = 0.807; p < 0.001) was positively correlated with the PAS score (p < 0.05), the correlation between TB maximal pressure and PAS score (rs = −0.315; p = 0.090) did not reach statistical significance. Conclusions: The biomechanical characteristics in tracheostomized patients with aspiration following ABI might manifest as decreased VP maximal pressure and subglottic pressure, increased UES residual pressure, and shortened UES relaxation duration, in which VP maximal pressure, UES relaxation duration, subglottic pressure, and UES residual pressure were correlated with aspiration.
Collapse
|
6
|
Cortés JP, Lin JZ, Marks KL, Espinoza VM, Ibarra EJ, Zañartu M, Hillman RE, Mehta DD. Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders. APPLIED SCIENCES (BASEL, SWITZERLAND) 2022; 12:10692. [PMID: 36777332 PMCID: PMC9910342 DOI: 10.3390/app122110692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The aerodynamic voice assessment of subglottal air pressure can discriminate between speakers with typical voices from patients with voice disorders, with further evidence validating subglottal pressure as a clinical outcome measure. Although estimating subglottal pressure during phonation is an important component of a standard voice assessment, current methods for estimating subglottal pressure rely on non-natural speech tasks in a clinical or laboratory setting. This study reports on the validation of a method for subglottal pressure estimation in individuals with and without voice disorders that can be translated to connected speech to enable the monitoring of vocal function and behavior in real-world settings. During a laboratory calibration session, a participant-specific multiple regression model was derived to estimate subglottal pressure from a neck-surface vibration signal that can be recorded during natural speech production. The model was derived for vocally typical individuals and patients diagnosed with phonotraumatic vocal fold lesions, primary muscle tension dysphonia, and unilateral vocal fold paralysis. Estimates of subglottal pressure using the developed method exhibited significantly lower error than alternative methods in the literature, with average errors ranging from 1.13 to 2.08 cm H2O for the participant groups. The model was then applied during activities of daily living, thus yielding ambulatory estimates of subglottal pressure for the first time in these populations. Results point to the feasibility and potential of real-time monitoring of subglottal pressure during an individual's daily life for the prevention, assessment, and treatment of voice disorders.
Collapse
Affiliation(s)
- Juan P. Cortés
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Jon Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Speech, Language & Hearing Sciences Department, College of Health & Rehabilitation: Sargent College, Boston University, Boston, MA 02215, USA
| | | | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
7
|
Toles LE, Seidman AY, Hillman RE, Mehta DD. Clinical Utility of the Ratio of Sound Pressure Level to Subglottal Pressure in Patients Surgically Treated for Phonotraumatic Vocal Fold Lesions. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2778-2788. [PMID: 35914023 PMCID: PMC9911101 DOI: 10.1044/2022_jslhr-21-00658] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 04/07/2022] [Accepted: 05/02/2022] [Indexed: 06/15/2023]
Abstract
PURPOSE This study aimed to determine whether a simplified, and potentially more stable, acoustic-aerodynamic voice outcome ratio (ratio of sound pressure level [SPL] to subglottal pressure) is comparable to a traditional vocal efficiency measure (ratio of acoustic power to the product of average subglottal pressure and average phonatory airflow) in terms of the ability to detect change in vocal function following surgical removal of bilateral phonotraumatic lesions. METHOD Pre- and postoperative acoustic and aerodynamic measures were analyzed retrospectively from 75 female patients who underwent surgical removal of bilateral phonotraumatic lesions. A 2 × 2 repeated-measures analysis of variance was conducted for each of three acoustic-aerodynamic voice outcome ratios-traditional vocal efficiency, an SPL-based ratio with both airflow and subglottal pressure, and a simplified SPL-based ratio with subglottal pressure only-to investigate the main effects of treatment stage (pre- and postsurgery), loudness condition (comfortable and loud), and their interaction. Post hoc paired samples t tests were conducted for statistically significant interactions. The within-subject variability of the measures was assessed using the coefficient of variation. RESULTS Although exhibiting an expected main effect of loudness (higher values in the loud condition), the traditional vocal efficiency ratio did not exhibit a main effect of treatment. For both SPL-based ratios, there were significant main effects of treatment stage (higher values postoperatively) and loudness condition (lower values in the loud condition). Within-subject, postoperative changes in the two SPL-based ratios moderately correlated with one another. The simplified ratio of SPL to subglottal pressure (without airflow) exhibited the least within-subject variability relative to the other two acoustic-aerodynamic ratios. CONCLUSIONS These findings indicate that SPL-based acoustic-aerodynamic voice outcome ratios increase significantly following the surgical removal of phonotraumatic vocal fold lesions. The simplified ratio of SPL to subglottal pressure exhibits the least variability and can be easily obtained without requiring the measurement of airflow.
Collapse
Affiliation(s)
- Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- The University of Texas Southwestern Medical Center, Dallas
| | | | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
8
|
Kopczynski B, Niebudek-Bogusz E, Pietruszewska W, Strumillo P. Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings. SENSORS (BASEL, SWITZERLAND) 2022; 22:1751. [PMID: 35270897 PMCID: PMC8915112 DOI: 10.3390/s22051751] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 02/12/2022] [Accepted: 02/15/2022] [Indexed: 05/17/2023]
Abstract
Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accurate detection of vocal folds edges. In our fully automatic method, we combine video and acoustic data that are synchronously recorded during the laryngeal endoscopy. We show that the image segmentation algorithm of the glottal area can be optimized by matching the Fourier spectra of the pre-processed video and the spectra of the acoustic recording during the phonation of sustained vowel /i:/. We verify our method on a set of LHSV recordings taken from subjects with normophonic voice and patients with voice disorders due to glottal insufficiency. We show that the computed geometric indices of the glottal area make it possible to discriminate between normal and pathologic voices. The median of the Open Quotient and Minimal Relative Glottal Area values for healthy subjects were 0.69 and 0.06, respectively, while for dysphonic subjects were 1 and 0.35, respectively. We also validate these results using independent phoniatrician experts.
Collapse
Affiliation(s)
- Bartosz Kopczynski
- Institute of Electronics, Lodz University of Technology, 90-924 Lodz, Poland;
| | - Ewa Niebudek-Bogusz
- Department of Otolaryngology, Head and Neck Oncology, Medical University of Lodz, 90-001 Lodz, Poland; (E.N.-B.); (W.P.)
| | - Wioletta Pietruszewska
- Department of Otolaryngology, Head and Neck Oncology, Medical University of Lodz, 90-001 Lodz, Poland; (E.N.-B.); (W.P.)
| | - Pawel Strumillo
- Institute of Electronics, Lodz University of Technology, 90-924 Lodz, Poland;
| |
Collapse
|
9
|
Marks KL, Verdi A, Toles LE, Stipancic KL, Ortiz AJ, Hillman RE, Mehta DD. Psychometric Analysis of an Ecological Vocal Effort Scale in Individuals With and Without Vocal Hyperfunction During Activities of Daily Living. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2021; 30:2589-2604. [PMID: 34665647 PMCID: PMC9132024 DOI: 10.1044/2021_ajslp-21-00111] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 06/11/2021] [Accepted: 07/07/2021] [Indexed: 05/29/2023]
Abstract
Objective The purpose of this study was to examine the psychometric properties of an ecological vocal effort scale linked to a voicing task. Method Thirty-eight patients with nodules, 18 patients with muscle tension dysphonia, and 45 vocally healthy control individuals participated in a week of ambulatory voice monitoring. A global vocal status question was asked hourly throughout the day. Participants produced a vowel-consonant-vowel syllable string and rated the vocal effort needed to produce the task on a visual analog scale. Test-retest reliability was calculated for a subset using the intraclass correlation coefficient, ICC(A, 1). Construct validity was assessed by (a) comparing the weeklong vocal effort ratings between the patient and control groups and (b) comparing weeklong vocal effort ratings before and after voice rehabilitation in a subset of 25 patients. Cohen's d, the standard error of measurement (SEM), and the minimal detectable change (MDC) assessed sensitivity. The minimal clinically important difference (MCID) assessed responsiveness. Results Test-retest reliability was excellent, ICC(A, 1) = .96. Weeklong mean effort was statistically higher in the patients than in controls (d = 1.62) and lower after voice rehabilitation (d = 1.75), supporting construct validity and sensitivity. SEM was 4.14, MDC was 11.47, and MCID was 9.74. Since the MCID was within the error of the measure, we must rely upon the MDC to detect real changes in ecological vocal effort. Conclusion The ecological vocal effort scale offers a reliable, valid, and sensitive method of monitoring vocal effort changes during the daily life of individuals with and without vocal hyperfunction.
Collapse
Affiliation(s)
- Katherine L. Marks
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
| | - Alessandra Verdi
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
| | - Laura E. Toles
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
| | - Kaila L. Stipancic
- MGH Institute of Health Professions, Boston, MA
- University at Buffalo, NY
| | - Andrew J. Ortiz
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| |
Collapse
|
10
|
Kadiri SR, Alku P. Glottal features for classification of phonation type from speech and neck surface accelerometer signals. COMPUT SPEECH LANG 2021. [DOI: 10.1016/j.csl.2021.101232] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
11
|
Van Stan JH, Ortiz AJ, Marks KL, Toles LE, Mehta DD, Burns JA, Hron T, Stadelman-Cohen T, Krusemark C, Muise J, Fox AB, Nudelman C, Zeitels S, Hillman RE. Changes in the Daily Phonotrauma Index Following the Use of Voice Therapy as the Sole Treatment for Phonotraumatic Vocal Hyperfunction in Females. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:3446-3455. [PMID: 34463536 PMCID: PMC8642084 DOI: 10.1044/2021_jslhr-21-00082] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 04/26/2021] [Accepted: 05/24/2021] [Indexed: 05/31/2023]
Abstract
Purpose The aim of this study was to use the Daily Phonotrauma Index (DPI) to quantify group-based changes in the daily voice use of patients with phonotraumatic vocal hyperfunction (PVH) after receiving voice therapy as the sole treatment. This is part of an ongoing effort to validate an updated theoretical framework for PVH. Method A custom-designed ambulatory voice monitor was used to collect 1 week of pre- and posttreatment data from 52 female patients with PVH. Normative weeklong data were also obtained from 52 matched controls. Each week was represented by the DPI, which is a combination of neck-surface acceleration magnitude skewness and the standard deviation of the difference between the first and second harmonic magnitudes. Results Compared to pretreatment, the DPI statistically decreased towards normal in the patient group after treatment (Cohen's d = -0.25). The posttreatment patient group's DPI was still significantly higher than the control group (d = 0.68). Conclusions The DPI showed the pattern of improved ambulatory voice use in a group of patients with PVH following voice therapy that was predicted by the updated theoretical framework. Per the prediction, voice therapy was associated with a decreased potential for phonotrauma in daily voice use, but the posttreatment patient group data were still significantly different from the normative control group data. This posttreatment difference is interpreted as reflecting the impact on voice use of the persistence of phonotrauma-induced structural changes to the vocal folds. Further validation of the DPI is needed to better understand its potential clinical use.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Charles Nudelman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
12
|
Ibarra EJ, Parra JA, Alzamendi GA, Cortés JP, Espinoza VM, Mehta DD, Hillman RE, Zañartu M. Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model. Front Physiol 2021; 12:732244. [PMID: 34539451 PMCID: PMC8440844 DOI: 10.3389/fphys.2021.732244] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 08/09/2021] [Indexed: 11/23/2022] Open
Abstract
The ambulatory assessment of vocal function can be significantly enhanced by having access to physiologically based features that describe underlying pathophysiological mechanisms in individuals with voice disorders. This type of enhancement can improve methods for the prevention, diagnosis, and treatment of behaviorally based voice disorders. Unfortunately, the direct measurement of important vocal features such as subglottal pressure, vocal fold collision pressure, and laryngeal muscle activation is impractical in laboratory and ambulatory settings. In this study, we introduce a method to estimate these features during phonation from a neck-surface vibration signal through a framework that integrates a physiologically relevant model of voice production and machine learning tools. The signal from a neck-surface accelerometer is first processed using subglottal impedance-based inverse filtering to yield an estimate of the unsteady glottal airflow. Seven aerodynamic and acoustic features are extracted from the neck surface accelerometer and an optional microphone signal. A neural network architecture is selected to provide a mapping between the seven input features and subglottal pressure, vocal fold collision pressure, and cricothyroid and thyroarytenoid muscle activation. This non-linear mapping is trained solely with 13,000 Monte Carlo simulations of a voice production model that utilizes a symmetric triangular body-cover model of the vocal folds. The performance of the method was compared against laboratory data from synchronous recordings of oral airflow, intraoral pressure, microphone, and neck-surface vibration in 79 vocally healthy female participants uttering consecutive /pæ/ syllable strings at comfortable, loud, and soft levels. The mean absolute error and root-mean-square error for estimating the mean subglottal pressure were 191 Pa (1.95 cm H2O) and 243 Pa (2.48 cm H2O), respectively, which are comparable with previous studies but with the key advantage of not requiring subject-specific training and yielding more output measures. The validation of vocal fold collision pressure and laryngeal muscle activation was performed with synthetic values as reference. These initial results provide valuable insight for further vocal fold model refinement and constitute a proof of concept that the proposed machine learning method is a feasible option for providing physiologically relevant measures for laboratory and ambulatory assessment of vocal function.
Collapse
Affiliation(s)
- Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- School of Electrical Engineering, University of the Andes, Mérida, Venezuela
| | - Jesús A. Parra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Gabriel A. Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics, Consejo Nacional de Investigaciones Científicas y Técnicas - Universidad Nacional de Entre Ríos, Oro Verde, Argentina
| | - Juan P. Cortés
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Víctor M. Espinoza
- Department of Sound, Faculty of Arts, University of Chile, Santiago, Chile
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
13
|
Mehta DD, Kobler JB, Zeitels SM, Zañartu M, Ibarra EJ, Alzamendi GA, Manriquez R, Erath BD, Peterson SD, Petrillo RH, Hillman RE. Direct measurement and modeling of intraglottal, subglottal, and vocal fold collision pressures during phonation in an individual with a hemilaryngectomy. APPLIED SCIENCES (BASEL, SWITZERLAND) 2021; 11:7256. [PMID: 36210866 PMCID: PMC9541559 DOI: 10.3390/app11167256] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The purpose of this paper is to report on the first in vivo application of a recently developed transoral, dual-sensor pressure probe that directly measures intraglottal, subglottal, and vocal fold collision pressures during phonation. Synchronous measurement of intraglottal and subglottal pressures was accomplished using two miniature pressure sensors mounted on the end of the probe and inserted transorally in a 78-year-old male who had previously undergone surgical removal of his right vocal fold for treatment of laryngeal cancer. The endoscopist used one hand to position the custom probe against the surgically medialized scar band that replaced the right vocal fold and used the other hand to position a transoral endoscope to record laryngeal high-speed videoendoscopy of the vibrating left vocal fold contacting the pressure probe. Visualization of the larynx during sustained phonation allowed the endoscopist to place the dual-sensor pressure probe such that the proximal sensor was positioned intraglottally and the distal sensor subglottally. The proximal pressure sensor was verified to be in the strike zone of vocal fold collision during phonation when the intraglottal pressure signal exhibited three characteristics: an impulsive peak at the start of the closed phase, rounded peak during the open phase, and minimum value around zero immediately preceding the impulsive peak of the subsequent phonatory cycle. Numerical voice production modeling was applied to validate model-based predictions of vocal fold collision pressure using kinematic vocal fold measures. The results successfully demonstrated feasibility of in vivo measurement of vocal fold collision pressure in an individual with a hemilaryngectomy, motivating ongoing data collection that is designed to aid in the development of vocal dose measures that incorporate vocal fold impact collision and stresses.
Collapse
Affiliation(s)
- Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- MGH Institute of Health Professions, Boston, MA, USA
| | - James B. Kobler
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Steven M. Zeitels
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Gabriel A. Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics, National University of Entre Rios–CONICET, Entre Ríos, Argentina
| | - Rodrigo Manriquez
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Byron D. Erath
- Department of Mechanical & Aeronautical Engineering, Clarkson University, Potsdam, NY, USA
| | - Sean D. Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Ontario, Canada
| | - Robert H. Petrillo
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- MGH Institute of Health Professions, Boston, MA, USA
| |
Collapse
|
14
|
Calvache C, Solaque L, Velasco A, Peñuela L. Biomechanical Models to Represent Vocal Physiology: A Systematic Review. J Voice 2021; 37:465.e1-465.e18. [PMID: 33678534 DOI: 10.1016/j.jvoice.2021.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 01/26/2021] [Accepted: 02/02/2021] [Indexed: 11/24/2022]
Abstract
Biomechanical modeling allows obtaining information on physical phenomena that cannot be directly observed. This study aims to review models that represent voice production. A systematic review of the literature was conducted using PubMed/Medline, SCOPUS, and IEEE Xplore databases. To select the papers, we used the protocol PRISMA Statement. A total of 53 publications were included in this review. This article considers a taxonomic classification of models found in the literature. We propose four categories in the taxonomy: (1) Models representing the Source (Vocal folds); (2) Models representing the Filter (Vocal Tract); (3) Models representing the Source - Filter Interaction; and (4) Models representing the Airflow - Source Interaction. We include a bibliographic analysis with the evolution of the publications per category. We provide an analysis of the number as well of publications in journals per year. Moreover, we present an analysis of the term occurrence and its frequency of usage, as found in the literature. In each category, different types of vocal production models are mentioned and analyzed. The models account for the analysis of evidence about aerodynamic, biomechanical, and acoustic phenomena and their correlation with the physiological processes involved in the production of the human voice. This review gives an insight into the state of the art related to the mathematical modeling of voice production, analyzed from the viewpoint of vocal physiology.
Collapse
Affiliation(s)
- Carlos Calvache
- Vocology Center, Bogotá, Colombia; Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia.
| | - Leonardo Solaque
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Alexandra Velasco
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Lina Peñuela
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| |
Collapse
|
15
|
Van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Marks KL, Toles LE, Stadelman-Cohen T, Krusemark C, Muise J, Hron T, Zeitels SM, Fox AB, Hillman RE. Changes in a Daily Phonotrauma Index After Laryngeal Surgery and Voice Therapy: Implications for the Role of Daily Voice Use in the Etiology and Pathophysiology of Phonotraumatic Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:3934-3944. [PMID: 33197360 PMCID: PMC8608140 DOI: 10.1044/2020_jslhr-20-00168] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose This study attempts to gain insights into the role of daily voice use in the etiology and pathophysiology of phonotraumatic vocal hyperfunction (PVH) by applying a logistic regression-based daily phonotrauma index (DPI) to predict group-based improvements in patients with PVH after laryngeal surgery and/or postsurgical voice therapy. Method A custom-designed ambulatory voice monitor was used to collect 1 week of pre- and postsurgery data from 27 female patients with PVH; 13 of these patients were also monitored after postsurgical voice therapy. Normative weeklong data were obtained from 27 matched controls. Each week was represented by the DPI, standard deviation of the difference between the first and second harmonic amplitudes (H1-H2). Results Compared to pretreatment, the DPI significantly decreased in the patient group after surgery (Cohen's d effect size = -0.86) and voice therapy (d = -1.06). The patient group DPI only normalized after voice therapy. Conclusions The DPI produced the expected pattern of improved ambulatory voice use across laryngeal surgery and postsurgical voice therapy in a group of patients with PVH. The results were interpreted as providing new objective information about the role of daily voice use in the etiology and pathophysiology of PVH. The DPI is viewed as an estimate of potential vocal fold trauma that relies on combining the long-term distributional characteristics of two parameters representing the magnitude of phonatory forces (neck-surface acceleration magnitude) and vocal fold closure dynamics (H1-H2). Further validation of the DPI is needed to better understand its potential clinical use.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Steven M. Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | | | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
16
|
Espinoza VM, Mehta DD, Van Stan JH, Hillman RE, Zañartu M. Glottal Aerodynamics Estimated From Neck-Surface Vibration in Women With Phonotraumatic and Nonphonotraumatic Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2861-2869. [PMID: 32755502 PMCID: PMC7890221 DOI: 10.1044/2020_jslhr-20-00189] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 05/29/2020] [Accepted: 06/10/2020] [Indexed: 05/06/2023]
Abstract
Purpose The purpose of this study was to determine whether estimates of glottal aerodynamic measures based on neck-surface vibration are comparable to those previously obtained using oral airflow and air pressure signals (Espinoza et al., 2017) in terms of discriminating patients with phonotraumatic and nonphonotraumatic vocal hyperfunction (PVH and NPVH) from vocally healthy controls. Method Consecutive /pae/ syllables at comfortable and loud level were produced by 16 women with PVH (organic vocal fold lesions), 16 women with NPVH (primary muscle tension dysphonia), and 32 vocally healthy women who were each matched to a patient according to age and occupation. Subglottal impedance-based inverse filtering of the anterior neck-surface accelerometer (ACC) signal yielded estimates of peak-to-peak glottal airflow, open quotient, and maximum flow declination rate. Average subglottal pressure and microphone-based sound pressure level (SPL) were also estimated from the ACC signal using subject-specific linear regression models. The ACC-based measures of glottal aerodynamics were normalized for SPL and statistically compared between each patient and matched-control group. Results Patients with PVH and NPVH exhibited lower SPL-normalized glottal aerodynamics values than their respective control subjects (p values ranging from < .01 to .07) with very large effect sizes (1.04-2.16), regardless of loudness condition or measurement method (i.e., ACC-based values maintained discriminatory power). Conclusions The results of this study demonstrate that ACC-based estimates of most glottal aerodynamic measures are comparable to those previously obtained from oral airflow and air pressure (Espinoza et al., 2017) in terms of differentiating between hyperfunctional (PVH and NPVH) and normal vocal function. ACC-based estimates of glottal aerodynamic measures may be used to assess vocal function during continuous speech and enables this assessment of daily voice use during ambulatory monitoring to provide better insight into the pathophysiological mechanisms associated with vocal hyperfunction.
Collapse
Affiliation(s)
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Jarrad H. Van Stan
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
17
|
Marks KL, Lin JZ, Burns JA, Hron TA, Hillman RE, Mehta DD. Estimation of Subglottal Pressure From Neck Surface Vibration in Patients With Voice Disorders. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2202-2218. [PMID: 32610028 PMCID: PMC7838842 DOI: 10.1044/2020_jslhr-19-00409] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 03/12/2020] [Accepted: 04/15/2020] [Indexed: 05/15/2023]
Abstract
Purpose Given the established linear relationship between neck surface vibration magnitude and mean subglottal pressure (Ps) in vocally healthy speakers, the purpose of this study was to better understand the impact of the presence of a voice disorder on this baseline relationship. Method Data were obtained from participants with voice disorders representing a variety of glottal conditions, including phonotraumatic vocal hyperfunction, nonphonotraumatic vocal hyperfunction, and unilateral vocal fold paralysis. Participants were asked to repeat /p/-vowel syllable strings from loud-to-soft loudness levels in multiple vowel contexts (/pa/, /pi/, /pu/) and pitch levels (comfortable, higher than comfortable, lower than comfortable). Three statistical metrics were computed to analyze the regression line between neck surface accelerometer (ACC) signal magnitude and Ps within and across pitch, vowel, and voice disorder category: coefficient of determination (r 2), slope, and intercept. Three linear mixed-effects models were used to evaluate the impact of voice disorder category, pitch level, and vowel context on the relationship between ACC signal magnitude and Ps. Results The relationship between ACC signal magnitude and Ps was statistically different in patients with voice disorders than in vocally healthy controls; patients exhibited higher levels of Ps given similar values of ACC signal magnitude. Negligible effects were found for pitch condition within each voice disorder category, and negligible-to-small effects were found for vowel context. The mean of patient-specific r 2 values was .63, ranging from .13 to .92. Conclusions The baseline, linear relationship between ACC signal magnitude and Ps is affected by the presence of a voice disorder, with the relationship being participant-specific. Further work is needed to improve ACC-based prediction of Ps, across treatment, and during naturalistic speech production.
Collapse
Affiliation(s)
- Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
| | - Jonathan Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - James A. Burns
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Tiffiny A. Hron
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| |
Collapse
|