1
|
Ghasemzadeh H, Hillman RE, Espinoza VM, Erath BD, Mehta DD. Vocal Fold Dissipated Power in Females with Hyperfunctional Voice Disorders. J Voice 2024:S0892-1997(24)00332-1. [PMID: 39426910 DOI: 10.1016/j.jvoice.2024.09.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Revised: 09/20/2024] [Accepted: 09/23/2024] [Indexed: 10/21/2024]
Abstract
OBJECTIVE Phonotrauma has been hypothesized to be associated with prolonged and/or accumulated biomechanical stress on vocal fold tissue. This hypothesis can be tested using ambulatory monitoring of vocal fold dissipated power, which requires a reliable method for its noninvasive estimation during the activity of daily living. The first aim of this study was to show that a laboratory-based estimate of vocal fold dissipated power computed from intraoral pressure (IOP) has significant discriminative power in individuals with phonotraumatic vocal hyperfunction (PVH). Considering that estimation of subglottal pressure from IOP is not practical for ambulatory applications, an alternative approach should be used. The second aim of this study was to test the impact of two alternative methods for the estimation of subglottal pressure on the discriminative power of vocal fold dissipated power in individuals with PVH and, hence, to provide an evidence-based recommendation for future ambulatory monitoring studies of vocal fold dissipated power. METHOD Four groups of adult females were included in this study: 16 individuals with PVH, 16 individuals with nonphonotraumatic vocal hyperfunction (NPVH), and two groups of vocally typical controls matched to the participants in each patient group in terms of age and occupation. Each participant produced strings of five consecutive /pae/ syllables while wearing a pneumotachograph mask with an IOP tube. Neck-surface accelerometer and acoustic signals were recorded simultaneously using an ambulatory voice monitor and a head-mounted microphone, respectively. IOP was used to estimate subglottal pressure and subject-specific calibration factors were determined for the estimation of subglottal pressure from the accelerometer signal. RESULTS (1) Individuals with PVH had significantly higher dissipated power than controls (P = 0.001, Cohen's D=1.31) when the intraoral estimate of subglottal pressure was used in the computation of dissipated power. (2) The difference between the dissipated power of individuals with NPVH and their matched controls was not significant. (3) When microphone-based sound pressure levels was used for the estimation of subglottal pressure, the difference between individuals with PVH and their matched controls vanished (P = 0.23). (4) When subject-specific estimation of subglottal pressure from the accelerometer was used, the discriminative power returned with a very large effect size (P = 0.001, D=1.38). CONCLUSION Increased dissipated power is sensitive and specific to individuals with PVH among individuals with hyperfunctional voice disorders. The results provide evidence that accelerometer-based estimate of energy dissipation dose (power integrated over time) during daily life could be clinically useful.
Collapse
Affiliation(s)
- Hamzeh Ghasemzadeh
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan.
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, Massachusetts; MGH Institute of Health Professions, Boston, Massachusetts
| | - Víctor M Espinoza
- Department of Sound, Faculty of Arts, University of Chile, Santiago, Chile
| | - Byron D Erath
- Department of Mechanical Engineering, Rochester Institute of Technology, Rochester, NY
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, Massachusetts; MGH Institute of Health Professions, Boston, Massachusetts
| |
Collapse
|
2
|
Van Stan JH, Hillman RE, Krusemark C, Muise J, Stadelman-Cohen T, Mehta DD, Sternad D. Floating Ball Voice Therapy: Preliminary Effects on Outcomes and Predicting Individual Patient Differences in Generalization. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:3521-3535. [PMID: 39320344 PMCID: PMC11482575 DOI: 10.1044/2024_jslhr-23-00727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 05/14/2024] [Accepted: 07/23/2024] [Indexed: 09/26/2024]
Abstract
PURPOSE Floating ball voice therapy (FBVT) is a voice-controlled virtual environment based on a common treatment component across multiple evidence-based therapies: improved vocal efficiency (target) via practicing voicing with modified resonance and airflow (ingredient). This study preliminarily tested FBVT's effects on outcomes and the potential for its novel variability metrics to predict individual patient generalization. METHOD Ten patients with nonphonotraumatic vocal hyperfunction (NPVH) practiced FBVT for 10 days. Outcomes were assessed by a vocal efficiency ratio, a validated NPVH index, the patient-reported Voice-Related Quality of Life (V-RQOL), and forced-choice auditory judgments of overall severity. Exploration in early practice (Day 1) was estimated by how the patient's two-dimensional variability (mean airflow and intensity) related to error (difference between the patient-produced and normative vocal efficiency ratio). Generalization from the game to spontaneous speech was evaluated using the validated NPVH index. RESULTS Ten days of FBVT were associated with improved vocal efficiency (Cohen's d = 1.3), NPVH index (d = -1.1), V-RQOL total score (d = 0.9), and overall severity (odds ratio = 2.5). Patients who generalized on Day 10 exhibited airflow/intensity exploration that was more aligned with the error gradient on Day 1 (d = 0.6-1.2). CONCLUSIONS A relatively small dosage of FBVT (i.e., 10 practice sessions) was associated with multiple improved voice therapy outcomes. The FBVT variability metrics on Practice Day 1 demonstrated strong potential to predict which patients generalized to connected speech. Future work can more thoroughly evaluate effects on outcomes and characterizing the quality of vocal exploration with a larger patient population. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.27040873.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | |
Collapse
|
3
|
Calvache Mora CA, Cantor-Cutiva LC, Hunter EJ, Guzmán M, Soláque L. Systematic Review of Literature on Vocal Demand Response: Understanding Physiology, Measurements, and Associated Factors. Folia Phoniatr Logop 2023; 76:1-21. [PMID: 37393892 DOI: 10.1159/000531678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 06/13/2023] [Indexed: 07/04/2023] Open
Abstract
PURPOSE Considering the conceptual migration from vocal load and vocal loading to vocal demand and vocal demand response, this review of literature aimed to identify physiological explanations, reported measurements, and associated factors (vocal demands) reported in the literature when considering the phonatory response to a vocal demand. METHODS A systematic review of literature, following the PRISMA Statement, was conducted using Web of Science, PubMed, Scopus, and ScienceDirect. Data were analyzed and presented in two parts. First, a bibliometric analysis, co-occurrence analysis, and content analysis were performed. Three criteria that got article inclusion were defined: (1) written in English, Spanish, and Portuguese; (2) published between 2009 and 2021; and (3) focused on vocal load and loading, vocal demand response, and voice assessment parameters. A total of 54 publications met the criteria and were included in this review. The second part included a conceptual framework based on the content analysis of three aspects of vocal demand response: (1) physiological explanations, (2) reported measurements, and (3) vocal demands. RESULTS AND CONCLUSION As would be expected since vocal demand response is a relatively new term and not yet commonly used in literature when discussing way that the speakers respond to communicative scenarios, most of the studies reviewed (both historical and recent) still use the term of vocal load and vocal loading. Although there is a broad variety of literature discussing a wide range of vocal demands and voice parameters used to characterize the vocal demand response, results show that there is consistency across the studies. While vocal demand response is unique and intrinsic to the talker, associated factors that contribute to this response include both internal talker and external talker factors. Internal factors include muscle stiffness, viscosity in the phonatory system, vocal fold tissue damage, elevated sound pressure levels during occupational voice demands, extended periods of voice use, suboptimal body posture, difficulties in breathing technique, and sleep disturbances. Associated external factors include the working environment (noise, acoustics, temperature, humidity). In conclusion, although vocal demand response is intrinsic to the speaker, the speaker's response is affected by external vocal demands. However, due to the wide methods to evaluate vocal demand response, it has been difficult to establish its contribution to voice disorders in the general population and, specifically, among occupational voice users. This literature review identified commonly reported parameters and factors that may help clinicians and researchers define vocal demand response.
Collapse
Affiliation(s)
- Carlos Alberto Calvache Mora
- Department of Communicative Sciences and Disorders, Corporación Universitaria Iberoamericana, Bogotá, Colombia
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
- Vocology Center, Bogotá, Colombia
| | - Lady Catherine Cantor-Cutiva
- Vocology Center, Bogotá, Colombia
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan, USA
| | - Eric J Hunter
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan, USA
| | - Marco Guzmán
- Department of Communication Sciences and Disorders, Universidad de los Andes, Santiago, Chile
| | - Leonardo Soláque
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| |
Collapse
|
4
|
Bottalico P, Nudelman CJ. Do-It-Yourself Voice Dosimeter Device: A Tutorial and Performance Results. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023:1-15. [PMID: 37263017 DOI: 10.1044/2023_jslhr-23-00060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
PURPOSE Voice dosimeters gather voice production data in the daily lives of individuals with voice disorders. Additionally, voice dosimeters aid in understanding the pathophysiology of voice disorders. Previously, several voice dosimeters were commercially available. However, these devices have been discontinued and are not available to clinicians and researchers alike. In this tutorial, instructions for a low-cost, easy-to-assemble voice dosimeter are provided. This do-it-yourself (DIY) voice dosimeter is further validated based on performance results. METHOD Ten vocally healthy participants wore the DIY voice dosimeter. They produced a sustained /a/ vowel and read a text with three different vocal efforts. These tasks were recorded by the DIY voice dosimeter and a reference microphone simultaneously. The expanded uncertainty of the mean error in the estimation of four voice acoustic parameters as measured by the DIY dosimeter was performed by comparing the signals acquired through the reference microphone and the dosimeter. RESULTS For measures of sound pressure level, the DIY voice dosimeter had a mean error of -0.68 dB with an uncertainty of 0.56 dB. For fundamental frequency, the mean error was 1.56 Hz for female participants and 1.11 Hz for male participants, with an uncertainty of 0.62 Hz and 0.34 Hz for female and male participants, respectively. Cepstral peak prominence smoothed and L1 minus L2 had mean errors (uncertainty) of -0.06 dB (0.27 dB) and 2.20 dB (0.72 dB). CONCLUSION The mean error and uncertainties for the DIY voice dosimeter are comparable to those for the most accurate voice dosimeters that were previously on the market.
Collapse
Affiliation(s)
- Pasquale Bottalico
- Department of Speech and Hearing Science, University of Illinois at Urbana-Champaign
| | - Charles J Nudelman
- Department of Speech and Hearing Science, University of Illinois at Urbana-Champaign
| |
Collapse
|
5
|
Bullock L, Toles LE, Hillman RE, Mehta DD. Acoustic-Aerodynamic Voice Outcome Ratios Identify Changes in Vocal Function Following Vocal Fold Medialization for Unilateral Vocal Fold Paralysis. J Voice 2023:S0892-1997(23)00104-2. [PMID: 37068982 PMCID: PMC10576834 DOI: 10.1016/j.jvoice.2023.03.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 03/09/2023] [Accepted: 03/10/2023] [Indexed: 04/19/2023]
Abstract
PURPOSE This study aimed to determine whether ratio-based measures that combine acoustic (output) and aerodynamic (input) parameters detect postoperative change in vocal function following vocal fold medialization for unilateral vocal fold paralysis. METHOD Pre- and postoperative acoustic and aerodynamic measures were analyzed retrospectively from 149 patients who underwent vocal fold medialization for unilateral vocal fold paralysis. A 2 × 2 repeated-measures analysis of variance was conducted for each of four acoustic-aerodynamic ratios-traditional vocal efficiency (VE), sound pressure level to aerodynamic power (SPL/AP), SPL to average airflow (SPL/AFLOW), and SPL to subglottal pressure (SPL/Ps)-to investigate the main effects and interaction of treatment stage and loudness level (comfortable and loud). RESULTS The patient group showed significant postoperative improvements in self reports of vocal function (voice-related quality of life) and clinical auditory-perceptual judgments of dysphonia (consensus auditory-perceptual evaluation of voice). Main effects for both treatment stage and loudness level were statistically significant for all measures except SPL/Ps. There were interaction effects for VE and SPL/AP, suggesting that magnitude of the treatment effect differs based on loudness. SPL/AFLOW had medium-to-large effect sizes in both loudness conditions. There were postoperative changes in SPL/Ps that were dependent on the magnitude of the reduction in AFLOW; as expected, SPL/Ps increased postoperatively in a subgroup that had large postoperative reductions in AFLOW at the comfortable loudness level. CONCLUSIONS Acoustic-aerodynamic ratios can aid in tracking changes in vocal function following vocal fold medialization. SPL/AFLOW exhibited the largest effect size, which is expected since a reduction in abnormally high AFLOW typically accompanies the increased modulation of glottal air flow associated with successful vocal fold medialization. Future study is needed to model physiological changes in acoustic-aerodynamic voice outcome ratios across different types of voice disorders.
Collapse
Affiliation(s)
- Latané Bullock
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts
| | - Laura E Toles
- Department of Otolaryngology-Head and Neck Surgery, The University of Texas Southwestern Medical Center, Dallas, Texas
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; School of Health and Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Division of Medical Sciences, Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, Massachusetts; Department of Surgery, Harvard Medical School, Boston, Massachusetts; School of Health and Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts.
| |
Collapse
|
6
|
Cortés JP, Lin JZ, Marks KL, Espinoza VM, Ibarra EJ, Zañartu M, Hillman RE, Mehta DD. Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders. APPLIED SCIENCES (BASEL, SWITZERLAND) 2022; 12:10692. [PMID: 36777332 PMCID: PMC9910342 DOI: 10.3390/app122110692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The aerodynamic voice assessment of subglottal air pressure can discriminate between speakers with typical voices from patients with voice disorders, with further evidence validating subglottal pressure as a clinical outcome measure. Although estimating subglottal pressure during phonation is an important component of a standard voice assessment, current methods for estimating subglottal pressure rely on non-natural speech tasks in a clinical or laboratory setting. This study reports on the validation of a method for subglottal pressure estimation in individuals with and without voice disorders that can be translated to connected speech to enable the monitoring of vocal function and behavior in real-world settings. During a laboratory calibration session, a participant-specific multiple regression model was derived to estimate subglottal pressure from a neck-surface vibration signal that can be recorded during natural speech production. The model was derived for vocally typical individuals and patients diagnosed with phonotraumatic vocal fold lesions, primary muscle tension dysphonia, and unilateral vocal fold paralysis. Estimates of subglottal pressure using the developed method exhibited significantly lower error than alternative methods in the literature, with average errors ranging from 1.13 to 2.08 cm H2O for the participant groups. The model was then applied during activities of daily living, thus yielding ambulatory estimates of subglottal pressure for the first time in these populations. Results point to the feasibility and potential of real-time monitoring of subglottal pressure during an individual's daily life for the prevention, assessment, and treatment of voice disorders.
Collapse
Affiliation(s)
- Juan P. Cortés
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Jon Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Speech, Language & Hearing Sciences Department, College of Health & Rehabilitation: Sargent College, Boston University, Boston, MA 02215, USA
| | | | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
7
|
Llorente-Ortega M, Podhorski A, Fernandez S. Introducing a New Dosimeter for the Assessment and Monitoring of Vocal Risk Situations and Voice Disorders. J Voice 2022:S0892-1997(22)00239-9. [PMID: 36210222 DOI: 10.1016/j.jvoice.2022.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 08/01/2022] [Accepted: 08/03/2022] [Indexed: 11/06/2022]
Abstract
PURPOSE There are many physiological parameters recorded by devices that are becoming more affordable, precise and accurate. However, the lack of development in the recording of voice parameters from the physiological or medical point of view is striking, given that it is a fundamental tool for the work of many people and given the high incidence and prevalence of voice pathologies that affect people's communication. In this paper we perform a complete literature review on the dosimeters used in voice research and to present a prototype dosimeter with a pilot study to show its capabilities. METHOD We conducted a literature review using the keywords [MONITORING], [PHONATION], [ACCUMULATOR], [PORTABLE], [DOSIMETRY], [VOICE] searching in PubMed, Trip Database, HONcode, and SciELO search engines. From our review of dosimeter designs, we created our own prototype consisting of two main components: a Knowles Electronics BU-7135-0000 accelerometer mounted on a neck brace; and the ultra-low power MSP430FR5994 microcontroller. The selected sampling frequency was 2048 Hz. The device calculates the F0 every 250 ms and the amplitude and phonation activity every 31.25 ms. A pilot study was conducted using 2 subjects: one male during 11 days and one female during 14 days. RESULTS This work includes devices that have been created during the last 45 years as tools for the diagnosis and monitoring of the treatment of cases of vocal pathology and for the detection of phonatory patterns or risk situations for developing voice disorders or vocal pathologies. We also present recordings with our new device on the pattern of daily talk time, the fundamental frequency and the relative intensity of two subjects on different days. CONCLUSIONS Interesting work has been done in the development of voice dosimeters with different approaches. In our experience it is not possible to access them for research and they are not yet in clinical use. It is possible that a joint approach with voice and voice disorders professionals and engineers working closely together could take advantage of current technology to develop a fully portable, useful, and efficient system.
Collapse
Affiliation(s)
| | - Adam Podhorski
- Biomedical Engineering and Sciences Department, TECNUN, School of Engineering, University of Navarra, San Sebastian, Spain
| | - Secundino Fernandez
- Medical Engineering Laboratory, School of Medicine, University of Navarra, Spain; Voice Laboratory, Department of Otorhinolaryngology, School of Medicine. University of Navarra, Spain
| |
Collapse
|
8
|
Włodarczak M, Ludusan B, Sundberg J, Heldner M. Classification of voice quality using neck-surface acceleration: Comparison with glottal flow and radiated sound. J Voice 2022:S0892-1997(22)00198-9. [PMID: 36028369 DOI: 10.1016/j.jvoice.2022.06.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 06/30/2022] [Accepted: 06/30/2022] [Indexed: 10/15/2022]
Abstract
OBJECTIVES The aim of the present study is to investigate the usefulness of features extracted from miniature accelerometers attached to speaker's tracheal wall below the glottis for classification of phonation type. The performance of the accelerometer features is evaluated relative to features obtained from inverse filtered and radiated sound. While the former is a good proxy for the voice source, obtaining robust voice source features from the latter is considered difficult since it also contains information about the vocal tract filter. By contrast, the accelerometer signal is largely unaffected by the vocal tract and although it is shaped by subglottal resonances and the transfer properties of the neck tissue, these properties remain constant within a speaker. For this reason, we expect it to provide a better approximation of the voice source than the raw audio. We also investigate which aspects of the voice source are derivable from the accelerometer and microphone signals. METHODS Five trained singers (two females and three males) were recorded producing the syllable [pæ:] in three voice qualities (neutral, breathy and pressed) and at three pitch levels as determined by the participants' personal preference. Features extracted from the three signals were used for classification of phonation type using a random forest classifier. In addition, accelerometer and microphone features with highest correlation with the voice source features were identified. RESULTS The three signals showed comparable classification error rates, with considerable differences across speakers both with respect to the overall performance and the importance of individual features. The speaker-specific differences notwithstanding, variation of phonation type had consistent effects on the voice source, accelerometer and audio signals. With regard to the voice source, AQ, NAQ, L1L2 and CQ all showed a monotonic variation along the breathy - neutral - pressed continuum. Several features were also found to vary systematically in the accelerometer and audio signals: HRF, L1L2 and CPPS (both the accelerometer and the audio), as well as the sound level (for the audio). The random forest analysis revealed that all of these features were also among the most important for the classification of voice quality. CONCLUSION Both the accelerometer and the audio signals were found to discriminate between phonation types with an accuracy approaching that of the voice source. Thus, the accelerometer signal, which is largely uncontaminated by vocal tract resonances, offered no advantage over the signal collected with a normal microphone.
Collapse
Affiliation(s)
| | - Bogdan Ludusan
- Faculty of Linguistics and Literary Studies, Bielefeld University, Germany
| | - Johan Sundberg
- Department of Speech, Music and Hearing, KTH Royal Institute of Technology, Sweden
| | | |
Collapse
|
9
|
Lei Z, Martignetti L, Ridgway C, Peacock S, Sakata JT, Li-Jessen NYK. Wearable Neck Surface Accelerometers for Occupational Vocal Health Monitoring: Instrument and Analysis Validation Study. JMIR Form Res 2022; 6:e39789. [PMID: 35930317 PMCID: PMC9391979 DOI: 10.2196/39789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 07/04/2022] [Accepted: 07/06/2022] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Neck surface accelerometer (NSA) wearable devices have been developed for voice and upper airway health monitoring. As opposed to acoustic sounds, NSA senses mechanical vibrations propagated from the vocal tract to neck skin, which are indicative of a person's voice and airway conditions. NSA signals do not carry identifiable speech information and a speaker's privacy is thus protected, which is important and necessary for continuous wearable monitoring. Our device was already tested for its durable endurance and signal processing algorithms in controlled laboratory conditions. OBJECTIVE This study aims to further evaluate both instrument and analysis validity in a group of occupational vocal users, namely, voice actors, who use their voices extensively at work in an ecologically valid setting. METHODS A total of 16 professional voice actors (age range 21-50 years; 11 females and 5 males) participated in this study. All participants were mounted with an NSA on their sternal notches during the voice acting and voice assessment sessions. The voice acting session was 4-hour long, directed by a voice director in a professional sound studio. Voice assessment sessions were conducted before, during, and 48 hours after the acting session. The assessment included phonation tasks of passage reading, sustained vowels, maximum vowel phonation, and pitch glides. Clinical acoustic metrics (eg, fundamental frequency, cepstral measures) and a vocal dose measure (ie, accumulated distance dose from acting) were computed from NSA signals. A commonly used online questionnaire (Self-Administered Voice Rating questionnaire) was also implemented to track participants' perception of vocal fatigue. RESULTS The NSA wearables stayed in place for all participants despite active body movements during the acting. The ensued body noise did not interfere with the NSA signal quality. All planned acoustic metrics were successfully derived from NSA signals and their numerical values were comparable with literature data. For a 4-hour long voice acting, the averaged distance dose was about 8354 m with no gender differences. Participants perceived vocal fatigue as early as 2 hours after the start of voice acting, with recovery 24-48 hours after the acting session. Among all acoustic metrics across phonation tasks, cepstral peak prominence and spectral tilt from the passage reading most closely mirrored trends in perceived fatigue. CONCLUSIONS The ecological validity of an in-house NSA wearable was vetted in a workplace setting. One key application of this wearable is to prompt occupational voice users when their vocal safety limits are reached for duly protection. Signal processing algorithms can thus be further developed for near real-time estimation of clinically relevant metrics, such as accumulated distance dose, cepstral peak prominence, and spectral tilt. This functionality will enable continuous self-awareness of vocal behavior and protection of vocal safety in occupational voice users.
Collapse
Affiliation(s)
- Zhengdong Lei
- School of Communication Sciences and Disorders, McGill University, Montreal, QC, Canada
| | - Lisa Martignetti
- School of Communication Sciences and Disorders, McGill University, Montreal, QC, Canada
| | - Chelsea Ridgway
- School of Medicine, University of Montreal, Quebec, QC, Canada
| | - Simon Peacock
- The Alliance of Canadian Cinema, Television and Radio Artists - Montreal, Montreal, QC, Canada
| | - Jon T Sakata
- Department of Biology, McGill University, Montreal, QC, Canada
- The Centre for Research on Brain, Language and Music, McGill University, Montreal, QC, Canada
| | - Nicole Y K Li-Jessen
- School of Communication Sciences and Disorders, McGill University, Montreal, QC, Canada
- The Centre for Research on Brain, Language and Music, McGill University, Montreal, QC, Canada
- Department of Biomedical Engineering, McGill University, Montreal, QC, Canada
- Department of Otolaryngology - Head and Neck Surgery, McGill University, Montreal, QC, Canada
- Research Institute of McGill University Health Center, Montreal, QC, Canada
| |
Collapse
|
10
|
Abur D, Perkell JS, Stepp CE. Impact of Vocal Effort on Respiratory and Articulatory Kinematics. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:5-21. [PMID: 34843405 PMCID: PMC9150749 DOI: 10.1044/2021_jslhr-21-00323] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 07/27/2021] [Accepted: 08/24/2021] [Indexed: 06/13/2023]
Abstract
PURPOSE The goal of this study was to examine the effects of increases in vocal effort, without changing speech intensity, on respiratory and articulatory kinematics in young adults with typical voices. METHOD A total of 10 participants completed a reading task under three speaking conditions: baseline, mild vocal effort, and maximum vocal effort. Respiratory inductance plethysmography bands around the chest and abdomen were used to estimate lung volumes during speech, and sensor coils for electromagnetic articulography were used to transduce articulatory movements, resulting in the following outcome measures: lung volume at speech initiation (LVSI) and at speech termination (LVST), articulatory kinematic vowel space (AKVS) of two points on the tongue dorsum (body and blade), and lip aperture. RESULTS With increases in vocal effort, and no statistical changes in speech intensity, speakers showed: (a) no statistically significant differences in LVST, (b) statistically significant increases in LVSI, (c) no statistically significant differences in AKVS measures, and (d) statistically significant reductions in lip aperture. CONCLUSIONS Speakers with typical voices exhibited larger lung volumes at speech initiation during increases in vocal effort, paired with reduced lip displacements. To our knowledge, this is the first study to demonstrate evidence that articulatory kinematics are impacted by modulations in vocal effort. However, the mechanisms underlying vocal effort may differ between speakers with and without voice disorders. Thus, future work should examine the relationship between articulatory kinematics, respiratory kinematics, and laryngeal-level changes during vocal effort in speakers with and without voice disorders. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.17065457.
Collapse
Affiliation(s)
- Defne Abur
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Joseph S. Perkell
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology-Head & Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
11
|
Van Stan JH, Ortiz AJ, Marks KL, Toles LE, Mehta DD, Burns JA, Hron T, Stadelman-Cohen T, Krusemark C, Muise J, Fox AB, Nudelman C, Zeitels S, Hillman RE. Changes in the Daily Phonotrauma Index Following the Use of Voice Therapy as the Sole Treatment for Phonotraumatic Vocal Hyperfunction in Females. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:3446-3455. [PMID: 34463536 PMCID: PMC8642084 DOI: 10.1044/2021_jslhr-21-00082] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 04/26/2021] [Accepted: 05/24/2021] [Indexed: 05/31/2023]
Abstract
Purpose The aim of this study was to use the Daily Phonotrauma Index (DPI) to quantify group-based changes in the daily voice use of patients with phonotraumatic vocal hyperfunction (PVH) after receiving voice therapy as the sole treatment. This is part of an ongoing effort to validate an updated theoretical framework for PVH. Method A custom-designed ambulatory voice monitor was used to collect 1 week of pre- and posttreatment data from 52 female patients with PVH. Normative weeklong data were also obtained from 52 matched controls. Each week was represented by the DPI, which is a combination of neck-surface acceleration magnitude skewness and the standard deviation of the difference between the first and second harmonic magnitudes. Results Compared to pretreatment, the DPI statistically decreased towards normal in the patient group after treatment (Cohen's d = -0.25). The posttreatment patient group's DPI was still significantly higher than the control group (d = 0.68). Conclusions The DPI showed the pattern of improved ambulatory voice use in a group of patients with PVH following voice therapy that was predicted by the updated theoretical framework. Per the prediction, voice therapy was associated with a decreased potential for phonotrauma in daily voice use, but the posttreatment patient group data were still significantly different from the normative control group data. This posttreatment difference is interpreted as reflecting the impact on voice use of the persistence of phonotrauma-induced structural changes to the vocal folds. Further validation of the DPI is needed to better understand its potential clinical use.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Charles Nudelman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
12
|
Mehta DD, Kobler JB, Zeitels SM, Zañartu M, Ibarra EJ, Alzamendi GA, Manriquez R, Erath BD, Peterson SD, Petrillo RH, Hillman RE. Direct measurement and modeling of intraglottal, subglottal, and vocal fold collision pressures during phonation in an individual with a hemilaryngectomy. APPLIED SCIENCES (BASEL, SWITZERLAND) 2021; 11:7256. [PMID: 36210866 PMCID: PMC9541559 DOI: 10.3390/app11167256] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The purpose of this paper is to report on the first in vivo application of a recently developed transoral, dual-sensor pressure probe that directly measures intraglottal, subglottal, and vocal fold collision pressures during phonation. Synchronous measurement of intraglottal and subglottal pressures was accomplished using two miniature pressure sensors mounted on the end of the probe and inserted transorally in a 78-year-old male who had previously undergone surgical removal of his right vocal fold for treatment of laryngeal cancer. The endoscopist used one hand to position the custom probe against the surgically medialized scar band that replaced the right vocal fold and used the other hand to position a transoral endoscope to record laryngeal high-speed videoendoscopy of the vibrating left vocal fold contacting the pressure probe. Visualization of the larynx during sustained phonation allowed the endoscopist to place the dual-sensor pressure probe such that the proximal sensor was positioned intraglottally and the distal sensor subglottally. The proximal pressure sensor was verified to be in the strike zone of vocal fold collision during phonation when the intraglottal pressure signal exhibited three characteristics: an impulsive peak at the start of the closed phase, rounded peak during the open phase, and minimum value around zero immediately preceding the impulsive peak of the subsequent phonatory cycle. Numerical voice production modeling was applied to validate model-based predictions of vocal fold collision pressure using kinematic vocal fold measures. The results successfully demonstrated feasibility of in vivo measurement of vocal fold collision pressure in an individual with a hemilaryngectomy, motivating ongoing data collection that is designed to aid in the development of vocal dose measures that incorporate vocal fold impact collision and stresses.
Collapse
Affiliation(s)
- Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- MGH Institute of Health Professions, Boston, MA, USA
| | - James B. Kobler
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Steven M. Zeitels
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Gabriel A. Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics, National University of Entre Rios–CONICET, Entre Ríos, Argentina
| | - Rodrigo Manriquez
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Byron D. Erath
- Department of Mechanical & Aeronautical Engineering, Clarkson University, Potsdam, NY, USA
| | - Sean D. Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Ontario, Canada
| | - Robert H. Petrillo
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- MGH Institute of Health Professions, Boston, MA, USA
| |
Collapse
|
13
|
Umatani M, Ogawa M, Hosokawa K, Kato C, Okajima E, Iwahashi T, Inohara H. Skin Acceleration Levels Estimated by a Neck-surface Accelerometer during Phonation Are Affected by The Mechanical Properties of The Anterior Cervical Skin. J Voice 2021:S0892-1997(21)00111-9. [PMID: 34011459 DOI: 10.1016/j.jvoice.2021.03.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 03/25/2021] [Accepted: 03/30/2021] [Indexed: 10/21/2022]
Abstract
OBJECTIVES The aim of the present study was to assess whether skin acceleration levels (SAL) estimated by a neck-surface accelerometer (ACC) are affected by the anterior neck skin condition. METHODS Each of six healthy non-dysphonic participants wore a headset microphone (MIC), had an ACC sensor fitted on the skin over the cervical trachea, and were subsequently asked to gradually increase the vocal loudness during sustained phonation of the vowel /e:/ (crescendo task), while the sound pressure levels on a sound level meter (SPLSLM) and MIC/ACC signals were simultaneously recorded. Root mean squared values were calculated from the MIC and ACC signals as the sound pressure level (SPLMIC) and SAL, respectively, and the relationships between SPLSLM and SAL were compared between neck anteflexion and retroflexion or between outward and inward skin retraction on both sides of the sensor. RESULTS In the total samples for the successful crescendo performance in a natural head position, the SPLMIC and SPLSLM showed a strong linear correlation (r=0.980), whereas the correlation between the SAL and SPLSLM showed a distorted regression line (r=0.765) with individual differences. In all participants, the anteflexion and inward skin retraction decreased the SAL value at the same SPLSLM value, whereas the retroflexion increased the SAL value at the same SPLSLM value. CONCLUSION These results demonstrate that the signal intensity of a neck-surface ACC is affected by the condition of the anterior cervical skin, perhaps leading to inter-individual variability in SAL measurements.
Collapse
Affiliation(s)
- Masanori Umatani
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Makoto Ogawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan; Department of Otorhinolaryngology, JCHO Osaka Hospital, Osaka, Japan.
| | - Kiyohito Hosokawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Chieri Kato
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Eri Okajima
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Toshihiko Iwahashi
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Hidenori Inohara
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| |
Collapse
|
14
|
Van Stan JH, Ortiz AJ, Cortes JP, Marks KL, Toles LE, Mehta DD, Burns JA, Hron T, Stadelman-Cohen T, Krusemark C, Muise J, Fox-Galalis AB, Nudelman C, Zeitels S, Hillman RE. Differences in Daily Voice Use Measures Between Female Patients With Nonphonotraumatic Vocal Hyperfunction and Matched Controls. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1457-1470. [PMID: 33900807 PMCID: PMC8608188 DOI: 10.1044/2021_jslhr-20-00538] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose The purpose of this study was to obtain a more comprehensive understanding of the pathophysiology and impact on daily voice use of nonphonotraumatic vocal hyperfunction (NPVH). Method An ambulatory voice monitor collected 1 week of data from 36 patients with NPVH and 36 vocally healthy matched controls. A subset of 11 patients with NPVH were monitored after voice therapy. Daily voice use measures included neck-skin acceleration magnitude, fundamental frequency (f o), cepstral peak prominence (CPP), and the difference between the first and second harmonic magnitudes (H1-H2). Additional comparisons included 118 patients with phonotraumatic vocal hyperfunction (PVH) and 89 additional vocally healthy controls. Results The NPVH group, compared to the matched control group, exhibited increased f o (Cohen's d = 0.6), reduced CPP (d = -0.9), and less positive H1-H2 skewness (d = -1.1). Classifiers used CPP mean and H1-H2 mode to maximally differentiate the NPVH and matched control groups (area under the receiver operating characteristic curve of 0.78). Classifiers performed well on unseen data: the logit decreased in patients with NPVH after therapy; ≥ 85% of the control and PVH groups were identified as "normal" or "not NPVH," respectively. Conclusions The NPVH group's daily voice use is less periodic (CPP), is higher pitched (f o), and has less abrupt vocal fold closure (H1-H2 skew) compared to the matched control group. The combination of CPP mean and H1-H2 mode appears to reflect a pathophysiological continuum in NPVH patients of inefficient phonation with minimal potential for phonotrauma. Further validation of the classification model is needed to better understand potential clinical uses. Supplemental Material https://doi.org/10.23641/asha.14390771.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - Juan P. Cortes
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Charles Nudelman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
15
|
Vojtech JM, Cilento DD, Luong AT, Noordzij JP, Diaz-Cadiz M, Groll MD, Buckley DP, McKenna VS, Noordzij JP, Stepp CE. Acoustic Identification of the Voicing Boundary during Intervocalic Offsets and Onsets based on Vocal Fold Vibratory Measures. APPLIED SCIENCES (BASEL, SWITZERLAND) 2021; 11:3816. [PMID: 36188437 PMCID: PMC9524108 DOI: 10.3390/app11093816] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Methods for automating relative fundamental frequency (RFF)-an acoustic estimate of laryngeal tension-rely on manual identification of voiced/unvoiced boundaries from acoustic signals. This study determined the effect of incorporating features derived from vocal fold vibratory transitions for acoustic boundary detection. Simultaneous microphone and flexible nasendoscope recordings were collected from adults with typical voices (N=69) and with voices characterized by excessive laryngeal tension (N=53) producing voiced-unvoiced-voiced utterances. Acoustic features that coincided with vocal fold vibratory transitions were identified and incorporated into an automated RFF algorithm ("aRFF-APH"). Voiced/unvoiced boundary detection accuracy was compared between the aRFF-APH algorithm, a recently published version of the automated RFF algorithm ("aRFF-AP"), and gold-standard, manual RFF estimation. Chi-square tests were performed to characterize differences in boundary cycle identification accuracy among the three RFF estimation methods. Voiced/unvoiced boundary detection accuracy significantly differed by RFF estimation method for voicing offsets and onsets. Of 7721 productions, 76.0% of boundaries were accurately identified via the aRFF-APH algorithm, compared to 70.3% with the aRFF-AP algorithm and 20.4% with manual estimation. Incorporating acoustic features that corresponded with voiced/unvoiced boundaries led to improvements in boundary detection accuracy that surpassed the gold-standard method for calculating RFF.
Collapse
Affiliation(s)
- Jennifer M. Vojtech
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Delsys, Inc. and Altec, Inc., Natick, MA, 01760, USA
| | - Dante D. Cilento
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Austin T. Luong
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Jacob P. Noordzij
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Manuel Diaz-Cadiz
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Matti D. Groll
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Daniel P. Buckley
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| | - Victoria S. McKenna
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - J. Pieter Noordzij
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| |
Collapse
|
16
|
Umatani M, Ogawa M, Hosokawa K, Kato C, Okajima E, Iwahashi T, Inohara H. The Feasibility of Gastroesophageal Manometry for Continuously Evaluating the Degree of Expiratory Effort During Successful Crescendo Phonation. J Voice 2021; 37:470.e7-470.e16. [PMID: 33707030 DOI: 10.1016/j.jvoice.2021.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Revised: 01/23/2021] [Accepted: 02/01/2021] [Indexed: 10/22/2022]
Abstract
OBJECTIVES This study aimed to assess the feasibility of gastroesophageal manometry for continuously evaluating the degree of expiratory effort by measuring the pressures in the digestive tract during crescendo phonation. METHODS Each of 18 healthy nondysphonic speakers had a probe with a four-channel gastroesophageal manometer inserted through the nasal cavity to place four pressure sensors in the hypopharynx, cervical-/thoracic esophagus, and stomach, and was asked to gradually increase the vocal loudness during sustained phonation of the vowel /e:/ (vowel-crescendo task), while the sound pressure level and the pressures were simultaneously recorded. RESULTS 50% of the successful vowel-crescendo task samples with a gradual and adequate sound pressure level increase showed a concomitant gradual increase in both the intra-thoracic-esophageal/intra-gastric pressure values from approximately -5 mmHg /6 mmHg to -10 mmHg/20 mmHg, respectively. The maximum pressure value was the highest in the intra-gastric pressure followed by the intra-thoracic-esophageal and intra-cervical-esophageal pressures in order. However, most of the samples showed less than one of atypical pressure changes, such as fluctuations in the intra-thoracic-esophageal and intra-gastric pressure changes and dispersion in the intra-cervical-esophageal and intra-hypopharyngeal pressure values (perhaps due to the peristaltic motions, and the contact of the sensors to the membranous wall). CONCLUSION These results show that, during successful crescendo phonation, gastroesophageal manometry reveals a gradual increase in the intra-thoracic and intra-abdominal pressures with increasing the vocal intensity, even though showing some systematic errors, suggesting the usefulness of gastroesophageal manometry for continuously evaluating the degree of expiratory effort without influence by the laryngeal condition.
Collapse
Affiliation(s)
- Masanori Umatani
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Makoto Ogawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan; Department of Otorhinolaryngology, JCHO Osaka Hospital, Osaka, Japan.
| | - Kiyohito Hosokawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Chieri Kato
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Eri Okajima
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Toshihiko Iwahashi
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Hidenori Inohara
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| |
Collapse
|
17
|
Toles LE, Ortiz AJ, Marks KL, Burns JA, Hron T, Van Stan JH, Mehta DD, Hillman RE. Differences Between Female Singers With Phonotrauma and Vocally Healthy Matched Controls in Singing and Speaking Voice Use During 1 Week of Ambulatory Monitoring. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2021; 30:199-209. [PMID: 33472007 PMCID: PMC8740583 DOI: 10.1044/2020_ajslp-20-00227] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose Previous ambulatory voice monitoring studies have included many singers and have combined speech and singing in the analyses. This study applied a singing classifier to the ambulatory recordings of singers with phonotrauma and healthy controls to determine if analyzing speech and singing separately would reveal voice use differences that could provide new insights into the etiology and pathophysiology of phonotrauma in this at-risk population. Method Forty-two female singers with phonotrauma (vocal fold nodules or polyps) and 42 healthy matched controls were monitored using an ambulatory voice monitor. Weeklong statistics (average, standard deviation, skewness, kurtosis) for sound pressure level (SPL), fundamental frequency, cepstral peak prominence, the magnitude ratio of the first two harmonics (H1-H2 ), and three vocal dose measures were computed from the neck surface acceleration signal and separated into singing and speech using a singing classifier. Results Mixed analysis of variance models found expected differences between singing and speech in each voice parameter, except SPL kurtosis. SPL skewness, SPL kurtosis, and all H1-H2 distributional parameters differentiated patients and controls when singing and speech were combined. Interaction effects were found in H1-H2 kurtosis and all vocal dose measures. Patients had significantly higher vocal doses in speech compared to controls. Conclusions Consistent with prior work, the pathophysiology of phonotrauma in singers is characterized by more abrupt/complete glottal closure (decreased mean and variation for H1-H2 ) and increased laryngeal forces (negatively skewed SPL distribution) during phonation. Application of a singing classifier to weeklong data revealed that singers with phonotrauma spent more time speaking on a weekly basis, but not more time singing, compared to controls. Results are used as a basis for hypothesizing about the role of speaking voice in the etiology of phonotraumatic vocal hyperfunction in singers.
Collapse
Affiliation(s)
- Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
18
|
Van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Marks KL, Toles LE, Stadelman-Cohen T, Krusemark C, Muise J, Hron T, Zeitels SM, Fox AB, Hillman RE. Changes in a Daily Phonotrauma Index After Laryngeal Surgery and Voice Therapy: Implications for the Role of Daily Voice Use in the Etiology and Pathophysiology of Phonotraumatic Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:3934-3944. [PMID: 33197360 PMCID: PMC8608140 DOI: 10.1044/2020_jslhr-20-00168] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose This study attempts to gain insights into the role of daily voice use in the etiology and pathophysiology of phonotraumatic vocal hyperfunction (PVH) by applying a logistic regression-based daily phonotrauma index (DPI) to predict group-based improvements in patients with PVH after laryngeal surgery and/or postsurgical voice therapy. Method A custom-designed ambulatory voice monitor was used to collect 1 week of pre- and postsurgery data from 27 female patients with PVH; 13 of these patients were also monitored after postsurgical voice therapy. Normative weeklong data were obtained from 27 matched controls. Each week was represented by the DPI, standard deviation of the difference between the first and second harmonic amplitudes (H1-H2). Results Compared to pretreatment, the DPI significantly decreased in the patient group after surgery (Cohen's d effect size = -0.86) and voice therapy (d = -1.06). The patient group DPI only normalized after voice therapy. Conclusions The DPI produced the expected pattern of improved ambulatory voice use across laryngeal surgery and postsurgical voice therapy in a group of patients with PVH. The results were interpreted as providing new objective information about the role of daily voice use in the etiology and pathophysiology of PVH. The DPI is viewed as an estimate of potential vocal fold trauma that relies on combining the long-term distributional characteristics of two parameters representing the magnitude of phonatory forces (neck-surface acceleration magnitude) and vocal fold closure dynamics (H1-H2). Further validation of the DPI is needed to better understand its potential clinical use.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Steven M. Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | | | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
19
|
Espinoza VM, Mehta DD, Van Stan JH, Hillman RE, Zañartu M. Glottal Aerodynamics Estimated From Neck-Surface Vibration in Women With Phonotraumatic and Nonphonotraumatic Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2861-2869. [PMID: 32755502 PMCID: PMC7890221 DOI: 10.1044/2020_jslhr-20-00189] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 05/29/2020] [Accepted: 06/10/2020] [Indexed: 05/06/2023]
Abstract
Purpose The purpose of this study was to determine whether estimates of glottal aerodynamic measures based on neck-surface vibration are comparable to those previously obtained using oral airflow and air pressure signals (Espinoza et al., 2017) in terms of discriminating patients with phonotraumatic and nonphonotraumatic vocal hyperfunction (PVH and NPVH) from vocally healthy controls. Method Consecutive /pae/ syllables at comfortable and loud level were produced by 16 women with PVH (organic vocal fold lesions), 16 women with NPVH (primary muscle tension dysphonia), and 32 vocally healthy women who were each matched to a patient according to age and occupation. Subglottal impedance-based inverse filtering of the anterior neck-surface accelerometer (ACC) signal yielded estimates of peak-to-peak glottal airflow, open quotient, and maximum flow declination rate. Average subglottal pressure and microphone-based sound pressure level (SPL) were also estimated from the ACC signal using subject-specific linear regression models. The ACC-based measures of glottal aerodynamics were normalized for SPL and statistically compared between each patient and matched-control group. Results Patients with PVH and NPVH exhibited lower SPL-normalized glottal aerodynamics values than their respective control subjects (p values ranging from < .01 to .07) with very large effect sizes (1.04-2.16), regardless of loudness condition or measurement method (i.e., ACC-based values maintained discriminatory power). Conclusions The results of this study demonstrate that ACC-based estimates of most glottal aerodynamic measures are comparable to those previously obtained from oral airflow and air pressure (Espinoza et al., 2017) in terms of differentiating between hyperfunctional (PVH and NPVH) and normal vocal function. ACC-based estimates of glottal aerodynamic measures may be used to assess vocal function during continuous speech and enables this assessment of daily voice use during ambulatory monitoring to provide better insight into the pathophysiological mechanisms associated with vocal hyperfunction.
Collapse
Affiliation(s)
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Jarrad H. Van Stan
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
20
|
Marks KL, Lin JZ, Burns JA, Hron TA, Hillman RE, Mehta DD. Estimation of Subglottal Pressure From Neck Surface Vibration in Patients With Voice Disorders. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2202-2218. [PMID: 32610028 PMCID: PMC7838842 DOI: 10.1044/2020_jslhr-19-00409] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 03/12/2020] [Accepted: 04/15/2020] [Indexed: 05/15/2023]
Abstract
Purpose Given the established linear relationship between neck surface vibration magnitude and mean subglottal pressure (Ps) in vocally healthy speakers, the purpose of this study was to better understand the impact of the presence of a voice disorder on this baseline relationship. Method Data were obtained from participants with voice disorders representing a variety of glottal conditions, including phonotraumatic vocal hyperfunction, nonphonotraumatic vocal hyperfunction, and unilateral vocal fold paralysis. Participants were asked to repeat /p/-vowel syllable strings from loud-to-soft loudness levels in multiple vowel contexts (/pa/, /pi/, /pu/) and pitch levels (comfortable, higher than comfortable, lower than comfortable). Three statistical metrics were computed to analyze the regression line between neck surface accelerometer (ACC) signal magnitude and Ps within and across pitch, vowel, and voice disorder category: coefficient of determination (r 2), slope, and intercept. Three linear mixed-effects models were used to evaluate the impact of voice disorder category, pitch level, and vowel context on the relationship between ACC signal magnitude and Ps. Results The relationship between ACC signal magnitude and Ps was statistically different in patients with voice disorders than in vocally healthy controls; patients exhibited higher levels of Ps given similar values of ACC signal magnitude. Negligible effects were found for pitch condition within each voice disorder category, and negligible-to-small effects were found for vowel context. The mean of patient-specific r 2 values was .63, ranging from .13 to .92. Conclusions The baseline, linear relationship between ACC signal magnitude and Ps is affected by the presence of a voice disorder, with the relationship being participant-specific. Further work is needed to improve ACC-based prediction of Ps, across treatment, and during naturalistic speech production.
Collapse
Affiliation(s)
- Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
| | - Jonathan Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - James A. Burns
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Tiffiny A. Hron
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Rehabilitation Science, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| |
Collapse
|
21
|
The Feasibility of a Neck-Surface Accelerometer for Estimating the Amount of Acoustic Output During Phonation Regardless of the Difference in the Mouth Configuration. J Voice 2020; 36:297-308. [DOI: 10.1016/j.jvoice.2020.06.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 06/10/2020] [Accepted: 06/10/2020] [Indexed: 11/23/2022]
|
22
|
Van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, Vangel M, Hron T, Zeitels S, Hillman RE. Differences in Weeklong Ambulatory Vocal Behavior Between Female Patients With Phonotraumatic Lesions and Matched Controls. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:372-384. [PMID: 31995428 PMCID: PMC7210443 DOI: 10.1044/2019_jslhr-19-00065] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Purpose Previous work using ambulatory voice recordings has shown no differences in average vocal behavior between patients with phonotraumatic vocal hyperfunction and matched controls. This study used larger groups to replicate these results and expanded the analysis to include distributional characteristics of ambulatory voice use and measures indicative of glottal closure. Method Subjects included 180 adult women: 90 diagnosed with vocal fold nodules or polyps and 90 age-, sex-, and occupation-matched controls with no history of voice disorders. Weeklong summary statistics (average, variability, skewness, kurtosis) of voice use were computed from neck-surface acceleration recorded using an ambulatory voice monitor. Voice measures included estimates of sound pressure level (SPL), fundamental frequency (f o), cepstral peak prominence, and the difference between the first and second harmonic magnitudes (H1-H2). Results Statistical comparisons resulted in medium-large differences (Cohen's d ≥ 0.5) between groups for SPL skewness, f o variability, and H1-H2 variability. Two logistic regressions (theory-based and stepwise) found SPL skewness and H1-H2 variability to classify patients and controls based on their weekly voice data, with an area under the receiver operating characteristic curve of 0.85 and 0.82 on training and test sets, respectively. Conclusion Compared to controls, the weekly voice use of patients with phonotraumatic vocal hyperfunction reflected higher SPL tendencies (negatively skewed SPL) with more abrupt glottal closure (reduced H1-H2 variability, especially toward higher values). Further work could examine posttreatment data (e.g., after surgery and/or therapy) to determine the extent to which these differences are associated with the etiology and pathophysiology of phonotraumatic vocal fold lesions.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Mark Vangel
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
23
|
Lin JZ, Espinoza VM, Marks KL, Zañartu M, Mehta DD. Improved subglottal pressure estimation from neck-surface vibration in healthy speakers producing non-modal phonation. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2020; 14:449-460. [PMID: 34079612 PMCID: PMC8168553 DOI: 10.1109/jstsp.2019.2959267] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Subglottal air pressure plays a major role in voice production and is a primary factor in controlling voice onset, offset, sound pressure level, glottal airflow, vocal fold collision pressures, and variations in fundamental frequency. Previous work has shown promise for the estimation of subglottal pressure from an unobtrusive miniature accelerometer sensor attached to the anterior base of the neck during typical modal voice production across multiple pitch and vowel contexts. This study expands on that work to incorporate additional accelerometer-based measures of vocal function to compensate for non-modal phonation characteristics and achieve an improved estimation of subglottal pressure. Subjects with normal voices repeated /p/-vowel syllable strings from loud-to-soft levels in multiple vowel contexts (/ɑ/, /i/, and /u/), pitch conditions (comfortable, lower than comfortable, higher than comfortable), and voice quality types (modal, breathy, strained, and rough). Subject-specific, stepwise regression models were constructed using root-mean-square (RMS) values of the accelerometer signal alone (baseline condition) and in combination with cepstral peak prominence, fundamental frequency, and glottal airflow measures derived using subglottal impedance-based inverse filtering. Five-fold cross-validation assessed the robustness of model performance using the root-mean-square error metric for each regression model. Each cross-validation fold exhibited up to a 25% decrease in prediction error when the model incorporated multidimensional aspects of the accelerometer signal compared with RMS-only models. Improved estimation of subglottal pressure for non-modal phonation was thus achievable, lending to future studies of subglottal pressure estimation in patients with voice disorders and in ambulatory voice recordings.
Collapse
Affiliation(s)
- Jon Z Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114 USA
| | | | - Katherine L Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114 USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa Maria, Valparaíso, Chile
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital-Harvard Medical School, Boston, MA 02114 USA
| |
Collapse
|
24
|
Marks KL, Lin JZ, Fox AB, Toles LE, Mehta DD. Impact of Nonmodal Phonation on Estimates of Subglottal Pressure From Neck-Surface Acceleration in Healthy Speakers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:3339-3358. [PMID: 31518510 PMCID: PMC6808343 DOI: 10.1044/2019_jslhr-s-19-0067] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose The purpose of this study was to evaluate the effects of nonmodal phonation on estimates of subglottal pressure (Ps) derived from the magnitude of a neck-surface accelerometer (ACC) signal and to confirm previous findings regarding the impact of vowel contexts and pitch levels in a larger cohort of participants. Method Twenty-six vocally healthy participants (18 women, 8 men) were asked to produce a series of p-vowel syllables with descending loudness in 3 vowel contexts (/a/, /i/, and /u/), 3 pitch levels (comfortable, high, and low), and 4 elicited phonatory conditions (modal, breathy, strained, and rough). Estimates of Ps for each vowel segment were obtained by averaging the intraoral air pressure plateau before and after each segment. The root-mean-square magnitude of the neck-surface ACC signal was computed for each vowel segment. Three linear mixed-effects models were used to statistically assess the effects of vowel, pitch, and phonatory condition on the linear relationship (slope and intercept) between Ps and ACC signal magnitude. Results Results demonstrated statistically significant linear relationships between ACC signal magnitude and Ps within participants but with increased intercepts for the nonmodal phonatory conditions; slopes were affected to a lesser extent. Vowel and pitch contexts did not significantly affect the linear relationship between ACC signal magnitude and Ps. Conclusion The classic linear relationship between ACC signal magnitude and Ps is significantly affected when nonmodal phonation is produced by a speaker. Future work is warranted to further characterize nonmodal phonatory characteristics to improve the ACC-based prediction of Ps during naturalistic speech production.
Collapse
Affiliation(s)
- Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
| | - Jonathan Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - Annie B. Fox
- Center for Interprofessional Studies and Innovation, MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| |
Collapse
|
25
|
Park Y, Stepp CE. Test-Retest Reliability of Relative Fundamental Frequency and Conventional Acoustic, Aerodynamic, and Perceptual Measures in Individuals With Healthy Voices. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:1707-1718. [PMID: 31181173 PMCID: PMC6808369 DOI: 10.1044/2019_jslhr-s-18-0507] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Revised: 02/14/2019] [Accepted: 02/18/2019] [Indexed: 05/31/2023]
Abstract
Purpose Recent studies have shown that an acoustic measure, relative fundamental frequency (RFF), has potential for the assessment of excessive laryngeal tension and vocal effort associated with functional and neurological voice disorders. This study presents an analysis of the test-retest reliability of RFF in individuals with healthy voices and a comparison of reliability between RFF and conventional measures of voice. Method Acoustic and aerodynamic measurements and Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) were performed on 28 individuals with healthy voices on 5 consecutive days. Participants produced RFF stimuli, a sustained /ɑ/, and a reading passage to allow for extraction of acoustic measures and CAPE-V ratings; /pa/ trains were produced to allow for extraction of aerodynamic measures. Results Moderate reliabilities (intraclass correlation coefficient [ICC] = .64-.71) were found for RFF values. Mean vocal fundamental frequency, smoothed cepstral peak prominence, shimmer, harmonics-to-noise ratio, and mean airflow rate exhibited good-to-excellent reliabilities (ICC = .76-.99). ICCs for jitter and phonation threshold pressure were moderately reliable (ICC = .67-.74). ICCs for subglottal pressure estimates and all CAPE-V parameters showed poor reliabilities (ICC = .31-.58). Conclusion RFF has comparable reliability to conventional measures of voice. This expands the potential for clinical application of RFF. Supplemental Material https://doi.org/10.23641/asha.8233376.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
26
|
McKenna VS, Diaz-Cadiz ME, Shembel AC, Enos NM, Stepp CE. The Relationship Between Physiological Mechanisms and the Self-Perception of Vocal Effort. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:815-834. [PMID: 30969902 PMCID: PMC6802880 DOI: 10.1044/2018_jslhr-s-18-0205] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Revised: 07/16/2018] [Accepted: 11/30/2018] [Indexed: 05/09/2023]
Abstract
Purpose This study aimed to examine the relationship between a large set of hypothesized physiological measures of vocal effort and self-ratings of vocal effort. Method Twenty-six healthy adults modulated speech rate and vocal effort during repetitions of the utterance /ifi/, followed by self-perceptual ratings of vocal effort on a visual analog scale. Physiological measures included (a) intrinsic laryngeal tension via kinematic stiffness ratios determined from high-speed laryngoscopy, (b) extrinsic suprahyoid and infrahyoid laryngeal tension via normalized percent activations and durations derived from surface electromyography, (c) supraglottal compression via expert visual-perceptual ratings, and (d) subglottal pressure via magnitude of neck surface vibrations from an accelerometer signal. Results Individual statistical models revealed that all of the physiological predictors, except for kinematic stiffness ratios, were significantly predictive of self-ratings of vocal effort. However, a combined regression model analysis yielded only 3 significant predictors: subglottal pressure, mediolateral supraglottal compression, and the normalized percent activation of the suprahyoid muscles (adjusted R 2 = .60). Conclusions Vocal effort manifests as increases in specific laryngeal physiological measures. Further work is needed to examine these measures in combination with other contributing factors, as well as in speakers with dysphonia.
Collapse
Affiliation(s)
| | | | - Adrianna C. Shembel
- Department of Otolaryngology–Head and Neck Surgery, New York University Langone Medical Center and Voice Center, New York
| | - Nicole M. Enos
- Department of Biomedical Engineering, Boston University, MA
- Department of Electrical and Computer Engineering, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
27
|
McKenna VS, Llico AF, Mehta DD, Perkell JS, Stepp CE. Magnitude of Neck-Surface Vibration as an Estimate of Subglottal Pressure During Modulations of Vocal Effort and Intensity in Healthy Speakers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:3404-3416. [PMID: 29204608 PMCID: PMC6111520 DOI: 10.1044/2017_jslhr-s-17-0180] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 08/03/2017] [Indexed: 05/15/2023]
Abstract
PURPOSE This study examined the relationship between the magnitude of neck-surface vibration (NSVMag; transduced with an accelerometer) and intraoral estimates of subglottal pressure (P'sg) during variations in vocal effort at 3 intensity levels. METHOD Twelve vocally healthy adults produced strings of /pɑ/ syllables in 3 vocal intensity conditions, while increasing vocal effort during each condition. Measures were made of P'sg (estimated during stop-consonant closure), NSVMag (measured during the following vowel), sound pressure level, and respiratory kinematics. Mixed linear regression was used to analyze the relationship between NSVMag and P'sg with respect to total lung volume excursion, levels of lung volume initiation and termination, airflow, laryngeal resistance, and vocal efficiency across intensity conditions. RESULTS NSVMag was significantly related to P'sg (p < .001), and there was a significant, although small, interaction between NSVMag and intensity condition. Total lung excursion was the only additional variable contributing to predicting the NSVMag-P'sg relationship. CONCLUSIONS NSVMag closely reflects P'sg during variations of vocal effort; however, the relationship changes across different intensities in some individuals. Future research should explore additional NSV-based measures (e.g., glottal airflow features) to improve estimation accuracy during voice production.
Collapse
Affiliation(s)
| | - Andres F. Llico
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Charlestown, MA
| | - Joseph S. Perkell
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
28
|
Van Stan JH, Park SW, Jarvis M, Mehta DD, Hillman RE, Sternad D. Measuring vocal motor skill with a virtual voice-controlled slingshot. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:1199. [PMID: 28964079 PMCID: PMC5648563 DOI: 10.1121/1.5000233] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2017] [Revised: 08/08/2017] [Accepted: 08/09/2017] [Indexed: 05/30/2023]
Abstract
Successful voice training (e.g., singing lessons) and vocal rehabilitation (e.g., therapy for a voice disorder) involve learning complex, vocal behaviors. However, there are no metrics describing how humans learn new vocal skills or predicting how long the improved behavior will persist post-therapy. To develop measures capable of describing and predicting vocal motor learning, a theory-based paradigm from limb motor control inspired the development of a virtual task where subjects throw projectiles at a target via modifications in vocal pitch and loudness. Ten subjects with healthy voices practiced this complex vocal task for five days. The many-to-one mapping between the execution variables pitch and loudness and resulting target error was evaluated using an analysis that quantified distributional properties of variability: Tolerance, noise, covariation costs (TNC costs). Lag-1 autocorrelation (AC1) and detrended-fluctuation-analysis scaling index (SCI) analyzed temporal aspects of variability. Vocal data replicated limb-based findings: TNC costs were positively correlated with error; AC1 and SCI were modulated in relation to the task's solution manifold. The data suggests that vocal and limb motor learning are similar in how the learner navigates the solution space. Future work calls for investigating the game's potential to improve voice disorder diagnosis and treatment.
Collapse
Affiliation(s)
- Jarrad H Van Stan
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Se-Woong Park
- Department of Biology, Northeastern University, Boston, Massachusetts 02115, USA
| | | | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Dagmar Sternad
- Departments of Biology, Electrical and Computer Engineering, and Physics, Northeastern University, Boston, Massachusetts 02115, USA
| |
Collapse
|
29
|
Van Stan JH, Mehta DD, Hillman RE. Recent Innovations in Voice Assessment Expected to Impact the Clinical Management of Voice Disorders. ACTA ACUST UNITED AC 2017. [DOI: 10.1044/persp2.sig3.4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
This article provides a summary of some recent innovations in voice assessment expected to have an impact in the next 5–10 years on how patients with voice disorders are clinically managed by speech-language pathologists. Specific innovations discussed are in the areas of laryngeal imaging, ambulatory voice monitoring, and “big data” analysis using machine learning to produce new metrics for vocal health. Also discussed is the potential for using voice analysis to detect and monitor other health conditions.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Center for Laryngeal Surgery & Voice Rehabilitation, Massachusetts General Hospital Communication Sciences and Disorders, Massachusetts General Hospital
Boston, MA
- Institute of Health Professions
Charlestown, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery & Voice Rehabilitation, Massachusetts General Hospital Communication Sciences and Disorders, Massachusetts General Hospital
Boston, MA
- Department of Surgery, Harvard Medical School
Boston, MA
- Communication Sciences and Disorders, MGH Institute of Health Professions
Charlestown, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery & Voice Rehabilitation, Massachusetts General Hospital Communication Sciences and Disorders, Massachusetts General Hospital
Boston, MA
- Department of Surgery, Harvard Medical School
Boston, MA
- Communication Sciences and Disorders, MGH Institute of Health Professions
Charlestown, MA
| |
Collapse
|