1
|
Apfelbach CS, Sandage M, Abbott KV. Vocal Fold Vibratory Dose and Perceived Exertion Responses to the Fluid Interval Test for Voice (FIT-V): A Novel Vocal Loading Task Platform. J Voice 2024:S0892-1997(24)00263-7. [PMID: 39244385 DOI: 10.1016/j.jvoice.2024.08.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 08/13/2024] [Accepted: 08/14/2024] [Indexed: 09/09/2024]
Abstract
INTRODUCTION Vocal loading tasks are often used to study how specific variables influence downstream vocal effort or fatigue. The current study introduces a novel vocal loading task, the Fluid Interval Test for Voice (FIT-V) that combines laryngeal diadochokinesis, intervallic rest and exercise, and fluid back pressure. To assess the differences between this novel task and existing vocal loading tasks, we measured vocal fold vibratory dose and perceived phonatory exertion across three 30-minute task conditions: a control loud oral reading task (LOR), a back pressure-resisted Fluid Interval Test for Voice (FIT-V5), and an unresisted Fluid Interval Test for Voice (FIT-V0). METHODS Following a training session, 30 participants completed the three tasks (LOR, FIT-V0, and FIT-V5) in counterbalanced order on different days. Vocal fold cycle (Aim 1A) and distance doses (Aim 1B) were collected continuously using a dosimeter. The overall perceived phonatory exertion (Aim 2A) was rated once per minute, while anatomically localized cognitive, laryngeal, thoracic, abdominal, and articulatory exertion (Aim 2B) were rated once at the end of each task. Statistical analyses examined the effects of task and task× time interactions on the dependent variables. RESULTS The FIT-V0 and FIT-V5 tasks imposed smaller vocal fold cycle doses (P < 0.001; Aim 1A) and distance doses (P < 0.001; Aim 1B) than the LOR task while eliciting comparable or greater magnitudes of overall perceived phonatory exertion (P = 0.003; Aim 2A). However, anatomically localized perceived exertion was greater in the abdomen and thorax (P < 0.05) and lesser in the anterior neck (P < 0.05) in the two FIT-V tasks relative to the LOR task (Aim 2B). Absolute forced vital capacity was a significant predictor of most forms of anatomically localized perceived exertion. DISCUSSION The unresisted FIT-V0 and the back pressure-resisted FIT-V5 tasks yielded similar profiles of overall perceived phonatory exertion, despite vocal fold vibratory doses roughly half that of the LOR task. However, the anatomically localized perceived exertion data underscored subtle across-task differences not apparent in the overall ratings, suggesting potentially distinct physiological and perceptual niches for the three task protocols.
Collapse
Affiliation(s)
- Christopher S Apfelbach
- Department of Speech-Language-Hearing Sciences, University of Minnesota, Minneapolis, Minnesota.
| | - Mary Sandage
- Department of Speech, Language & Hearing Sciences, Auburn University, Auburn, Alabama
| | | |
Collapse
|
2
|
Echternach M, Köberlein M, Döllinger M, Kirsch J, Pilsl T. Does forced whisper have an impact on voice parameters? Eur Arch Otorhinolaryngol 2024; 281:4873-4880. [PMID: 38709324 PMCID: PMC11393054 DOI: 10.1007/s00405-024-08698-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 04/18/2024] [Indexed: 05/07/2024]
Abstract
OBJECTIVES There has been the assumption that whispering may impact vocal function, leading to the widespread recommendation against its practice after phonosurgery. However, the extent to which whispering affects vocal function and vocal fold oscillation patterns remains unclear. METHODS 10 vocally healthy subjects (5 male, 5 female) were instructed to forcefully whisper a standardized text for 10 min at a sound level of 70 dB(A), measured at a microphone distance of 30 cm to the mouth. Prior to and following the whisper loading, the dysphonia severity index was assessed. Simultaneously, recordings of high speed videolaryngoscopy (HSV), electroglottography, and audio signals during sustained phonation on the vowel /i/ (250 Hz for females and 125 Hz for males) were analyzed after segmentation of the HSV material. RESULTS The pre-post analysis revealed only minor changes after the intervention. These changes included a rise in minimum intensity, an increase in the glottal area waveform-derived open quotient, and the glottal gap index. However, no statistically significant changes were observed in the harmonic-to-noise-ratio, the glottal- to-noise-excitation-ratio, and the electroglottographic open quotient. CONCLUSION Overall, the study suggests that there are only small effects on vocal function in consequence of a forced whisper loading.
Collapse
Affiliation(s)
- Matthias Echternach
- Division Phoniatrics and Pediatric Audiology, Department of Otolaryngology, Munich University Hospital and Faculty of Medicine, Munich University (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany.
| | - Marie Köberlein
- Division Phoniatrics and Pediatric Audiology, Department of Otolaryngology, Munich University Hospital and Faculty of Medicine, Munich University (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otolaryngology Head & Neck Surgery, University Hospital Erlangen, FAU Erlangen-Nuremberg, Erlangen, Germany
| | - Jonas Kirsch
- Division Phoniatrics and Pediatric Audiology, Department of Otolaryngology, Munich University Hospital and Faculty of Medicine, Munich University (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany
| | - Theresa Pilsl
- Division Phoniatrics and Pediatric Audiology, Department of Otolaryngology, Munich University Hospital and Faculty of Medicine, Munich University (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany
| |
Collapse
|
3
|
Hunter EJ, Berardi ML, Whitling S. A Semiautomated Protocol Towards Quantifying Vocal Effort in Relation to Vocal Performance During a Vocal Loading Task. J Voice 2024; 38:876-888. [PMID: 35168867 PMCID: PMC9372227 DOI: 10.1016/j.jvoice.2022.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 12/30/2021] [Accepted: 01/04/2022] [Indexed: 11/19/2022]
Abstract
To increase the reliability and comparability of vocal loading studies, this paper proposes the use of a standardized approach with experiments that are [1] grounded on consistent definitions of terms related to vocal fatigue (vocal effort, vocal demand, and vocal demand response), and [2] designed to reduce uncertainty and increase repeatability. In the approach, a semi-automated vocal loading task that also increases efficiencies in collecting and preparing vocal samples for analysis was used to answer the following research question: To what extent is vocal effort and vocal demand response sensitive to changes in vocal demands (ie, noise only, noise plus duration)? Results indicate that the proposed protocol design consistently induced change in both vocal effort and vocal demand response, indicating vocal fatigue. The efficacy of future vocal loading studies would be improved by adopting a more consistent methodology for quantifying vocal fatigue, thus increasing interstudy comparability of results and conclusions.
Collapse
Affiliation(s)
- Eric J Hunter
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan.
| | | | - Susanna Whitling
- Department of Logopedics, Phoniatrics and Audiology, Lund University, Lund, Sweden
| |
Collapse
|
4
|
Zuim AF, Stewart CF, Titze IR. Vocal Dose and Vocal Demands in Contemporary Musical Theatre. J Voice 2024; 38:53-68. [PMID: 34620516 DOI: 10.1016/j.jvoice.2021.08.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Revised: 08/18/2021] [Accepted: 08/18/2021] [Indexed: 10/20/2022]
Abstract
OBJECTIVE To document and quantify vocal dose and student-singers' self-assessment during rehearsals for a contemporary musical theater production. METHODS/DESIGN Six student singers fastened the sensor from the KayPentax APM 3200 dosimeter to the lower neck to capture neck vibration data during their preparation for the musical Wonderland by Frank Wildhorn. Data were collected during 8-hour periods, at four different stages throughout the rehearsal process: beginning (music and choreography learning phase), middle (staging phase), and end (running the entire show/dress rehearsal phase), plus a post-production day once the production had concluded to establish a baseline vocal load. Students concurrently completed the EASE questionnaire1 after each data collection day. RESULTS The EASE score (Appendix 1) and demographics/perceptual questionnaire (Appendix 2) revealed that all subjects (three males and three females) found the singing role vocally and physically demanding but only two found the roles to be emotionally challenging. The musical score demanded a higher usage of chest register (judged perceptually) than mixed register from lead singers. All subjects' maximum fundamental frequency range exceeded the pitch range required by the score. The mean daily vibration dose (distance dose, as computed with Kay Pentax Software) of the three individual female singers' during rehearsals, 5,203 meters, was higher than the mean daily dose of the three male singers, 3,766 meters. The subjects' self-ratings on the EASE were not correlated with the distance dose. CONCLUSIONS A review of Wonderland's score and perceptual judgment of the singers' performances revealed extensive use of chest register, with belting and mix vocal strategies being the predominant stylistic choices. Students described the singing roles' vocal and physical requirements as more challenging than the character's emotional components. This pilot study provides information on the vocal dose for lead and ensemble singers in rehearsal for a Contemporary Musical Theatre production. Singers and voice professionals may find dosimetry a valuable tool for monitoring the vocal dose during rehearsals and performances.
Collapse
Affiliation(s)
- Ana Flavia Zuim
- Steinhardt School, Music and Performing Arts Professions, New York University, New York City, New York.
| | - Celia F Stewart
- Steinhardt School, Department of Communicative Sciences and Disorders, New York University, New York City, New York
| | - Ingo R Titze
- National Center for Voice and Speech, The University of Utah, Salt Lake City, Utah
| |
Collapse
|
5
|
Lester-Smith RA, Derrick E, Larson CR. Characterization of Source-Filter Interactions in Vocal Vibrato Using a Neck-Surface Vibration Sensor: A Pilot Study. J Voice 2024; 38:1-9. [PMID: 34649740 PMCID: PMC8995401 DOI: 10.1016/j.jvoice.2021.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 08/18/2021] [Accepted: 08/23/2021] [Indexed: 11/23/2022]
Abstract
PURPOSE Vocal vibrato is a singing technique that involves periodic modulation of fundamental frequency (fo) and intensity. The physiological sources of modulation within the speech mechanism and the interactions between the laryngeal source and vocal tract filter in vibrato are not fully understood. Therefore, the purpose of this study was to determine if differences in the rate and extent of fo and intensity modulation could be captured using simultaneously recorded signals from a neck-surface vibration sensor and a microphone, which represent features of the source before and after supraglottal vocal tract filtering. METHOD Nine classically-trained singers produced sustained vowels with vibrato while simultaneous signals were recorded using a vibration sensor and a microphone. Acoustical analyses were performed to measure the rate and extent of fo and intensity modulation for each trial. Paired-samples sign tests were used to analyze differences between the rate and extent of fo and intensity modulation in the vibration sensor and microphone signals. RESULTS The rate and extent of fo modulation and the extent of intensity modulation were equivalent in the vibration sensor and microphone signals, but the rate of intensity modulation was significantly higher in the microphone signal than in the vibration sensor signal. Larger differences in the rate of intensity modulation were seen with vowels that typically have smaller differences between the first and second formant frequencies. CONCLUSIONS This study demonstrated that the rate of intensity modulation at the source prior to supraglottal vocal tract filtering, as measured in neck-surface vibration sensor signals, was lower than the rate of intensity modulation after supraglottal vocal tract filtering, as measured in microphone signals. The difference in rate varied based on the vowel. These findings provide further support of the resonance-harmonics interaction in vocal vibrato. Further investigation is warranted to determine if differences in the physiological source(s) of vibrato account for inconsistent relationships between the extent of intensity modulation in neck-surface vibration sensor and microphone signals.
Collapse
Affiliation(s)
- Rosemary A Lester-Smith
- Department of Physical Medicine & Rehabilitation, Feinberg School of Medicine, Northwestern University, Chicago, Illinois.
| | - Elaina Derrick
- Department of Speech, Language and Hearing Sciences, Moody College of Communication, The University of Texas at Austin, Austin, Texas
| | - Charles R Larson
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| |
Collapse
|
6
|
Schloneger M, Hunter EJ, Maxfield L. Quantifying Vocal Repertoire Tessituras Through Real-Time Measures. J Voice 2024; 38:247.e11-247.e25. [PMID: 34315650 PMCID: PMC8786980 DOI: 10.1016/j.jvoice.2021.06.019] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 06/10/2021] [Accepted: 06/15/2021] [Indexed: 01/27/2023]
Abstract
INTRODUCTION Voice teachers use anecdotal evidence and experience in determining the appropriateness of repertoire for each student's development. Tessitura is important in that determination, but until recently a straightforward, repeatable, and quantifiable method for determining tessitura has not existed. However, technology exists to provide the means to estimate the tessituras of standard vocal repertoire by measuring sung pitch (fundamental frequency) and vocal dose (amount of phonation) in real-time performance. PURPOSE The purpose of this study was to examine the combined use of tessituragrams, Voice Range Profiles (VRPs), a singer's self-perception of a performance, and expert listeners perception of a performance towards the goal of a more systematic way of selecting appropriate voice repertoire for singers. The following research questions guided this investigation: 1) How do Performance Range Profiles (PRPs, performance-based tessituragrams computed from neck skin surface vibration during singing), compare to score-based tessituragrams of the same selection in the same key? 2) How do PRPs of the same vocal score compare when performed in three different keys? 3) How do singer VRPs compare with PRPs of three performances of a score, each sung in a different key? and 4) How do singer and expert panel perceptions of the selection's "fit" in three different keys compare with the alignment of each singer's VRP to their PRPs? Four female singers and five expert voice pedagogues were enlisted to address these questions. RESULTS The distribution (histogram) of the score-based tessituragram closely matched the distribution of performance-based tessituragrams (PRP), suggesting that score-based tessituragrams have promise in accurately reflecting the performance-based tessitura of a musical vocal work. Acquired data revealed relationships of practical importance between score-derived tessituragrams, PRPs, acquired VRPs, and singer perceptions of ease in singing. These data suggest that score-based tessituragrams aligned with singer VRPs show promise in repertoire selection. However, there was only a minor relationship between expert panel perceptions of ease in singing and the acquired PRPs or singer perceptions of ease. Creation of a score-based tessituragram database could be combined with singer VRPs to assist in appropriate repertoire selection.
Collapse
Affiliation(s)
- Matthew Schloneger
- Division of Fine Arts, Friends University, 2100 W. University Avenue, Wichita, Kansas 67213.
| | - Eric J Hunter
- Dept of Communicative Sciences and Disorders, Michigan State University, 404 Wilson Rd, Room 291, Communication Arts and Sciences Building, East Lansing, MI 48824
| | - Lynn Maxfield
- Utah Center for Vocology, University of Utah, 240 S. 1500 E., Room 206, Salt Lake City, UT 84112
| |
Collapse
|
7
|
Eastman SE, Ghasemzadeh H, Van Stan J, Hillman RE, Mehta DD. Quantifying the Occupational Voice Use of Teachers. PERSPECTIVES OF THE ASHA SPECIAL INTEREST GROUPS 2023; 8:1363-1379. [PMID: 38312372 PMCID: PMC10836712 DOI: 10.1044/2023_persp-23-00084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]
Abstract
Purpose The teaching profession is a high-voice use occupation at elevated risk for developing voice disorders. Continued research on teachers' vocal demands is necessary to advocate for and establish vocal health programs. This study quantified ambulatory vocal dose measures for teachers during both on- and off-work periods, comparing their occupational voice use to that in other studies that have reported percent phonation ranging from 17% to 30%. Method Participants included 26 full-time, female school teachers between 23 and 55 years of age across multiple grades and subjects, including individuals with and without a voice disorder. Ambulatory voice data were collected from weeklong voice monitoring that recorded phonatory activity through anterior neck-surface vibration. Three vocal dose measures-time, cycle, and distance doses-were computed for each participant for three time periods: on-work weekdays, off-work weekdays, and off-work weekend days. Results The teachers' average percent phonation was 16.2% on-work weekdays, 8.4% off-work weekdays, and 8.0% off-work weekend days. No statistically significant differences for vocal dose measures were found between off-work weekdays and weekend days. Overall, all vocal dose measures were approximately 2 times higher during work relative to off-work time periods. Conclusions This study provides values for vocal dose measures for school teachers using ambulatory voice-monitoring technology. The vocal demands of this particular teacher sample and voice activity detection algorithm are potential factors contributing to percent phonation values on the lower end of the range reported in the literature. Future work is needed to continue to understand occupational voice use and its associated risks related to voice health, with the ultimate goal of preventing and managing voice disorders in individuals engaged in high-risk occupations.
Collapse
Affiliation(s)
- Sarah E. Eastman
- Rehabilitation Sciences Program, MGH Institute of Health Professions, Boston, MA
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - Hamzeh Ghasemzadeh
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Jarrad Van Stan
- Rehabilitation Sciences Program, MGH Institute of Health Professions, Boston, MA
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Rehabilitation Sciences Program, MGH Institute of Health Professions, Boston, MA
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard University, Boston, MA
| | - Daryush D. Mehta
- Rehabilitation Sciences Program, MGH Institute of Health Professions, Boston, MA
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard University, Boston, MA
| |
Collapse
|
8
|
van Mersbergen M, Marchetta J, Foti D, Pillow E, Dasgupta A, Cain C, Morvant S. Comparison of Aerosol Emissions during Specific Speech Tasks. J Voice 2023:S0892-1997(23)00156-X. [PMID: 37423794 DOI: 10.1016/j.jvoice.2023.05.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Revised: 05/09/2023] [Accepted: 05/09/2023] [Indexed: 07/11/2023]
Abstract
OBJECTIVES/HYPOTHESIS Recent investigations into the behavior of aerosolized emissions from the oral cavity have shown that particulate emissions do indeed occur during speech. To date, there is little information about the relative contribution of different speech sounds in producing particle emissions in a free field. This study compares airborne aerosol generation in participants producing isolated speech sounds: fricative consonants, plosive consonants, and vowel sounds. STUDY DESIGN Prospective, reversal experimental design, where each participant served as their own control and all participants were exposed to all stimuli. METHODS While participants produced isolated speech tasks, a planar beam of laser light, a high-speed camera, and image software calculated the number of particulates detected over time. This study compared airborne aerosols emitted by human participants at a distance of 2.54 cm between the laser sheet and the mouth. RESULTS Statistically significant increases in particulate count over ambient dust distribution for all speech sounds. When collapsed across loudness levels, emitted particles in vowel sounds were statistically greater than consonants, suggesting that mouth opening, as opposed to the place of vocal tract constriction or manner of sound production, might also be influential in the degree to which particulates become aerosolized during speech. CONCLUSIONS The results of this research will inform boundary conditions for computational models of aerosolized particulates during speech.
Collapse
Affiliation(s)
- Miriam van Mersbergen
- School of Communication Sciences and Disorders, University of Memphis, Memphis, Tennessee; Department of Otolaryngology, Head and Neck Surgery, University of Tennessee Health Sciences Center, Memphis, Tennessee; Institute for Intelligent Systems, University of Memphis, Tennessee.
| | - Jeffrey Marchetta
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Daniel Foti
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Eric Pillow
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Apratim Dasgupta
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Chandler Cain
- Department of Mechanical Engineering, University of Memphis, Memphis, Tennessee
| | - Stephen Morvant
- Department of Otolaryngology, Head and Neck Surgery, University of Tennessee Health Sciences Center, Memphis, Tennessee
| |
Collapse
|
9
|
Bottalico P, Nudelman CJ. Do-It-Yourself Voice Dosimeter Device: A Tutorial and Performance Results. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023:1-15. [PMID: 37263017 DOI: 10.1044/2023_jslhr-23-00060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
PURPOSE Voice dosimeters gather voice production data in the daily lives of individuals with voice disorders. Additionally, voice dosimeters aid in understanding the pathophysiology of voice disorders. Previously, several voice dosimeters were commercially available. However, these devices have been discontinued and are not available to clinicians and researchers alike. In this tutorial, instructions for a low-cost, easy-to-assemble voice dosimeter are provided. This do-it-yourself (DIY) voice dosimeter is further validated based on performance results. METHOD Ten vocally healthy participants wore the DIY voice dosimeter. They produced a sustained /a/ vowel and read a text with three different vocal efforts. These tasks were recorded by the DIY voice dosimeter and a reference microphone simultaneously. The expanded uncertainty of the mean error in the estimation of four voice acoustic parameters as measured by the DIY dosimeter was performed by comparing the signals acquired through the reference microphone and the dosimeter. RESULTS For measures of sound pressure level, the DIY voice dosimeter had a mean error of -0.68 dB with an uncertainty of 0.56 dB. For fundamental frequency, the mean error was 1.56 Hz for female participants and 1.11 Hz for male participants, with an uncertainty of 0.62 Hz and 0.34 Hz for female and male participants, respectively. Cepstral peak prominence smoothed and L1 minus L2 had mean errors (uncertainty) of -0.06 dB (0.27 dB) and 2.20 dB (0.72 dB). CONCLUSION The mean error and uncertainties for the DIY voice dosimeter are comparable to those for the most accurate voice dosimeters that were previously on the market.
Collapse
Affiliation(s)
- Pasquale Bottalico
- Department of Speech and Hearing Science, University of Illinois at Urbana-Champaign
| | - Charles J Nudelman
- Department of Speech and Hearing Science, University of Illinois at Urbana-Champaign
| |
Collapse
|
10
|
Nudelman CJ, Codino J, Fry AC, Bottalico P, Rubin AD. Voice Biofeedback via Bone Conduction Headphones: Effects on Acoustic Voice Parameters and Self-Reported Vocal Effort in Individuals With Voice Disorders. J Voice 2022:S0892-1997(22)00320-4. [PMID: 36372674 DOI: 10.1016/j.jvoice.2022.10.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 10/17/2022] [Accepted: 10/18/2022] [Indexed: 11/13/2022]
Abstract
PURPOSE This study explores sidetone amplification (amplified playback of one's own voice) provided via bone conduction in participants with voice disorders. The effects of bone conduction feedback on acoustic voice parameters and vocal effort ratings are examined. METHODS Speech samples of 47 participants with voice disorders were recorded in three auditory feedback conditions: two with sidetone amplification delivered via bone conduction and one condition with no alteration of the feedback. After each task, the participants rated their vocal effort on a visual analog scale. The voice recordings were evaluated by a speech-language pathologist through the GRBAS scale and processed to calculate the within-participant centered sound pressure level (SPL) values, the mean pitch strength (PS), the time dose (Dt%), and cepstral peak prominence smoothed (CPPS). The effects of the feedback conditions on these acoustic parameters and vocal effort ratings were analyzed. RESULTS The high sidetone amplification condition resulted in a statistically significant decrease in the within-participant centered SPL values and mean pitch strength across all participants. The feedback conditions had no statistically significant effects on the vocal effort ratings, time dose (Dt%), or CPPS. CONCLUSIONS This study provides an evidence that bone conduction sidetone amplification contributes to a consistent adaptation in the within-participant centered SPL values (ΔSPL) in patients with vocal hyperfunction, glottal insufficiency, and organic/neurological laryngeal pathologies compared to conditions with no feedback.
Collapse
Affiliation(s)
- Charles J Nudelman
- Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, Illinois.
| | - Juliana Codino
- Lakeshore Professional Voice Center, Lakeshore Ear, Nose, and Throat Center, St. Clair Shores, Michigan
| | - Adam C Fry
- Lakeshore Professional Voice Center, Lakeshore Ear, Nose, and Throat Center, St. Clair Shores, Michigan
| | - Pasquale Bottalico
- Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, Illinois
| | - Adam D Rubin
- Lakeshore Professional Voice Center, Lakeshore Ear, Nose, and Throat Center, St. Clair Shores, Michigan
| |
Collapse
|
11
|
Cortés JP, Lin JZ, Marks KL, Espinoza VM, Ibarra EJ, Zañartu M, Hillman RE, Mehta DD. Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders. APPLIED SCIENCES (BASEL, SWITZERLAND) 2022; 12:10692. [PMID: 36777332 PMCID: PMC9910342 DOI: 10.3390/app122110692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The aerodynamic voice assessment of subglottal air pressure can discriminate between speakers with typical voices from patients with voice disorders, with further evidence validating subglottal pressure as a clinical outcome measure. Although estimating subglottal pressure during phonation is an important component of a standard voice assessment, current methods for estimating subglottal pressure rely on non-natural speech tasks in a clinical or laboratory setting. This study reports on the validation of a method for subglottal pressure estimation in individuals with and without voice disorders that can be translated to connected speech to enable the monitoring of vocal function and behavior in real-world settings. During a laboratory calibration session, a participant-specific multiple regression model was derived to estimate subglottal pressure from a neck-surface vibration signal that can be recorded during natural speech production. The model was derived for vocally typical individuals and patients diagnosed with phonotraumatic vocal fold lesions, primary muscle tension dysphonia, and unilateral vocal fold paralysis. Estimates of subglottal pressure using the developed method exhibited significantly lower error than alternative methods in the literature, with average errors ranging from 1.13 to 2.08 cm H2O for the participant groups. The model was then applied during activities of daily living, thus yielding ambulatory estimates of subglottal pressure for the first time in these populations. Results point to the feasibility and potential of real-time monitoring of subglottal pressure during an individual's daily life for the prevention, assessment, and treatment of voice disorders.
Collapse
Affiliation(s)
- Juan P. Cortés
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Jon Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Speech, Language & Hearing Sciences Department, College of Health & Rehabilitation: Sargent College, Boston University, Boston, MA 02215, USA
| | | | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA
- Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA 02129, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA 02114, USA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
12
|
Llorente-Ortega M, Podhorski A, Fernandez S. Introducing a New Dosimeter for the Assessment and Monitoring of Vocal Risk Situations and Voice Disorders. J Voice 2022:S0892-1997(22)00239-9. [PMID: 36210222 DOI: 10.1016/j.jvoice.2022.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 08/01/2022] [Accepted: 08/03/2022] [Indexed: 11/06/2022]
Abstract
PURPOSE There are many physiological parameters recorded by devices that are becoming more affordable, precise and accurate. However, the lack of development in the recording of voice parameters from the physiological or medical point of view is striking, given that it is a fundamental tool for the work of many people and given the high incidence and prevalence of voice pathologies that affect people's communication. In this paper we perform a complete literature review on the dosimeters used in voice research and to present a prototype dosimeter with a pilot study to show its capabilities. METHOD We conducted a literature review using the keywords [MONITORING], [PHONATION], [ACCUMULATOR], [PORTABLE], [DOSIMETRY], [VOICE] searching in PubMed, Trip Database, HONcode, and SciELO search engines. From our review of dosimeter designs, we created our own prototype consisting of two main components: a Knowles Electronics BU-7135-0000 accelerometer mounted on a neck brace; and the ultra-low power MSP430FR5994 microcontroller. The selected sampling frequency was 2048 Hz. The device calculates the F0 every 250 ms and the amplitude and phonation activity every 31.25 ms. A pilot study was conducted using 2 subjects: one male during 11 days and one female during 14 days. RESULTS This work includes devices that have been created during the last 45 years as tools for the diagnosis and monitoring of the treatment of cases of vocal pathology and for the detection of phonatory patterns or risk situations for developing voice disorders or vocal pathologies. We also present recordings with our new device on the pattern of daily talk time, the fundamental frequency and the relative intensity of two subjects on different days. CONCLUSIONS Interesting work has been done in the development of voice dosimeters with different approaches. In our experience it is not possible to access them for research and they are not yet in clinical use. It is possible that a joint approach with voice and voice disorders professionals and engineers working closely together could take advantage of current technology to develop a fully portable, useful, and efficient system.
Collapse
Affiliation(s)
| | - Adam Podhorski
- Biomedical Engineering and Sciences Department, TECNUN, School of Engineering, University of Navarra, San Sebastian, Spain
| | - Secundino Fernandez
- Medical Engineering Laboratory, School of Medicine, University of Navarra, Spain; Voice Laboratory, Department of Otorhinolaryngology, School of Medicine. University of Navarra, Spain
| |
Collapse
|
13
|
Lee S, Roh H, Kim J, Chung S, Seo D, Moon W, Cho K. An Electret-Powered Skin-Attachable Auditory Sensor that Functions in Harsh Acoustic Environments. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2022; 34:e2205537. [PMID: 35973438 DOI: 10.1002/adma.202205537] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 07/31/2022] [Indexed: 06/15/2023]
Abstract
Auditory sensors have shortcomings with respect to not only personalization with wearability and portability but also detecting a human voice clearly in a noisy environment or when a mask covers the mouth. In this work, an electret-powered and hole-patterned polymer diaphragm is exploited into a skin-attachable auditory sensor. The optimized charged electret diaphragm induces a voltage bias of >400 V against the counter electrode, which reduces the necessity of a bulky power source and enables the capacitive sensor to show high sensitivity (2.2 V Pa-1 ) with incorporation of an elastomer nanodroplet seismic mass. The sophisticated capacitive structure with low mechanical damping enables a flat frequency response (80-3000 Hz) and good linearity (50-80 dBSPL ). The hole-patterned electret diaphragms help the skin-attachable sensor detect only neck-skin vibration rather than dynamic air pressure, enabling a person's voice to be detected in a harsh acoustic environment. The sensor operates reliably even in the presence of surrounding noise and when the user is wearing a gas mask. Therefore, the sensor shows strong potential of a communication tool for disaster response and quarantine activities, and of diagnosis tool for vocal healthcare applications such as cough monitoring and voice dosimetry.
Collapse
Affiliation(s)
- Siyoung Lee
- Department of Chemical Engineering, Pohang University of Science and Technology, Pohang, 37673, Korea
| | - Hajung Roh
- Department of Chemical Engineering, Pohang University of Science and Technology, Pohang, 37673, Korea
| | - Junsoo Kim
- Department of Mechanical Engineering, Pohang University of Science and Technology, Pohang, 37673, Korea
| | - Sein Chung
- Department of Chemical Engineering, Pohang University of Science and Technology, Pohang, 37673, Korea
| | - Donghwan Seo
- Department of Mechanical Engineering, Pohang University of Science and Technology, Pohang, 37673, Korea
| | - Wonkyu Moon
- Department of Mechanical Engineering, Pohang University of Science and Technology, Pohang, 37673, Korea
| | - Kilwon Cho
- Department of Chemical Engineering, Pohang University of Science and Technology, Pohang, 37673, Korea
| |
Collapse
|
14
|
Włodarczak M, Ludusan B, Sundberg J, Heldner M. Classification of voice quality using neck-surface acceleration: Comparison with glottal flow and radiated sound. J Voice 2022:S0892-1997(22)00198-9. [PMID: 36028369 DOI: 10.1016/j.jvoice.2022.06.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 06/30/2022] [Accepted: 06/30/2022] [Indexed: 10/15/2022]
Abstract
OBJECTIVES The aim of the present study is to investigate the usefulness of features extracted from miniature accelerometers attached to speaker's tracheal wall below the glottis for classification of phonation type. The performance of the accelerometer features is evaluated relative to features obtained from inverse filtered and radiated sound. While the former is a good proxy for the voice source, obtaining robust voice source features from the latter is considered difficult since it also contains information about the vocal tract filter. By contrast, the accelerometer signal is largely unaffected by the vocal tract and although it is shaped by subglottal resonances and the transfer properties of the neck tissue, these properties remain constant within a speaker. For this reason, we expect it to provide a better approximation of the voice source than the raw audio. We also investigate which aspects of the voice source are derivable from the accelerometer and microphone signals. METHODS Five trained singers (two females and three males) were recorded producing the syllable [pæ:] in three voice qualities (neutral, breathy and pressed) and at three pitch levels as determined by the participants' personal preference. Features extracted from the three signals were used for classification of phonation type using a random forest classifier. In addition, accelerometer and microphone features with highest correlation with the voice source features were identified. RESULTS The three signals showed comparable classification error rates, with considerable differences across speakers both with respect to the overall performance and the importance of individual features. The speaker-specific differences notwithstanding, variation of phonation type had consistent effects on the voice source, accelerometer and audio signals. With regard to the voice source, AQ, NAQ, L1L2 and CQ all showed a monotonic variation along the breathy - neutral - pressed continuum. Several features were also found to vary systematically in the accelerometer and audio signals: HRF, L1L2 and CPPS (both the accelerometer and the audio), as well as the sound level (for the audio). The random forest analysis revealed that all of these features were also among the most important for the classification of voice quality. CONCLUSION Both the accelerometer and the audio signals were found to discriminate between phonation types with an accuracy approaching that of the voice source. Thus, the accelerometer signal, which is largely uncontaminated by vocal tract resonances, offered no advantage over the signal collected with a normal microphone.
Collapse
Affiliation(s)
| | - Bogdan Ludusan
- Faculty of Linguistics and Literary Studies, Bielefeld University, Germany
| | - Johan Sundberg
- Department of Speech, Music and Hearing, KTH Royal Institute of Technology, Sweden
| | | |
Collapse
|
15
|
Lei Z, Martignetti L, Ridgway C, Peacock S, Sakata JT, Li-Jessen NYK. Wearable Neck Surface Accelerometers for Occupational Vocal Health Monitoring: Instrument and Analysis Validation Study. JMIR Form Res 2022; 6:e39789. [PMID: 35930317 PMCID: PMC9391979 DOI: 10.2196/39789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 07/04/2022] [Accepted: 07/06/2022] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Neck surface accelerometer (NSA) wearable devices have been developed for voice and upper airway health monitoring. As opposed to acoustic sounds, NSA senses mechanical vibrations propagated from the vocal tract to neck skin, which are indicative of a person's voice and airway conditions. NSA signals do not carry identifiable speech information and a speaker's privacy is thus protected, which is important and necessary for continuous wearable monitoring. Our device was already tested for its durable endurance and signal processing algorithms in controlled laboratory conditions. OBJECTIVE This study aims to further evaluate both instrument and analysis validity in a group of occupational vocal users, namely, voice actors, who use their voices extensively at work in an ecologically valid setting. METHODS A total of 16 professional voice actors (age range 21-50 years; 11 females and 5 males) participated in this study. All participants were mounted with an NSA on their sternal notches during the voice acting and voice assessment sessions. The voice acting session was 4-hour long, directed by a voice director in a professional sound studio. Voice assessment sessions were conducted before, during, and 48 hours after the acting session. The assessment included phonation tasks of passage reading, sustained vowels, maximum vowel phonation, and pitch glides. Clinical acoustic metrics (eg, fundamental frequency, cepstral measures) and a vocal dose measure (ie, accumulated distance dose from acting) were computed from NSA signals. A commonly used online questionnaire (Self-Administered Voice Rating questionnaire) was also implemented to track participants' perception of vocal fatigue. RESULTS The NSA wearables stayed in place for all participants despite active body movements during the acting. The ensued body noise did not interfere with the NSA signal quality. All planned acoustic metrics were successfully derived from NSA signals and their numerical values were comparable with literature data. For a 4-hour long voice acting, the averaged distance dose was about 8354 m with no gender differences. Participants perceived vocal fatigue as early as 2 hours after the start of voice acting, with recovery 24-48 hours after the acting session. Among all acoustic metrics across phonation tasks, cepstral peak prominence and spectral tilt from the passage reading most closely mirrored trends in perceived fatigue. CONCLUSIONS The ecological validity of an in-house NSA wearable was vetted in a workplace setting. One key application of this wearable is to prompt occupational voice users when their vocal safety limits are reached for duly protection. Signal processing algorithms can thus be further developed for near real-time estimation of clinically relevant metrics, such as accumulated distance dose, cepstral peak prominence, and spectral tilt. This functionality will enable continuous self-awareness of vocal behavior and protection of vocal safety in occupational voice users.
Collapse
Affiliation(s)
- Zhengdong Lei
- School of Communication Sciences and Disorders, McGill University, Montreal, QC, Canada
| | - Lisa Martignetti
- School of Communication Sciences and Disorders, McGill University, Montreal, QC, Canada
| | - Chelsea Ridgway
- School of Medicine, University of Montreal, Quebec, QC, Canada
| | - Simon Peacock
- The Alliance of Canadian Cinema, Television and Radio Artists - Montreal, Montreal, QC, Canada
| | - Jon T Sakata
- Department of Biology, McGill University, Montreal, QC, Canada
- The Centre for Research on Brain, Language and Music, McGill University, Montreal, QC, Canada
| | - Nicole Y K Li-Jessen
- School of Communication Sciences and Disorders, McGill University, Montreal, QC, Canada
- The Centre for Research on Brain, Language and Music, McGill University, Montreal, QC, Canada
- Department of Biomedical Engineering, McGill University, Montreal, QC, Canada
- Department of Otolaryngology - Head and Neck Surgery, McGill University, Montreal, QC, Canada
- Research Institute of McGill University Health Center, Montreal, QC, Canada
| |
Collapse
|
16
|
Weerathunge HR, Tomassi NE, Stepp CE. What Can Altered Auditory Feedback Paradigms Tell Us About Vocal Motor Control in Individuals With Voice Disorders? PERSPECTIVES OF THE ASHA SPECIAL INTEREST GROUPS 2022; 7:959-976. [PMID: 37397620 PMCID: PMC10312128 DOI: 10.1044/2022_persp-21-00195] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Purpose The goal of this review article is to provide a summary of the progression of altered auditory feedback (AAF) as a method to understand the pathophysiology of voice disorders. This review article focuses on populations with voice disorders that have thus far been studied using AAF, including individuals with Parkinson's disease, cerebellar degeneration, hyperfunctional voice disorders, vocal fold paralysis, and laryngeal dystonia. Studies using AAF have found that individuals with Parkinson's disease, cerebellar degeneration, and laryngeal dystonia have hyperactive auditory feedback responses due to differing underlying causes. In persons with PD, the hyperactivity may be a compensatory mechanism for atypically weak feedforward motor control. In individuals with cerebellar degeneration and laryngeal dystonia, the reasons for hyperactivity remain unknown. Individuals with hyperfunctional voice disorders may have auditory-motor integration deficits, suggesting atypical updating of feedforward motor control. Conclusions These findings have the potential to provide critical insights to clinicians in selecting the most effective therapy techniques for individuals with voice disorders. Future collaboration between clinicians and researchers with the shared objective of improving AAF as an ecologically feasible and valid tool for clinical assessment may provide more personalized therapy targets for individuals with voice disorders.
Collapse
Affiliation(s)
- Hasini R. Weerathunge
- Department of Biomedical Engineering, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Nicole E. Tomassi
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Graduate Program for Neuroscience, Boston University, MA
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Otolaryngology—Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
17
|
Nudelman CJ, Ortiz AJ, Fox AB, Mehta DD, Hillman RE, Van Stan JH. Daily Phonotrauma Index: An Objective Indicator of Large Differences in Self-Reported Vocal Status in the Daily Life of Females With Phonotraumatic Vocal Hyperfunction. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2022; 31:1412-1423. [PMID: 35394805 PMCID: PMC9567307 DOI: 10.1044/2022_ajslp-21-00285] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 12/08/2021] [Accepted: 02/03/2022] [Indexed: 06/03/2023]
Abstract
PURPOSE The purpose of this study is to evaluate if the Daily Phonotrauma Index (DPI) can quantitatively discriminate large differences in overall vocal status in the daily life of patients with phonotraumatic vocal hyperfunction (PVH). METHOD For 1-4 weeks, 23 females with PVH wore an ambulatory voice monitor and answered three vocal status questions (i.e., difficulty producing soft, high-pitched phonation; discomfort; and fatigue) at the beginning, at 5-hr intervals, and the end of each day. DPI values were obtained for each patient's time periods of worst and best self-rated vocal status, and data for the group were analyzed for significant changes using a linear mixed-effects regression model. RESULTS The DPI was significantly lower during periods self-rated as "best vocal status" compared to during periods self-rated as "worst vocal status" (mean difference in DPI = 0.53) with a medium-to-large effect size (Cohen's d = -0.68). CONCLUSIONS In a group of patients with phonotraumatic lesions, the DPI indicated lower potential for phonotrauma during time periods of better vocal status compared to time periods of worse vocal status. Assuming that a large portion of variance in vocal status for patients with PVH is associated with the extent to which voicing is phonotraumatic, these results support the validity of obtaining estimates of DPI for much shorter time periods (i.e., an estimate every 2 min of voicing) than previous studies (i.e., a single estimate for the entire day or week). Future work can investigate the DPI's use for in-clinic assessment/treatment and ambulatory biofeedback and can gain further insights into phonatory mechanisms that underlie DPI via comparisons with other physiologically relevant measures and computational vocal fold modeling.
Collapse
Affiliation(s)
| | | | | | - Daryush D. Mehta
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Jarrad H. Van Stan
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| |
Collapse
|
18
|
Toles LE, Ortiz AJ, Marks KL, Mehta DD, Van Stan JH, Hillman RE. Amount and Characteristics of Speaking and Singing Voice Use in Vocally Healthy Female College Student Singers During a Typical Week. J Voice 2022; 36:203-211. [PMID: 32451254 PMCID: PMC7680342 DOI: 10.1016/j.jvoice.2020.04.029] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 04/14/2020] [Accepted: 04/15/2020] [Indexed: 11/30/2022]
Abstract
OBJECTIVES Singers, college students, and females are groups known to be at an elevated risk of developing functional/hyperfunctional voice disorders; therefore, female college students majoring in vocal performance may be at an even higher risk. To mitigate this risk, it would be helpful to know the "safe limits" for voice use that would help maintain vocal health in this vulnerable group, but there is a paucity of high-quality objective information upon which to base such limits. This study employed weeklong ambulatory voice monitoring in a large group of vocally healthy female college student singers to begin providing the types of objective data that could be used to help develop improved vocal health guidelines. METHODS Participants included 64 vocally healthy females currently enrolled in a vocal performance or similar program at a college or university. An ambulatory voice monitor recorded neck-surface acceleration throughout a typical week. A singing classifier was applied to the data to separate singing from speech. Weeklong vocal dose measures and distributional characteristics for standard voice measures were computed separately for singing and speech, and for both types of phonation combined. RESULTS Participants spent 6.2% of the total monitoring time speaking and 2.1% singing (with total phonation time being 8.4%). Singing had a higher fo mode, more pitch variability, higher average sound pressure level (SPL), negatively skewed SPL distributions, lower average CPP, and higher H1-H2 values than speaking. CONCLUSIONS These results provide a basis for beginning to establish vocal health guidelines for female students enrolled in college-level vocal performance programs and for future studies of the types of voice disorders that are common in this group. Results also demonstrate the potential value that ambulatory voice monitoring may have in helping to objectively identify vocal behaviors that could contribute to voice problems in this population.
Collapse
Affiliation(s)
- Laura E. Toles
- Massachusetts General Hospital, Center for Laryngeal Surgery and Voice Rehabilitation, 1 Bowdoin Square, 11 Floor, Boston, MA 02114,MGH Institute of Health Professions, Rehabilitation Science, 36 1 Ave, Charlestown, MA 02129
| | - Andrew J. Ortiz
- Massachusetts General Hospital, Center for Laryngeal Surgery and Voice Rehabilitation, 1 Bowdoin Square, 11 Floor, Boston, MA 02114
| | - Katherine L. Marks
- Massachusetts General Hospital, Center for Laryngeal Surgery and Voice Rehabilitation, 1 Bowdoin Square, 11 Floor, Boston, MA 02114,MGH Institute of Health Professions, Rehabilitation Science, 36 1 Ave, Charlestown, MA 02129
| | - Daryush D. Mehta
- Massachusetts General Hospital, Center for Laryngeal Surgery and Voice Rehabilitation, 1 Bowdoin Square, 11 Floor, Boston, MA 02114,MGH Institute of Health Professions, Rehabilitation Science, 36 1 Ave, Charlestown, MA 02129,Harvard Medical School, Department of Surgery, 25 Shattuck St., Boston, MA 02115
| | - Jarrad H. Van Stan
- Massachusetts General Hospital, Center for Laryngeal Surgery and Voice Rehabilitation, 1 Bowdoin Square, 11 Floor, Boston, MA 02114,MGH Institute of Health Professions, Rehabilitation Science, 36 1 Ave, Charlestown, MA 02129,Harvard Medical School, Department of Surgery, 25 Shattuck St., Boston, MA 02115
| | - Robert E. Hillman
- Massachusetts General Hospital, Center for Laryngeal Surgery and Voice Rehabilitation, 1 Bowdoin Square, 11 Floor, Boston, MA 02114,MGH Institute of Health Professions, Rehabilitation Science, 36 1 Ave, Charlestown, MA 02129,Harvard Medical School, Department of Surgery, 25 Shattuck St., Boston, MA 02115
| |
Collapse
|
19
|
Groll MD, Vojtech JM, Hablani S, Mehta DD, Buckley DP, Noordzij JP, Stepp CE. Automated Relative Fundamental Frequency Algorithms for Use With Neck-Surface Accelerometer Signals. J Voice 2022; 36:156-169. [PMID: 32653267 PMCID: PMC7790853 DOI: 10.1016/j.jvoice.2020.06.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Accepted: 06/04/2020] [Indexed: 10/23/2022]
Abstract
OBJECTIVE Relative fundamental frequency (RFF) has been suggested as a potential acoustic measure of vocal effort. However, current clinical standards for RFF measures require time-consuming manual markings. Previous semi-automated algorithms have been developed to calculate RFF from microphone signals. The current study aimed to develop fully automated algorithms to calculate RFF from neck-surface accelerometer signals for ecological momentary assessment and ambulatory monitoring of voice. METHODS Training a set of 2646 /vowel-fricative-vowel/ utterances from 317 unique speakers, with and without voice disorders, was used to develop automated algorithms to calculate RFF values from neck-surface accelerometer signals. The algorithms first rejected utterances with poor vowel-to-noise ratios, then identified fricative locations, then used signal features to determine voicing boundary cycles, and finally calculated corresponding RFF values. These automated RFF values were compared to the clinical gold-standard of manual RFF calculated from simultaneously collected microphone signals in a novel test set of 639 utterances from 77 unique speakers. RESULTS Automated accelerometer-based RFF values resulted in an average mean bias error (MBE) across all cycles of 0.027 ST, with an MBE of 0.152 ST and -0.252 ST in the offset and onset cycles closest to the fricative, respectively. CONCLUSION All MBE values were smaller than the expected changes in RFF values following successful voice therapy, suggesting that the current algorithms could be used for ecological momentary assessment and ambulatory monitoring via neck-surface accelerometer signals.
Collapse
Affiliation(s)
- Matti D. Groll
- Department of Biomedical Engineering, Boston University, Boston, 02215, Massachusetts,Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts
| | - Jennifer M. Vojtech
- Department of Biomedical Engineering, Boston University, Boston, 02215, Massachusetts,Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts
| | - Surbhi Hablani
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation and MGH Institute of Health Professions, Massachusetts General Hospital, Boston, 02114, Massachusetts,Department of Surgery, Harvard Medical School, Boston, 02144, Massachusetts,Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, 02129, Massachusetts,Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, 02144, Massachusetts
| | - Daniel P. Buckley
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts,Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, 02118, Massachusetts
| | - J. Pieter Noordzij
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, 02118, Massachusetts
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, Boston, 02215, Massachusetts,Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts,Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, 02118, Massachusetts
| |
Collapse
|
20
|
Cunsolo F, Ottaviani V, Capobianco S, Calcinoni O, Dellacà RL. Simultaneous monitoring of vocal doses and breathing patterns in professional singers. Comput Biol Med 2022; 144:105352. [DOI: 10.1016/j.compbiomed.2022.105352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 02/22/2022] [Accepted: 02/23/2022] [Indexed: 11/28/2022]
|
21
|
Van Stan JH, Ortiz AJ, Sternad D, Mehta DD, Huo C, Hillman RE. Ambulatory Voice Biofeedback: Acquisition and Retention of Modified Daily Voice Use in Patients With Phonotraumatic Vocal Hyperfunction. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2022; 31:409-418. [PMID: 34843372 PMCID: PMC9135013 DOI: 10.1044/2021_ajslp-21-00141] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 08/02/2021] [Accepted: 09/01/2021] [Indexed: 06/03/2023]
Abstract
PURPOSE Voice ambulatory biofeedback (VAB) has potential to improve carryover of therapeutic voice use into daily life. Previous work in vocally healthy participants demonstrated that motor learning inspired variations to VAB produced expected differences in acquisition and retention of modified daily voice use. This proof-of-concept study was designed to evaluate whether these VAB variations have the same desired effects on acquisition and retention in patients with phonotraumatic vocal hyperfunction (PVH). METHOD Seventeen female patients with PVH wore an ambulatory voice monitor for 6 days: three baseline days, one biofeedback day, one short-term retention day, and one long-term retention day. Short- and long-term retention were 1- and 7-days postbiofeedback, respectively. Patients were block-randomized to receive one of three types of VAB: 100%, 25%, and Summary. Performance was measured in terms of adherence time below a subject-specific vocal intensity threshold. RESULTS All three types of VAB produced a biofeedback effect with 13 out of 17 patients displaying an increase in adherence time compared to baseline days. Additionally, multiple patients from each VAB group increased their adherence time during short- and/or long-term retention monitoring compared to baseline. CONCLUSIONS These findings show that VAB can be associated with acquisition and retention of desired voice use in patients with PVH. Specifically, all three feedback types improved multiple patients' performance and retention for up to 1 week after biofeedback removal. Future work can investigate the impact of incorporating VAB into voice therapy.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | | | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Chuanbing Huo
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
22
|
Selosse G, Grandjean D, Ceravolo L. Influence of bodily resonances on emotional prosody perception. Front Psychol 2022; 13:1061930. [PMID: 36571062 PMCID: PMC9773097 DOI: 10.3389/fpsyg.2022.1061930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 11/22/2022] [Indexed: 12/13/2022] Open
Abstract
Introduction Emotional prosody is defined as suprasegmental and segmental changes in the human voice and related acoustic parameters that can inform the listener about the emotional state of the speaker. While the processing of emotional prosody is well represented in the literature, the mechanism of embodied cognition in emotional voice perception is very little studied. This study aimed to investigate the influence of induced bodily vibrations-through a vibrator placed close to the vocal cords-in the perception of emotional vocalizations. The main hypothesis was that induced body vibrations would constitute a potential interoceptive feedback that can influence the auditory perception of emotions. It was also expected that these effects would be greater for stimuli that are more ambiguous. Methods Participants were presented with emotional vocalizations expressing joy or anger which varied from low-intensity vocalizations, considered as ambiguous, to high-intensity ones, considered as non-ambiguous. Vibrations were induced simultaneously in half of the trials and expressed joy or anger congruently with the voice stimuli. Participants had to evaluate each voice stimulus using four visual analog scales (joy, anger, and surprise, sadness as control scales). Results A significant effect of the vibrations was observed on the three behavioral indexes-discrimination, confusion and accuracy-with vibrations confusing rather than facilitating vocal emotion processing. Conclusion Over all, this study brings new light on a poorly documented topic, namely the potential use of vocal cords vibrations as an interoceptive feedback allowing humans to modulate voice production and perception during social interactions.
Collapse
Affiliation(s)
- Garance Selosse
- Neuroscience of Emotion and Affective Dynamics Lab, Department of Psychology, University of Geneva, Geneva, Switzerland
- Swiss Center for Affective Sciences, University of Geneva, Geneva, Switzerland
- *Correspondence: Garance Selosse,
| | - Didier Grandjean
- Neuroscience of Emotion and Affective Dynamics Lab, Department of Psychology, University of Geneva, Geneva, Switzerland
- Swiss Center for Affective Sciences, University of Geneva, Geneva, Switzerland
| | - Leonardo Ceravolo
- Neuroscience of Emotion and Affective Dynamics Lab, Department of Psychology, University of Geneva, Geneva, Switzerland
- Swiss Center for Affective Sciences, University of Geneva, Geneva, Switzerland
| |
Collapse
|
23
|
Toles LE, Roy N, Sogg S, Marks KL, Ortiz AJ, Fox AB, Mehta DD, Hillman RE. Relationships Among Personality, Daily Speaking Voice Use, and Phonotrauma in Adult Female Singers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:4580-4598. [PMID: 34731578 PMCID: PMC9150681 DOI: 10.1044/2021_jslhr-21-00274] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 06/21/2021] [Accepted: 07/19/2021] [Indexed: 05/12/2023]
Abstract
PURPOSE This study sought to determine whether personality traits related to extraversion and impulsivity are more strongly associated with singers with nodules compared to vocally healthy singers and to understand the relationship between personality and the types of daily speaking voice use. METHOD Weeklong ambulatory voice recordings and personality inventories were obtained for 47 female singers with nodules and 47 vocally healthy female singers. Paired t tests investigated trait differences between groups. Relationships between traits and weeklong speaking voice measures (vocal dose, sound pressure level [SPL], neck surface acceleration magnitude [NSAM], fundamental frequency, cepstral peak prominence [CPP], and the ratio of the first two harmonic magnitudes [H 1 -H 2]) were examined using pairwise Pearson r coefficients. Multiple regressions were performed to estimate voice parameters that correlated with two or more traits. RESULTS Singers with nodules scored higher on the Social Potency scale (reflecting a tendency toward social dominance) and lower on the Control scale (reflecting impulsivity) compared to the vocally healthy singers. In vocally healthy singers, vocal dose measures were positively correlated with a combination of Wellbeing (i.e., happiness) and Social Potency, mean SPL was positively correlated with Wellbeing, SPL variability was positively correlated with Social Potency and negatively with Harm Avoidance, and CPP mean was positively correlated with Wellbeing. Singers with nodules had a negative correlation between NSAM skewness and Social Potency. Both groups had negative correlations between H 1 -H 2 mean and Social Potency and Social Closeness. CONCLUSIONS Singers with nodules are more socially dominant and impulsive than vocally healthy singers. Personality traits are related to daily speaking voice use, particularly in vocally healthy singers. Individuals with higher levels of traits related to happiness and social dominance and lower Harm Avoidance tended to speak more, with higher laryngeal forces, with more SPL variability, and with more pressed glottal closure, which could increase risk of phonotrauma.
Collapse
Affiliation(s)
- Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Stephanie Sogg
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Annie B. Fox
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
24
|
Marks KL, Verdi A, Toles LE, Stipancic KL, Ortiz AJ, Hillman RE, Mehta DD. Psychometric Analysis of an Ecological Vocal Effort Scale in Individuals With and Without Vocal Hyperfunction During Activities of Daily Living. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2021; 30:2589-2604. [PMID: 34665647 PMCID: PMC9132024 DOI: 10.1044/2021_ajslp-21-00111] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 06/11/2021] [Accepted: 07/07/2021] [Indexed: 05/29/2023]
Abstract
Objective The purpose of this study was to examine the psychometric properties of an ecological vocal effort scale linked to a voicing task. Method Thirty-eight patients with nodules, 18 patients with muscle tension dysphonia, and 45 vocally healthy control individuals participated in a week of ambulatory voice monitoring. A global vocal status question was asked hourly throughout the day. Participants produced a vowel-consonant-vowel syllable string and rated the vocal effort needed to produce the task on a visual analog scale. Test-retest reliability was calculated for a subset using the intraclass correlation coefficient, ICC(A, 1). Construct validity was assessed by (a) comparing the weeklong vocal effort ratings between the patient and control groups and (b) comparing weeklong vocal effort ratings before and after voice rehabilitation in a subset of 25 patients. Cohen's d, the standard error of measurement (SEM), and the minimal detectable change (MDC) assessed sensitivity. The minimal clinically important difference (MCID) assessed responsiveness. Results Test-retest reliability was excellent, ICC(A, 1) = .96. Weeklong mean effort was statistically higher in the patients than in controls (d = 1.62) and lower after voice rehabilitation (d = 1.75), supporting construct validity and sensitivity. SEM was 4.14, MDC was 11.47, and MCID was 9.74. Since the MCID was within the error of the measure, we must rely upon the MDC to detect real changes in ecological vocal effort. Conclusion The ecological vocal effort scale offers a reliable, valid, and sensitive method of monitoring vocal effort changes during the daily life of individuals with and without vocal hyperfunction.
Collapse
Affiliation(s)
- Katherine L. Marks
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
| | - Alessandra Verdi
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
| | - Laura E. Toles
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
| | - Kaila L. Stipancic
- MGH Institute of Health Professions, Boston, MA
- University at Buffalo, NY
| | - Andrew J. Ortiz
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- MGH Institute of Health Professions, Boston, MA
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| |
Collapse
|
25
|
Song Y, Kim Y, Yun I, Jeung J, Kang J, Chung Y. Study on Optimal Position and Covering Pressure of Wearable Neck Microphone for Continuous Voice Monitoring. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:7340-7343. [PMID: 34892793 DOI: 10.1109/embc46164.2021.9629724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Vocal cord disorder is one of the important health problems, especially in noisy industrial sites where excessive voice is required. A convenient and reliable communication method is required in a noisy environment to prevent the related disorders. However, the signal sensitivity of previous neck microphones is still insufficient to accurately convey the voice. In this study, we developed a skin-attachable neck microphone with a lightweight and flexible form factor. Also, we optimized the attachment position and covering pressure to maximize the signal sensitivity. As a result, we obtained the optimal position near the thyroid cartilage and confirmed that the signal sensitivity is the highest when the covering pressure is approximately 4 mmHg.Clinical Relevance- People can measure the voice status using a wearable neck microphone at the optimal position and covering pressure. It provides a solution to keep the vocal cords in good health even in a noisy environment.
Collapse
|
26
|
Van Stan JH, Ortiz AJ, Marks KL, Toles LE, Mehta DD, Burns JA, Hron T, Stadelman-Cohen T, Krusemark C, Muise J, Fox AB, Nudelman C, Zeitels S, Hillman RE. Changes in the Daily Phonotrauma Index Following the Use of Voice Therapy as the Sole Treatment for Phonotraumatic Vocal Hyperfunction in Females. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:3446-3455. [PMID: 34463536 PMCID: PMC8642084 DOI: 10.1044/2021_jslhr-21-00082] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 04/26/2021] [Accepted: 05/24/2021] [Indexed: 05/31/2023]
Abstract
Purpose The aim of this study was to use the Daily Phonotrauma Index (DPI) to quantify group-based changes in the daily voice use of patients with phonotraumatic vocal hyperfunction (PVH) after receiving voice therapy as the sole treatment. This is part of an ongoing effort to validate an updated theoretical framework for PVH. Method A custom-designed ambulatory voice monitor was used to collect 1 week of pre- and posttreatment data from 52 female patients with PVH. Normative weeklong data were also obtained from 52 matched controls. Each week was represented by the DPI, which is a combination of neck-surface acceleration magnitude skewness and the standard deviation of the difference between the first and second harmonic magnitudes. Results Compared to pretreatment, the DPI statistically decreased towards normal in the patient group after treatment (Cohen's d = -0.25). The posttreatment patient group's DPI was still significantly higher than the control group (d = 0.68). Conclusions The DPI showed the pattern of improved ambulatory voice use in a group of patients with PVH following voice therapy that was predicted by the updated theoretical framework. Per the prediction, voice therapy was associated with a decreased potential for phonotrauma in daily voice use, but the posttreatment patient group data were still significantly different from the normative control group data. This posttreatment difference is interpreted as reflecting the impact on voice use of the persistence of phonotrauma-induced structural changes to the vocal folds. Further validation of the DPI is needed to better understand its potential clinical use.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Charles Nudelman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
27
|
Ibarra EJ, Parra JA, Alzamendi GA, Cortés JP, Espinoza VM, Mehta DD, Hillman RE, Zañartu M. Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model. Front Physiol 2021; 12:732244. [PMID: 34539451 PMCID: PMC8440844 DOI: 10.3389/fphys.2021.732244] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 08/09/2021] [Indexed: 11/23/2022] Open
Abstract
The ambulatory assessment of vocal function can be significantly enhanced by having access to physiologically based features that describe underlying pathophysiological mechanisms in individuals with voice disorders. This type of enhancement can improve methods for the prevention, diagnosis, and treatment of behaviorally based voice disorders. Unfortunately, the direct measurement of important vocal features such as subglottal pressure, vocal fold collision pressure, and laryngeal muscle activation is impractical in laboratory and ambulatory settings. In this study, we introduce a method to estimate these features during phonation from a neck-surface vibration signal through a framework that integrates a physiologically relevant model of voice production and machine learning tools. The signal from a neck-surface accelerometer is first processed using subglottal impedance-based inverse filtering to yield an estimate of the unsteady glottal airflow. Seven aerodynamic and acoustic features are extracted from the neck surface accelerometer and an optional microphone signal. A neural network architecture is selected to provide a mapping between the seven input features and subglottal pressure, vocal fold collision pressure, and cricothyroid and thyroarytenoid muscle activation. This non-linear mapping is trained solely with 13,000 Monte Carlo simulations of a voice production model that utilizes a symmetric triangular body-cover model of the vocal folds. The performance of the method was compared against laboratory data from synchronous recordings of oral airflow, intraoral pressure, microphone, and neck-surface vibration in 79 vocally healthy female participants uttering consecutive /pæ/ syllable strings at comfortable, loud, and soft levels. The mean absolute error and root-mean-square error for estimating the mean subglottal pressure were 191 Pa (1.95 cm H2O) and 243 Pa (2.48 cm H2O), respectively, which are comparable with previous studies but with the key advantage of not requiring subject-specific training and yielding more output measures. The validation of vocal fold collision pressure and laryngeal muscle activation was performed with synthetic values as reference. These initial results provide valuable insight for further vocal fold model refinement and constitute a proof of concept that the proposed machine learning method is a feasible option for providing physiologically relevant measures for laboratory and ambulatory assessment of vocal function.
Collapse
Affiliation(s)
- Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- School of Electrical Engineering, University of the Andes, Mérida, Venezuela
| | - Jesús A. Parra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Gabriel A. Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics, Consejo Nacional de Investigaciones Científicas y Técnicas - Universidad Nacional de Entre Ríos, Oro Verde, Argentina
| | - Juan P. Cortés
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Víctor M. Espinoza
- Department of Sound, Faculty of Arts, University of Chile, Santiago, Chile
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation Laboratory, Massachusetts General Hospital–Harvard Medical School, Boston, MA, United States
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
28
|
Mehta DD, Kobler JB, Zeitels SM, Zañartu M, Ibarra EJ, Alzamendi GA, Manriquez R, Erath BD, Peterson SD, Petrillo RH, Hillman RE. Direct measurement and modeling of intraglottal, subglottal, and vocal fold collision pressures during phonation in an individual with a hemilaryngectomy. APPLIED SCIENCES (BASEL, SWITZERLAND) 2021; 11:7256. [PMID: 36210866 PMCID: PMC9541559 DOI: 10.3390/app11167256] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The purpose of this paper is to report on the first in vivo application of a recently developed transoral, dual-sensor pressure probe that directly measures intraglottal, subglottal, and vocal fold collision pressures during phonation. Synchronous measurement of intraglottal and subglottal pressures was accomplished using two miniature pressure sensors mounted on the end of the probe and inserted transorally in a 78-year-old male who had previously undergone surgical removal of his right vocal fold for treatment of laryngeal cancer. The endoscopist used one hand to position the custom probe against the surgically medialized scar band that replaced the right vocal fold and used the other hand to position a transoral endoscope to record laryngeal high-speed videoendoscopy of the vibrating left vocal fold contacting the pressure probe. Visualization of the larynx during sustained phonation allowed the endoscopist to place the dual-sensor pressure probe such that the proximal sensor was positioned intraglottally and the distal sensor subglottally. The proximal pressure sensor was verified to be in the strike zone of vocal fold collision during phonation when the intraglottal pressure signal exhibited three characteristics: an impulsive peak at the start of the closed phase, rounded peak during the open phase, and minimum value around zero immediately preceding the impulsive peak of the subsequent phonatory cycle. Numerical voice production modeling was applied to validate model-based predictions of vocal fold collision pressure using kinematic vocal fold measures. The results successfully demonstrated feasibility of in vivo measurement of vocal fold collision pressure in an individual with a hemilaryngectomy, motivating ongoing data collection that is designed to aid in the development of vocal dose measures that incorporate vocal fold impact collision and stresses.
Collapse
Affiliation(s)
- Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- MGH Institute of Health Professions, Boston, MA, USA
| | - James B. Kobler
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Steven M. Zeitels
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Gabriel A. Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics, National University of Entre Rios–CONICET, Entre Ríos, Argentina
| | - Rodrigo Manriquez
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Byron D. Erath
- Department of Mechanical & Aeronautical Engineering, Clarkson University, Potsdam, NY, USA
| | - Sean D. Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Ontario, Canada
| | - Robert H. Petrillo
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA, USA
- Department of Surgery, Massachusetts General Hospital–Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- MGH Institute of Health Professions, Boston, MA, USA
| |
Collapse
|
29
|
Hunter EJ, Berardi ML, van Mersbergen M. Relationship Between Tasked Vocal Effort Levels and Measures of Vocal Intensity. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1829-1840. [PMID: 34057833 PMCID: PMC8740752 DOI: 10.1044/2021_jslhr-20-00465] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 01/04/2021] [Accepted: 02/19/2021] [Indexed: 06/12/2023]
Abstract
Purpose Patients with voice problems commonly report increased vocal effort, regardless of the underlying pathophysiology. Previous studies investigating vocal effort and voice production have used a range of methods to quantify vocal effort. The goals of the current study were to use the Borg CR100 effort scale to (a) demonstrate the relation between vocal intensity or vocal level (dB) and tasked vocal effort goals and (b) investigate the repeated measure reliability of vocal level at tasked effort level goals. Method Three types of speech (automatic, read, and structured spontaneous) were elicited at four vocal effort level goals on the Borg CR100 scale (2, 13, 25, and 50) from 20 participants (10 females and 10 males). Results Participants' vocal level reliably changed approximately 5 dB between the elicited effort level goals; this difference was statistically significant and repeatable. Biological females produced a voice with consistently less intensity for a vocal effort level goal compared to biological males. Conclusions The results indicate the utility of the Borg CR100 in tracking effort in voice production that is repeatable with respect to vocal level (dB). Future research will investigate other metrics of voice production with the goal of understanding the mechanisms underlying vocal effort and the external environmental influences on the perception of vocal effort.
Collapse
Affiliation(s)
- Eric J. Hunter
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing
| | - Mark L. Berardi
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing
| | | |
Collapse
|
30
|
Umatani M, Ogawa M, Hosokawa K, Kato C, Okajima E, Iwahashi T, Inohara H. Skin Acceleration Levels Estimated by a Neck-surface Accelerometer during Phonation Are Affected by The Mechanical Properties of The Anterior Cervical Skin. J Voice 2021:S0892-1997(21)00111-9. [PMID: 34011459 DOI: 10.1016/j.jvoice.2021.03.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 03/25/2021] [Accepted: 03/30/2021] [Indexed: 10/21/2022]
Abstract
OBJECTIVES The aim of the present study was to assess whether skin acceleration levels (SAL) estimated by a neck-surface accelerometer (ACC) are affected by the anterior neck skin condition. METHODS Each of six healthy non-dysphonic participants wore a headset microphone (MIC), had an ACC sensor fitted on the skin over the cervical trachea, and were subsequently asked to gradually increase the vocal loudness during sustained phonation of the vowel /e:/ (crescendo task), while the sound pressure levels on a sound level meter (SPLSLM) and MIC/ACC signals were simultaneously recorded. Root mean squared values were calculated from the MIC and ACC signals as the sound pressure level (SPLMIC) and SAL, respectively, and the relationships between SPLSLM and SAL were compared between neck anteflexion and retroflexion or between outward and inward skin retraction on both sides of the sensor. RESULTS In the total samples for the successful crescendo performance in a natural head position, the SPLMIC and SPLSLM showed a strong linear correlation (r=0.980), whereas the correlation between the SAL and SPLSLM showed a distorted regression line (r=0.765) with individual differences. In all participants, the anteflexion and inward skin retraction decreased the SAL value at the same SPLSLM value, whereas the retroflexion increased the SAL value at the same SPLSLM value. CONCLUSION These results demonstrate that the signal intensity of a neck-surface ACC is affected by the condition of the anterior cervical skin, perhaps leading to inter-individual variability in SAL measurements.
Collapse
Affiliation(s)
- Masanori Umatani
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Makoto Ogawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan; Department of Otorhinolaryngology, JCHO Osaka Hospital, Osaka, Japan.
| | - Kiyohito Hosokawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Chieri Kato
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Eri Okajima
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Toshihiko Iwahashi
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Hidenori Inohara
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| |
Collapse
|
31
|
Van Stan JH, Ortiz AJ, Cortes JP, Marks KL, Toles LE, Mehta DD, Burns JA, Hron T, Stadelman-Cohen T, Krusemark C, Muise J, Fox-Galalis AB, Nudelman C, Zeitels S, Hillman RE. Differences in Daily Voice Use Measures Between Female Patients With Nonphonotraumatic Vocal Hyperfunction and Matched Controls. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1457-1470. [PMID: 33900807 PMCID: PMC8608188 DOI: 10.1044/2021_jslhr-20-00538] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose The purpose of this study was to obtain a more comprehensive understanding of the pathophysiology and impact on daily voice use of nonphonotraumatic vocal hyperfunction (NPVH). Method An ambulatory voice monitor collected 1 week of data from 36 patients with NPVH and 36 vocally healthy matched controls. A subset of 11 patients with NPVH were monitored after voice therapy. Daily voice use measures included neck-skin acceleration magnitude, fundamental frequency (f o), cepstral peak prominence (CPP), and the difference between the first and second harmonic magnitudes (H1-H2). Additional comparisons included 118 patients with phonotraumatic vocal hyperfunction (PVH) and 89 additional vocally healthy controls. Results The NPVH group, compared to the matched control group, exhibited increased f o (Cohen's d = 0.6), reduced CPP (d = -0.9), and less positive H1-H2 skewness (d = -1.1). Classifiers used CPP mean and H1-H2 mode to maximally differentiate the NPVH and matched control groups (area under the receiver operating characteristic curve of 0.78). Classifiers performed well on unseen data: the logit decreased in patients with NPVH after therapy; ≥ 85% of the control and PVH groups were identified as "normal" or "not NPVH," respectively. Conclusions The NPVH group's daily voice use is less periodic (CPP), is higher pitched (f o), and has less abrupt vocal fold closure (H1-H2 skew) compared to the matched control group. The combination of CPP mean and H1-H2 mode appears to reflect a pathophysiological continuum in NPVH patients of inefficient phonation with minimal potential for phonotrauma. Further validation of the classification model is needed to better understand potential clinical uses. Supplemental Material https://doi.org/10.23641/asha.14390771.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - Juan P. Cortes
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Charles Nudelman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
32
|
Frič M, Hruška V, Dlask P. Full-field face vibration measurement in singing—Case study. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2021.102427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
33
|
Vojtech JM, Cilento DD, Luong AT, Noordzij JP, Diaz-Cadiz M, Groll MD, Buckley DP, McKenna VS, Noordzij JP, Stepp CE. Acoustic Identification of the Voicing Boundary during Intervocalic Offsets and Onsets based on Vocal Fold Vibratory Measures. APPLIED SCIENCES (BASEL, SWITZERLAND) 2021; 11:3816. [PMID: 36188437 PMCID: PMC9524108 DOI: 10.3390/app11093816] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Methods for automating relative fundamental frequency (RFF)-an acoustic estimate of laryngeal tension-rely on manual identification of voiced/unvoiced boundaries from acoustic signals. This study determined the effect of incorporating features derived from vocal fold vibratory transitions for acoustic boundary detection. Simultaneous microphone and flexible nasendoscope recordings were collected from adults with typical voices (N=69) and with voices characterized by excessive laryngeal tension (N=53) producing voiced-unvoiced-voiced utterances. Acoustic features that coincided with vocal fold vibratory transitions were identified and incorporated into an automated RFF algorithm ("aRFF-APH"). Voiced/unvoiced boundary detection accuracy was compared between the aRFF-APH algorithm, a recently published version of the automated RFF algorithm ("aRFF-AP"), and gold-standard, manual RFF estimation. Chi-square tests were performed to characterize differences in boundary cycle identification accuracy among the three RFF estimation methods. Voiced/unvoiced boundary detection accuracy significantly differed by RFF estimation method for voicing offsets and onsets. Of 7721 productions, 76.0% of boundaries were accurately identified via the aRFF-APH algorithm, compared to 70.3% with the aRFF-AP algorithm and 20.4% with manual estimation. Incorporating acoustic features that corresponded with voiced/unvoiced boundaries led to improvements in boundary detection accuracy that surpassed the gold-standard method for calculating RFF.
Collapse
Affiliation(s)
- Jennifer M. Vojtech
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Delsys, Inc. and Altec, Inc., Natick, MA, 01760, USA
| | - Dante D. Cilento
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Austin T. Luong
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Jacob P. Noordzij
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Manuel Diaz-Cadiz
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Matti D. Groll
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Daniel P. Buckley
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| | - Victoria S. McKenna
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - J. Pieter Noordzij
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| |
Collapse
|
34
|
Umatani M, Ogawa M, Hosokawa K, Kato C, Okajima E, Iwahashi T, Inohara H. The Feasibility of Gastroesophageal Manometry for Continuously Evaluating the Degree of Expiratory Effort During Successful Crescendo Phonation. J Voice 2021; 37:470.e7-470.e16. [PMID: 33707030 DOI: 10.1016/j.jvoice.2021.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Revised: 01/23/2021] [Accepted: 02/01/2021] [Indexed: 10/22/2022]
Abstract
OBJECTIVES This study aimed to assess the feasibility of gastroesophageal manometry for continuously evaluating the degree of expiratory effort by measuring the pressures in the digestive tract during crescendo phonation. METHODS Each of 18 healthy nondysphonic speakers had a probe with a four-channel gastroesophageal manometer inserted through the nasal cavity to place four pressure sensors in the hypopharynx, cervical-/thoracic esophagus, and stomach, and was asked to gradually increase the vocal loudness during sustained phonation of the vowel /e:/ (vowel-crescendo task), while the sound pressure level and the pressures were simultaneously recorded. RESULTS 50% of the successful vowel-crescendo task samples with a gradual and adequate sound pressure level increase showed a concomitant gradual increase in both the intra-thoracic-esophageal/intra-gastric pressure values from approximately -5 mmHg /6 mmHg to -10 mmHg/20 mmHg, respectively. The maximum pressure value was the highest in the intra-gastric pressure followed by the intra-thoracic-esophageal and intra-cervical-esophageal pressures in order. However, most of the samples showed less than one of atypical pressure changes, such as fluctuations in the intra-thoracic-esophageal and intra-gastric pressure changes and dispersion in the intra-cervical-esophageal and intra-hypopharyngeal pressure values (perhaps due to the peristaltic motions, and the contact of the sensors to the membranous wall). CONCLUSION These results show that, during successful crescendo phonation, gastroesophageal manometry reveals a gradual increase in the intra-thoracic and intra-abdominal pressures with increasing the vocal intensity, even though showing some systematic errors, suggesting the usefulness of gastroesophageal manometry for continuously evaluating the degree of expiratory effort without influence by the laryngeal condition.
Collapse
Affiliation(s)
- Masanori Umatani
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Makoto Ogawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan; Department of Otorhinolaryngology, JCHO Osaka Hospital, Osaka, Japan.
| | - Kiyohito Hosokawa
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Chieri Kato
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Eri Okajima
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Toshihiko Iwahashi
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| | - Hidenori Inohara
- Department of Otorhinolaryngology - Head and Neck Surgery, Osaka University Graduate School of Medicine, Suita, Japan
| |
Collapse
|
35
|
Toles LE, Ortiz AJ, Marks KL, Burns JA, Hron T, Van Stan JH, Mehta DD, Hillman RE. Differences Between Female Singers With Phonotrauma and Vocally Healthy Matched Controls in Singing and Speaking Voice Use During 1 Week of Ambulatory Monitoring. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2021; 30:199-209. [PMID: 33472007 PMCID: PMC8740583 DOI: 10.1044/2020_ajslp-20-00227] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose Previous ambulatory voice monitoring studies have included many singers and have combined speech and singing in the analyses. This study applied a singing classifier to the ambulatory recordings of singers with phonotrauma and healthy controls to determine if analyzing speech and singing separately would reveal voice use differences that could provide new insights into the etiology and pathophysiology of phonotrauma in this at-risk population. Method Forty-two female singers with phonotrauma (vocal fold nodules or polyps) and 42 healthy matched controls were monitored using an ambulatory voice monitor. Weeklong statistics (average, standard deviation, skewness, kurtosis) for sound pressure level (SPL), fundamental frequency, cepstral peak prominence, the magnitude ratio of the first two harmonics (H1-H2 ), and three vocal dose measures were computed from the neck surface acceleration signal and separated into singing and speech using a singing classifier. Results Mixed analysis of variance models found expected differences between singing and speech in each voice parameter, except SPL kurtosis. SPL skewness, SPL kurtosis, and all H1-H2 distributional parameters differentiated patients and controls when singing and speech were combined. Interaction effects were found in H1-H2 kurtosis and all vocal dose measures. Patients had significantly higher vocal doses in speech compared to controls. Conclusions Consistent with prior work, the pathophysiology of phonotrauma in singers is characterized by more abrupt/complete glottal closure (decreased mean and variation for H1-H2 ) and increased laryngeal forces (negatively skewed SPL distribution) during phonation. Application of a singing classifier to weeklong data revealed that singers with phonotrauma spent more time speaking on a weekly basis, but not more time singing, compared to controls. Results are used as a basis for hypothesizing about the role of speaking voice in the etiology of phonotraumatic vocal hyperfunction in singers.
Collapse
Affiliation(s)
- Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | | | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
36
|
Humming Facilitates a Gradual Increase in Vocal Intensity by Alleviating the Enhancement of Vocal Fold Contact and Supraglottic Constriction. J Voice 2021; 35:156.e1-156.e13. [DOI: 10.1016/j.jvoice.2019.05.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Revised: 05/09/2019] [Accepted: 05/09/2019] [Indexed: 11/23/2022]
|
37
|
Van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Marks KL, Toles LE, Stadelman-Cohen T, Krusemark C, Muise J, Hron T, Zeitels SM, Fox AB, Hillman RE. Changes in a Daily Phonotrauma Index After Laryngeal Surgery and Voice Therapy: Implications for the Role of Daily Voice Use in the Etiology and Pathophysiology of Phonotraumatic Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:3934-3944. [PMID: 33197360 PMCID: PMC8608140 DOI: 10.1044/2020_jslhr-20-00168] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose This study attempts to gain insights into the role of daily voice use in the etiology and pathophysiology of phonotraumatic vocal hyperfunction (PVH) by applying a logistic regression-based daily phonotrauma index (DPI) to predict group-based improvements in patients with PVH after laryngeal surgery and/or postsurgical voice therapy. Method A custom-designed ambulatory voice monitor was used to collect 1 week of pre- and postsurgery data from 27 female patients with PVH; 13 of these patients were also monitored after postsurgical voice therapy. Normative weeklong data were obtained from 27 matched controls. Each week was represented by the DPI, standard deviation of the difference between the first and second harmonic amplitudes (H1-H2). Results Compared to pretreatment, the DPI significantly decreased in the patient group after surgery (Cohen's d effect size = -0.86) and voice therapy (d = -1.06). The patient group DPI only normalized after voice therapy. Conclusions The DPI produced the expected pattern of improved ambulatory voice use across laryngeal surgery and postsurgical voice therapy in a group of patients with PVH. The results were interpreted as providing new objective information about the role of daily voice use in the etiology and pathophysiology of PVH. The DPI is viewed as an estimate of potential vocal fold trauma that relies on combining the long-term distributional characteristics of two parameters representing the magnitude of phonatory forces (neck-surface acceleration magnitude) and vocal fold closure dynamics (H1-H2). Further validation of the DPI is needed to better understand its potential clinical use.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tara Stadelman-Cohen
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Carol Krusemark
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Jason Muise
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Steven M. Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | | | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
38
|
Espinoza VM, Mehta DD, Van Stan JH, Hillman RE, Zañartu M. Glottal Aerodynamics Estimated From Neck-Surface Vibration in Women With Phonotraumatic and Nonphonotraumatic Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2861-2869. [PMID: 32755502 PMCID: PMC7890221 DOI: 10.1044/2020_jslhr-20-00189] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 05/29/2020] [Accepted: 06/10/2020] [Indexed: 05/06/2023]
Abstract
Purpose The purpose of this study was to determine whether estimates of glottal aerodynamic measures based on neck-surface vibration are comparable to those previously obtained using oral airflow and air pressure signals (Espinoza et al., 2017) in terms of discriminating patients with phonotraumatic and nonphonotraumatic vocal hyperfunction (PVH and NPVH) from vocally healthy controls. Method Consecutive /pae/ syllables at comfortable and loud level were produced by 16 women with PVH (organic vocal fold lesions), 16 women with NPVH (primary muscle tension dysphonia), and 32 vocally healthy women who were each matched to a patient according to age and occupation. Subglottal impedance-based inverse filtering of the anterior neck-surface accelerometer (ACC) signal yielded estimates of peak-to-peak glottal airflow, open quotient, and maximum flow declination rate. Average subglottal pressure and microphone-based sound pressure level (SPL) were also estimated from the ACC signal using subject-specific linear regression models. The ACC-based measures of glottal aerodynamics were normalized for SPL and statistically compared between each patient and matched-control group. Results Patients with PVH and NPVH exhibited lower SPL-normalized glottal aerodynamics values than their respective control subjects (p values ranging from < .01 to .07) with very large effect sizes (1.04-2.16), regardless of loudness condition or measurement method (i.e., ACC-based values maintained discriminatory power). Conclusions The results of this study demonstrate that ACC-based estimates of most glottal aerodynamic measures are comparable to those previously obtained from oral airflow and air pressure (Espinoza et al., 2017) in terms of differentiating between hyperfunctional (PVH and NPVH) and normal vocal function. ACC-based estimates of glottal aerodynamic measures may be used to assess vocal function during continuous speech and enables this assessment of daily voice use during ambulatory monitoring to provide better insight into the pathophysiological mechanisms associated with vocal hyperfunction.
Collapse
Affiliation(s)
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Jarrad H. Van Stan
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Robert E. Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- Speech and Hearing Bioscience and Technology Program, Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| |
Collapse
|
39
|
The Feasibility of a Neck-Surface Accelerometer for Estimating the Amount of Acoustic Output During Phonation Regardless of the Difference in the Mouth Configuration. J Voice 2020; 36:297-308. [DOI: 10.1016/j.jvoice.2020.06.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 06/10/2020] [Accepted: 06/10/2020] [Indexed: 11/23/2022]
|
40
|
Whittico TH, Ortiz AJ, Marks KL, Toles LE, Van Stan JH, Hillman RE, Mehta DD. Ambulatory monitoring of Lombard-related vocal characteristics in vocally healthy female speakers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:EL552. [PMID: 32611177 PMCID: PMC7316514 DOI: 10.1121/10.0001446] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Speakers typically modify their voice in the presence of increased background noise levels, exhibiting the classic Lombard effect. Lombard-related characteristics during everyday activities were recorded from 17 vocally healthy women who wore an acoustic noise dosimeter and ambulatory voice monitor. The linear relationship between vocal sound pressure level and environmental noise level exhibited an average slope of 0.54 dB/dB and value of 72.8 dB SPL at 50 dBA when correlation coefficients were greater than 0.4. These results, coupled with analyses of spectral and cepstral vocal function measures, provide normative ambulatory Lombard characteristics for comparison with patients with voice-use related disorders.
Collapse
Affiliation(s)
- Thomas H Whittico
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , ,
| | - Andrew J Ortiz
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , ,
| | - Katherine L Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , ,
| | - Laura E Toles
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , ,
| | - Jarrad H Van Stan
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , ,
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , ,
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, , , , , , ,
| |
Collapse
|
41
|
Abstract
Vocal loading tasks are often used to investigate the relationship between voice use and vocal fatigue in laboratory settings. The present study investigated the concept of a novel quantitative dose-based vocal loading task for vocal fatigue evaluation. Ten female subjects participated in the study. Voice use was monitored and quantified using an online vocal distance dose calculator during six consecutive 30-min long sessions. Voice quality was evaluated subjectively using the CAPE-V and SAVRa before, between, and after each vocal loading task session. Fatigue-indicative symptoms, such as cough, swallowing, and voice clearance, were recorded. Statistical analysis of the results showed that the overall severity, the roughness, and the strain ratings obtained from CAPE-V obeyed similar trends as the three ratings from the SAVRa. These metrics increased over the first two thirds of the sessions to reach a maximum, and then decreased slightly near the session end. Quantitative metrics obtained from surface neck accelerometer signals were found to obey similar trends. The results consistently showed that an initial adjustment of voice quality was followed by vocal saturation, supporting the effectiveness of the proposed loading task.
Collapse
|
42
|
Echternach M, Huseynov J, Döllinger M, Nusseck M, Richter B. The impact of a standardized vocal loading test on vocal fold oscillations. Eur Arch Otorhinolaryngol 2020; 277:1699-1705. [PMID: 32107614 PMCID: PMC7198646 DOI: 10.1007/s00405-020-05791-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 01/07/2020] [Indexed: 12/04/2022]
Abstract
Introduction Vocal loading capacity is an important aspect of vocal health and is measured using standardized vocal loading tests. However, it remains unclear how vocal fold oscillation patterns are influenced by a standardized vocal loading task. Methods 21 (10 male, 11 female) vocally healthy subjects were analyzed concerning the dysphonia severity index (DSI) and high speed videolaryngoscopy (HSV) on the vowel /i/ at a comfortable pitch and loudness before and after a standardized vocal loading test (10 min standardized text reading, at a level higher than 80 dB (A) measured at 30 cm from the mouth). Results Changes in DSI were statistically significant, diminishing by 1.2 points after the vocal loading test, which was mainly caused by an increase of the minimum intensity. However, the pre-post comparison of HSV derived measures failed to show any statistically significant changes. Conclusion It seems necessary to analyze the effects of a standardized vocal loading test on vocal fold oscillation patterns with respect to softest phonation and phonation threshold pressure rather than comfortable pitch and loudness. Level of evidence 2c
Collapse
Affiliation(s)
- Matthias Echternach
- Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Munich University Hospital (LMU), Campus Großhadern, Marchioninistraße 15, 81377, Munich, Germany.
| | - Jamal Huseynov
- Freiburg Institute of Musicians' Medicine, Freiburg University, Freiburg, Germany
- Department of Otolaryngology/Head and Neck Surgery, Clinics of Villingen-Schwenningen, Villingen Schwenningen, Germany
| | - Michael Döllinger
- Division of Phoniatrics and Pediatric Audiology, Department of Otolaryngology Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany
| | - Manfred Nusseck
- Freiburg Institute of Musicians' Medicine, Freiburg University, Freiburg, Germany
| | - Bernhard Richter
- Freiburg Institute of Musicians' Medicine, Freiburg University, Freiburg, Germany
- Medical Faculty, Freiburg University, Freiburg, Germany
| |
Collapse
|
43
|
Van Stan JH, Mehta DD, Ortiz AJ, Burns JA, Toles LE, Marks KL, Vangel M, Hron T, Zeitels S, Hillman RE. Differences in Weeklong Ambulatory Vocal Behavior Between Female Patients With Phonotraumatic Lesions and Matched Controls. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:372-384. [PMID: 31995428 PMCID: PMC7210443 DOI: 10.1044/2019_jslhr-19-00065] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Purpose Previous work using ambulatory voice recordings has shown no differences in average vocal behavior between patients with phonotraumatic vocal hyperfunction and matched controls. This study used larger groups to replicate these results and expanded the analysis to include distributional characteristics of ambulatory voice use and measures indicative of glottal closure. Method Subjects included 180 adult women: 90 diagnosed with vocal fold nodules or polyps and 90 age-, sex-, and occupation-matched controls with no history of voice disorders. Weeklong summary statistics (average, variability, skewness, kurtosis) of voice use were computed from neck-surface acceleration recorded using an ambulatory voice monitor. Voice measures included estimates of sound pressure level (SPL), fundamental frequency (f o), cepstral peak prominence, and the difference between the first and second harmonic magnitudes (H1-H2). Results Statistical comparisons resulted in medium-large differences (Cohen's d ≥ 0.5) between groups for SPL skewness, f o variability, and H1-H2 variability. Two logistic regressions (theory-based and stepwise) found SPL skewness and H1-H2 variability to classify patients and controls based on their weekly voice data, with an area under the receiver operating characteristic curve of 0.85 and 0.82 on training and test sets, respectively. Conclusion Compared to controls, the weekly voice use of patients with phonotraumatic vocal hyperfunction reflected higher SPL tendencies (negatively skewed SPL) with more abrupt glottal closure (reduced H1-H2 variability, especially toward higher values). Further work could examine posttreatment data (e.g., after surgery and/or therapy) to determine the extent to which these differences are associated with the etiology and pathophysiology of phonotraumatic vocal fold lesions.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| | | | - James A. Burns
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Laura E. Toles
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Katherine L. Marks
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
| | - Mark Vangel
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Tiffiny Hron
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Steven Zeitels
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
| | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- Harvard Medical School, Boston, MA
- MGH Institute of Health Professions, Boston, MA
| |
Collapse
|
44
|
Lin JZ, Espinoza VM, Marks KL, Zañartu M, Mehta DD. Improved subglottal pressure estimation from neck-surface vibration in healthy speakers producing non-modal phonation. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2020; 14:449-460. [PMID: 34079612 PMCID: PMC8168553 DOI: 10.1109/jstsp.2019.2959267] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Subglottal air pressure plays a major role in voice production and is a primary factor in controlling voice onset, offset, sound pressure level, glottal airflow, vocal fold collision pressures, and variations in fundamental frequency. Previous work has shown promise for the estimation of subglottal pressure from an unobtrusive miniature accelerometer sensor attached to the anterior base of the neck during typical modal voice production across multiple pitch and vowel contexts. This study expands on that work to incorporate additional accelerometer-based measures of vocal function to compensate for non-modal phonation characteristics and achieve an improved estimation of subglottal pressure. Subjects with normal voices repeated /p/-vowel syllable strings from loud-to-soft levels in multiple vowel contexts (/ɑ/, /i/, and /u/), pitch conditions (comfortable, lower than comfortable, higher than comfortable), and voice quality types (modal, breathy, strained, and rough). Subject-specific, stepwise regression models were constructed using root-mean-square (RMS) values of the accelerometer signal alone (baseline condition) and in combination with cepstral peak prominence, fundamental frequency, and glottal airflow measures derived using subglottal impedance-based inverse filtering. Five-fold cross-validation assessed the robustness of model performance using the root-mean-square error metric for each regression model. Each cross-validation fold exhibited up to a 25% decrease in prediction error when the model incorporated multidimensional aspects of the accelerometer signal compared with RMS-only models. Improved estimation of subglottal pressure for non-modal phonation was thus achievable, lending to future studies of subglottal pressure estimation in patients with voice disorders and in ambulatory voice recordings.
Collapse
Affiliation(s)
- Jon Z Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114 USA
| | | | - Katherine L Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114 USA
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa Maria, Valparaíso, Chile
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital-Harvard Medical School, Boston, MA 02114 USA
| |
Collapse
|
45
|
Marks KL, Lin JZ, Fox AB, Toles LE, Mehta DD. Impact of Nonmodal Phonation on Estimates of Subglottal Pressure From Neck-Surface Acceleration in Healthy Speakers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:3339-3358. [PMID: 31518510 PMCID: PMC6808343 DOI: 10.1044/2019_jslhr-s-19-0067] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Purpose The purpose of this study was to evaluate the effects of nonmodal phonation on estimates of subglottal pressure (Ps) derived from the magnitude of a neck-surface accelerometer (ACC) signal and to confirm previous findings regarding the impact of vowel contexts and pitch levels in a larger cohort of participants. Method Twenty-six vocally healthy participants (18 women, 8 men) were asked to produce a series of p-vowel syllables with descending loudness in 3 vowel contexts (/a/, /i/, and /u/), 3 pitch levels (comfortable, high, and low), and 4 elicited phonatory conditions (modal, breathy, strained, and rough). Estimates of Ps for each vowel segment were obtained by averaging the intraoral air pressure plateau before and after each segment. The root-mean-square magnitude of the neck-surface ACC signal was computed for each vowel segment. Three linear mixed-effects models were used to statistically assess the effects of vowel, pitch, and phonatory condition on the linear relationship (slope and intercept) between Ps and ACC signal magnitude. Results Results demonstrated statistically significant linear relationships between ACC signal magnitude and Ps within participants but with increased intercepts for the nonmodal phonatory conditions; slopes were affected to a lesser extent. Vowel and pitch contexts did not significantly affect the linear relationship between ACC signal magnitude and Ps. Conclusion The classic linear relationship between ACC signal magnitude and Ps is significantly affected when nonmodal phonation is produced by a speaker. Future work is warranted to further characterize nonmodal phonatory characteristics to improve the ACC-based prediction of Ps during naturalistic speech production.
Collapse
Affiliation(s)
- Katherine L. Marks
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
| | - Jonathan Z. Lin
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - Annie B. Fox
- Center for Interprofessional Studies and Innovation, MGH Institute of Health Professions, Boston, MA
| | - Laura E. Toles
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston
- PhD Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, MA
- Department of Surgery, Harvard Medical School, Boston, MA
- Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, MA
| |
Collapse
|
46
|
Dhaliwal SS, Doyle PC, Failla S, Hawkins S, Fung K. Role of voice rest following laser resection of vocal fold lesions: A randomized controlled trial. Laryngoscope 2019; 130:1750-1755. [PMID: 31498467 DOI: 10.1002/lary.28287] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Revised: 07/17/2019] [Accepted: 08/19/2019] [Indexed: 12/13/2022]
Abstract
OBJECTIVES/HYPOTHESIS Voice rest is often prescribed following phonosurgery by most surgeons despite limited empiric evidence to support its practice. This study assessed the effect of postphonosurgery voice rest on vocal outcomes. STUDY DESIGN Prospective, randomized controlled trial. METHODS Patients with unilateral vocal fold lesions undergoing CO2 laser excision were recruited in a prospective manner and randomized into one of two groups: 1) an experimental arm consisting of 7 days of absolute voice rest, or 2) a control arm consisting of no voice rest. The primary outcome measure was the Voice Handicap Index-10 (VHI-10) questionnaire. Secondary outcomes included aerodynamic measurements (maximum phonation time), acoustic measures (fundamental frequency, jitter, shimmer, and harmonic-to-noise ratio), and auditory-perceptual measures. Primary and secondary outcomes were assessed preoperatively and reassessed postoperatively at the 1- and 3-month follow-up. Patient compliance to voice rest instructions were controlled for using subjective and objective parameters. RESULTS Thirty patients were enrolled with 15 randomized to each arm of the study. Statistical analysis for the entire cohort showed a significant improvement in the mean preoperative VHI-10 compared to postoperative assessments at 1-month (19.0 vs. 7.3, P < .05) and 3-month (19.0 vs. 6.2, P < .05) follow-up. However, between-group comparisons showed no significant difference in postoperative VHI-10 at either time point. Similarly, secondary outcome measures yielded no significant difference in between-group comparisons. CONCLUSIONS Our study shows no significant benefit to voice rest on postoperative voice outcomes as determined by patient self-perception, acoustic variables, and auditory-perceptual analysis. LEVEL OF EVIDENCE 1b CLINICAL TRIAL NUMBER: NCT02788435 (clinicaltrials.gov) Laryngoscope, 130:1750-1755, 2020.
Collapse
Affiliation(s)
- Sandeep S Dhaliwal
- Department of Otolaryngology-Head and Neck Surgery, Western University, London, Ontario, Canada
| | - Philip C Doyle
- Department of Otolaryngology-Head and Neck Surgery, Western University, London, Ontario, Canada.,Voice Production and Perception Laboratory, and Department of Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
| | - Sebastiano Failla
- Voice Production and Perception Laboratory, and Department of Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
| | - Sarah Hawkins
- Department of Speech-Language Pathology, London Health Sciences Centre, London, Ontario, Canada
| | - Kevin Fung
- Department of Otolaryngology-Head and Neck Surgery, Western University, London, Ontario, Canada
| |
Collapse
|
47
|
Patel RR, Lulich SM, Verdi A. Vocal tract shape and acoustic adjustments of children during phonation into narrow flow-resistant tubes. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:352. [PMID: 31370566 DOI: 10.1121/1.5116681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Accepted: 06/25/2019] [Indexed: 06/10/2023]
Abstract
The goal of the study is to quantify the salient vocal tract acoustic, subglottal acoustic, and vocal tract physiological characteristics during phonation into a narrow flow-resistant tube with 2.53 mm inner diameter and 124 mm length in typically developing vocally healthy children using simultaneous microphone, accelerometer, and 3D/4D ultrasound recordings. Acoustic measurements included fundamental frequency (fo), first formant frequency (F1), second formant frequency (F2), first subglottal resonance (FSg1), and peak-to-peak amplitude ratio (Pvt:Psg). Physiological measurements included posterior tongue height (D1), tongue dorsum height (D2), tongue tip height (D3), tongue length (D4), oral cavity width (D5), hyoid elevation (D6), pharynx width (D7). All measurements were made on eight boys and ten girls (6-9 years) during sustained /o:/ production at typical pitch and loudness, with and without flow-resistant tube. Phonation with the flow-resistant tube resulted in a significant decrease in F1, F2, and Pvt:Psg and a significant increase in D2, D3, and FSg1. A statistically significant gender effect was observed for D1, with D1 higher in boys. These findings agree well with reported findings from adults, suggesting common acoustic and articulatory mechanisms for narrow flow-resistant tube phonation. Theoretical implications of the findings are discussed.
Collapse
Affiliation(s)
- Rita R Patel
- Department of Speech and Hearing Sciences, Indiana University, 200 South Jordan Avenue, Bloomington, Indiana 47405-7002, USA
| | - Steven M Lulich
- Department of Speech and Hearing Sciences, Indiana University, 200 South Jordan Avenue, Bloomington, Indiana 47405-7002, USA
| | - Alessandra Verdi
- Department of Speech and Hearing Sciences, Indiana University, 200 South Jordan Avenue, Bloomington, Indiana 47405-7002, USA
| |
Collapse
|
48
|
Application of Ambulatory Phonation Monitoring (APM) in the measurement of daily speaking-time and voice intensity before and after cochlear implant in deaf adult patients. Auris Nasus Larynx 2019; 46:844-852. [PMID: 31005362 DOI: 10.1016/j.anl.2019.03.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 03/12/2019] [Accepted: 03/28/2019] [Indexed: 11/24/2022]
Abstract
OBJECTIVE to evaluate the changes in daily voice production, analysed through the Ambulatory Phonation Monitoring (APM), and their relationship with Quality of Life (QOL) measurements in a group of profound deaf patients treated with Cochlear Implant (CI). METHODS A total of 12 consecutive post-lingual deaf patients (8 females and 4 males) treated with CI for bilateral severe-to-profound hearing loss were enrolled. Each patient was evaluated before and after 6 months of CI use. In particular, the daily voice production evaluation was performed using the APM, while QOL information were gathered from the Italian version of the Nijmegen Cochlear Implant Questionnaire (I-NCIQ). RESULTS Significant differences in the APM results obtained before and after CI were found. In particular, a significant decrease of the mean amplitude and a significant increase of the daily phonation time and percentage of phonation time were demonstrated after CI use in all the patients. A significant improvement in the I-NCIQ scores was demonstrated after CI use and significant correlations among I-NCIQ scores and the APM parameters were found. CONCLUSIONS The APM could be useful in the evaluation of the benefits of cochlear implantation and may represents an indicator of deaf patient participation. In addition, the daily voice production's modifications after CI and their significant relations with the changes in QOL measurements could be useful in treatment planning as well as during pre- and post-operative counselling.
Collapse
|
49
|
Lei Z, Kennedy E, Fasanella L, Li-Jessen NYK, Mongeau L. Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals. APPLIED SCIENCES (BASEL, SWITZERLAND) 2019; 9:10.3390/app9071505. [PMID: 32133204 PMCID: PMC7055909 DOI: 10.3390/app9071505] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs' reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy.
Collapse
Affiliation(s)
- Zhengdong Lei
- Department of Mechanical Engineering, McGill University, Montreal, QC H3A 0G4, Canada
| | - Evan Kennedy
- School of Communication Sciences and Disorders, McGill University, Montreal, QC H3A 0G4, Canada
| | - Laura Fasanella
- Department of Mechanical Engineering, McGill University, Montreal, QC H3A 0G4, Canada
| | | | - Luc Mongeau
- Department of Mechanical Engineering, McGill University, Montreal, QC H3A 0G4, Canada
| |
Collapse
|
50
|
Astolfi A, Castellana A, Puglisi GE, Fugiglando U, Carullo A. Speech level parameters in very low and excessive reverberation measured with a contact-sensor-based device and a headworn microphone. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:2540. [PMID: 31046351 DOI: 10.1121/1.5098942] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Accepted: 04/03/2019] [Indexed: 06/09/2023]
Abstract
This work deals with the effect of very low and excessive reverberation on speech sound pressure levels. Measurements were carried out in a semi-anechoic and reverberant room using a contact-sensor-based device fixed at the base of the neck and a headworn microphone placed at 2.5 cm from the speaker's mouth. Speakers produced short monologues and also described a map with the intent of correctly explaining directions to a listener 6 m away. A significant increase of about 2 dB in mean, equivalent, and mode speech levels in semi-anechoic compared to reverberant room was on average observed for the map description from recordings with the contact device, thus, highlighting an increased vocal intensity in dead rooms when the speech task had a communicative intent. Such reliable results were not found with the headworn microphone because speech levels were affected by noise recorded in the pauses. Particularly, the 30 ms logging interval brought to more severe errors in mode and mean speech levels compared to 1 s. When the pauses were removed and a logging interval of 30 ms was adopted, results were more consistent with those obtained with the contact sensor, but level differences between the two rooms still remained not significant.
Collapse
Affiliation(s)
- Arianna Astolfi
- Politecnico di Torino, Department of Energy, Corso Duca degli Abruzzi, 24, 10129, Torino, Italy
| | - Antonella Castellana
- Politecnico di Torino, Department of Electronics and Telecommunications, Corso Duca degli Abruzzi, 24, 10129, Torino, Italy
| | - Giuseppina Emma Puglisi
- Politecnico di Torino, Department of Energy, Corso Duca degli Abruzzi, 24, 10129, Torino, Italy
| | - Umberto Fugiglando
- Massachusetts Institute of Technology, Senseable City Lab, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| | - Alessio Carullo
- Politecnico di Torino, Department of Electronics and Telecommunications, Corso Duca degli Abruzzi, 24, 10129, Torino, Italy
| |
Collapse
|