1
|
Bi MS, Nguyen DD, Arias-Vergara T, Döllinger M, Holik J, Madill CJ. Effects of Instructed Laryngeal Manipulation on Vocal Rise Time. J Voice 2024:S0892-1997(24)00352-7. [PMID: 39537447 DOI: 10.1016/j.jvoice.2024.10.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Revised: 10/05/2024] [Accepted: 10/08/2024] [Indexed: 11/16/2024]
Abstract
OBJECTIVES Previous research has shown that instructed manipulation of the false vocal fold activity (FVFA), true vocal fold mass (TVFM), and larynx height (LH) impacted on voice quality. It is not known whether these manipulations have any effect on voice onset. Vocal Rise Time (VRT) is an objective acoustic measure of voice onset, which has potential as an assessment tool in clinical settings. The present study aimed to investigate the effects of instructed manipulation of FVFA, TVFM, and LH on VRT. STUDY DESIGN Retrospective, observational study. METHODS Nine vocally trained participants (five females, four males) aged between 19 and 36years were instructed to perform differential manipulation of FVFA, TVFM, and LH while phonating the prolonged /ɑ/ vowel. Recorded voice samples were edited and analyzed using a novel Python-based application, the Voice Onset Analysis Tool (VOAT) to obtain VRT results. The VRT data were compared across conditions using repeated-measures analysis of variance, and were correlated against perceptual ratings of tone onset. RESULTS Reliability analysis showed excellent intra- and inter-rater agreement in VRT measurements using VOAT. All laryngeal parameters (FVFA, TVFM, and LH) showed statistically significant main effects on VRT. There was a consistent trend for thin TVFM, constricted FVFA, and lower LH to increase VRT values. However, post hoc analysis showed some statistically insignificant results possibly due to the small sample size. There was a weak positive correlation between VRT and perceptual tone onset ratings. CONCLUSION VRT measurements using VOAT are highly reliable. All three laryngeal parameters were contributors to determining voice onset. Given the limited sample size, careful definition and standardization of VRT measurement protocol is needed for it to become a useful and reliable measure of voice onset in research and clinical settings.
Collapse
Affiliation(s)
- Mingxuan Sophie Bi
- University of Sydney Voice Research Laboratory, Faculty of Medicine and Health, University of Sydney, Sydney, New South Wales, Australia
| | - Duy Duong Nguyen
- University of Sydney Voice Research Laboratory, Faculty of Medicine and Health, University of Sydney, Sydney, New South Wales, Australia; National Hospital of Otorhinolaryngology, Hanoi, Vietnam
| | - Tomás Arias-Vergara
- Pattern Recognition Lab, Friedrich-Alexander-University Erlangen-Nuernberg, Erlangen, Germany
| | - Micheal Döllinger
- Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head & Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nuernberg, Erlangen, Germany
| | - John Holik
- University of Sydney Voice Research Laboratory, Faculty of Medicine and Health, University of Sydney, Sydney, New South Wales, Australia
| | - Catherine J Madill
- University of Sydney Voice Research Laboratory, Faculty of Medicine and Health, University of Sydney, Sydney, New South Wales, Australia.
| |
Collapse
|
2
|
Converse AK, Krasko MN, Rudisch DM, Lunaris CL, Nisbet AF, Slesarev MS, Szot JC, Hoerst AG, Leverson GE, Gallagher CL, Ciucci MR. Positron emission tomography neuroimaging of [ 18F]fluorodeoxyglucose uptake and related behavior in the Pink1-/- rat model of Parkinson disease. Front Neurosci 2024; 18:1451118. [PMID: 39474461 PMCID: PMC11520326 DOI: 10.3389/fnins.2024.1451118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Accepted: 08/30/2024] [Indexed: 11/02/2024] Open
Abstract
Introduction Parkinson disease (PD) is a neurodegenerative condition affecting multiple sensorimotor and cognitive systems. The Pink1-/- rat model exhibits vocal, cognitive, and limb use deficits seen in idiopathic PD. We sought to measure glucose metabolism in brain regions in Pink1-/- and wild type (WT) rats, and to associate these to measures of ultrasonic vocalization, cognition, and limb use behavior. Methods Pink1-/- (n = 12) and WT (n = 14) rats were imaged by [18F]fluorodeoxyglucose (FDG) positron emission tomography (PET) in a repeated measures design at approximately 10 months of age and 6 weeks later. Relative regional glucose metabolism was indexed by whole brain normalized FDG uptake, which was calculated for 18 regions identified a priori for comparison. Behavioral measures included tests of communication via ultrasonic vocalization, cognition with 5-Choice Serial Reaction Time Test (5-CSRTT), and limb use with Cylinder Test and Challenge Beam. Results Relative glucose metabolism was significantly different in Pink1-/- rats in prelimbic area, striatum, nucleus ambiguus, globus pallidus, and posterior parietal association cortex compared to WT controls. For behavioral measures, Pink1-/- rats demonstrated quieter vocalizations with a restricted frequency range, and they showed increased number of foot-faults and hindlimb steps (shuffling) in limb motor tests. Significant behavior vs. brain correlations included associations of ultrasonic vocalization parameters with glucose metabolism indices in locus coeruleus and substantia nigra. Conclusion FDG PET reveals abnormalities in relative regional brain glucose metabolism in Pink1-/- rats in brain regions that are important to cognition, vocalization, and limb motor control that are also impacted by Parkinson disease. This method may be useful for mechanistic studies of behavioral deficits and therapeutic interventions in translational studies in the Pink1-/- PD model.
Collapse
Affiliation(s)
| | - Maryann N. Krasko
- Division of Otolaryngology, Department of Surgery, University of Wisconsin-Madison, Madison, WI, United States
- Department of Communication Science and Disorders, University of Wisconsin-Madison, Madison, WI, United States
| | - Denis Michael Rudisch
- Division of Otolaryngology, Department of Surgery, University of Wisconsin-Madison, Madison, WI, United States
- Department of Communication Science and Disorders, University of Wisconsin-Madison, Madison, WI, United States
- Institute for Clinical and Translational Research, University of Wisconsin-Madison, Madison, WI, United States
| | - Charlie Lenell Lunaris
- Division of Otolaryngology, Department of Surgery, University of Wisconsin-Madison, Madison, WI, United States
| | - Alex F. Nisbet
- Division of Otolaryngology, Department of Surgery, University of Wisconsin-Madison, Madison, WI, United States
| | - Maxim S. Slesarev
- Waisman Center, University of Wisconsin-Madison, Madison, WI, United States
| | - John C. Szot
- Division of Otolaryngology, Department of Surgery, University of Wisconsin-Madison, Madison, WI, United States
| | - Andrew G. Hoerst
- Division of Otolaryngology, Department of Surgery, University of Wisconsin-Madison, Madison, WI, United States
| | - Glen E. Leverson
- Division of Otolaryngology, Department of Surgery, University of Wisconsin-Madison, Madison, WI, United States
| | | | - Michelle R. Ciucci
- Division of Otolaryngology, Department of Surgery, University of Wisconsin-Madison, Madison, WI, United States
- Department of Communication Science and Disorders, University of Wisconsin-Madison, Madison, WI, United States
- Neuroscience Training Program, University of Wisconsin-Madison, Madison, WI, United States
| |
Collapse
|
3
|
Vojtech JM, Stepp CE. Effects of Age and Parkinson's Disease on the Relationship between Vocal Fold Abductory Kinematics and Relative Fundamental Frequency. J Voice 2024; 38:1008-1022. [PMID: 35393167 PMCID: PMC9532464 DOI: 10.1016/j.jvoice.2022.03.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 03/07/2022] [Accepted: 03/08/2022] [Indexed: 11/16/2022]
Abstract
PURPOSE This study reports on two experiments to examine vocal fold abduction and its relationship with relative fundamental frequency (RFF), considering two attributes that have been shown to elicit group differences in RFF: age (Experiment 1) and Parkinson's disease (PD; Experiment 2). METHODS For both experiments, simultaneous acoustic and nasendoscopic recordings were collected as participants produced the utterance, /ifi/. RFF values were computed from the acoustic signal, whereas abduction duration and glottic angle at voicing offset were identified from the laryngoscopic images. In Experiment 1, 50 speakers with typical voices (18-83 years) were analyzed to examine (1A) the effects of speaker age on individual outcome measures (RFF, abduction duration, glottic angle) via Pearson's correlation coefficients, and (1B) the effects of abductory measures and age on RFF via an analysis of covariance. In Experiment 2, 20 speakers with PD and 20 matched controls were analyzed to examine (2A) the effects of group (with/without PD) on outcome measures via an analysis of variance, and (2B) the relationship of RFF with abduction duration, glottic angle, and age when considering group via an analysis of covariance. RESULTS Age demonstrated a significant, negative relationship with glottic angle (1A) but was not a significant factor when examining the relationship of vocal fold abduction and RFF (1B). Speaker group (with/without PD) demonstrated a significant effect on measures of RFF and abduction duration (2A) but was not a significant factor when examining the relationship of vocal fold abduction and RFF (2B). CONCLUSIONS RFF is sensitive to changes in vocal fold abductory patterns during devoicing, irrespective of speaker age or PD status.
Collapse
Affiliation(s)
- Jennifer M Vojtech
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts; Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts; Delsys, Inc., Natick, Massachusetts; Altec, Inc., Natick, Massachusetts.
| | - Cara E Stepp
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts; Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts; Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts
| |
Collapse
|
4
|
Hoffmeister JD, Broadfoot CK, Schaen-Heacock NE, Lechner SA, Krasko MN, Nisbet AF, Russell J, Szot J, Glass TJ, Connor NP, Kelm-Nelson CA, Ciucci MR. Vocal and tongue exercise in early to mid-stage Parkinson disease using the Pink1-/- rat. Brain Res 2024; 1837:148958. [PMID: 38685371 PMCID: PMC11166513 DOI: 10.1016/j.brainres.2024.148958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/27/2024] [Accepted: 04/23/2024] [Indexed: 05/02/2024]
Abstract
Vocal and swallowing deficits are common in Parkinson disease (PD). Because these impairments are resistant to dopamine replacement therapies, vocal and lingual exercise are the primary treatment, but not all individuals respond to exercise and neural mechanisms of treatment response are unclear. To explore putative mechanisms, we used the progressive Pink1-/- rat model of early to mid-stage PD and employed vocal and lingual exercises at 6- and 10-months of age in male Pink1-/- and wild type (WT) rats. We hypothesized that vocal and lingual exercise would improve vocal and tongue use dynamics and increase serotonin (5HT) immunoreactivity in related brainstem nuclei. Rats were tested at baseline and after 8 weeks of exercise or sham exercise. At early-stage PD (6 months), vocal exercise resulted in increased call complexity, but did not change intensity, while at mid-stage (10 months), vocal exercise no longer influenced vocalization complexity. Lingual exercise increased tongue force generation and reduced relative optical density of 5HT in the hypoglossal nucleus at both time points. The effects of vocal and lingual exercise at these time points are less robust than in prodromal stages observed in previous work, suggesting that early exercise interventions may yield greater benefit. Future work targeting optimization of exercise at later time points may facilitate clinical translation.
Collapse
Affiliation(s)
- J D Hoffmeister
- University of Minnesota, Dept. of Otolaryngology, 420 Delaware Street SE, Minneapolis, MN 55422, USA; University of Wisconsin-Madison, Dept. of Communication Sciences and Disorders, 1975 Willow Drive, Madison, WI 53706, USA.
| | - C K Broadfoot
- University of South Alabama, Dept. of Speech Pathology and Audiology, 5721 USA Drive N, HAHN 1119, Mobile, AL 36688, USA; University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - N E Schaen-Heacock
- University of Wisconsin-Madison, Dept. of Communication Sciences and Disorders, 1975 Willow Drive, Madison, WI 53706, USA; University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - S A Lechner
- University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - M N Krasko
- University of Wisconsin-Madison, Dept. of Communication Sciences and Disorders, 1975 Willow Drive, Madison, WI 53706, USA; University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - A F Nisbet
- University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - J Russell
- University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - J Szot
- University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - T J Glass
- University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - N P Connor
- University of Wisconsin-Madison, Dept. of Communication Sciences and Disorders, 1975 Willow Drive, Madison, WI 53706, USA; University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - C A Kelm-Nelson
- University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA.
| | - M R Ciucci
- University of Wisconsin-Madison, Dept. of Communication Sciences and Disorders, 1975 Willow Drive, Madison, WI 53706, USA; University of Wisconsin-Madison, Dept. of Surgery, Div. of Otolaryngology, 1300 University Avenue, 483 Medical Sciences Building, Madison, WI 53706, USA; University of Wisconsin-Madison, Neuroscience Training Program, 9531 WIMR II, 1111 Highland Ave., Madison, WI 53705, USA.
| |
Collapse
|
5
|
Buckley DP, Vojtech JM, Stepp CE. Relative Fundamental Frequency in Individuals with Globus Syndrome and Muscle Tension Dysphagia. J Voice 2024; 38:612-618. [PMID: 34823980 PMCID: PMC9124719 DOI: 10.1016/j.jvoice.2021.10.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 10/01/2021] [Accepted: 10/05/2021] [Indexed: 12/19/2022]
Abstract
OBJECTIVE Relative fundamental frequency (RFF) has been investigated as an acoustic measure to assess for changes in laryngeal tension. This study aimed to assess RFF in individuals with globus syndrome, individuals with muscle tension dysphagia (MTDg), and individuals with typical voices. METHODS RFF values were calculated from the speech acoustics of individuals with globus syndrome (n = 12), individuals with MTDg (n = 12), and age- and sex-matched controls with typical voices (n = 24). An analysis of variance was performed on RFF values to assess the effect of group. RESULTS There was no statistically significant effect of group on RFF values, with similar values for individuals with globus syndrome, individuals with MTDg, and control participants. CONCLUSIONS These results suggest that individuals with these disorders do not appear to possess paralaryngeal muscle tension in a locus and/or manner that directly impacts voice production.
Collapse
Affiliation(s)
- Daniel P Buckley
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts; Department of Otolaryngology - Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts.
| | - Jennifer M Vojtech
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts; Department of Biomedical Engineering, Boston University, Boston, Massachusetts; Delsys, Inc., Natick, Massachusetts; Altec, Inc., Natick, Massachusetts
| | - Cara E Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts; Department of Biomedical Engineering, Boston University, Boston, Massachusetts
| |
Collapse
|
6
|
Dragicevic DA, Dahl KL, Perkins Z, Abur D, Stepp CE. Effects of a Concurrent Working Memory Task on Speech Acoustics in Parkinson's Disease. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2024; 33:418-434. [PMID: 38081054 PMCID: PMC11001185 DOI: 10.1044/2023_ajslp-23-00214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 08/30/2023] [Accepted: 10/26/2023] [Indexed: 01/05/2024]
Abstract
PURPOSE The purpose of this study was to determine the effect of a concurrent working memory task on acoustic measures of speech in individuals with Parkinson's disease (PD). METHOD Individuals with PD and age- and sex-matched controls performed a speaking task with and without a Stroop-like concurrent working memory task. Cepstral peak prominence, low-to-high spectral energy ratio, fundamental frequency (fo) standard deviation, articulation rate, pause duration, articulatory-acoustic vowel space, relative fo, mean voice onset time (VOT), and VOT variability were calculated for each condition. Mixed-model analyses of variance were performed to determine the effects of group, condition (presence of the concurrent working memory task), and their interaction on the acoustic measures. RESULTS All measures except for VOT variability, mean pause duration, and relative fo offset differed between people with and without PD. Cepstral peak prominence, articulation rate, and relative fo offset differed as a function of condition. However, no measures indicated disparate effects of condition as a function of group. CONCLUSION Although differentially impactful on limb motor function in PD, here a concurrent working memory task was not found to be differentially disruptive to speech acoustics in PD. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.24759648.
Collapse
Affiliation(s)
| | - Kimberly L. Dahl
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Zoe Perkins
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Defne Abur
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Center for Language and Cognition Groningen, University of Groningen, the Netherlands
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology—Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
7
|
Pyfrom M, Lister J, Anand S. Influence of Cognitive Load on Voice Production: A Scoping Review. J Voice 2023:S0892-1997(23)00271-0. [PMID: 37770330 DOI: 10.1016/j.jvoice.2023.08.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 08/24/2023] [Accepted: 08/25/2023] [Indexed: 09/30/2023]
Abstract
Cognitive-motor interactions in speech production have a strong theoretical basis. However, majority of the existing literature has primarily focused on subjective and objective measures related to speech and not voice. This systematic review gathered evidence on the potential relationship between cognitive load and voice production. A search of five databases, website, citation review, and author search were completed in a sequential order to find published and unpublished literature from 1992 to 2022 using a combination of search terms including voice, cognitive load/demand/effort/flexibility, dual task, and speech production/motor. Studies for which the primary dependent variables were linguistic, or speech measures were included if voice acoustics was also measured and described. A final sample of nine articles were identified as meeting inclusion criteria: completed between 1992 and 2022, healthy adults (18+), and American English speakers. The review indicated that existing literature on the influence of cognitive load on voice production is limited. Acoustic measures, such as fundamental frequency, sound pressure level, and cepstral peak prominence, do not show consistent patterns of change with an increase in cognitive load. It is likely that the inconsistencies in the speech or cognitive task type and measurement of individual reaction to cognitive load changes may have led to these varied results. Further research using a range/continuum of cognitive tasks varying in load/difficulty level and physiological measurements is warranted to understand the underlying mechanisms of behavioral performance with implications for clinical voice assessment and rehabilitation.
Collapse
Affiliation(s)
- Mary Pyfrom
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida
| | - Jennifer Lister
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida
| | - Supraja Anand
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida.
| |
Collapse
|
8
|
Groll MD, Peterson SD, Zañartu M, Vojtech JM, Stepp CE. Empirical Evaluation of the Role of Vocal Fold Collision on Relative Fundamental Frequency in Voicing Offset. J Voice 2022:S0892-1997(22)00291-0. [PMID: 36336485 PMCID: PMC10154433 DOI: 10.1016/j.jvoice.2022.09.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 09/16/2022] [Accepted: 09/19/2022] [Indexed: 11/06/2022]
Abstract
OBJECTIVES Relative fundamental frequency (RFF) is an acoustic measure of changes in fundamental frequency during voicing transitions. The physiological mechanisms underlying RFF remain unclear. Recent modeling suggests that changes in RFF during voicing offset are due to decreases in overall system stiffness as a direct result of the cessation of vocal fold collision. To evaluate this finding empirically, here we examined whether variable timing between the end of vocal fold collision and the final voicing cycle used to calculate RFF explained the variability in RFF across individual voicing offset utterances. METHODS RFF during voicing offset was calculated from /ifi/ utterances produced by 35 participants under endoscopy, with and without vocal effort. RFF was calculated via two methods, in which utterances were aligned by (1) the end of vocal fold collision, or (2) the end of voicing. Analyses of variance were used to determine the effects of vocal effort and RFF method on the mean and standard deviation of RFF. RESULTS Aligning by vocal fold collision resulted in statistically significantly lower standard deviations. RFF means were statistically higher using the collision method; however, the degree of vocal effort was statistically significant regardless of the method. CONCLUSIONS These results provide empirical evidence to support that decreases in RFF during voicing offset are a result of decreases in system stiffness due to termination of vocal fold collision.
Collapse
Affiliation(s)
- Matti D Groll
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts; Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts.
| | - Sean D Peterson
- Department of Mechanical and Mechatronics Engineering, University of Waterloo, Ontario, Canada
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Jennifer M Vojtech
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts; Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts
| | - Cara E Stepp
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts; Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts; Department of Otolaryngology-Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts
| |
Collapse
|
9
|
Groll MD, Vojtech JM, Hablani S, Mehta DD, Buckley DP, Noordzij JP, Stepp CE. Automated Relative Fundamental Frequency Algorithms for Use With Neck-Surface Accelerometer Signals. J Voice 2022; 36:156-169. [PMID: 32653267 PMCID: PMC7790853 DOI: 10.1016/j.jvoice.2020.06.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Accepted: 06/04/2020] [Indexed: 10/23/2022]
Abstract
OBJECTIVE Relative fundamental frequency (RFF) has been suggested as a potential acoustic measure of vocal effort. However, current clinical standards for RFF measures require time-consuming manual markings. Previous semi-automated algorithms have been developed to calculate RFF from microphone signals. The current study aimed to develop fully automated algorithms to calculate RFF from neck-surface accelerometer signals for ecological momentary assessment and ambulatory monitoring of voice. METHODS Training a set of 2646 /vowel-fricative-vowel/ utterances from 317 unique speakers, with and without voice disorders, was used to develop automated algorithms to calculate RFF values from neck-surface accelerometer signals. The algorithms first rejected utterances with poor vowel-to-noise ratios, then identified fricative locations, then used signal features to determine voicing boundary cycles, and finally calculated corresponding RFF values. These automated RFF values were compared to the clinical gold-standard of manual RFF calculated from simultaneously collected microphone signals in a novel test set of 639 utterances from 77 unique speakers. RESULTS Automated accelerometer-based RFF values resulted in an average mean bias error (MBE) across all cycles of 0.027 ST, with an MBE of 0.152 ST and -0.252 ST in the offset and onset cycles closest to the fricative, respectively. CONCLUSION All MBE values were smaller than the expected changes in RFF values following successful voice therapy, suggesting that the current algorithms could be used for ecological momentary assessment and ambulatory monitoring via neck-surface accelerometer signals.
Collapse
Affiliation(s)
- Matti D. Groll
- Department of Biomedical Engineering, Boston University, Boston, 02215, Massachusetts,Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts
| | - Jennifer M. Vojtech
- Department of Biomedical Engineering, Boston University, Boston, 02215, Massachusetts,Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts
| | - Surbhi Hablani
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts
| | - Daryush D. Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation and MGH Institute of Health Professions, Massachusetts General Hospital, Boston, 02114, Massachusetts,Department of Surgery, Harvard Medical School, Boston, 02144, Massachusetts,Program in Rehabilitation Sciences, MGH Institute of Health Professions, Boston, 02129, Massachusetts,Speech and Hearing Bioscience and Technology Program, Division of Medical Sciences, Harvard Medical School, Boston, 02144, Massachusetts
| | - Daniel P. Buckley
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts,Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, 02118, Massachusetts
| | - J. Pieter Noordzij
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, 02118, Massachusetts
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, Boston, 02215, Massachusetts,Department of Speech, Language and Hearing Sciences, Boston University, Boston, 02215, Massachusetts,Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, 02118, Massachusetts
| |
Collapse
|
10
|
Hoffmeister JD, Kelm-Nelson CA, Ciucci MR. Manipulation of vocal communication and anxiety through pharmacologic modulation of norepinephrine in the Pink1-/- rat model of Parkinson disease. Behav Brain Res 2022; 418:113642. [PMID: 34755639 PMCID: PMC8671235 DOI: 10.1016/j.bbr.2021.113642] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 10/21/2021] [Accepted: 10/24/2021] [Indexed: 11/23/2022]
Abstract
Vocal deficits and anxiety are common, co-occurring, and interacting signs of Parkinson Disease (PD) that have a devastating impact on quality of life. Both manifest early in the disease process. Unlike hallmark motor signs of PD, neither respond adequately to dopamine replacement therapies, suggesting that their disease-specific mechanisms are at least partially extra-dopaminergic. Because noradrenergic dysfunction is also a defining feature of PD, especially early in the disease progression, drug therapies targeting norepinephrine are being trialed for treatment of motor and non-motor impairments in PD. Research assessing the effects of noradrenergic manipulation on anxiety and vocal impairment in PD, however, is sparse. In this pre-clinical study, we quantified the influence of pharmacologic manipulation of norepinephrine on vocal impairment and anxiety in Pink1-/- rats, a translational model of PD that demonstrates both vocal deficits and anxiety. Ultrasonic vocalization acoustics, anxiety behavior, and limb motor activity were tested twice for each rat: after injection of saline and after one of three drugs. We hypothesized that norepinephrine reuptake inhibitors (atomoxetine and reboxetine) and a β receptor antagonist (propranolol) would decrease vocal impairment and anxiety compared to saline, without affecting spontaneous motor activity. Our results demonstrated that atomoxetine and reboxetine decreased anxiety behavior. Atomoxetine also modulated ultrasonic vocalization acoustics, including an increase in vocal intensity, which is almost always reduced in animal models and patients with PD. Propranolol did not affect anxiety or vocalization. Drug condition did not influence spontaneous motor activity. These studies demonstrate relationships among vocal impairment, anxiety, and noradrenergic systems in the Pink1-/- rat model of PD.
Collapse
Affiliation(s)
- Jesse D Hoffmeister
- Department of Communication Sciences and Disorders, University of Wisconsin-Madison, 1975 Willow Drive, Madison, WI 53706, USA; Department of Surgery, Division of Otolaryngology-Head & Neck Surgery, University of Wisconsin-Madison, 600 Highland Avenue, Madison, WI 53792-7375, USA.
| | - Cynthia A Kelm-Nelson
- Department of Surgery, Division of Otolaryngology-Head & Neck Surgery, University of Wisconsin-Madison, 600 Highland Avenue, Madison, WI 53792-7375, USA.
| | - Michelle R Ciucci
- Department of Communication Sciences and Disorders, University of Wisconsin-Madison, 1975 Willow Drive, Madison, WI 53706, USA; Department of Surgery, Division of Otolaryngology-Head & Neck Surgery, University of Wisconsin-Madison, 600 Highland Avenue, Madison, WI 53792-7375, USA; Neuroscience Training Program, University of Wisconsin-Madison, 9531 WIMR II, 1111 Highland Avenue, Madison, WI 53705, USA.
| |
Collapse
|
11
|
Hoffmeister JD, Kelm-Nelson CA, Ciucci MR. Quantification of brainstem norepinephrine relative to vocal impairment and anxiety in the Pink1-/- rat model of Parkinson disease. Behav Brain Res 2021; 414:113514. [PMID: 34358571 PMCID: PMC8393386 DOI: 10.1016/j.bbr.2021.113514] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 07/23/2021] [Accepted: 07/30/2021] [Indexed: 12/31/2022]
Abstract
Vocal communication impairment and anxiety are co-occurring and interacting signs of Parkinson Disease (PD) that are common, poorly understood, and under-treated. Both vocal communication and anxiety are influenced by the noradrenergic system. In light of this shared neural substrate and considering that noradrenergic dysfunction is a defining characteristic of PD, tandem investigation of vocal impairment and anxiety in PD relative to noradrenergic mechanisms is likely to yield insights into the underlying disease-specific causes of these impairments. In order to address this gap in knowledge, we assessed vocal impairment and anxiety behavior relative to brainstem noradrenergic markers in a genetic rat model of early-onset PD (Pink1-/-) and wild type controls (WT). We hypothesized that 1) brainstem noradrenergic markers would be disrupted in Pink1-/-, and 2) brainstem noradrenergic markers would be associated with vocal acoustic changes and anxiety level. Rats underwent testing of ultrasonic vocalization and anxiety (elevated plus maze) at 4, 8, and 12 months of age. At 12 months, brainstem norepinephrine markers were quantified with immunohistochemistry. Results demonstrated that vocal impairment and anxiety were increased in Pink1-/- rats, and increased anxiety was associated with greater vocal deficit in this model of PD. Further, brainstem noradrenergic markers including TH and α1 adrenoreceptor immunoreactivity in the locus coeruleus, and β1 adrenoreceptor immunoreactivity in vagal nuclei differed by genotype, and were associated with vocalization and anxiety behavior. These findings demonstrate statistically significant relationships among vocal impairment, anxiety, and brainstem norepinephrine in the Pink1-/- rat model of PD.
Collapse
Affiliation(s)
- Jesse D Hoffmeister
- Department of Communication Sciences and Disorders, University of Wisconsin-Madison, 1975 Willow Drive, Madison, WI, 53706, USA; Department of Surgery, Division of Otolaryngology-Head & Neck Surgery, University of Wisconsin-Madison, 600 Highland Avenue, Madison, WI, 53792-7375, USA.
| | - Cynthia A Kelm-Nelson
- Department of Surgery, Division of Otolaryngology-Head & Neck Surgery, University of Wisconsin-Madison, 600 Highland Avenue, Madison, WI, 53792-7375, USA.
| | - Michelle R Ciucci
- Department of Communication Sciences and Disorders, University of Wisconsin-Madison, 1975 Willow Drive, Madison, WI, 53706, USA; Department of Surgery, Division of Otolaryngology-Head & Neck Surgery, University of Wisconsin-Madison, 600 Highland Avenue, Madison, WI, 53792-7375, USA; Neuroscience Training Program, University of Wisconsin-Madison, 9531 WIMR II, 1111 Highland Avenue, Madison, WI, 53705, USA.
| |
Collapse
|
12
|
Vojtech JM, Cilento DD, Luong AT, Noordzij JP, Diaz-Cadiz M, Groll MD, Buckley DP, McKenna VS, Noordzij JP, Stepp CE. Acoustic Identification of the Voicing Boundary during Intervocalic Offsets and Onsets based on Vocal Fold Vibratory Measures. APPLIED SCIENCES (BASEL, SWITZERLAND) 2021; 11:3816. [PMID: 36188437 PMCID: PMC9524108 DOI: 10.3390/app11093816] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Methods for automating relative fundamental frequency (RFF)-an acoustic estimate of laryngeal tension-rely on manual identification of voiced/unvoiced boundaries from acoustic signals. This study determined the effect of incorporating features derived from vocal fold vibratory transitions for acoustic boundary detection. Simultaneous microphone and flexible nasendoscope recordings were collected from adults with typical voices (N=69) and with voices characterized by excessive laryngeal tension (N=53) producing voiced-unvoiced-voiced utterances. Acoustic features that coincided with vocal fold vibratory transitions were identified and incorporated into an automated RFF algorithm ("aRFF-APH"). Voiced/unvoiced boundary detection accuracy was compared between the aRFF-APH algorithm, a recently published version of the automated RFF algorithm ("aRFF-AP"), and gold-standard, manual RFF estimation. Chi-square tests were performed to characterize differences in boundary cycle identification accuracy among the three RFF estimation methods. Voiced/unvoiced boundary detection accuracy significantly differed by RFF estimation method for voicing offsets and onsets. Of 7721 productions, 76.0% of boundaries were accurately identified via the aRFF-APH algorithm, compared to 70.3% with the aRFF-AP algorithm and 20.4% with manual estimation. Incorporating acoustic features that corresponded with voiced/unvoiced boundaries led to improvements in boundary detection accuracy that surpassed the gold-standard method for calculating RFF.
Collapse
Affiliation(s)
- Jennifer M. Vojtech
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Delsys, Inc. and Altec, Inc., Natick, MA, 01760, USA
| | - Dante D. Cilento
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Austin T. Luong
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Jacob P. Noordzij
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Manuel Diaz-Cadiz
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Matti D. Groll
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - Daniel P. Buckley
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| | - Victoria S. McKenna
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
| | - J. Pieter Noordzij
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| | - Cara E. Stepp
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts, USA
| |
Collapse
|
13
|
Dahl KL, Stepp CE. Changes in Relative Fundamental Frequency Under Increased Cognitive Load in Individuals With Healthy Voices. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1189-1196. [PMID: 33788635 PMCID: PMC8608166 DOI: 10.1044/2021_jslhr-20-00134] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Revised: 11/19/2020] [Accepted: 01/06/2021] [Indexed: 05/26/2023]
Abstract
Purpose The purpose of this study was to determine the effect of cognitive load on relative fundamental frequency (RFF) in individuals with healthy voices. Method Twenty adults with healthy voices read sentences under different cognitive load conditions. Each sentence contained color terms printed in colored ink, creating an embedded Stroop task. Participants read the ink color in which a word was printed, rather than the color term itself. Sentences with mismatched ink colors and printed words constituted an increased cognitive load. RFF, an acoustic correlate of laryngeal tension, was calculated for the 10 voicing cycles preceding (i.e., offset) and following (i.e., onset) voiceless consonants. Repeated measures analyses of variance were constructed to assess the effects of RFF cycle, cognitive load, and their interaction on mean RFF offset and onset. Results There was a significant effect of cognitive load condition on RFF offset. There was no significant effect of condition on RFF onset nor significant interaction between cycle and condition on RFF onset or offset values. Conclusion Reduced mean RFF offset may indicate an increase in laryngeal muscle tension during a cognitively demanding task.
Collapse
Affiliation(s)
- Kimberly L. Dahl
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
14
|
Park Y, Wang F, Díaz-Cádiz M, Vojtech JM, Groll MD, Stepp CE. Vocal fold kinematics and relative fundamental frequency as a function of obstruent type and speaker age. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:2189. [PMID: 33940922 PMCID: PMC8018794 DOI: 10.1121/10.0003961] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2020] [Revised: 03/02/2021] [Accepted: 03/11/2021] [Indexed: 06/12/2023]
Abstract
The acoustic measure, relative fundamental frequency (RFF), has been proposed as an objective metric for assessing vocal hyperfunction; however, its underlying physiological mechanisms have not yet been fully characterized. This study aimed to characterize the relationship between RFF and vocal fold kinematics. Simultaneous acoustic and high-speed videoendoscopic (HSV) recordings were collected as younger and older speakers repeated the utterances /ifi/ and /iti/. RFF values at voicing offsets and onsets surrounding the obstruents were estimated from acoustic recordings, whereas glottal angles, durations of voicing offset and onset, and a kinematic estimate of laryngeal stiffness (KS) were obtained from HSV images. No differences were found between younger and older speakers for any measure. RFF did not differ between the two obstruents at voicing offset; however, fricatives necessitated larger glottal angles and longer durations to devoice. RFF values were lower and glottal angles were greater for stops relative to fricatives at voicing onset. KS values were greater in stops relative to fricatives. The less adducted vocal folds with greater KS and lower RFF at voicing onset for stops relative to fricatives in this study were in accordance with prior speculations that decreased vocal fold contact area and increased laryngeal stiffness may decrease RFF.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Feng Wang
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, Massachusetts 02215, USA
| | - Manuel Díaz-Cádiz
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Jennifer M Vojtech
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, Massachusetts 02215, USA
| | - Matti D Groll
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, Massachusetts 02215, USA
| | - Cara E Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|
15
|
Lin H, Karjadi C, Ang TFA, Prajakta J, McManus C, Alhanai TW, Glass J, Au R. Identification of digital voice biomarkers for cognitive health. EXPLORATION OF MEDICINE 2020; 1:406-417. [PMID: 33665648 PMCID: PMC7929495 DOI: 10.37349/emed.2020.00028] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 09/04/2020] [Indexed: 01/03/2023] Open
Abstract
AIM Human voice contains rich information. Few longitudinal studies have been conducted to investigate the potential of voice to monitor cognitive health. The objective of this study is to identify voice biomarkers that are predictive of future dementia. METHODS Participants were recruited from the Framingham Heart Study. The vocal responses to neuropsychological tests were recorded, which were then diarized to identify participant voice segments. Acoustic features were extracted with the OpenSMILE toolkit (v2.1). The association of each acoustic feature with incident dementia was assessed by Cox proportional hazards models. RESULTS Our study included 6, 528 voice recordings from 4, 849 participants (mean age 63 ± 15 years old, 54.6% women). The majority of participants (71.2%) had one voice recording, 23.9% had two voice recordings, and the remaining participants (4.9%) had three or more voice recordings. Although all asymptomatic at the time of examination, participants who developed dementia tended to have shorter segments than those who were dementia free (P < 0.001). Additionally, 14 acoustic features were significantly associated with dementia after adjusting for multiple testing (P < 0.05/48 = 1 × 10-3). The most significant acoustic feature was jitterDDP_sma_de (P = 7.9 × 10-7), which represents the differential frame-to-frame Jitter. A voice based linear classifier was also built that was capable of predicting incident dementia with area under curve of 0.812. CONCLUSIONS Multiple acoustic and linguistic features are identified that are associated with incident dementia among asymptomatic participants, which could be used to build better prediction models for passive cognitive health monitoring.
Collapse
Affiliation(s)
- Honghuang Lin
- Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA 02118, USA
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
| | - Cody Karjadi
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
| | - Ting F. A. Ang
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA 02118, USA
- Slone Epidemiology Center, Boston University School of Medicine, Boston, MA 02118, USA
| | - Joshi Prajakta
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
| | - Chelsea McManus
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
| | - Tuka W. Alhanai
- Department of Electrical and Computer Engineering, New York University Abu Dhabi, Abu Dhabi, UAE
| | - James Glass
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Rhoda Au
- The Framingham Heart Study, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Anatomy and Neurobiology, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA 02118, USA
- Slone Epidemiology Center, Boston University School of Medicine, Boston, MA 02118, USA
- Department of Neurology, Boston University School of Medicine, Boston, MA 02118, USA
| |
Collapse
|
16
|
Park Y, Cádiz MD, Nagle KF, Stepp CE. Perceptual and Acoustic Assessment of Strain Using Synthetically Modified Voice Samples. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:3897-3908. [PMID: 33151770 PMCID: PMC8608200 DOI: 10.1044/2020_jslhr-20-00294] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 07/23/2020] [Accepted: 08/17/2020] [Indexed: 06/11/2023]
Abstract
Purpose Assessment of strained voice quality is difficult due to the weak reliability of auditory-perceptual evaluation and lack of strong acoustic correlates. This study evaluated the contributions of relative fundamental frequency (RFF) and mid-to-high frequency noise to the perception of strain. Method Stimuli were created using recordings of speakers producing /ifi/ with a comfortable voice and with maximum vocal effort. RFF values of the comfortable voice samples were synthetically lowered, and RFF values of the maximum vocal effort samples were synthetically raised. Mid-to-high frequency noise was added to the samples. Twenty listeners rated strain in a visual sort-and-rate task. The effects of RFF modification and added noise on strain were assessed using an analysis of variance; intra- and interrater reliability were compared with and without noise. Results Lowering RFF in the comfortable voice samples increased their perceived strain, whereas raising RFF in the maximum vocal effort samples decreased their strain. Adding noise increased strain and decreased intra- and interrater reliability relative to samples without added noise. Conclusions Both RFF and mid-to-high frequency noise contribute to the perception of strain. The presence of dysphonia may decrease the reliability of auditory-perceptual evaluation of strain, which supports the need for complementary objective assessments. Supplemental Material https://doi.org/10.23641/asha.13172252.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Manuel Díaz Cádiz
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Kathleen F. Nagle
- Department of Speech-Language Pathology, Seton Hall University, South Orange, NJ
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology – Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
17
|
Buckley DP, Cadiz MD, Eadie TL, Stepp CE. Acoustic Model of Perceived Overall Severity of Dysphonia in Adductor-Type Laryngeal Dystonia. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2713-2722. [PMID: 32692616 PMCID: PMC7872728 DOI: 10.1044/2020_jslhr-19-00354] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 01/28/2020] [Accepted: 05/19/2020] [Indexed: 05/19/2023]
Abstract
Purpose This study is a secondary analysis of existing data. The goal of the study was to construct an acoustic model of perceived overall severity of dysphonia in adductory laryngeal dystonia (AdLD). We predicted that acoustic measures (a) related to voice and pitch breaks and (b) related to vocal effort would form the primary elements of a model corresponding to auditory-perceptual ratings of overall severity of dysphonia. Method Twenty inexperienced listeners evaluated the overall severity of dysphonia of speech stimuli from 19 individuals with AdLD. Acoustic features related to primary signs of AdLD (hyperadduction resulting in pitch and voice breaks) and to a potential secondary symptom of AdLD (vocal effort, measures of relative fundamental frequency) were computed from the speech stimuli. Multiple linear regression analysis was applied to construct an acoustic model of the overall severity of dysphonia. Results The acoustic model included an acoustic feature related to pitch and voice breaks and three acoustic measures derived from relative fundamental frequency; it explained 84.9% of the variance in the auditory-perceptual ratings of overall severity of dysphonia in the speech samples. Conclusions Auditory-perceptual ratings of overall severity of dysphonia in AdLD were related to acoustic features of primary signs (pitch and voice breaks, hyperadduction associated with laryngeal spasms) and were also related to acoustic features of vocal effort. This suggests that compensatory vocal effort may be a secondary symptom in AdLD. Future work to generalize this acoustic model to a larger, independent data set is necessary before clinical translation is warranted.
Collapse
Affiliation(s)
- Daniel P. Buckley
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| | - Manuel Diaz Cadiz
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Tanya L. Eadie
- Department of Speech and Hearing Sciences, University of Washington, Seattle
| | - Cara E. Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
- Department of Biomedical Engineering, Boston University, MA
| |
Collapse
|
18
|
Heller Murray ES, Segina RK, Woodnorth GH, Stepp CE. Relative Fundamental Frequency in Children With and Without Vocal Fold Nodules. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:361-371. [PMID: 32073342 PMCID: PMC7210445 DOI: 10.1044/2019_jslhr-19-00058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Purpose Relative fundamental frequency (RFF) is an acoustic measure that is sensitive to functional voice differences in adults. The aim of the current study was to evaluate RFF in children, as there are known structural and functional differences between the pediatric and adult vocal mechanisms. Method RFF was analyzed in 28 children with vocal fold nodules (CwVN, M = 9.0 years) and 28 children with typical voices (CwTV, M = 8.9 years). RFF is the instantaneous fundamental frequency (f 0) of the 10 vocalic cycles during devoicing (vocal offset) and 10 vocalic cycles during the revoicing (vocal onset) of the vowels that surround a voiceless consonant. Each cycle's f 0 was normalized to a steady-state portion of the vowel. RFF values for the cycles closest to the voiceless consonant, that is, Offset Cycle 10 and Onset Cycle 1, were examined. Results Average RFF values for Offset Cycle 10 and Onset Cycle 1 did not differ between CwVN and CwTV; however, within-subject variability of Offset Cycle 10 was decreased in CwVN. Across both groups, male children had lower Offset Cycle 10 RFF values as compared to female children. Additionally, Onset Cycle 1 values were decreased in younger children as compared to those of older children. Conclusions Unlike previous work with adults, CwVN did not have significantly different RFF values than CwTV. Younger children had lower RFF values for Onset Cycle 1 than older children, suggesting that vocal onset f 0 may provide information on the maturity of the laryngeal motor system.
Collapse
Affiliation(s)
- Elizabeth S. Heller Murray
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Otolaryngology and Communication Enhancement, Boston Children's Hospital, MA
| | - Roxanne K. Segina
- Department of Speech, Language & Hearing Sciences, Boston University, MA
| | | | - Cara E. Stepp
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Otolaryngology—Head & Neck Surgery, Boston University School of Medicine, MA
- Department of Biomedical Engineering, Boston University, MA
| |
Collapse
|
19
|
Vojtech JM, Segina RK, Buckley DP, Kolin KR, Tardif MC, Noordzij JP, Stepp CE. Refining algorithmic estimation of relative fundamental frequency: Accounting for sample characteristics and fundamental frequency estimation method. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:3184. [PMID: 31795681 PMCID: PMC6847943 DOI: 10.1121/1.5131025] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Revised: 10/07/2019] [Accepted: 10/08/2019] [Indexed: 05/26/2023]
Abstract
Relative fundamental frequency (RFF) is a promising acoustic measure for evaluating voice disorders. Yet, the accuracy of the current RFF algorithm varies across a broad range of vocal signals. The authors investigated how fundamental frequency (fo) estimation and sample characteristics impact the relationship between manual and semi-automated RFF estimates. Acoustic recordings were collected from 227 individuals with and 256 individuals without voice disorders. Common fo estimation techniques were compared to the autocorrelation method currently implemented in the RFF algorithm. Pitch strength-based categories were constructed using a training set (1158 samples), and algorithm thresholds were tuned to each category. RFF was then computed on an independent test set (291 samples) using category-specific thresholds and compared against manual RFF via mean bias error (MBE) and root-mean-square error (RMSE). Auditory-SWIPE' for fo estimation led to the greatest correspondence with manual RFF and was implemented in concert with category-specific thresholds. Refining fo estimation and accounting for sample characteristics led to increased correspondence with manual RFF [MBE = 0.01 semitones (ST), RMSE = 0.28 ST] compared to the unmodified algorithm (MBE = 0.90 ST, RMSE = 0.34 ST), reducing the MBE and RMSE of semi-automated RFF estimates by 88.4% and 17.3%, respectively.
Collapse
Affiliation(s)
- Jennifer M Vojtech
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, Massachusetts 02215, USA
| | - Roxanne K Segina
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Daniel P Buckley
- Department of Otolaryngology-Head and Neck Surgery, Boston University School of Medicine, 72 East Concord Street, Boston, Massachusetts 02118, USA
| | - Katharine R Kolin
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Monique C Tardif
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - J Pieter Noordzij
- Department of Otolaryngology-Head and Neck Surgery, Boston University School of Medicine, 72 East Concord Street, Boston, Massachusetts 02118, USA
| | - Cara E Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|
20
|
Park Y, Stepp CE. Test-Retest Reliability of Relative Fundamental Frequency and Conventional Acoustic, Aerodynamic, and Perceptual Measures in Individuals With Healthy Voices. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:1707-1718. [PMID: 31181173 PMCID: PMC6808369 DOI: 10.1044/2019_jslhr-s-18-0507] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Revised: 02/14/2019] [Accepted: 02/18/2019] [Indexed: 05/31/2023]
Abstract
Purpose Recent studies have shown that an acoustic measure, relative fundamental frequency (RFF), has potential for the assessment of excessive laryngeal tension and vocal effort associated with functional and neurological voice disorders. This study presents an analysis of the test-retest reliability of RFF in individuals with healthy voices and a comparison of reliability between RFF and conventional measures of voice. Method Acoustic and aerodynamic measurements and Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) were performed on 28 individuals with healthy voices on 5 consecutive days. Participants produced RFF stimuli, a sustained /ɑ/, and a reading passage to allow for extraction of acoustic measures and CAPE-V ratings; /pa/ trains were produced to allow for extraction of aerodynamic measures. Results Moderate reliabilities (intraclass correlation coefficient [ICC] = .64-.71) were found for RFF values. Mean vocal fundamental frequency, smoothed cepstral peak prominence, shimmer, harmonics-to-noise ratio, and mean airflow rate exhibited good-to-excellent reliabilities (ICC = .76-.99). ICCs for jitter and phonation threshold pressure were moderately reliable (ICC = .67-.74). ICCs for subglottal pressure estimates and all CAPE-V parameters showed poor reliabilities (ICC = .31-.58). Conclusion RFF has comparable reliability to conventional measures of voice. This expands the potential for clinical application of RFF. Supplemental Material https://doi.org/10.23641/asha.8233376.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
21
|
Hamdan AL, Khalifee E, Tabet G. Unilateral Vocal Fold Paralysis in Parkinson Disease: Case Report and Review of the Literature. J Voice 2018; 32:763-766. [DOI: 10.1016/j.jvoice.2017.09.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Revised: 09/05/2017] [Accepted: 09/12/2017] [Indexed: 11/25/2022]
|
22
|
Park Y, Stepp CE. The Effects of Stress Type, Vowel Identity, Baseline f 0, and Loudness on the Relative Fundamental Frequency of Individuals With Healthy Voices. J Voice 2018; 33:603-610. [PMID: 30078521 DOI: 10.1016/j.jvoice.2018.04.004] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Accepted: 04/04/2018] [Indexed: 11/26/2022]
Abstract
OBJECTIVE Relative fundamental frequency (RFF) has been investigated as a possible acoustic measure to assess laryngeal tension. This study aimed to identify possible factors in RFF stimuli (stress type, vowel identity, baseline f0, and loudness) that might also affect RFF values. METHODS Fifteen speakers with healthy voices produced short RFF stimuli (vowel-/f/-vowel; eg, /ɑfɑ/) in different conditions. They produced the stimuli with three different stress types and four different vowels. Participants also produced stimuli in three different baseline f0 conditions and three different loudness conditions. The mean RFF and within- and between-subject standard deviation (SD) of RFF were estimated for each stimuli condition. RESULTS Stress type had a statistically significant effect on RFF means and within-subject SDs with a large effect size (P < 0.001). A significant but small effect of vowel identity was observed: onset 1 RFF values from /ɑ/ were higher than onset 1 RFF values from /u/ (P < 0.01). Baseline f0 had a significant effect on RFF values with a medium effect size (P < 0.05). Loudness did not have any significant effect on RFF, but onset 1 RFF values produced with soft voice showed an unexpectedly high between-subject SD. CONCLUSIONS This evidence suggests that stress type is the most important factor to consider in RFF measurement. We also conclude that RFF may be somewhat resistant to vowel variation and small differences in baseline f0 and loudness, which may be beneficial in clinical settings.
Collapse
Affiliation(s)
- Yeonggwang Park
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts.
| | - Cara E Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, Massachusetts; Department of Biomedical Engineering, Boston University, Boston, Massachusetts; Department of Otolaryngology-Head and Neck Surgery, Boston University School of Medicine, Boston, Massachusetts
| |
Collapse
|
23
|
Rusz J, Hlavnicka J, Tykalova T, Novotny M, Dusek P, Sonka K, Ruzicka E. Smartphone Allows Capture of Speech Abnormalities Associated With High Risk of Developing Parkinson’s Disease. IEEE Trans Neural Syst Rehabil Eng 2018; 26:1495-1507. [DOI: 10.1109/tnsre.2018.2851787] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
24
|
Analysis of speaker recognition methodologies and the influence of kinetic changes to automatically detect Parkinson's Disease. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2017.11.001] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
25
|
Godino-Llorente JI, Shattuck-Hufnagel S, Choi JY, Moro-Velázquez L, Gómez-García JA. Towards the identification of Idiopathic Parkinson's Disease from the speech. New articulatory kinetic biomarkers. PLoS One 2017; 12:e0189583. [PMID: 29240814 PMCID: PMC5730127 DOI: 10.1371/journal.pone.0189583] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Accepted: 11/29/2017] [Indexed: 11/22/2022] Open
Abstract
Although a large amount of acoustic indicators have already been proposed in the literature to evaluate the hypokinetic dysarthria of people with Parkinson's Disease, the goal of this work is to identify and interpret new reliable and complementary articulatory biomarkers that could be applied to predict/evaluate Parkinson's Disease from a diadochokinetic test, contributing to the possibility of a further multidimensional analysis of the speech of parkinsonian patients. The new biomarkers proposed are based on the kinetic behaviour of the envelope trace, which is directly linked with the articulatory dysfunctions introduced by the disease since the early stages. The interest of these new articulatory indicators stands on their easiness of identification and interpretation, and their potential to be translated into computer based automatic methods to screen the disease from the speech. Throughout this paper, the accuracy provided by these acoustic kinetic biomarkers is compared with the one obtained with a baseline system based on speaker identification techniques. Results show accuracies around 85% that are in line with those obtained with the complex state of the art speaker recognition techniques, but with an easier physical interpretation, which open the possibility to be transferred to a clinical setting.
Collapse
Affiliation(s)
- J. I. Godino-Llorente
- Speech Communication Group, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - S. Shattuck-Hufnagel
- Speech Communication Group, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - J. Y. Choi
- Speech Communication Group, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - L. Moro-Velázquez
- Centre for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain
| | - J. A. Gómez-García
- Centre for Biomedical Technology, Universidad Politécnica de Madrid, Madrid, Spain
| |
Collapse
|
26
|
Sachs AM, Bielamowicz SA, Stager SV. Treatment effectiveness for aging changes in the larynx. Laryngoscope 2017; 127:2572-2577. [PMID: 28681923 DOI: 10.1002/lary.26706] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Revised: 04/24/2017] [Accepted: 05/01/2017] [Indexed: 11/11/2022]
Abstract
OBJECTIVES To compare the effectiveness of injection augmentation and bilateral thyroplasty surgery in managing age-related changes of the larynx. STUDY DESIGN Retrospective chart review of patients treated with bilateral thyroplasty and/or injection augmentation. METHODS We evaluated 22 patients before and after treatment using: 1) normalized glottal gap area and normalized true vocal fold width from endoscopic images; 2) patient self-rating questionnaires; and 3) acoustic and aerodynamic measures. RESULTS Thyroplasty surgery resulted in 38% of patients demonstrating less bowing compared to 33% after injection, and 63% demonstrated less supraglottic activity compared to 43% after injection (P = 0.09). Change in mean Voice-Related Quality of Life (V-RQOL) scores was 25.5 after thyroplasty compared to -16.4 after injection (P < 0.05). Those exhibiting a greater than 20 change in V-RQOL after treatment were significantly more likely to report swallowing symptoms pretreatment. CONCLUSION Patients postinjection did not rate themselves on any questionnaires as significantly better compared to pretreatment, whereas patients post-thyroplasty rated themselves as significantly improved on all questionnaires. Patients post-thyroplasty rated their voices as significantly closer to their best voice than patients postinjection. Likewise, 64% of patients who had thyroplasty surgery reported a significant treatment effect compared to 33% for injection augmentation. LEVEL OF EVIDENCE 4. Laryngoscope, 127:2572-2577, 2017.
Collapse
Affiliation(s)
- Aaron M Sachs
- The George Washington University School of Medicine, Washington, DC
| | - Steven A Bielamowicz
- Voice Treatment Center, Medical Faculty Associates, Department of Surgery, Division of Otolaryngology, The George Washington University School of Medicine, Washington, DC
| | - Sheila V Stager
- Voice Treatment Center, Medical Faculty Associates, Department of Surgery, Division of Otolaryngology, The George Washington University School of Medicine, Washington, DC
| |
Collapse
|
27
|
McKenna VS, Heller Murray ES, Lien YAS, Stepp CE. The Relationship Between Relative Fundamental Frequency and a Kinematic Estimate of Laryngeal Stiffness in Healthy Adults. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2016; 59:1283-1294. [PMID: 27936279 PMCID: PMC5399757 DOI: 10.1044/2016_jslhr-s-15-0406] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Revised: 02/21/2016] [Accepted: 05/02/2016] [Indexed: 05/19/2023]
Abstract
PURPOSE This study examined the relationship between the acoustic measure relative fundamental frequency (RFF) and a kinematic estimate of laryngeal stiffness. METHOD Twelve healthy adults (mean age = 22.7 years, SD = 4.4; 10 women, 2 men) produced repetitions of /ifi/ while varying their vocal effort during simultaneous acoustic and video nasendoscopic recordings. RFF was determined from the last 10 voicing cycles before the voiceless obstruent (RFF offset) and the first 10 cycles of revoicing (RFF onset). A kinematic stiffness ratio was calculated for the vocal fold adductory gesture during revoicing by normalizing the maximum angular velocity by the maximum glottic angle during the voiceless obstruent. RESULTS A linear mixed effect model indicated that RFF offset and onset were significant predictors of the kinematic stiffness ratios. The model accounted for 52% of the variance in the kinematic data. Individual relationships between RFF and kinematic stiffness ratios varied across participants, with at least moderate negative correlations in 83% of participants for RFF offset but only 40% of participants for RFF onset. CONCLUSIONS RFF significantly predicted kinematic estimates of laryngeal stiffness in healthy speakers and has the potential to be a useful clinical indicator of laryngeal tension. Further research is needed in individuals with voice disorders.
Collapse
Affiliation(s)
| | | | - Yu-An S. Lien
- Department of Biomedical Engineering, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology—Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
28
|
Roy N, Fetrow RA, Merrill RM, Dromey C. Exploring the Clinical Utility of Relative Fundamental Frequency as an Objective Measure of Vocal Hyperfunction. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2016; 59:1002-1017. [PMID: 27768175 DOI: 10.1044/2016_jslhr-s-15-0354] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Accepted: 03/28/2016] [Indexed: 05/16/2023]
Abstract
PURPOSE Vocal hyperfunction, related to abnormal laryngeal muscle activity, is considered the proximal cause of primary muscle tension dysphonia (pMTD). Relative fundamental frequency (RFF) has been proposed as an objective acoustic marker of vocal hyperfunction. This study examined (a) the ability of RFF to track changes in vocal hyperfunction after treatment for pMTD and (b) the influence of dysphonia severity, among other factors, on the feasibility of RFF computation. METHOD RFF calculations and dysphonia severity ratings were derived from pre- and posttreatment recordings from 111 women with pMTD and 20 healthy controls. Three vowel-voiceless consonant-vowel stimuli were analyzed. RESULTS RFF onset slope consistently varied as a function of group (pMTD vs. controls) and time (pretherapy vs. posttherapy). Significant correlations between RFF onset cycle 1 and dysphonia severity were observed. However, in many samples, RFF could not be computed, and adjusted odds ratios revealed that these unanalyzable data were linked to dysphonia severity, phonetic (vowel-voiceless consonant-vowel) context, and group (pMTD vs. control). CONCLUSIONS RFF onset appears to be sensitive to the presence and degree of suspected vocal hyperfunction before and after therapy. The large number of unanalyzable samples (related especially to dysphonia severity in the pMTD group) represents an important limitation.
Collapse
Affiliation(s)
- Nelson Roy
- Department of Communication Sciences and Disorders, The University of Utah, Salt Lake City
| | - Rebecca A Fetrow
- Department of Communication Sciences and Disorders, The University of Utah, Salt Lake City
| | - Ray M Merrill
- Department of Health Science, Brigham Young University, Provo, UT
| | - Christopher Dromey
- Department of Communication Disorders, Brigham Young University, Provo, UT
| |
Collapse
|
29
|
Kunduk M, Ikuma T, Blouin DC, McWhorter AJ. Effects of Volume, Pitch, and Phonation Type on Oscillation Initiation and Termination Phases Investigated With High-speed Videoendoscopy. J Voice 2016; 31:313-322. [PMID: 27671752 DOI: 10.1016/j.jvoice.2016.08.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Revised: 08/21/2016] [Accepted: 08/23/2016] [Indexed: 11/29/2022]
Abstract
OBJECTIVES This study aimed to investigate the effects of varying volume, pitch, and phonation types on the initiation and termination phases of vocal fold oscillation using high-speed digital videoendoscopy. Specifically, it addressed the effects of the variation of volume, pitch, and phonation type (normal, pressed, and breathy) on the transient duration of the vibrating glottal length (length transient duration, Tlen), the transient duration of the glottal area waveform (area transient duration, Tarea), the time offset between the beginning (or the end) of the full-length vibration and the full-amplitude vibration, TΔ, and the variation of the fundamental frequency during the vocal fold oscillation initiation and termination segments (pitch instability, %PI). METHODS A female subject with no voice problem produced voices with varying pitch and loudness, including comfortable pitch and comfortable loudness, normal pitch loud, high pitch and comfortable loudness, and high pitch and loud. Breathy and pressed phonations were also recorded. Each of the six phonation types was recorded six times, which resulted in 72 transient segments (each recording included both initiation and termination phases). Mixed model statistical analyses were employed to the five objective high-speed digital videoendoscopy parameters. RESULTS Preliminary findings demonstrated significant findings for voice type effects for the length and area transient durations for the oscillation initiation segment but not for the oscillation termination segment. CONCLUSIONS This study demonstrates that voice types appear to influence vibration initiation patterns more than the vibration termination patterns.
Collapse
Affiliation(s)
- Melda Kunduk
- The Department of Communication Sciences and Disorders, Louisiana State University, Baton Rouge, Louisiana.
| | - Takeshi Ikuma
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans, Louisiana
| | - David C Blouin
- Department of Experimental Statistics, Louisiana State University, Baton Rouge, Louisiana
| | - Andrew J McWhorter
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans, Louisiana
| |
Collapse
|
30
|
Ikuma T, Kunduk M, Fink D, McWhorter AJ. A Spatiotemporal Approach to the Objective Analysis of Initiation and Termination of Vocal-fold Oscillation With High-speed Videoendoscopy. J Voice 2015; 30:756.e21-756.e30. [PMID: 26654851 DOI: 10.1016/j.jvoice.2015.09.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Accepted: 09/15/2015] [Indexed: 11/26/2022]
Abstract
High-speed videoendoscopy excels in the ability to observe the vocal-fold oscillatory patterns during voice initiation and termination. The initial and most critical step in the analysis of these transient regions is to identify the locations of these transient periods, that is, determining when the vocal-fold oscillation is absent and when the oscillation has reached its steady-state behavior. The latter is more challenging as the "steady" oscillation during sustained phonation is not truly steady and is expected to vary over time. This variation may cause unreliable identification of the transient periods, possibly resulting in less accurate or less reliable parameter measurements. An oscillation feature that is relatively consistent in the steady state is the glottal length, that is, the extent of the oscillation along vocal folds. This paper proposes an autonomous algorithm to estimate the vocal-fold oscillation length and its use to detect four transient events: oscillation onset and offset, and attainment and loss of full-length oscillation. The detected event markers are intended to be used to improve the transient parameter measurements. The autonomous algorithm manipulates the set of glottal width waveforms spatiotemporally to estimate the oscillation length. Examples with in vivo high-speed videoendoscopy recordings of both normal and pathological cases are included to show the efficacy of the proposed algorithm to identify the transient markers.
Collapse
Affiliation(s)
- Takeshi Ikuma
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, 533 Bolivar Street, New Orleans, Louisiana 70112.
| | - Melda Kunduk
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, 533 Bolivar Street, New Orleans, Louisiana 70112; Department of Communication Sciences and Disorders, Louisiana State University, 64 Hatcher Hall, Baton Rouge, Louisiana 70803; Our Lady of the Lake Regional Medical Center-Voice Center, 4950 Essen Lane, Suite B, Baton Rouge, Louisiana 70808
| | - Daniel Fink
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, 533 Bolivar Street, New Orleans, Louisiana 70112; Our Lady of the Lake Regional Medical Center-Voice Center, 4950 Essen Lane, Suite B, Baton Rouge, Louisiana 70808
| | - Andrew J McWhorter
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, 533 Bolivar Street, New Orleans, Louisiana 70112; Our Lady of the Lake Regional Medical Center-Voice Center, 4950 Essen Lane, Suite B, Baton Rouge, Louisiana 70808
| |
Collapse
|
31
|
Lien YAS, Calabrese CR, Michener CM, Murray EH, Van Stan JH, Mehta DD, Hillman RE, Noordzij JP, Stepp CE. Voice Relative Fundamental Frequency Via Neck-Skin Acceleration in Individuals With Voice Disorders. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2015; 58:1482-7. [PMID: 26134171 PMCID: PMC4686308 DOI: 10.1044/2015_jslhr-s-15-0126] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2015] [Accepted: 06/25/2015] [Indexed: 05/20/2023]
Abstract
PURPOSE This study investigated the use of neck-skin acceleration for relative fundamental frequency (RFF) analysis. METHOD Forty individuals with voice disorders associated with vocal hyperfunction and 20 age- and sex-matched control participants were recorded with a subglottal neck-surface accelerometer and a microphone while producing speech stimuli appropriate for RFF. Rater reliabilities, RFF means, and RFF standard deviations derived from the accelerometer were compared with those derived from the microphone. RESULTS RFF estimated from the accelerometer had slightly higher intrarater reliability and identical interrater reliability compared with values estimated with the microphone. Although sensor type and the Vocal Cycle × Sensor and Vocal Cycle × Sensor × Group interactions showed significant effects on RFF means, the typical RFF pattern could be derived from either sensor. For both sensors, the RFF of individuals with vocal hyperfunction was lower than that of the controls. Sensor type and its interactions did not have significant effects on RFF standard deviations. CONCLUSIONS RFF can be reliably estimated using an accelerometer, but these values cannot be compared with those collected via microphone. Future studies are needed to determine the physiological basis of RFF and examine the effect of sensors on RFF in practical voice assessment and monitoring settings.
Collapse
Affiliation(s)
| | | | | | | | - Jarrad H. Van Stan
- MGH Institute of Health Professions, Boston, MA
- Center for Laryngeal Surgery & Voice Rehabilitation, Massachusetts General Hospital, Boston
| | - Daryush D. Mehta
- MGH Institute of Health Professions, Boston, MA
- Center for Laryngeal Surgery & Voice Rehabilitation, Massachusetts General Hospital, Boston
- Harvard Medical School, Cambridge, MA
| | - Robert E. Hillman
- MGH Institute of Health Professions, Boston, MA
- Center for Laryngeal Surgery & Voice Rehabilitation, Massachusetts General Hospital, Boston
- Harvard Medical School, Cambridge, MA
| | - J. Pieter Noordzij
- Boston University, Boston, MA
- Boston University School of Medicine, Boston, MA
| | - Cara E. Stepp
- Boston University, Boston, MA
- Boston University School of Medicine, Boston, MA
| |
Collapse
|
32
|
Characteristic laryngoscopic findings in Parkinson’s disease patients after subthalamic nucleus deep brain stimulation and its correlation with voice disorder. J Neural Transm (Vienna) 2015; 122:1663-72. [DOI: 10.1007/s00702-015-1436-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Accepted: 07/29/2015] [Indexed: 10/23/2022]
|
33
|
Characterizing gait asymmetry via frequency sub-band components of the ground reaction force. Biomed Signal Process Control 2015. [DOI: 10.1016/j.bspc.2014.11.008] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
34
|
Eklund E, Qvist J, Sandström L, Viklund F, Van Doorn J, Karlsson F. Perceived articulatory precision in patients with Parkinson's disease after deep brain stimulation of subthalamic nucleus and caudal zona incerta. CLINICAL LINGUISTICS & PHONETICS 2015; 29:150-166. [PMID: 25333411 DOI: 10.3109/02699206.2014.971192] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The effect of deep brain stimulation (DBS) of the subthalamic nucleus (STN) and caudal zona incerta (cZi) on speech articulation in patients with Parkinson's disease (PD) was investigated. Read speech samples were collected from nine patients with STN-DBS and 10 with cZi-DBS. The recordings were made pre-operatively and 12 months post-operatively with stimulator on and off (on medication). Blinded, randomised, repeated perceptual assessments were performed on words and isolated fricatives extracted from the recordings to assess (1) overall articulatory quality ratings, (2) frequency of occurrence of misarticulation patterns and (3) fricative production. Statistically significant worsening of articulatory measures on- compared with off-stimulation occurred in the cZi-DBS group, with deteriorated articulatory precision ratings, increased presence of misarticulations (predominately altered realisations of plosives and fricatives) and a reduced accuracy in fricative production. A similar, but not significant, trend was found for the STN-DBS group.
Collapse
Affiliation(s)
- Elisabeth Eklund
- Department of Clinical Sciences, Division of Speech and Language Pathology and
| | | | | | | | | | | |
Collapse
|
35
|
Lien YAS, Gattuccio CI, Stepp CE. Effects of phonetic context on relative fundamental frequency. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:1259-67. [PMID: 24686466 PMCID: PMC4119098 DOI: 10.1044/2014_jslhr-s-13-0158] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
PURPOSE The effect of phonetic context on relative fundamental frequency (RFF) was examined, in order to develop stimuli sets with minimal within-speaker variability that can be implemented in future clinical protocols. METHOD Sixteen speakers with healthy voices produced RFF stimuli. Uniform utterances consisted of 3 repetitions of the same voiced sonorant-voiceless consonant-voiced sonorant speech sequence; moderately variable sentences contained speech sequences with a single voiceless phoneme (/f/, /s/, /ʃ/, /p/, /t/, or /k/); highly variable sentences were loaded with speech sequences using multiple phonemes. Effects of stimulus type (uniform, moderately variable, and highly variable) and phoneme identity (/f/, /s/, /ʃ/, /p/, /t/, and /k/) on RFF means and standard deviations were determined. RESULTS Stimulus type and the interaction of vocal cycle and stimulus type were significant for RFF means and standard deviations but with small effect sizes. Phoneme identity and the interaction of vocal cycle and phoneme identity on RFF means and standard deviations were also significant with small to medium effect sizes. CONCLUSIONS For speakers with healthy voices, uniform utterances with /f/ and /ʃ/ have the lowest standard deviations and thus are recommended for RFF-based assessments. Future work is necessary to extend these findings to disordered voices.
Collapse
|
36
|
Lien YAS, Stepp CE. Comparison of voice relative fundamental frequency estimates derived from an accelerometer signal and low-pass filtered and unprocessed microphone signals. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:2977-85. [PMID: 24815277 PMCID: PMC4032403 DOI: 10.1121/1.4870488] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
The relative fundamental frequency (RFF) surrounding the production of a voiceless consonant has previously been estimated using unprocessed and low-pass filtered microphone signals, but it can also be estimated using a neck-placed accelerometer signal that is less affected by vocal tract formants. Determining the effects of signal type on RFF will allow for comparisons across studies and aid in establishing a standard protocol with minimal within-speaker variability. Here RFF was estimated in 12 speakers with healthy voices using unprocessed microphone, low-pass filtered microphone, and unprocessed accelerometer signals. Unprocessed microphone and accelerometer signals were recorded simultaneously using a microphone and neck-placed accelerometer. The unprocessed microphone signal was filtered at 350 Hz to construct the low-pass filtered microphone signal. Analyses of variance showed that signal type and the interaction of vocal cycle × signal type had significant effects on both RFF means and standard deviations, but with small effect sizes. The overall RFF trend was preserved regardless of signal type and the intra-speaker variability of RFF was similar among the signal types. Thus, RFF can be estimated using either a microphone or an accelerometer signal in individuals with healthy voices. Future work extending these findings to individuals with disordered voices is warranted.
Collapse
Affiliation(s)
- Yu-An S Lien
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215
| | - Cara E Stepp
- Departments of Speech, Language, and Hearing Sciences and Biomedical Engineering, Boston University, Boston, Massachusetts 02215
| |
Collapse
|