1
|
Frankford SA, Estrada A, Stepp CE. Contributions of Speech Timing and Articulatory Precision to Listener Perceptions of Intelligibility and Naturalness in Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024:1-13. [PMID: 39116309 DOI: 10.1044/2024_jslhr-23-00802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/10/2024]
Abstract
PURPOSE Parkinson's disease (PD) results in hypokinetic dysarthria in as many as 90% of cases. Among the most distinctive features of hypokinetic dysarthria are atypical timing and articulatory imprecision in speech production. Here, we examined the contributions of perceived speech timing typicality and articulatory precision, both on their own and while controlling for the other, on intelligibility and naturalness in speakers with PD. METHOD Twenty speakers with PD and four healthy older adults read aloud the first paragraph of the Rainbow Passage. Twenty inexperienced listeners with typical hearing listened to these recordings and rated intelligibility, naturalness, timing typicality, and articulatory precision using separate visual analog scales. Ratings were averaged across listeners and entered into linear regression models with intelligibility and naturalness as dependent variables and timing typicality and articulatory precision as independent variables in each. RESULTS Articulatory precision, but not timing typicality, was positively correlated with intelligibility on its own, but neither was associated with intelligibility after accounting for the other. Both timing typicality and articulatory precision were positively correlated with naturalness on their own as well as after controlling for the other variable. CONCLUSION These results contribute to the overall understanding of speech factors associated with intelligibility and naturalness in speakers with PD and indicate that considering the unique contributions of related perceptual constructs may provide more information than bivariate relationships alone.
Collapse
|
2
|
Moya-Galé G, Hernández JD, Goudarzi A, Walsh SJ. ¿Notas La Diferencia? [Do You Hear the Difference?]: Perceptual Consequences of Intensive Voice Treatment in Spanish Speakers With Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024:1-21. [PMID: 38513128 DOI: 10.1044/2023_jslhr-23-00379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/23/2024]
Abstract
PURPOSE The primary objective of this study was to explore the effects of intensive voice-focused treatment on speech parameters in Spanish speakers with dysarthria associated with Parkinson's disease (PD) as perceived by naïve listeners. METHOD Fifteen Spanish speakers with dysarthria associated with PD received the Lee Silverman Voice Treatment (LSVT LOUD) for a month. Voice and speech recordings were conducted pretreatment, posttreatment, and at a 1-month follow-up. Thirty naïve adult listeners rated the perceptual dimensions of ease of understanding (EoU), resonance, articulatory precision, prosody, and voice quality from sentences extracted from an emotional monologue on a visual analogue scale. RESULTS EoU, resonance, articulatory precision, and voice quality significantly improved pre- to posttreatment, but gains were not maintained at follow-up. Speech severity was a significant source of variance in mean listener response for all perceptual dimensions, although the interaction between speech severity and time was only significant for resonance and voice quality. CONCLUSIONS LSVT LOUD may be beneficial to improve perceptual speech domains affected by PD in Spanish speakers with dysarthria. Its impact on the different speech subsystems may reflect a universal distribution of effects when directly targeting the glottal source. Language-specific contributions of each perceptual domain to speech intelligibility should be explored in further research to determine linguistically sensitive treatment targets.
Collapse
|
3
|
Rowe HP, Stipancic KL, Campbell TF, Yunusova Y, Green JR. The association between longitudinal declines in speech sound accuracy and speech intelligibility in speakers with amyotrophic lateral sclerosis. CLINICAL LINGUISTICS & PHONETICS 2024; 38:227-248. [PMID: 37122073 PMCID: PMC10613582 DOI: 10.1080/02699206.2023.2202297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 04/01/2023] [Accepted: 04/03/2023] [Indexed: 05/27/2023]
Abstract
The purpose of this study was to examine how neurodegeneration secondary to amyotrophic lateral sclerosis (ALS) impacts speech sound accuracy over time and how speech sound accuracy, in turn, is related to speech intelligibility. Twenty-one participants with ALS read the Bamboo Passage over multiple data collection sessions across several months. Phonemic and orthographic transcriptions were completed for all speech samples. The percentage of phonemes accurately produced was calculated across each phoneme, sound class (i.e. consonants versus vowels), and distinctive feature (i.e. features involved in Manner of Articulation, Place of Articulation, Laryngeal Voicing, Tongue Height, and Tongue Advancement). Intelligibility was determined by calculating the percentage of words correctly transcribed orthographically by naive listeners. Linear mixed effects models were conducted to assess the decline of each distinctive feature over time and its impact on intelligibility. The results demonstrated that overall phonemic production accuracy had a nonlinear relationship with speech intelligibility and that a subset of features (i.e. those dependent on precise lingual and labial constriction and/or extensive lingual and labial movement) were more important for intelligibility and were more impacted over time than other features. Furthermore, findings revealed that consonants were more strongly associated with intelligibility than vowels, but consonants did not significantly differ from vowels in their decline over time. These findings have the potential to (1) strengthen mechanistic understanding of the physiological constraints imposed by neuronal degeneration on speech production and (2) inform the timing and selection of treatment and assessment targets for individuals with ALS.
Collapse
Affiliation(s)
- Hannah P Rowe
- Department of Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts, USA
| | - Kaila L Stipancic
- Department of Communicative Disorders and Sciences, The State University of New York, Buffalo, New York, USA
| | - Thomas F Campbell
- Callier Center for Communication Disorders, University of Texas, Dallas, Texas, USA
| | - Yana Yunusova
- Department of Speech-Language Pathology and Rehabilitation Sciences Institute, University of Toronto, Toronto, Ontario, Canada
- Hurvitz Brain Sciences Research Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
- KITE Research Center, Toronto Rehabilitation Institute, Toronto, Ontario, Canada
| | - Jordan R Green
- Department of Rehabilitation Sciences, MGH Institute of Health Professions, Boston, Massachusetts, USA
| |
Collapse
|
4
|
Rong P, Heidrick L. Hierarchical Temporal Structuring of Speech: A Multiscale, Multimodal Framework to Inform the Assessment and Management of Neuromotor Speech Disorder. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:92-115. [PMID: 38099851 DOI: 10.1044/2023_jslhr-23-00219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2024]
Abstract
PURPOSE Hierarchical temporal structuring of speech is the key to multiscale linguistic information transfer toward effective communication. This study investigated and linked the hierarchical temporal cues of the kinematic and acoustic modalities of natural, unscripted speech in neurologically healthy and impaired speakers. METHOD Thirteen individuals with amyotrophic lateral sclerosis (ALS) and 10 age-matched healthy controls performed a story-telling task. The hierarchical temporal structure of the speech stimulus was measured by (a) 26 articulatory-kinematic features characterizing the depth, phase synchronization, and coherence of temporal modulation of the tongue tip, tongue body, lower lip, and jaw, at three hierarchically nested timescales corresponding to prosodic stress, syllables, and onset-rime/phonemes, and (b) 25 acoustic features characterizing the parallel aspects of temporal modulation of five critical-spectral-band envelopes. All features were compared between groups. For each aspect of temporal modulation, the contributions of all articulatory features to the parallel acoustic features were evaluated by group. RESULTS Generally consistent disease impacts were identified on the articulatory and acoustic features, manifested by reduced modulation depths of most articulators and critical-spectral-band envelopes, primarily at the timescales of syllables and onset-rime/phonemes. For healthy speakers, the strongest articulatory-acoustic relationships were found for (a) jaw and lip, in modulating stress timing, and (b) tongue tip, in modulating the timing relation between onset-rime/phonemes and syllables. For speakers with ALS, the tongue body, tongue tip, and jaw all showed the greatest contributions to modulating syllable timing. CONCLUSIONS The observed disease impacts likely reflect reduced entrainment of speech motor activities to finer-grained linguistic events, presumably due to the dynamic constraints of the neuromuscular system. To accommodate these restrictions, speakers with ALS appear to use their residual articulatory motor capacities to accentuate and convey the perceptually most salient temporal cues underpinned by the syllable-centric parsing mechanism. This adaptive strategy has potential implications in managing neuromotor speech disorders.
Collapse
Affiliation(s)
- Panying Rong
- Department of Speech-Language-Hearing: Sciences & Disorders, The University of Kansas, Lawrence
| | - Lindsey Heidrick
- Department of Hearing and Speech, The University of Kansas Medical Center, Kansas City
| |
Collapse
|
5
|
Coy AG, Whitfield JA. Effect of clear speech on acoustic measures of dysprosody in Parkinson disease for different reading tasks. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023:1-14. [PMID: 37668056 DOI: 10.1080/17549507.2023.2240041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/06/2023]
Abstract
PURPOSE The purpose of the study was to determine the effect of clear speech instruction on acoustic measures of dysprosody between reading passages of differing linguistic content for speakers with and without Parkinson Disease (PD). METHOD Ten speakers with PD and 10 controls served as participants and read five simple and three standard reading stimuli twice. First, speakers read habitually and then following clear speech instruction. Acoustic measures of fundamental frequency variation (semitone standard deviation, STSD), articulation rate, and between-complex pause durations were calculated. RESULT Results indicated speakers with PD exhibited less fundamental frequency variation than controls across reading stimuli and instructions. All speakers exhibited lower STSD and longer between-complex pause durations for the standard compared to simple reading stimuli. For clear speech, all speakers reduced articulation rate and increased between-complex pause durations in both simple and standard reading stimuli. However, speakers with PD exhibited a significantly less robust reduction in articulation rate for clear speech than control speakers for all reading stimuli. CONCLUSION Linguistic content of reading stimuli contributes to differences in fundamental frequency variation and pause duration for all speakers. All speakers reduced articulation rate for clear speech compared to habitual instruction, but speakers with PD did so to a lesser extent than controls. Linguistic content of reading stimuli to examine dysprosody in PD should be considered for clinical application.
Collapse
Affiliation(s)
- Anna G Coy
- Division of Communication Sciences and Disorders, West Virginia University, Morgantown, WV, USA
| | - Jason A Whitfield
- Department of Communication Sciences and Disorders, Bowling Green State University, Bowling Green, OH, USA
| |
Collapse
|
6
|
Stipancic KL, Wilding G, Tjaden K. Lexical Characteristics of the Speech Intelligibility Test: Effects on Transcription Intelligibility for Speakers With Multiple Sclerosis and Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:3115-3131. [PMID: 36931064 PMCID: PMC10555462 DOI: 10.1044/2023_jslhr-22-00279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 09/19/2022] [Accepted: 01/01/2023] [Indexed: 06/18/2023]
Abstract
PURPOSE Lexical characteristics of speech stimuli can significantly impact intelligibility. However, lexical characteristics of the widely used Speech Intelligibility Test (SIT) are unknown. We aimed to (a) define variation in neighborhood density, word frequency, grammatical word class, and type-token ratio across a large corpus of SIT sentences and tests and (b) determine the relationship of lexical characteristics to speech intelligibility in speakers with multiple sclerosis (MS), Parkinson's disease (PD), and neurologically healthy controls. METHOD Using an extant database of 92 speakers (32 controls, 30 speakers with MS, and 30 speakers with PD), percent correct intelligibility scores were obtained for the SIT. Neighborhood density, word frequency, word class, and type-token ratio were calculated and summed for each of the 11 sentences of each SIT test. The distribution of each characteristic across SIT sentences and tests was examined. Linear mixed-effects models were performed to assess the relationship between intelligibility and the lexical characteristics. RESULTS There was large variability in the distribution of lexical characteristics across this large corpus of SIT sentences and tests. Modeling revealed a relationship between intelligibility and the lexical characteristics, with word frequency and word class significantly contributing to the model. CONCLUSIONS Three primary findings emerged: (a) There was considerable variability in lexical characteristics both within and across the large corpus of SIT tests; (b) there was not a robust association between intelligibility and the lexical characteristics; and (c) findings from a study demonstrating an effect of neighborhood density and word frequency on intelligibility were replicated. Clinical and research implications of the findings are discussed, and three exemplar SIT tests systematically controlling for neighborhood density and word frequency are provided.
Collapse
Affiliation(s)
- Kaila L. Stipancic
- Department of Communicative Disorders and Sciences, University at Buffalo, The State University of New York
| | - Gregory Wilding
- Department of Biostatistics, University at Buffalo, The State University of New York
| | - Kris Tjaden
- Department of Communicative Disorders and Sciences, University at Buffalo, The State University of New York
| |
Collapse
|
7
|
McAllister T, Nightingale C, Moya-Galé G, Kawamura A, Ramig LO. Crowdsourced Perceptual Ratings of Voice Quality in People With Parkinson's Disease Before and After Intensive Voice and Articulation Therapies: Secondary Outcome of a Randomized Controlled Trial. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:1541-1562. [PMID: 37059078 PMCID: PMC10457080 DOI: 10.1044/2023_jslhr-22-00694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 01/25/2023] [Accepted: 01/26/2023] [Indexed: 05/11/2023]
Abstract
PURPOSE Limited research has examined the suitability of crowdsourced ratings to measure treatment effects in speakers with Parkinson's disease (PD), particularly for constructs such as voice quality. This study obtained measures of reliability and validity for crowdsourced listeners' ratings of voice quality in speech samples from a published study. We also investigated whether aggregated listener ratings would replicate the original study's findings of treatment effects based on the Acoustic Voice Quality Index (AVQI) measure. METHOD This study reports a secondary outcome measure of a randomized controlled trial with speakers with dysarthria associated with PD, including two active comparators (Lee Silverman Voice Treatment [LSVT LOUD] and LSVT ARTIC), an inactive comparator (untreated PD), and a healthy control group. Speech samples from three time points (pretreatment, posttreatment, and 6-month follow-up) were presented in random order for rating as "typical" or "atypical" with respect to voice quality. Untrained listeners were recruited through the Amazon Mechanical Turk crowdsourcing platform until each sample had at least 25 ratings. RESULTS Intrarater reliability for tokens presented repeatedly was substantial (Cohen's κ = .65-.70), and interrater agreement significantly exceeded chance level. There was a significant correlation of moderate magnitude between the AVQI and the proportion of listeners classifying a given sample as "typical." Consistent with the original study, we found a significant interaction between group and time point, with the LSVT LOUD group alone showing significantly higher perceptually rated voice quality at posttreatment and follow-up relative to the pretreatment time point. CONCLUSIONS These results suggest that crowdsourcing can be a valid means to evaluate clinical speech samples, even for less familiar constructs such as voice quality. The findings also replicate the results of the study by Moya-Galé et al. (2022) and support their functional relevance by demonstrating that the effects of treatment measured acoustically in that study are perceptually apparent to everyday listeners.
Collapse
Affiliation(s)
| | | | | | | | - Lorraine Olson Ramig
- University of Colorado Boulder
- National Center for Voice and Speech, Denver, CO
- Columbia University, New York, NY
- LSVT Global, Inc., Tucson, AZ
| |
Collapse
|
8
|
Stipancic KL, van Brenk F, Kain A, Wilding G, Tjaden K. Clear Speech Variants: An Investigation of Intelligibility and Speaker Effort in Speakers With Parkinson's Disease. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2022; 31:2789-2805. [PMID: 36327495 PMCID: PMC9911091 DOI: 10.1044/2022_ajslp-22-00189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 08/02/2022] [Accepted: 08/04/2022] [Indexed: 05/03/2023]
Abstract
PURPOSE This study investigated the effects of three clear speech variants on sentence intelligibility and speaking effort for speakers with Parkinson's disease (PD) and age- and sex-matched neurologically healthy controls. METHOD Fourteen speakers with PD and 14 neurologically healthy speakers participated. Each speaker was recorded reading 18 sentences from the Speech Intelligibility Test in their habitual speaking style and for three clear speech variants: clear (SC; given instructions to speak clearly), hearing impaired (HI; given instructions to speak with someone with a hearing impairment), and overenunciate (OE; given instructions to overenunciate each word). Speakers rated the amount of physical and mental effort exerted during each speaking condition using visual analog scales (averaged to yield a metric of overall speaking effort). Sentence productions were orthographically transcribed by 50 naive listeners. Linear mixed-effects models were used to compare intelligibility and speaking effort across the clear speech variants. RESULTS Intelligibility was reduced for the PD group in comparison to the control group only in the habitual condition. All clear speech variants significantly improved intelligibility above habitual levels for the PD group, with OE maximizing intelligibility, followed by the SC and HI conditions. Both groups rated speaking effort to be significantly higher for both the OE and HI conditions versus the SC and habitual conditions. DISCUSSION For speakers with PD, all clear speech variants increased intelligibility to a level comparable to that of healthy controls. All clear speech variants were also associated with higher levels of speaking effort than habitual speech for the speakers with PD. Clinically, findings suggest that clear speech training programs consider using the instruction "overenunciate" for maximizing intelligibility. Future research is needed to identify if high levels of speaking effort elicited by the clear speech variants affect long-term sustainability of the intelligibility benefit.
Collapse
Affiliation(s)
- Kaila L. Stipancic
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
| | - Frits van Brenk
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
- Utrecht Institute of Linguistics OTS, Utrecht University, the Netherlands
| | - Alexander Kain
- Department of Pediatrics, Oregon Health & Science University, Portland
| | | | - Kris Tjaden
- Department of Communicative Disorders and Sciences, University at Buffalo, NY
| |
Collapse
|