1
|
Hyppa-Martin J, Lilley J, Chen M, Friese J, Schmidt C, Bunnell HT. A large-scale comparison of two voice synthesis techniques on intelligibility, naturalness, preferences, and attitudes toward voices banked by individuals with amyotrophic lateral sclerosis. Augment Altern Commun 2024; 40:31-45. [PMID: 37791834 DOI: 10.1080/07434618.2023.2262032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 09/12/2023] [Indexed: 10/05/2023] Open
Abstract
Amyotrophic lateral sclerosis (ALS) commonly results in the inability to produce natural speech, making speech-generating devices (SGDs) important. Historically, synthetic voices generated by SGDs were neither unique, nor age- or dialect-appropriate, which depersonalized SGD use. Voices generated by SGDs can now be customized via voice banking and should ideally sound uniquely like the individual's natural speech, be intelligible, and elicit positive reactions from communication partners. This large-scale 2 x 2 mixed between- and within-participants design examined perceptions of 831 adult listeners regarding custom synthetic voices created for two individuals diagnosed with ALS via two synthesis systems in common clinical use (waveform concatenation and statistical parametric synthesis). The study explored relationships among synthesis system, dysarthria severity, synthetic speech intelligibility, naturalness, and preferences, and also provided a preliminary examination of attitudes regarding the custom synthetic voices. Synthetic voices generated via statistical parametric synthesis trained on deep neural networks were more intelligible, natural, and preferred than voices produced via waveform concatenation, and were associated with more positive attitudes. The custom synthetic voice created from moderately dysarthric speech was more intelligible than the voice created from mildly dysarthric speech. Clinical implications and factors that may have contributed to the relative intelligibilities are discussed.
Collapse
Affiliation(s)
- Jolene Hyppa-Martin
- Department of Communication Sciences and Disorders, University of Minnesota Duluth, Duluth, MN, USA
| | | | - Mo Chen
- National Institute of Education, Nanyang Technological University, Singapore, Singapore
| | - Jaclyn Friese
- Department of Communication Sciences and Disorders, University of Minnesota Duluth, Duluth, MN, USA
| | - Corinne Schmidt
- Department of Communication Sciences and Disorders, University of Minnesota Duluth, Duluth, MN, USA
| | | |
Collapse
|
2
|
Borrie SA, Hepworth TJ, Wynn CJ, Hustad KC, Barrett TS, Lansford KL. Perceptual Learning of Dysarthria in Adolescence. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:3791-3803. [PMID: 37616225 PMCID: PMC10713018 DOI: 10.1044/2023_jslhr-23-00231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 05/28/2023] [Accepted: 06/20/2023] [Indexed: 08/26/2023]
Abstract
PURPOSE As evidenced by perceptual learning studies involving adult listeners and speakers with dysarthria, adaptation to dysarthric speech is driven by signal predictability (speaker property) and a flexible speech perception system (listener property). Here, we extend adaptation investigations to adolescent populations and examine whether adult and adolescent listeners can learn to better understand an adolescent speaker with dysarthria. METHOD Classified by developmental stage, adult (n = 42) and adolescent (n = 40) listeners completed a three-phase perceptual learning protocol (pretest, familiarization, and posttest). During pretest and posttest, all listeners transcribed speech produced by a 13-year-old adolescent with spastic dysarthria associated with cerebral palsy. During familiarization, half of the adult and adolescent listeners engaged in structured familiarization (audio and lexical feedback) with the speech of the adolescent speaker with dysarthria; and the other half, with the speech of a neurotypical adolescent speaker (control). RESULTS Intelligibility scores increased from pretest to posttest for all listeners. However, listeners who received dysarthria familiarization achieved greater intelligibility improvements than those who received control familiarization. Furthermore, there was a significant effect of developmental stage, where the adults achieved greater intelligibility improvements relative to the adolescents. CONCLUSIONS This study provides the first tranche of evidence that adolescent dysarthric speech is learnable-a finding that holds even for adolescent listeners whose speech perception systems are not yet fully developed. Given the formative role that social interactions play during adolescence, these findings of improved intelligibility afford important clinical implications.
Collapse
Affiliation(s)
- Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Taylor J. Hepworth
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Camille J. Wynn
- Department of Communication Science and Disorders, University of Houston
| | - Katherine C. Hustad
- Waisman Center, University of Wisconsin–Madison
- Department of Communication Sciences and Disorders, University of Wisconsin–Madison
| | | | - Kaitlin L. Lansford
- Department of Communication Science and Disorders, Florida State University, Tallahassee
| |
Collapse
|
3
|
Yoho SE, Barrett TS, Borrie SA. The Influence of Sensorineural Hearing Loss on the Relationship Between the Perception of Speech in Noise and Dysarthric Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:4025-4036. [PMID: 37652059 PMCID: PMC10713019 DOI: 10.1044/2023_jslhr-23-00115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 04/25/2023] [Accepted: 06/14/2023] [Indexed: 09/02/2023]
Abstract
PURPOSE The ability to understand speech under adverse listening conditions is highly variable across listeners. Despite this, studies have found that listeners with normal hearing display consistency in their ability to perceive speech across different types of degraded speech, suggesting that, for at least these listeners, global skills may be involved in navigating the ambiguity in speech signals. However, there are substantial differences in the perceptual challenges faced by listeners with normal and impaired hearing. This study examines whether listeners with sensorineural hearing loss demonstrate the same type of consistency as normal-hearing listeners when processing neurotypical (i.e., control) speech that has been degraded by external noise and speech that is neurologically degraded such as dysarthria. METHOD Listeners with normal hearing (n = 31) and listeners with sensorineural hearing loss (n = 36) completed an intelligibility task with neurotypical speech in noise and with dysarthric speech in quiet. RESULTS Findings were consistent with previous work demonstrating a relationship between the ability to perceive neurotypical speech in noise and dysarthric speech for listeners with normal hearing, albeit at a higher intelligibility level than previously observed. This relationship was also observed for listeners with hearing loss, although listeners with more severe hearing losses performed better with dysarthric speech than with neurotypical speech in noise. CONCLUSIONS This study demonstrated a high level of consistency in intelligibility performance for listeners across two different types of degraded speech, even when those listeners were further challenged by the presence of sensorineural hearing loss. Clinical implications for both listeners with hearing loss and their communication partners with dysarthria are discussed.
Collapse
Affiliation(s)
- Sarah E. Yoho
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
- Department of Speech and Hearing Science, The Ohio State University, Columbus
| | | | - Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| |
Collapse
|
4
|
Wolfrum V, Lehner K, Heim S, Ziegler W. Clinical Assessment of Communication-Related Speech Parameters in Dysarthria: The Impact of Perceptual Adaptation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023:1-21. [PMID: 37486782 DOI: 10.1044/2023_jslhr-23-00105] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/26/2023]
Abstract
PURPOSE In current clinical practice, intelligibility of dysarthric speech is commonly assessed by speech-language therapists (SLTs), in most cases by the therapist caring for the patient being diagnosed. Since SLTs are familiar with dysarthria in general and with the speech of the individual patient to be assessed in particular, they have an adaptation advantage in understanding the patient's utterances. We examined whether and how listeners' assessments of communication-related speech parameters vary as a function of their familiarity with dysarthria in general and with the diagnosed patients in particular. METHOD Intelligibility, speech naturalness, and perceived listener effort were assessed in 20 persons with dysarthria (PWD). Patients' speech samples were judged by the individual treating therapists, five dysarthria experts who were unfamiliar with the patients, and crowdsourced naïve listeners. Adaptation effects were analyzed using (a) linear mixed models of overall scoring levels, (b) regression models of severity dependence, (c) network analyses of between-listener and between-parameter relationships, and (d) measures of intra- and interobserver consistency. RESULTS Significant advantages of dysarthria experts over laypeople were found in all parameters. An overall advantage of the treating therapists over nonfamiliar experts was only seen in listening effort. Severity-dependent adaptation effects occurred in all parameters. The therapists' responses were heterogeneous and inconsistent with those of the unfamiliar experts and the naïve listeners. CONCLUSIONS The way SLTs evaluate communication-relevant speech parameters of the PWD whom they care for is influenced not only by adaptation benefits but also by therapeutic biases. This finding weakens the validity of assessments of communication-relevant speech parameters by the treating therapists themselves and encourages the development and use of alternative methods.
Collapse
Affiliation(s)
- Vera Wolfrum
- Department of Neurology, Faculty of Medicine, RWTH Aachen University, Germany
| | - Katharina Lehner
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig Maximilian University of Munich, Germany
| | - Stefan Heim
- Department of Psychiatry, Psychotherapy, and Psychosomatics, Faculty of Medicine, RWTH Aachen University, Germany
- Research Center Jülich, Institute of Neurosciences and Medicine (INM-1), Germany
- JARA - Translational Brain Medicine, Aachen, Germany
| | - Wolfram Ziegler
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig Maximilian University of Munich, Germany
| |
Collapse
|
5
|
Borrie SA, Yoho SE, Healy EW, Barrett TS. The Application of Time-Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:1853-1866. [PMID: 36944186 PMCID: PMC10457087 DOI: 10.1044/2023_jslhr-22-00558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Revised: 12/13/2022] [Accepted: 01/10/2023] [Indexed: 05/11/2023]
Abstract
PURPOSE Background noise reduces speech intelligibility. Time-frequency (T-F) masking is an established signal processing technique that improves intelligibility of neurotypical speech in background noise. Here, we investigated a novel application of T-F masking, assessing its potential to improve intelligibility of neurologically degraded speech in background noise. METHOD Listener participants (N = 422) completed an intelligibility task either in the laboratory or online, listening to and transcribing audio recordings of neurotypical (control) and neurologically degraded (dysarthria) speech under three different processing types: speech in quiet (quiet), speech mixed with cafeteria noise (noise), and speech mixed with cafeteria noise and then subsequently processed by an ideal quantized mask (IQM) to remove the noise. RESULTS We observed significant reductions in intelligibility of dysarthric speech, even at highly favorable signal-to-noise ratios (+11 to +23 dB) that did not impact neurotypical speech. We also observed significant intelligibility improvements from speech in noise to IQM-processed speech for both control and dysarthric speech across a wide range of noise levels. Furthermore, the overall benefit of IQM processing for dysarthric speech was comparable with that of the control speech in background noise, as was the intelligibility data collected in the laboratory versus online. CONCLUSIONS This study demonstrates proof of concept, validating the application of T-F masks to a neurologically degraded speech signal. Given that intelligibility challenges greatly impact communication, and thus the lives of people with dysarthria and their communication partners, the development of clinical tools to enhance intelligibility in this clinical population is critical.
Collapse
Affiliation(s)
- Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Sarah E. Yoho
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
- Department of Speech and Hearing Science, The Ohio State University, Columbus
| | - Eric W. Healy
- Department of Speech and Hearing Science, The Ohio State University, Columbus
| | | |
Collapse
|
6
|
Hashemi Hosseinabad H, Ishikawa K, Washington K. Agreements between speech language pathologists and naïve listeners' judgements of intelligibility in children with cleft palate. CLINICAL LINGUISTICS & PHONETICS 2022; 36:1010-1028. [PMID: 34597196 DOI: 10.1080/02699206.2021.1983021] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Revised: 09/07/2021] [Accepted: 09/13/2021] [Indexed: 06/13/2023]
Abstract
Speech-language pathologists (SLPs) working in interdisciplinary craniofacial teams need to know how much their judgments of speech intelligibility could predict the patient's communication difficulties with non-clinical communicative partners. This study examines the relationship between experienced SLPs and naïve listeners in judgments of speech intelligibility in speakers with cleft palate. A speech perception study was conducted using speech samples from 20 speakers with velopharyngeal insufficiency (VPI) following cleft palate. Speech samples were presented to 70 naïve listeners and 10 SLPs who were experts in cleft palate treatment. Speech intelligibility scores from naïve listeners' orthographic transcriptions were obtained as the percentage of correctly identified words. Speech intelligibility scores from SLPs were obtained using a five-point rating scale. Spearman rank correlation indicated a very high level of overall agreement between naïve listeners and SLPs at the speaker level scores (rs = -.94, p < .001). While the listeners' judgment seems highly related across highly intelligible speakers, the differences in agreements increase when the speaker is unintelligible. The high correlations between scores for naïve listeners and SLPs suggest that speech intelligibility in children with VPI could be predicted by ratings done in the clinic by expert SLPs.
Collapse
Affiliation(s)
- Hedieh Hashemi Hosseinabad
- Department of Communication Sciences and Disorders, College of Health Science and Public Health, Eastern Washington University, Spokane, Washington, USA
| | - Keiko Ishikawa
- Department of Speech and Hearing Science, College of Applied Health Science, University of Illinois at Urbana-Champaign, Illinois, USA
| | - Karla Washington
- Department of Communication Sciences and Disorders, University of Cincinnati, Cincinnati, Ohio, USA
| |
Collapse
|
7
|
Olmstead AJ, Lee J, Chen J. Perceptual Learning of Altered Vowel Space Improves Identification of Vowels Produced by Individuals With Dysarthria Secondary to Amyotrophic Lateral Sclerosis. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2204-2214. [PMID: 35623135 DOI: 10.1044/2022_jslhr-21-00567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
PURPOSE This study examines the efficacy of perceptual training for improving typical listeners' identification of vowels produced by individuals with dysarthria. We examined whether training on a subset of vowels can generalize to (a) untrained vowels and (b) other speakers with similar overall intelligibility. METHOD Sixty naive listeners completed a pretest/posttest perceptual learning task. In the pretraining test and posttraining test, participants identified nine American English monophthongs produced by two speakers with dysarthria secondary to amyotrophic lateral sclerosis (ALS). In the 20-min training task, a two-alternative forced choice (2AFC) task with feedback trained listeners on a subset of the vowels and speakers presented in the pretraining test. RESULTS Vowel identification accuracy improved overall as a function of training. However, patterns of generalization between speakers and vowel types were not symmetric. Specifically, listeners generalized training from front vowels to back vowels but not vice versa. Likewise, listeners generalized from one speaker to another but not in the opposite direction. Examination of confusion matrices for the pretraining and posttraining revealed complex patterns of vowel-specific improvement. CONCLUSIONS This study demonstrates that listeners benefit from a very simple training paradigm targeting vowels. Additionally, error patterns revealed that vowels are both resistant to and responsive to perceptual learning. Implications for future research and clinical training paradigms are discussed.
Collapse
Affiliation(s)
- Annie J Olmstead
- Department of Communication Sciences and Disorders, The Pennsylvania State University, University Park
| | - Jimin Lee
- Department of Communication Sciences and Disorders, The Pennsylvania State University, University Park
| | - Janice Chen
- Department of Communication Sciences and Disorders, The Pennsylvania State University, University Park
| |
Collapse
|
8
|
Lehner K, Ziegler W. Indicators of Communication Limitation in Dysarthria and Their Relation to Auditory-Perceptual Speech Symptoms: Construct Validity of the KommPaS Web App. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:22-42. [PMID: 34890213 DOI: 10.1044/2021_jslhr-21-00215] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
PURPOSE Despite extensive research into communication-related parameters in dysarthria, such as intelligibility, naturalness, and perceived listener effort, the existing evidence has not been translated into a clinically applicable, comprehensive, and valid diagnostic tool so far. This study addresses Communication-Related Parameters in Speech Disorders (KommPaS), a new web-based diagnostic instrument for measuring indices of communication limitation in individuals with dysarthria through online crowdsourcing. More specifically, it answers questions about the construct validity of KommPaS. In the first part, the interrelationship of the KommPaS variables intelligibility, naturalness, perceived listener effort, and speech rate were explored in order to draw a comprehensive picture of a patient's limitations and avoid the collection of redundant information. Second, the influences of motor speech symptoms on the KommPaS variables were studied in order to delineate the structural relationships between two complementary diagnostic perspectives. METHOD One hundred persons with dysarthria of different etiologies and varying degrees of severity were examined with KommPaS to obtain layperson-based data on communication-level parameters, and with the Bogenhausen Dysarthria Scale (BoDyS) to obtain expert-based, function-level data on dysarthria symptoms. The internal structure of the KommPaS variables and their dependence on the BoDyS variables were analyzed using structural equation modeling. RESULTS Despite a high multicollinearity, all KommPaS variables were shown to provide complementary diagnostic information and their mutual interconnections were delineated in a path graph model. Regarding the influence of the BoDyS scales on the KommPaS variables, separate linear regression models revealed plausible predictor sets. A complete path model of KommPaS and BoDyS variables was developed to map the complex interplay between variables at the functional and the communication levels of dysarthria assessment. CONCLUSION In validating a new clinical tool for the diagnostics of communication limitations in dysarthria, this study is the first to draw a comprehensive picture of how auditory-perceptual characteristics of dysarthria interact at the levels of expert-based functional and layperson-based communicative assessments.
Collapse
Affiliation(s)
- Katharina Lehner
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig-Maximilians-University Munich, Germany
| | - Wolfram Ziegler
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig-Maximilians-University Munich, Germany
| |
Collapse
|
9
|
Ziegler W, Lehner K. Crowdsourcing as a tool in the clinical assessment of intelligibility in dysarthria: How to deal with excessive variation. JOURNAL OF COMMUNICATION DISORDERS 2021; 93:106135. [PMID: 34214758 DOI: 10.1016/j.jcomdis.2021.106135] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 06/03/2021] [Accepted: 06/07/2021] [Indexed: 06/13/2023]
Abstract
PURPOSE Independent laypersons are essential in the assessment of intelligibility in persons with dysarthria (PWD), as they reflect intelligibility limitations in the most ecologically valid way, without being influenced by familiarity with the speaker. The present work investigated online crowdsourcing as a convenient method to involve lay people as listeners, with the objective of exploring how to constrain the expected variability of crowd-based judgements to make them applicable in clinical diagnostics. METHOD Intelligibility was assessed using a word transcription task administered via crowdsourcing. In study 1, speech samples of 23 PWD were transcribed by 18 crowdworkers each. Four methods of aggregating the intelligibility scores of randomly sampled panels of 4 to 14 listeners were compared for accuracy, i.e. the stability of the resulting intelligibility estimates across different panels, and their validity, i.e. the degree to which they matched data obtained under controlled laboratory conditions ("gold standard"). In addition, we determined an economically acceptable number of crowdworkers per speaker which is needed to obtain accurate and valid intelligibility estimates. Study 2 examined the robustness of the chosen aggregation method against downward outliers due to spamming in a larger sample of 100 PWD. RESULTS In study 1, an interworker aggregation method based on negative exponential weightings of the scores as a function of their distance from the "best" listener's score (exponentially weighted mean) outperformed three other methods (median value, arithmetic mean, maximum). Under cost-benefit considerations, an optimum panel size of 9 crowd listeners per examination was determined. Study 2 demonstrated the robustness of this aggregation method against spamming crowd listeners. CONCLUSION Though intelligibility data collected through online crowdsourcing are noisy, accurate and valid intelligibility estimates can be obtained by appropriate aggregation of the raw data. This makes crowdsourcing a suitable method for incorporating real-world perspectives into clinical dysarthria assessment.
Collapse
Affiliation(s)
- Wolfram Ziegler
- Clinical Neuropsychology Research Group, Institute of Phonetics and Speech Processing, Ludwig-Maximilians-University, Germany.
| | - Katharina Lehner
- Clinical Neuropsychology Research Group, Institute of Phonetics and Speech Processing, Ludwig-Maximilians-University, Germany.
| |
Collapse
|
10
|
Borrie SA, Lansford KL. A Perceptual Learning Approach for Dysarthria Remediation: An Updated Review. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:3060-3073. [PMID: 34289312 PMCID: PMC8740677 DOI: 10.1044/2021_jslhr-21-00012] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Revised: 03/15/2021] [Accepted: 03/29/2021] [Indexed: 05/19/2023]
Abstract
Purpose Early studies of perceptual learning of dysarthric speech, those summarized in Borrie, McAuliffe, and Liss (2012), yielded preliminary evidence that listeners could learn to better understand the speech of a person with dysarthria, revealing a potentially promising avenue for future intelligibility interventions. Since then, a programmatic body of research grounded in models of perceptual processing has unfolded. The current review provides an updated account of the state of the evidence in this area and offers direction for moving this work toward clinical implementation. Method The studies that have investigated perceptual learning of dysarthric speech (N = 24) are summarized and synthesized first according to the proposed learning source and then by highlighting the parameters that appear to mediate learning, culminating with additional learning outcomes. Results The recent literature has established strong empirical evidence of intelligibility improvements following familiarization with dysarthric speech and a theoretical account of the mechanisms that facilitate improved processing of the neurologically degraded acoustic signal. Conclusions There are no existing intelligibility interventions for individuals with dysarthria who cannot behaviorally modify their speech. However, there is now robust support for the development of an approach that shifts the weight of behavioral change from speaker to listener, exploiting perceptual learning to ease the intelligibility burden of dysarthria. To move this work from bench to bedside, recommendations for translational studies that establish best practices and candidacy for listener-targeted dysarthria remediation, perceptual training, are provided.
Collapse
Affiliation(s)
- Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Kaitlin L. Lansford
- Department of Communication Science and Disorders, Florida State University, Tallahassee
| |
Collapse
|
11
|
Lagerberg TB, Anrep-Nordin E, Emanuelsson H, Strömbergsson S. Parent rating of intelligibility: A discussion of the construct validity of the Intelligibility in Context Scale (ICS) and normative data of the Swedish version of the ICS. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2021; 56:873-886. [PMID: 34125476 DOI: 10.1111/1460-6984.12634] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 03/25/2021] [Accepted: 05/05/2021] [Indexed: 06/12/2023]
Abstract
BACKGROUND Intelligibility can be defined as the speakers' ability to convey a message to the listener and it is considered the key functional measure of speech. The Intelligibility in Context Scale (ICS) is a parent rating scale used to assess intelligibility in children. AIMS To describe normative and validation data on the ICS in Swedish and to investigate how these are related to age, gender and multilingualism. METHODS & PROCEDURES Two studies were included. Study 1 included ICS forms from 319 Swedish-speaking children (3:2-9:2 years:months). Study 2 included video recordings and ICS forms from 14 children with speech sound disorder (SSD) and two with typical speech. The video recordings were transcribed in the validation process, resulting in intelligibility reference scores to which ICS scores were correlated. OUTCOMES & RESULTS Study 1: The mean value of the ICS for the 319 children was 4.73. There were no differences in ICS score related to age or gender. The children in the multilingual group were significantly older than the monolingual group and had significantly lower ICS scores than the group of monolinguals. Study 2: There was a moderate correlation between the ICS score and the transcription-based intelligibility score, with the two children with typical speech excluded; however, this correlation was not significant. CONCLUSIONS & IMPLICATIONS We contribute mean scores and percentiles on the ICS for Swedish-speaking children. The finding that the ICS does not provide valid measures of intelligibility for the included children with SSD suggests that the instrument measures a different construct. WHAT THIS PAPER ADDS What is already known on the subject The ICS has been translated to numerous languages and validated against articulation measures in several previous studies. The validity of the Swedish version has been investigated against intelligibility based on transcription of single words. What this paper adds to existing knowledge The study provides normative values of the Swedish version of the ICS for children aged 3-9 years. This is the first study to use a gold standard measure of intelligibility in continuous speech to validate the ICS. The results show a somewhat dubious validity regarding ICS for the group of children with SSD included in the study. What are the potential or actual clinical implications of this work? The ICS's suitability as a measure of intelligibility is questionable; however, it might be of use for speech and language pathologists to give an overview of the parents' view of their child's ability to communicate, in order to make a decision on possible further assessment and intervention. The normative values of the Swedish version of the ICS could be of use in this decision process.
Collapse
Affiliation(s)
- Tove B Lagerberg
- Institute of Neuroscience and Physiology, Division of Speech and Language Pathology, The Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Elin Anrep-Nordin
- Institute of Neuroscience and Physiology, Division of Speech and Language Pathology, The Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Helena Emanuelsson
- Institute of Neuroscience and Physiology, Division of Speech and Language Pathology, The Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Sofia Strömbergsson
- Division of Speech and Language Pathology, Department of Clinical Science, Intervention and Technology (CLINTEC), Karolinska Institutet (KI), Stockholm, Sweden
| |
Collapse
|
12
|
Fletcher A, McAuliffe M. Comparing Lexical Cues in Listener Processing of Dysarthria and Speech in Noise. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2021; 30:1572-1579. [PMID: 33630661 DOI: 10.1044/2020_ajslp-20-00137] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Purpose The frequency of a word and its number of phonologically similar neighbors can dramatically affect how likely it is to be accurately identified in adverse listening conditions. This study compares how these two cues affect listeners' processing of speech in noise and dysarthric speech. Method Seven speakers with moderate hypokinetic dysarthria and eight healthy control speakers were recorded producing the same set of phrases. Statements from control speakers were mixed with noise at a level selected to match the intelligibility range of the speakers with dysarthria. A binomial mixed-effects model quantified the effects of word frequency and phonological density on word identification. Results The model revealed significant effects of word frequency (b = 0.37, SE = 0.12, p = .002) and phonological neighborhood density (b = 0.40, SE = 0.12, p = .001). There was no effect of speaking condition (i.e., dysarthric speech vs. speech in noise). However, a significant interaction was observed between speaking condition and word frequency (b = 0.26, SE = 0.04, p < .001). Conclusions The model's interactions indicated that listeners were more strongly influenced by the effects of word frequency when decoding moderate hypokinetic dysarthria as compared to speech in noise. Differences in listener reliance on lexical cues may have important implications for the selection of communication-based treatment strategies for speakers with dysarthria.
Collapse
Affiliation(s)
- Annalise Fletcher
- Department of Audiology and Speech-Language Pathology, University of North Texas, Denton
| | - Megan McAuliffe
- School of Psychology, Speech and Hearing, University of Canterbury, New Zealand
| |
Collapse
|
13
|
Lubold N, Willi MM, Borrie SA, Barrett TS, Berisha V. Healthy Communication Partners Modify Their Speech When Conversing With Individuals With Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1539-1549. [PMID: 33831330 PMCID: PMC8608161 DOI: 10.1044/2021_jslhr-20-00233] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 10/02/2020] [Accepted: 01/06/2021] [Indexed: 05/22/2023]
Abstract
Purpose For individuals with Parkinson's disease (PD), conversational interactions can be challenging. Efforts to improve the success of these interactions have largely fallen on the individual with PD. Successful communication, however, involves contributions from both the individual with PD and their communication partner. The current study examines whether healthy communication partners naturally engage in different acoustic-prosodic behavior (speech compensations) when conversing with an individual with PD and, further, whether such behavior aids communication success. Method Measures of articulatory precision, speaking rate, and pitch variability were extracted from the speech of healthy speakers engaged in goal-directed dialogue with other healthy speakers (healthy-healthy dyads) and with individuals with PD (healthy-PD dyads). Speech compensations, operationally defined as significant differences in healthy speakers' acoustic-prosodic behavior in healthy-healthy dyads versus healthy-PD dyads, were calculated for the three speech behaviors. Finally, the relationships between speech behaviors and an objective measure of communicative efficiency were examined. Results Healthy speakers engaged in speech characterized by greater articulatory precision and slower speaking rate when conversing with individuals with PD relative to conversations with other healthy individuals. However, these adaptive speech compensations were not predictive of communicative efficiency. Conclusions Evidence that healthy speakers naturally engage in speech compensations when conversing with individuals with PD is novel, yet consistent with findings from studies with other populations in which conversation can be challenging. In the case of PD, these compensatory behaviors did not support communication outcomes. While preliminary in nature, the results raise important questions regarding the speech behavior of healthy communication partners and provide directions for future work.
Collapse
Affiliation(s)
- Nichola Lubold
- School of Electrical, Computer, and Energy Engineering, Arizona State University, Tempe
| | - Megan M. Willi
- Communication Sciences and Disorders Program, California State University, Chico
| | - Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | | | - Visar Berisha
- School of Electrical, Computer, and Energy Engineering, Arizona State University, Tempe
| |
Collapse
|
14
|
Borrie SA, Lansford KL, Barrett TS. A Clinical Advantage: Experience Informs Recognition and Adaptation to a Novel Talker With Dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1503-1514. [PMID: 33831307 PMCID: PMC8608160 DOI: 10.1044/2021_jslhr-20-00663] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 12/22/2020] [Accepted: 01/21/2021] [Indexed: 05/19/2023]
Abstract
Purpose Perceptual training paradigms, which leverage the mechanism of perceptual learning, show that naïve listeners, those with no prior experience with dysarthria, benefit from explicit familiarization with a talker with dysarthria. It is theorized that familiarization affords listeners an opportunity to acquire distributional knowledge of the degraded speech signal. Here, we extend investigations to clinically experienced listeners, speech-language pathologists (SLPs), and advance models of listener recognition and adaptation to dysarthric speech. Method Forty-seven SLPs completed a standard three-phase perceptual training protocol (pretest, familiarization, and posttest) with a novel talker with dysarthria. Intelligibility scores were compared with historical data from naïve listeners. Potential relationships between intelligibility scores and characteristics of clinical experience were examined. Results Intelligibility scores of SLPs improved by an average of 19% from pretest to posttest. This intelligibility improvement was lower than naïve listeners, although the difference was small. Moreover, clinical characteristics related to level of dysarthria experience (e.g., percent of caseload composed of dysarthria) predicted pretest/initial intelligibility. No predictive relationships between clinical characteristics and intelligibility improvement were revealed. Conclusions As a group, SLPs benefitted from perceptual training, suggesting that, despite prior experience, the opportunity to acquire knowledge of talker-specific cue distributions is crucial for optimal adaptation. However, SLPs with greater dysarthria experience were better at initially understanding the talker with dysarthria. This suggests that, through regular interaction with individuals with dysarthria, clinicians acquire knowledge of the cue distributions of dysarthric speech more generally and can generalize this group-specific knowledge to aid in understanding other talkers with dysarthria. Consistent with theoretical models of perceptual learning, both talker- and group-specific knowledge informed recognition and adaptation to dysarthric speech.
Collapse
Affiliation(s)
- Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Kaitlin L. Lansford
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | | |
Collapse
|
15
|
Bieber RE, Gordon-Salant S. Improving older adults' understanding of challenging speech: Auditory training, rapid adaptation and perceptual learning. Hear Res 2021; 402:108054. [PMID: 32826108 PMCID: PMC7880302 DOI: 10.1016/j.heares.2020.108054] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 07/21/2020] [Accepted: 08/02/2020] [Indexed: 12/13/2022]
Abstract
The literature surrounding auditory perceptual learning and auditory training for challenging speech signals in older adult listeners is highly varied, in terms of both study methodology and reported outcomes. In this review, we discuss some of the pertinent features of listener, stimulus, and training protocol. Literature regarding the elicitation of auditory perceptual learning for time-compressed speech, non-native speech, and noise-vocoded speech is reviewed, as are auditory training protocols designed to improve speech-in-noise recognition. The literature is synthesized to establish some over-arching findings for the aging population, including an intact capacity for auditory perceptual learning, but a limited transfer of learning to untrained stimuli.
Collapse
Affiliation(s)
- Rebecca E Bieber
- Department of Hearing and Speech Sciences, University of Maryland, 0100 LeFrak Hall, 7251 Preinkert Drive, College Park, MD 20742, United States.
| | - Sandra Gordon-Salant
- Department of Hearing and Speech Sciences, University of Maryland, 0100 LeFrak Hall, 7251 Preinkert Drive, College Park, MD 20742, United States
| |
Collapse
|
16
|
Hirsch ME, Lansford KL, Barrett TS, Borrie SA. Generalized Learning of Dysarthric Speech Between Male and Female Talkers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:444-451. [PMID: 33508210 PMCID: PMC8632474 DOI: 10.1044/2020_jslhr-20-00313] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 09/28/2020] [Accepted: 10/21/2020] [Indexed: 05/29/2023]
Abstract
Purpose Perceptual training is a listener-targeted means for improving intelligibility of dysarthric speech. Recent work has shown that training with one talker generalizes to a novel talker of the same sex and that the magnitude of benefit is maximized when the talkers are perceptually similar. The current study expands previous findings by investigating whether perceptual training effects generalize between talkers of different sex. Method Forty new listeners were recruited for this study and completed a pretest, familiarization, and posttest perceptual training paradigm. Historical data collected using the same three-phase protocol were included in the data analysis. All listeners were exposed to the same talker with dysarthria during the pretest and posttest phases. For the familiarization phase, listeners were exposed to one of four talkers with dysarthria, differing in sex and level of perceptual similarity to the test talker or a control talker. During the testing phases, listener transcribed phrases produced by the test talker with dysarthria. Listener transcriptions were then used to calculate a percent words correct intelligibility score. Results Multiple linear regression analysis revealed that intelligibility at posttest was not predicted by sex of the training talker. Consistent with earlier work, the magnitude of intelligibility gain was greater when the familiarization and test talkers were perceptually similar. Additional analyses revealed greater between-listeners variability in the dissimilar conditions as compared to the similar conditions. Conclusions Learning as a result of perceptual training with one talker with dysarthria generalized to another talker regardless of sex. In addition, listeners trained with perceptually similar talkers had greater and more consistent intelligibility improvement. Together, these results add to previous evidence demonstrating that learning generalizes to novel talkers with dysarthria and that perceptual training is suitable for many listeners.
Collapse
Affiliation(s)
- Micah E. Hirsch
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | - Kaitlin L. Lansford
- School of Communication Science and Disorders, Florida State University, Tallahassee
| | | | - Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| |
Collapse
|
17
|
Lansford KL, Borrie SA, Barrett TS, Flechaus C. When Additional Training Isn't Enough: Further Evidence That Unpredictable Speech Inhibits Adaptation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:1700-1711. [PMID: 32437259 PMCID: PMC7839029 DOI: 10.1044/2020_jslhr-19-00380] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Revised: 02/21/2020] [Accepted: 02/24/2020] [Indexed: 05/29/2023]
Abstract
Purpose Robust improvements in intelligibility following familiarization, a listener-targeted perceptual training paradigm, have been revealed for talkers diagnosed with spastic, ataxic, and hypokinetic dysarthria but not for talkers with hyperkinetic dysarthria. While the theoretical explanation for the lack of intelligibility improvement following training with hyperkinetic talkers is that there is insufficient distributional regularity in the speech signals to support perceptual adaptation, it could simply be that the standard training protocol was inadequate to facilitate learning of the unpredictable talker. In a pair of experiments, we addressed this possible alternate explanation by modifying the levels of exposure and feedback provided by the perceptual training protocol to offer listeners a more robust training experience. Method In Experiment 1, we examined the exposure modifications, testing whether perceptual adaptation to an unpredictable talker with hyperkinetic dysarthria could be achieved with greater or more diverse exposure to dysarthric speech during the training phase. In Experiment 2, we examined feedback modifications, testing whether perceptual adaptation to the unpredictable talker could be achieved with the addition of internally generated somatosensory feedback, via vocal imitation, during the training phase. Results Neither task modification led to improved intelligibility of the unpredictable talker with hyperkinetic dysarthria. Furthermore, listeners who completed the vocal imitation task demonstrated significantly reduced intelligibility at posttest. Conclusion Together, the results from Experiments 1 and 2 replicate and extend findings from our previous work, suggesting perceptual adaptation is inhibited for talkers whose speech is largely characterized by unpredictable degradations. Collectively, these results underscore the importance of integrating signal predictability into theoretical models of perceptual learning.
Collapse
Affiliation(s)
- Kaitlin L. Lansford
- School of Communication Science & Disorders, Florida State University, Tallahassee
| | - Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | | | - Cassidy Flechaus
- School of Communication Science & Disorders, Florida State University, Tallahassee
| |
Collapse
|
18
|
Borrie SA, Barrett TS, Yoho SE. Autoscore: An open-source automated tool for scoring listener perception of speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:392. [PMID: 30710955 PMCID: PMC6347573 DOI: 10.1121/1.5087276] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Revised: 11/26/2018] [Accepted: 12/10/2018] [Indexed: 05/19/2023]
Abstract
Speech perception studies typically rely on trained research assistants to score orthographic listener transcripts for words correctly identified. While the accuracy of the human scoring protocol has been validated with strong intra- and inter-rater reliability, the process of hand-scoring the transcripts is time-consuming and resource intensive. Here, an open-source computer-based tool for automated scoring of listener transcripts is built (Autoscore) and validated on three different human-scored data sets. Results show that not only is Autoscore highly accurate, achieving approximately 99% accuracy, but extremely efficient. Thus, Autoscore affords a practical research tool, with clinical application, for scoring listener intelligibility of speech.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84322, USA
| | - Tyson S Barrett
- Department of Psychology, Utah State University, Logan, Utah 84322, USA
| | - Sarah E Yoho
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84322, USA
| |
Collapse
|
19
|
Coping with adversity: Individual differences in the perception of noisy and accented speech. Atten Percept Psychophys 2018; 80:1559-1570. [DOI: 10.3758/s13414-018-1537-4] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
20
|
Borrie SA, Lansford KL, Barrett TS. Understanding dysrhythmic speech: When rhythm does not matter and learning does not happen. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:EL379. [PMID: 29857710 PMCID: PMC5959736 DOI: 10.1121/1.5037620] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Revised: 04/19/2018] [Accepted: 04/24/2018] [Indexed: 05/22/2023]
Abstract
A positive relationship between rhythm perception and improved understanding of a naturally dysrhythmic speech signal, ataxic dysarthria, has been previously reported [Borrie, Lansford, and Barrett. (2017). J. Speech Lang. Hear. Res. 60, 3110-3117]. The current follow-on investigation suggests that this relationship depends on the nature of the dysrhythmia. When the corrupted rhythm cues are relatively predictable, affording some learnable acoustic regularity, the relationship is replicated. However, this relationship is nonexistent, along with any intelligibility improvements, when the corrupted rhythm cues are unpredictable. Findings highlight a key role for rhythm perception and distributional regularities in adaptation to dysrhythmic speech.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84322, USA
| | - Kaitlin L Lansford
- School of Communication Sciences and Disorders, Florida State University, Tallahassee, Florida 32306, USA
| | - Tyson S Barrett
- Department of Kinesiology and Health Sciences, Utah State University, Logan, Utah 84322, USA
| |
Collapse
|
21
|
Nagle KF, Eadie TL. Perceived listener effort as an outcome measure for disordered speech. JOURNAL OF COMMUNICATION DISORDERS 2018; 73:34-49. [PMID: 29567465 DOI: 10.1016/j.jcomdis.2018.03.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Revised: 03/05/2018] [Accepted: 03/12/2018] [Indexed: 06/08/2023]
Abstract
PURPOSE Perceived listening effort is a perceptual dimension used to identify the amount of work necessary to understand disordered speech. The purpose of this study was to investigate the utility of perceived listening effort to provide unique information about disordered speech. The relationships between perceived listening effort and two current outcome measures (speech acceptability, intelligibility) were examined for listeners rating electrolaryngeal speech, along with their reliability and intra-rater agreement. METHODS Ten healthy male speakers read low-context sentences using an electrolarynx. Twenty-five inexperienced listeners orthographically transcribed and rated the stimuli for perceived listening effort and speech acceptability using a visual analog scale. Strict reliability and agreement criteria were set. RESULTS Perceived listening effort was moderately to strongly correlated with intelligibility (r = -0.76) and acceptability (r = -0.80), each of which contributed uniquely to ratings of perceived listening effort. However, only 17 listeners met stringent reliability and agreement criteria. CONCLUSIONS Ratings of perceived listening effort may provide unique information about the communicative success of individuals with communication disorders. There is great variability, however, among inexperienced listeners' perceptual ratings of electrolaryngeal speech. Future research should investigate variables that may affect perceived listening effort specifically and auditory-perceptual ratings in general.
Collapse
Affiliation(s)
- Kathleen F Nagle
- Department of Speech and Hearing Sciences, University of Washington, Seattle, WA, 98105, USA.
| | - Tanya L Eadie
- Department of Speech and Hearing Sciences, University of Washington, Seattle, WA, 98105, USA
| |
Collapse
|
22
|
Chiu YF, Forrest K. The Impact of Lexical Characteristics and Noise on Intelligibility of Parkinsonian Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2018; 61:837-846. [PMID: 29587306 DOI: 10.1044/2017_jslhr-s-17-0205] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Accepted: 12/26/2017] [Indexed: 06/08/2023]
Abstract
PURPOSE This study investigated the impact of lexical characteristics on the intelligibility of speakers with Parkinson's disease (PD). Intelligibility was compared for listening in a quiet versus a noisy environment. METHOD A total of 192 young listeners participated in the study, with 96 listeners listening in quiet and 96 listening in noise in which the spoken sentences were mixed with 10-talker babble. The listeners transcribed spoken sentences with target words controlled for word frequency and neighborhood density. The sentences were produced by 12 speakers with PD and 12 healthy control (HC) speakers. Transcription accuracy was compared across lexical categories between PD and HC and between the two listening conditions. RESULTS Transcription accuracy was higher for high-frequency words than low-frequency words for both speaker groups in quiet and in noise. The neighborhood density effect was evident only in the noise condition with better intelligibility for words from sparse neighborhoods than from dense neighborhoods. Regardless of listening condition, the PD group was less intelligible with a significant decrease in intelligibility in noise relative to quiet. CONCLUSIONS The lexical properties of the words affected listeners' understanding of Parkinsonian speech. Frequent words in sparse neighborhood enhanced intelligibility, especially under adverse listening conditions. Babble noise had a detrimental impact on the intelligibility of Parkinsonian speech. The extent of the lexical effect on intelligibility increased in noise.
Collapse
Affiliation(s)
- Yi-Fang Chiu
- Department of Communication Sciences and Disorders, Saint Louis University, MO
| | - Karen Forrest
- Department of Speech and Hearing Sciences, Indiana University Bloomington
| |
Collapse
|
23
|
Lansford KL, Luhrsen S, Ingvalson EM, Borrie SA. Effects of Familiarization on Intelligibility of Dysarthric Speech in Older Adults With and Without Hearing Loss. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2018; 27:91-98. [PMID: 29305612 PMCID: PMC5968332 DOI: 10.1044/2017_ajslp-17-0090] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Revised: 07/17/2017] [Accepted: 07/25/2017] [Indexed: 05/25/2023]
Abstract
PURPOSE Familiarization tasks offer a promising platform for listener-targeted remediation of intelligibility disorders associated with dysarthria. To date, the body of work demonstrating improved understanding of dysarthric speech following a familiarization experience has been carried out on younger adults. The primary purpose of the present study was to examine the intelligibility effects of familiarization in older adults. METHOD Nineteen older adults, with and without hearing loss, completed a familiarization protocol consisting of three phases: pretest, familiarization, and posttest. The older adults' initial intelligibility and intelligibility improvement scores were compared with previously reported data collected from 50 younger adults (Borrie, Lansford, & Barrett, 2017a). RESULTS Relative to younger adults, initial intelligibility scores were significantly lower for older adults, although additional analysis revealed that the difference was limited to older adults with hearing loss. Key, however, is that irrespective of hearing status, the older and younger adults achieved comparable intelligibility improvement following familiarization (gain of roughly 20 percentage points). CONCLUSION This study extends previous findings of improved intelligibility of dysarthria following familiarization to a group of listeners who are critical to consider in listener-targeted remediation, namely, aging caregivers and/or spouses of individuals with dysarthria.
Collapse
Affiliation(s)
- Kaitlin L. Lansford
- School of Communication Sciences and Disorders, Florida State University, Tallahassee
| | - Stephani Luhrsen
- School of Communication Sciences and Disorders, Florida State University, Tallahassee
| | - Erin M. Ingvalson
- School of Communication Sciences and Disorders, Florida State University, Tallahassee
| | - Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| |
Collapse
|
24
|
Yoho SE, Borrie SA. Combining degradations: The effect of background noise on intelligibility of disordered speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:281. [PMID: 29390797 PMCID: PMC5775095 DOI: 10.1121/1.5021254] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The effect of background noise on intelligibility of disordered speech was assessed. Speech-shaped noise was mixed with neurologically healthy (control) and disordered (dysarthric) speech at a series of signal-to-noise ratios. In addition, bandpass filtered control and dysarthric speech conditions were assessed to determine the effect of noise on both naturally and artificially degraded speech. While significant effects of both the amount of noise and the type of speech were revealed, no interaction between the two factors was observed, in either the broadband or filtered testing conditions. Thus, it appears that there is no multiplicative effect of the presence of background noise on intelligibility of disordered speech relative to control speech. That is, the decrease in intelligibility due to increasing levels of noise is similar for both types of speech, and both types of testing conditions, and the function for dysarthric speech is simply shifted downward due to the inherent source degradations of the speech itself. Last, large-scale online crowdsourcing via Amazon Mechanical Turk was utilized to collect data for the current study. Findings and implications for this data and data collection approach are discussed.
Collapse
Affiliation(s)
- Sarah E Yoho
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84321, USA
| | - Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84321, USA
| |
Collapse
|
25
|
Borrie SA, Lansford KL, Barrett TS. Generalized Adaptation to Dysarthric Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:3110-3117. [PMID: 29075754 PMCID: PMC5945074 DOI: 10.1044/2017_jslhr-s-17-0127] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2017] [Accepted: 06/02/2017] [Indexed: 05/04/2023]
Abstract
PURPOSE Generalization of perceptual learning has received limited attention in listener adaptation studies with dysarthric speech. This study investigated whether adaptation to a talker with dysarthria could be predicted by the nature of the listener's prior familiarization experience, specifically similarity of perceptual features, and level of intelligibility. METHOD Following an intelligibility pretest involving a talker with ataxic dysarthria, 160 listeners were familiarized with 1 of 7 talkers with dysarthria-who differed from the test talker in terms of perceptual similarity (same, similar, dissimilar) and level of intelligibility (low, mid, high)-or a talker with no neurological impairment (control). Listeners then completed an intelligibility posttest on the test talker. RESULTS All listeners benefited from familiarization with a talker with dysarthria; however, adaptation to the test talker was superior when the familiarization talker had similar perceptual features and reduced when the familiarization talker had low intelligibility. CONCLUSION Evidence for both generalization and specificity of learning highlights the differential value of listeners' prior experiences for adaptation to, and improved understanding of, a talker with dysarthria. These findings broaden our theoretical knowledge of adaptation to degraded speech, as well as the clinical application of training paradigms that exploit perceptual processes for therapeutic gain.
Collapse
Affiliation(s)
- Stephanie A. Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Kaitlin L. Lansford
- School of Communication Sciences and Disorders, Florida State University, Tallahassee
| | | |
Collapse
|
26
|
Rusiewicz HL, Rivera JL. The Effect of Hand Gesture Cues Within the Treatment of /r/ for a College-Aged Adult With Persisting Childhood Apraxia of Speech. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2017; 26:1236-1243. [PMID: 29114768 DOI: 10.1044/2017_ajslp-15-0172] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 05/16/2017] [Indexed: 06/07/2023]
Abstract
PURPOSE Despite the widespread use of hand movements as visual and kinesthetic cues to facilitate accurate speech produced by individuals with speech sound disorders (SSDs), no experimental investigation of gestural cues that mimic that spatiotemporal parameters of speech sounds (e.g., holding fingers and thumb together and "popping" them to cue /p/) currently exists. The purpose of this study was to examine the effectiveness of manual mimicry cues within a multisensory intervention of persisting childhood apraxia of speech (CAS). METHOD A single-subject ABAB withdrawal design was implemented to assess the accuracy of vowel + /r/ combinations produced by a 21-year-old woman with persisting CAS. The effect of manual mimicry gestures paired with multisensory therapy consisting of verbal instructions and visual modeling was assessed via clinician and naïve listener ratings of target sound accuracy. RESULTS According to the perceptual ratings of the treating clinician and 28 naïve listeners, the participant demonstrated improved speech sound accuracy as a function of the manual mimicry/multisensory therapy. CONCLUSIONS These data offer preliminary support for the incorporation of gestural cues in therapy for CAS and other SSDs. The need for continued research on the interaction of speech and manual movements for individuals with SSDs is discussed.
Collapse
|
27
|
Borrie SA, Schäfer MCM. Effects of Lexical and Somatosensory Feedback on Long-Term Improvements in Intelligibility of Dysarthric Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:2151-2158. [PMID: 28687828 DOI: 10.1044/2017_jslhr-s-16-0411] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2016] [Accepted: 02/09/2017] [Indexed: 06/07/2023]
Abstract
PURPOSE Intelligibility improvements immediately following perceptual training with dysarthric speech using lexical feedback are comparable to those observed when training uses somatosensory feedback (Borrie & Schäfer, 2015). In this study, we investigated if these lexical and somatosensory guided improvements in listener intelligibility of dysarthric speech remain comparable and stable over the course of 1 month. METHOD Following an intelligibility pretest, 60 participants were trained with dysarthric speech stimuli under one of three conditions: lexical feedback, somatosensory feedback, or no training (control). Participants then completed a series of intelligibility posttests, which took place immediately (immediate posttest), 1 week (1-week posttest) following training, and 1 month (1-month posttest) following training. RESULTS As per our previous study, intelligibility improvements at immediate posttest were equivalent between lexical and somatosensory feedback conditions. Condition differences, however, emerged over time. Improvements guided by lexical feedback deteriorated over the month whereas those guided by somatosensory feedback remained robust. CONCLUSIONS Somatosensory feedback, internally generated by vocal imitation, may be required to affect long-term perceptual gain in processing dysarthric speech. Findings are discussed in relation to underlying learning mechanisms and offer insight into how externally and internally generated feedback may differentially affect perceptual learning of disordered speech.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Martina C M Schäfer
- New Zealand Institute of Language, Brain and Behaviour, University of Canterbury, Christchurch
| |
Collapse
|
28
|
Borrie SA, Baese-Berk M, Van Engen K, Bent T. A relationship between processing speech in noise and dysarthric speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:4660. [PMID: 28679257 DOI: 10.1121/1.4986746] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
There is substantial individual variability in understanding speech in adverse listening conditions. This study examined whether a relationship exists between processing speech in noise (environmental degradation) and dysarthric speech (source degradation), with regard to intelligibility performance and the use of metrical stress to segment the degraded speech signals. Ninety native speakers of American English transcribed speech in noise and dysarthric speech. For each type of listening adversity, transcriptions were analyzed for proportion of words correct and lexical segmentation errors indicative of stress cue utilization. Consistent with the hypotheses, intelligibility performance for speech in noise was correlated with intelligibility performance for dysarthric speech, suggesting similar cognitive-perceptual processing mechanisms may support both. The segmentation results also support this postulation. While stress-based segmentation was stronger for speech in noise relative to dysarthric speech, listeners utilized metrical stress to parse both types of listening adversity. In addition, reliance on stress cues for parsing speech in noise was correlated with reliance on stress cues for parsing dysarthric speech. Taken together, the findings demonstrate a preference to deploy the same cognitive-perceptual strategy in conditions where metrical stress offers a route to segmenting degraded speech.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84322, USA
| | - Melissa Baese-Berk
- Department of Linguistics, University of Oregon, Eugene, Oregon 97403, USA
| | - Kristin Van Engen
- Department of Psychological and Brain Sciences, Washington University, St. Louis, Missouri 63130, USA
| | - Tessa Bent
- Department of Speech and Hearing Sciences, Indiana University, Bloomington, Indiana 47405, USA
| |
Collapse
|
29
|
Borrie SA, Lansford KL, Barrett TS. Rhythm Perception and Its Role in Perception and Learning of Dysrhythmic Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:561-570. [PMID: 28241307 DOI: 10.1044/2016_jslhr-s-16-0094] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Accepted: 09/11/2016] [Indexed: 05/13/2023]
Abstract
PURPOSE The perception of rhythm cues plays an important role in recognizing spoken language, especially in adverse listening conditions. Indeed, this has been shown to hold true even when the rhythm cues themselves are dysrhythmic. This study investigates whether expertise in rhythm perception provides a processing advantage for perception (initial intelligibility) and learning (intelligibility improvement) of naturally dysrhythmic speech, dysarthria. METHOD Fifty young adults with typical hearing participated in 3 key tests, including a rhythm perception test, a receptive vocabulary test, and a speech perception and learning test, with standard pretest, familiarization, and posttest phases. Initial intelligibility scores were calculated as the proportion of correct pretest words, while intelligibility improvement scores were calculated by subtracting this proportion from the proportion of correct posttest words. RESULTS Rhythm perception scores predicted intelligibility improvement scores but not initial intelligibility. On the other hand, receptive vocabulary scores predicted initial intelligibility scores but not intelligibility improvement. CONCLUSIONS Expertise in rhythm perception appears to provide an advantage for processing dysrhythmic speech, but a familiarization experience is required for the advantage to be realized. Findings are discussed in relation to the role of rhythm in speech processing and shed light on processing models that consider the consequence of rhythm abnormalities in dysarthria.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan
| | - Kaitlin L Lansford
- School of Communication Sciences and Disorders, Florida State University, Tallahassee
| | | |
Collapse
|
30
|
McAuliffe MJ, Fletcher AR, Kerr SE, O'Beirne GA, Anderson T. Effect of Dysarthria Type, Speaking Condition, and Listener Age on Speech Intelligibility. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2017; 26:113-123. [PMID: 28124068 DOI: 10.1044/2016_ajslp-15-0182] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Accepted: 07/11/2016] [Indexed: 06/06/2023]
Abstract
PURPOSE The aim of this study was to examine the effect of loud and slow speech cues on younger and older listeners' comprehension of dysarthric speech, specifically, (a) whether one strategy, as opposed to the other, promoted greater intelligibility gains for different speaker groups; (b) whether older and younger listeners' understandings were differentially affected by these strategies; and (c) which acoustic changes best predicted intelligibility gain in individual speakers. METHOD Twenty younger and 40 older listeners completed a perceptual task. Six individuals with dysarthria produced phrases across habitual, loud, and slow conditions. The primary dependent variable was proportion of words correct; follow-up acoustic analyses linked perceptual outcomes to changes in acoustic speech features. RESULTS Regardless of dysarthria type, the loud condition produced significant intelligibility gains. Overall, older listeners' comprehension was reduced relative to younger listeners. Follow-up analysis revealed considerable interspeaker differences in intelligibility outcomes across conditions. Although the most successful speaking mode varied, intelligibility gains were strongly associated with the degree of change participants made to their vowel formants. CONCLUSIONS Perceptual outcomes vary across speaking modes, even when speakers with dysarthria are grouped according to similar perceptual profiles. Further investigation of interspeaker differences is needed to inform individually tailored intervention approaches.
Collapse
Affiliation(s)
- Megan J McAuliffe
- Department of Communication Disorders, University of Canterbury, Christchurch, New ZealandNew Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch, New Zealand
| | - Annalise R Fletcher
- Department of Communication Disorders, University of Canterbury, Christchurch, New ZealandNew Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch, New Zealand
| | - Sarah E Kerr
- Department of Communication Disorders, University of Canterbury, Christchurch, New ZealandNew Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch, New Zealand
| | - Greg A O'Beirne
- Department of Communication Disorders, University of Canterbury, Christchurch, New ZealandNew Zealand Institute of Language, Brain & Behaviour, University of Canterbury, Christchurch, New Zealand
| | - Tim Anderson
- New Zealand Brain Research Institute, Christchurch, New Zealand
| |
Collapse
|
31
|
Bent T, Baese-Berk M, Borrie SA, McKee M. Individual differences in the perception of regional, nonnative, and disordered speech varieties. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:3775. [PMID: 27908060 DOI: 10.1121/1.4966677] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Speech perception abilities vary substantially across listeners, particularly in adverse conditions including those stemming from environmental degradation (e.g., noise) or from talker-related challenges (e.g., nonnative or disordered speech). This study examined adult listeners' recognition of words in phrases produced by six talkers representing three speech varieties: a nonnative accent (Spanish-accented English), a regional dialect (Irish English), and a disordered variety (ataxic dysarthria). Semantically anomalous phrases from these talkers were presented in a transcription task and intelligibility scores, percent words correct, were compared across the three speech varieties. Three cognitive-linguistic areas-receptive vocabulary, cognitive flexibility, and inhibitory control of attention-were assessed as possible predictors of individual word recognition performance. Intelligibility scores for the Spanish accent were significantly correlated with scores for the Irish English and ataxic dysarthria. Scores for the Irish English and dysarthric speech, in contrast, were not correlated. Furthermore, receptive vocabulary was the only cognitive-linguistic assessment that significantly predicted intelligibility scores. These results suggest that, rather than a global skill of perceiving speech that deviates from native dialect norms, listeners may possess specific abilities to overcome particular types of acoustic-phonetic deviation. Furthermore, vocabulary size offers performance benefits for intelligibility of speech that deviates from one's typical dialect norms.
Collapse
Affiliation(s)
- Tessa Bent
- Department of Speech and Hearing Sciences, Indiana University, Bloomington, Indiana 47405, USA
| | - Melissa Baese-Berk
- Department of Linguistics, University of Oregon, Eugene, Oregon 97403, USA
| | - Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84332, USA
| | - Megan McKee
- Department of Speech and Hearing Sciences, Indiana University, Bloomington, Indiana 47405, USA
| |
Collapse
|
32
|
Lansford KL, Borrie SA, Bystricky L. Use of Crowdsourcing to Assess the Ecological Validity of Perceptual-Training Paradigms in Dysarthria. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2016; 25:233-9. [PMID: 27145295 DOI: 10.1044/2015_ajslp-15-0059] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2015] [Accepted: 11/30/2015] [Indexed: 05/13/2023]
Abstract
PURPOSE It has been documented in laboratory settings that familiarizing listeners with dysarthric speech improves intelligibility of that speech. If these findings can be replicated in real-world settings, the ability to improve communicative function by focusing on communication partners has major implications for extending clinical practice in dysarthria rehabilitation. An important step toward development of a listener-targeted treatment approach requires establishment of its ecological validity. To this end, the present study leveraged the mechanism of crowdsourcing to determine whether perceptual-training benefits achieved by listeners in the laboratory could be elicited in an at-home computer-based scenario. METHOD Perceptual-training data (i.e., intelligibility scores from a posttraining transcription task) were collected from listeners in 2 settings-the laboratory and the crowdsourcing website Amazon Mechanical Turk. RESULTS Consistent with previous findings, results revealed a main effect of training condition (training vs. control) on intelligibility scores. There was, however, no effect of training setting (Mechanical Turk vs. laboratory). Thus, the perceptual benefit achieved via Mechanical Turk was comparable to that achieved in the laboratory. CONCLUSION This study provides evidence regarding the ecological validity of perceptual-training paradigms designed to improve intelligibility of dysarthric speech, thereby supporting their continued advancement as a listener-targeted treatment option.
Collapse
|
33
|
Kim H. Familiarization Effects on Consonant Intelligibility in Dysarthric Speech. Folia Phoniatr Logop 2016; 67:245-52. [PMID: 26906426 DOI: 10.1159/000444255] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND/AIMS This study investigates the effects of familiarization on naïve listeners' ability to identify consonants in dysarthric speech. METHODS A total of 120 listeners (30 listeners/speaker) participated in experiments over a 6-week period. Listeners were randomly assigned to one of the three familiarization conditions: a passive condition in which listeners heard audio recordings of words, an active condition in which listeners heard audio recordings of words while viewing the written material of words, and a control condition in which listeners had no exposure to the audio signal prior to identification tasks. RESULTS Familiarization improved naïve listeners' ability to identify consonants produced by a speaker with dysarthria. The active familiarization method exhibited an advantage over the other conditions, in terms of the magnitude and rapidness of improvement. One-month delayed test scores were higher than pre-familiarization scores, but the advantage of active familiarization was not present for all speakers. CONCLUSION This study supports familiarization benefits in enhancing consonant intelligibility in dysarthria and suggests that perceptual learning mechanisms be harnessed for developing effective listener-oriented intervention techniques in the management of dysarthria. Current findings call for further research on a familiarization protocol that can subserve segmental learning with maximum efficacy.
Collapse
Affiliation(s)
- Heejin Kim
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Ill., USA
| |
Collapse
|
34
|
Borrie SA, Schäfer MCM. The Role of Somatosensory Information in Speech Perception: Imitation Improves Recognition of Disordered Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2015; 58:1708-16. [PMID: 26536172 DOI: 10.1044/2015_jslhr-s-15-0163] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2015] [Accepted: 09/14/2015] [Indexed: 05/13/2023]
Abstract
PURPOSE Perceptual learning paradigms involving written feedback appear to be a viable clinical tool to reduce the intelligibility burden of dysarthria. The underlying theoretical assumption is that pairing the degraded acoustics with the intended lexical targets facilitates a remapping of existing mental representations in the lexicon. This study investigated whether ties to mental representations can be strengthened by way of a somatosensory motor trace. METHOD Following an intelligibility pretest, 100 participants were assigned to 1 of 5 experimental groups. The control group received no training, but the other 4 groups received training with dysarthric speech under conditions involving a unique combination of auditory targets, written feedback, and/or a vocal imitation task. All participants then completed an intelligibility posttest. RESULTS Training improved intelligibility of dysarthric speech, with the largest improvements observed when the auditory targets were accompanied by both written feedback and an imitation task. Further, a significant relationship between intelligibility improvement and imitation accuracy was identified. CONCLUSIONS This study suggests that somatosensory information can strengthen the activation of speech sound maps of dysarthric speech. The findings, therefore, implicate a bidirectional relationship between speech perception and speech production as well as advance our understanding of the mechanisms that underlie perceptual learning of degraded speech.
Collapse
|
35
|
Hustad KC, Oakes A, Allison K. Variability and Diagnostic Accuracy of Speech Intelligibility Scores in Children. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2015; 58:1695-707. [PMID: 26381119 PMCID: PMC4987026 DOI: 10.1044/2015_jslhr-s-14-0365] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2014] [Accepted: 08/17/2015] [Indexed: 05/15/2023]
Abstract
PURPOSE We examined variability of speech intelligibility scores and how well intelligibility scores predicted group membership among 5-year-old children with speech motor impairment (SMI) secondary to cerebral palsy and an age-matched group of typically developing (TD) children. METHOD Speech samples varying in length from 1-4 words were elicited from 24 children with cerebral palsy (mean age 60.50 months) and 20 TD children (mean age 60.33 months). Two hundred twenty adult listeners made orthographic transcriptions of speech samples (n = 5 per child). RESULTS Variability associated with listeners made a significant contribution to explaining the variance in intelligibility scores for TD and SMI children, but the magnitude was greater for TD children. Intelligibility scores differentiated very well between children who have SMI and TD children when intelligibility was at or below approximately 75% and above approximately 85%. CONCLUSIONS Intelligibility seems to be a useful clinical tool for differentiating between TD children and children with SMI at 5 years of age; however, there is considerable variability within and between listeners, highlighting the need for more than one listener per child to ensure validity of an intelligibility measure.
Collapse
Affiliation(s)
- Katherine C. Hustad
- University of Wisconsin–Madison
- Waisman Center, University of Wisconsin–Madison
| | | | - Kristen Allison
- University of Wisconsin–Madison
- Waisman Center, University of Wisconsin–Madison
| |
Collapse
|
36
|
Borrie SA, Lubold N, Pon-Barry H. Disordered speech disrupts conversational entrainment: a study of acoustic-prosodic entrainment and communicative success in populations with communication challenges. Front Psychol 2015; 6:1187. [PMID: 26321996 PMCID: PMC4532918 DOI: 10.3389/fpsyg.2015.01187] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2015] [Accepted: 07/27/2015] [Indexed: 11/13/2022] Open
Abstract
Conversational entrainment, a pervasive communication phenomenon in which dialogue partners adapt their behaviors to align more closely with one another, is considered essential for successful spoken interaction. While well-established in other disciplines, this phenomenon has received limited attention in the field of speech pathology and the study of communication breakdowns in clinical populations. The current study examined acoustic-prosodic entrainment, as well as a measure of communicative success, in three distinctly different dialogue groups: (i) healthy native vs. healthy native speakers (Control), (ii) healthy native vs. foreign-accented speakers (Accented), and (iii) healthy native vs. dysarthric speakers (Disordered). Dialogue group comparisons revealed significant differences in how the groups entrain on particular acoustic–prosodic features, including pitch, intensity, and jitter. Most notably, the Disordered dialogues were characterized by significantly less acoustic-prosodic entrainment than the Control dialogues. Further, a positive relationship between entrainment indices and communicative success was identified. These results suggest that the study of conversational entrainment in speech pathology will have essential implications for both scientific theory and clinical application in this domain.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Human Interaction Lab, Department of Communicative Disorders and Deaf Education, Utah State University , Logan, UT, USA
| | - Nichola Lubold
- School of Computing, Informatics, and Decision Systems Engineering, Arizona State University , Tempe, AZ, USA
| | - Heather Pon-Barry
- Department of Computer Science, Mount Holyoke College , South Hadley, MA, USA
| |
Collapse
|
37
|
Lagerberg TB, Johnels JÅ, Hartelius L, Persson C. Effect of the number of presentations on listener transcriptions and reliability in the assessment of speech intelligibility in children. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2015; 50:476-487. [PMID: 25588966 DOI: 10.1111/1460-6984.12149] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Accepted: 11/03/2014] [Indexed: 06/04/2023]
Abstract
BACKGROUND The assessment of intelligibility is an essential part of establishing the severity of a speech disorder. The intelligibility of a speaker is affected by a number of different variables relating, inter alia, to the speech material, the listener and the listener task. AIMS To explore the impact of the number of presentations of the utterances on assessments of intelligibility based on orthographic transcription of spontaneous speech, specifically the impact on intelligibility scores, reliability and intra-listener variability. METHODS & PROCEDURES Speech from 12 children (aged 4:6-8:3 years; mean = 5:10 years) with percentage consonants correct (PCC) scores ranging from 49 to 81 was listened to by 18 students on the speech-language pathology (SLP) programme and by two recent graduates from that programme. Three conditions were examined during the transcription phase: (1) listening to each utterance once; (2) listening to each utterance a second time; and (3) listening to all utterances from a given child a third time after having heard all of its utterances twice. OUTCOMES & RESULTS Statistically significant differences between intelligibility scores were found across the three conditions, i.e. the intelligibility score increased with the number of presentations while inter-judge reliability was unchanged. The results differed markedly across listeners, but each individual listener's results were very consistent across conditions. CONCLUSIONS & IMPLICATIONS Information about the number of times an utterance is presented to the listener is important and should therefore always be included in reports of research involving intelligibility assessment. There is a need for further research and discussion on listener abilities and strategies.
Collapse
Affiliation(s)
- Tove B Lagerberg
- The Sahlgrenska Academy, University of Gothenburg, Institute of Neuroscience and Physiology, Division of Speech and Language Pathology, Gothenburg, Sweden
| | - Jakob Åsberg Johnels
- The Sahlgrenska Academy, University of Gothenburg, Institute of Neuroscience and Physiology, Division of Speech and Language Pathology, Gothenburg, Sweden
| | - Lena Hartelius
- The Sahlgrenska Academy, University of Gothenburg, Institute of Neuroscience and Physiology, Division of Speech and Language Pathology, Gothenburg, Sweden
| | - Christina Persson
- The Sahlgrenska Academy, University of Gothenburg, Institute of Neuroscience and Physiology, Division of Speech and Language Pathology, Gothenburg, Sweden
| |
Collapse
|
38
|
Borrie SA. Visual speech information: a help or hindrance in perceptual processing of dysarthric speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:1473-80. [PMID: 25786958 DOI: 10.1121/1.4913770] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
This study investigated the influence of visual speech information on perceptual processing of neurologically degraded speech. Fifty listeners identified spastic dysarthric speech under both audio (A) and audiovisual (AV) conditions. Condition comparisons revealed that the addition of visual speech information enhanced processing of the neurologically degraded input in terms of (a) acuity (percent phonemes correct) of vowels and consonants and (b) recognition (percent words correct) of predictive and nonpredictive phrases. Listeners exploited stress-based segmentation strategies more readily in AV conditions, suggesting that the perceptual benefit associated with adding visual speech information to the auditory signal-the AV advantage-has both segmental and suprasegmental origins. Results also revealed that the magnitude of the AV advantage can be predicted, to some degree, by the extent to which an individual utilizes syllabic stress cues to inform word recognition in AV conditions. Findings inform the development of a listener-specific model of speech perception that applies to processing of dysarthric speech in everyday communication contexts.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communicative Disorders and Deaf Education, Utah State University, Logan, Utah 84322
| |
Collapse
|
39
|
Kim H, Nanney S. Familiarization effects on word intelligibility in dysarthric speech. Folia Phoniatr Logop 2015; 66:258-64. [PMID: 25676630 DOI: 10.1159/000369799] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND/AIMS This study investigated the effects of familiarization on naïve listeners' ability to perceive dysarthric speech produced by speakers with cerebral palsy and evaluated the degree of intelligibility improvement, both in the short and long term, as a function of (1) familiarization paradigms and (2) the number of familiarization phases. METHODS A total of 120 listeners (30 listeners/speaker) were recruited to complete word transcription tasks over a 6-week period. The listeners were assigned to one of the following familiarization paradigms: passive familiarization with audio signal only, active familiarization with both audio and orthography, and no explicit familiarization. Intelligibility scores were measured as the percentage of words correctly transcribed. RESULTS The active familiarization paradigm that provided listeners with both audio and orthography resulted in higher intelligibility scores compared to the passive familiarization and no explicit familiarization conditions. The degree of intelligibility improvement as a function of passive familiarization varied depending on the speaker. Last, the advantage of active familiarization was also found as a long-term effect. CONCLUSION Our findings provide evidence for the benefits of familiarization in enhancing the intelligibility of dysarthric speech and support the efficacy of familiarization paradigms as an intervention technique in the management of dysarthria.
Collapse
Affiliation(s)
- Heejin Kim
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana-Champaign, Ill., USA
| | | |
Collapse
|
40
|
Lansford KL, Liss JM, Norton RE. Free-classification of perceptually similar speakers with dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:2051-64. [PMID: 25057892 PMCID: PMC9589826 DOI: 10.1044/2014_jslhr-s-13-0177] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2013] [Accepted: 06/27/2014] [Indexed: 05/15/2023]
Abstract
PURPOSE In this investigation, the construct of perceptual similarity was explored in the dysarthrias. Specifically, we employed an auditory free-classification task to determine whether listeners could cluster speakers by perceptual similarity, whether the clusters mapped to acoustic metrics, and whether the clusters were constrained by dysarthria subtype diagnosis. METHOD Twenty-three listeners blinded to speakers' medical and dysarthria subtype diagnoses participated. The task was to group together (drag and drop) the icons corresponding to 33 speakers with dysarthria on the basis of how similar they sounded. Cluster analysis and multidimensional scaling (MDS) modeled the perceptual dimensions underlying similarity. Acoustic metrics and perceptual judgments were used in correlation analyses to facilitate interpretation of the derived dimensions. RESULTS Six clusters of similar-sounding speakers and 3 perceptual dimensions underlying similarity were revealed. The clusters of similar-sounding speakers were not constrained by dysarthria subtype diagnosis. The 3 perceptual dimensions revealed by MDS were correlated with metrics for articulation rate, intelligibility, and vocal quality, respectively. CONCLUSIONS This study shows (a) feasibility of a free-classification approach for studying perceptual similarity in dysarthria, (b) correspondence between acoustic and perceptual metrics to clusters of similar-sounding speakers, and (c) similarity judgments transcended dysarthria subtype diagnosis.
Collapse
|
41
|
Landa S, Pennington L, Miller N, Robson S, Thompson V, Steen N. Association between objective measurement of the speech intelligibility of young people with dysarthria and listener ratings of ease of understanding. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2014; 16:408-416. [PMID: 25011400 DOI: 10.3109/17549507.2014.927922] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
PURPOSE This study aimed to investigate the association between listeners' ratings of how much effort it took to understand the speech of young people with cerebral palsy and the percentage of words listeners actually understood. METHOD Thirty-one young people with dysarthria and cerebral palsy (16 males, 15 females; mean age = 11 years, SD = 3) were audio recorded repeating single words and producing speech. Objective measures of intelligibility were calculated for multiple familiar and unfamiliar listeners using a forced choice paradigm for single words and verbatim orthographic transcriptions for connected speech. Listeners rated how much effort it took to understand speech in each condition using a 5-point ordinal ease of listening (EOL) scale. RESULTS Agreement on EOL within rater groups was high (ICC > 0.71). An effect of listener was observed for familiar listeners, but not for unfamiliar listeners. EOL agreement between familiar and unfamiliar listeners was weak-moderate (ICC = 0.46). EOL predicted the percentage of speech actually understood by familiar and unfamiliar listeners (r > 0.56, p < 0.001 for all predictions). Strongest associations between EOL and intelligibility were observed for speakers with mild and profound impairments. CONCLUSIONS The findings of this study demonstrate that listeners can judge how well they have understood dysarthric speech. EOL is associated with listener familiarity, speech task and speech impairment severity. EOL is appropriate for use in clinical practice as a measure of communication activity.
Collapse
|
42
|
McAuliffe MJ, Kerr SE, Gibson EMR, Anderson T, LaShell PJ. Cognitive-perceptual examination of remediation approaches to hypokinetic dysarthria. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:1268-1283. [PMID: 24687031 DOI: 10.1044/2014_jslhr-s-12-0349] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
PURPOSE To determine how increased vocal loudness and reduced speech rate affect listeners' cognitive-perceptual processing of hypokinetic dysarthric speech associated with Parkinson's disease. METHOD Fifty-one healthy listener participants completed a speech perception experiment. Listeners repeated phrases produced by 5 individuals with dysarthria across habitual, loud, and slow speaking modes. Listeners were allocated to habitual ( n = 17), loud ( n = 17), or slow ( n = 17) experimental conditions. Transcripts derived from the phrase repetition task were coded for overall accuracy (i.e., intelligibility), and perceptual error analyses examined how these conditions affected listeners' phonemic mapping (i.e., syllable resemblance) and lexical segmentation (i.e., lexical boundary error analysis). RESULTS Both speech conditions provided obvious perceptual benefits to listeners. Overall, transcript accuracy was highest in the slow condition. In the loud condition, however, improvement was evidenced across the experiment. An error analysis suggested that listeners in the loud condition prioritized acoustic-phonetic cues in their attempts to resolve the degraded signal, whereas those in the slow condition appeared to preferentially weight lexical stress cues. CONCLUSIONS Increased loudness and reduced rate exhibited differential effects on listeners' perceptual processing of dysarthric speech. The current study highlights the insights that may be gained from a cognitive-perceptual approach.
Collapse
|
43
|
Borrie SA, Liss JM. Rhythm as a coordinating device: entrainment with disordered speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:815-24. [PMID: 24686410 PMCID: PMC4084711 DOI: 10.1044/2014_jslhr-s-13-0149] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
PURPOSE The rhythmic entrainment (coordination) of behavior during human interaction is a powerful phenomenon, considered essential for successful communication, supporting social and emotional connection, and facilitating sense-making and information exchange. Disruption in entrainment likely occurs in conversations involving those with speech and language impairment, but its contribution to communication disorders has not been defined. As a first step to exploring this phenomenon in clinical populations, the present investigation examined the influence of disordered speech on the speech production properties of healthy interactants. METHOD Twenty-nine neurologically healthy interactants participated in a quasi-conversational paradigm, in which they read sentences (response) in response to hearing prerecorded sentences (exposure) from speakers with dysarthria (n = 4) and healthy controls (n = 4). Recordings of read sentences prior to the task were also collected (habitual). RESULTS Findings revealed that interactants modified their speaking rate and pitch variation to align more closely with the disordered speech. Production shifts in these rhythmic properties, however, remained significantly different from corresponding properties in dysarthric speech. CONCLUSION Entrainment offers a new avenue for exploring speech and language impairment, addressing a communication process not currently explained by existing frameworks. This article offers direction for advancing this line of inquiry.
Collapse
|
44
|
Miller N. Measuring up to speech intelligibility. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2013; 48:601-612. [PMID: 24119170 DOI: 10.1111/1460-6984.12061] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Improvement or maintenance of speech intelligibility is a central aim in a whole range of conditions in speech-language therapy, both developmental and acquired. Best clinical practice and pursuance of the evidence base for interventions would suggest measurement of intelligibility forms a vital role in clinical decision-making and monitoring. However, what should be measured to gauge intelligibility and how this is achieved and relates to clinical planning continues to be a topic of debate. This review considers the strengths and weaknesses of selected clinical approaches to intelligibility assessment, stressing the importance of explanatory, diagnostic testing as both a more sensitive and a clinically informative method. The worth of this, and any approach, is predicated, though, on awareness and control of key design, elicitation, transcription and listening/listener variables to maximize validity and reliability of assessments. These are discussed. A distinction is drawn between signal-dependent and -independent factors in intelligibility evaluation. Discussion broaches how these different perspectives might be reconciled to deliver comprehensive insights into intelligibility levels and their clinical/educational significance. The paper ends with a call for wider implementation of best practice around intelligibility assessment.
Collapse
Affiliation(s)
- Nick Miller
- Institute of Health and Society, Speech and Language Sciences, Newcastle University, Newcastle upon Tyne, UK
| |
Collapse
|
45
|
McAuliffe MJ, Gibson EMR, Kerr SE, Anderson T, LaShell PJ. Vocabulary influences older and younger listeners' processing of dysarthric speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:1358-1368. [PMID: 23927132 DOI: 10.1121/1.4812764] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
This study examined younger (n = 16) and older (n = 16) listeners' processing of dysarthric speech-a naturally occurring form of signal degradation. It aimed to determine how age, hearing acuity, memory, and vocabulary knowledge interacted in speech recognition and lexical segmentation. Listener transcripts were coded for accuracy and pattern of lexical boundary errors. For younger listeners, transcription accuracy was predicted by receptive vocabulary. For older listeners, this same effect existed but was moderated by pure-tone hearing thresholds. While both groups employed syllabic stress cues to inform lexical segmentation, older listeners were less reliant on this perceptual strategy. The results were interpreted to suggest that individuals with larger receptive vocabularies, with their presumed greater language familiarity, were better able to leverage cue redundancies within the speech signal to form lexical hypothesis-leading to an improved ability to comprehend dysarthric speech. This advantage was minimized as hearing thresholds increased. While the differing levels of reliance on stress cues across the listener groups could not be attributed to specific individual differences, it was hypothesized that some combination of larger vocabularies and reduced hearing thresholds in the older participant group led to them prioritize lexical cues as a segmentation frame.
Collapse
Affiliation(s)
- Megan J McAuliffe
- Department of Communication Disorders and New Zealand Institute of Language, Brain and Behaviour, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand
| | | | | | | | | |
Collapse
|
46
|
Borrie SA, McAuliffe MJ, Liss JM, O'Beirne GA, Anderson TJ. The role of linguistic and indexical information in improved recognition of dysarthric speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 133:474-82. [PMID: 23297919 PMCID: PMC4109309 DOI: 10.1121/1.4770239] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2012] [Revised: 11/08/2012] [Accepted: 11/12/2012] [Indexed: 05/15/2023]
Abstract
This investigation examined perceptual learning of dysarthric speech. Forty listeners were randomly assigned to one of two identification training tasks, aimed at highlighting either the linguistic (word identification task) or indexical (speaker identification task) properties of the neurologically degraded signal. Twenty additional listeners served as a control group, passively exposed to the training stimuli. Immediately following exposure to dysarthric speech, all three listener groups completed an identical phrase transcription task. Analysis of listener transcripts revealed remarkably similar intelligibility improvements for listeners trained to attend to either the linguistic or the indexical properties of the signal. Perceptual learning effects were also evaluated with regards to underlying error patterns indicative of segmental and suprasegmental processing. The findings of this study suggest that elements within both the linguistic and indexical properties of the dysarthric signal are learnable and interact to promote improved processing of this type and severity of speech degradation. Thus, the current study extends support for the development of a model of perceptual processing in which the learning of indexical properties is encoded and retained in conjunction with linguistic properties of the signal.
Collapse
Affiliation(s)
- Stephanie A Borrie
- Department of Communication Disorders and New Zealand Institute of Language, Brain and Behaviour, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand.
| | | | | | | | | |
Collapse
|
47
|
Lansford KL, Liss JM, Caviness JN, Utianski RL. A cognitive-perceptual approach to conceptualizing speech intelligibility deficits and remediation practice in hypokinetic dysarthria. PARKINSONS DISEASE 2011; 2011:150962. [PMID: 21918728 PMCID: PMC3171761 DOI: 10.4061/2011/150962] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 04/09/2011] [Revised: 06/14/2011] [Accepted: 07/13/2011] [Indexed: 11/20/2022]
Abstract
Hypokinetic dysarthria is a common manifestation of Parkinson's disease, which negatively influences quality of life. Behavioral techniques that aim to improve speech intelligibility constitute the bulk of intervention strategies for this population, as the dysarthria does not often respond vigorously to medical interventions. Although several case and group studies generally support the efficacy of behavioral treatment, much work remains to establish a rigorous evidence base. This absence of definitive research leaves both the speech-language pathologist and referring physician with the task of determining the feasibility and nature of therapy for intelligibility remediation in PD. The purpose of this paper is to introduce a novel framework for medical practitioners in which to conceptualize and justify potential targets for speech remediation. The most commonly targeted deficits (e.g., speaking rate and vocal loudness) can be supported by this approach, as well as underutilized and novel treatment targets that aim at the listener's perceptual skills.
Collapse
Affiliation(s)
- Kaitlin L Lansford
- Motor Speech Disorders Laboratory, Department of Speech and Hearing Science, Arizona State University, P.O. Box 870102, Tempe, AZ 85287-0102, USA
| | | | | | | |
Collapse
|