1
|
Okalidou A, Peng ZE, Banioti A, Fourakis M, Kyriafinis G. The Lombard effect in children with cochlear implants: suprasegmental aspects. CLINICAL LINGUISTICS & PHONETICS 2024; 38:699-719. [PMID: 38679889 DOI: 10.1080/02699206.2024.2340096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 04/02/2024] [Accepted: 04/02/2024] [Indexed: 05/01/2024]
Abstract
Children with cochlear implants (CI) communicate in noisy environments, such as in classrooms, where multiple talkers and reverberation are present. Speakers compensate for noise via the 'Lombard effect'. The present study examined the Lombard effect on the intensity and duration of stressed vowels in the speech of children with Cochlear Implants (CIs) as compared to children with Normal Hearing (NH), focusing on the effects of speech-shaped noise (SSN) and speech-shaped noise with reverberation (SSN+Reverberation). The sample consisted of 7 children with CIs and 7 children with NH, aged 7-12 years. Regarding intensity, a) children with CIs produced stressed vowels with an overall greater intensity across acoustic conditions as compared to NH peers, b) both groups increased their stressed vowel intensity for all vowels from Quiet to both noise conditions, and c) children with NH further increased their intensity when reverberation was added to SSN, esp. for the vowel /u/. Regarding duration, longer stressed vowels were produced by children with CIs as compared to NH in Quiet and SSN conditions but the effect was retained only for the vowels /i/, /o/ and /u/ when reverberation was added to noise. The SSN+Reverberation condition induced systematic lengthening in stressed vowels for children with NH. Furthermore, although greater intensity and duration ratios of stressed/unstressed syllables were observed for children with NH as compared to CIs in Quiet condition, they diminished with noise. The differences observed across groups have implications for speaking in classroom noise.
Collapse
Affiliation(s)
- Areti Okalidou
- Department of Educational & Social Policy, Graduate Program of Communication Disorders & Sciences, University of Macedonia, Thessaloniki, Greece
| | - Z Ellen Peng
- Functional Hearing Laboratory, Boys Town National Research Hospital, Omaha, Nebraska, USA
| | - Aggeliki Banioti
- Department of Educational & Social Policy, Graduate Program of Communication Disorders & Sciences, University of Macedonia, Thessaloniki, Greece
| | - Marios Fourakis
- Department of Hearing & Speech Sciences, University of Maryland, College Park, Maryland, USA
| | - Georgios Kyriafinis
- 1st University Otolaryngology Clinic of AHEPA Hospital, Medical School, Aristotle University of Thessaloniki, Thessaloniki, Greece
| |
Collapse
|
2
|
Littlejohn M, Maas E. How to cut the pie is no piece of cake: Toward a process-oriented approach to assessment and diagnosis of speech sound disorders. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS 2023. [PMID: 37483105 DOI: 10.1111/1460-6984.12934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 06/29/2023] [Indexed: 07/25/2023]
Abstract
BACKGROUND 'Speech sound disorder' is an umbrella term that encompasses dysarthria, articulation disorders, childhood apraxia of speech and phonological disorders. However, differential diagnosis between these disorders is a persistent challenge in speech pathology, as many diagnostic procedures use symptom clusters instead of identifying an origin of breakdown in the speech and language system. AIMS This article reviews typical and disordered speech through the lens of two well-developed models of production-one focused on phonological encoding and one focused on speech motor planning. We illustrate potential breakdown locations within these models that may relate to childhood apraxia of speech and phonological disorders. MAIN CONTRIBUTION This paper presents an overview of an approach to conceptualisation of speech sound disorders that is grounded in current models of speech production and emphasises consideration of underlying processes. The paper also sketches a research agenda for the development of valid, reliable and clinically feasible assessment protocols for children with speech sound disorders. CONCLUSION The process-oriented approach outlined here is in the early stages of development but holds promise for developing a more detailed and comprehensive understanding of, and assessment protocols for speech sound disorders that go beyond broad diagnostic labels based on error analysis. Directions for future research are discussed. WHAT THIS PAPER ADDS What is already known on the subject Speech sound disorders (SSD) are heterogeneous, and there is agreement that some children have a phonological impairment (phonological disorders, PD) whereas others have an impairment of speech motor planning (childhood apraxia of speech, CAS). There is also recognition that speech production involves multiple processes, and several approaches to the assessment and diagnosis of SSD have been proposed. What this paper adds to existing knowledge This paper provides a more detailed conceptualisation of potential impairments in children with SSD that is grounded in current models of speech production and encourages greater consideration of underlying processes. The paper illustrates this approach and provides guidance for further development. One consequence of this perspective is the notion that broad diagnostic category labels (PD, CAS) may each comprise different subtypes or profiles depending on the processes that are affected. What are the potential or actual clinical implications of this work? Although the approach is in the early stages of development and no comprehensive validated set of tasks and measures is available to assess all processes, clinicians may find the conceptualisation of different underlying processes and the notion of potential subtypes within PD and CAS informative when evaluating SSD. In addition, this perspective discourages either/or thinking (PD or CAS) and instead encourages consideration of the possibility that children may have different combinations of impairments at different processing stages.
Collapse
Affiliation(s)
- Meghan Littlejohn
- Department of Communication Sciences and Disorders, Temple University, Philadelphia, Pennsylvania, USA
| | - Edwin Maas
- Department of Communication Sciences and Disorders, Temple University, Philadelphia, Pennsylvania, USA
| |
Collapse
|
3
|
Cuadros J, Z-Rivera L, Castro C, Whitaker G, Otero M, Weinstein A, Martínez-Montes E, Prado P, Zañartu M. DIVA Meets EEG: Model Validation Using Formant-Shift Reflex. APPLIED SCIENCES (BASEL, SWITZERLAND) 2023; 13:7512. [PMID: 38435340 PMCID: PMC10906992 DOI: 10.3390/app13137512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/05/2024]
Abstract
The neurocomputational model 'Directions into Velocities of Articulators' (DIVA) was developed to account for various aspects of normal and disordered speech production and acquisition. The neural substrates of DIVA were established through functional magnetic resonance imaging (fMRI), providing physiological validation of the model. This study introduces DIVA_EEG an extension of DIVA that utilizes electroencephalography (EEG) to leverage the high temporal resolution and broad availability of EEG over fMRI. For the development of DIVA_EEG, EEG-like signals were derived from original equations describing the activity of the different DIVA maps. Synthetic EEG associated with the utterance of syllables was generated when both unperturbed and perturbed auditory feedback (first formant perturbations) were simulated. The cortical activation maps derived from synthetic EEG closely resembled those of the original DIVA model. To validate DIVA_EEG, the EEG of individuals with typical voices (N = 30) was acquired during an altered auditory feedback paradigm. The resulting empirical brain activity maps significantly overlapped with those predicted by DIVA_EEG. In conjunction with other recent model extensions, DIVA_EEG lays the foundations for constructing a complete neurocomputational framework to tackle vocal and speech disorders, which can guide model-driven personalized interventions.
Collapse
Affiliation(s)
- Jhosmary Cuadros
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
- Advanced Center for Electrical and Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
- Grupo de Bioingeniería, Decanato de Investigación, Universidad Nacional Experimental del Táchira, San Cristóbal 5001, Venezuela
| | - Lucía Z-Rivera
- Advanced Center for Electrical and Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
- Escuela de Ingeniería Civil Biomédica, Facultad de Ingeniería, Universidad de Valparaíso, Valparaíso 2350026, Chile
| | - Christian Castro
- Advanced Center for Electrical and Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
- Escuela de Ingeniería Civil Biomédica, Facultad de Ingeniería, Universidad de Valparaíso, Valparaíso 2350026, Chile
| | - Grace Whitaker
- Advanced Center for Electrical and Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| | - Mónica Otero
- Facultad de Ingeniería, Arquitectura y Diseño, Universidad San Sebastián, Santiago 8420524, Chile
- Centro Basal Ciencia & Vida, Universidad San Sebastián, Santiago 8580000, Chile
| | - Alejandro Weinstein
- Advanced Center for Electrical and Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
- Escuela de Ingeniería Civil Biomédica, Facultad de Ingeniería, Universidad de Valparaíso, Valparaíso 2350026, Chile
| | | | - Pavel Prado
- Escuela de Fonoaudiología, Facultad de Odontología y Ciencias de la Rehabilitación, Universidad San Sebastián, Santiago 7510602, Chile
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
- Advanced Center for Electrical and Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso 2390123, Chile
| |
Collapse
|
4
|
Arias-Vergara T, Batliner A, Rader T, Polterauer D, Högerle C, Müller J, Orozco-Arroyave JR, Nöth E, Schuster M. Adult Cochlear Implant Users Versus Typical Hearing Persons: An Automatic Analysis of Acoustic-Prosodic Parameters. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:4623-4636. [PMID: 36417788 DOI: 10.1044/2022_jslhr-21-00116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
PURPOSE The aim of this study was to investigate the speech prosody of postlingually deaf cochlear implant (CI) users compared with control speakers without hearing or speech impairment. METHOD Speech recordings of 74 CI users (37 males and 37 females) and 72 age-balanced control speakers (36 males and 36 females) are considered. All participants are German native speakers and read Der Nordwind und die Sonne (The North Wind and the Sun), a standard text in pathological speech analysis and phonetic transcriptions. Automatic acoustic analysis is performed considering pitch, loudness, and duration features, including speech rate and rhythm. RESULTS In general, duration and rhythm features differ between CI users and control speakers. CI users read slower and have a lower voiced segment ratio compared with control speakers. A lower voiced ratio goes along with a prolongation of the voiced segments' duration in male and with a prolongation of pauses in female CI users. Rhythm features in CI users have higher variability in the duration of vowels and consonants than in control speakers. The use of bilateral CIs showed no advantages concerning speech prosody features in comparison to unilateral use of CI. CONCLUSIONS Even after cochlear implantation and rehabilitation, the speech of postlingually deaf adults deviates from the speech of control speakers, which might be due to changed auditory feedback. We suggest considering changes in temporal aspects of speech in future rehabilitation strategies. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.21579171.
Collapse
Affiliation(s)
- Tomás Arias-Vergara
- Department of Otorhinolaryngology and Head and Neck Surgery, Ludwig Maximilians University of Munich, Germany
- Faculty of Engineering, Universidad de Antioquia, Medellín, Colombia
- Pattern Recognition Lab, Friedrich-Alexander University, Erlangen-Nuremberg, Germany
| | - Anton Batliner
- Pattern Recognition Lab, Friedrich-Alexander University, Erlangen-Nuremberg, Germany
- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany
| | - Tobias Rader
- Department of Otorhinolaryngology and Head and Neck Surgery, Ludwig Maximilians University of Munich, Germany
| | - Daniel Polterauer
- Department of Otorhinolaryngology and Head and Neck Surgery, Ludwig Maximilians University of Munich, Germany
| | - Catalina Högerle
- Department of Otorhinolaryngology and Head and Neck Surgery, Ludwig Maximilians University of Munich, Germany
| | - Joachim Müller
- Department of Otorhinolaryngology and Head and Neck Surgery, Ludwig Maximilians University of Munich, Germany
| | - Juan-Rafael Orozco-Arroyave
- Faculty of Engineering, Universidad de Antioquia, Medellín, Colombia
- Pattern Recognition Lab, Friedrich-Alexander University, Erlangen-Nuremberg, Germany
| | - Elmar Nöth
- Pattern Recognition Lab, Friedrich-Alexander University, Erlangen-Nuremberg, Germany
| | - Maria Schuster
- Department of Otorhinolaryngology and Head and Neck Surgery, Ludwig Maximilians University of Munich, Germany
| |
Collapse
|
5
|
Senthinathan A, Adams S, Page AD, Jog M. Speech Intensity Response to Altered Intensity Feedback in Individuals With Parkinson's Disease. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2261-2275. [PMID: 33830820 DOI: 10.1044/2021_jslhr-20-00278] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Purpose Hypophonia (low speech intensity) is the most common speech symptom experienced by individuals with Parkinson's disease (IWPD). Previous research suggests that, in IWPD, there may be abnormal integration of sensory information for motor production of speech intensity. In the current study, intensity of auditory feedback was systematically manipulated (altered in both positive and negative directions) during sensorimotor conditions that are known to modulate speech intensity in everyday contexts in order to better understand the role of auditory feedback for speech intensity regulation. Method Twenty-six IWPD and 24 neurologically healthy controls were asked to complete the following tasks: converse with the experimenter, start vowel production, and read sentences at a comfortable loudness, while hearing their own speech intensity randomly altered. Altered intensity feedback conditions included 5-, 10-, and 15-dB reductions and increases in the feedback intensity. Speech tasks were completed in no noise and in background noise. Results IWPD displayed a reduced response to the altered intensity feedback compared to control participants. This reduced response was most apparent when participants were speaking in background noise. Specific task-based differences in responses were observed such that the reduced response by IWPD was most pronounced during the conversation task. Conclusions The current study suggests that IWPD have abnormal processing of auditory information for speech intensity regulation, and this disruption particularly impacts their ability to regulate speech intensity in the context of speech tasks with clear communicative goals (i.e., conversational speech) and speaking in background noise.
Collapse
Affiliation(s)
| | - Scott Adams
- Department of Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
- Department of Clinical Neurological Sciences, Western University, London, Ontario, Canada
| | - Allyson D Page
- Department of Health and Rehabilitation Sciences, Western University, London, Ontario, Canada
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
| | - Mandar Jog
- Department of Clinical Neurological Sciences, Western University, London, Ontario, Canada
| |
Collapse
|
6
|
Terband H, Rodd J, Maas E. Testing hypotheses about the underlying deficit of apraxia of speech through computational neural modelling with the DIVA model. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2020; 22:475-486. [PMID: 31630555 DOI: 10.1080/17549507.2019.1669711] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Purpose: A recent behavioural experiment featuring a noise masking paradigm suggests that Apraxia of Speech (AOS) reflects a disruption of feedforward control, whereas feedback control is spared and plays a more prominent role in achieving and maintaining segmental contrasts. The present study set out to validate the interpretation of AOS as a possible feedforward impairment using computational neural modelling with the DIVA (Directions Into Velocities of Articulators) model.Method: In a series of computational simulations with the DIVA model featuring a noise-masking paradigm mimicking the behavioural experiment, we investigated the effect of a feedforward, feedback, feedforward + feedback, and an upper motor neuron dysarthria impairment on average vowel spacing and dispersion in the production of six/bVt/speech targets.Result: The simulation results indicate that the output of the model with the simulated feedforward deficit resembled the group findings for the human speakers with AOS best.Conclusion: These results provide support to the interpretation of the human observations, corroborating the notion that AOS can be conceptualised as a deficit in feedforward control.
Collapse
Affiliation(s)
- Hayo Terband
- Utrecht Institute of Linguistics-OTS, Utrecht University, Utrecht, The Netherlands
| | - Joe Rodd
- Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Edwin Maas
- Department of Communication Sciences and Disorders, Temple University, Philadelphia, PA, USA
| |
Collapse
|
7
|
Smith DJ, Stepp C, Guenther FH, Kearney E. Contributions of Auditory and Somatosensory Feedback to Vocal Motor Control. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2039-2053. [PMID: 32603626 PMCID: PMC7838841 DOI: 10.1044/2020_jslhr-19-00296] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Purpose To better define the contributions of somatosensory and auditory feedback in vocal motor control, a laryngeal perturbation experiment was conducted with and without masking of auditory feedback. Method Eighteen native speakers of English produced a sustained vowel while their larynx was physically and externally displaced on a subset of trials. For the condition with auditory masking, speech-shaped noise was played via earphones at 90 dB SPL. Responses to the laryngeal perturbation were compared to responses by the same participants to an auditory perturbation experiment that involved a 100-cent downward shift in fundamental frequency (f o). Responses were also examined in relation to a measure of auditory acuity. Results Compensatory responses to the laryngeal perturbation were observed with and without auditory masking. The level of compensation was greatest in the laryngeal perturbation condition without auditory masking, followed by the condition with auditory masking; the level of compensation was smallest in the auditory perturbation experiment. No relationship was found between the degree of compensation to auditory versus laryngeal perturbations, and the variation in responses in both perturbation experiments was not related to auditory acuity. Conclusions The findings indicate that somatosensory and auditory feedback control mechanisms work together to compensate for laryngeal perturbations, resulting in the greatest degree of compensation when both sources of feedback are available. In contrast, these two control mechanisms work in competition in response to auditory perturbations, resulting in an overall smaller degree of compensation. Supplemental Material https://doi.org/10.23641/asha.12559628.
Collapse
Affiliation(s)
- Dante J. Smith
- Graduate Program for Neuroscience, Boston University, MA
| | - Cara Stepp
- Graduate Program for Neuroscience, Boston University, MA
- Department of Speech, Language and Hearing Science, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| | - Frank H. Guenther
- Graduate Program for Neuroscience, Boston University, MA
- Department of Speech, Language and Hearing Science, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| | - Elaine Kearney
- Department of Speech, Language and Hearing Science, Boston University, MA
| |
Collapse
|
8
|
Reilly KJ. Vowel and Sibilant Production in Noise: Effects of Noise Frequency and Phonological Similarity. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:1002-1017. [PMID: 32293944 PMCID: PMC7242990 DOI: 10.1044/2020_jslhr-19-00345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Revised: 11/20/2019] [Accepted: 12/30/2019] [Indexed: 06/11/2023]
Abstract
Purpose This study investigated vowel and sibilant productions in noise to determine whether responses to noise (a) are sensitive to the spectral characteristics of the noise signal and (b) are modulated by the contribution of vowel or sibilant contrasts to word discrimination. Method Vowel and sibilant productions were elicited during serial recall of three-word sequences that were produced in quiet or during exposure to speaker-specific noise signals. These signals either masked a speaker's productions of the sibilants /s/ and /ʃ/ or their productions of the vowels /a/ and /æ/. The contribution of the vowel and sibilant contrasts to word discrimination in a sequence was manipulated by varying the number of times that the target sibilant and vowel pairs occurred in the same word position in each sequence. Results Spectral noise effects were observed for both sibilants and vowels: Responses to noise were larger and/or involved to more acoustic features when the noise signal masked the acoustic characteristics of that phoneme class. Word discrimination effects were limited and consisted of only small increases in vowel duration. Interaction effects between noise and similarity indicated that the phonological similarity of sequences containing both sibilants and/or both vowels influenced articulation in ways not related to speech clarity. Conclusion The findings of this study indicate that sensorimotor control of speech exhibits some sensitivity to noise spectral characteristics. However, productions of sibilants and vowels were not sensitive to their importance in discriminating the words in a sequence. In addition, phonological similarity effects were observed that likely reflected processing demands related to the recall and sequencing of high-similarity words.
Collapse
Affiliation(s)
- Kevin J. Reilly
- Department of Audiology and Speech Pathology, College of Health Professions, University of Tennessee Health Science Center, Memphis
| |
Collapse
|
9
|
van Brenk F, Terband H. Compensatory and adaptive responses to real-time formant shifts in adults and children. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:2261. [PMID: 32359278 DOI: 10.1121/10.0001018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Accepted: 03/17/2020] [Indexed: 06/11/2023]
Abstract
Auditory feedback plays an important role in speech motor learning, yet, little is known about the strength of motor learning and feedback control in speech development. This study investigated compensatory and adaptive responses to auditory feedback perturbation in children (aged 4-9 years old) and young adults (aged 18-29 years old). Auditory feedback was perturbed by near-real-time shifting F1 and F2 of the vowel /ɪː/ during the production of consonant-vowel-consonant words. Children were able to compensate and adapt in a similar or larger degree compared to young adults. Higher token-to-token variability was found in children compared to adults but not disproportionately higher during the perturbation phases compared to the unperturbed baseline. The added challenge to auditory-motor integration did not influence production variability in children, and compensation and adaptation effects were found to be strong and sustainable. Significant group differences were absent in the proportions of speakers displaying a compensatory or adaptive response, an amplifying response, or no consistent response. Within these categories, children produced significantly stronger compensatory, adaptive, or amplifying responses, which could be explained by less-ingrained existing representations. The results are interpreted as both auditory-motor integration and learning capacities are stronger in young children compared to adults.
Collapse
Affiliation(s)
- Frits van Brenk
- Department of Communicative Disorders and Sciences, University at Buffalo, 122 Cary Hall, 3435 Main Street, Buffalo, New York 14214, USA
| | - Hayo Terband
- Utrecht Institute of Linguistics-OTS, Utrecht University, Trans 10, Room 0.25, 3512 JK Utrecht, The Netherlands
| |
Collapse
|
10
|
Taitelbaum-Swead R, Avivi M, Gueta B, Fostick L. The effect of delayed auditory feedback (DAF) and frequency altered feedback (FAF) on speech production: cochlear implanted versus normal hearing individuals. CLINICAL LINGUISTICS & PHONETICS 2019; 33:628-640. [PMID: 30704295 DOI: 10.1080/02699206.2019.1574313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 01/17/2019] [Accepted: 01/21/2019] [Indexed: 06/09/2023]
Abstract
Normal auditory feedback contributes to moment-to-moment control of speech production. Effects of auditory feedback's absence on hearing-impaired individuals are widely documented, but auditory perturbation has not been investigated. Our objective was to evaluate the effect of delayed auditory feedback (DAF) and frequency altered feedback (FAF) on speech production among prelingual cochlear implant (CI) users and normal hearing (NH) individuals, to evaluate CI users' reliance on auditory feedback. Twenty young adults (10 CI, 10 NH), without developmental and cognitive impairments, participated in the study. Under variable auditory feedback conditions, speech production (spontaneous or reading aloud) was measured using speech rate, percentage of interruptions, fundamental frequency (F0), and relative intensity. Results showed that (1) both DAF and FAF caused slower speech rates and more interruptions while reading aloud, with DAF having larger effect; (2) altered feedback produced no differences between groups, except an increase in F0 for CI users during DAF; and (3) CI users' ability to understand speech via phone and without lip-reading was positively correlated with performance under DAF. These findings suggest that auditory perturbation similarly affects speech production among prelingual CI users and NH individuals, indicating CI users depend on auditory feedback to the same degree as normal hearing individuals.
Collapse
Affiliation(s)
- Riki Taitelbaum-Swead
- a Department of Communication Disorders , Ariel University , Ariel , Israel
- b Meuhedet Health Services , Tel Aviv , Israel
| | - Michal Avivi
- a Department of Communication Disorders , Ariel University , Ariel , Israel
| | - Batel Gueta
- a Department of Communication Disorders , Ariel University , Ariel , Israel
| | - Leah Fostick
- a Department of Communication Disorders , Ariel University , Ariel , Israel
| |
Collapse
|
11
|
Garnier M, Ménard L, Alexandre B. Hyper-articulation in Lombard speech: An active communicative strategy to enhance visible speech cues? THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:1059. [PMID: 30180713 DOI: 10.1121/1.5051321] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2017] [Accepted: 08/02/2018] [Indexed: 06/08/2023]
Abstract
This study investigates the hypothesis that speakers make active use of the visual modality in production to improve their speech intelligibility in noisy conditions. Six native speakers of Canadian French produced speech in quiet conditions and in 85 dB of babble noise, in three situations: interacting face-to-face with the experimenter (AV), using the auditory modality only (AO), or reading aloud (NI, no interaction). The audio signal was recorded with the three-dimensional movements of their lips and tongue, using electromagnetic articulography. All the speakers reacted similarly to the presence vs absence of communicative interaction, showing significant speech modifications with noise exposure in both interactive and non-interactive conditions, not only for parameters directly related to voice intensity or for lip movements (very visible) but also for tongue movements (less visible); greater adaptation was observed in interactive conditions, though. However, speakers reacted differently to the availability or unavailability of visual information: only four speakers enhanced their visible articulatory movements more in the AV condition. These results support the idea that the Lombard effect is at least partly a listener-oriented adaptation. However, to clarify their speech in noisy conditions, only some speakers appear to make active use of the visual modality.
Collapse
Affiliation(s)
- Maëva Garnier
- Centre National de la Recherche Scientifique, Laboratoire Grenoble Images Parole Signal Automatique, 11 rue des Mathématiques, Grenoble Campus, Boîte Postale 46, F-38402 Saint Martin d'Hères Cedex, France
| | - Lucie Ménard
- Département de Linguistique, Laboratoire de Phonétique, Center for Research on Brain, Language, and Music, Université du Québec à Montréal, 320, Ste-Catherine Est, Montréal, Quebec H2X 1L7, Canada
| | - Boris Alexandre
- Centre National de la Recherche Scientifique, Laboratoire Grenoble Images Parole Signal Automatique, 11 rue des Mathématiques, Grenoble Campus, Boîte Postale 46, F-38402 Saint Martin d'Hères Cedex, France
| |
Collapse
|
12
|
Reilly KJ, Pettibone C. Vowel generalization and its relation to adaptation during perturbations of auditory feedback. J Neurophysiol 2017; 118:2925-2934. [PMID: 28835529 PMCID: PMC5686240 DOI: 10.1152/jn.00702.2016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Revised: 08/21/2017] [Accepted: 08/22/2017] [Indexed: 11/22/2022] Open
Abstract
Repeated perturbations of auditory feedback during vowel production elicit changes not only in the production of the perturbed vowel (adaptation) but also in the production of nearby vowels that were not perturbed (generalization). The finding that adaptation generalizes to other, nonperturbed vowels suggests that sensorimotor representations for vowels are not independent; instead, the goals for producing any one vowel may depend in part on the goals for other vowels. The present study investigated the dependence or independence of vowel representations by evaluating adaptation and generalization in two groups of speakers exposed to auditory perturbations of their first formant (F1) during different vowels. The speakers in both groups who adapted to the perturbation exhibited generalization in two nonperturbed vowels that were produced under masking noise. Correlation testing was performed to evaluate the relations between adaptation and generalization as well as between the generalization in the two nonperturbed vowels. These tests identified significant coupling between the F1 changes of adjacent vowels but not nonadjacent vowels. The pattern of correlation findings indicates that generalization was due in part to feedforward representations that are partly shared across adjacent vowels, possibly to maintain their acoustic contrast.NEW & NOTEWORTHY Speech adaptations to alterations, or perturbations, of auditory feedback have provided important insights into sensorimotor representations underlying speech. One finding from these studies that is yet to be accounted for is vowel generalization, which describes the effects of repeated perturbations to one vowel on the production of other vowels that were not perturbed. The present study used correlation testing to quantify the effects of changes in a perturbed vowel on neighboring (i.e., similar) nonperturbed vowels. The results identified significant correlations between the changes of adjacent, but not nonadjacent, vowel pairs. This finding suggests that generalization is partly a response to adaptation and not solely due to the auditory perturbation.
Collapse
Affiliation(s)
- Kevin J Reilly
- Department of Audiology and Speech Pathology, University of Tennessee Health Science Center, Knoxville, Tennessee; and
| | | |
Collapse
|
13
|
Maas E, Mailend ML. Fricative Contrast and Coarticulation in Children With and Without Speech Sound Disorders. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2017; 26:649-663. [PMID: 28654946 PMCID: PMC5576970 DOI: 10.1044/2017_ajslp-16-0110] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2016] [Revised: 08/22/2016] [Accepted: 10/09/2016] [Indexed: 06/07/2023]
Abstract
PURPOSE The purpose of this study was, first, to expand our understanding of typical speech development regarding segmental contrast and anticipatory coarticulation, and second, to explore the potential diagnostic utility of acoustic measures of fricative contrast and anticipatory coarticulation in children with speech sound disorders (SSD). METHOD In a cross-sectional design, 10 adults, 17 typically developing children, and 11 children with SSD repeated carrier phrases with novel words with fricatives (/s/, /ʃ/). Dependent measures were 2 ratios derived from spectral mean, obtained from perceptually accurate tokens. Group analyses compared adults and typically developing children; individual children with SSD were compared to their respective typically developing peers. RESULTS Typically developing children demonstrated smaller fricative acoustic contrast than adults but similar coarticulatory patterns. Three children with SSD showed smaller fricative acoustic contrast than their typically developing peers, and 2 children showed abnormal coarticulation. The 2 children with abnormal coarticulation both had a clinical diagnosis of childhood apraxia of speech; no clear pattern was evident regarding SSD subtype for smaller fricative contrast. CONCLUSIONS Children have not reached adult-like speech motor control for fricative production by age 10 even when fricatives are perceptually accurate. Present findings also suggest that abnormal coarticulation but not reduced fricative contrast is SSD-subtype-specific. SUPPLEMENTAL MATERIALS S1: https://doi.org/10.23641/asha.5103070. S2 and S3: https://doi.org/10.23641/asha.5106508.
Collapse
Affiliation(s)
- Edwin Maas
- Department of Communication Sciences and Disorders, Temple University, Philadelphia, PA
| | - Marja-Liisa Mailend
- Department of Communication Sciences and Disorders, Temple University, Philadelphia, PA
| |
Collapse
|
14
|
Cler GJ, Lee JC, Mittelman T, Stepp CE, Bohland JW. Kinematic Analysis of Speech Sound Sequencing Errors Induced by Delayed Auditory Feedback. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:1695-1711. [PMID: 28655038 PMCID: PMC5544401 DOI: 10.1044/2017_jslhr-s-16-0234] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Revised: 10/07/2016] [Accepted: 11/16/2016] [Indexed: 06/07/2023]
Abstract
PURPOSE Delayed auditory feedback (DAF) causes speakers to become disfluent and make phonological errors. Methods for assessing the kinematics of speech errors are lacking, with most DAF studies relying on auditory perceptual analyses, which may be problematic, as errors judged to be categorical may actually represent blends of sounds or articulatory errors. METHOD Eight typical speakers produced nonsense syllable sequences under normal and DAF (200 ms). Lip and tongue kinematics were captured with electromagnetic articulography. Time-locked acoustic recordings were transcribed, and the kinematics of utterances with and without perceived errors were analyzed with existing and novel quantitative methods. RESULTS New multivariate measures showed that for 5 participants, kinematic variability for productions perceived to be error free was significantly increased under delay; these results were validated by using the spatiotemporal index measure. Analysis of error trials revealed both typical productions of a nontarget syllable and productions with articulatory kinematics that incorporated aspects of both the target and the perceived utterance. CONCLUSIONS This study is among the first to characterize articulatory changes under DAF and provides evidence for different classes of speech errors, which may not be perceptually salient. New methods were developed that may aid visualization and analysis of large kinematic data sets. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.5103067.
Collapse
Affiliation(s)
- Gabriel J. Cler
- Graduate Program for Neuroscience–Computational Neuroscience, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | | | | | - Cara E. Stepp
- Graduate Program for Neuroscience–Computational Neuroscience, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- School of Medicine, Department of Otolaryngology–Head and Neck Surgery, Boston University, MA
| | - Jason W. Bohland
- Graduate Program for Neuroscience–Computational Neuroscience, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Health Sciences, Boston University, MA
| |
Collapse
|
15
|
Abstract
Speech sound disorders (SSDs) are commonly viewed as involving impaired articulation and/or phonological skills. Speech language pathologists working with individuals with (SSDs) assess the articulation of speech sounds and the coordination of articulatory structures with other components of the speech mechanism, including the phonatory, respiratory, and resonatory subsystems. The sound system of the language and the rules that govern how phonemes are combined are equally critical for clinicians to explore. While the terms “articulation” and “phonology” provide clinicians with a framework for classification, children who are broadly identified with (SSDs) may also display characteristics of a motor speech impairment, which can obscure the decision making process with respect to both diagnosis and treatment. One such motor speech disorder is childhood apraxia of speech (CAS). The focus of this paper is to discuss motor speech deficits in children and to review research that aims to distinguish motor speech patterns in children with (SSDs) with and without CAS. We will also address the relationship between emerging speech motor and linguistic skills.
Collapse
Affiliation(s)
- Maria I. Grigos
- Department of Communicative Sciences and Disorders, New York University
New York, NY
| |
Collapse
|
16
|
Casserly ED. Effects of real-time cochlear implant simulation on speech production. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:2791-2800. [PMID: 25994707 PMCID: PMC4441710 DOI: 10.1121/1.4916965] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Revised: 03/17/2015] [Accepted: 03/24/2015] [Indexed: 05/30/2023]
Abstract
Investigations using normal-hearing subjects listening to simulations of cochlear implant (CI) acoustic processing have provided substantial information about the impact of these distorted listening conditions on the accuracy of auditory perception, but extensions of this method to the domain of speech production have been limited. In the present study, a portable, real-time vocoder was used to simulate conditions of CI auditory feedback during speech production in NH subjects. Acoustic-phonetic characteristics of sibilant fricatives, aspirated stops, and F1/F2 vowel qualities were analyzed for changes as a result of CI simulation of acoustic speech feedback. Significant changes specific to F1 were observed; speakers reduced their phonological vowel height contrast, typically via talker-specific raising of the low vowels [æ] and [ɑ] or lowering of high vowels [i] and [u]. Comparisons to the results of both localized feedback perturbation procedures and investigations of speech production in deaf adults with CIs are discussed.
Collapse
Affiliation(s)
- Elizabeth D Casserly
- Department of Psychological and Brain Sciences, Speech Research Laboratory, Indiana University, 1101 East 10th Street, Bloomington, Indiana 47405
| |
Collapse
|
17
|
Maas E, Mailend ML, Guenther FH. Feedforward and feedback control in apraxia of speech: effects of noise masking on vowel production. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2015; 58:185-200. [PMID: 25565143 PMCID: PMC4398652 DOI: 10.1044/2014_jslhr-s-13-0300] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Revised: 07/14/2014] [Accepted: 10/29/2014] [Indexed: 05/08/2023]
Abstract
PURPOSE This study was designed to test two hypotheses about apraxia of speech (AOS) derived from the Directions Into Velocities of Articulators (DIVA) model (Guenther et al., 2006): the feedforward system deficit hypothesis and the feedback system deficit hypothesis. METHOD The authors used noise masking to minimize auditory feedback during speech. Six speakers with AOS and aphasia, 4 with aphasia without AOS, and 2 groups of speakers without impairment (younger and older adults) participated. Acoustic measures of vowel contrast, variability, and duration were analyzed. RESULTS Younger, but not older, speakers without impairment showed significantly reduced vowel contrast with noise masking. Relative to older controls, the AOS group showed longer vowel durations overall (regardless of masking condition) and a greater reduction in vowel contrast under masking conditions. There were no significant differences in variability. Three of the 6 speakers with AOS demonstrated the group pattern. Speakers with aphasia without AOS did not differ from controls in contrast, duration, or variability. CONCLUSION The greater reduction in vowel contrast with masking noise for the AOS group is consistent with the feedforward system deficit hypothesis but not with the feedback system deficit hypothesis; however, effects were small and not present in all individual speakers with AOS. Theoretical implications and alternative interpretations of these findings are discussed.
Collapse
|
18
|
Ballard KJ, Tourville JA, Robin DA. Behavioral, computational, and neuroimaging studies of acquired apraxia of speech. Front Hum Neurosci 2014; 8:892. [PMID: 25404911 PMCID: PMC4217373 DOI: 10.3389/fnhum.2014.00892] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2014] [Accepted: 10/16/2014] [Indexed: 12/14/2022] Open
Abstract
A critical examination of speech motor control depends on an in-depth understanding of network connectivity associated with Brodmann areas 44 and 45 and surrounding cortices. Damage to these areas has been associated with two conditions-the speech motor programming disorder apraxia of speech (AOS) and the linguistic/grammatical disorder of Broca's aphasia. Here we focus on AOS, which is most commonly associated with damage to posterior Broca's area (BA) and adjacent cortex. We provide an overview of our own studies into the nature of AOS, including behavioral and neuroimaging methods, to explore components of the speech motor network that are associated with normal and disordered speech motor programming in AOS. Behavioral, neuroimaging, and computational modeling studies are indicating that AOS is associated with impairment in learning feedforward models and/or implementing feedback mechanisms and with the functional contribution of BA6. While functional connectivity methods are not yet routinely applied to the study of AOS, we highlight the need for focusing on the functional impact of localized lesions throughout the speech network, as well as larger scale comparative studies to distinguish the unique behavioral and neurological signature of AOS. By coupling these methods with neural network models, we have a powerful set of tools to improve our understanding of the neural mechanisms that underlie AOS, and speech production generally.
Collapse
Affiliation(s)
| | - Jason A. Tourville
- Department of Speech, Language, and Hearing Sciences, Boston UniversityBoston, MA, USA
- Center for Computational Neuroscience and Neural Technology (CompNet), Boston UniversityBoston, MA, USA
| | - Donald A. Robin
- Departments of Neurology, Radiology, Biomedical Engineering, Research Imaging Institute, University of Texas Health Science Center San AntonioSan Antonio, TX, USA
- Biomedical Engineering, Honor’s College, University of Texas at San AntonioTX, USA
| |
Collapse
|
19
|
Terband H, van Brenk F, van Doornik-van der Zee A. Auditory feedback perturbation in children with developmental speech sound disorders. JOURNAL OF COMMUNICATION DISORDERS 2014; 51:64-77. [PMID: 25127854 DOI: 10.1016/j.jcomdis.2014.06.009] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2014] [Revised: 06/07/2014] [Accepted: 06/30/2014] [Indexed: 06/03/2023]
Abstract
BACKGROUND/PURPOSE Several studies indicate a close relation between auditory and speech motor functions in children with speech sound disorders (SSD). The aim of this study was to investigate the ability to compensate and adapt for perturbed auditory feedback in children with SSD compared to age-matched normally developing children. METHOD 17 normally developing children aged 4.1-8.7 years (mean=5.5, SD=1.4), and 11 children with SSD aged 3.9-7.5 years (mean=5.1, SD=1.0) participated in the study. Auditory feedback was perturbed by real-time shifting the first and second formant of the vowel /e/ during the production of CVC words in a five-step paradigm (practice/familiarization; start/baseline; ramp; hold; end/release). RESULTS At the group level, the normally developing children were better able to compensate and adapt, adjusting their formant frequencies in the direction opposite to the perturbation, while the group of children with SSD followed (amplifying) the perturbation. However, large individual differences lie underneath. Furthermore, strong correlations were found between the amount of compensation and performance on oral motor movement non-word repetition tasks. CONCLUSIONS Results suggested that while most children with SSD can detect incongruencies in auditory feedback and can adapt their target representations, they are unable to compensate for perturbed auditory feedback. These findings suggest that impaired auditory-motor integration may play a key role in SSD. LEARNING OUTCOMES The reader will be able to: (1) describe the potential role of auditory feedback control in developmental speech disorders (SSD); (2) identify the neural control subsystems involved in feedback based speech motor control; (3) describe the differences between compensation and adaptation for perturbed auditory feedback; (4) explain why auditory-motor integration may play a key role in SSD.
Collapse
Affiliation(s)
- Hayo Terband
- Utrecht Institute of Linguistics-OTS, Utrecht University, Utrecht, The Netherlands; Centre for Language and Cognition (CLCG), University of Groningen, Groningen, The Netherlands.
| | - Frits van Brenk
- Utrecht Institute of Linguistics-OTS, Utrecht University, Utrecht, The Netherlands.
| | - Anniek van Doornik-van der Zee
- Utrecht Institute of Linguistics-OTS, Utrecht University, Utrecht, The Netherlands; Division of Speech and Language Therapy, HU University of Applied Sciences Utrecht, Utrecht, The Netherlands.
| |
Collapse
|
20
|
The listening talker: A review of human and algorithmic context-induced modifications of speech. COMPUT SPEECH LANG 2014. [DOI: 10.1016/j.csl.2013.08.003] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
21
|
Reilly KJ, Dougherty KE. The role of vowel perceptual cues in compensatory responses to perturbations of speech auditory feedback. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:1314-23. [PMID: 23927128 PMCID: PMC3745484 DOI: 10.1121/1.4812763] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
The perturbation of acoustic features in a speaker's auditory feedback elicits rapid compensatory responses that demonstrate the importance of auditory feedback for control of speech output. The current study investigated whether responses to a perturbation of speech auditory feedback vary depending on the importance of the perturbed feature to perception of the vowel being produced. Auditory feedback of speakers' first formant frequency (F1) was shifted upward by 130 mels in randomly selected trials during the speakers' production of consonant-vowel-consonant words containing either the vowel /Λ/ or the vowel /ɝ/. Although these vowels exhibit comparable F1 frequencies, the contribution of F1 to perception of /Λ/ is greater than its contribution to perception of /ɝ/. Compensation to the F1 perturbation was observed during production of both vowels, but compensatory responses during /Λ/ occurred at significantly shorter latencies and exhibited significantly larger magnitudes than compensatory responses during /ɝ/. The finding that perturbation of vowel F1 during /Λ/ and /ɝ/ yielded compensatory differences that mirrored the contributions of F1 to perception of these vowels indicates that some portion of feedback control is weighted toward monitoring and preservation of acoustic cues for speech perception.
Collapse
Affiliation(s)
- Kevin J Reilly
- Department of Speech-Language Pathology and Audiology, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 02115, USA.
| | | |
Collapse
|
22
|
Perkell JS. Movement goals and feedback and feedforward control mechanisms in speech production. JOURNAL OF NEUROLINGUISTICS 2012; 25:382-407. [PMID: 22661828 PMCID: PMC3361736 DOI: 10.1016/j.jneuroling.2010.02.011] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Studies of speech motor control are described that support a theoretical framework in which fundamental control variables for phonemic movements are multi-dimensional regions in auditory and somatosensory spaces. Auditory feedback is used to acquire and maintain auditory goals and in the development and function of feedback and feedforward control mechanisms. Several lines of evidence support the idea that speakers with more acute sensory discrimination acquire more distinct goal regions and therefore produce speech sounds with greater contrast. Feedback modification findings indicate that fluently produced sound sequences are encoded as feedforward commands, and feedback control serves to correct mismatches between expected and produced sensory consequences.
Collapse
Affiliation(s)
- Joseph S Perkell
- Speech Communication Group, Massachusetts Institute of Technology, Research Laboratory of Electronics, Room 36-591, 50 Vassar St., Cambridge, MA 02139-4307, United States
| |
Collapse
|
23
|
Tourville JA, Guenther FH. The DIVA model: A neural theory of speech acquisition and production. ACTA ACUST UNITED AC 2011; 26:952-981. [PMID: 23667281 DOI: 10.1080/01690960903498424] [Citation(s) in RCA: 379] [Impact Index Per Article: 29.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The DIVA model of speech production provides a computationally and neuroanatomically explicit account of the network of brain regions involved in speech acquisition and production. An overview of the model is provided along with descriptions of the computations performed in the different brain regions represented in the model. The latest version of the model, which contains a new right-lateralized feedback control map in ventral premotor cortex, will be described, and experimental results that motivated this new model component will be discussed. Application of the model to the study and treatment of communication disorders will also be briefly described.
Collapse
Affiliation(s)
- Jason A Tourville
- Department of Cognitive and Neural Systems, Boston University, 677 Beacon Street, Boston, MA, 02215, Telephone: (617) 353-5765, Fax Number: (617) 353-7755,
| | | |
Collapse
|
24
|
Ghosh SS, Matthies ML, Maas E, Hanson A, Tiede M, Ménard L, Guenther FH, Lane H, Perkell JS. An investigation of the relation between sibilant production and somatosensory and auditory acuity. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:3079-87. [PMID: 21110603 PMCID: PMC3003728 DOI: 10.1121/1.3493430] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
The relation between auditory acuity, somatosensory acuity and the magnitude of produced sibilant contrast was investigated with data from 18 participants. To measure auditory acuity, stimuli from a synthetic sibilant continuum ([s]-[ʃ]) were used in a four-interval, two-alternative forced choice adaptive-staircase discrimination task. To measure somatosensory acuity, small plastic domes with grooves of different spacing were pressed against each participant's tongue tip and the participant was asked to identify one of four possible orientations of the grooves. Sibilant contrast magnitudes were estimated from productions of the words 'said,' 'shed,' 'sid,' and 'shid'. Multiple linear regression revealed a significant relation indicating that a combination of somatosensory and auditory acuity measures predicts produced acoustic contrast. When the participants were divided into high- and low-acuity groups based on their median somatosensory and auditory acuity measures, separate ANOVA analyses with sibilant contrast as the dependent variable yielded a significant main effect for each acuity group. These results provide evidence that sibilant productions have auditory as well as somatosensory goals and are consistent with prior results and the theoretical framework underlying the DIVA model of speech production.
Collapse
Affiliation(s)
- Satrajit S Ghosh
- Speech Communication Group, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Cai S, Ghosh SS, Guenther FH, Perkell JS. Adaptive auditory feedback control of the production of formant trajectories in the Mandarin triphthong /iau/ and its pattern of generalization. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:2033-48. [PMID: 20968374 PMCID: PMC2981117 DOI: 10.1121/1.3479539] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
In order to test whether auditory feedback is involved in the planning of complex articulatory gestures in time-varying phonemes, the current study examined native Mandarin speakers' responses to auditory perturbations of their auditory feedback of the trajectory of the first formant frequency during their production of the triphthong /iau/. On average, subjects adaptively adjusted their productions to partially compensate for the perturbations in auditory feedback. This result indicates that auditory feedback control of speech movements is not restricted to quasi-static gestures in monophthongs as found in previous studies, but also extends to time-varying gestures. To probe the internal structure of the mechanisms of auditory-motor transformations, the pattern of generalization of the adaptation learned on the triphthong /iau/ to other vowels with different temporal and spatial characteristics (produced only under masking noise) was tested. A broad but weak pattern of generalization was observed; the strength of the generalization diminished with increasing dissimilarity from /iau/. The details and implications of the pattern of generalization are examined and discussed in light of previous sensorimotor adaptation studies of both speech and limb motor control and a neurocomputational model of speech motor control.
Collapse
Affiliation(s)
- Shanqing Cai
- Speech and Hearing Bioscience and Technology Program, Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| | | | | | | |
Collapse
|
26
|
Nasir SM, Ostry DJ. Speech motor learning in profoundly deaf adults. Nat Neurosci 2008; 11:1217-22. [PMID: 18794839 PMCID: PMC2601702 DOI: 10.1038/nn.2193] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2008] [Accepted: 08/05/2008] [Indexed: 11/10/2022]
Abstract
Speech production, like other sensorimotor behaviors, relies on multiple sensory inputs--audition, proprioceptive inputs from muscle spindles and cutaneous inputs from mechanoreceptors in the skin and soft tissues of the vocal tract. However, the capacity for intelligible speech by deaf speakers suggests that somatosensory input alone may contribute to speech motor control and perhaps even to speech learning. We assessed speech motor learning in cochlear implant recipients who were tested with their implants turned off. A robotic device was used to alter somatosensory feedback by displacing the jaw during speech. We found that implant subjects progressively adapted to the mechanical perturbation with training. Moreover, the corrections that we observed were for movement deviations that were exceedingly small, on the order of millimeters, indicating that speakers have precise somatosensory expectations. Speech motor learning is substantially dependent on somatosensory input.
Collapse
Affiliation(s)
- Sazzad M Nasir
- Department of Psychology, McGill University, 1205 Dr. Penfield Avenue, Montreal, Quebec H3A1B1, Canada
| | | |
Collapse
|
27
|
Ménard L, Polak M, Denny M, Burton E, Lane H, Matthies ML, Marrone N, Perkell JS, Tiede M, Vick J. Interactions of speaking condition and auditory feedback on vowel production in postlingually deaf adults with cochlear implants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:3790-801. [PMID: 17552727 DOI: 10.1121/1.2710963] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
This study investigates the effects of speaking condition and auditory feedback on vowel production by postlingually deafened adults. Thirteen cochlear implant users produced repetitions of nine American English vowels prior to implantation, and at one month and one year after implantation. There were three speaking conditions (clear, normal, and fast), and two feedback conditions after implantation (implant processor turned on and off). Ten normal-hearing controls were also recorded once. Vowel contrasts in the formant space (expressed in mels) were larger in the clear than in the fast condition, both for controls and for implant users at all three time samples. Implant users also produced differences in duration between clear and fast conditions that were in the range of those obtained from the controls. In agreement with prior work, the implant users had contrast values lower than did the controls. The implant users' contrasts were larger with hearing on than off and improved from one month to one year postimplant. Because the controls and implant users responded similarly to a change in speaking condition, it is inferred that auditory feedback, although demonstrably important for maintaining normative values of vowel contrasts, is not needed to maintain the distinctiveness of those contrasts in different speaking conditions.
Collapse
Affiliation(s)
- Lucie Ménard
- Département de Linguistique, Université du Québec à Montréal, Montréal (Québec), H3C 3P8 Canada.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Perkell JS, Lane H, Denny M, Matthies ML, Tiede M, Zandipour M, Vick J, Burton E. Time course of speech changes in response to unanticipated short-term changes in hearing state. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:2296-311. [PMID: 17471743 DOI: 10.1121/1.2642349] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
The timing of changes in parameters of speech production was investigated in six cochlear implant users by switching their implant microphones off and on a number of times in a single experimental session. The subjects repeated four short, two-word utterances, /dV1n#SV2d/ (S = /s/ or /S/), in quasi-random order. The changes between hearing and nonhearing states were introduced by a voice-activated switch at V1 onset. "Postural" measures were made of vowel sound pressure level (SPL), duration, F0; contrast measures were made of vowel separation (distance between pair members in the formant plane) and sibilant separation (difference in spectral means). Changes in parameter values were averaged over multiple utterances, lined up with respect to the switch. No matter whether prosthetic hearing was blocked or restored, contrast measures for vowels and sibilants did not change systematically. Some changes in duration, SPL and F0 were observed during the vowel within which hearing state was changed, V1, as well as during V2 and subsequent utterance repetitions. Thus, sound segment contrasts appear to be controlled differently from the postural parameters of speaking rate and average SPL and F0. These findings are interpreted in terms of the function of hypothesized feedback and feedforward mechanisms for speech motor control.
Collapse
Affiliation(s)
- Joseph S Perkell
- Speech Communication Group, Research Laboratory of Electronics, and Department of Brain and Cognitive Sciences, MIT Room 36-511, Cambridge, MA 02139, USA.
| | | | | | | | | | | | | | | |
Collapse
|