1
|
Tang DL. Using altered auditory feedback to study pitch compensation and adaptation in tonal language speakers. Front Hum Neurosci 2024; 18:1364803. [PMID: 38567000 PMCID: PMC10985180 DOI: 10.3389/fnhum.2024.1364803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 02/26/2024] [Indexed: 04/04/2024] Open
Abstract
Human speech production is strongly influenced by the auditory feedback it generates. Auditory feedback-what we hear when we speak-enables us to learn and maintain speaking skills and to rapidly correct errors in our speech. Over the last three decades, the real-time altered auditory feedback (AAF) paradigm has gained popularity as a tool to study auditory feedback control during speech production. This method involves changing a speaker's speech and feeding it back to them in near real time. More than 50% of the world's population speak tonal languages, in which the pitch or tone used to pronounce a word can change its meaning. This review article aims to offer an overview of the progression of AAF paradigm as a method to study pitch motor control among speakers of tonal languages. Eighteen studies were included in the current mini review and were compared based on their methodologies and results. Overall, findings from these studies provide evidence that tonal language speakers can compensate and adapt when receiving inconsistent and consistent pitch perturbations. Response magnitude and latency are influenced by a range of factors. Moreover, by combining AAF with brain stimulation and neuroimaging techniques, the neural basis of pitch motor control in tonal language speakers has been investigated. To sum up, AAF has been demonstrated to be an emerging tool for studying pitch motor control in speakers of tonal languages.
Collapse
Affiliation(s)
- Ding-lan Tang
- Human Communication, Development, and Information Sciences, Faculty of Education, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
| |
Collapse
|
2
|
Dahl KL, Cádiz MD, Zuk J, Guenther FH, Stepp CE. Controlling Pitch for Prosody: Sensorimotor Adaptation in Linguistically Meaningful Contexts. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:440-454. [PMID: 38241671 PMCID: PMC11000799 DOI: 10.1044/2023_jslhr-23-00460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 10/09/2023] [Accepted: 11/02/2023] [Indexed: 01/21/2024]
Abstract
PURPOSE This study examined how speakers adapt to fundamental frequency (fo) errors that affect the use of prosody to convey linguistic meaning, whether fo adaptation in that context relates to adaptation in linguistically neutral sustained vowels, and whether cue trading is reflected in responses in the prosodic cues of fo and amplitude. METHOD Twenty-four speakers said vowels and sentences while fo was digitally altered to induce predictable errors. Shifts in fo (±200 cents) were applied to the entire sustained vowel and one word (emphasized or unemphasized) in sentences. Two prosodic cues-fo and amplitude-were extracted. The effects of fo shifts, shift direction, and emphasis on fo response magnitude were evaluated with repeated-measures analyses of variance. Relationships between adaptive fo responses in sentences and vowels and between adaptive fo and amplitude responses were evaluated with Spearman correlations. RESULTS Speakers adapted to fo errors in both linguistically meaningful sentences and linguistically neutral vowels. Adaptive fo responses of unemphasized words were smaller than those of emphasized words when fo was shifted upward. There was no relationship between adaptive fo responses in vowels and emphasized words, but adaptive fo and amplitude responses were strongly, positively correlated. CONCLUSIONS Sensorimotor adaptation occurs in response to fo errors regardless of how disruptive the error is to linguistic meaning. Adaptation to fo errors during sustained vowels may not involve the exact same mechanisms as sensorimotor adaptation as it occurs in meaningful speech. The relationship between adaptive responses in fo and amplitude supports an integrated model of prosody. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.25008908.
Collapse
Affiliation(s)
- Kimberly L. Dahl
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Manuel Díaz Cádiz
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Jennifer Zuk
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Frank H. Guenther
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
3
|
Alves MDC, Mancini PC, Teixeira LC. Modifications of auditory feedback and its effects on the voice of adult subjects: a scoping review. Codas 2023; 36:e20220202. [PMID: 38126424 PMCID: PMC10750862 DOI: 10.1590/2317-1782/20232022202pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 05/29/2023] [Indexed: 12/23/2023] Open
Abstract
INTRODUCTION The auditory perception of voice and its production involve auditory feedback, kinesthetic cues and the feedforward system that produce different effects for the voice. The Lombard, Sidetone and Pitch-Shift-Reflex effects are the most studied. The mapping of scientific experiments on changes in auditory feedback for voice motor control makes it possible to examine the existing literature on the phenomenon and may contribute to voice training or therapies. PURPOSE To map experiments and research results with manipulation of auditory feedback for voice motor control in adults. METHOD Scope review following the Checklist Preferred Reporting Items for Systematic reviews and Meta-Analyses extension (PRISMA-ScR) to answer the question: "What are the investigation methods and main research findings on the manipulation of auditory feedback in voice self-monitoring of adults?". The search protocol was based on the Population, Concept, and Context (PCC) mnemonic strategy, in which the population is adult individuals, the concept is the manipulation of auditory feedback and the context is on motor voice control. Articles were searched in the databases: BVS/Virtual Health Library, MEDLINE/Medical Literature Analysis and Retrieval System online, COCHRANE, CINAHL/Cumulative Index to Nursing and Allied Health Literature, SCOPUS and WEB OF SCIENCE. RESULTS 60 articles were found, 19 on the Lombard Effect, 25 on the Pitch-shift-reflex effect, 12 on the Sidetone effect and four on the Sidetone/Lombard effect. The studies are in agreement that the insertion of a noise that masks the auditory feedback causes an increase in the individual's speech intensity and that the amplification of the auditory feedback promotes the reduction of the sound pressure level in the voice production. A reflex response to the change in pitch is observed in the auditory feedback, however, with particular characteristics in each study. CONCLUSION The material and method of the experiments are different, there are no standardizations in the tasks, the samples are varied and often reduced. The methodological diversity makes it difficult to generalize the results. The main findings of research on auditory feedback on voice motor control confirm that in the suppression of auditory feedback, the individual tends to increase the intensity of the voice. In auditory feedback amplification, the individual decreases the intensity and has greater control over the fundamental frequency, and in frequency manipulations, the individual tends to correct the manipulation. The few studies with dysphonic individuals show that they behave differently from non-dysphonic individuals.
Collapse
Affiliation(s)
- Moisés do Carmo Alves
- Programa de Pós-graduação em Ciências Fonoaudiológicas, Departamento de Fonoaudiologia, Faculdade de Medicina, Universidade Federal de Minas Gerais – UFMG - Belo Horizonte (MG), Brasil.
| | - Patrícia Cotta Mancini
- Programa de Pós-graduação em Ciências Fonoaudiológicas, Departamento de Fonoaudiologia, Faculdade de Medicina, Universidade Federal de Minas Gerais – UFMG - Belo Horizonte (MG), Brasil.
| | - Leticia Caldas Teixeira
- Programa de Pós-graduação em Ciências Fonoaudiológicas, Departamento de Fonoaudiologia, Faculdade de Medicina, Universidade Federal de Minas Gerais – UFMG - Belo Horizonte (MG), Brasil.
| |
Collapse
|
4
|
Hilger A, Cole J, Larson C. Task-dependent pitch auditory feedback control in cerebellar ataxia. RESEARCH SQUARE 2023:rs.3.rs-3186155. [PMID: 37547022 PMCID: PMC10402250 DOI: 10.21203/rs.3.rs-3186155/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
Purpose The purpose of this study was to investigate how ataxia affects the task-dependent role of pitch auditory feedback control in speech. In previous research, individuals with ataxia produced over-corrected, hypermetric compensatory responses to unexpected pitch and formant frequency perturbations in auditory feedback in sustained vowels and single words (Houde et al., 2019; Li et al., 2019; Parrell et al., 2017). In this study, we investigated whether ataxia would also affect the task-dependent role of the auditory feedback control system, measuring whether pitch-shift responses would be mediated by speech task or semantic focus pattern as they are in neurologically healthy speakers. Methods Twenty-two adults with ataxia and 29 age- and sex-matched control participants produced sustained vowels and sentences with and without corrective focus while their auditory feedback was briefly and unexpectedly perturbed in pitch by +/-200 cents. The magnitude and latency of the reflexive pitch-shift responses were measured as a reflection of auditory feedback control. Results Individuals with ataxia produced larger reflexive pitch-shift responses in both the sustained-vowel and sentence-production tasks than the control participants. Additionally, a differential response magnitude was observed by task and sentence focus pattern for both groups. Conclusion These findings demonstrate that even though accuracy of auditory feedback control correction is affected by cerebellar damage, as evidenced by the hypermetric responses, the system still retains efficiency in utilizing the task-dependent role of auditory feedback.
Collapse
|
5
|
Hilger AI, Cole J, Larson C. Semantic focus mediates pitch auditory feedback control in phrasal prosody. LANGUAGE, COGNITION AND NEUROSCIENCE 2022; 38:328-345. [PMID: 37065842 PMCID: PMC10103821 DOI: 10.1080/23273798.2022.2116060] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
Abstract
This study investigated the effect of semantic focus on pitch auditory feedback control in the production of phrasal prosody through an experiment using pitch-shifted auditory feedback. We hypothesized that pitch-shift responses would be mediated by semantic focus because highly informative focus types, such as corrective focus, impose more specific constraints on the prosodic form of a phrase and require greater consistency in the production of pitch excursions compared to sentences with no such focus elements. Twenty-eight participants produced sentences with and without corrective focus while their auditory feedback was briefly and unexpectedly perturbed in pitch by +/-200 cents at the start of the sentence. The magnitude and latency of the reflexive pitch-shift responses were measured as a reflection of auditory feedback control. Our results matched our prediction that corrective focus would elicit larger pitch-shift responses, supporting our hypothesis that auditory feedback control is mediated by semantic focus.
Collapse
Affiliation(s)
- Allison I. Hilger
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL, United States
- Current department: Department of Speech, Language, and Hearing Sciences, University of Colorado Boulder, Boulder, CO, United States
| | - Jennifer Cole
- Department of Linguistics, Northwestern University, Evanston, IL, United States
| | - Charles Larson
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL, United States
| |
Collapse
|
6
|
Ning LH. Comparison of Involuntary and Volitional Responses to Pitch-Shifted Auditory Feedback: Evidence for Tone Speakers' Flexibility to Switch Between Opposing and Following Responses. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2160-2186. [PMID: 35537117 DOI: 10.1044/2022_jslhr-21-00597] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
PURPOSE Our audio-vocal system involves a negative feedback system that functions to correct for fundamental frequency (f 0) errors in production. Therefore, automatic and opposing responses appear when an unexpected change in voice pitch is present in auditory feedback. This study explores following responses to pitch perturbation in auditory feedback in tonal language speakers, which have been commonly overlooked or discarded by past research. We examine whether the number of response types (opposing vs. following) and their dynamic f 0 contours in tone word production vary as a function of instruction (involuntary ["to ignore"] vs. volitional ["to compensate"]). METHOD Twenty-four native speakers of Taiwanese Southern Min (TSM) produced three disyllabic TSM words while receiving pitch perturbation through headphones. The three disyllabic words were tsau55-im55 ("out of tune"; HH), kau33-uann33 ("exchange"; MM), and pan11-an11 ("handle a case"; LL) that carry an identical high-, mid-, or low-level tone. The participants were instructed either "to ignore" or "to compensate" for the pitch shifts. RESULTS Results from a Bayesian Poisson regression show that the number of opposing and following responses were split nearly 50-50 for the "ignore" condition and 55%-35% for the "compensate" condition. The simulation results indicate that the speakers were able to switch between the feedback and the feedforward mode during the testing. On the other hand, f 0 contour analyses using generalized additive models show that pitch-increasing responses (i.e., oppose to downshifts or follow upshifts) were significantly larger than pitch-decreasing responses (i.e., oppose to upshifts or follow downshifts) for the MM and LL words, but not for the HH word. CONCLUSIONS Our results provide support for the view that, for tone speakers, following responses are not uncommon. The magnitudes of pitch shift response trajectories have to do with the available pitch range for moving up or down in tone word production.
Collapse
Affiliation(s)
- Li-Hsin Ning
- Department of English, National Taiwan Normal University, Taipei City
| |
Collapse
|
7
|
Ning LH. The effect of stimulus timing in compensating for pitch perturbation on flat, rising, and falling contours. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 151:2530. [PMID: 35461497 DOI: 10.1121/10.0010237] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Accepted: 03/28/2022] [Indexed: 06/14/2023]
Abstract
The purpose of this study was to explore vocal responses to pitch perturbation on the flat, rising, and falling contour made of sequences of level tones in Taiwanese Southern Min. Twenty-two native speakers produced nine disyllabic words (flat: high-high, mid-mid, and low-low tone sequences; rising: mid-high, low-high, and low-mid tone sequences; falling: high-mid, high-low, and mid-low tone sequences). Pitch-shift stimuli (200 ms) appeared at either 100 ms (the beginning of the first syllable) or 400 ms (the beginning of the second syllable) after vocal onset. The participants were asked to ignore the pitch perturbation that appeared via auditory feedback. We found their compensation decreased when both syllables had identical level tones (i.e., the flat contour) but was particularly large when the overall contour was falling. Furthermore, pitch compensation at 100 ms was smaller than at 400 ms for the falling contour, but not for the flat and rising contours. Our results suggest that less susceptibility to pitch perturbation in the initial speech planning process is conditioned by the velocity of overall pitch contour.
Collapse
Affiliation(s)
- Li-Hsin Ning
- Department of English, National Taiwan Normal University, 162 Heping East Road, Daan District, Taipei City 106, Taiwan
| |
Collapse
|
8
|
Raharjo I, Kothare H, Nagarajan SS, Houde JF. Speech compensation responses and sensorimotor adaptation to formant feedback perturbations. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:1147. [PMID: 33639824 PMCID: PMC7892200 DOI: 10.1121/10.0003440] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 01/11/2021] [Accepted: 01/13/2021] [Indexed: 06/11/2023]
Abstract
Control of speech formants is important for the production of distinguishable speech sounds and is achieved with both feedback and learned feedforward control. However, it is unclear whether the learning of feedforward control involves the mechanisms of feedback control. Speakers have been shown to compensate for unpredictable transient mid-utterance perturbations of pitch and loudness feedback, demonstrating online feedback control of these speech features. To determine whether similar feedback control mechanisms exist in the production of formants, responses to unpredictable vowel formant feedback perturbations were examined. Results showed similar within-trial compensatory responses to formant perturbations that were presented at utterance onset and mid-utterance. The relationship between online feedback compensation to unpredictable formant perturbations and sensorimotor adaptation to consistent formant perturbations was further examined. Within-trial online compensation responses were not correlated with across-trial sensorimotor adaptation. A detailed analysis of within-trial time course dynamics across trials during sensorimotor adaptation revealed that across-trial sensorimotor adaptation responses did not result from an incorporation of within-trial compensation response. These findings suggest that online feedback compensation and sensorimotor adaptation are governed by distinct neural mechanisms. These findings have important implications for models of speech motor control in terms of how feedback and feedforward control mechanisms are implemented.
Collapse
Affiliation(s)
- Inez Raharjo
- University of California, Berkeley and University of California, San Francisco, Graduate Program in Bioengineering
| | - Hardik Kothare
- University of California, Berkeley and University of California, San Francisco, Graduate Program in Bioengineering
| | - Srikantan S Nagarajan
- Biomagnetic Imaging Laboratory, Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, California 94143, USA
| | - John F Houde
- Speech Neuroscience Laboratory, Department of Otolaryngology-Head and Neck Surgery, University of California San Francisco, San Francisco, California 94143, USA
| |
Collapse
|
9
|
Hilger A, Cole J, Kim JH, Lester-Smith RA, Larson C. The Effect of Pitch Auditory Feedback Perturbations on the Production of Anticipatory Phrasal Prominence and Boundary. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2185-2201. [PMID: 32615845 DOI: 10.1044/2020_jslhr-19-00043] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Purpose In this study, we investigated how the direction and timing of a perturbation in voice pitch auditory feedback during phrasal production modulated the magnitude and latency of the pitch-shift reflex as well as the scaling of acoustic production of anticipatory intonation targets for phrasal prominence and boundary. Method Brief pitch auditory feedback perturbations (±200 cents for 200-ms duration) were applied during the production of a target phrase on the first or the second word of the phrase. To replicate previous work, we first measured the magnitude and latency of the pitch-shift reflex as a function of the direction and timing of the perturbation within the phrase. As a novel approach, we also measured the adjustment in the production of the phrase-final prominent word as a function of perturbation direction and timing by extracting the acoustic correlates of pitch, loudness, and duration. Results The pitch-shift reflex was greater in magnitude after perturbations on the first word of the phrase, replicating the results from Mandarin speakers in an American English-speaking population. Additionally, the production of the phrase-final prominent word was acoustically enhanced (lengthened vowel duration and increased intensity and fundamental frequency) after perturbations earlier in the phrase, but more so after perturbations on the first word in the phrase. Conclusion The timing of the pitch perturbation within the phrase modulated both the magnitude of the pitch-shift reflex and the production of the prominent word, supporting our hypothesis that speakers use auditory feedback to correct for immediate production errors and to scale anticipatory intonation targets during phrasal production.
Collapse
Affiliation(s)
- Allison Hilger
- Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL
| | - Jennifer Cole
- Department of Linguistics, Northwestern University, Evanston, IL
| | - Jason H Kim
- Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL
| | | | - Charles Larson
- Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL
| |
Collapse
|
10
|
Kim JH, Larson CR. Modulation of auditory-vocal feedback control due to planned changes in voice f o. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:1482. [PMID: 31067945 PMCID: PMC6433561 DOI: 10.1121/1.5094414] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 02/11/2019] [Accepted: 02/26/2019] [Indexed: 06/09/2023]
Abstract
Previous studies have demonstrated that voice fundamental frequency (fo), or pitch, relies on auditory feedback to monitor and correct for errors in production. When voice-pitch auditory feedback is unexpectedly perturbed, individuals typically produce a compensatory change in fo that opposes the direction of the pitch-perturbation. Studies comparing steady vowel vocalizations and speech tasks have demonstrated task-dependent modulation of the compensatory response, but the effects of planning to volitionally change fo during active vocalization have yet to be explored. Ten musicians and ten non-musicians were asked to perform two vocal tasks. Both tasks started off at a conversational fo. In one task, pitch-shifted feedback was presented when the participants were planning to hold fo constant (steady fo), and in the other, feedback was shifted while participants were in the planning stage prior to raising fo (raised fo) from a steady state. Acoustical analyses of fo were performed to measure the peak magnitude and latency of both the compensatory response as well as the voluntary fo change. Results showed that planning to change pitch modulates the mechanisms controlling feedback-based error correction of fo, and musicality affects how individuals incorporate modulations in auditory feedback with the feedforward plans to increase voice fo.
Collapse
Affiliation(s)
- Jason H Kim
- Department of Communication Sciences and Disorders, Northwestern University, 2240 Campus Drive, Evanston, Illinois 60208, USA
| | - Charles R Larson
- Department of Communication Sciences and Disorders, Northwestern University, 2240 Campus Drive, Evanston, Illinois 60208, USA
| |
Collapse
|
11
|
Ning LH, Loucks TM, Shih C. Suppression of vocal responses to auditory perturbation with real-time visual feedback. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:3698. [PMID: 29960493 DOI: 10.1121/1.5043383] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Speakers can adjust pitch using auditory feedback through a short-latency corrective response known as the pitch-shift response (PSR). Suppression of the PSR denotes on-line stabilization of pitch. In this paper, the hypothesis that pitch-shift responses can be suppressed with real-time visual feedback of vocal F0 is investigated. Mandarin speakers and naive speakers without tonal language experience were instructed to produce the sustained vowel /a/ and Mandarin tone /ma1/ in an audio-only condition and a separate audio-visual condition. Both Mandarin speakers and naive speakers suppressed pitch-shift responses in the audio-visual condition, regardless of task (/a/ or /ma1/) and stimulus magnitude (25 cents or 200 cents). These findings suggest that multisensory feedback (audio-visual) can improve the stability of voice F0. The benefit of audio-visual integration is independent of language experience.
Collapse
Affiliation(s)
- Li-Hsin Ning
- Department of English, National Taiwan Normal University, 162 Heping East Road, Daan District, Taipei City 106, Taiwan
| | - Torrey M Loucks
- Department of Communication Sciences and Disorders, University of Alberta, 8205 114 Saint Edmonton, Alberta, T6G 2G4, Canada
| | - Chilin Shih
- Department of Linguistics, University of Illinois Urbana-Champaign, 707 South Mathews Avenue, MC-168, Urbana, Illinois 61801, USA
| |
Collapse
|
12
|
Li W, Guo Z, Jones JA, Huang X, Chen X, Liu P, Chen S, Liu H. Training of Working Memory Impacts Neural Processing of Vocal Pitch Regulation. Sci Rep 2015; 5:16562. [PMID: 26553373 PMCID: PMC4639724 DOI: 10.1038/srep16562] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Accepted: 10/15/2015] [Indexed: 11/26/2022] Open
Abstract
Working memory training can improve the performance of tasks that were not trained. Whether auditory-motor integration for voice control can benefit from working memory training, however, remains unclear. The present event-related potential (ERP) study examined the impact of working memory training on the auditory-motor processing of vocal pitch. Trained participants underwent adaptive working memory training using a digit span backwards paradigm, while control participants did not receive any training. Before and after training, both trained and control participants were exposed to frequency-altered auditory feedback while producing vocalizations. After training, trained participants exhibited significantly decreased N1 amplitudes and increased P2 amplitudes in response to pitch errors in voice auditory feedback. In addition, there was a significant positive correlation between the degree of improvement in working memory capacity and the post-pre difference in P2 amplitudes. Training-related changes in the vocal compensation, however, were not observed. There was no systematic change in either vocal or cortical responses for control participants. These findings provide evidence that working memory training impacts the cortical processing of feedback errors in vocal pitch regulation. This enhanced cortical processing may be the result of increased neural efficiency in the detection of pitch errors between the intended and actual feedback.
Collapse
Affiliation(s)
- Weifeng Li
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, P. R. China
| | - Zhiqiang Guo
- Department of Biomedical Engineering, School of Engineering, Sun Yat-sen University, Guangzhou, China, 510006
| | - Jeffery A Jones
- Psychology Department and Laurier Centre for Cognitive Neuroscience, Wilfrid Laurier University, Waterloo, Ontario, N2L 3C5, Canada
| | - Xiyan Huang
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, P. R. China
| | - Xi Chen
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, P. R. China
| | - Peng Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, P. R. China
| | - Shaozhen Chen
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, P. R. China
| | - Hanjun Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, P. R. China.,Department of Biomedical Engineering, School of Engineering, Sun Yat-sen University, Guangzhou, China, 510006
| |
Collapse
|
13
|
Okazaki S, Mori K, Okada M. Principal component analysis reveals differential attentional modulation of the vocal response to pitch perturbation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:334-340. [PMID: 24993217 DOI: 10.1121/1.4881921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
The auditory-vocal system modifies voice fundamental frequency (F0) with auditory feedback. The responses to F0 changes in auditory feedback are known to depend on the task. The hypothesis explored in this study is that the task dependency is the result of multiple components of the F0 responses differently modulated with different tasks. Attention to audition was manipulated by task condition by the instruction to ignore or to count the number of the F0 shifts heard during vocalization. A synthetic voice with pitch shifts was used as auditory pseudo-feedback. The upward and downward shifts evoked very similar vocal F0 response patterns with polarity reversal. Attention to the auditory feedback caused a reduction in the grand-average response amplitude. By decomposing the F0 responses with principal component analysis (PCA), three principal components (PCs) with different peak latencies were found to have contributions above the criterion of 5%, totaling to 74%. All three PCs contributed to a compensatory response under the "ignore" condition. The slowest PC changed its polarity and the intermediate PC was reduced to almost zero under the "count" condition. Thus, the task-dependency of the F0 response to auditory feedback can be described in terms of different sensitivities of components to attention.
Collapse
Affiliation(s)
- Shuntaro Okazaki
- Research Institute of National Rehabilitation Center for Persons with Disabilities, 4-1 Namiki, Tokorozawa, Saitama, 359-8555, Japan
| | - Koichi Mori
- Research Institute of National Rehabilitation Center for Persons with Disabilities, 4-1 Namiki, Tokorozawa, Saitama, 359-8555, Japan
| | - Minae Okada
- Research Institute of National Rehabilitation Center for Persons with Disabilities, 4-1 Namiki, Tokorozawa, Saitama, 359-8555, Japan
| |
Collapse
|
14
|
Developmental sex-specific change in auditory–vocal integration: ERP evidence in children. Clin Neurophysiol 2013; 124:503-13. [PMID: 23036182 DOI: 10.1016/j.clinph.2012.08.024] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2012] [Revised: 08/27/2012] [Accepted: 08/29/2012] [Indexed: 10/27/2022]
|
15
|
Chen Z, Liu P, Wang EQ, Larson CR, Huang D, Liu H. ERP correlates of language-specific processing of auditory pitch feedback during self-vocalization. BRAIN AND LANGUAGE 2012; 121:25-34. [PMID: 22377260 DOI: 10.1016/j.bandl.2012.02.004] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2011] [Revised: 01/16/2012] [Accepted: 02/03/2012] [Indexed: 05/28/2023]
Abstract
The present study investigated whether the neural correlates for auditory feedback control of vocal pitch can be shaped by tone language experience. Event-related potentials (P2/N1) were recorded from adult native speakers of Mandarin and Cantonese who heard their voice auditory feedback shifted in pitch by -50, -100, -200, or -500 cents when they sustained the vowel sound /u/. Cantonese speakers produced larger P2 amplitudes to -200 or -500 cents stimuli than Mandarin speakers, but this language effect failed to reach significance in the case of -50 or -100 cents. Moreover, Mandarin speakers produced shorter N1 latencies over the left hemisphere than the right hemisphere, whereas Cantonese speakers did not. These findings demonstrate that neural processing of auditory pitch feedback in vocal motor control is subject to language-dependent neural plasticity, suggesting that cortical mechanisms of auditory-vocal integration can be shaped by tone language experience.
Collapse
Affiliation(s)
- Zhaocong Chen
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, PR China
| | | | | | | | | | | |
Collapse
|
16
|
Liu H, Wang EQ, Metman LV, Larson CR. Vocal responses to perturbations in voice auditory feedback in individuals with Parkinson's disease. PLoS One 2012; 7:e33629. [PMID: 22448258 PMCID: PMC3308986 DOI: 10.1371/journal.pone.0033629] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2011] [Accepted: 02/16/2012] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND One of the most common symptoms of speech deficits in individuals with Parkinson's disease (PD) is significantly reduced vocal loudness and pitch range. The present study investigated whether abnormal vocalizations in individuals with PD are related to sensory processing of voice auditory feedback. Perturbations in loudness or pitch of voice auditory feedback are known to elicit short latency, compensatory responses in voice amplitude or fundamental frequency. METHODOLOGY/PRINCIPAL FINDINGS Twelve individuals with Parkinson's disease and 13 age- and sex-matched healthy control subjects sustained a vowel sound (/α/) and received unexpected, brief (200 ms) perturbations in voice loudness (±3 or 6 dB) or pitch (±100 cents) auditory feedback. Results showed that, while all subjects produced compensatory responses in their voice amplitude or fundamental frequency, individuals with PD exhibited larger response magnitudes than the control subjects. Furthermore, for loudness-shifted feedback, upward stimuli resulted in shorter response latencies than downward stimuli in the control subjects but not in individuals with PD. CONCLUSIONS/SIGNIFICANCE The larger response magnitudes in individuals with PD compared with the control subjects suggest that processing of voice auditory feedback is abnormal in PD. Although the precise mechanisms of the voice feedback processing are unknown, results of this study suggest that abnormal voice control in individuals with PD may be related to dysfunctional mechanisms of error detection or correction in sensory feedback processing.
Collapse
Affiliation(s)
- Hanjun Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University Guangzhou, People's Republic of China
| | - Emily Q. Wang
- Departments of Communication Disorders and Sciences, and Otolaryngology, Rush University Medical Center, Chicago, Illinois, United States of America
| | - Leo Verhagen Metman
- Department of Neurological Sciences, Rush University Medical Center, Chicago, Illinois, United States of America
| | - Charles R. Larson
- Department of Communication Science and Disorders, Northwestern University, Evanston, Illinois, United States of America
- * E-mail:
| |
Collapse
|
17
|
Differential effects of perturbation direction and magnitude on the neural processing of voice pitch feedback. Clin Neurophysiol 2011; 122:951-7. [PMID: 20869305 DOI: 10.1016/j.clinph.2010.08.010] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2010] [Revised: 08/17/2010] [Accepted: 08/18/2010] [Indexed: 11/23/2022]
Abstract
OBJECTIVE The present study examined the differential effects of voice auditory feedback perturbation direction and magnitude on voice fundamental frequency (F(0)) responses and event-related potentials (ERPs) from EEG electrodes on the scalp. METHODS The voice F(0) responses and N1 and P2 components of ERPs were examined from 12 right-handed speakers when they sustained a vowel phonation and their mid-utterance voice pitch feedback was shifted ±100, ±200, and ±500 cents with 200 ms duration. RESULTS Downward voice pitch feedback perturbations led to larger voice F(0) responses than upward perturbations. The amplitudes of N1 and P2 components were larger for downward compared with upward pitch-shifts for 200 and 500 cents stimulus magnitudes. Shorter N1 and P2 latencies were also associated with larger magnitudes of pitch feedback perturbations. CONCLUSIONS Corresponding changes in vocal and neural responses to upward and downward voice pitch feedback perturbations suggest that the N1 and P2 components of ERPs reflect neural concomitants of the vocal responses. SIGNIFICANCE The findings of interactive effects between the magnitude and direction of voice feedback pitch perturbation on N1 and P2 ERP components indicate that the neural mechanisms underlying error detection and correction in voice pitch auditory feedback are differentially sensitive to both the magnitude and direction of pitch perturbations.
Collapse
|
18
|
Liu H, Wang EQ, Chen Z, Liu P, Larson CR, Huang D. Effect of tonal native language on voice fundamental frequency responses to pitch feedback perturbations during sustained vocalizations. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:3739-3746. [PMID: 21218905 PMCID: PMC3037774 DOI: 10.1121/1.3500675] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2010] [Revised: 07/20/2010] [Accepted: 09/15/2010] [Indexed: 05/30/2023]
Abstract
The purpose of this cross-language study was to examine whether the online control of voice fundamental frequency (F(0)) during vowel phonation is influenced by language experience. Native speakers of Cantonese and Mandarin, both tonal languages spoken in China, participated in the experiments. Subjects were asked to vocalize a vowel sound /u/at their comfortable habitual F(0), during which their voice pitch was unexpectedly shifted (± 50, ± 100, ± 200, or ± 500 cents, 200 ms duration) and fed back instantaneously to them over headphones. The results showed that Cantonese speakers produced significantly smaller responses than Mandarin speakers when the stimulus magnitude varied from 200 to 500 cents. Further, response magnitudes decreased along with the increase in stimulus magnitude in Cantonese speakers, which was not observed in Mandarin speakers. These findings suggest that online control of voice F(0) during vocalization is sensitive to language experience. Further, systematic modulations of vocal responses across stimulus magnitude were observed in Cantonese speakers but not in Mandarin speakers, which indicates that this highly automatic feedback mechanism is sensitive to the specific tonal system of each language.
Collapse
Affiliation(s)
- Hanjun Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, People's Republic of China.
| | | | | | | | | | | |
Collapse
|
19
|
Chen Z, Liu P, Jones JA, Huang D, Liu H. Sex-related differences in vocal responses to pitch feedback perturbations during sustained vocalization. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:EL355-EL360. [PMID: 21218857 DOI: 10.1121/1.3509124] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
The present study assessed the effect of sex on voice fundamental frequency (F(0)) responses to pitch feedback perturbations during sustained vocalization. Sixty-four native-Mandarin speakers heard their voice pitch feedback shifted at ± 50, ± 100, or ± 200 cents for 200 ms, five times during each vocalization. The results showed that, as compared to female speakers, male speakers produced significantly larger but slower vocal responses to the pitch-shifted stimuli. These findings reveal a modulation of vocal response as a function of sex, and suggest that there may be a differential processing of vocal pitch feedback perturbations between men and women.
Collapse
Affiliation(s)
- Zhaocong Chen
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, People's Republic of China.
| | | | | | | | | |
Collapse
|
20
|
Liu H, Wang EQ, Chen Z, Liu P, Larson CR, Huang D. Effect of tonal native language on voice fundamental frequency responses to pitch feedback perturbations during sustained vocalizations. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:3739-3746. [PMID: 21218905 DOI: 10.1121/1.3666047] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
The purpose of this cross-language study was to examine whether the online control of voice fundamental frequency (F(0)) during vowel phonation is influenced by language experience. Native speakers of Cantonese and Mandarin, both tonal languages spoken in China, participated in the experiments. Subjects were asked to vocalize a vowel sound /u/at their comfortable habitual F(0), during which their voice pitch was unexpectedly shifted (± 50, ± 100, ± 200, or ± 500 cents, 200 ms duration) and fed back instantaneously to them over headphones. The results showed that Cantonese speakers produced significantly smaller responses than Mandarin speakers when the stimulus magnitude varied from 200 to 500 cents. Further, response magnitudes decreased along with the increase in stimulus magnitude in Cantonese speakers, which was not observed in Mandarin speakers. These findings suggest that online control of voice F(0) during vocalization is sensitive to language experience. Further, systematic modulations of vocal responses across stimulus magnitude were observed in Cantonese speakers but not in Mandarin speakers, which indicates that this highly automatic feedback mechanism is sensitive to the specific tonal system of each language.
Collapse
Affiliation(s)
- Hanjun Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, People's Republic of China.
| | | | | | | | | | | |
Collapse
|
21
|
Cai S, Ghosh SS, Guenther FH, Perkell JS. Adaptive auditory feedback control of the production of formant trajectories in the Mandarin triphthong /iau/ and its pattern of generalization. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 128:2033-48. [PMID: 20968374 PMCID: PMC2981117 DOI: 10.1121/1.3479539] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
In order to test whether auditory feedback is involved in the planning of complex articulatory gestures in time-varying phonemes, the current study examined native Mandarin speakers' responses to auditory perturbations of their auditory feedback of the trajectory of the first formant frequency during their production of the triphthong /iau/. On average, subjects adaptively adjusted their productions to partially compensate for the perturbations in auditory feedback. This result indicates that auditory feedback control of speech movements is not restricted to quasi-static gestures in monophthongs as found in previous studies, but also extends to time-varying gestures. To probe the internal structure of the mechanisms of auditory-motor transformations, the pattern of generalization of the adaptation learned on the triphthong /iau/ to other vowels with different temporal and spatial characteristics (produced only under masking noise) was tested. A broad but weak pattern of generalization was observed; the strength of the generalization diminished with increasing dissimilarity from /iau/. The details and implications of the pattern of generalization are examined and discussed in light of previous sensorimotor adaptation studies of both speech and limb motor control and a neurocomputational model of speech motor control.
Collapse
Affiliation(s)
- Shanqing Cai
- Speech and Hearing Bioscience and Technology Program, Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| | | | | | | |
Collapse
|
22
|
Liu H, Russo NM, Larson CR. Age-related differences in vocal responses to pitch feedback perturbations: a preliminary study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:1042-6. [PMID: 20136225 PMCID: PMC2830265 DOI: 10.1121/1.3273880] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2009] [Revised: 11/12/2009] [Accepted: 11/14/2009] [Indexed: 05/24/2023]
Abstract
The present study tested the effect of age on voice fundamental frequency (F(0)) responses to pitch-shifted feedback. Pitch-shift stimuli (-100 cents, 200 ms duration) were presented to 30 native-English speakers: 10 children (7-12 yrs), 10 younger adults (19-21 yrs), and 10 older adults (60-73 yrs). Significantly larger response magnitudes were found in the older group compared to the children and young adult groups, while the longest latencies were associated with the children group as compared to the two adult groups. These findings provide preliminary evidence of an age effect on the modulation of vocal responses to pitch-shifted feedback.
Collapse
Affiliation(s)
- Hanjun Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, People's Republic of China
| | | | | |
Collapse
|
23
|
Liu H, Auger J, Larson CR. Voice fundamental frequency modulates vocal response to pitch perturbations during English speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:EL1-EL5. [PMID: 20058940 PMCID: PMC2803714 DOI: 10.1121/1.3263897] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2009] [Accepted: 10/15/2009] [Indexed: 05/27/2023]
Abstract
Previous research has demonstrated task-dependent vocal responses to pitch perturbations during speech production. The present study investigated the effect of voice fundamental frequency (F(0)) on the modulation of vocal responses during English speech. Randomized pitch shifts of +/-100 or 200 cents during speaking were presented to English speakers. Results indicated larger vocal responses and shorter latencies at a high voice F(0) than at a low voice F(0), but no significance differences were observed for stimulus magnitude or direction. These findings suggest that the pitch-shift reflex during speech can be modulated as a function of voice F(0).
Collapse
Affiliation(s)
- Hanjun Liu
- Department of Rehabilitation Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou 510080, People's Republic of China.
| | | | | |
Collapse
|