1
|
Yun EWT, Nguyen DD, Carding P, Hodges NJ, Chacon AM, Madill C. The Relationship Between Pitch Discrimination and Acoustic Voice Measures in a Cohort of Female Speakers. J Voice 2024; 38:1023-1034. [PMID: 35317969 DOI: 10.1016/j.jvoice.2022.02.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 02/13/2022] [Accepted: 02/13/2022] [Indexed: 11/28/2022]
Abstract
BACKGROUND Evidence across a range of musically trained, hearing disordered and voice disordered populations present conflicting results regarding the relationship between pitch discrimination (PD) and voice quality. PD characteristics of female speakers with and without a musical training background and no self-reported voice disorder, and the relationship between PD and voice quality in this particular population, have not been investigated. AIMS To evaluate PD characteristics in a cohort of female participants without a self-reported voice disorder and the relationship between PD and acoustic voice measures. METHOD One hundred fourteen female participants were studied, all of whom self-reported as being non-voice disordered. All completed the Newcastle Assessment of Pitch Discrimination which involved a two-tone PD task. Their voices were recorded producing standardized vocal tasks. Voice samples were acoustically analyzed for frequency-domain measures (fundamental frequency and its standard deviation, and harmonics-to-noise ratio) and spectral-domain measures (cepstral peak prominence and the Cepstral/Spectral Index of Dysphonia). Data were analyzed for the whole cohort and for musical and non-musical training backgrounds. RESULTS In the whole cohort, there were no significant correlations between PD and acoustic voice measures. PD accuracy in musically trained speakers was better than in non-trained speakers and correlated with fundamental frequency standard deviation in prolonged vowel tasks. Vocalists demonstrated superior PD accuracy and fundamental frequency standard deviation in prolonged vowels compared to instrumentalists but did not show significant correlations between PD and acoustic measures. The Newcastle Assessment of Pitch Discrimination was a reliable tool, showing moderate-good prediction value in differentiating musical background. CONCLUSIONS There was little evidence of a relationship between PD and acoustic measures of voice quality, regardless of musical training background and superior PD accuracy among the musically trained. These data do not support ideas concerning the co-development of perception and action among individuals identified as having voice quality measures within normal ranges. Numerous measures of voice quality, including measures sensitive to pitch, did not distinguish across musically and non-musically trained individuals, despite individual differences in pitch discrimination.
Collapse
Affiliation(s)
- Emily Wing-Tung Yun
- Discipline of Speech Pathology, Faculty of Medicine and Health, Sydney School of Health Sciences, The University of Sydney, Sydney, Australia; Doctor Liang Voice Program, Faculty of Medicine and Health, Sydney School of Health Sciences, The University of Sydney, Sydney, Australia
| | - Duy Duong Nguyen
- Discipline of Speech Pathology, Faculty of Medicine and Health, Sydney School of Health Sciences, The University of Sydney, Sydney, Australia; Doctor Liang Voice Program, Faculty of Medicine and Health, Sydney School of Health Sciences, The University of Sydney, Sydney, Australia
| | - Paul Carding
- Oxford Institute of Nursing, Midwifery and Allied Health Research, Oxford Brookes University, Oxford, England
| | - Nicola J Hodges
- School of Kinesiology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Antonia Margarita Chacon
- Discipline of Speech Pathology, Faculty of Medicine and Health, Sydney School of Health Sciences, The University of Sydney, Sydney, Australia; Doctor Liang Voice Program, Faculty of Medicine and Health, Sydney School of Health Sciences, The University of Sydney, Sydney, Australia
| | - Catherine Madill
- Discipline of Speech Pathology, Faculty of Medicine and Health, Sydney School of Health Sciences, The University of Sydney, Sydney, Australia; Doctor Liang Voice Program, Faculty of Medicine and Health, Sydney School of Health Sciences, The University of Sydney, Sydney, Australia.
| |
Collapse
|
2
|
Cai X, Ouyang M, Yin Y, Zhang Q. Sensorimotor Adaptation to Formant-Shifted Auditory Feedback Is Predicted by Language-Specific Factors in L1 and L2 Speech Production. LANGUAGE AND SPEECH 2024; 67:846-869. [PMID: 37830332 DOI: 10.1177/00238309231202503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
Auditory feedback plays an important role in the long-term updating and maintenance of speech motor control; thus, the current study explored the unresolved question of how sensorimotor adaptation is predicted by language-specific and domain-general factors in first-language (L1) and second-language (L2) production. Eighteen English-L1 speakers and 22 English-L2 speakers performed the same sensorimotor adaptation experiments and tasks, which measured language-specific and domain-general abilities. The experiment manipulated the language groups (English-L1 and English-L2) and experimental conditions (baseline, early adaptation, late adaptation, and end). Linear mixed-effects model analyses indicated that auditory acuity was significantly associated with sensorimotor adaptation in L1 and L2 speakers. Analysis of vocal responses showed that L1 speakers exhibited significant sensorimotor adaptation under the early adaptation, late adaptation, and end conditions, whereas L2 speakers exhibited significant sensorimotor adaptation only under the late adaptation condition. Furthermore, the domain-general factors of working memory and executive control were not associated with adaptation/aftereffects in either L1 or L2 production, except for the role of working memory in aftereffects in L2 production. Overall, the study empirically supported the hypothesis that sensorimotor adaptation is predicted by language-specific factors such as auditory acuity and language experience, whereas general cognitive abilities do not play a major role in this process.
Collapse
Affiliation(s)
- Xiao Cai
- School of Foreign Languages, Renmin University of China, China; Department of Psychology, Renmin University of China, China
| | - Mingkun Ouyang
- School of Education Science, Guangxi Minzu University, China
| | - Yulong Yin
- School of Psychology, Northwest Normal University, China
| | - Qingfang Zhang
- Department of Psychology, Renmin University of China, China
| |
Collapse
|
3
|
Dahl KL, Cádiz MD, Zuk J, Guenther FH, Stepp CE. Controlling Pitch for Prosody: Sensorimotor Adaptation in Linguistically Meaningful Contexts. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:440-454. [PMID: 38241671 PMCID: PMC11000799 DOI: 10.1044/2023_jslhr-23-00460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 10/09/2023] [Accepted: 11/02/2023] [Indexed: 01/21/2024]
Abstract
PURPOSE This study examined how speakers adapt to fundamental frequency (fo) errors that affect the use of prosody to convey linguistic meaning, whether fo adaptation in that context relates to adaptation in linguistically neutral sustained vowels, and whether cue trading is reflected in responses in the prosodic cues of fo and amplitude. METHOD Twenty-four speakers said vowels and sentences while fo was digitally altered to induce predictable errors. Shifts in fo (±200 cents) were applied to the entire sustained vowel and one word (emphasized or unemphasized) in sentences. Two prosodic cues-fo and amplitude-were extracted. The effects of fo shifts, shift direction, and emphasis on fo response magnitude were evaluated with repeated-measures analyses of variance. Relationships between adaptive fo responses in sentences and vowels and between adaptive fo and amplitude responses were evaluated with Spearman correlations. RESULTS Speakers adapted to fo errors in both linguistically meaningful sentences and linguistically neutral vowels. Adaptive fo responses of unemphasized words were smaller than those of emphasized words when fo was shifted upward. There was no relationship between adaptive fo responses in vowels and emphasized words, but adaptive fo and amplitude responses were strongly, positively correlated. CONCLUSIONS Sensorimotor adaptation occurs in response to fo errors regardless of how disruptive the error is to linguistic meaning. Adaptation to fo errors during sustained vowels may not involve the exact same mechanisms as sensorimotor adaptation as it occurs in meaningful speech. The relationship between adaptive responses in fo and amplitude supports an integrated model of prosody. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.25008908.
Collapse
Affiliation(s)
- Kimberly L. Dahl
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Manuel Díaz Cádiz
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Jennifer Zuk
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Frank H. Guenther
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
4
|
Miller HE, Kearney E, Nieto-Castañón A, Falsini R, Abur D, Acosta A, Chao SC, Dahl KL, Franken M, Heller Murray ES, Mollaei F, Niziolek CA, Parrell B, Perrachione T, Smith DJ, Stepp CE, Tomassi N, Guenther FH. Do Not Cut Off Your Tail: A Mega-Analysis of Responses to Auditory Perturbation Experiments. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:4315-4331. [PMID: 37850867 PMCID: PMC10715843 DOI: 10.1044/2023_jslhr-23-00315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 08/04/2023] [Accepted: 08/06/2023] [Indexed: 10/19/2023]
Abstract
PURPOSE The practice of removing "following" responses from speech perturbation analyses is increasingly common, despite no clear evidence as to whether these responses represent a unique response type. This study aimed to determine if the distribution of responses to auditory perturbation paradigms represents a bimodal distribution, consisting of two distinct response types, or a unimodal distribution. METHOD This mega-analysis pooled data from 22 previous studies to examine the distribution and magnitude of responses to auditory perturbations across four tasks: adaptive pitch, adaptive formant, reflexive pitch, and reflexive formant. Data included at least 150 unique participants for each task, with studies comprising younger adult, older adult, and Parkinson's disease populations. A Silverman's unimodality test followed by a smoothed bootstrap resampling technique was performed for each task to evaluate the number of modes in each distribution. Wilcoxon signed-ranks tests were also performed for each distribution to confirm significant compensation in response to the perturbation. RESULTS Modality analyses were not significant (p > .05) for any group or task, indicating unimodal distributions. Our analyses also confirmed compensatory reflexive responses to pitch and formant perturbations across all groups, as well as adaptive responses to sustained formant perturbations. However, analyses of sustained pitch perturbations only revealed evidence of adaptation in studies with younger adults. CONCLUSION The demonstration of a clear unimodal distribution across all tasks suggests that following responses do not represent a distinct response pattern, but rather the tail of a unimodal distribution. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.24282676.
Collapse
Affiliation(s)
- Hilary E. Miller
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Elaine Kearney
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | | | - Riccardo Falsini
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Defne Abur
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Alexander Acosta
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Sara-Ching Chao
- College of Health Solutions, Arizona State University, Tempe
| | - Kimberly L. Dahl
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Matthias Franken
- Department of Psychology, McGill University, Montréal, Québec, Canada
| | | | - Fatemeh Mollaei
- School of Psychology and Clinical Language Sciences, University of Reading, England
| | - Caroline A. Niziolek
- Department of Communication Sciences and Disorders, University of Wisconsin–Madison
| | - Benjamin Parrell
- Department of Communication Sciences and Disorders, University of Wisconsin–Madison
| | - Tyler Perrachione
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Dante J. Smith
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Cara E. Stepp
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Nicole Tomassi
- Graduate Program for Neuroscience, Boston University, MA
| | - Frank H. Guenther
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| |
Collapse
|
5
|
Heller Murray ES, Chao A. The Relationships Among Vocal Variability, Vocal-Articulatory Coordination, and Dysphonia in Children. J Voice 2023; 37:969.e43-969.e49. [PMID: 34272144 DOI: 10.1016/j.jvoice.2021.06.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 06/01/2021] [Accepted: 06/10/2021] [Indexed: 10/20/2022]
Abstract
OBJECTIVE The purpose of this study was to evaluate the relationship between vocal variability and variability of vocal-articulatory coordination in children. Furthermore, this study examined if this relationship was impacted by pediatric dysphonia. STUDY DESIGN Retrospective analysis of speech samples in the Arizona Child Acoustic Database. METHODS Speech samples from children 2-7 years of age were selected for analysis. Vocal variability was defined as the coefficient of variation (CoV) of fundamental frequency, taken from the center of sustained vowels. Variability of vocal-articulatory coordination was defined as the CoV of voice onset time (VOT) of voiceless stop consonants. Both objective and subjective measures of dysphonia were completed for each participant. RESULTS Children had a negative correlation between VOT variability and vocal variability. Further analysis indicated that this relationship was present in children with typical developmental levels of dysphonia but absent for children with moderate to severe dysphonia. Increased dysphonia severity was associated with increased vocal variability. CONCLUSION Increased VOT variability was associated with decreased vocal variability in children with dysphonia severities consistent with typical vocal development. However, this relationship was not present in children with moderate to severe dysphonia. This study suggests that future work is needed to examine the relationships between the vocal system and vocal-articulatory coordination in children with and without diagnosed voice disorders.
Collapse
Affiliation(s)
| | - Andie Chao
- Department of Communication Sciences and Disorders, Temple University, Philadelphia, Pennsylvania
| |
Collapse
|
6
|
Vojtech JM, Mitchell CL, Raiff L, Kline JC, De Luca G. Prediction of Voice Fundamental Frequency and Intensity from Surface Electromyographic Signals of the Face and Neck. VIBRATION 2022; 5:692-710. [PMID: 36299552 PMCID: PMC9592063 DOI: 10.3390/vibration5040041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Silent speech interfaces (SSIs) enable speech recognition and synthesis in the absence of an acoustic signal. Yet, the archetypal SSI fails to convey the expressive attributes of prosody such as pitch and loudness, leading to lexical ambiguities. The aim of this study was to determine the efficacy of using surface electromyography (sEMG) as an approach for predicting continuous acoustic estimates of prosody. Ten participants performed a series of vocal tasks including sustained vowels, phrases, and monologues while acoustic data was recorded simultaneously with sEMG activity from muscles of the face and neck. A battery of time-, frequency-, and cepstral-domain features extracted from the sEMG signals were used to train deep regression neural networks to predict fundamental frequency and intensity contours from the acoustic signals. We achieved an average accuracy of 0.01 ST and precision of 0.56 ST for the estimation of fundamental frequency, and an average accuracy of 0.21 dB SPL and precision of 3.25 dB SPL for the estimation of intensity. This work highlights the importance of using sEMG as an alternative means of detecting prosody and shows promise for improving SSIs in future development.
Collapse
Affiliation(s)
| | | | - Laura Raiff
- Delsys, Inc., Natick, MA 01760, USA
- Altec, Inc., Natick, MA 01760, USA
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Joshua C. Kline
- Delsys, Inc., Natick, MA 01760, USA
- Altec, Inc., Natick, MA 01760, USA
| | - Gianluca De Luca
- Delsys, Inc., Natick, MA 01760, USA
- Altec, Inc., Natick, MA 01760, USA
| |
Collapse
|
7
|
Kearney E, Nieto-Castañón A, Falsini R, Daliri A, Heller Murray ES, Smith DJ, Guenther FH. Quantitatively characterizing reflexive responses to pitch perturbations. Front Hum Neurosci 2022; 16:929687. [PMID: 36405080 PMCID: PMC9666385 DOI: 10.3389/fnhum.2022.929687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 10/04/2022] [Indexed: 11/06/2022] Open
Abstract
Background Reflexive pitch perturbation experiments are commonly used to investigate the neural mechanisms underlying vocal motor control. In these experiments, the fundamental frequency–the acoustic correlate of pitch–of a speech signal is shifted unexpectedly and played back to the speaker via headphones in near real-time. In response to the shift, speakers increase or decrease their fundamental frequency in the direction opposing the shift so that their perceived pitch is closer to what they intended. The goal of the current work is to develop a quantitative model of responses to reflexive perturbations that can be interpreted in terms of the physiological mechanisms underlying the response and that captures both group-mean data and individual subject responses. Methods A model framework was established that allowed the specification of several models based on Proportional-Integral-Derivative and State-Space/Directions Into Velocities of Articulators (DIVA) model classes. The performance of 19 models was compared in fitting experimental data from two published studies. The models were evaluated in terms of their ability to capture both population-level responses and individual differences in sensorimotor control processes. Results A three-parameter DIVA model performed best when fitting group-mean data from both studies; this model is equivalent to a single-rate state-space model and a first-order low pass filter model. The same model also provided stable estimates of parameters across samples from individual subject data and performed among the best models to differentiate between subjects. The three parameters correspond to gains in the auditory feedback controller’s response to a perceived error, the delay of this response, and the gain of the somatosensory feedback controller’s “resistance” to this correction. Excellent fits were also obtained from a four-parameter model with an additional auditory velocity error term; this model was better able to capture multi-component reflexive responses seen in some individual subjects. Conclusion Our results demonstrate the stereotyped nature of an individual’s responses to pitch perturbations. Further, we identified a model that captures population responses to pitch perturbations and characterizes individual differences in a stable manner with parameters that relate to underlying motor control capabilities. Future work will evaluate the model in characterizing responses from individuals with communication disorders.
Collapse
Affiliation(s)
- Elaine Kearney
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
- *Correspondence: Elaine Kearney,
| | - Alfonso Nieto-Castañón
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
- The McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, United States
| | - Riccardo Falsini
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
| | - Ayoub Daliri
- College of Health Solutions, Arizona State University, Tempe, AZ, United States
| | | | - Dante J. Smith
- Gradutate Program for Neuroscience, Boston University, Boston, MA, United States
| | - Frank H. Guenther
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA, United States
- Department of Biomedical Engineering, Boston University, Boston, MA, United States
- The Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA, United States
| |
Collapse
|
8
|
Castillo-Allendes A, Delgado-Bravo M, Ponce AR, Hunter EJ. Muscle Activity and Aerodynamic Voice Changes at Different Body Postures: A Pilot Study. J Voice 2022:S0892-1997(22)00300-9. [PMID: 36273960 PMCID: PMC10924224 DOI: 10.1016/j.jvoice.2022.09.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 09/23/2022] [Accepted: 09/26/2022] [Indexed: 11/06/2022]
Abstract
OBJECTIVE Body posture is a commonly discussed component of voice training and therapy. However, body postures, postural changes, related posturing muscle monitoring, and the potential changes in voice production (eg, glottal aerodynamic changes, acoustic differences) have been inconsistently described in the literature, leaving room for free interpretation and possible misunderstandings. The primary purpose of this pilot study was to compare the magnitude of electromyographical activation of muscles involved in phonation-breathing functions and their changes due to four standardized body postures in experienced singers. Secondly, to identify which body posture produces greater changes in aerodynamic parameters, vocal pitch, and loudness. METHODS Eight healthy adults with experience in singing voice performed a vocal task during different body postures commonly used in both voice training and therapy. A 3D-capture system was used to control and quantify the alignment of each posture. During the performances, surface electromyography (sEMG) was used to measure the muscular activity involved in the breathing/phonation and posture processes. A nonparametric Kruskal-Wallis test was used to compare the sEMG activity of phonatory muscles and aerodynamic voice variables between postures. RESULTS Our study did not reveal significant differences in sEMG activity, aerodynamic parameters, vocal pitch, and loudness among body postures during vocal task productions. However, the vocal pitch (in semitones) revealed significant differences in the unstable surface when compared to the upright posture, modified upright, and leaning postures. CONCLUSION The body postures selected did not generate voice aerodynamic modifications of the voice nor in the levels of activation of muscles involved in the phonation-breathing process in individuals with experience in singing voice. Modifications of body posture as a tool for voice therapy should be further investigated, considering the population with voice problems and no voice training experience.
Collapse
Affiliation(s)
- Adrián Castillo-Allendes
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
| | - Mauricio Delgado-Bravo
- Carrera de Kinesiología, Departamento de Ciencias de la Salud, Facultad de Medicina, Pontificia Universidad Católica de Chile, Santiago, Chile; Human Performance Laboratory, Faculty of Kinesiology, University of Calgary, Calgary, Alberta, Canada.
| | - Alvaro Reyes Ponce
- Exercise and Rehabilitation Sciences Institute, School of Physical Therapy, Faculty of Rehabilitation Sciences, Universidad Andres Bello, Santiago de Chile, Chile
| | - Eric J Hunter
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
| |
Collapse
|
9
|
Coughler C, Quinn de Launay KL, Purcell DW, Oram Cardy J, Beal DS. Pediatric Responses to Fundamental and Formant Frequency Altered Auditory Feedback: A Scoping Review. Front Hum Neurosci 2022; 16:858863. [PMID: 35664350 PMCID: PMC9157279 DOI: 10.3389/fnhum.2022.858863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Accepted: 04/12/2022] [Indexed: 11/13/2022] Open
Abstract
Purpose The ability to hear ourselves speak has been shown to play an important role in the development and maintenance of fluent and coherent speech. Despite this, little is known about the developing speech motor control system throughout childhood, in particular if and how vocal and articulatory control may differ throughout development. A scoping review was undertaken to identify and describe the full range of studies investigating responses to frequency altered auditory feedback in pediatric populations and their contributions to our understanding of the development of auditory feedback control and sensorimotor learning in childhood and adolescence. Method Relevant studies were identified through a comprehensive search strategy of six academic databases for studies that included (a) real-time perturbation of frequency in auditory input, (b) an analysis of immediate effects on speech, and (c) participants aged 18 years or younger. Results Twenty-three articles met inclusion criteria. Across studies, there was a wide variety of designs, outcomes and measures used. Manipulations included fundamental frequency (9 studies), formant frequency (12), frequency centroid of fricatives (1), and both fundamental and formant frequencies (1). Study designs included contrasts across childhood, between children and adults, and between typical, pediatric clinical and adult populations. Measures primarily explored acoustic properties of speech responses (latency, magnitude, and variability). Some studies additionally examined the association of these acoustic responses with clinical measures (e.g., stuttering severity and reading ability), and neural measures using electrophysiology and magnetic resonance imaging. Conclusion Findings indicated that children above 4 years generally compensated in the opposite direction of the manipulation, however, in several cases not as effectively as adults. Overall, results varied greatly due to the broad range of manipulations and designs used, making generalization challenging. Differences found between age groups in the features of the compensatory vocal responses, latency of responses, vocal variability and perceptual abilities, suggest that maturational changes may be occurring in the speech motor control system, affecting the extent to which auditory feedback is used to modify internal sensorimotor representations. Varied findings suggest vocal control develops prior to articulatory control. Future studies with multiple outcome measures, manipulations, and more expansive age ranges are needed to elucidate findings.
Collapse
Affiliation(s)
- Caitlin Coughler
- Graduate Program in Health and Rehabilitation Sciences, Faculty of Health Sciences, The University of Western Ontario, London, ON, Canada
- *Correspondence: Caitlin Coughler,
| | - Keelia L. Quinn de Launay
- Bloorview Research Institute, Holland Bloorview Kids Rehabilitation Hospital, Toronto, ON, Canada
- Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - David W. Purcell
- School of Communication Sciences and Disorders, Faculty of Health Sciences, The University of Western Ontario, London, ON, Canada
- National Centre for Audiology, Faculty of Health Sciences, The University of Western Ontario, London, ON, Canada
| | - Janis Oram Cardy
- School of Communication Sciences and Disorders, Faculty of Health Sciences, The University of Western Ontario, London, ON, Canada
- National Centre for Audiology, Faculty of Health Sciences, The University of Western Ontario, London, ON, Canada
| | - Deryk S. Beal
- Bloorview Research Institute, Holland Bloorview Kids Rehabilitation Hospital, Toronto, ON, Canada
- Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- Department of Speech-Language Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
10
|
Rhee N, Chen A, Kuang J. Musicality and Age Interaction in Tone Development. Front Neurosci 2022; 16:804042. [PMID: 35264924 PMCID: PMC8901167 DOI: 10.3389/fnins.2022.804042] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 01/26/2022] [Indexed: 11/13/2022] Open
Abstract
Vocal pitch, which involves not only F0 but also multiple covarying acoustic cues is central to linguistic perception and production at various levels of prosodic structure. Recent studies on language development have shown that differences in learners' musicality affect the F0 cue development in perception of sentence-level intonation or in prosodic realization of focus. This study aims to contribute toward a fuller understanding of the effect of musicality on linguistic pitch development via a close investigation of the relationship between musicality, age, and lexical tone production covering both F0 and spectral cues in children. Forty-three native Mandarin-speaking children between the ages of 4 and 6 years are recruited to participate in both a semi-spontaneous tone production task and a musicality test. For each age (4, 5, and 6 years) and musicality (below or above the median score of each age group) group, the contrastivity of the four tones is evaluated by performing automatic tone classification using three sets of acoustic cues (F0, spectral cues, and both). It has been found that higher musicality is associated with higher contrastivity of the tones produced at the age of 4 and 5 years, but not at the age of 6 years. These results suggest that musicality promotes earlier development of tone production only in earlier stages of prosodic development; by the age of 6 years, the musicality advantage in tone production subsides.
Collapse
Affiliation(s)
- Nari Rhee
- Department of Linguistics, University of Pennsylvania, Philadelphia, PA, United States
- *Correspondence: Nari Rhee
| | - Aoju Chen
- Utrecht Institute of Linguistics OTS, Utrecht University, Utrecht, Netherlands
| | - Jianjing Kuang
- Department of Linguistics, University of Pennsylvania, Philadelphia, PA, United States
- Jianjing Kuang
| |
Collapse
|
11
|
Tomassi NE, Weerathunge HR, Cushman MR, Bohland JW, Stepp CE. Assessing Ecologically Valid Methods of Auditory Feedback Measurement in Individuals With Typical Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:121-135. [PMID: 34941381 PMCID: PMC9153919 DOI: 10.1044/2021_jslhr-21-00377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 09/15/2021] [Accepted: 09/16/2021] [Indexed: 06/14/2023]
Abstract
PURPOSE Auditory feedback is thought to contribute to the online control of speech production. Yet, the standard method of estimating auditory feedback control (i.e., reflexive responses to auditory-motor perturbations), although sound, requires specialized instrumentation, meticulous calibration, unnatural tasks, and specific acoustic environments. The purpose of this study was to explore more ecologically valid features of speech production to determine their relationships with auditory feedback mechanisms. METHOD Two previously proposed measures of within-utterance variability (centering and baseline variability) were compared with reflexive response magnitudes in 30 adults with typical speech. These three measures were estimated for both the laryngeal and articulatory subsystems of speech. RESULTS Regardless of the speech subsystem, neither centering nor baseline variability was shown to be related to reflexive response magnitudes. Likewise, no relationships were found between centering and baseline variability. CONCLUSIONS Despite previous suggestions that centering and baseline variability may be related to auditory feedback mechanisms, this study did not support these assertions. However, the detection of such relationships may have required a larger degree of variability in responses, relative to that found in those with typical speech. Future research on these relationships is warranted in populations with more heterogeneous responses, such as children or clinical populations. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.17330546.
Collapse
Affiliation(s)
- Nicole E. Tomassi
- Graduate Program for Neuroscience, Boston University, MA
- Department of Speech, Language & Hearing Sciences, Boston University, MA
| | - Hasini R. Weerathunge
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| | - Megan R. Cushman
- Department of Speech, Language & Hearing Sciences, Boston University, MA
| | - Jason W. Bohland
- Department of Communication Science and Disorders, University of Pittsburgh, PA
| | - Cara E. Stepp
- Graduate Program for Neuroscience, Boston University, MA
- Department of Speech, Language & Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology—Head & Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
12
|
Coughler C, Hamel EM, Cardy JO, Archibald LMD, Purcell DW. Compensation to Altered Auditory Feedback in Children With Developmental Language Disorder and Typical Development. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2363-2376. [PMID: 33769836 DOI: 10.1044/2020_jslhr-20-00374] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Purpose Developmental language disorder (DLD), an unexplained problem using and understanding spoken language, has been hypothesized to have an underlying auditory processing component. Auditory feedback plays a key role in speech motor control. The current study examined whether auditory feedback is used to regulate speech production in a similar way by children with DLD and their typically developing (TD) peers. Method Participants aged 6-11 years completed tasks measuring hearing, language, first formant (F1) discrimination thresholds, partial vowel space, and responses to altered auditory feedback with F1 perturbation. Results Children with DLD tended to compensate more than TD children for the positive F1 manipulation and compensated less than TD children in the negative shift condition. Conclusion Our findings suggest that children with DLD make atypical use of auditory feedback.
Collapse
Affiliation(s)
- Caitlin Coughler
- Graduate Program in Health and Rehabilitation Sciences, The University of Western Ontario, London, Canada
| | - Emily Michaela Hamel
- Graduate Program in Neuroscience, The University of Western Ontario, London, Canada
- Doctor of Medicine Program, The University of British Columbia, Vancouver, Canada
| | - Janis Oram Cardy
- School of Communication Sciences and Disorders, The University of Western Ontario, London, Canada
- National Centre for Audiology, The University of Western Ontario, London, Canada
| | - Lisa M D Archibald
- School of Communication Sciences and Disorders, The University of Western Ontario, London, Canada
| | - David W Purcell
- School of Communication Sciences and Disorders, The University of Western Ontario, London, Canada
- National Centre for Audiology, The University of Western Ontario, London, Canada
| |
Collapse
|
13
|
Cai X, Yin Y, Zhang Q. Online Control of Voice Intensity in Late Bilinguals' First and Second Language Speech Production: Evidence From Unexpected and Brief Noise Masking. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:1471-1489. [PMID: 33830851 DOI: 10.1044/2021_jslhr-20-00330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Purpose Speech production requires the combined efforts of feedforward control and feedback control subsystems. The primary purpose of this study is to explore whether the relative weighting of auditory feedback control is different between the first language (L1) and the second language (L2) production for late bilinguals. The authors also make an exploratory investigation into how bilinguals' speech fluency and speech perception relate to their auditory feedback control. Method Twenty Chinese-English bilinguals named Chinese or English bisyllabic words, while being exposed to 30- or 60-dB unexpected brief masking noise. Variables of language (L1 or L2) and noise condition (quiet, weak noise, or strong noise) were manipulated in the experiment. L1 and L2 speech fluency tests and an L2 perception test were also included to measure bilinguals' speech fluency and auditory acuity. Results Peak intensity analyses indicated that the intensity increases in the weak noise and strong noise conditions were larger in L2-English than L1-Chinese production. Intensity contour analysis showed that the intensity increases in both languages had an onset around 80-140 ms, a peak around 220-250 ms, and persisted till 400 ms post vocalization onset. Correlation analyses also revealed that poorer speech fluency or L2 auditory acuity was associated with larger Lombard effect. Conclusions For late bilinguals, the reliance on auditory feedback control is heavier in L2 than in L1 production. We empirically supported a relation between speech fluency and the relative weighting of auditory feedback control, and provided the first evidence for the production-perception link in L2 speech motor control.
Collapse
Affiliation(s)
- Xiao Cai
- Department of Psychology, Renmin University of China, Beijing
| | - Yulong Yin
- Department of Psychology, Renmin University of China, Beijing
| | - Qingfang Zhang
- Department of Psychology, Renmin University of China, Beijing
| |
Collapse
|
14
|
Lester-Smith RA, Daliri A, Enos N, Abur D, Lupiani AA, Letcher S, Stepp CE. The Relation of Articulatory and Vocal Auditory-Motor Control in Typical Speakers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:3628-3642. [PMID: 33079610 PMCID: PMC8582832 DOI: 10.1044/2020_jslhr-20-00192] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 07/16/2020] [Accepted: 08/12/2020] [Indexed: 05/14/2023]
Abstract
Purpose The purpose of this study was to explore the relationship between feedback and feedforward control of articulation and voice by measuring reflexive and adaptive responses to first formant (F 1) and fundamental frequency (f o) perturbations. In addition, perception of F 1 and f o perturbation was estimated using passive (listening) and active (speaking) just noticeable difference paradigms to assess the relation of auditory acuity to reflexive and adaptive responses. Method Twenty healthy women produced single words and sustained vowels while the F 1 or f o of their auditory feedback was suddenly and unpredictably perturbed to assess reflexive responses or gradually and predictably perturbed to assess adaptive responses. Results Typical speakers' reflexive responses to sudden perturbation of F 1 were related to their adaptive responses to gradual perturbation of F 1. Specifically, speakers with larger reflexive responses to sudden perturbation of F 1 had larger adaptive responses to gradual perturbation of F 1. Furthermore, their reflexive responses to sudden perturbation of F 1 were associated with their passive auditory acuity to F 1 such that speakers with better auditory acuity to F 1 produced larger reflexive responses to sudden perturbations of F 1. Typical speakers' adaptive responses to gradual perturbation of F 1 were not associated with their auditory acuity to F 1. Speakers' reflexive and adaptive responses to perturbation of f o were not related, nor were their responses related to either measure of auditory acuity to f o. Conclusion These findings indicate that there may be disparate feedback and feedforward control mechanisms for articulatory and vocal error correction based on auditory feedback.
Collapse
Affiliation(s)
| | - Ayoub Daliri
- Department of Speech, Language, & Hearing Sciences, Boston University, MA
| | - Nicole Enos
- Department of Biomedical Engineering, Boston University, MA
- Department of Electrical & Computer Engineering, Boston University, MA
| | - Defne Abur
- Department of Speech, Language, & Hearing Sciences, Boston University, MA
| | - Ashling A. Lupiani
- Department of Speech, Language, & Hearing Sciences, Boston University, MA
| | | | - Cara E. Stepp
- Department of Speech, Language, & Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology—Head & Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|
15
|
Adaptation to pitch-altered feedback is independent of one's own voice pitch sensitivity. Sci Rep 2020; 10:16860. [PMID: 33033324 PMCID: PMC7544828 DOI: 10.1038/s41598-020-73932-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 09/23/2020] [Indexed: 01/17/2023] Open
Abstract
Monitoring voice pitch is a fine-tuned process in daily conversations as conveying accurately the linguistic and affective cues in a given utterance depends on the precise control of phonation and intonation. This monitoring is thought to depend on whether the error is treated as self-generated or externally-generated, resulting in either a correction or inflation of errors. The present study reports on two separate paradigms of adaptation to altered feedback to explore whether participants could behave in a more cohesive manner once the error is of comparable size perceptually. The vocal behavior of normal-hearing and fluent speakers was recorded in response to a personalized size of pitch shift versus a non-specific size, one semitone. The personalized size of shift was determined based on the just-noticeable difference in fundamental frequency (F0) of each participant’s voice. Here we show that both tasks successfully demonstrated opposing responses to a constant and predictable F0 perturbation (on from the production onset) but these effects barely carried over once the feedback was back to normal, depicting a pattern that bears some resemblance to compensatory responses. Experiencing a F0 shift that is perceived as self-generated (because it was precisely just-noticeable) is not enough to force speakers to behave more consistently and more homogeneously in an opposing manner. On the contrary, our results suggest that the type of the response as well as the magnitude of the response do not depend in any trivial way on the sensitivity of participants to their own voice pitch. Based on this finding, we speculate that error correction could possibly occur even with a bionic ear, typically even when F0 cues are too subtle for cochlear implant users to detect accurately.
Collapse
|