1
|
Haas E, Ziegler W, Schölderle T. Age Estimation and Gender Attribution in Typically Developing Children and Children With Dysarthria. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2024; 33:1236-1253. [PMID: 38416062 DOI: 10.1044/2023_ajslp-23-00246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
PURPOSE The purposes of this study were (a) to investigate adult listeners' perceptions of age and gender in typically developing children and children with dysarthria and (b) to identify predictors of their estimates among auditory-perceptual parameters and an acoustic measure of vocal pitch (F0). We aimed to evaluate the influence of dysarthria on the listeners' impressions of age and gender against the background of typical developmental processes. METHOD In a listening experiment, adult listeners completed age and gender estimates of 144 typically developing children (3-9 years of age) and 25 children with dysarthria (5-9 years of age). The Bogenhausen Dysarthria Scales for Childhood Dysarthria (BoDyS-KiD) were applied to record speech samples and to complete auditory-perceptual judgments covering all speech subsystems. Furthermore, each child's mean F0 was determined from samples of four BoDyS-KiD sentences. RESULTS Age estimates for the typically developing children showed a regression to the mean, whereas children with dysarthria were systematically underestimated in their age. The estimates of all children were predicted by developmental speech features; for the children with dysarthria, specific dysarthria symptoms had an additional effect. We found a significantly higher accuracy of gender attribution in the typically developing children than in the children with dysarthria. The prediction accuracy of the listeners' gender attribution in the preadolescent children by the included speech characteristics was limited. CONCLUSIONS Children with dysarthria are more difficult to estimate for their age and gender than their typically developing peers. Dysarthria thus alters the auditory-perceptual impression of indexical speech features in children, which must be considered another facet of the communication disorder associated with childhood dysarthria.
Collapse
Affiliation(s)
- Elisabet Haas
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig-Maximilians-Universität München, Germany
| | - Wolfram Ziegler
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig-Maximilians-Universität München, Germany
| | - Theresa Schölderle
- Clinical Neuropsychology Research Group, Institute for Phonetics and Speech Processing, Ludwig-Maximilians-Universität München, Germany
| |
Collapse
|
2
|
Story BH, Bunton K. The relation of velopharyngeal coupling area and vocal tract scaling to identification of stop-nasal cognates. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:3741-3759. [PMID: 38099832 DOI: 10.1121/10.0023958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 11/22/2023] [Indexed: 12/18/2023]
Abstract
The purpose of this study was to determine whether the threshold of velopharyngeal (VP) coupling area at which listeners switch from identifying a consonant as a stop to a nasal in North American English was different for speech produced by a model based on an adult male, an adult female, and a 4-year-old child. V1CV2 stimuli were generated with a speech production model that encodes phonetic segments as relative acoustic targets imposed on an underlying vocal tract and laryngeal structure that can be scaled according to sex and age. Each V1CV2 was synthesized with a set of VP coupling functions whose maximum area ranged from 0 to 0.1 cm2. Results showed that scaling the vocal tract and vocal folds had essentially no effect on the VP coupling area at which listener identification shifted from stop to nasal. The range of coupling areas at which the crossover occurred was 0.037-0.049 cm2 for the male model, 0.040-0.055 cm2 for the female model, and 0.039-0.052 cm2 for the 4-year-old child model, and overall mean was 0.044 cm2. Calculations of band limited peak nasalance indicated that 85% peak nasalance during the consonant was well aligned with listener responses.
Collapse
Affiliation(s)
- Brad H Story
- Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona 85721-0071, USA
| | - Kate Bunton
- Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona 85721-0071, USA
| |
Collapse
|
3
|
Zealouk O, Satori H, Hamidi M, Laaidi N, Salek A, Satori K. Analysis of COVID-19 Resulting Cough Using Formants and Automatic Speech Recognition System. J Voice 2023; 37:971.e9-971.e16. [PMID: 34256982 PMCID: PMC8205259 DOI: 10.1016/j.jvoice.2021.05.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 05/16/2021] [Accepted: 05/18/2021] [Indexed: 12/22/2022]
Abstract
As part of our contributions to researches on the ongoing COVID-19 pandemic worldwide, we have studied the cough changes to the infected people based on the Hidden Markov Model (HMM) speech recognition classification, formants frequency and pitch analysis. In this paper, An HMM-based cough recognition system was implemented with 5 HMM states, 8 Gaussian Mixture Distributions (GMMs) and 13 dimensions of the basic Mel-Frequency Cepstral Coefficients (MFCC) with 39 dimensions of the overall feature vector. A comparison between formants frequency and pitch extracted values is realized based on the cough of COVID-19 infected people and healthy ones to confirm our cough recognition system results. The experimental results present that the difference between the recognition rates of infected and non-infected people is 6.7%. Whereas, the formant analysis variation based on the cough of infected and non-infected people is clearly observed with F1, F3, and F4 and lower for F0 and F2.
Collapse
Affiliation(s)
- Ouissam Zealouk
- Department of Mathematics and Computer Science, Faculty of Sciences Dhar Mahraz, Sidi Mohammed Ben Abbdallah University, Fez, Morocco.
| | - Hassan Satori
- Department of Mathematics and Computer Science, Faculty of Sciences Dhar Mahraz, Sidi Mohammed Ben Abbdallah University, Fez, Morocco
| | - Mohamed Hamidi
- Department of Mathematics and Computer Science, Faculty of Sciences Dhar Mahraz, Sidi Mohammed Ben Abbdallah University, Fez, Morocco
| | - Naouar Laaidi
- Department of Mathematics and Computer Science, Faculty of Sciences Dhar Mahraz, Sidi Mohammed Ben Abbdallah University, Fez, Morocco
| | - Amine Salek
- Faculty of Medicine and Pharmacy, UMP, Oujda, Morocco
| | - Khalid Satori
- Department of Mathematics and Computer Science, Faculty of Sciences Dhar Mahraz, Sidi Mohammed Ben Abbdallah University, Fez, Morocco
| |
Collapse
|
4
|
Vasquez-Serrano P, Reyes-Moreno J, Guido RC, Sepúlveda-Sepúlveda A. MFCC Parameters of the Speech Signal: An Alternative to Formant-Based Instantaneous Vocal Tract Length Estimation. J Voice 2023:S0892-1997(23)00163-7. [PMID: 37344246 DOI: 10.1016/j.jvoice.2023.05.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 05/22/2023] [Accepted: 05/22/2023] [Indexed: 06/23/2023]
Abstract
On the one hand, the relationship between formant frequencies and vocal tract length (VTL) has been intensively studied over the years. On the other hand, the connection involving mel-frequency cepstral coefficients (MFCCs), which concisely codify the overall shape of a speaker's spectral envelope with just a few cepstral coefficients, and VTL has only been modestly analyzed, being worth of further investigation. Thus, based on different statistical models, this article explores the advantages and disadvantages of the latter approach, which is relatively novel, in contrast to the former which arises from more traditional studies. Additionally, VTL is assumed to be a static and inherent characteristic of speakers, that is, a single length parameter is frequently estimated per speaker. By contrast, in this paper we consider VTL estimation from a dynamic perspective using modern real-time Magnetic Resonance Imaging (rtMRI) to measure VTL in parallel with audio signals. To support the experiments, data obtained from USC-TIMIT magnetic resonance videos were used, allowing for the 2D real-time analysis of articulators in motion. As a result, we observed that the performance of MFCCs in case of speaker-dependent modeling is higher, however, in case of cross-speaker modeling, which uses different speakers' data for training and evaluating, its performance is not significantly different of that obtained with formants. In complement, we note that the estimation based on MFCCs is robust, with an acceptable computational time complexity, coherent with the traditional approach.
Collapse
Affiliation(s)
- P Vasquez-Serrano
- Escuela de Ing. Eléctrica, Electrónica y de Telecomunicaciones (E3T), Universidad Industrial de Santander, Bucaramanga, Colombia
| | - J Reyes-Moreno
- Escuela de Ing. Eléctrica, Electrónica y de Telecomunicaciones (E3T), Universidad Industrial de Santander, Bucaramanga, Colombia
| | - Rodrigo Capobianco Guido
- Instituto de Biociências, Letras e Ciências Exatas, Unesp - Univ Estadual Paulista (São Paulo State University), Rua Cristóvão Colombo 2265, Jd Nazareth, 15054-000 São José do Rio Preto, SP, Brazil.
| | - Alexander Sepúlveda-Sepúlveda
- Escuela de Ing. Eléctrica, Electrónica y de Telecomunicaciones (E3T), Universidad Industrial de Santander, Bucaramanga, Colombia
| |
Collapse
|
5
|
Vorperian HK, Kent RD, Lee Y, Buhr KA. Vowel Production in Children and Adults With Down Syndrome: Fundamental and Formant Frequencies of the Corner Vowels. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:1208-1239. [PMID: 37015000 PMCID: PMC10187968 DOI: 10.1044/2022_jslhr-22-00510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Revised: 12/01/2022] [Accepted: 12/21/2022] [Indexed: 05/18/2023]
Abstract
PURPOSE Atypical vowel production contributes to reduced speech intelligibility in children and adults with Down syndrome (DS). This study compares the acoustic data of the corner vowels /i/, /u/, /æ/, and /ɑ/ from speakers with DS against typically developing/developed (TD) speakers. METHOD Measurements of the fundamental frequency (f o) and first four formant frequencies (F1-F4) were obtained from single word recordings containing the target vowels from 81 participants with DS (ages 3-54 years) and 293 TD speakers (ages 4-92 years), all native speakers of English. The data were used to construct developmental trajectories and to determine interspeaker and intraspeaker variability. RESULTS Trajectories for DS differed from TD based on age and sex, but the groups were similar with the striking change in f o and F1-F4 frequencies around age 10 years. Findings confirm higher f o in DS, and vowel-specific differences between DS and TD in F1 and F2 frequencies, but not F3 and F4. The measure of F2 differences of front-versus-back vowels was more sensitive of compression than reduced vowel space area/centralization across age and sex. Low vowels had more pronounced F2 compression as related to reduced speech intelligibility. Intraspeaker variability was significantly greater for DS than TD for nearly all frequency values across age. DISCUSSION Vowel production differences between DS and TD are age- and sex-specific, which helps explain contradictory results in previous studies. Increased intraspeaker variability across age in DS confirms the presence of a persisting motor speech disorder. Atypical vowel production in DS is common and related to dysmorphology, delayed development, and disordered motor control.
Collapse
Affiliation(s)
- Houri K. Vorperian
- Vocal Tract Development Lab, Waisman Center, University of Wisconsin–Madison
| | - Raymond D. Kent
- Vocal Tract Development Lab, Waisman Center, University of Wisconsin–Madison
| | - Yen Lee
- Department of Educational Leadership, Edgewood College, Madison, Wisconsin
| | - Kevin A. Buhr
- Department of Biostatistics and Medical Informatics, University of Wisconsin–Madison
| |
Collapse
|
6
|
Kent SAK, Fletcher TL, Morgan A, Morton M, Hall RJ, Sandage MJ. Updated Acoustic Normative Data through the Lifespan: A Scoping Review. J Voice 2023:S0892-1997(23)00066-8. [PMID: 36941164 DOI: 10.1016/j.jvoice.2023.02.011] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 03/23/2023]
Abstract
OBJECTIVE To assess the recent literature for voice acoustic data values reported for individuals without voice disorder through the lifespan as a means to develop an updated normative acoustic data resource for children and adults. METHODS A scoping review was conducted using the Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) Checklist. English language, full-text publications were identified through Medline (EBSCO & OVID), PubMed, APA PsycINFO, Web of Science, Google Scholar, and ProQuest Theses and Dissertations Global. RESULTS A total of 903 sources were retrieved; of these 510 were duplicates. Abstracts of 393 were screened, with 68 full-text review. From the eligible studies, citation review yielded 51 additional resources. Twenty-eight sources were included for data extraction. For the normative acoustic data extracted for males and females across the lifespan, lower fundamental frequency for adult females was observed and few studies collected semitone range, sound level range, or frequency range. Data extraction also indicated a predominately gender binary reporting of acoustic measures with few studies reporting gender identity, race, or ethnicity as variables of interest. CONCLUSIONS The scoping review yielded updated acoustic normative data that is of value for clinicians and researchers who rely on this normative data to make determinations about vocal function. The limited availability of acoustic data by gender, race, and ethnicity creates barriers for generalization of these normative values across all patients, clients, and research volunteers.
Collapse
Affiliation(s)
- Samantha A K Kent
- Department of Speech, Language & Hearing Sciences, Auburn University, Auburn, Alabama
| | - T Laine Fletcher
- Department of Speech, Language & Hearing Sciences, Auburn University, Auburn, Alabama
| | - Abigail Morgan
- Department of Speech, Language & Hearing Sciences, Auburn University, Auburn, Alabama
| | - Mariah Morton
- School of Kinesiology, Auburn University, Auburn, Alabama
| | - Rebecca J Hall
- Department of Speech, Language & Hearing Sciences, Auburn University, Auburn, Alabama
| | - Mary J Sandage
- Department of Speech, Language & Hearing Sciences, Auburn University, Auburn, Alabama.
| |
Collapse
|
7
|
Novotny M, Cmejla R, Tykalova T. Automated prediction of children's age from voice acoustics. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
8
|
Jibson J. Formant detail needed for identifying, rating, and discriminating vowels in Wisconsin English. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 151:4004. [PMID: 35778208 DOI: 10.1121/10.0011539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 05/12/2022] [Indexed: 06/15/2023]
Abstract
Neel [(2004). Acoust. Res. Lett. Online 5, 125-131] asked how much time-varying formant detail is needed for vowel identification. In that study, multiple stimuli were synthesized for each vowel: 1-point (monophthongal with midpoint frequencies), 2-point (linear from onset to offset), 3-point, 5-point, and 11-point. Results suggested that a 3-point model was optimal. This conflicted with the dual-target hypothesis of vowel inherent spectral change research, which has found that two targets are sufficient to model vowel identification. The present study replicates and expands upon the work of Neel. Ten English monophthongs were chosen for synthesis. One-, two-, three-, and five-point vowels were created as described above, and another 1-point stimulus was created with onset frequencies rather than midpoint frequencies. Three experiments were administered (n = 18 for each): vowel identification, goodness rating, and discrimination. The results ultimately align with the dual-target hypothesis, consistent with most vowel inherent spectral change studies.
Collapse
Affiliation(s)
- Jonathan Jibson
- English Department, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
9
|
Chuang YJ, Hwang SJ, Buhr KA, Miller CA, Avey GD, Story BH, Vorperian HK. Anatomic development of the upper airway during the first five years of life: A three-dimensional imaging study. PLoS One 2022; 17:e0264981. [PMID: 35275939 PMCID: PMC8916633 DOI: 10.1371/journal.pone.0264981] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Accepted: 02/21/2022] [Indexed: 12/05/2022] Open
Abstract
Purpose Normative data on the growth and development of the upper airway across the sexes is needed for the diagnosis and treatment of congenital and acquired respiratory anomalies and to gain insight on developmental changes in speech acoustics and disorders with craniofacial anomalies. Methods The growth of the upper airway in children ages birth to 5 years, as compared to adults, was quantified using an imaging database with computed tomography studies from typically developing individuals. Methodological criteria for scan inclusion and airway measurements included: head position, histogram-based airway segmentation, anatomic landmark placement, and development of a semi-automatic centerline for data extraction. A comprehensive set of 2D and 3D supra- and sub-glottal measurements from the choanae to tracheal opening were obtained including: naso-oro-laryngo-pharynx subregion volume and length, each subregion’s superior and inferior cross-sectional-area, and antero-posterior and transverse/width distances. Results Growth of the upper airway during the first 5 years of life was more pronounced in the vertical and transverse/lateral dimensions than in the antero-posterior dimension. By age 5 years, females have larger pharyngeal measurement than males. Prepubertal sex-differences were identified in the subglottal region. Conclusions Our findings demonstrate the importance of studying the growth of the upper airway in 3D. As the lumen length increases, its shape changes, becoming increasingly elliptical during the first 5 years of life. This study also emphasizes the importance of methodological considerations for both image acquisition and data extraction, as well as the use of consistent anatomic structures in defining pharyngeal regions.
Collapse
Affiliation(s)
- Ying Ji Chuang
- Vocal Tract Development Lab, Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Seong Jae Hwang
- Department of Computer Science, University of Pittsburgh, Pittsburg, Pennsylvania, United States of America
| | - Kevin A. Buhr
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Courtney A. Miller
- Vocal Tract Development Lab, Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Gregory D. Avey
- Department of Radiology, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, United States of America
| | - Brad H. Story
- Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona, United States of America
| | - Houri K. Vorperian
- Vocal Tract Development Lab, Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- * E-mail:
| |
Collapse
|
10
|
Merritt B, Bent T. Revisiting the acoustics of speaker gender perception: A gender expansive perspective. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 151:484. [PMID: 35105035 DOI: 10.1121/10.0009282] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 12/21/2021] [Indexed: 06/14/2023]
Abstract
Examinations of speaker gender perception have primarily focused on the roles of fundamental frequency (fo) and formant frequencies from structured speech tasks using cisgender speakers. Yet, there is evidence to suggest that fo and formants do not fully account for listeners' perceptual judgements of gender, particularly from connected speech. This study investigated the perceptual importance of fo, formant frequencies, articulation, and intonation in listeners' judgements of gender identity and masculinity/femininity from spontaneous speech from cisgender male and female speakers as well as transfeminine and transmasculine speakers. Stimuli were spontaneous speech samples from 12 speakers who are cisgender (6 female and 6 male) and 12 speakers who are transgender (6 transfeminine and 6 transmasculine). Listeners performed a two-alternative forced choice (2AFC) gender identification task and masculinity/femininity rating task in two experiments that manipulated which acoustic cues were available. Experiment 1 confirmed that fo and formant frequency manipulations were insufficient to alter listener judgements across all speakers. Experiment 2 demonstrated that articulatory cues had greater weighting than intonation cues on the listeners' judgements when the fo and formant frequencies were in a gender ambiguous range. These findings counter the assumptions that fo and formant manipulations are sufficient to effectively alter perceived speaker gender.
Collapse
Affiliation(s)
- Brandon Merritt
- Department of Speech, Language, and Hearing Sciences, Indiana University, Bloomington, Indiana 47408, USA
| | - Tessa Bent
- Department of Speech, Language, and Hearing Sciences, Indiana University, Bloomington, Indiana 47408, USA
| |
Collapse
|
11
|
Kent RD, Eichhorn JT, Vorperian HK. Acoustic parameters of voice in typically developing children ages 4-19 years. Int J Pediatr Otorhinolaryngol 2021; 142:110614. [PMID: 33450527 PMCID: PMC7902385 DOI: 10.1016/j.ijporl.2021.110614] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 12/31/2020] [Accepted: 12/31/2020] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Report data on acoustic measures of voice in sustained vowels produced by typically developing children, aged 4-19 years, to add to the cross-sectional reference values in a pediatric database. METHODS Recordings of sustained vowel/ɑ/phonation were obtained from 158 children (80 males, 78 females) aged 4-19 years who were judged to be typically developing with respect to speech and voice. Acoustic analyses were performed with the Multidimensional Voice Program (MDVP™) and the Analysis of Dysphonia in Speech and Voice (ADSV™), both from Pentax Medical. RESULTS Values from both MDVP and ADSV are reported for children in the following age cohorts: 4-6 years, 7-9 years, 10-12 years, 13-15 years, and 16-19 years. CONCLUSION The data in this study complement previously published data and contribute to a pediatric reference database useful for research and for clinical practice related to children's voice. Acoustic parameters most sensitive to age and sex are identified.
Collapse
Affiliation(s)
- Raymond D. Kent
- Waisman Center, University of Wisconsin-Madison, 1500 Highland Ave., Madison, WI 53705
| | - Julie T. Eichhorn
- Waisman Center, University of Wisconsin-Madison, 1500 Highland Ave., Madison, WI 53705
| | - Houri K. Vorperian
- Waisman Center, University of Wisconsin-Madison, 1500 Highland Ave., Madison, WI 53705
| |
Collapse
|
12
|
Fung P, Schertz J, Johnson EK. The development of gendered speech in children: Insights from adult L1 and L2 perceptions. JASA EXPRESS LETTERS 2021; 1:014407. [PMID: 36154089 DOI: 10.1121/10.0003322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Past studies have shown that boys and girls sound distinct by 4 years old, long before sexual dimorphisms in vocal anatomy develop. These gender differences are thought to be learned within a particular speech community. However, no study has asked whether listeners' sensitivity to gender in child speech is modulated by language experience. This study shows that gendered speech emerges at 2.5 years old, and that L1 listeners outperform L2 listeners in detecting these differences. The findings highlight the role of language-specific sociolinguistic factors in both speech perception and production, and show that gendered speech emerges earlier than previously suggested.
Collapse
Affiliation(s)
- Priscilla Fung
- Department of Psychology, University of Toronto, Mississauga, Ontario L5L 1C6, Canada
| | - Jessamyn Schertz
- Department of Language Studies, University of Toronto, Mississauga, Ontario L5L 1C6, Canada , ,
| | - Elizabeth K Johnson
- Department of Psychology, University of Toronto, Mississauga, Ontario L5L 1C6, Canada
| |
Collapse
|
13
|
Comparison of Habitual and High Pitch Phonation in Teachers With and Without Vocal Fatigue. J Voice 2020; 36:141.e1-141.e9. [DOI: 10.1016/j.jvoice.2020.04.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 03/28/2020] [Accepted: 04/01/2020] [Indexed: 11/23/2022]
|