1
|
Lavechin M, de Seyssel M, Métais M, Metze F, Mohamed A, Bredin H, Dupoux E, Cristia A. Modeling early phonetic acquisition from child-centered audio data. Cognition 2024; 245:105734. [PMID: 38335906 DOI: 10.1016/j.cognition.2024.105734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 12/29/2023] [Accepted: 01/22/2024] [Indexed: 02/12/2024]
Abstract
Infants learn their native language(s) at an amazing speed. Before they even talk, their perception adapts to the language(s) they hear. However, the mechanisms responsible for this perceptual attunement and the circumstances in which it takes place remain unclear. This paper presents the first attempt to study perceptual attunement using ecological child-centered audio data. We show that a simple prediction algorithm exhibits perceptual attunement when applied on unrealistic clean audio-book data, but fails to do so when applied on ecologically-valid child-centered data. In the latter scenario, perceptual attunement only emerges when the prediction mechanism is supplemented with inductive biases that force the algorithm to focus exclusively on speech segments while learning speaker-, pitch-, and room-invariant representations. We argue these biases are plausible given previous research on infants and non-human animals. More generally, we show that what our model learns and how it develops through exposure to speech depends exquisitely on the details of the input signal. By doing so, we illustrate the importance of considering ecologically valid input data when modeling language acquisition.
Collapse
Affiliation(s)
- Marvin Lavechin
- Laboratoire de Sciences Cognitives et Psycholinguistique, Département d'Etudes Cognitives, ENS, EHESS, CNRS, PSL University, Paris, France; Cognitive Machine Learning Team, INRIA, Paris, France; Meta AI Research, Paris, France.
| | - Maureen de Seyssel
- Laboratoire de Sciences Cognitives et Psycholinguistique, Département d'Etudes Cognitives, ENS, EHESS, CNRS, PSL University, Paris, France; Cognitive Machine Learning Team, INRIA, Paris, France; Laboratoire de linguistique formelle, Université de Paris, CNRS, Paris, France
| | - Marianne Métais
- Laboratoire de Sciences Cognitives et Psycholinguistique, Département d'Etudes Cognitives, ENS, EHESS, CNRS, PSL University, Paris, France; Cognitive Machine Learning Team, INRIA, Paris, France
| | | | | | - Hervé Bredin
- Institut de Recherche en Informatique de Toulouse, Université de Toulouse, CNRS, Toulouse, France
| | - Emmanuel Dupoux
- Laboratoire de Sciences Cognitives et Psycholinguistique, Département d'Etudes Cognitives, ENS, EHESS, CNRS, PSL University, Paris, France; Cognitive Machine Learning Team, INRIA, Paris, France; Meta AI Research, Paris, France
| | - Alejandrina Cristia
- Laboratoire de Sciences Cognitives et Psycholinguistique, Département d'Etudes Cognitives, ENS, EHESS, CNRS, PSL University, Paris, France; Cognitive Machine Learning Team, INRIA, Paris, France
| |
Collapse
|
2
|
Bundgaard-Nielsen RL, O'Shannessy C, Wang Y, Nelson A, Bartlett J, Davis V. Two-part vowel modifications in Child Directed Speech in Warlpiri may enhance child attention to speech and scaffold noun acquisition. PHONETICA 2023; 0:phon-2022-0039. [PMID: 37314963 DOI: 10.1515/phon-2022-0039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 05/08/2023] [Indexed: 06/16/2023]
Abstract
Study 1 compared vowels in Child Directed Speech (CDS; child ages 25-46 months) to vowels in Adult Directed Speech (ADS) in natural conversation in the Australian Indigenous language Warlpiri, which has three vowels (/i/, /a/, /u). Study 2 compared the vowels of the child interlocutors from Study 1 to caregiver ADS and CDS. Study 1 indicates that Warlpiri CDS vowels are characterised by fronting, /a/-lowering, f o -raising, and increased duration, but not vowel space expansion. Vowels in CDS nouns, however, show increased between-contrast differentiation and reduced within-contrast variation, similar to what has been reported for other languages. We argue that this two-part CDS modification process serves a dual purpose: Vowel space shifting induces IDS/CDS that sounds more child-like, which may enhance child attention to speech, while increased between-contrast differentiation and reduced within-contrast variation in nouns may serve didactic purposes by providing high-quality information about lexical specifications. Study 2 indicates that Warlpiri CDS vowels are more like child vowels, providing indirect evidence that aspects of CDS may serve non-linguistic purposes simultaneously with other aspects serving linguistic-didactic purposes. The studies have novel implications for the way CDS vowel modifications are considered and highlight the necessity of naturalistic data collection, novel analyses, and typological diversity.
Collapse
Affiliation(s)
- Rikke L Bundgaard-Nielsen
- MARCS Institute of Brain, Behaviour and Development, Western Sydney University, Westmead, NSW, Australia
- School of Literature, Languages and Linguistics, Australian National University, Canberra, ACT, Australia
| | - Carmel O'Shannessy
- School of Literature, Languages and Linguistics, Australian National University, Canberra, ACT, Australia
| | - Yizhou Wang
- School of Literature, Languages and Linguistics, Australian National University, Canberra, ACT, Australia
| | - Alice Nelson
- Red Dust Role Models, Alice Springs, NT, Australia
| | | | - Vanessa Davis
- School of Literature, Languages and Linguistics, Australian National University, Canberra, ACT, Australia
- Tangentyere Council Research Hub, Alice Springs, NT, Australia
| |
Collapse
|
3
|
Fukui H, Toyoshima K. Testosterone, oxytocin and co-operation: A hypothesis for the origin and function of music. Front Psychol 2023; 14:1055827. [PMID: 36860786 PMCID: PMC9968751 DOI: 10.3389/fpsyg.2023.1055827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 01/25/2023] [Indexed: 02/15/2023] Open
Abstract
Since the time of Darwin, theories have been proposed on the origin and functions of music; however, the subject remains enigmatic. The literature shows that music is closely related to important human behaviours and abilities, namely, cognition, emotion, reward and sociality (co-operation, entrainment, empathy and altruism). Notably, studies have deduced that these behaviours are closely related to testosterone (T) and oxytocin (OXT). The association of music with important human behaviours and neurochemicals is closely related to the understanding of reproductive and social behaviours being unclear. In this paper, we describe the endocrinological functions of human social and musical behaviour and demonstrate its relationship to T and OXT. We then hypothesised that the emergence of music is associated with behavioural adaptations and emerged as humans socialised to ensure survival. Moreover, the proximal factor in the emergence of music is behavioural control (social tolerance) through the regulation of T and OXT, and the ultimate factor is group survival through co-operation. The "survival value" of music has rarely been approached from the perspective of musical behavioural endocrinology. This paper provides a new perspective on the origin and functions of music.
Collapse
Affiliation(s)
- Hajime Fukui
- Nara University of Education, Nara, Japan,*Correspondence: Hajime Fukui, ✉
| | | |
Collapse
|
4
|
Naturalistic speech supports distributional learning across contexts. Proc Natl Acad Sci U S A 2022; 119:e2123230119. [PMID: 36095175 PMCID: PMC9499502 DOI: 10.1073/pnas.2123230119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
At birth, infants discriminate most of the sounds of the world's languages, but by age 1, infants become language-specific listeners. This has generally been taken as evidence that infants have learned which acoustic dimensions are contrastive, or useful for distinguishing among the sounds of their language(s), and have begun focusing primarily on those dimensions when perceiving speech. However, speech is highly variable, with different sounds overlapping substantially in their acoustics, and after decades of research, we still do not know what aspects of the speech signal allow infants to differentiate contrastive from noncontrastive dimensions. Here we show that infants could learn which acoustic dimensions of their language are contrastive, despite the high acoustic variability. Our account is based on the cross-linguistic fact that even sounds that overlap in their acoustics differ in the contexts they occur in. We predict that this should leave a signal that infants can pick up on and show that acoustic distributions indeed vary more by context along contrastive dimensions compared with noncontrastive dimensions. By establishing this difference, we provide a potential answer to how infants learn about sound contrasts, a question whose answer in natural learning environments has remained elusive.
Collapse
|
5
|
Cychosz M. LANGUAGE EXPOSURE PREDICTS CHILDREN'S PHONETIC PATTERNING: EVIDENCE FROM LANGUAGE SHIFT. LANGUAGE 2022; 98:461-509. [PMID: 37034148 PMCID: PMC10079255 DOI: 10.1353/lan.0.0269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Although understanding the role of the environment is central to language acquisition theory, rarely has this been studied for children's phonetic development, and receptive and expressive language experiences in the environment are not distinguished. This last distinction may be crucial for child speech production in particular, because production requires coordination of low-level speech-motor planning with high-level linguistic knowledge. In this study, the role of the environment is evaluated in a novel way-by studying phonetic development in a bilingual community undergoing rapid language shift. This sociolinguistic context provides a naturalistic gradient of the amount of children's exposure to two languages and the ratio of expressive to receptive experiences. A large-scale child language corpus encompassing over 500 hours of naturalistic South Bolivian Quechua and Spanish speech was efficiently annotated for children's and their caregivers' bilingual language use. These estimates were correlated with children's patterns in a series of speech production tasks. The role of the environment varied by outcome: children's expressive language experience best predicted their performance on a coarticulation-morphology measure, while their receptive experience predicted performance on a lower-level measure of vowel variability. Overall these bilingual exposure effects suggest a pathway for children's role in language change whereby language shift can result in different learning outcomes within a single speech community. Appropriate ways to model language exposure in development are discussed.
Collapse
|
6
|
Piazza G, Martin CD, Kalashnikova M. The Acoustic Features and Didactic Function of Foreigner-Directed Speech: A Scoping Review. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2896-2918. [PMID: 35914012 DOI: 10.1044/2022_jslhr-21-00609] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
PURPOSE This scoping review considers the acoustic features of a clear speech register directed to nonnative listeners known as foreigner-directed speech (FDS). We identify vowel hyperarticulation and low speech rate as the most representative acoustic features of FDS; other features, including wide pitch range and high intensity, are still under debate. We also discuss factors that may influence the outcomes and characteristics of FDS. We start by examining accommodation theories, outlining the reasons why FDS is likely to serve a didactic function by helping listeners acquire a second language (L2). We examine how this speech register adapts to listeners' identities and linguistic needs, suggesting that FDS also takes listeners' L2 proficiency into account. To confirm the didactic function of FDS, we compare it to other clear speech registers, specifically infant-directed speech and Lombard speech. CONCLUSIONS Our review reveals that research has not yet established whether FDS succeeds as a didactic tool that supports L2 acquisition. Moreover, a complex set of factors determines specific realizations of FDS, which need further exploration. We conclude by summarizing open questions and indicating directions and recommendations for future research.
Collapse
Affiliation(s)
- Giorgio Piazza
- Basque Center on Cognition, Brain and Language (BCBL), Donostia-San Sebastián, Spain
- Department of Social Sciences and Law, Universidad del País Vasco/Euskal Herriko Unibertsitatea, Donostia-San Sebastián, Spain
| | - Clara D Martin
- Basque Center on Cognition, Brain and Language (BCBL), Donostia-San Sebastián, Spain
- Ikerbasque, Basque Foundation for Science, Bilbao, Spain
| | - Marina Kalashnikova
- Basque Center on Cognition, Brain and Language (BCBL), Donostia-San Sebastián, Spain
- Ikerbasque, Basque Foundation for Science, Bilbao, Spain
| |
Collapse
|
7
|
Zacharaki K, Sebastian-Galles N. Before perceptual narrowing: The emergence of the native sounds of language. INFANCY 2022; 27:900-915. [PMID: 35632983 DOI: 10.1111/infa.12486] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 04/15/2022] [Accepted: 04/20/2022] [Indexed: 11/28/2022]
Abstract
The present study investigates the precursors of representations of phonemes in 4.5-month-olds. The emergence of phonemes has been mainly studied within the framework of perceptual narrowing, that is, infants tuning to their native language and losing sensitivity to non-native speech. One of the mechanisms behind this phenomenon is distributional learning. In this article, we tested the preference of 4.5-month-old infants using lists of pseudowords that resemble the vowel distribution of the native or a non-native language. We found that infants prefer listening to the lists mirroring the native language. The results suggest that infants can extract vowel information from novel stimuli, and they can map it on pre-existing knowledge on vowels that leads to a preference for the native lists.
Collapse
|
8
|
Long-term priors constrain category learning in the context of short-term statistical regularities. Psychon Bull Rev 2022; 29:1925-1937. [PMID: 35524011 DOI: 10.3758/s13423-022-02114-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/27/2022] [Indexed: 11/08/2022]
Abstract
Cognitive systems face a constant tension of maintaining existing representations that have been fine-tuned to long-term input regularities and adapting representations to meet the needs of short-term input that may deviate from long-term norms. Systems must balance the stability of long-term representations with plasticity to accommodate novel contexts. We investigated the interaction between perceptual biases or priors acquired across the long-term and sensitivity to statistical regularities introduced in the short-term. Participants were first passively exposed to short-term acoustic regularities and then learned categories in a supervised training task that either conflicted or aligned with long-term perceptual priors. We found that the long-term priors had robust and pervasive impact on categorization behavior. In contrast, behavior was not influenced by the nature of the short-term passive exposure. These results demonstrate that perceptual priors place strong constraints on the course of learning and that short-term passive exposure to acoustic regularities has limited impact on directing subsequent category learning.
Collapse
|
9
|
Natural Infant-Directed Speech Facilitates Neural Tracking of Prosody. Neuroimage 2022; 251:118991. [PMID: 35158023 DOI: 10.1016/j.neuroimage.2022.118991] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 02/02/2022] [Accepted: 02/10/2022] [Indexed: 01/04/2023] Open
Abstract
Infants prefer to be addressed with infant-directed speech (IDS). IDS benefits language acquisition through amplified low-frequency amplitude modulations. It has been reported that this amplification increases electrophysiological tracking of IDS compared to adult-directed speech (ADS). It is still unknown which particular frequency band triggers this effect. Here, we compare tracking at the rates of syllables and prosodic stress, which are both critical to word segmentation and recognition. In mother-infant dyads (n=30), mothers described novel objects to their 9-month-olds while infants' EEG was recorded. For IDS, mothers were instructed to speak to their children as they typically do, while for ADS, mothers described the objects as if speaking with an adult. Phonetic analyses confirmed that pitch features were more prototypically infant-directed in the IDS-condition compared to the ADS-condition. Neural tracking of speech was assessed by speech-brain coherence, which measures the synchronization between speech envelope and EEG. Results revealed significant speech-brain coherence at both syllabic and prosodic stress rates, indicating that infants track speech in IDS and ADS at both rates. We found significantly higher speech-brain coherence for IDS compared to ADS in the prosodic stress rate but not the syllabic rate. This indicates that the IDS benefit arises primarily from enhanced prosodic stress. Thus, neural tracking is sensitive to parents' speech adaptations during natural interactions, possibly facilitating higher-level inferential processes such as word segmentation from continuous speech.
Collapse
|
10
|
Lovcevic I, Burnham D, Kalashnikova M. Language development in infants with hearing loss: Benefits of infant-directed speech. Infant Behav Dev 2022; 67:101699. [PMID: 35123319 DOI: 10.1016/j.infbeh.2022.101699] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 01/25/2022] [Accepted: 01/27/2022] [Indexed: 11/28/2022]
Abstract
The majority of infants with permanent congenital hearing loss fall significantly behind their normal hearing peers in the development of receptive and expressive oral communication skills. Independent of any prosthetic intervention ("hardware") for infants with hearing loss, the social and linguistic environment ("software") can still be optimal or sub-optimal and so can exert significant positive or negative effects on speech and language acquisition, with far-reaching beneficial or adverse effects, respectively. This review focusses on the nature of the social and linguistic environment of infants with hearing loss, in particular others' speech to infants. The nature of this "infant-directed speech" and its effects on language development has been studied extensively in hearing infants but far less comprehensively in infants with hearing loss. Here, literature on the nature of infant-directed speech and its impact on the speech perception and language acquisition in infants with hearing loss is reviewed. The review brings together evidence on the little-studied effects of infant-directed speech on speech and language development in infants with hearing loss, and provides suggestions, over and above early screening and external treatment, for a natural intervention at the level of the carer-infant microcosm that may well optimize the early linguistic experiences and mitigate later adverse effects for infants born with hearing loss.
Collapse
Affiliation(s)
- Irena Lovcevic
- International Research Center for Neurointelligence (IRCN), The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo 113-0033, Japan.
| | - Denis Burnham
- The MARCS Institute for Brain, Behaviour and Development, Western Sydney University, Locked Bag 1797, Penrith, NSW 2751, Australia
| | - Marina Kalashnikova
- The MARCS Institute for Brain, Behaviour and Development, Western Sydney University, Locked Bag 1797, Penrith, NSW 2751, Australia; BCBL. Basque Center for Cognition, Brain and Language, Paseo Mikeletegi 69, San Sebastian-Donostia, Guipuzcoa 2004, Spain; IKERBASQUE, Basque Foundation for Science, Bilbao, Spain
| |
Collapse
|
11
|
Feldman NH, Goldwater S, Dupoux E, Schatz T. Do Infants Really Learn Phonetic Categories? OPEN MIND 2022; 5:113-131. [PMID: 35024527 PMCID: PMC8746127 DOI: 10.1162/opmi_a_00046] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Accepted: 08/06/2021] [Indexed: 11/16/2022] Open
Abstract
Early changes in infants’ ability to perceive native and nonnative speech sound contrasts are typically attributed to their developing knowledge of phonetic categories. We critically examine this hypothesis and argue that there is little direct evidence of category knowledge in infancy. We then propose an alternative account in which infants’ perception changes because they are learning a perceptual space that is appropriate to represent speech, without yet carving up that space into phonetic categories. If correct, this new account has substantial implications for understanding early language development.
Collapse
Affiliation(s)
- Naomi H Feldman
- Department of Linguistics and UMIACS, University of Maryland, College Park, MD, USA
| | | | - Emmanuel Dupoux
- Cognitive Machine Learning (ENS - EHESS - PSL Research University - CNRS - INRIA), Paris, France
| | - Thomas Schatz
- Department of Linguistics and UMIACS, University of Maryland, College Park, MD, USA
| |
Collapse
|
12
|
Wang L, Kalashnikova M, Kager R, Lai R, Wong PCM. Lexical and Prosodic Pitch Modifications in Cantonese Infant-directed Speech. JOURNAL OF CHILD LANGUAGE 2021; 48:1235-1261. [PMID: 33531090 DOI: 10.1017/s0305000920000707] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The functions of acoustic-phonetic modifications in infant-directed speech (IDS) remain a question: do they specifically serve to facilitate language learning via enhanced phonemic contrasts (the hyperarticulation hypothesis) or primarily to improve communication via prosodic exaggeration (the prosodic hypothesis)? The study of lexical tones provides a unique opportunity to shed light on this, as lexical tones are phonemically contrastive, yet their primary cue, pitch, is also a prosodic cue. This study investigated Cantonese IDS and found increased intra-talker variation of lexical tones, which more likely posed a challenge to rather than facilitated phonetic learning. Although tonal space was expanded which could facilitate phonetic learning, its expansion was a function of overall intonational modifications. Similar findings were observed in speech to pets who should not benefit from larger phonemic distinction. We conclude that lexical-tone adjustments in IDS mainly serve to broadly enhance communication rather than specifically increase phonemic contrast for learners.
Collapse
Affiliation(s)
- Luchang Wang
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Hong Kong, China
- Brain and Mind Institute, The Chinese University of Hong Kong, Hong Kong, China
| | - Marina Kalashnikova
- Basque Center on Cognition, Brain and Language, San Sebastian, Spain
- IKERBASQUE, Basque Foundation for Science, Bilbao, Spain
| | - René Kager
- Utrecht Institute of Linguistics OTS, Utrecht University, Utrecht, Netherlands
| | - Regine Lai
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Hong Kong, China
| | - Patrick C M Wong
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Hong Kong, China
- Brain and Mind Institute, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
13
|
McClay EK, Cebioglu S, Broesch T, Yeung HH. Rethinking the phonetics of baby-talk: Differences across Canada and Vanuatu in the articulation of mothers' speech to infants. Dev Sci 2021; 25:e13180. [PMID: 34633716 DOI: 10.1111/desc.13180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 09/04/2021] [Accepted: 09/15/2021] [Indexed: 11/29/2022]
Abstract
Infant-directed speech (IDS) is phonetically distinct from adult-directed speech (ADS): It is typically considered to have special prosody-like higher pitch and slower speaking rates-as well as unique speech sound properties, for example, more breathy, hyperarticulated, and/or variable consonant and vowel articulation. These phonetic features are widely observed in the IDS of caregivers from urbanized contexts who speak a handful of very well-researched languages. Yet studies with more diverse socio-cultural and linguistic samples show that this "typical" IDS prosody is not consistently observed across cultures. We extended cross-cultural work by examining IDS speech segment articulation, which-like prosody-is also thought to be a characteristic phonetic feature of IDS that might aid speech and language development. Here we asked whether IDS vowels have different articulatory features compared to ADS vowels in two distinct linguistic and socio-cultural contexts: urban English-speaking Canadian mothers, and rural Lenakel- and Southwest Tanna-speaking ni-Vanuatu mothers (n = 57, 20-46 years of age). Replicating prior work, Canadian mothers had more variable vowels in IDS compared to ADS, but also did not show clear register differences for breathiness or hyperarticulation. Vowels spoken by ni-Vanuatu mothers showed very distinct articulatory tendencies, using less variable (and less breathy) IDS vowels. Along with other work showing diversity in IDS phonetics across populations, this paper suggests that any understanding of how IDS might aid speech and language development are best examined through a culturally- and linguistically-specific lens.
Collapse
Affiliation(s)
- Elise K McClay
- Department of Linguistics, Simon Fraser University, Burnaby, BC, Canada
| | - Senay Cebioglu
- Department of Psychology, Simon Fraser University, Burnaby, BC, Canada.,Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, Leipzig, 04103, Germany
| | - Tanya Broesch
- Department of Human Behavior, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, Leipzig, 04103, Germany
| | - H Henny Yeung
- Department of Linguistics, Simon Fraser University, Burnaby, BC, Canada.,Integrative Neuroscience and Cognition Center, UMR 8002, Centre national de la recherche scientifique (CNRS) & Université de Paris, Paris, 75006, France
| |
Collapse
|
14
|
Kostilainen K, Partanen E, Mikkola K, Wikström V, Pakarinen S, Fellman V, Huotilainen M. Repeated Parental Singing During Kangaroo Care Improved Neural Processing of Speech Sound Changes in Preterm Infants at Term Age. Front Neurosci 2021; 15:686027. [PMID: 34539329 PMCID: PMC8446605 DOI: 10.3389/fnins.2021.686027] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 07/20/2021] [Indexed: 11/13/2022] Open
Abstract
Preterm birth carries a risk for adverse neurodevelopment. Cognitive dysfunctions, such as language disorders may manifest as atypical sound discrimination already in early infancy. As infant-directed singing has been shown to enhance language acquisition in infants, we examined whether parental singing during skin-to-skin care (kangaroo care) improves speech sound discrimination in preterm infants. Forty-five preterm infants born between 26 and 33 gestational weeks (GW) and their parents participated in this cluster-randomized controlled trial (ClinicalTrials ID IRB00003181SK). In both groups, parents conducted kangaroo care during 33-40 GW. In the singing intervention group (n = 24), a certified music therapist guided parents to sing or hum during daily kangaroo care. In the control group (n = 21), parents conducted standard kangaroo care and were not instructed to use their voices. Parents in both groups reported the duration of daily intervention. Auditory event-related potentials were recorded with electroencephalogram at term age using a multi-feature paradigm consisting of phonetic and emotional speech sound changes and a one-deviant oddball paradigm with pure tones. In the multi-feature paradigm, prominent mismatch responses (MMR) were elicited to the emotional sounds and many of the phonetic deviants in the singing intervention group and in the control group to some of the emotional and phonetic deviants. A group difference was found as the MMRs were larger in the singing intervention group, mainly due to larger MMRs being elicited to the emotional sounds, especially in females. The overall duration of the singing intervention (range 15-63 days) was positively associated with the MMR amplitudes for both phonetic and emotional stimuli in both sexes, unlike the daily singing time (range 8-120 min/day). In the oddball paradigm, MMRs for the non-speech sounds were elicited in both groups and no group differences nor connections between the singing time and the response amplitudes were found. These results imply that repeated parental singing during kangaroo care improved auditory discrimination of phonetic and emotional speech sounds in preterm infants at term age. Regular singing routines can be recommended for parents to promote the development of the auditory system and auditory processing of speech sounds in preterm infants.
Collapse
Affiliation(s)
- Kaisamari Kostilainen
- Cognitive Brain Research Unit, Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Eino Partanen
- Cognitive Brain Research Unit, Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Kaija Mikkola
- New Children's Hospital, Pediatric Research Center, Neonatology, Department of Pediatrics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Valtteri Wikström
- Cognitive Brain Research Unit, Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Satu Pakarinen
- Cognitive Brain Research Unit, Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Vineta Fellman
- Pediatrics, Department of Clinical Sciences, Lund University, Lund, Sweden.,Children's Hospital, University of Helsinki, Helsinki, Finland
| | - Minna Huotilainen
- Cognitive Brain Research Unit, Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland.,CICERO Learning Network, Faculty of Educational Sciences, University of Helsinki, Helsinki, Finland
| |
Collapse
|
15
|
Ludusan B, Mazuka R, Dupoux E. Does Infant-Directed Speech Help Phonetic Learning? A Machine Learning Investigation. Cogn Sci 2021; 45:e12946. [PMID: 34018231 DOI: 10.1111/cogs.12946] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 12/17/2020] [Accepted: 12/31/2020] [Indexed: 11/28/2022]
Abstract
A prominent hypothesis holds that by speaking to infants in infant-directed speech (IDS) as opposed to adult-directed speech (ADS), parents help them learn phonetic categories. Specifically, two characteristics of IDS have been claimed to facilitate learning: hyperarticulation, which makes the categories more separable, and variability, which makes the generalization more robust. Here, we test the separability and robustness of vowel category learning on acoustic representations of speech uttered by Japanese adults in ADS, IDS (addressed to 18- to 24-month olds), or read speech (RS). Separability is determined by means of a distance measure computed between the five short vowel categories of Japanese, while robustness is assessed by testing the ability of six different machine learning algorithms trained to classify vowels to generalize on stimuli spoken by a novel speaker in ADS. Using two different speech representations, we find that hyperarticulated speech, in the case of RS, can yield better separability, and that increased between-speaker variability in ADS can yield, for some algorithms, more robust categories. However, these conclusions do not apply to IDS, which turned out to yield neither more separable nor more robust categories compared to ADS inputs. We discuss the usefulness of machine learning algorithms run on real data to test hypotheses about the functional role of IDS.
Collapse
Affiliation(s)
- Bogdan Ludusan
- Laboratory for Language Development, RIKEN Center for Brain Science.,Phonetics Workgroup, Faculty of Linguistics and Literary Studies, Bielefeld University
| | - Reiko Mazuka
- Laboratory for Language Development, RIKEN Center for Brain Science.,Department of Psychology and Neuroscience, Duke University
| | - Emmanuel Dupoux
- Laboratoire de Sciences Cognitives et Psycholiguistique, EHESS/ENS/PSL/CNRS/INRIA
| |
Collapse
|
16
|
Woodard K, Plate RC, Morningstar M, Wood A, Pollak SD. Categorization of Vocal Emotion Cues Depends on Distributions of Input. AFFECTIVE SCIENCE 2021; 2:301-310. [PMID: 33870212 PMCID: PMC8035059 DOI: 10.1007/s42761-021-00038-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 02/09/2021] [Indexed: 01/23/2023]
Abstract
Learners use the distributional properties of stimuli to identify environmentally relevant categories in a range of perceptual domains, including words, shapes, faces, and colors. We examined whether similar processes may also operate on affective information conveyed through the voice. In Experiment 1, we tested how adults (18–22-year-olds) and children (8–10-year-olds) categorized affective states communicated by vocalizations varying continuously from “calm” to “upset.” We found that the threshold for categorizing both verbal (i.e., spoken word) and nonverbal (i.e., a yell) vocalizations as “upset” depended on the statistical distribution of the stimuli participants encountered. In Experiment 2, we replicated and extended these findings in adults using vocalizations that conveyed multiple negative affect states. These results suggest perceivers’ flexibly and rapidly update their interpretation of affective vocal cues based upon context.
Collapse
Affiliation(s)
- Kristina Woodard
- Department of Psychology, University of Wisconsin – Madison, 1500 Highland Avenue, Madison, WI 53705 USA
| | - Rista C. Plate
- Department of Psychology, University of Wisconsin – Madison, 1500 Highland Avenue, Madison, WI 53705 USA
- Department of Psychology, University of Pennsylvania, 3720 Walnut Street, Philadelphia, PA 19104 USA
| | | | - Adrienne Wood
- Department of Psychology, University of Virginia, 485 McCormick Rd, Charlottesville, VA 22904 USA
| | - Seth D. Pollak
- Department of Psychology, University of Wisconsin – Madison, 1500 Highland Avenue, Madison, WI 53705 USA
| |
Collapse
|
17
|
Schatz T, Feldman NH, Goldwater S, Cao XN, Dupoux E. Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input. Proc Natl Acad Sci U S A 2021; 118:e2001844118. [PMID: 33510040 PMCID: PMC7924220 DOI: 10.1073/pnas.2001844118] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Before they even speak, infants become attuned to the sounds of the language(s) they hear, processing native phonetic contrasts more easily than nonnative ones. For example, between 6 to 8 mo and 10 to 12 mo, infants learning American English get better at distinguishing English and [l], as in "rock" vs. "lock," relative to infants learning Japanese. Influential accounts of this early phonetic learning phenomenon initially proposed that infants group sounds into native vowel- and consonant-like phonetic categories-like and [l] in English-through a statistical clustering mechanism dubbed "distributional learning." The feasibility of this mechanism for learning phonetic categories has been challenged, however. Here, we demonstrate that a distributional learning algorithm operating on naturalistic speech can predict early phonetic learning, as observed in Japanese and American English infants, suggesting that infants might learn through distributional learning after all. We further show, however, that, contrary to the original distributional learning proposal, our model learns units too brief and too fine-grained acoustically to correspond to phonetic categories. This challenges the influential idea that what infants learn are phonetic categories. More broadly, our work introduces a mechanism-driven approach to the study of early phonetic learning, together with a quantitative modeling framework that can handle realistic input. This allows accounts of early phonetic learning to be linked to concrete, systematic predictions regarding infants' attunement.
Collapse
Affiliation(s)
- Thomas Schatz
- Department of Linguistics, University of Maryland, College Park, MD 20742;
- University of Maryland Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742
| | - Naomi H Feldman
- Department of Linguistics, University of Maryland, College Park, MD 20742
- University of Maryland Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742
| | - Sharon Goldwater
- School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, United Kingdom
| | - Xuan-Nga Cao
- Cognitive Machine Learning, École Normale Supérieure-École des Hautes Études en Sciences Sociales-Paris Sciences et Lettres Research University-CNRS-Institut National de Recherche en Informatique et en Automatique, 75012 Paris, France
| | - Emmanuel Dupoux
- Cognitive Machine Learning, École Normale Supérieure-École des Hautes Études en Sciences Sociales-Paris Sciences et Lettres Research University-CNRS-Institut National de Recherche en Informatique et en Automatique, 75012 Paris, France
- Facebook A.I. Research, 75002 Paris, France
| |
Collapse
|
18
|
Falk S, Fasolo M, Genovese G, Romero‐Lauro L, Franco F. Sing for me, Mama! Infants' discrimination of novel vowels in song. INFANCY 2021; 26:248-270. [DOI: 10.1111/infa.12387] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Revised: 12/08/2020] [Accepted: 12/14/2020] [Indexed: 01/22/2023]
Affiliation(s)
- Simone Falk
- Department of Linguistics and Translation University of Montreal, Montreal Quebec Canada
- Laboratory for Brain, Music and Sound Research (BRAMS) University of Montreal, Montreal Quebec Canada
| | - Mirco Fasolo
- Department of Neuroscience, Imaging, and Clinical Sciences University "G. d'Annunzio" of Chieti‐Pescara Chieti Italy
| | | | - Leonor Romero‐Lauro
- Department of Psychology University of Milan‐Bicocca Milano Italy
- Neuromi Milan Center for Neuroscience Milano Italy
| | - Fabia Franco
- Department of Psychology Faculty of Science and Technology Middlesex University London London UK
| |
Collapse
|
19
|
Lovcevic I, Kalashnikova M, Burnham D. Acoustic features of infant-directed speech to infants with hearing loss. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:3399. [PMID: 33379914 DOI: 10.1121/10.0002641] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Accepted: 10/23/2020] [Indexed: 06/12/2023]
Abstract
This study investigated the effects of hearing loss and hearing experience on the acoustic features of infant-directed speech (IDS) to infants with hearing loss (HL) compared to controls with normal hearing (NH) matched by either chronological or hearing age (experiment 1) and across development in infants with hearing loss as well as the relation between IDS features and infants' developing lexical abilities (experiment 2). Both experiments included detailed acoustic analyses of mothers' productions of the three corner vowels /a, i, u/ and utterance-level pitch in IDS and in adult-directed speech. Experiment 1 demonstrated that IDS to infants with HL was acoustically more variable than IDS to hearing-age matched infants with NH. Experiment 2 yielded no changes in IDS features over development; however, the results did show a positive relationship between formant distances in mothers' speech and infants' concurrent receptive vocabulary size, as well as between vowel hyperarticulation and infants' expressive vocabulary. These findings suggest that despite infants' HL and thus diminished access to speech input, infants with HL are exposed to IDS with generally similar acoustic qualities as are infants with NH. However, some differences persist, indicating that infants with HL might receive less intelligible speech.
Collapse
Affiliation(s)
- Irena Lovcevic
- The MARCS Institute for Brain, Behaviour and Development, Western Sydney University, Locked Bag 1797, Penrith NSW 2751, Australia
| | - Marina Kalashnikova
- BCBL, Basque Center on Cognition, Brain and Language, Mikeletegi Pasealekua, 69, Donostia, Gipuzkoa 20009, Spain
| | - Denis Burnham
- The MARCS Institute for Brain, Behaviour and Development, Western Sydney University, Locked Bag 1797, Penrith NSW 2751, Australia
| |
Collapse
|
20
|
Laing C, Bergelson E. From babble to words: Infants' early productions match words and objects in their environment. Cogn Psychol 2020; 122:101308. [PMID: 32504852 PMCID: PMC7572567 DOI: 10.1016/j.cogpsych.2020.101308] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 04/30/2020] [Accepted: 05/15/2020] [Indexed: 11/25/2022]
Abstract
Infants' early babbling allows them to engage in proto-conversations with caretakers, well before clearly articulated, meaningful words are part of their productive lexicon. Moreover, the well-rehearsed sounds from babble serve as a perceptual 'filter', drawing infants' attention towards words that match the sounds they can reliably produce. Using naturalistic home recordings of 44 10-11-month-olds (an age with high variability in early speech sound production), this study tests whether infants' early consonant productions match words and objects in their environment. We find that infants' babble matches the consonants produced in their caregivers' speech. Infants with a well-established consonant repertoire also match their babble to objects in their environment. Our findings show that infants' early consonant productions are shaped by their input: by 10 months, the sounds of babble match what infants see and hear.
Collapse
Affiliation(s)
- Catherine Laing
- Centre for Language and Communication Research, Cardiff University, UK; Department of Psychology and Neuroscience, Duke University, USA.
| | - Elika Bergelson
- Centre for Language and Communication Research, Cardiff University, UK; Department of Psychology and Neuroscience, Duke University, USA
| |
Collapse
|
21
|
Choi J, Kim S, Cho T. An apparent-time study of an ongoing sound change in Seoul Korean: A prosodic account. PLoS One 2020; 15:e0240682. [PMID: 33091043 PMCID: PMC7580931 DOI: 10.1371/journal.pone.0240682] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Accepted: 10/01/2020] [Indexed: 11/20/2022] Open
Abstract
In present-day Seoul Korean, the primary phonetic feature for the lenis–aspirated stop distinction is shifting from VOT to F0. Some previous studies have considered this sound change to be a tonogenesis, whereby the low-level F0 perturbation has developed into tonal features (L for the lenis and H for the aspirated) in the segmental phonology. They, however, have examined the stop distinction only at a phrase- or utterance-initial position. We newly explore the sound change in relation to various prosodic structural factors (position and prominence). Apparent-time production data were recorded from four speaker groups: young female, young male, old female, old male. The way the speakers use VOT versus F0 indeed varies as a function of position and prominence. Crucially, in all groups, VOT is still used for the lenis–aspirated distinction phrase-medially due to the lenis stop voicing. This role of VOT, however, is found only in the non-prominent (unfocused) condition, in which the F0 difference is reduced to a low-level perturbation effect. In the prominent (focused) context in which tones come into play, the role of VOT diminishes, led by young female speakers. These can be interpreted as a prosodically-conditioned, complementary use of the features to maintain sufficient contrast. Importantly, however, the tonal difference under focus is not bidirectionally polarized, so that F0 is not lowered for the lenis stop. A lack of direct enhancement of the distinctive L tone weakens a possibility that F0 is transphonologized to the phonemic feature system of the language. As an alternative to the view that tonal features are newly introduced in the segmental phonology, we propose a prosodic account: the sound change is best characterized as a prosodically-conditioned change in the use of the segmental voicing feature (implemented by VOT) versus already available post-lexical tones in the intonational phonology of Korean.
Collapse
Affiliation(s)
- Jiyoun Choi
- Department of Social Psychology, Sookmyung Women’s University, Seoul, Korea
| | - Sahyang Kim
- Department of English Education, Hongik University, Seoul, Korea
| | - Taehong Cho
- Department of English Language and Literature, Hanyang Institute for Phonetics and Cognitive Science, Hanyang University, Seoul, Korea
- * E-mail:
| |
Collapse
|
22
|
Kostilainen K, Partanen E, Mikkola K, Wikström V, Pakarinen S, Fellman V, Huotilainen M. Neural processing of changes in phonetic and emotional speech sounds and tones in preterm infants at term age. Int J Psychophysiol 2020; 148:111-118. [DOI: 10.1016/j.ijpsycho.2019.10.009] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 09/10/2019] [Accepted: 10/23/2019] [Indexed: 10/25/2022]
|
23
|
Dilley L, Gamache J, Wang Y, Houston DM, Bergeson TR. Statistical distributions of consonant variants in infant-directed speech: evidence that /t/ may be exceptional. JOURNAL OF PHONETICS 2019; 75:73-87. [PMID: 32884162 PMCID: PMC7467459 DOI: 10.1016/j.wocn.2019.05.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Statistical distributions of phonetic variants in spoken language influence speech perception for both language learners and mature users. We theorized that patterns of phonetic variant processing of consonants demonstrated by adults might stem in part from patterns of early exposure to statistics of phonetic variants in infant-directed (ID) speech. In particular, we hypothesized that ID speech might involve greater proportions of canonical /t/ pronunciations compared to adult-directed (AD) speech in at least some phonological contexts. This possibility was tested using a corpus of spontaneous speech of mothers speaking to other adults, or to their typically-developing infant. Tokens of word-final alveolar stops - including /t/, /d/, and the nasal stop /n/ - were examined in assimilable contexts (i.e., those followed by a word-initial labial and/or velar); these were classified as canonical, assimilated, deleted, or glottalized. Results confirmed that there were significantly more canonical pronunciations in assimilable contexts in ID compared with AD speech, an effect which was driven by the phoneme /t/. These findings suggest that at least in phonological contexts involving possible assimilation, children are exposed to more canonical /t/ variant pronunciations than adults are. This raises the possibility that perceptual processing of canonical /t/ may be partly attributable to exposure to canonical /t/ variants in ID speech. Results support the need for further research into how statistics of variant pronunciations in early language input may shape speech processing across the lifespan.
Collapse
Affiliation(s)
- Laura Dilley
- Department of Communicative Sciences and Disorders, Michigan State University
| | - Jessica Gamache
- Department of Linguistics and Germanic, Slavic, Asian and African Languages, Michigan State University
| | - Yuanyuan Wang
- Department of Otolaryngology, The Ohio State University
| | | | - Tonya R. Bergeson
- Dept. of Otolaryngology – Head & Neck Surgery, Indiana University School of Medicine
- Department of Communication Sciences and Disorders, Butler University
| |
Collapse
|
24
|
Swingley D. Learning phonology from surface distributions, considering Dutch and English vowel duration. LANGUAGE LEARNING AND DEVELOPMENT : THE OFFICIAL JOURNAL OF THE SOCIETY FOR LANGUAGE DEVELOPMENT 2019; 15:199-216. [PMID: 31607832 PMCID: PMC6788768 DOI: 10.1080/15475441.2018.1562927] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
In learning language, children must discover how to interpret the linguistic significance of phonetic variation. On some accounts, receptive phonology is grounded in perceptual learning of phonetic categories from phonetic distributions drawn over the infant's sample of speech. On other accounts, receptive phonology is instead based on phonetic generalizations over the words in the lexicon. Tests of these hypotheses have been rare and indirect, usually making use of idealized estimates of phonetic variation. Here we evaluated these hypotheses, using as our test case English and Dutch toddlers' different interpretation of the lexical significance of vowel duration. Analysis of thousands of vowels of one Dutch and three English mothers' speech suggests that children's language-specific differences in interpretation of vowel duration are likely due to detection of lexically specific patterns, rather than bimodality in raw phonetic distributions.
Collapse
|
25
|
Chong AJ, Vicenik C, Sundara M. Intonation Plays a Role in Language Discrimination by Infants. INFANCY 2018. [DOI: 10.1111/infa.12257] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- Adam J. Chong
- Department of Linguistics Queen Mary University of London
- Department of Linguistics University of California Los Angeles
| | - Chad Vicenik
- Department of Linguistics University of California Los Angeles
| | - Megha Sundara
- Department of Linguistics University of California Los Angeles
| |
Collapse
|
26
|
Adriaans F. Effects of consonantal context on the learnability of vowel categories from infant-directed speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:EL20. [PMID: 30075685 DOI: 10.1121/1.5045192] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Recent studies have shown that vowels in infant-directed speech (IDS) are characterized by highly variable formant distributions. The current study investigates whether vowel variability is partially due to consonantal context, and explores whether consonantal context could support the learning of vowel categories from IDS. A computational model is presented which selects contexts based on frequency in the input and generalizes across contextual categories. Improved categorization performance was found on a vowel contrast in American-English IDS. The findings support a view in which the infant's learning mechanism is anchored in context, in order to cope with acoustic variability in the input.
Collapse
Affiliation(s)
- Frans Adriaans
- Utrecht Institute of Linguistics OTS, Utrecht University, Trans 10, 3512 JK Utrecht, the Netherlands
| |
Collapse
|
27
|
Swingley D, Alarcon C. Lexical Learning May Contribute to Phonetic Learning in Infants: A Corpus Analysis of Maternal Spanish. Cogn Sci 2018; 42:10.1111/cogs.12620. [PMID: 29785714 PMCID: PMC7224410 DOI: 10.1111/cogs.12620] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2017] [Revised: 02/06/2018] [Accepted: 02/09/2018] [Indexed: 11/26/2022]
Abstract
In their first year, infants begin to learn the speech sounds of their language. This process is typically modeled as an unsupervised clustering problem in which phonetically similar speech-sound tokens are grouped into phonetic categories by infants using their domain-general inference abilities. We argue here that maternal speech is too phonetically variable for this account to be plausible, and we provide phonetic evidence from Spanish showing that infant-directed Spanish vowels are more readily clustered over word types than over vowel tokens. The results suggest that infants' early adaptation to native-language phonetics depends on their word-form lexicon, implicating a much wider range of potential sources of influence on infants' developmental trajectories in language learning.
Collapse
|
28
|
Han M, de Jong NH, Kager R. Lexical Tones in Mandarin Chinese Infant-Directed Speech: Age-Related Changes in the Second Year of Life. Front Psychol 2018; 9:434. [PMID: 29670555 PMCID: PMC5893784 DOI: 10.3389/fpsyg.2018.00434] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Accepted: 03/15/2018] [Indexed: 11/13/2022] Open
Abstract
Tonal information is essential to early word learning in tone languages. Although numerous studies have investigated the intonational and segmental properties of infant-directed speech (IDS), only a few studies have explored the properties of lexical tones in IDS. These studies mostly focused on the first year of life; thus little is known about how lexical tones in IDS change as children's vocabulary acquisition accelerates in the second year (Goldfield and Reznick, 1990; Bloom, 2001). The present study examines whether Mandarin Chinese mothers hyperarticulate lexical tones in IDS addressing 18- and 24-month-old children-at which age children are learning words at a rapid speed-vs. adult-directed speech (ADS). Thirty-nine Mandarin Chinese-speaking mothers were tested in a semi-spontaneous picture-book-reading task, in which they told the same story to their child (IDS condition) and to an adult (ADS condition). Results for the F0 measurements (minimum F0, maximum F0, and F0 range) of tone in the speech data revealed a continuum of differences among IDS addressing 18-month-olds, IDS addressing 24-month-olds, and ADS. Lexical tones in IDS addressing 18-month-old children had a higher minimum F0, higher maximum F0, and larger pitch range than lexical tones in ADS. Lexical tones in IDS addressing 24-month-old children showed more similarity to ADS tones with respect to pitch height: there were no differences in minimum F0 and maximum F0 between ADS and IDS. However, F0 range was still larger. These results suggest that lexical tones are generally hyperarticulated in Mandarin Chinese IDS addressing 18- and 24- month-old children despite the change in pitch level over time. Mandarin Chinese mothers hyperarticulate lexical tones in IDS when talking to toddlers and potentially facilitate tone acquisition and word learning.
Collapse
Affiliation(s)
- Mengru Han
- Utrecht Institute of Linguistics (OTS), Utrecht University, Utrecht, Netherlands
| | - Nivja H. de Jong
- Leiden University Centre for Linguistics (LUCL), Leiden University, Leiden, Netherlands
- Leiden University Graduate School of Teaching (ICLON), Leiden University, Leiden, Netherlands
| | - René Kager
- Utrecht Institute of Linguistics (OTS), Utrecht University, Utrecht, Netherlands
| |
Collapse
|
29
|
Kalashnikova M, Carignan C, Burnham D. The origins of babytalk: smiling, teaching or social convergence? ROYAL SOCIETY OPEN SCIENCE 2017; 4:170306. [PMID: 28878980 PMCID: PMC5579095 DOI: 10.1098/rsos.170306] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Accepted: 06/28/2017] [Indexed: 05/21/2023]
Abstract
When addressing their young infants, parents systematically modify their speech. Such infant-directed speech (IDS) contains exaggerated vowel formants, which have been proposed to foster language development via articulation of more distinct speech sounds. Here, this assumption is rigorously tested using both acoustic and, for the first time, fine-grained articulatory measures. Mothers were recorded speaking to their infant and to another adult, and measures were taken of their acoustic vowel space, their tongue and lip movements and the length of their vocal tract. Results showed that infant- but not adult-directed speech contains acoustically exaggerated vowels, and these are not the product of adjustments to tongue or to lip movements. Rather, they are the product of a shortened vocal tract due to a raised larynx, which can be ascribed to speakers' unconscious effort to appear smaller and more non-threatening to the young infant. This adjustment in IDS may be a vestige of early mother-infant interactions, which had as its primary purpose the transmission of non-aggressiveness and/or a primitive manifestation of pre-linguistic vocal social convergence of the mother to her infant. With the advent of human language, this vestige then acquired a secondary purpose-facilitating language acquisition via the serendipitously exaggerated vowels.
Collapse
Affiliation(s)
- Marina Kalashnikova
- The MARCS Institute for Brain, Behaviour and Development, Western Sydney University, Locked Bag 1957, Penrith 2527, Australia
| | | | | |
Collapse
|