1
|
Liu W, Vicario DS. Dynamic encoding of phonetic categories in zebra finch auditory forebrain. Sci Rep 2023; 13:11172. [PMID: 37430030 DOI: 10.1038/s41598-023-37982-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Accepted: 06/30/2023] [Indexed: 07/12/2023] Open
Abstract
Vocal communication requires the formation of acoustic categories to enable invariant representations of sounds despite superficial variations. Humans form acoustic categories for speech phonemes, enabling the listener to recognize words independent of speakers; animals can also discriminate speech phonemes. We investigated the neural mechanisms of this process using electrophysiological recordings from the zebra finch secondary auditory area, caudomedial nidopallium (NCM), during passive exposure to human speech stimuli consisting of two naturally spoken words produced by multiple speakers. Analysis of neural distance and decoding accuracy showed improvements in neural discrimination between word categories over the course of exposure, and this improved representation transferred to the same words by novel speakers. We conclude that NCM neurons formed generalized representations of word categories independent of speaker-specific variations that became more refined over the course of passive exposure. The discovery of this dynamic encoding process in NCM suggests a general processing mechanism for forming categorical representations of complex acoustic signals that humans share with other animals.
Collapse
Affiliation(s)
- Wanyi Liu
- Department of Psychology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
| | - David S Vicario
- Department of Psychology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
| |
Collapse
|
2
|
Who is there? Captive western gorillas distinguish human voices based on familiarity and nature of previous interactions. Anim Cogn 2021; 25:217-228. [PMID: 34390429 DOI: 10.1007/s10071-021-01543-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Revised: 06/28/2021] [Accepted: 07/31/2021] [Indexed: 10/20/2022]
Abstract
The ability to recognize conspecifics by their acoustic signals is of crucial importance to social animals, especially where visibility is limited, because it allows for discrimination between familiar and unfamiliar individuals and facilitates associations with and the avoidance of particular conspecifics. Animals may also benefit from an ability to recognize and use the information coded into the auditory signals of other species. Companion species such as dogs, cats, and horses are able to discriminate between familiar and unfamiliar human voices; however, whether this ability is widespread across vertebrates is still unknown. Using playback experiments, we tested whether western gorillas living at Zoo Atlanta were able to discriminate between the voices of subgroups of people: i.e., unfamiliar individuals, familiar individuals with whom the gorillas had positive interactions, and familiar individuals with whom they had negative interactions. Gorillas responded significantly more often (longer gazing duration, higher gazing frequency, shorter latency, and larger number of distress behaviors) to the voices of unfamiliar and familiar-negative individuals than to those of familiar-positive individuals, indicating that they recognized the voices of subgroup of people based on familiarity and possibly the nature of the relationship with them. Future studies should determine whether this is also the case in the wild, where interspecific associations with humans are less intense than they are in captive settings.
Collapse
|
3
|
Pike CD, Kriengwatana BP. Vocal tract constancy in birds and humans. Behav Processes 2018; 163:99-112. [PMID: 30145277 DOI: 10.1016/j.beproc.2018.08.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2017] [Revised: 07/30/2018] [Accepted: 08/10/2018] [Indexed: 12/30/2022]
Abstract
Humans perceive speech as being relatively stable despite acoustic variation caused by vocal tract (VT) differences between speakers. Humans use perceptual 'vocal tract normalisation' (VTN) and other processes to achieve this stability. Similarity in vocal apparatus/acoustics between birds and humans means that birds might also experience VT variation. This has the potential to impede bird communication. No known studies have explicitly examined this, but a number of studies show perceptual stability or 'perceptual constancy' in birds similar to that seen in humans when dealing with VT variation. This review explores similarities between birds and humans and concludes that birds show sufficient evidence of perceptual constancy to warrant further research in this area. Future work should 1) quantify the multiple sources of variation in bird vocalisations, including, but not limited to VT variations, 2) determine whether vocalisations are perniciously disrupted by any of these and 3) investigate how birds reduce variation to maintain perceptual constancy and perceptual efficiency.
Collapse
Affiliation(s)
- Cleopatra Diana Pike
- School of Psychology and Neuroscience, University of St Andrews, St Mary's Quad, South Street, St Andrews, Fife, KY16 9JP, UK.
| | - Buddhamas Pralle Kriengwatana
- School of Psychology and Neuroscience, University of St Andrews, St Mary's Quad, South Street, St Andrews, Fife, KY16 9JP, UK
| |
Collapse
|
4
|
Burgering MA, Ten Cate C, Vroomen J. Mechanisms underlying speech sound discrimination and categorization in humans and zebra finches. Anim Cogn 2018; 21:285-299. [PMID: 29435769 PMCID: PMC5818571 DOI: 10.1007/s10071-018-1165-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2017] [Revised: 01/29/2018] [Accepted: 02/01/2018] [Indexed: 11/28/2022]
Abstract
Speech sound categorization in birds seems in many ways comparable to that by humans, but it is unclear what mechanisms underlie such categorization. To examine this, we trained zebra finches and humans to discriminate two pairs of edited speech sounds that varied either along one dimension (vowel or speaker sex) or along two dimensions (vowel and speaker sex). Sounds could be memorized individually or categorized based on one dimension or by integrating or combining both dimensions. Once training was completed, we tested generalization to new speech sounds that were either more extreme, more ambiguous (i.e., close to the category boundary), or within-category intermediate between the trained sounds. Both humans and zebra finches learned the one-dimensional stimulus-response mappings faster than the two-dimensional mappings. Humans performed higher on the trained, extreme and within-category intermediate test-sounds than on the ambiguous ones. Some individual birds also did so, but most performed higher on the trained exemplars than on the extreme, within-category intermediate and ambiguous test-sounds. These results suggest that humans rely on rule learning to form categories and show poor performance when they cannot apply a rule. Birds rely mostly on exemplar-based memory with weak evidence for rule learning.
Collapse
Affiliation(s)
- Merel A Burgering
- Department of Cognitive Neuropsychology, Tilburg University, Warandelaan 2, P.O. Box 90153, 5000 LE, Tilburg, The Netherlands.
- Institute Biology Leiden (IBL) Leiden University, P.O. Box 9505, 2300 RA, Leiden, The Netherlands.
| | - Carel Ten Cate
- Institute Biology Leiden (IBL) Leiden University, P.O. Box 9505, 2300 RA, Leiden, The Netherlands
- Leiden Institute for Brain and Cognition (LIBC), Leiden University, Leiden, The Netherlands
| | - Jean Vroomen
- Department of Cognitive Neuropsychology, Tilburg University, Warandelaan 2, P.O. Box 90153, 5000 LE, Tilburg, The Netherlands
| |
Collapse
|
5
|
Kriengwatana B, Terry J, Chládková K, Escudero P. Speaker and Accent Variation Are Handled Differently: Evidence in Native and Non-Native Listeners. PLoS One 2016; 11:e0156870. [PMID: 27309889 PMCID: PMC4911083 DOI: 10.1371/journal.pone.0156870] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2015] [Accepted: 05/20/2016] [Indexed: 11/22/2022] Open
Abstract
Listeners are able to cope with between-speaker variability in speech that stems from anatomical sources (i.e. individual and sex differences in vocal tract size) and sociolinguistic sources (i.e. accents). We hypothesized that listeners adapt to these two types of variation differently because prior work indicates that adapting to speaker/sex variability may occur pre-lexically while adapting to accent variability may require learning from attention to explicit cues (i.e. feedback). In Experiment 1, we tested our hypothesis by training native Dutch listeners and Australian-English (AusE) listeners without any experience with Dutch or Flemish to discriminate between the Dutch vowels /I/ and /ε/ from a single speaker. We then tested their ability to classify /I/ and /ε/ vowels of a novel Dutch speaker (i.e. speaker or sex change only), or vowels of a novel Flemish speaker (i.e. speaker or sex change plus accent change). We found that both Dutch and AusE listeners could successfully categorize vowels if the change involved a speaker/sex change, but not if the change involved an accent change. When AusE listeners were given feedback on their categorization responses to the novel speaker in Experiment 2, they were able to successfully categorize vowels involving an accent change. These results suggest that adapting to accents may be a two-step process, whereby the first step involves adapting to speaker differences at a pre-lexical level, and the second step involves adapting to accent differences at a contextual level, where listeners have access to word meaning or are given feedback that allows them to appropriately adjust their perceptual category boundaries.
Collapse
Affiliation(s)
- Buddhamas Kriengwatana
- Institute for Biology Leiden, Leiden University, Leiden, the Netherlands
- Department of Psychology, University of Amsterdam, Amsterdam, the Netherlands
| | - Josephine Terry
- The MARCS Institute for Brain, Behaviour, and Development, Western Sydney University, Sydney, Australia
- ARC Centre of Excellence for the Dynamics of Language, Australian National University, Canberra, Australia
| | - Kateřina Chládková
- Amsterdam Center for Language and Communication, Phonetic Sciences, University of Amsterdam, Amsterdam, the Netherlands
- Institute of Psychology, University of Leipzig, Leipzig, Germany
| | - Paola Escudero
- The MARCS Institute for Brain, Behaviour, and Development, Western Sydney University, Sydney, Australia
- ARC Centre of Excellence for the Dynamics of Language, Australian National University, Canberra, Australia
| |
Collapse
|
6
|
Kriengwatana B, Spierings MJ, ten Cate C. Auditory discrimination learning in zebra finches: effects of sex, early life conditions and stimulus characteristics. Anim Behav 2016. [DOI: 10.1016/j.anbehav.2016.03.028] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
7
|
Elie JE, Theunissen FE. The vocal repertoire of the domesticated zebra finch: a data-driven approach to decipher the information-bearing acoustic features of communication signals. Anim Cogn 2016; 19:285-315. [PMID: 26581377 PMCID: PMC5973879 DOI: 10.1007/s10071-015-0933-6] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Revised: 10/19/2015] [Accepted: 10/19/2015] [Indexed: 12/18/2022]
Abstract
Although a universal code for the acoustic features of animal vocal communication calls may not exist, the thorough analysis of the distinctive acoustical features of vocalization categories is important not only to decipher the acoustical code for a specific species but also to understand the evolution of communication signals and the mechanisms used to produce and understand them. Here, we recorded more than 8000 examples of almost all the vocalizations of the domesticated zebra finch, Taeniopygia guttata: vocalizations produced to establish contact, to form and maintain pair bonds, to sound an alarm, to communicate distress or to advertise hunger or aggressive intents. We characterized each vocalization type using complete representations that avoided any a priori assumptions on the acoustic code, as well as classical bioacoustics measures that could provide more intuitive interpretations. We then used these acoustical features to rigorously determine the potential information-bearing acoustical features for each vocalization type using both a novel regularized classifier and an unsupervised clustering algorithm. Vocalization categories are discriminated by the shape of their frequency spectrum and by their pitch saliency (noisy to tonal vocalizations) but not particularly by their fundamental frequency. Notably, the spectral shape of zebra finch vocalizations contains peaks or formants that vary systematically across categories and that would be generated by active control of both the vocal organ (source) and the upper vocal tract (filter).
Collapse
Affiliation(s)
- Julie E Elie
- Department of Psychology and Helen Wills Neuroscience Institute, University of California Berkeley, 3210 Tolman Hall, Berkeley, CA, 94720, USA.
| | - Frédéric E Theunissen
- Department of Psychology and Helen Wills Neuroscience Institute, University of California Berkeley, 3210 Tolman Hall, Berkeley, CA, 94720, USA
| |
Collapse
|