1
|
Terranova F, Baciadonna L, Maccarone C, Isaja V, Gamba M, Favaro L. Penguins perceive variations of source- and filter-related vocal parameters of species-specific vocalisations. Anim Cogn 2023; 26:1613-1622. [PMID: 37401990 PMCID: PMC10442253 DOI: 10.1007/s10071-023-01806-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 05/22/2023] [Accepted: 06/27/2023] [Indexed: 07/05/2023]
Abstract
Animal vocalisations encode a wide range of biological information about the age, sex, body size, and social status of the emitter. Moreover, vocalisations play a significant role in signalling the identity of the emitter to conspecifics. Recent studies have shown that, in the African penguin (Spheniscus demersus), acoustic cues to individual identity are encoded in the fundamental frequency (F0) and resonance frequencies (formants) of the vocal tract. However, although penguins are known to produce vocalisations where F0 and formants vary among individuals, it remains to be tested whether the receivers can perceive and use such information in the individual recognition process. In this study, using the Habituation-Dishabituation (HD) paradigm, we tested the hypothesis that penguins perceive and respond to a shift of ± 20% (corresponding to the natural inter-individual variation observed in ex-situ colonies) of F0 and formant dispersion (ΔF) of species-specific calls. We found that penguins were more likely to look rapidly and for longer at the source of the sound when F0 and formants of the calls were manipulated, indicating that they could perceive variations of these parameters in the vocal signals. Our findings provide the first experimental evidence that, in the African penguin, listeners can perceive changes in F0 and formants, which can be used by the receiver as potential cues for the individual discrimination of the emitter.
Collapse
Affiliation(s)
- Francesca Terranova
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Luigi Baciadonna
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy.
| | - Chiara Maccarone
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | | | - Marco Gamba
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Livio Favaro
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy.
| |
Collapse
|
2
|
Gammino B, Palacios V, Root-Gutteridge H, Reby D, Gamba M. Grey wolves (Canis lupus) discriminate between familiar and unfamiliar human voices. Anim Cogn 2023; 26:1589-1600. [PMID: 37338632 DOI: 10.1007/s10071-023-01796-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 05/20/2023] [Accepted: 05/24/2023] [Indexed: 06/21/2023]
Abstract
The ability to discriminate between different individuals based on identity cues, which is important to support the social behaviour of many animal species, has mostly been investigated in conspecific contexts. A rare example of individual heterospecific discrimination is found in domestic dogs, who are capable of recognising their owners' voices. Here, we test whether grey wolves, the nearest wild relative of dogs, also have the ability to distinguish familiar human voices, which would indicate that dogs' ability is not a consequence of domestication. Using the habituation-dishabituation paradigm, we presented captive wolves with playback recordings of their keepers' and strangers' voices producing either familiar or unfamiliar phrases. The duration of their response was significantly longer when presented with keepers' voices than with strangers' voices, demonstrating that wolves discriminated between familiar and unfamiliar speakers. This suggests that dogs' ability to discriminate between human voices was probably present in their common ancestor and may support the idea that this is a general ability of vertebrates to recognise heterospecific individuals. Our study also provides further evidence for familiar voice discrimination in a wild animal in captivity, indicating that this ability may be widespread across vertebrate species.
Collapse
Affiliation(s)
- Beatrice Gammino
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy.
| | - Vicente Palacios
- Cavanilles Institute of Biodiversity and Evolutionary Biology, University of Valencia, Valencia, Spain
- ARCA, People and Nature, SL, Oviedo, Spain
| | - Holly Root-Gutteridge
- University of Sussex, Brighton, Sussex, UK
- Animal Behaviour, Cognition and Welfare Group, University of Lincoln, Lincoln, UK
| | - David Reby
- Equipe de Neuro-Ethologie Sensorielle ENES / CRNL, CNRS, Inserm, University of Saint-Etienne, Saint-Etienne, France
| | - Marco Gamba
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| |
Collapse
|
3
|
The acoustic bases of human voice identity processing in dogs. Anim Cogn 2022; 25:905-916. [PMID: 35142977 PMCID: PMC9334438 DOI: 10.1007/s10071-022-01601-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 01/03/2022] [Accepted: 01/19/2022] [Indexed: 11/22/2022]
Abstract
Speech carries identity-diagnostic acoustic cues that help individuals recognize each other during vocal–social interactions. In humans, fundamental frequency, formant dispersion and harmonics-to-noise ratio serve as characteristics along which speakers can be reliably separated. The ability to infer a speaker’s identity is also adaptive for members of other species (like companion animals) for whom humans (as owners) are relevant. The acoustic bases of speaker recognition in non-humans are unknown. Here, we tested whether dogs can recognize their owner’s voice and whether they rely on the same acoustic parameters for such recognition as humans use to discriminate speakers. Stimuli were pre-recorded sentences spoken by the owner and control persons, played through loudspeakers placed behind two non-transparent screens (with each screen hiding a person). We investigated the association between acoustic distance of speakers (examined along several dimensions relevant in intraspecific voice identification) and dogs’ behavior. Dogs chose their owner’s voice more often than that of control persons’, suggesting that they can identify it. Choosing success and time spent looking in the direction of the owner’s voice were positively associated, showing that looking time is an index of the ease of choice. Acoustic distance of speakers in mean fundamental frequency and jitter were positively associated with looking time, indicating that the shorter the acoustic distance between speakers with regard to these parameters, the harder the decision. So, dogs use these cues to discriminate their owner’s voice from unfamiliar voices. These findings reveal that dogs use some but probably not all acoustic parameters that humans use to identify speakers. Although dogs can detect fine changes in speech, their perceptual system may not be fully attuned to identity-diagnostic cues in the human voice.
Collapse
|
4
|
Cuaya LV, Hernández-Pérez R, Boros M, Deme A, Andics A. Speech naturalness detection and language representation in the dog brain. Neuroimage 2021; 248:118811. [PMID: 34906714 DOI: 10.1016/j.neuroimage.2021.118811] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 12/08/2021] [Accepted: 12/10/2021] [Indexed: 12/11/2022] Open
Abstract
Family dogs are exposed to a continuous flow of human speech throughout their lives. However, the extent of their abilities in speech perception is unknown. Here, we used functional magnetic resonance imaging (fMRI) to test speech detection and language representation in the dog brain. Dogs (n = 18) listened to natural speech and scrambled speech in a familiar and an unfamiliar language. Speech scrambling distorts auditory regularities specific to speech and to a given language, but keeps spectral voice cues intact. We hypothesized that if dogs can extract auditory regularities of speech, and of a familiar language, then there will be distinct patterns of brain activity for natural speech vs. scrambled speech, and also for familiar vs. unfamiliar language. Using multivoxel pattern analysis (MVPA) we found that bilateral auditory cortical regions represented natural speech and scrambled speech differently; with a better classifier performance in longer-headed dogs in a right auditory region. This neural capacity for speech detection was not based on preferential processing for speech but rather on sensitivity to sound naturalness. Furthermore, in case of natural speech, distinct activity patterns were found for the two languages in the secondary auditory cortex and in the precruciate gyrus; with a greater difference in responses to the familiar and unfamiliar languages in older dogs, indicating a role for the amount of language exposure. No regions represented differently the scrambled versions of the two languages, suggesting that the activity difference between languages in natural speech reflected sensitivity to language-specific regularities rather than to spectral voice cues. These findings suggest that separate cortical regions support speech naturalness detection and language representation in the dog brain.
Collapse
Affiliation(s)
- Laura V Cuaya
- Department of Ethology, Institute of Biology, Eötvös Loránd University, Budapest, Hungary; MTA-ELTE 'Lendület' Neuroethology of Communication Research Group, Hungarian Academy of Sciences - Eötvös Loránd University, Budapest, Hungary.
| | - Raúl Hernández-Pérez
- Department of Ethology, Institute of Biology, Eötvös Loránd University, Budapest, Hungary; MTA-ELTE 'Lendület' Neuroethology of Communication Research Group, Hungarian Academy of Sciences - Eötvös Loránd University, Budapest, Hungary
| | - Marianna Boros
- Department of Ethology, Institute of Biology, Eötvös Loránd University, Budapest, Hungary; MTA-ELTE 'Lendület' Neuroethology of Communication Research Group, Hungarian Academy of Sciences - Eötvös Loránd University, Budapest, Hungary
| | - Andrea Deme
- Department of Applied Linguistics and Phonetics, Faculty of Humanities, Eötvös Loránd University, Budapest, Hungary; MTA-ELTE 'Lendület' Lingual Articulation Research Group, Budapest, Hungary
| | - Attila Andics
- Department of Ethology, Institute of Biology, Eötvös Loránd University, Budapest, Hungary; MTA-ELTE 'Lendület' Neuroethology of Communication Research Group, Hungarian Academy of Sciences - Eötvös Loránd University, Budapest, Hungary
| |
Collapse
|
5
|
Domestic dogs (Canis lupus familiaris) are sensitive to the correlation between pitch and timbre in human speech. Anim Cogn 2021; 25:545-554. [PMID: 34714438 PMCID: PMC9107418 DOI: 10.1007/s10071-021-01567-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 09/14/2021] [Accepted: 10/15/2021] [Indexed: 12/01/2022]
Abstract
The perceived pitch of human voices is highly correlated with the fundamental frequency (f0) of the laryngeal source, which is determined largely by the length and mass of the vocal folds. The vocal folds are larger in adult males than in adult females, and men’s voices consequently have a lower pitch than women’s. The length of the supralaryngeal vocal tract (vocal-tract length; VTL) affects the resonant frequencies (formants) of speech which characterize the timbre of the voice. Men’s longer vocal tracts produce lower frequency, and less dispersed, formants than women’s shorter vocal tracts. Pitch and timbre combine to influence the perception of speaker characteristics such as size and age. Together, they can be used to categorize speaker sex with almost perfect accuracy. While it is known that domestic dogs can match a voice to a person of the same sex, there has been no investigation into whether dogs are sensitive to the correlation between pitch and timbre. We recorded a female voice giving three commands (‘Sit’, ‘Lay down’, ‘Come here’), and manipulated the recordings to lower the fundamental frequency (thus lowering pitch), increase simulated VTL (hence affecting timbre), or both (synthesized adult male voice). Dogs responded to the original adult female and synthesized adult male voices equivalently. Their tendency to obey the commands was, however, reduced when either pitch or timbre was manipulated alone. These results suggest that dogs are sensitive to both the pitch and timbre of human voices, and that they learn about the natural covariation of these perceptual attributes.
Collapse
|
6
|
Root-Gutteridge H, Brown LP, Forman J, Korzeniowska AT, Simner J, Reby D. Using a new video rating tool to crowd-source analysis of behavioural reaction to stimuli. Anim Cogn 2021; 24:947-956. [PMID: 33751273 PMCID: PMC8360862 DOI: 10.1007/s10071-021-01490-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 02/08/2021] [Accepted: 02/10/2021] [Indexed: 12/31/2022]
Abstract
Quantifying the intensity of animals' reaction to stimuli is notoriously difficult as classic unidimensional measures of responses such as latency or duration of looking can fail to capture the overall strength of behavioural responses. More holistic rating can be useful but have the inherent risks of subjective bias and lack of repeatability. Here, we explored whether crowdsourcing could be used to efficiently and reliably overcome these potential flaws. A total of 396 participants watched online videos of dogs reacting to auditory stimuli and provided 23,248 ratings of the strength of the dogs' responses from zero (default) to 100 using an online survey form. We found that raters achieved very high inter-rater reliability across multiple datasets (although their responses were affected by their sex, age, and attitude towards animals) and that as few as 10 raters could be used to achieve a reliable result. A linear mixed model applied to PCA components of behaviours discovered that the dogs' facial expressions and head orientation influenced the strength of behaviour ratings the most. Further linear mixed models showed that that strength of behaviour ratings was moderately correlated to the duration of dogs' reactions but not to dogs' reaction latency (from the stimulus onset). This suggests that observers' ratings captured consistent dimensions of animals' responses that are not fully represented by more classic unidimensional metrics. Finally, we report that overall participants strongly enjoyed the experience. Thus, we suggest that using crowdsourcing can offer a useful, repeatable tool to assess behavioural intensity in experimental or observational studies where unidimensional coding may miss nuance, or where coding multiple dimensions may be too time-consuming.
Collapse
Affiliation(s)
- Holly Root-Gutteridge
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, BN1 9RH, UK.
- School of Life Sciences, Joseph Banks Laboratories, University of Lincoln, Beevor Street, Lincoln, LN6 7DL, UK.
| | - Louise P Brown
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, BN1 9RH, UK
| | - Jemma Forman
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, BN1 9RH, UK
| | - Anna T Korzeniowska
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, BN1 9RH, UK
| | - Julia Simner
- MULTISENSE Lab, School of Psychology, University of Sussex, Brighton, BN1 9RH, UK
| | - David Reby
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton, BN1 9RH, UK
- Equipe Neuro-Ethologie Sensorielle, ENES, CRNL, CNRS UMR5292, INSERM UMR_S 1028, University of Lyon, Saint-Etienne, France
| |
Collapse
|
7
|
Carlson NV, Kelly EM, Couzin I. Individual vocal recognition across taxa: a review of the literature and a look into the future. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190479. [PMID: 32420840 PMCID: PMC7331019 DOI: 10.1098/rstb.2019.0479] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2020] [Indexed: 11/12/2022] Open
Abstract
Individual vocal recognition (IVR) has been well studied in mammals and birds. These studies have primarily delved into understanding IVR in specific limited contexts (e.g. parent-offspring and mate recognition) where individuals discriminate one individual from all others. However, little research has examined IVR in more socially demanding circumstances, such as when an individual discriminates all individuals in their social or familial group apart. In this review, we describe what IVR is and suggest splitting studies of IVR into two general types based on what questions they answer (IVR-singular, and IVR-multiple). We explain how we currently test for IVR, and many of the benefits and drawbacks of different methods. We address why IVR is so prevalent in the animal kingdom, and the circumstances in which it is often found. Finally, we explain current weaknesses in IVR research including temporality, specificity, and taxonomic bias, and testing paradigms, and provide some solutions to address these weaknesses. This article is part of the theme issue 'Signal detection theory in recognition systems: from evolving models to experimental tests'.
Collapse
Affiliation(s)
- Nora V. Carlson
- Department of Collective Behaviour, Max Planck Institute of Animal Behavior, D-78457 Konstanz, Germany
- Centre for the Advanced Study of Collective Behaviour, University of Konstanz, Universitätsstraße 10, D-78457 Konstanz, Germany
- Department of Biology, University of Konstanz, Universitätsstraße 10, D-78457 Konstanz, Germany
| | - E. McKenna Kelly
- Department of Collective Behaviour, Max Planck Institute of Animal Behavior, D-78457 Konstanz, Germany
- Centre for the Advanced Study of Collective Behaviour, University of Konstanz, Universitätsstraße 10, D-78457 Konstanz, Germany
- Department of Biology, University of Konstanz, Universitätsstraße 10, D-78457 Konstanz, Germany
| | - Iain Couzin
- Department of Collective Behaviour, Max Planck Institute of Animal Behavior, D-78457 Konstanz, Germany
- Centre for the Advanced Study of Collective Behaviour, University of Konstanz, Universitätsstraße 10, D-78457 Konstanz, Germany
- Department of Biology, University of Konstanz, Universitätsstraße 10, D-78457 Konstanz, Germany
| |
Collapse
|
8
|
Multimodal signaling in the visuo-acoustic mismatch paradigm: similarities between dogs and children in the communicative approach. Anim Cogn 2020; 23:833-841. [PMID: 32451634 DOI: 10.1007/s10071-020-01398-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Revised: 05/18/2020] [Accepted: 05/20/2020] [Indexed: 10/24/2022]
Abstract
In this review, we have analyzed the studies on the "mismatch paradigm" or "contrasting paradigm", in which the word indicates an intent that is opposite to the gesture in dogs and children. The studies on children highlighted the importance of the type of gestural messages that, when delivered in a non-ostensive manner, assume less value than the verbal indication; whereas, when more emphasis is given to the gestures, it produces opposite results. Word-trained dogs appear to rely more on words, but in the absence of such specific training, dogs rely more on gestures either in transitive or intransitive actions. Moreover, gestural communication appears easier to generalize, since dogs respond equally well to the gestural messages of familiar persons and strangers, whereas their performance lowers when a stranger provides a vocal message. Visual signals trigger faster responses than auditory signals, whereas verbal indications can at most equal the gestural latencies, but never overcome them. Female dogs appeared to be more proficient in the interpretation of gestural commands, while males performed better in the case of verbal commands. Based on a PRISMA analyses from the Web of Science database, three papers on children and four on dogs were retrieved. Our analyses revealed that gestures are more reliable reference points than words for dogs and children. Future studies should focus on choices related to objects of different values for the subjects. Moreover, the choices of dogs should be compared using known and unknown objects, which might help clarify how familiarity with the objects could differently influence their responses.
Collapse
|
9
|
Corballis MC. Crossing the Rubicon: Behaviorism, Language, and Evolutionary Continuity. Front Psychol 2020; 11:653. [PMID: 32373020 PMCID: PMC7186390 DOI: 10.3389/fpsyg.2020.00653] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 03/18/2020] [Indexed: 01/09/2023] Open
Abstract
Euan Macphail's work and ideas captured a pivotal time in the late 20th century when behavioral laws were considered to apply equally across vertebrates, implying equal intelligence, but it was also a time when behaviorism was challenged by the view that language was unique to humans, and bestowed a superior mental status. Subsequent work suggests greater continuity between humans and their forebears, challenging the Chomskyan assumption that language evolved in a single step ("the great leap forward") in humans. Language is now understood to be based on an amalgam of cognitive functions, including mental time travel, theory of mind, and what may be more broadly defined as imagination. These functions probably evolved gradually in hominin evolution and are present in varying degrees in non-human species. The blending of language into cognition provides for both interspecies differences in mental function, and continuity between humans and other species. What does seem to be special to humans is the ability to communicate the contents of imagination, although even this is not absolute, and is perhaps less adaptive than we like to think.
Collapse
Affiliation(s)
- Michael C. Corballis
- Faculty of Science, School of Psychology, The University of Auckland, Auckland, New Zealand
| |
Collapse
|
10
|
Root-Gutteridge H, Ratcliffe VF, Korzeniowska AT, Reby D. Dogs perceive and spontaneously normalize formant-related speaker and vowel differences in human speech sounds. Biol Lett 2019; 15:20190555. [PMID: 31795850 DOI: 10.1098/rsbl.2019.0555] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Domesticated animals have been shown to recognize basic phonemic information from human speech sounds and to recognize familiar speakers from their voices. However, whether animals can spontaneously identify words across unfamiliar speakers (speaker normalization) or spontaneously discriminate between unfamiliar speakers across words remains to be investigated. Here, we assessed these abilities in domestic dogs using the habituation-dishabituation paradigm. We found that while dogs habituated to the presentation of a series of different short words from the same unfamiliar speaker, they significantly dishabituated to the presentation of a novel word from a new speaker of the same gender. This suggests that dogs spontaneously categorized the initial speaker across different words. Conversely, dogs who habituated to the same short word produced by different speakers of the same gender significantly dishabituated to a novel word, suggesting that they had spontaneously categorized the word across different speakers. Our results indicate that the ability to spontaneously recognize both the same phonemes across different speakers, and cues to identity across speech utterances from unfamiliar speakers, is present in domestic dogs and thus not a uniquely human trait.
Collapse
Affiliation(s)
- Holly Root-Gutteridge
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton BN1 9QH, UK
| | | | - Anna T Korzeniowska
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton BN1 9QH, UK
| | - David Reby
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton BN1 9QH, UK.,Equipe de Neuro-Ethologie Sensorielle ENES/CRNL, University of Lyon/Saint-Etienne, CNRS UMR5292, INSERM UMR_S 1028, Saint-Etienne, France
| |
Collapse
|