1
|
Sharma P, Gero S, Payne R, Gruber DF, Rus D, Torralba A, Andreas J. Contextual and combinatorial structure in sperm whale vocalisations. Nat Commun 2024; 15:3617. [PMID: 38714699 PMCID: PMC11076547 DOI: 10.1038/s41467-024-47221-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 03/25/2024] [Indexed: 05/10/2024] Open
Abstract
Sperm whales (Physeter macrocephalus) are highly social mammals that communicate using sequences of clicks called codas. While a subset of codas have been shown to encode information about caller identity, almost everything else about the sperm whale communication system, including its structure and information-carrying capacity, remains unknown. We show that codas exhibit contextual and combinatorial structure. First, we report previously undescribed features of codas that are sensitive to the conversational context in which they occur, and systematically controlled and imitated across whales. We call these rubato and ornamentation. Second, we show that codas form a combinatorial coding system in which rubato and ornamentation combine with two context-independent features we call rhythm and tempo to produce a large inventory of distinguishable codas. Sperm whale vocalisations are more expressive and structured than previously believed, and built from a repertoire comprising nearly an order of magnitude more distinguishable codas. These results show context-sensitive and combinatorial vocalisation can appear in organisms with divergent evolutionary lineage and vocal apparatus.
Collapse
Grants
- This analysis was funded by Project CETI via grants from Dalio Philanthropies and Ocean X; Sea Grape Foundation; Virgin Unite, Rosamund Zander/Hansjorg Wyss, Chris Anderson/Jacqueline Novogratz through The Audacious Project: a collaborative funding initiative housed at TED to PS, SG, RP, DFG, DR, AT and JA. Further funding was provided by the J.H.\ and E.V.\ Wade Fund at MIT. Fieldwork for The Dominica Sperm Whale Project was supported by through a FNU fellowship for the Danish Council for Independent Research supplemented by a Sapere Aude Research Talent Award (1325-00047A), a Carlsberg Foundation expedition grant (CF14-0789), two Explorer Grants from the National Geographic Society (WW-218R-17 and NGS-64863R-19), a grant from Focused on Nature, and supplementary grants from the Arizona Center for Nature Conservation, Quarters For Conservation, the Dansk Akustisks Selskab, Oticon Foundation, and the Dansk Tennis Fond all to SG. Further funding was provided by a Discovery and Equipment grants from the Natural Sciences and Engineering Research Council of Canada (NSERC) to Hal Whitehead of Dalhousie University and a FNU large frame grant and a Villum Foundation Grant (13273) to Peter Madsen of Aarhus University.
Collapse
Affiliation(s)
- Pratyusha Sharma
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
- Project CETI, New York, NY, USA
| | - Shane Gero
- Project CETI, New York, NY, USA
- Department of Biology, Carleton University, 1125 Colonel By Drive, Ottawa, ON, K1S 5B6, Canada
- The Dominica Sperm Whale Project, Roseau, Dominica
| | | | - David F Gruber
- Project CETI, New York, NY, USA
- Baruch College and The Graduate Center, City University of New York, New York, NY, USA
| | - Daniela Rus
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Project CETI, New York, NY, USA.
| | - Antonio Torralba
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Project CETI, New York, NY, USA.
| | - Jacob Andreas
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Project CETI, New York, NY, USA.
| |
Collapse
|
2
|
Hickok G, Venezia J, Teghipco A. Beyond Broca: neural architecture and evolution of a dual motor speech coordination system. Brain 2023; 146:1775-1790. [PMID: 36746488 PMCID: PMC10411947 DOI: 10.1093/brain/awac454] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 11/04/2022] [Accepted: 11/19/2022] [Indexed: 02/08/2023] Open
Abstract
Classical neural architecture models of speech production propose a single system centred on Broca's area coordinating all the vocal articulators from lips to larynx. Modern evidence has challenged both the idea that Broca's area is involved in motor speech coordination and that there is only one coordination network. Drawing on a wide range of evidence, here we propose a dual speech coordination model in which laryngeal control of pitch-related aspects of prosody and song are coordinated by a hierarchically organized dorsolateral system while supralaryngeal articulation at the phonetic/syllabic level is coordinated by a more ventral system posterior to Broca's area. We argue further that these two speech production subsystems have distinguishable evolutionary histories and discuss the implications for models of language evolution.
Collapse
Affiliation(s)
- Gregory Hickok
- Department of Cognitive Sciences, University of California, Irvine, CA 92697, USA
- Department of Language Science, University of California, Irvine, CA 92697, USA
| | - Jonathan Venezia
- Auditory Research Laboratory, VA Loma Linda Healthcare System, Loma Linda, CA 92357, USA
- Department of Otolaryngology—Head and Neck Surgery, Loma Linda University School of Medicine, Loma Linda, CA 92350, USA
| | - Alex Teghipco
- Department of Psychology, University of South Carolina, Columbia, SC 29208, USA
| |
Collapse
|
3
|
Berthet M, Coye C, Dezecache G, Kuhn J. Animal linguistics: a primer. Biol Rev Camb Philos Soc 2023; 98:81-98. [PMID: 36189714 PMCID: PMC10091714 DOI: 10.1111/brv.12897] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 08/10/2022] [Accepted: 08/12/2022] [Indexed: 01/12/2023]
Abstract
The evolution of language has been investigated by several research communities, including biologists and linguists, striving to highlight similar linguistic capacities across species. To date, however, no consensus exists on the linguistic capacities of non-human species. Major controversies remain on the use of linguistic terminology, analysis methods and behavioural data collection. The field of 'animal linguistics' has emerged to overcome these difficulties and attempt to reach uniform methods and terminology. This primer is a tutorial review of 'animal linguistics'. It describes the linguistic concepts of semantics, pragmatics and syntax, and proposes minimal criteria to be fulfilled to claim that a given species displays a particular linguistic capacity. Second, it reviews relevant methods successfully applied to the study of communication in animals and proposes a list of useful references to detect and overcome major pitfalls commonly observed in the collection of animal behaviour data. This primer represents a step towards mutual understanding and fruitful collaborations between linguists and biologists.
Collapse
Affiliation(s)
- Mélissa Berthet
- Institut Jean Nicod, Département d'études cognitives, ENS, EHESS, CNRS, PSL University, 75005, Paris, France.,Center for the Interdisciplinary Study of Language Evolution, University of Zürich, Affolternstrasse 56, 8050, Zurich, Switzerland.,Department of Comparative Language Science, University of Zürich, Affolternstrasse 56, 8050, Zurich, Switzerland
| | - Camille Coye
- Institut Jean Nicod, Département d'études cognitives, ENS, EHESS, CNRS, PSL University, 75005, Paris, France.,Center for Ecology and Conservation, Bioscience Department, University of Exeter, Penryn Campus, Penryn, TR10 9FE, UK
| | | | - Jeremy Kuhn
- Institut Jean Nicod, Département d'études cognitives, ENS, EHESS, CNRS, PSL University, 75005, Paris, France
| |
Collapse
|
4
|
Cognitive control of song production by humpback whales. Anim Cogn 2022; 25:1133-1149. [PMID: 36058997 DOI: 10.1007/s10071-022-01675-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 08/04/2022] [Accepted: 08/12/2022] [Indexed: 11/01/2022]
Abstract
Singing humpback whales are highly versatile vocalizers, producing complex sequences of sounds that they vary throughout adulthood. Past analyses of humpback whale song have emphasized yearly variations in structural features of songs made collectively by singers within a population with comparatively little attention given to the ways that individual singers vary consecutive songs. As a result, many researchers describe singing by humpback whales as a process in which singers produce sequences of repeating sound patterns. Here, we show that such characterizations misrepresent the degree to which humpback whales flexibly and dynamically control the production of sounds and sound patterns within song sessions. Singers recorded off the coast of Hawaii continuously morphed units along multiple acoustic dimensions, with the degree and direction of morphing varying across parallel streams of successive units. Individual singers also produced multiple phrase variants (structurally similar, but acoustically distinctive sequences) within song sessions. The precision with which individual singers maintained some acoustic properties of phrases and morphing trajectories while flexibly changing others suggests that singing humpback whales actively select and adjust acoustic elements of their songs in real time rather than simply repeating stereotyped sound patterns within song sessions.
Collapse
|
5
|
He L. Characterizing first and second language rhythm in English using spectral coherence between temporal envelope and mouth opening-closing movements. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:567. [PMID: 35931510 DOI: 10.1121/10.0012694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 06/29/2022] [Indexed: 06/15/2023]
Abstract
This study investigated the rhythmic differences between first and second language English from 19 native speakers of American English and an equal number of native speakers of Mandarin. Speech rhythm was viewed from MacNeilage's frame/content theory. The spectral coherence between the temporal envelope and the mouth opening and closing kinematics was computed to operationalize the rhythmic frame. The spectral centroid, spread, rolloff, flatness, and entropy were calculated to reveal the frequency distribution patterns in the coherence. Using a binary logistic regression model, these measures were collectively found to be effective in characterizing rhythmic differences between native and non-native groups (A' = 0.71 and B″D = -0.06). Specifically, the native group was significantly higher than the non-native group in terms of spectral centroid and spread, whereas the native group was significantly lower than its non-native counterpart in terms of spectral flatness and entropy. Both groups were not significantly different in spectral rolloff. Possible explanations for the result as well as the efficacy of employing the aforesaid coherence in speech rhythm research in general were discussed.
Collapse
Affiliation(s)
- Lei He
- Department of Computational Linguistics, University of Zurich (UZH), Zurich-Oerlikon 8050, Switzerland
| |
Collapse
|
6
|
Ter Haar SM, Fernandez AA, Gratier M, Knörnschild M, Levelt C, Moore RK, Vellema M, Wang X, Oller DK. Cross-species parallels in babbling: animals and algorithms. Philos Trans R Soc Lond B Biol Sci 2021; 376:20200239. [PMID: 34482727 PMCID: PMC8419573 DOI: 10.1098/rstb.2020.0239] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
A key feature of vocal ontogeny in a variety of taxa with extensive vocal repertoires is a developmental pattern in which vocal exploration is followed by a period of category formation that results in a mature species-specific repertoire. Vocal development preceding the adult repertoire is often called ‘babbling’, a term used to describe aspects of vocal development in species of vocal-learning birds, some marine mammals, some New World monkeys, some bats and humans. The paper summarizes the results of research on babbling in examples from five taxa and proposes a unifying definition facilitating their comparison. There are notable similarities across these species in the developmental pattern of vocalizations, suggesting that vocal production learning might require babbling. However, the current state of the literature is insufficient to confirm this suggestion. We suggest directions for future research to elucidate this issue, emphasizing the importance of (i) expanding the descriptive data and seeking species with complex mature repertoires where babbling may not occur or may occur only to a minimal extent; (ii) (quasi-)experimental research to tease apart possible mechanisms of acquisition and/or self-organizing development; and (iii) computational modelling as a methodology to test hypotheses about the origins and functions of babbling. This article is part of the theme issue ‘Vocal learning in animals and humans’.
Collapse
Affiliation(s)
- Sita M Ter Haar
- Cognitive Neurobiology and Helmholtz Institute, Department of Psychology, Utrecht University, PO Box 80086, 3508 TB Utrecht, The Netherlands
| | - Ahana A Fernandez
- Museum für Naturkunde - Leibniz Institute for Evolution and Biodiversity Science, Invalidenstrasse 43, 10115 Berlin, Germany
| | - Maya Gratier
- Laboratoire Ethologie, Cognition, Développement, Paris Nanterre University, Nanterre, France
| | - Mirjam Knörnschild
- Museum für Naturkunde - Leibniz Institute for Evolution and Biodiversity Science, Invalidenstrasse 43, 10115 Berlin, Germany.,Animal Behavior Lab, Freie Universität, Berlin, Germany.,Smithsonian Tropical Research Institute, Balboa, Ancón, Panama
| | - Claartje Levelt
- Leiden University Centre for Linguistics, Leiden University, Leiden, The Netherlands.,Leiden Institute for Brain and Cognition, Leiden University, Leiden, The Netherlands
| | - Roger K Moore
- Department Computer Science, University of Sheffield, Sheffield, UK
| | - Michiel Vellema
- Cognitive Neurobiology and Helmholtz Institute, Department of Psychology, Utrecht University, PO Box 80086, 3508 TB Utrecht, The Netherlands
| | - Xiaoqin Wang
- Laboratory of Auditory Neurophysiology, Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - D Kimbrough Oller
- School of Communication Sciences and Disorders, University of Memphis, Memphis, TN, USA.,Institute for Intelligent Systems, University of Memphis, Memphis, TN, USA.,Konrad Lorenz Institute for Evolution and Cognition Research, Klosterneuburg, Austria
| |
Collapse
|
7
|
Beeck VC, Heilmann G, Kerscher M, Stoeger AS. A novel theory of Asian elephant high-frequency squeak production. BMC Biol 2021; 19:121. [PMID: 34134675 PMCID: PMC8210382 DOI: 10.1186/s12915-021-01026-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Accepted: 04/14/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Anatomical and cognitive adaptations to overcome morpho-mechanical limitations of laryngeal sound production, where body size and the related vocal apparatus dimensions determine the fundamental frequency, increase vocal diversity across taxa. Elephants flexibly use laryngeal and trunk-based vocalizations to form a repertoire ranging from infrasonic rumbles to higher-pitched trumpets. Moreover, they are among the few evolutionarily distantly related animals (humans, pinnipeds, cetaceans, birds) capable of imitating species-atypical sounds. Yet, their vocal plasticity has so far not been related to functions within their natural communicative system, in part because not all call types have been systematically studied. Here, we reveal how Asian elephants (Elephas maximus) produce species-specific squeaks (F0 300-2300 Hz) by using acoustic camera recordings to visualize sound emission and examining this alongside acoustic, behavioral, and morphological data across seven captive groups. RESULTS We found that squeaks were emitted through the closed mouth in synchrony with cheek depression and retraction of the labial angles. The simultaneous emission of squeaks with nasal snorts (biphonation) in one individual confirmed that squeak production was independent of nasal passage involvement and this implicated oral sound production. The squeaks' spectral structure is incongruent with laryngeal sound production and aerodynamic whistles, pointing to tissue vibration as the sound source. Anatomical considerations suggest that the longitudinal closed lips function as the vibrators. Acoustic and temporal parameters exhibit high intra- and inter-individual variability that enables individual but no call-subtype classification. Only 19 of 56 study subjects were recorded to squeak, mostly during alarming contexts and social arousal but some also on command. CONCLUSION Our results strongly suggest that Asian elephants force air from the small oral cavity through the tensed lips, inducing self-sustained lip vibration. Besides human brass players, lip buzzing is not described elsewhere in the animal kingdom. Given the complexity of the proposed mechanism, the surprising absence of squeaking in most of the unrelated subjects and the indication for volitional control, we hypothesize that squeak production involves social learning. Our study offers new insights into how vocal and cognitive flexibility enables mammals to overcome size-related limitations of laryngeal sound production. This flexibility enables Asian elephants to exploit a frequency range spanning seven octaves within their communicative system.
Collapse
Affiliation(s)
- Veronika C Beeck
- Department of Behavioural and Cognitive Biology, Mammal Communication Lab, University of Vienna, Vienna, Austria.
| | | | | | - Angela S Stoeger
- Department of Behavioural and Cognitive Biology, Mammal Communication Lab, University of Vienna, Vienna, Austria.
| |
Collapse
|
8
|
Ordin M. Speech rhythm as naturally occurring and culturally transmitted behavioral patterns. Ann N Y Acad Sci 2019; 1453:5-11. [PMID: 31502260 DOI: 10.1111/nyas.14234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 08/19/2019] [Accepted: 08/27/2019] [Indexed: 11/29/2022]
Abstract
Rhythm is fundamental to every motor activity. Neural and physiological mechanisms that underlie rhythmic cognition, in general, and rhythmic pattern generation, in particular, are evolutionarily ancient. As speech production is a kind of motor activity, investigating speech rhythm can provide insight into how general motor patterns have been adapted for more specific use in articulation and speech production. Studies on speech rhythm may further provide insight into the development of speech capacity in humans. As speech capacity is putatively a prerequisite for developing a language faculty, studies on speech rhythm may cast some light on the mystery of language evolution in the human genus. Hereby, we propose an approach to exploring speech rhythm as a window on speech emergence in ontogenesis and phylogenesis, as well as on diachronic linguistic changes.
Collapse
Affiliation(s)
- Mikhail Ordin
- Basque Centre on Cognition, Brain and Language (BCBL) and Ikerbasque - Basque Foundation for Science
| |
Collapse
|