1
|
Burchardt LS, van de Sande Y, Kehy M, Gamba M, Ravignani A, Pouw W. A toolkit for the dynamic study of air sacs in siamang and other elastic circular structures. PLoS Comput Biol 2024; 20:e1012222. [PMID: 38913743 PMCID: PMC11226135 DOI: 10.1371/journal.pcbi.1012222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 07/05/2024] [Accepted: 06/03/2024] [Indexed: 06/26/2024] Open
Abstract
Biological structures are defined by rigid elements, such as bones, and elastic elements, like muscles and membranes. Computer vision advances have enabled automatic tracking of moving animal skeletal poses. Such developments provide insights into complex time-varying dynamics of biological motion. Conversely, the elastic soft-tissues of organisms, like the nose of elephant seals, or the buccal sac of frogs, are poorly studied and no computer vision methods have been proposed. This leaves major gaps in different areas of biology. In primatology, most critically, the function of air sacs is widely debated; many open questions on the role of air sacs in the evolution of animal communication, including human speech, remain unanswered. To support the dynamic study of soft-tissue structures, we present a toolkit for the automated tracking of semi-circular elastic structures in biological video data. The toolkit contains unsupervised computer vision tools (using Hough transform) and supervised deep learning (by adapting DeepLabCut) methodology to track inflation of laryngeal air sacs or other biological spherical objects (e.g., gular cavities). Confirming the value of elastic kinematic analysis, we show that air sac inflation correlates with acoustic markers that likely inform about body size. Finally, we present a pre-processed audiovisual-kinematic dataset of 7+ hours of closeup audiovisual recordings of siamang (Symphalangus syndactylus) singing. This toolkit (https://github.com/WimPouw/AirSacTracker) aims to revitalize the study of non-skeletal morphological structures across multiple species.
Collapse
Affiliation(s)
- Lara S. Burchardt
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, Netherlands
- Leibniz-Zentrum Allgemeine Sprachwissenschaft, Berlin, Germany
| | - Yana van de Sande
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, Netherlands
| | - Mounia Kehy
- Equipe de Neuro-Ethologie Sensorielle, Université Jean Monnet, France
| | - Marco Gamba
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Andrea Ravignani
- Comparative Bioacoustics Group, Max Planck Institute for Psycholinguistics, Nijmegen, Netherlands
- Center for Music in the Brain, Department of Clinical Medicine, Aarhus University & The Royal Academy of Music, Aarhus, Denmark
- Department of Human Neurosciences, Sapienza University of Rome, Rome, Italy
| | - Wim Pouw
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, Netherlands
| |
Collapse
|
2
|
Ma H, Wang Z, Han P, Fan P, Chapman CA, Garber PA, Fan P. Small apes adjust rhythms to facilitate song coordination. Curr Biol 2024; 34:935-945.e3. [PMID: 38266649 DOI: 10.1016/j.cub.2023.12.071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 11/03/2023] [Accepted: 12/22/2023] [Indexed: 01/26/2024]
Abstract
Song coordination is a universal characteristic of human music. Many animals also produce well-coordinated duets or choruses that resemble human music. However, the mechanism and evolution of song coordination have only recently been studied in animals. Here, we studied the mechanism of song coordination in three closely related species of wild Nomascus gibbons that live in polygynous groups. In each species, song bouts were dominated by male solo sequences (referred to hereafter as male sequence), and females contributed stereotyped great calls to coordinate with males. Considering the function of rhythm in facilitating song coordination in human music and animal vocalizations, we predicted that adult males adjust their song rhythm to facilitate song coordination with females. In support of this prediction, we found that adult males produced significantly more isochronous rhythms with a faster tempo in male sequences that were followed by successful female great calls (a complete sequence with "introductory" and "wa" notes). The difference in isochrony and tempos between successful great call sequences and male sequences was smaller in N. concolor compared with the other two species, which may make it difficult for females to predict a male's precise temporal pattern. Consequently, adult females of N. concolor produced more failed great call (an incomplete sequence with only introductory notes) sequences. We propose that the high degree of rhythm change functions as an unambiguous signal that can be easily perceived by receivers. In this regard, gibbon vocalizations offer an instructive model to understand the origins and evolution of human music.
Collapse
Affiliation(s)
- Haigang Ma
- School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, Guangdong, China
| | - Zidi Wang
- School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, Guangdong, China
| | - Pu Han
- School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, Guangdong, China
| | - Penglai Fan
- Key Laboratory of Ecology of Rare and Endangered Species and Environmental Protection (Guangxi Normal University), Ministry of Education, Guilin 541006, Guangxi, China; Endangered Animal Ecology, College of Life Sciences, Guangxi Normal University, Guilin 541006, Guangxi, China
| | - Colin A Chapman
- Biology Department, Vancouver Island University, Nanaimo, BC V9R 5S5, Canada; Wilson Center, 1300 Pennsylvania Avenue NW, Washington, DC 20004, USA; School of Life Sciences, University of KwaZulu-Natal, Scottsville, Pietermaritzburg 3209, South Africa; Shanxi Key Laboratory for Animal Conservation, Northwest University, Xi'an 710127, China
| | - Paul A Garber
- Department of Anthropology, Program in Ecology and Evolutionary Biology, University of Illinois, Urbana, IL 61801, USA; International Centre of Biodiversity and Primate Conservation, Dali University, Dali 671003, Yunnan, China
| | - Pengfei Fan
- School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, Guangdong, China.
| |
Collapse
|
3
|
Nakamura K, Kanaya M, Matsushima D, Dunn JC, Hirabayashi H, Sato K, Tokuda IT, Nishimura T. Twin vocal folds as a novel evolutionary adaptation for vocal communications in lemurs. Sci Rep 2024; 14:3631. [PMID: 38351102 PMCID: PMC10864409 DOI: 10.1038/s41598-024-54172-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 02/09/2024] [Indexed: 02/16/2024] Open
Abstract
Primates have varied vocal repertoires to communicate with conspecifics and sometimes other species. The larynx has a central role in vocal source generation, where a pair of vocal folds vibrates to modify the air flow. Here, we show that Madagascan lemurs have a unique additional pair of folds in the vestibular region, parallel to the vocal folds. The additional fold has a rigid body of a vocal muscle branch and it is covered by a stratified squamous epithelium, equal to those of the vocal fold. Such anatomical features support the hypothesis that it also vibrates in a manner like the vibrations that occur in the vocal folds. To examine the acoustic function of the two pairs of folds, we made a silicone compound model to demonstrate that they can simultaneously vibrate to lower the fundamental frequency and increase vocal efficiency. Similar acoustic effects are achieved using different features of the larynx for the other primates, e.g., by vibrating multiple sets of ventricular folds in several species and further by an evolutionary modification of enlarged larynx in howler monkeys. Our multidisciplinary approaches found that these functions were acquired through a unique evolutionary adaptation of the twin vocal folds in Madagascan lemurs.
Collapse
Affiliation(s)
- Kanta Nakamura
- Center for the Evolutionary Origins of Human Behavior, Kyoto University, Inuyama, Aichi, Japan
| | - Mayuka Kanaya
- College of Science and Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Daisuke Matsushima
- College of Science and Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Jacob C Dunn
- Behavioural Ecology Research Group, Anglia Ruskin University, Cambridge, UK
- Biological Anthropology, The University of Cambridge, Cambridge, UK
- Department of Cognitive Biology, University of Vienna, Vienna, Austria
| | | | - Kiminori Sato
- Department of Otolaryngology-Head and Neck Surgery, Kurume University School of Medicine, Kurume, Fukuoka, Japan
| | - Isao T Tokuda
- College of Science and Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Takeshi Nishimura
- Center for the Evolutionary Origins of Human Behavior, Kyoto University, Inuyama, Aichi, Japan.
| |
Collapse
|
4
|
Anikin A, Canessa-Pollard V, Pisanski K, Massenet M, Reby D. Beyond speech: Exploring diversity in the human voice. iScience 2023; 26:108204. [PMID: 37908309 PMCID: PMC10613903 DOI: 10.1016/j.isci.2023.108204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 07/20/2023] [Accepted: 10/11/2023] [Indexed: 11/02/2023] Open
Abstract
Humans have evolved voluntary control over vocal production for speaking and singing, while preserving the phylogenetically older system of spontaneous nonverbal vocalizations such as laughs and screams. To test for systematic acoustic differences between these vocal domains, we analyzed a broad, cross-cultural corpus representing over 2 h of speech, singing, and nonverbal vocalizations. We show that, while speech is relatively low-pitched and tonal with mostly regular phonation, singing and especially nonverbal vocalizations vary enormously in pitch and often display harsh-sounding, irregular phonation owing to nonlinear phenomena. The evolution of complex supralaryngeal articulatory spectro-temporal modulation has been critical for speech, yet has not significantly constrained laryngeal source modulation. In contrast, articulation is very limited in nonverbal vocalizations, which predominantly contain minimally articulated open vowels and rapid temporal modulation in the roughness range. We infer that vocal source modulation works best for conveying affect, while vocal filter modulation mainly facilitates semantic communication.
Collapse
Affiliation(s)
- Andrey Anikin
- Division of Cognitive Science, Lund University, Lund, Sweden
- ENES Bioacoustics Research Lab, CRNL, University of Saint-Etienne, CNRS, Inserm, 23 rue Michelon, 42023 Saint-Etienne, France
| | - Valentina Canessa-Pollard
- ENES Bioacoustics Research Lab, CRNL, University of Saint-Etienne, CNRS, Inserm, 23 rue Michelon, 42023 Saint-Etienne, France
- Psychology, Institute of Psychology, Business and Human Sciences, University of Chichester, Chichester, West Sussex PO19 6PE, UK
| | - Katarzyna Pisanski
- ENES Bioacoustics Research Lab, CRNL, University of Saint-Etienne, CNRS, Inserm, 23 rue Michelon, 42023 Saint-Etienne, France
- CNRS French National Centre for Scientific Research, DDL Dynamics of Language Lab, University of Lyon 2, 69007 Lyon, France
- Institute of Psychology, University of Wrocław, Dawida 1, 50-527 Wrocław, Poland
| | - Mathilde Massenet
- ENES Bioacoustics Research Lab, CRNL, University of Saint-Etienne, CNRS, Inserm, 23 rue Michelon, 42023 Saint-Etienne, France
| | - David Reby
- ENES Bioacoustics Research Lab, CRNL, University of Saint-Etienne, CNRS, Inserm, 23 rue Michelon, 42023 Saint-Etienne, France
| |
Collapse
|
5
|
Herbst CT, Elemans CPH, Tokuda IT, Chatziioannou V, Švec JG. Dynamic System Coupling in Voice Production. J Voice 2023:S0892-1997(22)00310-1. [PMID: 36737267 DOI: 10.1016/j.jvoice.2022.10.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 10/07/2022] [Accepted: 10/07/2022] [Indexed: 02/04/2023]
Abstract
Voice is a major means of communication for humans, non-human mammals and many other vertebrates like birds and anurans. The physical and physiological principles of voice production are described by two theories: the MyoElastic-AeroDynamic (MEAD) theory and the Source-Filter Theory (SFT). While MEAD employs a multiphysics approach to understand the motor control and dynamics of self-sustained vibration of vocal folds or analogous tissues, SFT predominantly uses acoustics to understand spectral changes of the source via linear propagation through the vocal tract. Because the two theories focus on different aspects of voice production, they are often applied distinctly in specific areas of science and engineering. Here, we argue that the MEAD and the SFT are linked integral aspects of a holistic theory of voice production, describing a dynamically coupled system. The aim of this manuscript is to provide a comprehensive review of both the MEAD and the source-filter theory with its nonlinear extension, the latter of which suggests a number of conceptual similarities to sound production in brass instruments. We discuss the application of both theories to voice production of humans as well as of animals. An appraisal of voice production in the light of non-linear dynamics supports the notion that voice production can best be described with a systems view, considering coupled systems rather than isolated contributions of individual sub-systems.
Collapse
Affiliation(s)
- Christian T Herbst
- Department of Vocal Studies, Mozarteum University, Salzburg, Austria; Janette Ogg Voice Research Center, Shenandoah Conservatory, Winchester, Virginia. http://www.christian-herbst.org
| | - Coen P H Elemans
- Vocal Neuromechanics Lab, Department of Biology, University of Southern Denmark, Odense M, Denmark
| | - Isao T Tokuda
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | | | - Jan G Švec
- Voice Research Laboratory, Department of Experimental Physics, Faculty of Science, Palacky University Olomouc, Olomouc, Czech Republic
| |
Collapse
|
6
|
Adret P. Developmental Plasticity in Primate Coordinated Song: Parallels and Divergences With Duetting Songbirds. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.862196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Homeothermic animals (birds and mammals) are prime model systems for investigating the developmental plasticity and neural mechanisms of vocal duetting, a cooperative acoustic signal that prevails in family-living and pair-bonded species including humans. This review focuses on the nature of this trait and its nurturing during ontogeny and extending into adulthood. I begin by outlining the underpinning concepts of duet codes and pair-specific answering rules as used by birds to develop their learned coordinated song, driven by a complex interaction between self-generated and socially mediated auditory feedback. The more tractable avian model of duetting helps identify research gaps in singing primates that also use duetting as a type of intraspecific vocal interaction. Nevertheless, it has become clear that primate coordinated song—whether overlapping or antiphonal—is subject to some degree of vocal flexibility. This is reflected in the ability of lesser apes, titi monkeys, tarsiers, and lemurs to adjust the structure and timing of their calls through (1) social influence, (2) coordinated duetting both before and after mating, (3) the repair of vocal mistakes, (4) the production of heterosexual song early in life, (5) vocal accommodation in call rhythm, (6) conditioning, and (7) innovation. Furthermore, experimental work on the neural underpinnings of avian and mammalian antiphonal duets point to a hierarchical (cortico-subcortical) control mechanism that regulates, via inhibition, the temporal segregation of rapid vocal exchanges. I discuss some weaknesses in this growing field of research and highlight prospective avenues for future investigation.
Collapse
|
7
|
Iwasaki SI, Yoshimura K, Asami T, Erdoğan S. Comparative morphology and physiology of the vocal production apparatus and the brain in the extant primates. Ann Anat 2022; 240:151887. [PMID: 35032565 DOI: 10.1016/j.aanat.2022.151887] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 12/26/2021] [Accepted: 12/28/2021] [Indexed: 01/04/2023]
Abstract
Objective data mainly from the comparative anatomy of various organs related to human speech and language is considered to unearth clues about the mechanisms behind language development. The two organs of the larynx and hyoid bone are considered to have evolved towards suitable positions and forms in preparation for the occurrence of the large repertoire of vocalization necessary for human speech. However, some researchers have asserted that there is no significant difference of these organs between humans and non-human primates. Speech production is dependent on the voluntary control of the respiratory, laryngeal, and vocal tract musculature. Such control is fully present in humans but only partially so in non-human primates, which appear to be able to voluntarily control only supralaryngeal articulators. Both humans and non-human primates have direct cortical innervation of motor neurons controlling the supralaryngeal vocal tract but only human appear to have direct cortical innervation of motor neurons controlling the larynx. In this review, we investigate the comparative morphology and function of the wide range of components involved in vocal production, including the larynx, the hyoid bone, the tongue, and the vocal brain. We would like to emphasize the importance of the tongue in the primary development of human speech and language. It is now time to reconsider the possibility of the tongue playing a definitive role in the emergence of human speech.
Collapse
Affiliation(s)
- Shin-Ichi Iwasaki
- Faculty of Health Science, Gunma PAZ University, Takasaki, Japan; The Nippon Dental University, Tokyo and Niigata, Japan
| | - Ken Yoshimura
- Department of Anatomy, The Nippon Dental University School of Life Dentistry at Niigata, Niigata, Japan
| | - Tomoichiro Asami
- Faculty of Rehabilitation, Gunma Paz University, Takasaki, Japan
| | - Serkan Erdoğan
- Department of Anatomy, Faculty of Veterinary Medicine, Tekirdağ Namık Kemal University, Tekirdağ, Turkey.
| |
Collapse
|
8
|
The transition from the female-like great calls to male calls during ontogeny in southern yellow-cheeked gibbon males (Nomascus gabriellae). Sci Rep 2021; 11:22040. [PMID: 34764403 PMCID: PMC8586049 DOI: 10.1038/s41598-021-01648-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 10/29/2021] [Indexed: 11/09/2022] Open
Abstract
It is well known that gibbons emit a pattern of vocalizations, which is specific for species and sex. A previous study showed, however, that immature southern yellow-cheeked gibbon (Nomascus gabriellae) males produce only female-like great calls from 2.3 to 5.3 years of age in co-singing interactions with their mothers. To date, nothing is known about how the vocal repertoire of a male changes from the female-like call (great call) to the male call (staccato notes and multi-modulation phrase) during vocal ontogeny. The goal of this study was to describe the transition from the female-like great call to the male call and the ontogeny of the male call. We predicted that the transition from the female-like great call to the male-specific call and the development of the male call is a normal part of the aging proces. If this is the case, the following phenomena will occur: (a) female vocalization should no longer be produced with the mature form of the multi-modulation phrase and (b) all stages of the male vocalization should occur gradually as the young male ages. Young males regularly emit both female-like great calls and male-specific calls between the ages of 5.6 to 7.1 years. Once the young males reached 7.1 years of age, they emitted male calls exclusively, and they continued to do so until the end of the observation period (at 8.11 years of age). It was confirmed that the young males emitted only female-like great calls during periods when they produced non-mature forms of a multi-modulation phrase (Fm0,1-none or one frequency modulation in second notes). Furhermore, the decrease in the number of female-like great calls was attributed to the development of the mature form of the multi-modulation phrase (Fm2-two or more frequency modulation in second notes), which developed with age. We also confirmed that the multi-modulation phrase developed gradually, while the development of the staccato notes occurred in leaps. A multi-modulation phrase developed as the initial part of the male-specific call. It was evolved from a simpler to a more complex form as the maximum frequency and age of the young males increased. Staccato notes subsequently developed in certain young males. Possible explanations for such vocal ontogeny in young males are discussed in this work.
Collapse
|
9
|
Matsumoto T, Kanaya M, Ishimura K, Tokuda IT. Experimental study of vocal-ventricular fold oscillations in voice production. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:271. [PMID: 33514158 DOI: 10.1121/10.0003211] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 12/15/2020] [Indexed: 06/12/2023]
Abstract
Ventricular folds are located in the supraglottal region above the vocal folds. Although the ventricular folds do not vibrate under normal vocalizations, they vibrate under certain conditions, e.g., throat singing or ventricular fold dysphonia. In throat singing, the ventricular folds vibrate at the same frequency as (or at integer ratios of) the vocal fold vibration frequency. In ventricular fold dysphonia, on the other hand, the ventricular folds interfere with the vocal folds, giving rise to a hoarse voice. In the present study, the synthetic larynx model was utilized to examine the vocal-ventricular fold oscillations. Our experiments revealed that the vocal and ventricular folds can co-oscillate at the same frequency with an out-of-phase relation. Compared to the control condition, under which no ventricular folds exist, the phonation threshold pressure was increased in the presence of the ventricular folds. Acoustic analysis indicated that jitter was reduced and vocal efficiency was increased by the ventricular folds. Distance between the vocal and ventricular folds did not alter these oscillation properties. A computational model was further simulated to elucidate the mechanism underlying the observed vocal-ventricular fold oscillations. It has been suggested that out-of-phase oscillations of the vocal and ventricular folds are important for sustaining periodic laryngeal vibrations.
Collapse
Affiliation(s)
- Takuma Matsumoto
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Mayuka Kanaya
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Kazuyoshi Ishimura
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Isao T Tokuda
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| |
Collapse
|
10
|
Herbst CT, Nishimura T, Garcia M, Migimatsu K, Tokuda IT. Effect of Ventricular Folds on Vocalization Fundamental Frequency in Domestic Pigs (Sus scrofa domesticus). J Voice 2020; 35:805.e1-805.e15. [PMID: 33388229 DOI: 10.1016/j.jvoice.2020.01.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 01/10/2020] [Accepted: 01/16/2020] [Indexed: 10/22/2022]
Abstract
This study investigates the effect of the ventricular folds on fundamental frequency (fo) in the voice production of domestic pigs (Sus scrofa domesticus). The excised larynges of six subadult pigs were phonated in two preparation stages, with the ventricular folds present (PS1) and removed (PS2). Vocal fold resonances were tested with a laser vibrometer, and a four-mass computational model was created. Highly significant fo differences were found between PS1 and PS2 (means at 93.7 and 409.3 Hz, respectively). Two tissue resonances were found at 115 Hz and 250-290 Hz. The computational model had unique solutions for abducted and adducted ventricular folds at about 150 and 400 Hz, roughly matching the fo measured ex vivo for PS1 and PS2. The differing fo encountered across preparation stages PS1 and PS2 is explained by distinct activation of either a high or a low eigenfrequency mode, depending on the engagement of the ventricular folds. The inability of the investigated larynges to vibrate at frequencies below 250 Hz in PS2 suggests that in vivo low-frequency calls of domestic pigs (pre-eminently grunts) are likely produced with engaged ventricular folds. Allometric comparison suggests that the special, mechanically coupled "double oscillator" has evolved to prevent signaling disadvantages. Given these traits, the porcine larynx might - apart from special applications relating to the involvement of ventricular folds - not be an ideal candidate for emulating human voice production in excised larynx experimentation.
Collapse
Affiliation(s)
- Christian T Herbst
- Antonio Salieri Department of Vocal Studies and Vocal Research in Music Education, University of Music and Performing Arts Vienna, Vienna, Austria.
| | | | - Maxime Garcia
- ENES Lab, Université Lyon/Saint-Etienne, Neuro-PSI, CNRS UMR 9197, Saint-Etienne, France; Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Kishin Migimatsu
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Isao T Tokuda
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| |
Collapse
|
11
|
Toyoda A, Maruhashi T, Malaivijitnond S, Koda H. Dominance status and copulatory vocalizations among male stump-tailed macaques in Thailand. Primates 2020; 61:685-694. [PMID: 32318928 DOI: 10.1007/s10329-020-00820-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Accepted: 04/08/2020] [Indexed: 11/25/2022]
Abstract
Male copulation calls sometimes play important roles in sexual strategies, attracting conspecific females or advertising their social status to conspecific males. These calls generally occur in sexually competitive societies such as harem groups and multi-male and multi-female societies. However, the call functions remain unclear because of limited availability of data sets that include a large number of male and female animals in naturalistic environments, particularly in primates. Here, we examined the possible function of male-specific copulation calls in wild stump-tailed macaques (Macaca arctoides) by analyzing the contexts and acoustic features of vocalizations. We observed 395 wild stump-tailed macaques inhabiting the Khao Krapuk Khao Taomor Non-Hunting Area in Thailand and recorded all occurrences of observed copulations. We counted 446 male-specific calls in 383 copulations recorded, and measured their acoustic characteristics. Data were categorized into three groups depending on their social status: dominant (alpha and coalition) males and non-dominant males. When comparing male status, alpha males most frequently produced copulation calls at ejaculation, coalition males produced less frequent calls than alpha males, and other non-dominant males rarely vocalized, maintaining silence even when mounting females. Acoustic analysis indicated no significant influence of status (alpha or coalition) on call number, bout duration, or further formant dispersion parameters. Our results suggest that male copulation calls of this species are social status-dependent signals. Furthermore, dominant males might actively transmit their social status and copulations to other male rivals to impede their challenging attacks, while other non-dominant males maintain silence to prevent the interference of dominants.
Collapse
Affiliation(s)
- Aru Toyoda
- Academy of Emerging Science, Chubu University, Inuyama, Aichi, 484-8506, Japan.
- Core Research for Evolutional Science and Technology, Japan Science and Technology Agency, Saitama, Japan.
- Primate Research Institute, Kyoto University, Kyoto, Japan.
| | | | - Suchinda Malaivijitnond
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok, 10330, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi, 18110, Thailand
| | - Hiroki Koda
- Primate Research Institute, Kyoto University, Kyoto, Japan.
| |
Collapse
|
12
|
Clink DJ, Ahmad AH, Klinck H. Brevity is not a universal in animal communication: evidence for compression depends on the unit of analysis in small ape vocalizations. ROYAL SOCIETY OPEN SCIENCE 2020; 7:200151. [PMID: 32431905 PMCID: PMC7211885 DOI: 10.1098/rsos.200151] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2020] [Accepted: 03/10/2020] [Indexed: 05/04/2023]
Abstract
Evidence for compression, or minimization of code length, has been found across biological systems from genomes to human language and music. Two linguistic laws-Menzerath's Law (which states that longer sequences consist of shorter constituents) and Zipf's Law of abbreviation (a negative relationship between signal length and frequency of use)-are predictions of compression. It has been proposed that compression is a universal in animal communication, but there have been mixed results, particularly in reference to Zipf's Law of abbreviation. Like songbirds, male gibbons (Hylobates muelleri) engage in long solo bouts with unique combinations of notes which combine into phrases. We found strong support for Menzerath's Law as the longer a phrase, the shorter the notes. To identify phrase types, we used state-of-the-art affinity propagation clustering, and were able to predict phrase types using support vector machines with a mean accuracy of 74%. Based on unsupervised phrase type classification, we did not find support for Zipf's Law of abbreviation. Our results indicate that adherence to linguistic laws in male gibbon solos depends on the unit of analysis. We conclude that principles of compression are applicable outside of human language, but may act differently across levels of organization in biological systems.
Collapse
Affiliation(s)
- Dena J. Clink
- Center for Conservation Bioacoustics, Cornell Laboratory of Ornithology, Cornell University, Ithaca, NY, USA
| | - Abdul Hamid Ahmad
- Faculty of Sustainable Agriculture, Universiti Malaysia Sabah, Sandakan Campus, Sabah, Malaysia
| | - Holger Klinck
- Center for Conservation Bioacoustics, Cornell Laboratory of Ornithology, Cornell University, Ithaca, NY, USA
| |
Collapse
|
13
|
Male gibbon loud morning calls conform to Zipf's law of brevity and Menzerath's law: insights into the origin of human language. Anim Behav 2020. [DOI: 10.1016/j.anbehav.2019.11.017] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
14
|
Prieur J, Barbu S, Blois‐Heulin C, Lemasson A. The origins of gestures and language: history, current advances and proposed theories. Biol Rev Camb Philos Soc 2019; 95:531-554. [DOI: 10.1111/brv.12576] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 11/30/2019] [Accepted: 12/03/2019] [Indexed: 12/16/2022]
Affiliation(s)
- Jacques Prieur
- Department of Education and PsychologyComparative Developmental Psychology, Freie Universität Berlin Berlin Germany
- Univ Rennes, Normandie Univ, CNRS, EthoS (Ethologie animale et humaine) – UMR 6552 F‐35380 Paimpont France
| | - Stéphanie Barbu
- Univ Rennes, Normandie Univ, CNRS, EthoS (Ethologie animale et humaine) – UMR 6552 F‐35380 Paimpont France
| | - Catherine Blois‐Heulin
- Univ Rennes, Normandie Univ, CNRS, EthoS (Ethologie animale et humaine) – UMR 6552 F‐35380 Paimpont France
| | - Alban Lemasson
- Univ Rennes, Normandie Univ, CNRS, EthoS (Ethologie animale et humaine) – UMR 6552 F‐35380 Paimpont France
| |
Collapse
|
15
|
Migimatsu K, Tokuda IT. Experimental study on nonlinear source-filter interaction using synthetic vocal fold models. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:983. [PMID: 31472538 DOI: 10.1121/1.5120618] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 07/18/2019] [Indexed: 06/10/2023]
Abstract
Under certain conditions, e.g., singing voice, the fundamental frequency of the vocal folds can go up and interfere with the formant frequencies. Acoustic feedback from the vocal tract filter to the vocal fold source then becomes strong and non-negligible. An experimental study was presented on such source-filter interaction using three types of synthetic vocal fold models. Asymmetry was also created between the left and right vocal folds. The experiment reproduced various nonlinear phenomena, such as frequency jump and quenching, as reported in humans. Increase in phonation threshold pressure was also observed when resonant frequency of the vocal tract and fundamental frequency of the vocal folds crossed each other. As a combined effect, the phonation threshold pressure was further increased by the left-right asymmetry. Simulation of the asymmetric two-mass model reproduced the experiments to some extent. One of the intriguing findings of this study is the variable strength of the source-filter interaction over different model types. Among the three models, two models were strongly influenced by the vocal tract, while no clear effect of the vocal tract was observed in the other model. This implies that the level of source-filter interaction may vary considerably from one subject to another in humans.
Collapse
Affiliation(s)
- Kishin Migimatsu
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Isao T Tokuda
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| |
Collapse
|
16
|
Morita T, Koda H. Superregular grammars do not provide additional explanatory power but allow for a compact analysis of animal song. ROYAL SOCIETY OPEN SCIENCE 2019; 6:190139. [PMID: 31417719 PMCID: PMC6689648 DOI: 10.1098/rsos.190139] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Accepted: 06/14/2019] [Indexed: 06/10/2023]
Abstract
A pervasive belief with regard to the differences between human language and animal vocal sequences (song) is that they belong to different classes of computational complexity, with animal song belonging to regular languages, whereas human language is superregular. This argument, however, lacks empirical evidence since superregular analyses of animal song are understudied. The goal of this paper is to perform a superregular analysis of animal song, using data from gibbons as a case study, and demonstrate that a superregular analysis can be effectively used with non-human data. A key finding is that a superregular analysis does not increase explanatory power but rather provides for compact analysis: fewer grammatical rules are necessary once superregularity is allowed. This pattern is analogous to a previous computational analysis of human language, and accordingly, the null hypothesis, that human language and animal song are governed by the same type of grammatical systems, cannot be rejected.
Collapse
Affiliation(s)
- T. Morita
- Primate Research Institute, Kyoto University, 41-2 Kanrin, Inuyama, Aichi 484-8506, Japan
| | - H. Koda
- Primate Research Institute, Kyoto University, 41-2 Kanrin, Inuyama, Aichi 484-8506, Japan
| |
Collapse
|
17
|
Reby D, Wyman MT, Frey R, Charlton BD, Dalmont JP, Gilbert J. Vocal tract modelling in fallow deer: are male groans nasalized? ACTA ACUST UNITED AC 2018; 221:jeb.179416. [PMID: 29941611 DOI: 10.1242/jeb.179416] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 06/21/2018] [Indexed: 11/20/2022]
Abstract
Males of several species of deer have a descended and mobile larynx, resulting in an unusually long vocal tract, which can be further extended by lowering the larynx during call production. Formant frequencies are lowered as the vocal tract is extended, as predicted when approximating the vocal tract as a uniform quarter wavelength resonator. However, formant frequencies in polygynous deer follow uneven distribution patterns, indicating that the vocal tract configuration may in fact be rather complex. We CT-scanned the head and neck region of two adult male fallow deer specimens with artificially extended vocal tracts and measured the cross-sectional areas of the supra-laryngeal vocal tract along the oral and nasal tracts. The CT data were then used to predict the resonances produced by three possible configurations, including the oral vocal tract only, the nasal vocal tract only, or combining the two. We found that the area functions from the combined oral and nasal vocal tracts produced resonances more closely matching the formant pattern and scaling observed in fallow deer groans than those predicted by the area functions of the oral vocal tract only or of the nasal vocal tract only. This indicates that the nasal and oral vocal tracts are both simultaneously involved in the production of a non-human mammal vocalization, and suggests that the potential for nasalization in putative oral loud calls should be carefully considered.
Collapse
Affiliation(s)
- D Reby
- School of Psychology, University of Sussex, Falmer, Brighton BN1 9QH, UK
| | - M T Wyman
- School of Psychology, University of Sussex, Falmer, Brighton BN1 9QH, UK.,Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - R Frey
- Department of Reproduction Management, Leibniz Institute for Zoo and Wildlife Research (IZW), 10315 Berlin, Germany
| | - B D Charlton
- San Diego Zoo's Institute for Conservation Research, Escondido 92027, CA, USA
| | - J P Dalmont
- Laboratoire d'Acoustique de l'Université du Mans, CNRS, 72085 le Mans, France
| | - J Gilbert
- Laboratoire d'Acoustique de l'Université du Mans, CNRS, 72085 le Mans, France
| |
Collapse
|
18
|
Clink DJ, Charif RA, Crofoot MC, Marshall AJ. Evidence for vocal performance constraints in a female nonhuman primate. Anim Behav 2018. [DOI: 10.1016/j.anbehav.2018.05.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
19
|
Frey R, Reby D, Fritsch G, Charlton BD. The remarkable vocal anatomy of the koala (Phascolarctos cinereus): insights into low-frequency sound production in a marsupial species. J Anat 2018; 232:575-595. [PMID: 29460389 DOI: 10.1111/joa.12770] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/05/2017] [Indexed: 11/27/2022] Open
Abstract
Koalas are characterised by a highly unusual vocal anatomy, with a descended larynx and velar vocal folds, allowing them to produce calls at disproportionately low frequencies. Here we use advanced imaging techniques, histological data, classical macroscopic dissection and behavioural observations to provide the first detailed description and interpretation of male and female koala vocal anatomy. We show that both males and females have an elongated pharynx and soft palate, resulting in a permanently descended larynx. In addition, the hyoid apparatus has a human-like configuration in which paired dorsal, resilient ligaments suspend the hyoid apparatus from the skull, while the ventral parts tightly connect to the descended larynx. We also show that koalas can retract the larynx down into the thoracic inlet, facilitated by a dramatic evolutionary transformation of the ventral neck muscles. First, the usual retractors of the larynx and the hyoid have their origins deep in the thorax. Secondly, three hyoid muscles have lost their connection to the hyoid skeleton. Thirdly, the genioglossus and geniohyoid muscles are greatly increased in length. Finally, the digastric, omohyoid and sternohyoid muscles, connected by a common tendinous intersection, form a guiding channel for the dynamic down-and-up movements of the ventral hyoid parts and the larynx. We suggest that these features evolved to accommodate the low resting position of the larynx and assist in its retraction during call production. We also confirm that the edges of the intra-pharyngeal ostium have specialised to form the novel, extra-laryngeal velar vocal folds, which are much larger than the true intra-laryngeal vocal folds in both sexes, but more developed and specialised for low frequency sound production in males than in females. Our findings illustrate that strong selection pressures on acoustic signalling not only lead to the specialisation of existing vocal organs but can also result in the evolution of novel vocal structures in both sexes.
Collapse
Affiliation(s)
- Roland Frey
- Leibniz Institute for Zoo and Wildlife Research (IZW), Berlin, Germany
| | - David Reby
- School of Psychology, University of Sussex, Brighton, UK
| | - Guido Fritsch
- Leibniz Institute for Zoo and Wildlife Research (IZW), Berlin, Germany
| | - Benjamin D Charlton
- Behavioural Ecology Research Group, Department of Biology, Anglia Ruskin University, Cambridge, UK
| |
Collapse
|
20
|
Affiliation(s)
- ISAO T. TOKUDA
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu
| |
Collapse
|
21
|
|
22
|
Riede T. Peripheral Vocal Motor Dynamics and Combinatory Call Complexity of Ultrasonic Vocal Production in Rats. HANDBOOK OF ULTRASONIC VOCALIZATION - A WINDOW INTO THE EMOTIONAL BRAIN 2018. [DOI: 10.1016/b978-0-12-809600-0.00005-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
23
|
Garcia M, Herbst CT, Bowling DL, Dunn JC, Fitch WT. Acoustic allometry revisited: morphological determinants of fundamental frequency in primate vocal production. Sci Rep 2017; 7:10450. [PMID: 28874852 PMCID: PMC5585385 DOI: 10.1038/s41598-017-11000-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2017] [Accepted: 08/15/2017] [Indexed: 11/08/2022] Open
Abstract
A fundamental issue in the evolution of communication is the degree to which signals convey accurate ("honest") information about the signaler. In bioacoustics, the assumption that fundamental frequency (f o) should correlate with the body size of the caller is widespread, but this belief has been challenged by various studies, possibly because larynx size and body size can vary independently. In the present comparative study, we conducted excised larynx experiments to investigate this hypothesis rigorously and explore the determinants of f o. Using specimens from eleven primate species, we carried out an inter-specific investigation, examining correlations between the minimum f o produced by the sound source, body size and vocal fold length (VFL). We found that, across species, VFL predicted minimum f o much better than body size, clearly demonstrating the potential for decoupling between larynx size and body size in primates. These findings shed new light on the diversity of primate vocalizations and vocal morphology, highlighting the importance of vocal physiology in understanding the evolution of mammal vocal communication.
Collapse
Affiliation(s)
- Maxime Garcia
- Department of Cognitive Biology, University of Vienna, Althanstrasse 14, 1090, Vienna, Austria.
- ENES Lab, Université Lyon/Saint-Etienne, NEURO-PSI, CNRS UMR 9197, Saint-Etienne, France.
| | - Christian T Herbst
- Department of Cognitive Biology, University of Vienna, Althanstrasse 14, 1090, Vienna, Austria
| | - Daniel L Bowling
- Department of Cognitive Biology, University of Vienna, Althanstrasse 14, 1090, Vienna, Austria
| | - Jacob C Dunn
- Division of Biological Anthropology, University of Cambridge, Pembroke Street, Cambridge, CB2 3QG, UK
- Animal and Environment Research Group, Anglia Ruskin University, East Road, Cambridge, CB1 1PT, UK
| | - W Tecumseh Fitch
- Department of Cognitive Biology, University of Vienna, Althanstrasse 14, 1090, Vienna, Austria.
| |
Collapse
|
24
|
|
25
|
Bouchet H, Koda H, Masataka N, Lemasson A. Vocal flexibility in nonhuman primates and the origins of human language. REVUE DE PRIMATOLOGIE 2016. [DOI: 10.4000/primatologie.2637] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
26
|
Reber SA, Nishimura T, Janisch J, Robertson M, Fitch WT. A Chinese alligator in heliox: formant frequencies in a crocodilian. ACTA ACUST UNITED AC 2016; 218:2442-7. [PMID: 26246611 PMCID: PMC4528706 DOI: 10.1242/jeb.119552] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Crocodilians are among the most vocal non-avian reptiles. Adults of both sexes produce loud vocalizations known as ‘bellows’ year round, with the highest rate during the mating season. Although the specific function of these vocalizations remains unclear, they may advertise the caller's body size, because relative size differences strongly affect courtship and territorial behaviour in crocodilians. In mammals and birds, a common mechanism for producing honest acoustic signals of body size is via formant frequencies (vocal tract resonances). To our knowledge, formants have to date never been documented in any non-avian reptile, and formants do not seem to play a role in the vocalizations of anurans. We tested for formants in crocodilian vocalizations by using playbacks to induce a female Chinese alligator (Alligator sinensis) to bellow in an airtight chamber. During vocalizations, the animal inhaled either normal air or a helium/oxygen mixture (heliox) in which the velocity of sound is increased. Although heliox allows normal respiration, it alters the formant distribution of the sound spectrum. An acoustic analysis of the calls showed that the source signal components remained constant under both conditions, but an upward shift of high-energy frequency bands was observed in heliox. We conclude that these frequency bands represent formants. We suggest that crocodilian vocalizations could thus provide an acoustic indication of body size via formants. Because birds and crocodilians share a common ancestor with all dinosaurs, a better understanding of their vocal production systems may also provide insight into the communication of extinct Archosaurians. Highlighted Article: Frequency peaks in bellows of a Chinese alligator breathing a helium–oxygen mixture instead of air shift to significantly higher frequencies, showing that crocodilian vocalizations contain vocal resonance frequencies or ‘formants’.
Collapse
Affiliation(s)
- Stephan A Reber
- Department of Cognitive Biology, University of Vienna, Vienna 1150, Austria
| | - Takeshi Nishimura
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Judith Janisch
- Department of Cognitive Biology, University of Vienna, Vienna 1150, Austria
| | - Mark Robertson
- St Augustine Alligator Farm Zoological Park, St Augustine, FL 32080, USA
| | - W Tecumseh Fitch
- Department of Cognitive Biology, University of Vienna, Vienna 1150, Austria
| |
Collapse
|
27
|
|
28
|
|
29
|
Reby D, Wyman MT, Frey R, Passilongo D, Gilbert J, Locatelli Y, Charlton BD. Evidence of biphonation and source–filter interactions in the bugles of male North American wapiti (Cervus canadensis). J Exp Biol 2016; 219:1224-36. [DOI: 10.1242/jeb.131219] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 02/12/2016] [Indexed: 11/20/2022]
Abstract
ABSTRACT
With an average male body mass of 320 kg, the wapiti, Cervus canadensis, is the largest extant species of Old World deer (Cervinae). Despite this large body size, male wapiti produce whistle-like sexual calls called bugles characterised by an extremely high fundamental frequency. Investigations of the biometry and physiology of the male wapiti's relatively large larynx have so far failed to account for the production of such a high fundamental frequency. Our examination of spectrograms of male bugles suggested that the complex harmonic structure is best explained by a dual-source model (biphonation), with one source oscillating at a mean of 145 Hz (F0) and the other oscillating independently at an average of 1426 Hz (G0). A combination of anatomical investigations and acoustical modelling indicated that the F0 of male bugles is consistent with the vocal fold dimensions reported in this species, whereas the secondary, much higher source at G0 is more consistent with an aerodynamic whistle produced as air flows rapidly through a narrow supraglottic constriction. We also report a possible interaction between the higher frequency G0 and vocal tract resonances, as G0 transiently locks onto individual formants as the vocal tract is extended. We speculate that male wapiti have evolved such a dual-source phonation to advertise body size at close range (with a relatively low-frequency F0 providing a dense spectrum to highlight size-related information contained in formants) while simultaneously advertising their presence over greater distances using the very high-amplitude G0 whistle component.
Collapse
Affiliation(s)
- D. Reby
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton BN1 9QH, UK
| | - M. T. Wyman
- Mammal Vocal Communication and Cognition Research Group, School of Psychology, University of Sussex, Brighton BN1 9QH, UK
| | - R. Frey
- Leibniz Institute for Zoo and Wildlife Research (IZW), Berlin 10315, Germany
| | - D. Passilongo
- Department of Science for Nature and Environmental Resources, University of Sassari, Sassari 07100, Italy
| | - J. Gilbert
- Laboratoire d'Acoustique de l'Université du Maine – UMR CNRS, le Mans 72085, France
| | - Y. Locatelli
- Réserve de la Haute Touche, Muséum National d'Histoire Naturelle, Obterre 36290, France
| | - B. D. Charlton
- School of Biology and Environmental Science, Science Centre West, University College Dublin (UCD), Belfield, Dublin 4, Ireland
| |
Collapse
|
30
|
Frey R, Volodin IA, Fritsch G, Volodina EV. Potential Sources of High Frequency and Biphonic Vocalization in the Dhole (Cuon alpinus). PLoS One 2016; 11:e0146330. [PMID: 26730952 PMCID: PMC4701476 DOI: 10.1371/journal.pone.0146330] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 12/16/2015] [Indexed: 11/19/2022] Open
Abstract
Biphonation, i.e. two independent fundamental frequencies in a call spectrum, is a prominent feature of vocal activity in dog-like canids. Dog-like canids can produce a low (f0) and a high (g0) fundamental frequency simultaneously. In contrast, fox-like canids are only capable of producing the low fundamental frequency (f0). Using a comparative anatomical approach for revealing macroscopic structures potentially responsible for canid biphonation, we investigated the vocal anatomy for 4 (1 male, 3 female) captive dholes (Cuon alpinus) and for 2 (1 male, 1 female) wild red fox (Vulpes vulpes). In addition, we analyzed the acoustic structure of vocalizations in the same dholes that served postmortem as specimens for the anatomical investigation. All study dholes produced both high-frequency and biphonic calls. The anatomical reconstructions revealed that the vocal morphologies of the dhole are very similar to those of the red fox. These results suggest that the high-frequency and biphonic calls in dog-like canids can be produced without specific anatomical adaptations of the sound-producing structures. We discuss possible production modes for the high-frequency and biphonic calls involving laryngeal and nasal structures.
Collapse
Affiliation(s)
- Roland Frey
- Leibniz Institute for Zoo and Wildlife Research (IZW), Berlin, Germany
| | - Ilya A. Volodin
- Department of Vertebrate Zoology, Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia
- Scientific Research Department, Moscow Zoo, Moscow, Russia
| | - Guido Fritsch
- Leibniz Institute for Zoo and Wildlife Research (IZW), Berlin, Germany
| | | |
Collapse
|
31
|
Vocal Production by Terrestrial Mammals: Source, Filter, and Function. VERTEBRATE SOUND PRODUCTION AND ACOUSTIC COMMUNICATION 2016. [DOI: 10.1007/978-3-319-27721-9_8] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
32
|
Vertebrate Bioacoustics: Prospects and Open Problems. VERTEBRATE SOUND PRODUCTION AND ACOUSTIC COMMUNICATION 2016. [DOI: 10.1007/978-3-319-27721-9_10] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
|
33
|
Koda H, Tokuda IT, Wakita M, Ito T, Nishimura T. The source-filter theory of whistle-like calls in marmosets: Acoustic analysis and simulation of helium-modulated voices. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:3068-3076. [PMID: 26093398 DOI: 10.1121/1.4921607] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Whistle-like high-pitched "phee" calls are often used as long-distance vocal advertisements by small-bodied marmosets and tamarins in the dense forests of South America. While the source-filter theory proposes that vibration of the vocal fold is modified independently from the resonance of the supralaryngeal vocal tract (SVT) in human speech, a source-filter coupling that constrains the vibration frequency to SVT resonance effectively produces loud tonal sounds in some musical instruments. Here, a combined approach of acoustic analyses and simulation with helium-modulated voices was used to show that phee calls are produced principally with the same mechanism as in human speech. The animal keeps the fundamental frequency (f0) close to the first formant (F1) of the SVT, to amplify f0. Although f0 and F1 are primarily independent, the degree of their tuning can be strengthened further by a flexible source-filter interaction, the variable strength of which depends upon the cross-sectional area of the laryngeal cavity. The results highlight the evolutionary antiquity and universality of the source-filter model in primates, but the study can also explore the diversification of vocal physiology, including source-filter interaction and its anatomical basis in non-human primates.
Collapse
Affiliation(s)
- Hiroki Koda
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Isao T Tokuda
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga 525-8577, Japan
| | - Masumi Wakita
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Tsuyoshi Ito
- Department of Human Biology and Anatomy, Graduate School of Medicine, University of the Ryukyus, Nishihara, Okinawa 903-0215, Japan
| | - Takeshi Nishimura
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| |
Collapse
|
34
|
Clarke E, Reichard UH, Zuberbühler K. Context-specific close-range "hoo" calls in wild gibbons (Hylobates lar). BMC Evol Biol 2015; 15:56. [PMID: 25888361 PMCID: PMC4389582 DOI: 10.1186/s12862-015-0332-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Accepted: 03/09/2015] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Close range calls are produced by many animals during intra-specific interactions, such as during home range defence, playing, begging for food, and directing others. In this study, we investigated the most common close range vocalisation of lar gibbons (Hylobates lar), the 'hoo' call. Gibbons and siamangs (family Hylobatidae) are known for their conspicuous and elaborate songs, while quieter, close range vocalisations have received almost no empirical attention, perhaps due to the difficult observation conditions in their natural forest habitats. RESULTS We found that 'hoo' calls were emitted by both sexes in a variety of contexts, including feeding, separation from group members, encountering predators, interacting with neighbours, or as part of duet songs by the mated pair. Acoustic analyses revealed that 'hoo' calls varied in a number of spectral parameters as a function of the different contexts. Males' and females' 'hoo' calls showed similar variation in these context-specific parameter differences, although there were also consistent sex differences in frequency across contexts. CONCLUSIONS Our study provides evidence that lar gibbons are able to generate significant, context-dependent acoustic variation within their main social call, which potentially allows recipients to make inferences about the external events experienced by the caller. Communicating about different events by producing subtle acoustic variation within some call types appears to be a general feature of primate communication, which can increase the expressive power of vocal signals within the constraints of limited vocal tract flexibility that is typical for all non-human primates. In this sense, this study is of direct relevance for the on-going debate about the nature and origins of vocally-based referential communication and the evolution of human speech.
Collapse
Affiliation(s)
- Esther Clarke
- Evolutionary Anthropology Research Group, Dawson Building, Durham University, Durham, DH1 3LE, UK.
- School of Psychology and Neuroscience, University of St Andrews, St Andrews, KY16 9JP, UK.
| | - Ulrich H Reichard
- Department of Anthropology and left for Ecology, University of Southern Illinois, Carbondale, USA.
| | - Klaus Zuberbühler
- School of Psychology and Neuroscience, University of St Andrews, St Andrews, KY16 9JP, UK.
- Cognitive Science Centre, University of Neuchâtel, Neuchâtel, Switzerland.
| |
Collapse
|
35
|
Poliva O. From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans. F1000Res 2015; 4:67. [PMID: 28928931 PMCID: PMC5600004 DOI: 10.12688/f1000research.6175.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/03/2015] [Indexed: 03/28/2024] Open
Abstract
In the brain of primates, the auditory cortex connects with the frontal lobe via the temporal pole (auditory ventral stream; AVS) and via the inferior parietal lobule (auditory dorsal stream; ADS). The AVS is responsible for sound recognition, and the ADS for sound-localization, voice detection and audio-visual integration. I propose that the primary role of the ADS in monkeys/apes is the perception and response to contact calls. These calls are exchanged between tribe members (e.g., mother-offspring) and are used for monitoring location. Perception of contact calls occurs by the ADS detecting a voice, localizing it, and verifying that the corresponding face is out of sight. The auditory cortex then projects to parieto-frontal visuospatial regions (visual dorsal stream) for searching the caller, and via a series of frontal lobe-brainstem connections, a contact call is produced in return. Because the human ADS processes also speech production and repetition, I further describe a course for the development of speech in humans. I propose that, due to duplication of a parietal region and its frontal projections, and strengthening of direct frontal-brainstem connections, the ADS converted auditory input directly to vocal regions in the frontal lobe, which endowed early Hominans with partial vocal control. This enabled offspring to modify their contact calls with intonations for signaling different distress levels to their mother. Vocal control could then enable question-answer conversations, by offspring emitting a low-level distress call for inquiring about the safety of objects, and mothers responding with high- or low-level distress calls. Gradually, the ADS and the direct frontal-brainstem connections became more robust and vocal control became more volitional. Eventually, individuals were capable of inventing new words and offspring were capable of inquiring about objects in their environment and learning their names via mimicry.
Collapse
|
36
|
Poliva O. From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans. F1000Res 2015; 4:67. [PMID: 28928931 PMCID: PMC5600004 DOI: 10.12688/f1000research.6175.3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/21/2017] [Indexed: 12/28/2022] Open
Abstract
In the brain of primates, the auditory cortex connects with the frontal lobe via the temporal pole (auditory ventral stream; AVS) and via the inferior parietal lobe (auditory dorsal stream; ADS). The AVS is responsible for sound recognition, and the ADS for sound-localization, voice detection and integration of calls with faces. I propose that the primary role of the ADS in non-human primates is the detection and response to contact calls. These calls are exchanged between tribe members (e.g., mother-offspring) and are used for monitoring location. Detection of contact calls occurs by the ADS identifying a voice, localizing it, and verifying that the corresponding face is out of sight. Once a contact call is detected, the primate produces a contact call in return via descending connections from the frontal lobe to a network of limbic and brainstem regions. Because the ADS of present day humans also performs speech production, I further propose an evolutionary course for the transition from contact call exchange to an early form of speech. In accordance with this model, structural changes to the ADS endowed early members of the genus Homo with partial vocal control. This development was beneficial as it enabled offspring to modify their contact calls with intonations for signaling high or low levels of distress to their mother. Eventually, individuals were capable of participating in yes-no question-answer conversations. In these conversations the offspring emitted a low-level distress call for inquiring about the safety of objects (e.g., food), and his/her mother responded with a high- or low-level distress call to signal approval or disapproval of the interaction. Gradually, the ADS and its connections with brainstem motor regions became more robust and vocal control became more volitional. Speech emerged once vocal control was sufficient for inventing novel calls.
Collapse
|
37
|
Poliva O. From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans. F1000Res 2015; 4:67. [PMID: 28928931 PMCID: PMC5600004.2 DOI: 10.12688/f1000research.6175.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/12/2016] [Indexed: 03/28/2024] Open
Abstract
In the brain of primates, the auditory cortex connects with the frontal lobe via the temporal pole (auditory ventral stream; AVS) and via the inferior parietal lobe (auditory dorsal stream; ADS). The AVS is responsible for sound recognition, and the ADS for sound-localization, voice detection and integration of calls with faces. I propose that the primary role of the ADS in non-human primates is the detection and response to contact calls. These calls are exchanged between tribe members (e.g., mother-offspring) and are used for monitoring location. Detection of contact calls occurs by the ADS identifying a voice, localizing it, and verifying that the corresponding face is out of sight. Once a contact call is detected, the primate produces a contact call in return via descending connections from the frontal lobe to a network of limbic and brainstem regions. Because the ADS of present day humans also performs speech production, I further propose an evolutionary course for the transition from contact call exchange to an early form of speech. In accordance with this model, structural changes to the ADS endowed early members of the genus Homo with partial vocal control. This development was beneficial as it enabled offspring to modify their contact calls with intonations for signaling high or low levels of distress to their mother. Eventually, individuals were capable of participating in yes-no question-answer conversations. In these conversations the offspring emitted a low-level distress call for inquiring about the safety of objects (e.g., food), and his/her mother responded with a high- or low-level distress call to signal approval or disapproval of the interaction. Gradually, the ADS and its connections with brainstem motor regions became more robust and vocal control became more volitional. Speech emerged once vocal control was sufficient for inventing novel calls.
Collapse
|
38
|
Affiliation(s)
- Michael J O'Neill
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut 06269, USA
| | - Rachel J O'Neill
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut 06269, USA
| |
Collapse
|
39
|
Krubitzer L, Stolzenberg DS. The evolutionary masquerade: genetic and epigenetic contributions to the neocortex. Curr Opin Neurobiol 2014; 24:157-65. [DOI: 10.1016/j.conb.2013.11.010] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2013] [Revised: 10/30/2013] [Accepted: 11/22/2013] [Indexed: 01/25/2023]
|
40
|
Barelli C, Mundry R, Heistermann M, Hammerschmidt K. Cues to androgens and quality in male gibbon songs. PLoS One 2013; 8:e82748. [PMID: 24367551 PMCID: PMC3867390 DOI: 10.1371/journal.pone.0082748] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2013] [Accepted: 10/28/2013] [Indexed: 11/19/2022] Open
Abstract
Animal vocal signals may provide information about senders and mediate important social interactions like sexual competition, territory maintenance and mate selection. Hence, it is important to understand whether vocal signals provide accurate information about animal attributes or status. Gibbons are non-human primates that produce loud, distinctive and melodious vocalizations resembling more those of birds than of other non-human primates. Wild gibbons are characterized by flexibility in social organization (i.e., pairs and multimale units) as well as in mating system (i.e., monogamy and polyandry). Such features make them a suitable model to investigate whether the physiology (hormonal status) and socio-demographic features find their correspondence in the structure of their songs. By combining male solo song recordings, endocrine outputs using non-invasive fecal androgen measures and behavioral observations, we studied 14 groups (10 pair-living, 4 multimale) of wild white-handed gibbons (Hylobates lar) residing at Khao Yai National Park, Thailand. We collected a total of 322 fecal samples and recorded 48 songs from 18 adult animals. Our results confirmed inter-individuality in male gibbon songs, and showed a clear correlation between androgen levels and song structures. Gibbons with higher androgen levels produced calls having higher pitch, and similarly adult individuals produced longer calls than senior males. Thus, it is plausible that gibbon vocalizations provide receivers with information about singers' attributes.
Collapse
Affiliation(s)
- Claudia Barelli
- Sezione di Biodiversità Tropicale, MUSE - Museo delle Scienze, Trento, Italy
- Reproductive Biology Unit, German Primate Center, Göttingen, Germany
- * E-mail: (CB); (KH)
| | - Roger Mundry
- Department of Primatology and Department of Developmental and Comparative Psychology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | | | - Kurt Hammerschmidt
- Cognitive Ethology Laboratory, German Primate Center, Göttingen, Germany
- * E-mail: (CB); (KH)
| |
Collapse
|
41
|
Koda H, Lemasson A, Oyakawa C, Rizaldi, Pamungkas J, Masataka N. Possible role of mother-daughter vocal interactions on the development of species-specific song in gibbons. PLoS One 2013; 8:e71432. [PMID: 23951160 PMCID: PMC3741147 DOI: 10.1371/journal.pone.0071432] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Accepted: 06/30/2013] [Indexed: 11/18/2022] Open
Abstract
Mother-infant vocal interactions play a crucial role in the development of human language. However, comparatively little is known about the maternal role during vocal development in nonhuman primates. Here, we report the first evidence of mother-daughter vocal interactions contributing to vocal development in gibbons, a singing and monogamous ape species. Gibbons are well known for their species-specific duets sung between mates, yet little is known about the role of intergenerational duets in gibbon song development. We observed singing interactions between free-ranging mothers and their sub-adult daughters prior to emigration. Daughters sang simultaneously with their mothers at different rates. First, we observed significant acoustic variation between daughters. Co-singing rates between mother and daughter were negatively correlated with the temporal precision of the song's synchronization. In addition, songs of daughters who co-sang less with their mothers were acoustically more similar to the maternal song than any other adult female's song. All variables have been reported to be influenced by social relationships of pairs. Therefore those correlations would be mediated by mother-daughter social relationship, which would be modifiable in daughter's development. Here we hypothesized that daughters who co-sing less often, well-synchronize, and converge acoustically with the maternal acoustic pattern would be at a more advanced stage of social independence in sub-adult females prior to emigration. Second, we observed acoustic matching between mothers and daughters when co-singing, suggesting short-term vocal flexibility. Third, we found that mothers adjusted songs to a more stereotyped pattern when co-singing than when singing alone. This vocal adjustment was stronger for mothers with daughters who co-sang less. These results indicate the presence of socially mediated vocal flexibility in gibbon sub-adults and adults, and that mother-daughter co-singing interactions may enhance vocal development. More comparative work, notably longitudinal and experimental, is now needed to clarify maternal roles during song development.
Collapse
Affiliation(s)
- Hiroki Koda
- Primate Research Institute, Kyoto University, Inuyama, Aichi, Japan.
| | | | | | | | | | | |
Collapse
|
42
|
Helium reveals gibbon's soprano skill. Nature 2012. [DOI: 10.1038/nature.2012.11257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|