1
|
Costalunga G, Vallentin D, Benichov JI. A neuroethological view of the multifaceted sensory influences on birdsong. Curr Opin Neurobiol 2024; 86:102867. [PMID: 38520789 DOI: 10.1016/j.conb.2024.102867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 02/13/2024] [Accepted: 03/07/2024] [Indexed: 03/25/2024]
Abstract
Learning and execution of complex motor skills are often modulated by sensory feedback and contextual cues arriving across multiple sensory modalities. Vocal motor behaviors, in particular, are primarily influenced by auditory inputs, both during learning and mature vocal production. The importance of auditory input in shaping vocal output has been investigated in several songbird species that acquire their adult song based on auditory exposure to a tutor during development. Recent studies have highlighted the influences of stimuli arriving through other sensory channels in juvenile song learning and in adult song production. Here, we review changes induced by diverse sensory stimuli during the song learning process and the production of adult song, considering the neuroethological significance of sensory channels in different species of songbirds. Additionally, we highlight advances, open questions, and possible future approaches for understanding the neural circuits that enable the multimodal shaping of singing behavior.
Collapse
Affiliation(s)
- Giacomo Costalunga
- Max Planck Institute for Biological Intelligence, 82319, Seewiesen, Germany
| | - Daniela Vallentin
- Max Planck Institute for Biological Intelligence, 82319, Seewiesen, Germany.
| | | |
Collapse
|
2
|
Kawaji T, Fujibayashi M, Abe K. Goal-directed and flexible modulation of syllable sequence within birdsong. Nat Commun 2024; 15:3419. [PMID: 38658545 PMCID: PMC11043396 DOI: 10.1038/s41467-024-47824-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 04/09/2024] [Indexed: 04/26/2024] Open
Abstract
Songs constitute a complex system of vocal signals for inter-individual communication in songbirds. Here, we elucidate the flexibility which songbirds exhibit in the organizing and sequencing of syllables within their songs. Utilizing a newly devised song decoder for quasi-real-time annotation, we execute an operant conditioning paradigm, with rewards contingent upon specific syllable syntax. Our analysis reveals that birds possess the capacity to modify the contents of their songs, adjust the repetition length of particular syllables and employing specific motifs. Notably, birds altered their syllable sequence in a goal-directed manner to obtain rewards. We demonstrate that such modulation occurs within a distinct song segment, with adjustments made within 10 minutes after cue presentation. Additionally, we identify the involvement of the parietal-basal ganglia pathway in orchestrating these flexible modulations of syllable sequences. Our findings unveil an unappreciated aspect of songbird communication, drawing parallels with human speech.
Collapse
Affiliation(s)
- Takuto Kawaji
- Lab of Brain Development, Graduate School of Life Sciences, Tohoku University, Katahira 2-1-1, Sendai, Miyagi, 980-8577, Japan
| | - Mizuki Fujibayashi
- Lab of Brain Development, Graduate School of Life Sciences, Tohoku University, Katahira 2-1-1, Sendai, Miyagi, 980-8577, Japan
| | - Kentaro Abe
- Lab of Brain Development, Graduate School of Life Sciences, Tohoku University, Katahira 2-1-1, Sendai, Miyagi, 980-8577, Japan.
- Division for the Establishment of Frontier Sciences of the Organization for Advanced Studies, Tohoku University, Sendai, Miyagi, 980-8577, Japan.
| |
Collapse
|
3
|
Koparkar A, Warren TL, Charlesworth JD, Shin S, Brainard MS, Veit L. Lesions in a songbird vocal circuit increase variability in song syntax. eLife 2024; 13:RP93272. [PMID: 38635312 PMCID: PMC11026095 DOI: 10.7554/elife.93272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024] Open
Abstract
Complex skills like speech and dance are composed of ordered sequences of simpler elements, but the neuronal basis for the syntactic ordering of actions is poorly understood. Birdsong is a learned vocal behavior composed of syntactically ordered syllables, controlled in part by the songbird premotor nucleus HVC (proper name). Here, we test whether one of HVC's recurrent inputs, mMAN (medial magnocellular nucleus of the anterior nidopallium), contributes to sequencing in adult male Bengalese finches (Lonchura striata domestica). Bengalese finch song includes several patterns: (1) chunks, comprising stereotyped syllable sequences; (2) branch points, where a given syllable can be followed probabilistically by multiple syllables; and (3) repeat phrases, where individual syllables are repeated variable numbers of times. We found that following bilateral lesions of mMAN, acoustic structure of syllables remained largely intact, but sequencing became more variable, as evidenced by 'breaks' in previously stereotyped chunks, increased uncertainty at branch points, and increased variability in repeat numbers. Our results show that mMAN contributes to the variable sequencing of vocal elements in Bengalese finch song and demonstrate the influence of recurrent projections to HVC. Furthermore, they highlight the utility of species with complex syntax in investigating neuronal control of ordered sequences.
Collapse
Affiliation(s)
- Avani Koparkar
- Neurobiology of Vocal Communication, Institute for Neurobiology, University of TübingenTübingenGermany
| | - Timothy L Warren
- Howard Hughes Medical Institute and Center for Integrative Neuroscience, University of California San FranciscoSan FranciscoUnited States
- Departments of Horticulture and Integrative Biology, Oregon State UniversityCorvallisUnited States
| | - Jonathan D Charlesworth
- Howard Hughes Medical Institute and Center for Integrative Neuroscience, University of California San FranciscoSan FranciscoUnited States
| | - Sooyoon Shin
- Howard Hughes Medical Institute and Center for Integrative Neuroscience, University of California San FranciscoSan FranciscoUnited States
| | - Michael S Brainard
- Howard Hughes Medical Institute and Center for Integrative Neuroscience, University of California San FranciscoSan FranciscoUnited States
| | - Lena Veit
- Neurobiology of Vocal Communication, Institute for Neurobiology, University of TübingenTübingenGermany
| |
Collapse
|
4
|
Montgomery JC. Roles for cerebellum and subsumption architecture in central pattern generation. J Comp Physiol A Neuroethol Sens Neural Behav Physiol 2024; 210:315-324. [PMID: 37130955 PMCID: PMC10994996 DOI: 10.1007/s00359-023-01634-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 04/13/2023] [Accepted: 04/14/2023] [Indexed: 05/04/2023]
Abstract
Within vertebrates, central pattern generators drive rhythmical behaviours, such as locomotion and ventilation. Their pattern generation is also influenced by sensory input and various forms of neuromodulation. These capabilities arose early in vertebrate evolution, preceding the evolution of the cerebellum in jawed vertebrates. This later evolution of the cerebellum is suggestive of subsumption architecture that adds functionality to a pre-existing network. From a central-pattern-generator perspective, what additional functionality might the cerebellum provide? The suggestion is that the adaptive filter capabilities of the cerebellum may be able to use error learning to appropriately repurpose pattern output. Examples may include head and eye stabilization during locomotion, song learning, and context-dependent alternation between learnt motor-control sequences.
Collapse
Affiliation(s)
- John C Montgomery
- Institute of Marine Science, University of Auckland, Auckland, New Zealand.
| |
Collapse
|
5
|
Roemschied FA, Pacheco DA, Aragon MJ, Ireland EC, Li X, Thieringer K, Pang R, Murthy M. Flexible circuit mechanisms for context-dependent song sequencing. Nature 2023; 622:794-801. [PMID: 37821705 PMCID: PMC10600009 DOI: 10.1038/s41586-023-06632-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 09/11/2023] [Indexed: 10/13/2023]
Abstract
Sequenced behaviours, including locomotion, reaching and vocalization, are patterned differently in different contexts, enabling animals to adjust to their environments. How contextual information shapes neural activity to flexibly alter the patterning of actions is not fully understood. Previous work has indicated that this could be achieved via parallel motor circuits, with differing sensitivities to context1,2. Here we demonstrate that a single pathway operates in two regimes dependent on recent sensory history. We leverage the Drosophila song production system3 to investigate the role of several neuron types4-7 in song patterning near versus far from the female fly. Male flies sing 'simple' trains of only one mode far from the female fly but complex song sequences comprising alternations between modes when near her. We find that ventral nerve cord (VNC) circuits are shaped by mutual inhibition and rebound excitability8 between nodes driving the two song modes. Brief sensory input to a direct brain-to-VNC excitatory pathway drives simple song far from the female, whereas prolonged input enables complex song production via simultaneous recruitment of functional disinhibition of VNC circuitry. Thus, female proximity unlocks motor circuit dynamics in the correct context. We construct a compact circuit model to demonstrate that the identified mechanisms suffice to replicate natural song dynamics. These results highlight how canonical circuit motifs8,9 can be combined to enable circuit flexibility required for dynamic communication.
Collapse
Affiliation(s)
- Frederic A Roemschied
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
- European Neuroscience Institute, Göttingen, Germany
| | - Diego A Pacheco
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
- Harvard Medical School, Boston, MA, USA
| | - Max J Aragon
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Elise C Ireland
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Xinping Li
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Kyle Thieringer
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Rich Pang
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Mala Murthy
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
6
|
Goffman L, Gerken L. A developmental account of the role of sequential dependencies in typical and atypical language learners. Cogn Neuropsychol 2023; 40:243-264. [PMID: 37963089 PMCID: PMC10939949 DOI: 10.1080/02643294.2023.2275837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 10/23/2023] [Indexed: 11/16/2023]
Abstract
The Gerken lab has shown that infants are able to learn sound patterns that obligate local sequential dependencies that are no longer readily accessible to adults. The Goffman lab has shown that children with developmental language disorder (DLD) exhibit deficits in learning sequential dependencies that influence the acquisition of words and grammar, as well as other types of domain general sequences. Thus, DLD appears to be an impaired ability to detect and deploy sequential dependencies over multiple domains. We meld these two lines of research to propose a novel account in which sequential dependency learning is required for many phonological and morphosyntactic patterns in natural language and is also central to the language and domain general deficits that are attested in DLD. However, patterns that are not dependent on sequential dependencies but rather on networks of stored forms are learnable by children with DLD as well as by adults.
Collapse
Affiliation(s)
- Lisa Goffman
- Callier Center, Speech, Language, & Hearing, University of Texas at Dallas, Richardson, USA
| | - LouAnn Gerken
- Psychology & Cognitive Science, University of Arizona, Tucson, USA
| |
Collapse
|
7
|
Brecht KF, Westendorff S, Nieder A. Neural correlates of cognitively controlled vocalizations in a corvid songbird. Cell Rep 2023; 42:112113. [PMID: 36821443 DOI: 10.1016/j.celrep.2023.112113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 01/13/2023] [Accepted: 01/28/2023] [Indexed: 02/24/2023] Open
Abstract
The neuronal basis of the songbird's song system is well understood. However, little is known about the neuronal correlates of the executive control of songbird vocalizations. Here, we record single-unit activity from the pallial endbrain region "nidopallium caudolaterale" (NCL) of crows that vocalize to the presentation of a visual go-cue but refrain from vocalizing during trials without a go-cue. We find that the preparatory activity of single vocalization-correlated neurons, but also of the entire population of NCL neurons, before vocal onset predicts whether or not the crows will produce an instructed vocalization. Fluctuations in baseline neuronal activity prior to the go-cue influence the premotor activity of such vocalization-correlated neurons and seemingly bias the crows' decision to vocalize. Neuronal response modulation significantly differs between volitional and task-unrelated vocalizations. This suggests that the NCL can take control over the vocal motor network during the production of volitional vocalizations in a corvid songbird.
Collapse
Affiliation(s)
- Katharina F Brecht
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen, 72076 Tübingen, Germany
| | - Stephanie Westendorff
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen, 72076 Tübingen, Germany
| | - Andreas Nieder
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen, 72076 Tübingen, Germany.
| |
Collapse
|
8
|
Zhang Y, Zhou L, Zuo J, Wang S, Meng W. Analogies of human speech and bird song: From vocal learning behavior to its neural basis. Front Psychol 2023; 14:1100969. [PMID: 36910811 PMCID: PMC9992734 DOI: 10.3389/fpsyg.2023.1100969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 02/06/2023] [Indexed: 02/24/2023] Open
Abstract
Vocal learning is a complex acquired social behavior that has been found only in very few animals. The process of animal vocal learning requires the participation of sensorimotor function. By accepting external auditory input and cooperating with repeated vocal imitation practice, a stable pattern of vocal information output is eventually formed. In parallel evolutionary branches, humans and songbirds share striking similarities in vocal learning behavior. For example, their vocal learning processes involve auditory feedback, complex syntactic structures, and sensitive periods. At the same time, they have evolved the hierarchical structure of special forebrain regions related to vocal motor control and vocal learning, which are organized and closely associated to the auditory cortex. By comparing the location, function, genome, and transcriptome of vocal learning-related brain regions, it was confirmed that songbird singing and human language-related neural control pathways have certain analogy. These common characteristics make songbirds an ideal animal model for studying the neural mechanisms of vocal learning behavior. The neural process of human language learning may be explained through similar neural mechanisms, and it can provide important insights for the treatment of language disorders.
Collapse
Affiliation(s)
- Yutao Zhang
- Jiangxi Key Laboratory of Organic Chemistry, Jiangxi Science and Technology Normal University, Nanchang, China
| | - Lifang Zhou
- Jiangxi Key Laboratory of Organic Chemistry, Jiangxi Science and Technology Normal University, Nanchang, China
| | - Jiachun Zuo
- Jiangxi Key Laboratory of Organic Chemistry, Jiangxi Science and Technology Normal University, Nanchang, China
| | - Songhua Wang
- Jiangxi Key Laboratory of Organic Chemistry, Jiangxi Science and Technology Normal University, Nanchang, China
| | - Wei Meng
- Jiangxi Key Laboratory of Organic Chemistry, Jiangxi Science and Technology Normal University, Nanchang, China
| |
Collapse
|
9
|
Desmet DM, Cusumano JP, Dingwell JB. Adaptive multi-objective control explains how humans make lateral maneuvers while walking. PLoS Comput Biol 2022; 18:e1010035. [PMID: 36374914 PMCID: PMC9704766 DOI: 10.1371/journal.pcbi.1010035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 11/28/2022] [Accepted: 10/26/2022] [Indexed: 11/15/2022] Open
Abstract
To successfully traverse their environment, humans often perform maneuvers to achieve desired task goals while simultaneously maintaining balance. Humans accomplish these tasks primarily by modulating their foot placements. As humans are more unstable laterally, we must better understand how humans modulate lateral foot placement. We previously developed a theoretical framework and corresponding computational models to describe how humans regulate lateral stepping during straight-ahead continuous walking. We identified goal functions for step width and lateral body position that define the walking task and determine the set of all possible task solutions as Goal Equivalent Manifolds (GEMs). Here, we used this framework to determine if humans can regulate lateral stepping during non-steady-state lateral maneuvers by minimizing errors consistent with these goal functions. Twenty young healthy adults each performed four lateral lane-change maneuvers in a virtual reality environment. Extending our general lateral stepping regulation framework, we first re-examined the requirements of such transient walking tasks. Doing so yielded new theoretical predictions regarding how steps during any such maneuver should be regulated to minimize error costs, consistent with the goals required at each step and with how these costs are adapted at each step during the maneuver. Humans performed the experimental lateral maneuvers in a manner consistent with our theoretical predictions. Furthermore, their stepping behavior was well modeled by allowing the parameters of our previous lateral stepping models to adapt from step to step. To our knowledge, our results are the first to demonstrate humans might use evolving cost landscapes in real time to perform such an adaptive motor task and, furthermore, that such adaptation can occur quickly-over only one step. Thus, the predictive capabilities of our general stepping regulation framework extend to a much greater range of walking tasks beyond just normal, straight-ahead walking.
Collapse
Affiliation(s)
- David M. Desmet
- Department of Kinesiology, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Joseph P. Cusumano
- Department of Engineering Science & Mechanics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Jonathan B. Dingwell
- Department of Kinesiology, Pennsylvania State University, University Park, Pennsylvania, United States of America
| |
Collapse
|
10
|
McGregor JN, Grassler AL, Jaffe PI, Jacob AL, Brainard MS, Sober SJ. Shared mechanisms of auditory and non-auditory vocal learning in the songbird brain. eLife 2022; 11:75691. [PMID: 36107757 PMCID: PMC9522248 DOI: 10.7554/elife.75691] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 09/14/2022] [Indexed: 01/18/2023] Open
Abstract
Songbirds and humans share the ability to adaptively modify their vocalizations based on sensory feedback. Prior studies have focused primarily on the role that auditory feedback plays in shaping vocal output throughout life. In contrast, it is unclear how non-auditory information drives vocal plasticity. Here, we first used a reinforcement learning paradigm to establish that somatosensory feedback (cutaneous electrical stimulation) can drive vocal learning in adult songbirds. We then assessed the role of a songbird basal ganglia thalamocortical pathway critical to auditory vocal learning in this novel form of vocal plasticity. We found that both this circuit and its dopaminergic inputs are necessary for non-auditory vocal learning, demonstrating that this pathway is critical for guiding adaptive vocal changes based on both auditory and somatosensory signals. The ability of this circuit to use both auditory and somatosensory information to guide vocal learning may reflect a general principle for the neural systems that support vocal plasticity across species.
Collapse
Affiliation(s)
- James N McGregor
- Neuroscience Graduate Program, Graduate Division of Biological and Biomedical Sciences, Laney Graduate School, Emory University, Atlanta, United States
| | | | - Paul I Jaffe
- Center for Integrative Neuroscience, University of California, San Francisco, San Francisco, United States
| | | | - Michael S Brainard
- Center for Integrative Neuroscience, University of California, San Francisco, San Francisco, United States.,Howard Hughes Medical Institute, University of California, San Francisco, San Francisco, United States
| | - Samuel J Sober
- Department of Biology, Emory University, Atlanta, United States
| |
Collapse
|
11
|
Cognitive control of song production by humpback whales. Anim Cogn 2022; 25:1133-1149. [PMID: 36058997 DOI: 10.1007/s10071-022-01675-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 08/04/2022] [Accepted: 08/12/2022] [Indexed: 11/01/2022]
Abstract
Singing humpback whales are highly versatile vocalizers, producing complex sequences of sounds that they vary throughout adulthood. Past analyses of humpback whale song have emphasized yearly variations in structural features of songs made collectively by singers within a population with comparatively little attention given to the ways that individual singers vary consecutive songs. As a result, many researchers describe singing by humpback whales as a process in which singers produce sequences of repeating sound patterns. Here, we show that such characterizations misrepresent the degree to which humpback whales flexibly and dynamically control the production of sounds and sound patterns within song sessions. Singers recorded off the coast of Hawaii continuously morphed units along multiple acoustic dimensions, with the degree and direction of morphing varying across parallel streams of successive units. Individual singers also produced multiple phrase variants (structurally similar, but acoustically distinctive sequences) within song sessions. The precision with which individual singers maintained some acoustic properties of phrases and morphing trajectories while flexibly changing others suggests that singing humpback whales actively select and adjust acoustic elements of their songs in real time rather than simply repeating stereotyped sound patterns within song sessions.
Collapse
|
12
|
Kim Y, Mori C, Kojima S. Effect of Darkness on Intrinsic Motivation for Undirected Singing in Bengalese Finch (Lonchura striata Domestica): A Comparative Study With Zebra Finch (Taeniopygia guttata). Front Physiol 2022; 13:884404. [PMID: 35694395 PMCID: PMC9174599 DOI: 10.3389/fphys.2022.884404] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Accepted: 05/12/2022] [Indexed: 11/13/2022] Open
Abstract
The zebra finch (ZF) and the Bengalese finch (BF) are animal models that have been commonly used for neurobiological studies on vocal learning. Although they largely share the brain structure for vocal learning and production, BFs produce more complex and variable songs than ZFs, providing a great opportunity for comparative studies to understand how animals learn and control complex motor behaviors. Here, we performed a comparative study between the two species by focusing on intrinsic motivation for non-courtship singing (“undirected singing”), which is critical for the development and maintenance of song structure. A previous study has demonstrated that ZFs dramatically increase intrinsic motivation for undirected singing when singing is temporarily suppressed by a dark environment. We found that the same procedure in BFs induced the enhancement of intrinsic singing motivation to much smaller degrees than that in ZFs. Moreover, unlike ZFs that rarely sing in dark conditions, substantial portion of BFs exhibited frequent singing in darkness, implying that such “dark singing” may attenuate the enhancement of intrinsic singing motivation during dark periods. In addition, measurements of blood corticosterone levels in dark and light conditions provided evidence that although BFs have lower stress levels than ZFs in dark conditions, such lower stress levels in BFs are not the major factor responsible for their frequent dark singing. Our findings highlight behavioral and physiological differences in spontaneous singing behaviors of BFs and ZFs and provide new insights into the interactions between singing motivation, ambient light, and environmental stress.
Collapse
Affiliation(s)
- Yunbok Kim
- Sensory and Motor Systems Research Group, Korea Brain Research Institute, Daegu, South Korea
| | - Chihiro Mori
- Department of Molecular Biology, Faculty of Pharmaceutical Sciences, Teikyo University, Tokyo, Japan
| | - Satoshi Kojima
- Sensory and Motor Systems Research Group, Korea Brain Research Institute, Daegu, South Korea
- *Correspondence: Satoshi Kojima,
| |
Collapse
|
13
|
Cohen Y, Nicholson DA, Sanchioni A, Mallaber EK, Skidanova V, Gardner TJ. Automated annotation of birdsong with a neural network that segments spectrograms. eLife 2022; 11:63853. [PMID: 35050849 PMCID: PMC8860439 DOI: 10.7554/elife.63853] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 01/19/2022] [Indexed: 11/13/2022] Open
Abstract
Songbirds provide a powerful model system for studying sensory-motor learning. However, many analyses of birdsong require time-consuming, manual annotation of its elements, called syllables. Automated methods for annotation have been proposed, but these methods assume that audio can be cleanly segmented into syllables, or they require carefully tuning multiple statistical models. Here we present TweetyNet: a single neural network model that learns how to segment spectrograms of birdsong into annotated syllables. We show that TweetyNet mitigates limitations of methods that rely on segmented audio. We also show that TweetyNet performs well across multiple individuals from two species of songbirds, Bengalese finches and canaries. Lastly, we demonstrate that using TweetyNet we can accurately annotate very large datasets containing multiple days of song, and that these predicted annotations replicate key findings from behavioral studies. In addition, we provide open-source software to assist other researchers, and a large dataset of annotated canary song that can serve as a benchmark. We conclude that TweetyNet makes it possible to address a wide range of new questions about birdsong.
Collapse
Affiliation(s)
- Yarden Cohen
- Department of Brain Sciences, Weizmann Institute of Science, Rehovot, Israel
| | | | - Alexa Sanchioni
- Department of Biology, Boston University, Boston, United States
| | | | | | - Timothy J Gardner
- Phil and Penny Knight Campus for Accelerating Scientific Impact, University of Oregon, Eugene, United States
| |
Collapse
|