1
|
Burk DC, Taswell C, Tang H, Averbeck BB. Computational Mechanisms Underlying Motivation to Earn Symbolic Reinforcers. J Neurosci 2024; 44:e1873232024. [PMID: 38670805 PMCID: PMC11170943 DOI: 10.1523/jneurosci.1873-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 02/27/2024] [Accepted: 04/11/2024] [Indexed: 04/28/2024] Open
Abstract
Reinforcement learning is a theoretical framework that describes how agents learn to select options that maximize rewards and minimize punishments over time. We often make choices, however, to obtain symbolic reinforcers (e.g., money, points) that are later exchanged for primary reinforcers (e.g., food, drink). Although symbolic reinforcers are ubiquitous in our daily lives, widely used in laboratory tasks because they can be motivating, mechanisms by which they become motivating are less understood. In the present study, we examined how monkeys learn to make choices that maximize fluid rewards through reinforcement with tokens. The question addressed here is how the value of a state, which is a function of multiple task features (e.g., the current number of accumulated tokens, choice options, task epoch, trials since the last delivery of primary reinforcer, etc.), drives value and affects motivation. We constructed a Markov decision process model that computes the value of task states given task features to then correlate with the motivational state of the animal. Fixation times, choice reaction times, and abort frequency were all significantly related to values of task states during the tokens task (n = 5 monkeys, three males and two females). Furthermore, the model makes predictions for how neural responses could change on a moment-by-moment basis relative to changes in the state value. Together, this task and model allow us to capture learning and behavior related to symbolic reinforcement.
Collapse
Affiliation(s)
- Diana C Burk
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland 20892-4415
| | - Craig Taswell
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland 20892-4415
| | - Hua Tang
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland 20892-4415
| | - Bruno B Averbeck
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland 20892-4415
| |
Collapse
|
2
|
Burk DC, Taswell C, Tang H, Averbeck BB. Computational mechanisms underlying motivation to earn symbolic reinforcers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.11.561900. [PMID: 37873311 PMCID: PMC10592730 DOI: 10.1101/2023.10.11.561900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Reinforcement learning (RL) is a theoretical framework that describes how agents learn to select options that maximize rewards and minimize punishments over time. We often make choices, however, to obtain symbolic reinforcers (e.g. money, points) that can later be exchanged for primary reinforcers (e.g. food, drink). Although symbolic reinforcers are motivating, little is understood about the neural or computational mechanisms underlying the motivation to earn them. In the present study, we examined how monkeys learn to make choices that maximize fluid rewards through reinforcement with tokens. The question addressed here is how the value of a state, which is a function of multiple task features (e.g. current number of accumulated tokens, choice options, task epoch, trials since last delivery of primary reinforcer, etc.), drives value and affects motivation. We constructed a Markov decision process model that computes the value of task states given task features to capture the motivational state of the animal. Fixation times, choice reaction times, and abort frequency were all significantly related to values of task states during the tokens task (n=5 monkeys). Furthermore, the model makes predictions for how neural responses could change on a moment-by-moment basis relative to changes in state value. Together, this task and model allow us to capture learning and behavior related to symbolic reinforcement.
Collapse
Affiliation(s)
- Diana C. Burk
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda MD, 20892-4415
| | - Craig Taswell
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda MD, 20892-4415
| | - Hua Tang
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda MD, 20892-4415
| | - Bruno B. Averbeck
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda MD, 20892-4415
| |
Collapse
|
3
|
Reznikova Z. Information Theory Opens New Dimensions in Experimental Studies of Animal Behaviour and Communication. Animals (Basel) 2023; 13:ani13071174. [PMID: 37048430 PMCID: PMC10093743 DOI: 10.3390/ani13071174] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 03/15/2023] [Accepted: 03/24/2023] [Indexed: 03/29/2023] Open
Abstract
Over the last 40–50 years, ethology has become increasingly quantitative and computational. However, when analysing animal behavioural sequences, researchers often need help finding an adequate model to assess certain characteristics of these sequences while using a relatively small number of parameters. In this review, I demonstrate that the information theory approaches based on Shannon entropy and Kolmogorov complexity can furnish effective tools to analyse and compare animal natural behaviours. In addition to a comparative analysis of stereotypic behavioural sequences, information theory can provide ideas for particular experiments on sophisticated animal communications. In particular, it has made it possible to discover the existence of a developed symbolic “language” in leader-scouting ant species based on the ability of these ants to transfer abstract information about remote events.
Collapse
|
4
|
Horn L, Zewald JS, Bugnyar T, Massen JJM. Carrion Crows and Azure-Winged Magpies Show No Prosocial Tendencies When Tested in a Token Transfer Paradigm. Animals (Basel) 2021; 11:1526. [PMID: 34073851 PMCID: PMC8225188 DOI: 10.3390/ani11061526] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 05/12/2021] [Accepted: 05/16/2021] [Indexed: 11/17/2022] Open
Abstract
To study the evolution of humans' cooperative nature, researchers have recently sought comparisons with other species. Studies investigating corvids, for example, showed that carrion crows and azure-winged magpies delivered food to group members when tested in naturalistic or simple experimental paradigms. Here, we investigated whether we could replicate these positive findings when testing the same two species in a token transfer paradigm. After training the birds to exchange tokens with an experimenter for food rewards, we tested whether they would also transfer tokens to other birds, when they did not have the opportunity to exchange the tokens themselves. To control for the effects of motivation, and of social or stimulus enhancement, we tested each individual in three additional control conditions. We witnessed very few attempts and/or successful token transfers, and those few instances did not occur more frequently in the test condition than in the controls, which would suggest that the birds lack prosocial tendencies. Alternatively, we propose that this absence of prosociality may stem from the artificial nature and cognitive complexity of the token transfer task. Consequently, our findings highlight the strong impact of methodology on animals' capability to exhibit prosocial tendencies and stress the importance of comparing multiple experimental paradigms.
Collapse
Affiliation(s)
- Lisa Horn
- Department of Behavioral and Cognitive Biology, University of Vienna, Althanstraße 14, 1090 Vienna, Austria; (T.B.); (J.J.M.M.)
| | - Jeroen S. Zewald
- Animal Behavior and Cognition, Department of Biology, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands;
| | - Thomas Bugnyar
- Department of Behavioral and Cognitive Biology, University of Vienna, Althanstraße 14, 1090 Vienna, Austria; (T.B.); (J.J.M.M.)
| | - Jorg J. M. Massen
- Department of Behavioral and Cognitive Biology, University of Vienna, Althanstraße 14, 1090 Vienna, Austria; (T.B.); (J.J.M.M.)
- Animal Behavior and Cognition, Department of Biology, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands;
| |
Collapse
|
5
|
Bourgeois-Gironde S, Addessi E, Boraud T. Economic behaviours among non-human primates. Philos Trans R Soc Lond B Biol Sci 2021; 376:20190676. [PMID: 33423625 PMCID: PMC7815433 DOI: 10.1098/rstb.2019.0676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/12/2020] [Indexed: 12/24/2022] Open
Abstract
Do we have any valid reasons to affirm that non-human primates display economic behaviour in a sufficiently rich and precise sense of the phrase? To address this question, we have to develop a set of criteria to assess the vast array of experimental studies and field observations on individual cognitive and behavioural competences as well as the collective organization of non-human primates. We review a sample of these studies and assess how they answer to the following four main challenges. (i) Do we see any economic organization or institutions emerge among groups of non-human primates? (ii) Are the cognitive abilities, and often biases, that have been evidenced as underlying typical economic decision-making among humans, also present among non-human primates? (iii) Can we draw positive lessons from performance comparisons among primate species, humans and non-humans but also across non-human primate species, as elicited by canonical game-theoretical experimental paradigms, especially as far as economic cooperation and coordination are concerned? And (iv) in which way should we improve models and paradigms to obtain more ecological data and conclusions? Articles discussed in this paper most often bring about positive answers and promising perspectives to support the existence and prevalence of economic behaviours among non-human primates. This article is part of the theme issue 'Existence and prevalence of economic behaviours among non-human primates'.
Collapse
Affiliation(s)
- Sacha Bourgeois-Gironde
- Institut Jean Nicod, Département d’études cognitives, ENS, EHESS, CNRS, PSLUniversity, France
| | - Elsa Addessi
- Unità di Primatologia Cognitiva e Centro Primati, Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche, 00197 Rome, Italy
| | - Thomas Boraud
- CNRS, UMR 5293, IMN, 33000 Bordeaux, France
- University of Bordeaux, UMR 5293, IMN, 33000, Bordeaux, France
- CHU de Bordeaux, IMN Clinique, 33000 Bordeaux, France
| |
Collapse
|
6
|
Brosnan SF. What behaviour in economic games tells us about the evolution of non-human species' economic decision-making behaviour. Philos Trans R Soc Lond B Biol Sci 2021; 376:20190670. [PMID: 33423638 DOI: 10.1098/rstb.2019.0670] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
In the past decade, there has been a surge of interest in using games derived from experimental economics to test decision-making behaviour across species. In most cases, researchers are using the games as a tool, for instance, to understand what factors influence decision-making, how decision-making differs across species or contexts, or to ask broader questions about species' propensities to cooperate or compete. These games have been quite successful in this regard. To what degree, however, do these games tap into species' economic decision-making? For the purpose of understanding the evolution of economic systems in humans, this is the key question. To study this, we can break economic decision-making down into smaller components, each of which is a potential step in the evolution of human economic behaviour. We can then use data from economic games, which are simplified, highly structured models of decision-making and therefore ideal for the comparative approach, to directly compare these components across species and contexts, as well as in relation to more naturalistic behaviours, to better understand the evolution of economic behaviour and the social and ecological contexts that influenced it. The comparative approach has successfully informed us about the evolution of other complex traits, such as language and morality, and should help us more deeply understand why and how human economic systems evolved. This article is part of the theme issue 'Existence and prevalence of economic behaviours among non-human primates'.
Collapse
Affiliation(s)
- Sarah F Brosnan
- Departments of Psychology & Philosophy, Neuroscience Institute, Center for Behavioral Neuroscience, Language Research Center, Georgia State University, PO Box 5010, Atlanta, GA 30302-5010, USA
| |
Collapse
|