1
|
Bergerot C, Barfuss W, Romanczuk P. Moderate confirmation bias enhances decision-making in groups of reinforcement-learning agents. PLoS Comput Biol 2024; 20:e1012404. [PMID: 39231162 DOI: 10.1371/journal.pcbi.1012404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 08/09/2024] [Indexed: 09/06/2024] Open
Abstract
Humans tend to give more weight to information confirming their beliefs than to information that disconfirms them. Nevertheless, this apparent irrationality has been shown to improve individual decision-making under uncertainty. However, little is known about this bias' impact on decision-making in a social context. Here, we investigate the conditions under which confirmation bias is beneficial or detrimental to decision-making under social influence. To do so, we develop a Collective Asymmetric Reinforcement Learning (CARL) model in which artificial agents observe others' actions and rewards, and update this information asymmetrically. We use agent-based simulations to study how confirmation bias affects collective performance on a two-armed bandit task, and how resource scarcity, group size and bias strength modulate this effect. We find that a confirmation bias benefits group learning across a wide range of resource-scarcity conditions. Moreover, we discover that, past a critical bias strength, resource abundance favors the emergence of two different performance regimes, one of which is suboptimal. In addition, we find that this regime bifurcation comes with polarization in small groups of agents. Overall, our results suggest the existence of an optimal, moderate level of confirmation bias for decision-making in a social context.
Collapse
Affiliation(s)
- Clémence Bergerot
- Department of Biology, Humboldt Universität zu Berlin, Berlin, Germany
- Charité - Universitätsmedizin Berlin, Einstein Center for Neurosciences Berlin, Berlin, Germany
| | - Wolfram Barfuss
- Transdisciplinary Research Area: Sustainable Futures, University of Bonn, Bonn, Germany
- Center for Development Research (ZEF), University of Bonn, Bonn, Germany
| | - Pawel Romanczuk
- Department of Biology, Humboldt Universität zu Berlin, Berlin, Germany
- Science of Intelligence, Research Cluster of Excellence, Berlin, Germany
| |
Collapse
|
2
|
Koch C, Zika O, Bruckner R, Schuck NW. Influence of surprise on reinforcement learning in younger and older adults. PLoS Comput Biol 2024; 20:e1012331. [PMID: 39141681 PMCID: PMC11346965 DOI: 10.1371/journal.pcbi.1012331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 08/26/2024] [Accepted: 07/16/2024] [Indexed: 08/16/2024] Open
Abstract
Surprise is a key component of many learning experiences, and yet its precise computational role, and how it changes with age, remain debated. One major challenge is that surprise often occurs jointly with other variables, such as uncertainty and outcome probability. To assess how humans learn from surprising events, and whether aging affects this process, we studied choices while participants learned from bandits with either Gaussian or bi-modal outcome distributions, which decoupled outcome probability, uncertainty, and surprise. A total of 102 participants (51 older, aged 50-73; 51 younger, 19-30 years) chose between three bandits, one of which had a bimodal outcome distribution. Behavioral analyses showed that both age-groups learned the average of the bimodal bandit less well. A trial-by-trial analysis indicated that participants performed choice reversals immediately following large absolute prediction errors, consistent with heightened sensitivity to surprise. This effect was stronger in older adults. Computational models indicated that learning rates in younger as well as older adults were influenced by surprise, rather than uncertainty, but also suggested large interindividual variability in the process underlying learning in our task. Our work bridges between behavioral economics research that has focused on how outcomes with low probability affect choice in older adults, and reinforcement learning work that has investigated age differences in the effects of uncertainty and suggests that older adults overly adapt to surprising events, even when accounting for probability and uncertainty effects.
Collapse
Affiliation(s)
- Christoph Koch
- Max Planck Institute for Human Development, Berlin, Germany
- Institute of Psychology, Universität Hamburg, Hamburg, Germany
| | - Ondrej Zika
- Max Planck Institute for Human Development, Berlin, Germany
- Max Planck UCL Centre for Computational Psychiatry and Aging Research, Berlin, Germany, and London, United Kingdom
| | - Rasmus Bruckner
- Max Planck Institute for Human Development, Berlin, Germany
- Department of Education and Psychology, Freie Universität Berlin, Berlin, Germany
| | - Nicolas W. Schuck
- Max Planck Institute for Human Development, Berlin, Germany
- Institute of Psychology, Universität Hamburg, Hamburg, Germany
- Max Planck UCL Centre for Computational Psychiatry and Aging Research, Berlin, Germany, and London, United Kingdom
| |
Collapse
|
3
|
Stoll FM, Rudebeck PH. Preferences reveal dissociable encoding across prefrontal-limbic circuits. Neuron 2024; 112:2241-2256.e8. [PMID: 38640933 PMCID: PMC11223984 DOI: 10.1016/j.neuron.2024.03.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 12/04/2023] [Accepted: 03/19/2024] [Indexed: 04/21/2024]
Abstract
Individual preferences for the flavor of different foods and fluids exert a strong influence on behavior. Most current theories posit that preferences are integrated with other state variables in the orbitofrontal cortex (OFC), which is thought to derive the relative subjective value of available options to guide choice behavior. Here, we report that instead of a single integrated valuation system in the OFC, another complementary one is centered in the ventrolateral prefrontal cortex (vlPFC) in macaques. Specifically, we found that the OFC and vlPFC preferentially represent outcome flavor and outcome probability, respectively, and that preferences are separately integrated into value representations in these areas. In addition, the vlPFC, but not the OFC, represented the probability of receiving the available outcome flavors separately, with the difference between these representations reflecting the degree of preference for each flavor. Thus, both the vlPFC and OFC exhibit dissociable but complementary representations of subjective value, both of which are necessary for decision-making.
Collapse
Affiliation(s)
- Frederic M Stoll
- Nash Family Department of Neuroscience, Lipschultz Center for Cognitive Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| | - Peter H Rudebeck
- Nash Family Department of Neuroscience, Lipschultz Center for Cognitive Neuroscience and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| |
Collapse
|
4
|
Feng YY, Bromberg-Martin ES, Monosov IE. Dorsal raphe neurons integrate the values of reward amount, delay, and uncertainty in multi-attribute decision-making. Cell Rep 2024; 43:114341. [PMID: 38878290 DOI: 10.1016/j.celrep.2024.114341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 03/27/2024] [Accepted: 05/23/2024] [Indexed: 06/25/2024] Open
Abstract
The dorsal raphe nucleus (DRN) is implicated in psychiatric disorders that feature impaired sensitivity to reward amount, impulsivity when facing reward delays, and risk-seeking when confronting reward uncertainty. However, it has been unclear whether and how DRN neurons signal reward amount, reward delay, and reward uncertainty during multi-attribute value-based decision-making, where subjects consider these attributes to make a choice. We recorded DRN neurons as monkeys chose between offers whose attributes, namely expected reward amount, reward delay, and reward uncertainty, varied independently. Many DRN neurons signaled offer attributes, and this population tended to integrate the attributes in a manner that reflected monkeys' preferences for amount, delay, and uncertainty. After decision-making, in response to post-decision feedback, these same neurons signaled signed reward prediction errors, suggesting a broader role in tracking value across task epochs and behavioral contexts. Our data illustrate how the DRN participates in value computations, guiding theories about the role of the DRN in decision-making and psychiatric disease.
Collapse
Affiliation(s)
- Yang-Yang Feng
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA; Department of Biomedical Engineering, Washington University, St. Louis, MO, USA
| | | | - Ilya E Monosov
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA; Department of Biomedical Engineering, Washington University, St. Louis, MO, USA; Washington University Pain Center, Washington University, St. Louis, MO, USA; Department of Neurosurgery, Washington University, St. Louis, MO, USA; Department of Electrical Engineering, Washington University, St. Louis, MO, USA.
| |
Collapse
|
5
|
Russek EM, Moran R, Liu Y, Dolan RJ, Huys QJM. Heuristics in risky decision-making relate to preferential representation of information. Nat Commun 2024; 15:4269. [PMID: 38769095 PMCID: PMC11106265 DOI: 10.1038/s41467-024-48547-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 05/03/2024] [Indexed: 05/22/2024] Open
Abstract
When making choices, individuals differ from one another, as well as from normativity, in how they weigh different types of information. One explanation for this relates to idiosyncratic preferences in what information individuals represent when evaluating choice options. Here, we test this explanation with a simple risky-decision making task, combined with magnetoencephalography (MEG). We examine the relationship between individual differences in behavioral markers of information weighting and neural representation of stimuli pertinent to incorporating that information. We find that the extent to which individuals (N = 19) behaviorally weight probability versus reward information is related to how preferentially they neurally represent stimuli most informative for making probability and reward comparisons. These results are further validated in an additional behavioral experiment (N = 88) that measures stimulus representation as the latency of perceptual detection following priming. Overall, the results suggest that differences in the information individuals consider during choice relate to their risk-taking tendencies.
Collapse
Affiliation(s)
- Evan M Russek
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, Queen Square Institute of Neurology, London, UK.
- Wellcome Centre for Human Neuroimaging, University College London, Queen Square Institute of Neurology, London, UK.
- Departments of Computer Science and Psychology, Princeton University, Princeton, NJ, USA.
| | - Rani Moran
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, Queen Square Institute of Neurology, London, UK
- Wellcome Centre for Human Neuroimaging, University College London, Queen Square Institute of Neurology, London, UK
- Department of Psychology, School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| | - Yunzhe Liu
- State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China
- Chinese Institute for Brain Research, Beijing, China
| | - Raymond J Dolan
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, Queen Square Institute of Neurology, London, UK
- Wellcome Centre for Human Neuroimaging, University College London, Queen Square Institute of Neurology, London, UK
| | - Quentin J M Huys
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, Queen Square Institute of Neurology, London, UK
- Wellcome Centre for Human Neuroimaging, University College London, Queen Square Institute of Neurology, London, UK
- Camden and Islington NHS Foundation Trust, London, UK
- Division of Psychiatry, University College London, London, UK
| |
Collapse
|
6
|
Lee DG, D'Alessandro M, Iodice P, Calluso C, Rustichini A, Pezzulo G. Risky decisions are influenced by individual attributes as a function of risk preference. Cogn Psychol 2023; 147:101614. [PMID: 37837926 DOI: 10.1016/j.cogpsych.2023.101614] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 09/13/2023] [Accepted: 10/05/2023] [Indexed: 10/16/2023]
Abstract
It has long been assumed in economic theory that multi-attribute decisions involving several attributes or dimensions - such as probabilities and amounts of money to be earned during risky choices - are resolved by first combining the attributes of each option to form an overall expected value and then comparing the expected values of the alternative options, using a unique evidence accumulation process. A plausible alternative would be performing independent comparisons between the individual attributes and then integrating the results of the comparisons afterwards. Here, we devise a novel method to disambiguate between these types of models, by orthogonally manipulating the expected value of choice options and the relative salience of their attributes. Our results, based on behavioral measures and drift-diffusion models, provide evidence in favor of the framework where information about individual attributes independently impacts deliberation. This suggests that risky decisions are resolved by running in parallel multiple comparisons between the separate attributes - possibly alongside an additional comparison of expected value. This result stands in contrast with the assumption of standard economic theory that choices require a unique comparison of expected values and suggests that at the cognitive level, decision processes might be more distributed than commonly assumed. Beyond our planned analyses, we also discovered that attribute salience affects people of different risk preference type in different ways: risk-averse participants seem to focus more on probability, except when monetary amount is particularly high; risk-neutral/seeking participants, in contrast, seem to focus more on monetary amount, except when probability is particularly low.
Collapse
Affiliation(s)
- Douglas G Lee
- Tel Aviv University, School of Psychological Sciences, Tel Aviv, Israel; Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| | - Marco D'Alessandro
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| | - Pierpaolo Iodice
- Université de Rouen, Rouen, France; Movement Interactions Performance Lab, Le Mans Université, Le Mans, France
| | | | | | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy.
| |
Collapse
|
7
|
Bao C, Zhu X, Mōller-Mara J, Li J, Dubroqua S, Erlich JC. The rat frontal orienting field dynamically encodes value for economic decisions under risk. Nat Neurosci 2023; 26:1942-1952. [PMID: 37857772 PMCID: PMC10620098 DOI: 10.1038/s41593-023-01461-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 09/11/2023] [Indexed: 10/21/2023]
Abstract
Frontal and parietal cortex are implicated in economic decision-making, but their causal roles are untested. Here we silenced the frontal orienting field (FOF) and posterior parietal cortex (PPC) while rats chose between a cued lottery and a small stable surebet. PPC inactivations produced minimal short-lived effects. FOF inactivations reliably reduced lottery choices. A mixed-agent model of choice indicated that silencing the FOF caused a change in the curvature of the rats' utility function (U = Vρ). Consistent with this finding, single-neuron and population analyses of neural activity confirmed that the FOF encodes the lottery value on each trial. A dynamical model, which accounts for electrophysiological and silencing results, suggests that the FOF represents the current lottery value to compare against the remembered surebet value. These results demonstrate that the FOF is a critical node in the neural circuit for the dynamic representation of action values for choice under risk.
Collapse
Affiliation(s)
- Chaofei Bao
- NYU-ECNU Institute of Brain and Cognitive Science at NYU Shanghai, Shanghai, China
- NYU Shanghai, Shanghai, China
- Shanghai Key Laboratory of Brain Functional Genomics (Ministry of Education), East China Normal University, Shanghai, China
- Sainsbury Wellcome Centre, University College London, London, UK
| | - Xiaoyue Zhu
- NYU-ECNU Institute of Brain and Cognitive Science at NYU Shanghai, Shanghai, China
- NYU Shanghai, Shanghai, China
- Shanghai Key Laboratory of Brain Functional Genomics (Ministry of Education), East China Normal University, Shanghai, China
| | - Joshua Mōller-Mara
- NYU-ECNU Institute of Brain and Cognitive Science at NYU Shanghai, Shanghai, China
- NYU Shanghai, Shanghai, China
- Shanghai Key Laboratory of Brain Functional Genomics (Ministry of Education), East China Normal University, Shanghai, China
| | - Jingjie Li
- NYU-ECNU Institute of Brain and Cognitive Science at NYU Shanghai, Shanghai, China
- NYU Shanghai, Shanghai, China
- Shanghai Key Laboratory of Brain Functional Genomics (Ministry of Education), East China Normal University, Shanghai, China
- Sainsbury Wellcome Centre, University College London, London, UK
| | - Sylvain Dubroqua
- NYU-ECNU Institute of Brain and Cognitive Science at NYU Shanghai, Shanghai, China
- NYU Shanghai, Shanghai, China
- Shanghai Key Laboratory of Brain Functional Genomics (Ministry of Education), East China Normal University, Shanghai, China
| | - Jeffrey C Erlich
- NYU-ECNU Institute of Brain and Cognitive Science at NYU Shanghai, Shanghai, China.
- NYU Shanghai, Shanghai, China.
- Shanghai Key Laboratory of Brain Functional Genomics (Ministry of Education), East China Normal University, Shanghai, China.
- Sainsbury Wellcome Centre, University College London, London, UK.
| |
Collapse
|
8
|
Johnston WJ, Fine JM, Yoo SBM, Ebitz RB, Hayden BY. Semi-orthogonal subspaces for value mediate a tradeoff between binding and generalization. ARXIV 2023:arXiv:2309.07766v1. [PMID: 37744462 PMCID: PMC10516109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
When choosing between options, we must associate their values with the action needed to select them. We hypothesize that the brain solves this binding problem through neural population subspaces. To test this hypothesis, we examined neuronal responses in five reward-sensitive regions in macaques performing a risky choice task with sequential offers. Surprisingly, in all areas, the neural population encoded the values of offers presented on the left and right in distinct subspaces. We show that the encoding we observe is sufficient to bind the values of the offers to their respective positions in space while preserving abstract value information, which may be important for rapid learning and generalization to novel contexts. Moreover, after both offers have been presented, all areas encode the value of the first and second offers in orthogonal subspaces. In this case as well, the orthogonalization provides binding. Our binding-by-subspace hypothesis makes two novel predictions borne out by the data. First, behavioral errors should correlate with putative spatial (but not temporal) misbinding in the neural representation. Second, the specific representational geometry that we observe across animals also indicates that behavioral errors should increase when offers have low or high values, compared to when they have medium values, even when controlling for value difference. Together, these results support the idea that the brain makes use of semi-orthogonal subspaces to bind features together.
Collapse
Affiliation(s)
- W. Jeffrey Johnston
- Center for Theoretical Neuroscience and Mortimer B. Zuckerman Mind, Brain, and Behavior Institute, Columbia University, New York, New York, United States of America
| | - Justin M. Fine
- Department of Neurosurgery, Baylor College of Medicine, Houston, Texas, United States of America
| | - Seng Bum Michael Yoo
- Department of Biomedical Engineering, Sunkyunkwan University, and Center for Neuroscience Imaging Research, Institute of Basic Sciences, Suwon, South Korea, Republic of Korea, 16419
| | - R. Becket Ebitz
- Department of Neuroscience, Université de Montréal, Montréal, Quebec, Canada
| | - Benjamin Y. Hayden
- Department of Neurosurgery, Baylor College of Medicine, Houston, Texas, United States of America
| |
Collapse
|
9
|
Maisson DJN, Cervera RL, Voloh B, Conover I, Zambre M, Zimmermann J, Hayden BY. Widespread coding of navigational variables in prefrontal cortex. Curr Biol 2023; 33:3478-3488.e3. [PMID: 37541250 PMCID: PMC10984098 DOI: 10.1016/j.cub.2023.07.024] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 06/01/2023] [Accepted: 07/13/2023] [Indexed: 08/06/2023]
Abstract
To navigate effectively, we must represent information about our location in the environment. Traditional research highlights the role of the hippocampal complex in this process. Spurred by recent research highlighting the widespread cortical encoding of cognitive and motor variables previously thought to have localized function, we hypothesized that navigational variables would be likewise encoded widely, especially in the prefrontal cortex, which is associated with volitional behavior. We recorded neural activity from six prefrontal regions while macaques performed a foraging task in an open enclosure. In all regions, we found strong encoding of allocentric position, allocentric head direction, boundary distance, and linear and angular velocity. These encodings were not accounted for by distance, time to reward, or motor factors. The strength of coding of all variables increased along a ventral-to-dorsal gradient. Together, these results argue that encoding of navigational variables is not localized to the hippocampus and support the hypothesis that navigation is continuous with other forms of flexible cognition in the service of action.
Collapse
Affiliation(s)
- David J-N Maisson
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Roberto Lopez Cervera
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Benjamin Voloh
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Indirah Conover
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Mrunal Zambre
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Jan Zimmermann
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Benjamin Y Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN 55455, USA.
| |
Collapse
|
10
|
Fine JM, Maisson DJN, Yoo SBM, Cash-Padgett TV, Wang MZ, Zimmermann J, Hayden BY. Abstract Value Encoding in Neural Populations But Not Single Neurons. J Neurosci 2023; 43:4650-4663. [PMID: 37208178 PMCID: PMC10286943 DOI: 10.1523/jneurosci.1954-22.2023] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 05/07/2023] [Accepted: 05/11/2023] [Indexed: 05/21/2023] Open
Abstract
An important open question in neuroeconomics is how the brain represents the value of offers in a way that is both abstract (allowing for comparison) and concrete (preserving the details of the factors that influence value). Here, we examine neuronal responses to risky and safe options in five brain regions that putatively encode value in male macaques. Surprisingly, we find no detectable overlap in the neural codes used for risky and safe options, even when the options have identical subjective values (as revealed by preference) in any of the regions. Indeed, responses are weakly correlated and occupy distinct (semi-orthogonal) encoding subspaces. Notably, however, these subspaces are linked through a linear transform of their constituent encodings, a property that allows for comparison of dissimilar option types. This encoding scheme allows these regions to multiplex decision related processes: they can encode the detailed factors that influence offer value (here, risky and safety) but also directly compare dissimilar offer types. Together these results suggest a neuronal basis for the qualitatively different psychological properties of risky and safe options and highlight the power of population geometry to resolve outstanding problems in neural coding.SIGNIFICANCE STATEMENT To make economic choices, we must have some mechanism for comparing dissimilar offers. We propose that the brain uses distinct neural codes for risky and safe offers, but that these codes are linearly transformable. This encoding scheme has the dual advantage of allowing for comparison across offer types while preserving information about offer type, which in turn allows for flexibility in changing circumstances. We show that responses to risky and safe offers exhibit these predicted properties in five different reward-sensitive regions. Together, these results highlight the power of population coding principles for solving representation problems in economic choice.
Collapse
Affiliation(s)
- Justin M Fine
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455
| | - David J-N Maisson
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455
| | - Seng Bum Michael Yoo
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455
| | - Tyler V Cash-Padgett
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455
| | - Maya Zhe Wang
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455
| | - Jan Zimmermann
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455
| | - Benjamin Y Hayden
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455
| |
Collapse
|
11
|
Voloh B, Eisenreich BR, Maisson DJN, Ebitz RB, Park HS, Hayden BY, Zimmermann J. Hierarchical organization of rhesus macaque behavior. OXFORD OPEN NEUROSCIENCE 2023; 2:kvad006. [PMID: 37577290 PMCID: PMC10421634 DOI: 10.1093/oons/kvad006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 06/11/2023] [Accepted: 06/12/2023] [Indexed: 08/15/2023]
Abstract
Primatologists, psychologists and neuroscientists have long hypothesized that primate behavior is highly structured. However, delineating that structure has been impossible due to the difficulties of precision behavioral tracking. Here we analyzed a dataset consisting of continuous measures of the 3D position of two male rhesus macaques (Macaca mulatta) performing three different tasks in a large unrestrained environment over several hours. Using an unsupervised embedding approach on the tracked joints, we identified commonly repeated pose patterns, which we call postures. We found that macaques' behavior is characterized by 49 distinct postures, lasting an average of 0.6 seconds. We found evidence that behavior is hierarchically organized, in that transitions between poses tend to occur within larger modules, which correspond to identifiable actions; these actions are further organized hierarchically. Our behavioral decomposition allows us to identify universal (cross-individual and cross-task) and unique (specific to each individual and task) principles of behavior. These results demonstrate the hierarchical nature of primate behavior, provide a method for the automated ethogramming of primate behavior, and provide important constraints on neural models of pose generation.
Collapse
Affiliation(s)
- Benjamin Voloh
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, 1 Baylor Plaza, Houston, TX 77030
| | - Benjamin R Eisenreich
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, 1 Baylor Plaza, Houston, TX 77030
| | - David J-N Maisson
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, 1 Baylor Plaza, Houston, TX 77030
| | - R Becket Ebitz
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, 1 Baylor Plaza, Houston, TX 77030
| | - Hyun Soo Park
- Department of Computer Science and Engineering, University of Minnesota, 40 Church St, Minneapolis, MN 55455, USA
| | - Benjamin Y Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, 1 Baylor Plaza, Houston, TX 77030
| | - Jan Zimmermann
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, 1 Baylor Plaza, Houston, TX 77030
| |
Collapse
|
12
|
Woo JH, Aguirre CG, Bari BA, Tsutsui KI, Grabenhorst F, Cohen JY, Schultz W, Izquierdo A, Soltani A. Mechanisms of adjustments to different types of uncertainty in the reward environment across mice and monkeys. COGNITIVE, AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2023; 23:600-619. [PMID: 36823249 PMCID: PMC10444905 DOI: 10.3758/s13415-022-01059-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 12/22/2022] [Indexed: 02/25/2023]
Abstract
Despite being unpredictable and uncertain, reward environments often exhibit certain regularities, and animals navigating these environments try to detect and utilize such regularities to adapt their behavior. However, successful learning requires that animals also adjust to uncertainty associated with those regularities. Here, we analyzed choice data from two comparable dynamic foraging tasks in mice and monkeys to investigate mechanisms underlying adjustments to different types of uncertainty. In these tasks, animals selected between two choice options that delivered reward probabilistically, while baseline reward probabilities changed after a variable number (block) of trials without any cues to the animals. To measure adjustments in behavior, we applied multiple metrics based on information theory that quantify consistency in behavior, and fit choice data using reinforcement learning models. We found that in both species, learning and choice were affected by uncertainty about reward outcomes (in terms of determining the better option) and by expectation about when the environment may change. However, these effects were mediated through different mechanisms. First, more uncertainty about the better option resulted in slower learning and forgetting in mice, whereas it had no significant effect in monkeys. Second, expectation of block switches accompanied slower learning, faster forgetting, and increased stochasticity in choice in mice, whereas it only reduced learning rates in monkeys. Overall, while demonstrating the usefulness of metrics based on information theory in examining adaptive behavior, our study provides evidence for multiple types of adjustments in learning and choice behavior according to uncertainty in the reward environment.
Collapse
Affiliation(s)
- Jae Hyung Woo
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Claudia G Aguirre
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Bilal A Bari
- Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
| | - Ken-Ichiro Tsutsui
- Department of Physiology, Development & Neuroscience, University of Cambridge, Cambridge, UK
- Laboratory of Systems Neuroscience, Tohoku University Graduate School of Life Sciences, Sendai, Japan
| | - Fabian Grabenhorst
- Department of Physiology, Development & Neuroscience, University of Cambridge, Cambridge, UK
- Department of Experimental Psychology, University of Oxford, Oxford, UK
| | - Jeremiah Y Cohen
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Allen Institute for Neural Dynamics, Seattle, WA, USA
| | - Wolfram Schultz
- Department of Physiology, Development & Neuroscience, University of Cambridge, Cambridge, UK
| | - Alicia Izquierdo
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
- The Brain Research Institute, University of California, Los Angeles, Los Angeles, CA, USA
| | - Alireza Soltani
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA.
| |
Collapse
|
13
|
Overman MJ, Sarrazin V, Browning M, O'Shea J. Stimulating human prefrontal cortex increases reward learning. Neuroimage 2023; 271:120029. [PMID: 36925089 DOI: 10.1016/j.neuroimage.2023.120029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 03/07/2023] [Accepted: 03/13/2023] [Indexed: 03/17/2023] Open
Abstract
Work in computational psychiatry suggests that mood disorders may stem from aberrant reinforcement learning processes. Specifically, it has been proposed that depressed individuals believe that negative events are more informative than positive events, resulting in higher learning rates from negative outcomes (Pulcu and Browning, 2019). In this proof-of-concept study, we investigated whether transcranial direct current stimulation (tDCS) applied to dorsolateral prefrontal cortex, as commonly used in depression treatment trials, might change learning rates for affective outcomes. Healthy adults completed an established reinforcement learning task (Pulcu and Browning, 2017) in which the information content of reward and loss outcomes was manipulated by varying the volatility of stimulus-outcome associations. Learning rates on the tasks were quantified using computational models. Stimulation over dorsolateral prefrontal cortex (DLPFC) but not motor cortex (M1) increased learning rates specifically for reward outcomes. The effects of prefrontal tDCS were cognitive state-dependent: tDCS applied during task performance increased learning rates for wins; tDCS applied before task performance decreased both win and loss learning rates. A replication study confirmed the key finding that tDCS to DLPFC during task performance increased learning rates specifically for rewards. Taken together, these findings demonstrate the potential of tDCS for modulating computational parameters of reinforcement learning that are relevant to mood disorders.
Collapse
Affiliation(s)
- Margot Juliëtte Overman
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, OX3 9DU, United Kingdom; Department of Psychiatry, Warneford Hospital, University of Oxford, OX3 7JX, United Kingdom; Oxford Centre for Human Brain Activity (OHBA), University of Oxford, OX3 7JX, United Kingdom
| | - Verena Sarrazin
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, OX3 9DU, United Kingdom; Department of Psychiatry, Warneford Hospital, University of Oxford, OX3 7JX, United Kingdom; Oxford Centre for Human Brain Activity (OHBA), University of Oxford, OX3 7JX, United Kingdom
| | - Michael Browning
- Department of Psychiatry, Warneford Hospital, University of Oxford, OX3 7JX, United Kingdom
| | - Jacinta O'Shea
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, OX3 9DU, United Kingdom; Department of Psychiatry, Warneford Hospital, University of Oxford, OX3 7JX, United Kingdom; Oxford Centre for Human Brain Activity (OHBA), University of Oxford, OX3 7JX, United Kingdom.
| |
Collapse
|
14
|
Bakst L, McGuire JT. Experience-driven recalibration of learning from surprising events. Cognition 2023; 232:105343. [PMID: 36481590 PMCID: PMC9851993 DOI: 10.1016/j.cognition.2022.105343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 10/13/2022] [Accepted: 11/21/2022] [Indexed: 12/12/2022]
Abstract
Different environments favor different patterns of adaptive learning. A surprising event that in one context would accelerate belief updating might, in another context, be downweighted as a meaningless outlier. Here, we investigated whether people would spontaneously regulate the influence of surprise on learning in response to event-by-event experiential feedback. Across two experiments, we examined whether participants performing a perceptual judgment task under spatial uncertainty (n = 29, n = 63) adapted their patterns of predictive gaze according to the informativeness or uninformativeness of surprising events in their current environment. Uninstructed predictive eye movements exhibited a form of metalearning in which surprise came to modulate event-by-event learning rates in opposite directions across contexts. Participants later appropriately readjusted their patterns of adaptive learning when the statistics of the environment underwent an unsignaled reversal. Although significant adjustments occurred in both directions, performance was consistently superior in environments in which surprising events reflected meaningful change, potentially reflecting a bias towards interpreting surprise as informative and/or difficulty ignoring salient outliers. Our results provide evidence for spontaneous, context-appropriate recalibration of the role of surprise in adaptive learning.
Collapse
Affiliation(s)
- Leah Bakst
- Department of Psychological & Brain Sciences, Boston University, 64 Cummington Mall, Boston, MA 02215, USA; Center for Systems Neuroscience, Boston University, 610 Commonwealth Avenue, Boston, MA 02215, USA.
| | - Joseph T McGuire
- Department of Psychological & Brain Sciences, Boston University, 64 Cummington Mall, Boston, MA 02215, USA; Center for Systems Neuroscience, Boston University, 610 Commonwealth Avenue, Boston, MA 02215, USA.
| |
Collapse
|
15
|
Seak LCU, Ferrari-Toniolo S, Jain R, Nielsen K, Schultz W. Systematic comparison of risky choices in humans and monkeys. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.07.527517. [PMID: 36798272 PMCID: PMC9934584 DOI: 10.1101/2023.02.07.527517] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The past decades have seen tremendous progress in fundamental studies on economic choice in humans. However, elucidation of the underlying neuronal processes requires invasive neurophysiological studies that are met with difficulties in humans. Monkeys as evolutionary closest relatives offer a solution. The animals display sophisticated and well-controllable behavior that allows to implement key constructs of proven economic choice theories. However, the similarity of economic choice between the two species has never been systematically investigated. We investigated compliance with the independence axiom (IA) of expected utility theory as one of the most demanding choice tests and compared IA violations between humans and monkeys. Using generalized linear modeling and cumulative prospect theory (CPT), we found that humans and monkeys made comparable risky choices, although their subjective values (utilities) differed. These results suggest similar fundamental choice mechanism across these primate species and encourage to study their underlying neurophysiological mechanisms.
Collapse
Affiliation(s)
- Leo Chi U Seak
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, United Kingdom
| | - Simone Ferrari-Toniolo
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, United Kingdom
| | - Ritesh Jain
- Management School, University of Liverpool, Liverpool L697ZY, United Kingdom
| | - Kirby Nielsen
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena CA 91125, USA
| | - Wolfram Schultz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, United Kingdom
| |
Collapse
|
16
|
He L, Bhatia S. Complex economic decisions from simple neurocognitive processes: the role of interactive attention. Proc Biol Sci 2023; 290:20221593. [PMID: 36750198 PMCID: PMC9904951 DOI: 10.1098/rspb.2022.1593] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023] Open
Abstract
Neurocognitive theories of value-based choice propose that people additively accumulate choice attributes when making decisions. These theories cannot explain the emergence of complex multiplicative preferences such as those assumed by prospect theory and other economic models. We investigate an interactive attention mechanism, according to which attention to attributes (like payoffs) depends on other attributes (like probabilities) attended to previously. We formalize this mechanism using a Markov attention model combined with an accumulator decision process, and test our model on eye-tracking and mouse-tracking data in risky choice. Our tests show that interactive attention is necessary to make good choices, that most participants display interactive attention and that allowing for interactive attention in accumulation-based decision models improves their predictions. By equipping established decision models with sophisticated attentional dynamics, we extend these models to describe complex economic choice, and in the process, we unify two prominent theoretical approaches to studying value-based decision making.
Collapse
Affiliation(s)
- Lisheng He
- SILC Business School, Shanghai University, Shanghai, People's Republic of China
| | - Sudeep Bhatia
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
17
|
Neural mechanisms underlying the hierarchical construction of perceived aesthetic value. Nat Commun 2023; 14:127. [PMID: 36693833 PMCID: PMC9873760 DOI: 10.1038/s41467-022-35654-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 12/15/2022] [Indexed: 01/26/2023] Open
Abstract
Little is known about how the brain computes the perceived aesthetic value of complex stimuli such as visual art. Here, we used computational methods in combination with functional neuroimaging to provide evidence that the aesthetic value of a visual stimulus is computed in a hierarchical manner via a weighted integration over both low and high level stimulus features contained in early and late visual cortex, extending into parietal and lateral prefrontal cortices. Feature representations in parietal and lateral prefrontal cortex may in turn be utilized to produce an overall aesthetic value in the medial prefrontal cortex. Such brain-wide computations are not only consistent with a feature-based mechanism for value construction, but also resemble computations performed by a deep convolutional neural network. Our findings thus shed light on the existence of a general neurocomputational mechanism for rapidly and flexibly producing value judgements across an array of complex novel stimuli and situations.
Collapse
|
18
|
Woo JH, Azab H, Jahn A, Hayden B, Brown JW. The PRO model accounts for the anterior cingulate cortex role in risky decision-making and monitoring. COGNITIVE, AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2022; 22:952-968. [PMID: 35332510 PMCID: PMC11059203 DOI: 10.3758/s13415-022-00992-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 02/17/2022] [Indexed: 11/08/2022]
Abstract
The anterior cingulate cortex (ACC) has been implicated in a number of functions, including performance monitoring and decision-making involving effort. The prediction of responses and outcomes (PRO) model has provided a unified account of much human and monkey ACC data involving anatomy, neurophysiology, EEG, fMRI, and behavior. We explored the computational nature of ACC with the PRO model, extending it to account specifically for both human and macaque monkey decision-making under risk, including both behavioral and neural data. We show that the PRO model can account for a number of additional effects related to outcome prediction, decision-making under risk, gambling behavior. In particular, we show that the ACC represents the variance of uncertain outcomes, suggesting a link between ACC function and mean-variance theories of decision making. The PRO model provides a unified account of a large set of data regarding the ACC.
Collapse
Affiliation(s)
- Jae Hyung Woo
- Department of Psychological & Brain Sciences, Indiana University, Bloomington, IN, USA
- Department of Psychological & Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Habiba Azab
- Baylor College of Medicine, Houston, TX, USA
| | - Andrew Jahn
- Department of Psychological & Brain Sciences, Indiana University, Bloomington, IN, USA
- fMRI Laboratory, University of Michigan, Ann Arbor, MI, USA
| | - Benjamin Hayden
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA
| | - Joshua W Brown
- Department of Psychological & Brain Sciences, Indiana University, Bloomington, IN, USA.
| |
Collapse
|
19
|
Cicerale A, Blanzieri E, Sacco K. How does decision-making change during challenging times? PLoS One 2022; 17:e0270117. [PMID: 35905131 PMCID: PMC9337702 DOI: 10.1371/journal.pone.0270117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 06/05/2022] [Indexed: 11/18/2022] Open
Abstract
Prospect Theory, proposed and developed by Kahneman and Tversky, demonstrated that people do not make rational decisions based on expected utility, but are instead biased by specific cognitive tendencies leading to neglect, under- or over- consider information, depending on the context of presentation. In this vein, the present paper focuses on whether and how individual decision-making attitudes are prone to change in the presence of globally challenging events. We ran three partial replications of the Kahneman and Tversky (1979) paper, focusing on a set of eight prospects, after a terror attack (Paris, November 2015, 134 subjects) and during the Covid-19 pandemic, both during the first lockdown in Italy (Spring 2020, 176 subjects) and after the first reopening (140 subjects). The results confirm patterns of choice characterizing uncertain times, as shown by previous literature. In particular, we note significant increase of risk aversion, both in the gain and in the loss domains, that consistently emerged in the three replications. Given the nature of our sample, and the heterogeneity between the three periods investigated, we suggest that the phenomenon we present can be explained stress-related effects on decision making rather than by other economic effects, such as the income effect.
Collapse
Affiliation(s)
- Alessandro Cicerale
- BraIn Plasticity and Behavior Changes @Department of Psychology, and Neuroscience Institute of Turin—University of Turin, Turin, Italy
| | - Enrico Blanzieri
- Department of Engineering & Information Sciences, University of Trento, Trento, Italy
| | - Katiuscia Sacco
- BraIn Plasticity and Behavior Changes @Department of Psychology, and Neuroscience Institute of Turin—University of Turin, Turin, Italy
- * E-mail:
| |
Collapse
|
20
|
Knaebe B, Weiss CC, Zimmermann J, Hayden BY. The Promise of Behavioral Tracking Systems for Advancing Primate Animal Welfare. Animals (Basel) 2022; 12:1648. [PMID: 35804547 PMCID: PMC9265027 DOI: 10.3390/ani12131648] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 06/17/2022] [Accepted: 06/21/2022] [Indexed: 11/16/2022] Open
Abstract
Recent years have witnessed major advances in the ability of computerized systems to track the positions of animals as they move through large and unconstrained environments. These systems have so far been a great boon in the fields of primatology, psychology, neuroscience, and biomedicine. Here, we discuss the promise of these technologies for animal welfare. Their potential benefits include identifying and reducing pain, suffering, and distress in captive populations, improving laboratory animal welfare within the context of the three Rs of animal research (reduction, refinement, and replacement), and applying our understanding of animal behavior to increase the "natural" behaviors in captive and wild populations facing human impact challenges. We note that these benefits are often incidental to the designed purpose of these tracking systems, a reflection of the fact that animal welfare is not inimical to research progress, but instead, that the aligned interests between basic research and welfare hold great promise for improvements to animal well-being.
Collapse
Affiliation(s)
- Brenna Knaebe
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN 55455, USA; (C.C.W.); (J.Z.); (B.Y.H.)
| | | | | | | |
Collapse
|
21
|
Wang MZ, Hayden BY, Heilbronner SR. A structural and functional subdivision in central orbitofrontal cortex. Nat Commun 2022; 13:3623. [PMID: 35750659 PMCID: PMC9232485 DOI: 10.1038/s41467-022-31273-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 06/07/2022] [Indexed: 11/09/2022] Open
Abstract
Economic choice requires many cognitive subprocesses, including stimulus detection, valuation, motor output, and outcome monitoring; many of these subprocesses are associated with the central orbitofrontal cortex (cOFC). Prior work has largely assumed that the cOFC is a single region with a single function. Here, we challenge that unified view with convergent anatomical and physiological results from rhesus macaques. Anatomically, we show that the cOFC can be subdivided according to its much stronger (medial) or weaker (lateral) bidirectional anatomical connectivity with the posterior cingulate cortex (PCC). We call these subregions cOFCm and cOFCl, respectively. These two subregions have notable functional differences. Specifically, cOFCm shows enhanced functional connectivity with PCC, as indicated by both spike-field coherence and mutual information. The cOFCm-PCC circuit, but not the cOFCl-PCC circuit, shows signatures of relaying choice signals from a non-spatial comparison framework to a spatially framed organization and shows a putative bidirectional mutually excitatory pattern.
Collapse
Affiliation(s)
- Maya Zhe Wang
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, 55455, USA.
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, 55455, USA.
| | - Benjamin Y Hayden
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, 55455, USA
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Sarah R Heilbronner
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, 55455, USA
| |
Collapse
|
22
|
Palminteri S, Lebreton M. The computational roots of positivity and confirmation biases in reinforcement learning. Trends Cogn Sci 2022; 26:607-621. [PMID: 35662490 DOI: 10.1016/j.tics.2022.04.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 04/13/2022] [Accepted: 04/18/2022] [Indexed: 12/16/2022]
Abstract
Humans do not integrate new information objectively: outcomes carrying a positive affective value and evidence confirming one's own prior belief are overweighed. Until recently, theoretical and empirical accounts of the positivity and confirmation biases assumed them to be specific to 'high-level' belief updates. We present evidence against this account. Learning rates in reinforcement learning (RL) tasks, estimated across different contexts and species, generally present the same characteristic asymmetry, suggesting that belief and value updating processes share key computational principles and distortions. This bias generates over-optimistic expectations about the probability of making the right choices and, consequently, generates over-optimistic reward expectations. We discuss the normative and neurobiological roots of these RL biases and their position within the greater picture of behavioral decision-making theories.
Collapse
Affiliation(s)
- Stefano Palminteri
- Laboratoire de Neurosciences Cognitives et Computationnelles, Institut National de la Santé et Recherche Médicale, Paris, France; Département d'Études Cognitives, Ecole Normale Supérieure, Paris, France; Université de Recherche Paris Sciences et Lettres, Paris, France.
| | - Maël Lebreton
- Paris School of Economics, Paris, France; LabNIC, Department of Fundamental Neurosciences, University of Geneva, Geneva, Switzerland; Swiss Center for Affective Science, Geneva, Switzerland.
| |
Collapse
|
23
|
Grossman CD, Bari BA, Cohen JY. Serotonin neurons modulate learning rate through uncertainty. Curr Biol 2022; 32:586-599.e7. [PMID: 34936883 PMCID: PMC8825708 DOI: 10.1016/j.cub.2021.12.006] [Citation(s) in RCA: 40] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 10/11/2021] [Accepted: 12/03/2021] [Indexed: 12/20/2022]
Abstract
Regulating how fast to learn is critical for flexible behavior. Learning about the consequences of actions should be slow in stable environments, but accelerate when that environment changes. Recognizing stability and detecting change are difficult in environments with noisy relationships between actions and outcomes. Under these conditions, theories propose that uncertainty can be used to modulate learning rates ("meta-learning"). We show that mice behaving in a dynamic foraging task exhibit choice behavior that varied as a function of two forms of uncertainty estimated from a meta-learning model. The activity of dorsal raphe serotonin neurons tracked both types of uncertainty in the foraging task as well as in a dynamic Pavlovian task. Reversible inhibition of serotonin neurons in the foraging task reproduced changes in learning predicted by a simulated lesion of meta-learning in the model. We thus provide a quantitative link between serotonin neuron activity, learning, and decision making.
Collapse
Affiliation(s)
- Cooper D Grossman
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, 725 N. Wolfe Street, Baltimore, MD 21205, USA
| | - Bilal A Bari
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, 725 N. Wolfe Street, Baltimore, MD 21205, USA
| | - Jeremiah Y Cohen
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, 725 N. Wolfe Street, Baltimore, MD 21205, USA.
| |
Collapse
|
24
|
Lefebvre G, Summerfield C, Bogacz R. A Normative Account of Confirmation Bias During Reinforcement Learning. Neural Comput 2022; 34:307-337. [PMID: 34758486 PMCID: PMC7612695 DOI: 10.1162/neco_a_01455] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 07/26/2021] [Indexed: 11/04/2022]
Abstract
Reinforcement learning involves updating estimates of the value of states and actions on the basis of experience. Previous work has shown that in humans, reinforcement learning exhibits a confirmatory bias: when the value of a chosen option is being updated, estimates are revised more radically following positive than negative reward prediction errors, but the converse is observed when updating the unchosen option value estimate. Here, we simulate performance on a multi-arm bandit task to examine the consequences of a confirmatory bias for reward harvesting. We report a paradoxical finding: that confirmatory biases allow the agent to maximize reward relative to an unbiased updating rule. This principle holds over a wide range of experimental settings and is most influential when decisions are corrupted by noise. We show that this occurs because on average, confirmatory biases lead to overestimating the value of more valuable bandits and underestimating the value of less valuable bandits, rendering decisions overall more robust in the face of noise. Our results show how apparently suboptimal learning rules can in fact be reward maximizing if decisions are made with finite computational precision.
Collapse
Affiliation(s)
- Germain Lefebvre
- MRC Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford OX3 9DU, U.K.
| | | | - Rafal Bogacz
- MRC Brain Network Dynamics Unit, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford OX3 9DU, U.K.
| |
Collapse
|
25
|
Soltani A, Koechlin E. Computational models of adaptive behavior and prefrontal cortex. Neuropsychopharmacology 2022; 47:58-71. [PMID: 34389808 PMCID: PMC8617006 DOI: 10.1038/s41386-021-01123-1] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 07/19/2021] [Accepted: 07/20/2021] [Indexed: 02/07/2023]
Abstract
The real world is uncertain, and while ever changing, it constantly presents itself in terms of new sets of behavioral options. To attain the flexibility required to tackle these challenges successfully, most mammalian brains are equipped with certain computational abilities that rely on the prefrontal cortex (PFC). By examining learning in terms of internal models associating stimuli, actions, and outcomes, we argue here that adaptive behavior relies on specific interactions between multiple systems including: (1) selective models learning stimulus-action associations through rewards; (2) predictive models learning stimulus- and/or action-outcome associations through statistical inferences anticipating behavioral outcomes; and (3) contextual models learning external cues associated with latent states of the environment. Critically, the PFC combines these internal models by forming task sets to drive behavior and, moreover, constantly evaluates the reliability of actor task sets in predicting external contingencies to switch between task sets or create new ones. We review different models of adaptive behavior to demonstrate how their components map onto this unifying framework and specific PFC regions. Finally, we discuss how our framework may help to better understand the neural computations and the cognitive architecture of PFC regions guiding adaptive behavior.
Collapse
Affiliation(s)
- Alireza Soltani
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA.
| | - Etienne Koechlin
- Institut National de la Sante et de la Recherche Medicale, Universite Pierre et Marie Curie, Ecole Normale Superieure, Paris, France.
| |
Collapse
|
26
|
Overman MJ, Browning M, O'Shea J. Inducing Affective Learning Biases with Cognitive Training and Prefrontal tDCS: A Proof-of-Concept Study. COGNITIVE THERAPY AND RESEARCH 2021; 45:869-884. [PMID: 34720259 PMCID: PMC8550254 DOI: 10.1007/s10608-020-10146-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/28/2020] [Indexed: 11/27/2022]
Abstract
Background Cognitive models of mood disorders emphasize a causal role of negative affective biases in depression. Computational work suggests that these biases may stem from a belief that negative events have a higher information content than positive events, resulting in preferential processing of and learning from negative outcomes. Learning biases therefore represent a promising target for therapeutic interventions. In this proof-of-concept study in healthy volunteers, we assessed the malleability of biased reinforcement learning using a novel cognitive training paradigm and concurrent transcranial direct current stimulation (tDCS). Methods In two studies, young healthy adults completed two sessions of negative (n = 20) or positive (n = 20) training designed to selectively increase learning from loss or win outcomes, respectively. During training active or sham tDCS was applied bilaterally to dorsolateral prefrontal cortex. Analyses tested for changes both in learning rates and win- and loss-driven behaviour. Potential positive/negative emotional transfer of win/loss learning was assessed by a facial emotion recognition task and mood questionnaires. Results Negative and positive training increased learning rates for losses and wins, respectively. With negative training, there was also a trend for win (but not loss) learning rates to decrease over successive task blocks. After negative training, there was evidence for near transfer in the form of an increase in loss-driven choices when participants performed a similar (untrained) task. There was no change in far transfer measures of emotional face processing or mood. tDCS had no effect on any aspect of behaviour. Discussion and Conclusions Negative training induced a mild negative bias in healthy adults as reflected in loss-driven choice behaviour. Prefrontal tDCS had no effect. Further research is needed to assess if this training procedure can be adapted to enhance learning from positive outcomes and whether effects translate to affective disorders. Electronic supplementary material The online version of this article (10.1007/s10608-020-10146-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Margot Juliëtte Overman
- Wellcome Centre for Integrative Neuroimaging, FMRIB, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU England
| | - Michael Browning
- Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, OX3 7JX England.,Oxford Health NHS Foundation Trust, Warneford Hospital, Warneford Lane, Oxford, OX3 7JX England
| | - Jacinta O'Shea
- Wellcome Centre for Integrative Neuroimaging, FMRIB, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU England.,Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, OX3 7JX England.,Oxford Centre for Human Brain Activity (OHBA), Wellcome Centre for Integrative Neuroimaging, Department of Psychiatry, University of Oxford, Oxford, OX3 7JX England
| |
Collapse
|
27
|
Abstract
We live in a world that changes on many timescales. To learn and make decisions appropriately, the human brain has evolved to integrate various types of information, such as sensory evidence and reward feedback, on multiple timescales. This is reflected in cortical hierarchies of timescales consisting of heterogeneous neuronal activities and expression of genes related to neurotransmitters critical for learning. We review the recent findings on how timescales of sensory and reward integration are affected by the temporal properties of sensory and reward signals in the environment. Despite existing evidence linking behavioral and neuronal timescales, future studies must examine how neural computations at multiple timescales are adjusted and combined to influence behavior flexibly.
Collapse
Affiliation(s)
- Alireza Soltani
- Department of Psychological and Brain Sciences, Dartmouth College, Moore Hall, 3 Maynard St, Hanover, NH 03755
| | - John D. Murray
- Department of Psychiatry, Yale School of Medicine, 300 George Street, New Haven, CT 06511
| | - Hyojung Seo
- Department of Psychiatry, Yale School of Medicine, 300 George Street, New Haven, CT 06511
| | - Daeyeol Lee
- The Zanvyl Krieger Mind/Brain Institute, Department of Neuroscience, Department of Psychological Sciences, Kavli Neuroscience Discovery Institute, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218
| |
Collapse
|
28
|
Ghambaryan A, Gutkin B, Klucharev V, Koechlin E. Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic Developments. Front Neurosci 2021; 15:704728. [PMID: 34658760 PMCID: PMC8517513 DOI: 10.3389/fnins.2021.704728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 09/13/2021] [Indexed: 11/20/2022] Open
Abstract
Value-based decision making in complex environments, such as those with uncertain and volatile mapping of reward probabilities onto options, may engender computational strategies that are not necessarily optimal in terms of normative frameworks but may ensure effective learning and behavioral flexibility in conditions of limited neural computational resources. In this article, we review a suboptimal strategy - additively combining reward magnitude and reward probability attributes of options for value-based decision making. In addition, we present computational intricacies of a recently developed model (named MIX model) representing an algorithmic implementation of the additive strategy in sequential decision-making with two options. We also discuss its opportunities; and conceptual, inferential, and generalization issues. Furthermore, we suggest future studies that will reveal the potential and serve the further development of the MIX model as a general model of value-based choice making.
Collapse
Affiliation(s)
- Anush Ghambaryan
- Centre for Cognition and Decision Making, HSE University, Moscow, Russia
- Ecole Normale Supérieure, PSL Research University, Paris, France
| | - Boris Gutkin
- Centre for Cognition and Decision Making, HSE University, Moscow, Russia
- Ecole Normale Supérieure, PSL Research University, Paris, France
| | - Vasily Klucharev
- Centre for Cognition and Decision Making, HSE University, Moscow, Russia
| | - Etienne Koechlin
- Ecole Normale Supérieure, PSL Research University, Paris, France
| |
Collapse
|
29
|
Abstract
Here we argue that the assignment of subjective value to potential outcomes at the time of decision-making is an active process, in which individual features of a potential outcome of varying degrees of abstraction are represented hierarchically and integrated in a weighted fashion to produce an overall value judgment. We implicate the lateral orbital and medial prefrontal cortex in this function, situating these areas more broadly within a hierarchical integration process that takes place throughout the cortex for the ultimate purpose of valuing options to guide decisions.
Collapse
Affiliation(s)
- John P O'Doherty
- Division of Humanities and Social Sciences, California Institute of Technology, Pasadena, CA 91125
| | - Ueli Rutishauser
- Department of Neurosurgery, Cedars-Sinai Medical Center, Los Angeles, CA 90048
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Kiyohito Iigaya
- Division of Humanities and Social Sciences, California Institute of Technology, Pasadena, CA 91125
| |
Collapse
|
30
|
Maisson DJN, Cash-Padgett TV, Wang MZ, Hayden BY, Heilbronner SR, Zimmermann J. Choice-relevant information transformation along a ventrodorsal axis in the medial prefrontal cortex. Nat Commun 2021; 12:4830. [PMID: 34376663 PMCID: PMC8355277 DOI: 10.1038/s41467-021-25219-w] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 07/27/2021] [Indexed: 11/17/2022] Open
Abstract
Choice-relevant brain regions in prefrontal cortex may progressively transform information about options into choices. Here, we examine responses of neurons in four regions of the medial prefrontal cortex as macaques performed two-option risky choices. All four regions encode economic variables in similar proportions and show similar putative signatures of key choice-related computations. We provide evidence to support a gradient of function that proceeds from areas 14 to 25 to 32 to 24. Specifically, we show that decodability of twelve distinct task variables increases along that path, consistent with the idea that regions that are higher in the anatomical hierarchy make choice-relevant variables more separable. We also show progressively longer intrinsic timescales in the same series. Together these results highlight the importance of the medial wall in choice, endorse a specific gradient-based organization, and argue against a modular functional neuroanatomy of choice.
Collapse
Affiliation(s)
- David J-N Maisson
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA.
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, USA.
- Center for Neuroengineering, University of Minnesota, Minneapolis, MN, USA.
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN, USA.
| | - Tyler V Cash-Padgett
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, USA
- Center for Neuroengineering, University of Minnesota, Minneapolis, MN, USA
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN, USA
| | - Maya Z Wang
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, USA
- Center for Neuroengineering, University of Minnesota, Minneapolis, MN, USA
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN, USA
| | - Benjamin Y Hayden
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, USA
- Center for Neuroengineering, University of Minnesota, Minneapolis, MN, USA
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN, USA
| | - Sarah R Heilbronner
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, USA
- Center for Neuroengineering, University of Minnesota, Minneapolis, MN, USA
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN, USA
| | - Jan Zimmermann
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, USA
- Center for Neuroengineering, University of Minnesota, Minneapolis, MN, USA
- Department of Biomedical Engineering, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
31
|
Azab H, Hayden BY. Partial integration of the components of value in anterior cingulate cortex. Behav Neurosci 2021; 134:296-308. [PMID: 32658523 DOI: 10.1037/bne0000382] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Evaluation often involves integrating multiple determinants of value, such as the different possible outcomes in risky choice. A brain region can be placed either before or after a presumed evaluation stage by measuring how responses of its neurons depend on multiple determinants of value. A brain region could also, in principle, show partial integration, which would indicate that it occupies a middle position between (preevaluative) nonintegration and (postevaluative) full integration. Existing mathematical techniques cannot distinguish full from partial integration and therefore risk misidentifying regional function. Here we use a new Bayesian regression-based approach to analyze responses of neurons in dorsal anterior cingulate cortex (dACC) to risky offers. We find that dACC neurons only partially integrate across outcome dimensions, indicating that dACC cannot be assigned to either a pre- or postevaluative position. Neurons in dACC also show putative signatures of value comparison, thereby demonstrating that comparison does not require complete evaluation before proceeding. (PsycInfo Database Record (c) 2020 APA, all rights reserved).
Collapse
Affiliation(s)
- Habiba Azab
- Department of Neuroscience, Center for Magnetic Resonance Research (CMRR), University of Minnesota, Twin Cities
| | - Benjamin Y Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research (CMRR), University of Minnesota, Twin Cities
| |
Collapse
|
32
|
Ferrari-Toniolo S, Bujold PM, Grabenhorst F, Báez-Mendoza R, Schultz W. Nonhuman Primates Satisfy Utility Maximization in Compliance with the Continuity Axiom of Expected Utility Theory. J Neurosci 2021; 41:2964-2979. [PMID: 33542082 PMCID: PMC8018892 DOI: 10.1523/jneurosci.0955-20.2020] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 11/13/2020] [Accepted: 11/19/2020] [Indexed: 11/21/2022] Open
Abstract
Expected Utility Theory (EUT), the first axiomatic theory of risky choice, describes choices as a utility maximization process: decision makers assign a subjective value (utility) to each choice option and choose the one with the highest utility. The continuity axiom, central to Expected Utility Theory and its modifications, is a necessary and sufficient condition for the definition of numerical utilities. The axiom requires decision makers to be indifferent between a gamble and a specific probabilistic combination of a more preferred and a less preferred gamble. While previous studies demonstrated that monkeys choose according to combinations of objective reward magnitude and probability, a concept-driven experimental approach for assessing the axiomatically defined conditions for maximizing utility by animals is missing. We experimentally tested the continuity axiom for a broad class of gamble types in 4 male rhesus macaque monkeys, showing that their choice behavior complied with the existence of a numerical utility measure as defined by the economic theory. We used the numerical quantity specified in the continuity axiom to characterize subjective preferences in a magnitude-probability space. This mapping highlighted a trade-off relation between reward magnitudes and probabilities, compatible with the existence of a utility function underlying subjective value computation. These results support the existence of a numerical utility function able to describe choices, allowing for the investigation of the neuronal substrates responsible for coding such rigorously defined quantity.SIGNIFICANCE STATEMENT A common assumption of several economic choice theories is that decisions result from the comparison of subjectively assigned values (utilities). This study demonstrated the compliance of monkey behavior with the continuity axiom of Expected Utility Theory, implying a subjective magnitude-probability trade-off relation, which supports the existence of numerical utility directly linked to the theoretical economic framework. We determined a numerical utility measure able to describe choices, which can serve as a correlate for the neuronal activity in the quest for brain structures and mechanisms guiding decisions.
Collapse
Affiliation(s)
- Simone Ferrari-Toniolo
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, CB2 3DY, United Kingdom
| | - Philipe M Bujold
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, CB2 3DY, United Kingdom
| | - Fabian Grabenhorst
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, CB2 3DY, United Kingdom
| | - Raymundo Báez-Mendoza
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, CB2 3DY, United Kingdom
- Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts 02114
| | - Wolfram Schultz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, CB2 3DY, United Kingdom
| |
Collapse
|
33
|
Garcia B, Cerrotti F, Palminteri S. The description-experience gap: a challenge for the neuroeconomics of decision-making under uncertainty. Philos Trans R Soc Lond B Biol Sci 2021; 376:20190665. [PMID: 33423626 PMCID: PMC7815421 DOI: 10.1098/rstb.2019.0665] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/30/2020] [Indexed: 01/10/2023] Open
Abstract
The experimental investigation of decision-making in humans relies on two distinct types of paradigms, involving either description- or experience-based choices. In description-based paradigms, decision variables (i.e. payoffs and probabilities) are explicitly communicated by means of symbols. In experience-based paradigms decision variables are learnt from trial-by-trial feedback. In the decision-making literature, 'description-experience gap' refers to the fact that different biases are observed in the two experimental paradigms. Remarkably, well-documented biases of description-based choices, such as under-weighting of rare events and loss aversion, do not apply to experience-based decisions. Here, we argue that the description-experience gap represents a major challenge, not only to current decision theories, but also to the neuroeconomics research framework, which relies heavily on the translation of neurophysiological findings between human and non-human primate research. In fact, most non-human primate neurophysiological research relies on behavioural designs that share features of both description- and experience-based choices. As a consequence, it is unclear whether the neural mechanisms built from non-human primate electrophysiology should be linked to description-based or experience-based decision-making processes. The picture is further complicated by additional methodological gaps between human and non-human primate neuroscience research. After analysing these methodological challenges, we conclude proposing new lines of research to address them. This article is part of the theme issue 'Existence and prevalence of economic behaviours among non-human primates'.
Collapse
Affiliation(s)
| | | | - Stefano Palminteri
- Laboratoire de Neurosciences Cognitives et Computationnelles, Ecole Normale Supérieure, Institut National de la Santé et Recherche Médicale, Université de Recherche Paris Sciences et Lettres, Paris, France
| |
Collapse
|
34
|
Kaiser LF, Gruendler TOJ, Speck O, Luettgau L, Jocham G. Dissociable roles of cortical excitation-inhibition balance during patch-leaving versus value-guided decisions. Nat Commun 2021; 12:904. [PMID: 33568654 PMCID: PMC7875994 DOI: 10.1038/s41467-020-20875-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Accepted: 12/11/2020] [Indexed: 01/30/2023] Open
Abstract
In a dynamic world, it is essential to decide when to leave an exploited resource. Such patch-leaving decisions involve balancing the cost of moving against the gain expected from the alternative patch. This contrasts with value-guided decisions that typically involve maximizing reward by selecting the current best option. Patterns of neuronal activity pertaining to patch-leaving decisions have been reported in dorsal anterior cingulate cortex (dACC), whereas competition via mutual inhibition in ventromedial prefrontal cortex (vmPFC) is thought to underlie value-guided choice. Here, we show that the balance between cortical excitation and inhibition (E/I balance), measured by the ratio of GABA and glutamate concentrations, plays a dissociable role for the two kinds of decisions. Patch-leaving decision behaviour relates to E/I balance in dACC. In contrast, value-guided decision-making relates to E/I balance in vmPFC. These results support mechanistic accounts of value-guided choice and provide evidence for a role of dACC E/I balance in patch-leaving decisions.
Collapse
Affiliation(s)
- Luca F. Kaiser
- grid.411327.20000 0001 2176 9917Biological Psychology of Decision Making, Institute of Experimental Psychology, Heinrich Heine University, Düsseldorf, Germany ,grid.5807.a0000 0001 1018 4307Center for Behavioral Brain Sciences, Otto von Guericke University, Magdeburg, Germany
| | - Theo O. J. Gruendler
- grid.5807.a0000 0001 1018 4307Center for Behavioral Brain Sciences, Otto von Guericke University, Magdeburg, Germany ,Center for Military Mental Health, Military Hospital Berlin, Berlin, Germany
| | - Oliver Speck
- grid.5807.a0000 0001 1018 4307Center for Behavioral Brain Sciences, Otto von Guericke University, Magdeburg, Germany ,grid.418723.b0000 0001 2109 6265Leibniz Institute for Neurobiology, Magdeburg, Germany ,grid.424247.30000 0004 0438 0426German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany ,grid.5807.a0000 0001 1018 4307Department of Biomedical Magnetic Resonance, Institute for Physics, Otto von Guericke University, Magdeburg, Germany
| | - Lennart Luettgau
- grid.411327.20000 0001 2176 9917Biological Psychology of Decision Making, Institute of Experimental Psychology, Heinrich Heine University, Düsseldorf, Germany ,grid.5807.a0000 0001 1018 4307Center for Behavioral Brain Sciences, Otto von Guericke University, Magdeburg, Germany
| | - Gerhard Jocham
- grid.411327.20000 0001 2176 9917Biological Psychology of Decision Making, Institute of Experimental Psychology, Heinrich Heine University, Düsseldorf, Germany ,grid.5807.a0000 0001 1018 4307Center for Behavioral Brain Sciences, Otto von Guericke University, Magdeburg, Germany
| |
Collapse
|
35
|
Romain A, Broihanne MH, De Marco A, Ngoubangoye B, Call J, Rebout N, Dufour V. Non-human primates use combined rules when deciding under ambiguity. Philos Trans R Soc Lond B Biol Sci 2021; 376:20190672. [PMID: 33423632 DOI: 10.1098/rstb.2019.0672] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Decision outcomes in unpredictable environments may not have exact known probabilities. Yet the predictability level of outcomes matters in decisions, and animals, including humans, generally avoid ambiguous options. Managing ambiguity may be more challenging and requires stronger cognitive skills than decision-making under risk, where decisions involve known probabilities. Here we compare decision-making in capuchins, macaques, orangutans, gorillas, chimpanzees and bonobos in risky and ambiguous contexts. Subjects were shown lotteries (a tray of potential rewards, some large, some small) and could gamble a medium-sized food item to obtain one of the displayed rewards. The odds of winning and losing varied and were accessible in the risky context (all rewards were visible) or partially available in the ambiguous context (some rewards were covered). In the latter case, the level of information varied from fully ambiguous (individuals could not guess what was under the covers) to predictable (individuals could guess). None of the species avoided gambling in ambiguous lotteries and gambling rates were high if at least two large rewards were visible. Capuchins and bonobos ignored the covered items and gorillas and macaques took the presence of potential rewards into account, but only chimpanzees and orangutans could consistently build correct expectations about the size of the covered rewards. Chimpanzees and orangutans combined decision rules according to the number of large visible rewards and the level of predictability, a process resembling conditional probabilities assessment in humans. Despite a low sample size, this is the first evidence in non-human primates that a combination of several rules can underlie choices made in an unpredictable environment. Our finding that non-human primates can deal with the uncertainty of an outcome when exchanging one food item for another is a key element to the understanding of the evolutionary origins of economic behaviour. This article is part of the theme issue 'Existence and prevalence of economic behaviours among non-human primates'.
Collapse
Affiliation(s)
- A Romain
- Université de Strasbourg, Strasbourg, France
| | - M-H Broihanne
- Laboratoire de Recherche en Gestion et Economie, EM Strasbourg Business School, Université de Strasbourg, Strasbourg, France
| | - A De Marco
- Fondazione Ethoikos, Radicondoli, Italy.,Parco Faunistico di Piano dell'Abatino, Poggio San Lorenzo, Italy
| | | | - J Call
- School of Psychology and Neuroscience, University of St Andrews, St Andrews, UK.,Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - N Rebout
- PRC, UMR 7247, Cognitive and social ethology team, INRAE-CNRS-IFCE, University of Tours, Tours, France
| | - V Dufour
- PRC, UMR 7247, Cognitive and social ethology team, INRAE-CNRS-IFCE, University of Tours, Tours, France
| |
Collapse
|
36
|
Millroth P, Collsiöö A, Juslin P. Cognitiva Speciebus: Towards a Linnaean Approach to Cognition. Trends Cogn Sci 2020; 25:173-176. [PMID: 33386248 DOI: 10.1016/j.tics.2020.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 11/25/2020] [Accepted: 12/04/2020] [Indexed: 10/22/2022]
Abstract
Research points to the limitations of approaches to decision-making, that rest on general 'Newtonian principles' derived from unitary a priori conceptions of rationality. To understand how the mind exploits environments, we instead propose a process of more open-ended discovery and systematization in the mold of Linnaeus's famous taxonomy of plants.
Collapse
Affiliation(s)
- Philip Millroth
- Department of Psychology, Uppsala University, Uppsala, Sweden.
| | - August Collsiöö
- Department of Psychology, Uppsala University, Uppsala, Sweden
| | - Peter Juslin
- Department of Psychology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
37
|
Blain B, Rutledge RB. Momentary subjective well-being depends on learning and not reward. eLife 2020; 9:57977. [PMID: 33200989 PMCID: PMC7755387 DOI: 10.7554/elife.57977] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 11/16/2020] [Indexed: 01/20/2023] Open
Abstract
Subjective well-being or happiness is often associated with wealth. Recent studies suggest that momentary happiness is associated with reward prediction error, the difference between experienced and predicted reward, a key component of adaptive behaviour. We tested subjects in a reinforcement learning task in which reward size and probability were uncorrelated, allowing us to dissociate between the contributions of reward and learning to happiness. Using computational modelling, we found convergent evidence across stable and volatile learning tasks that happiness, like behaviour, is sensitive to learning-relevant variables (i.e. probability prediction error). Unlike behaviour, happiness is not sensitive to learning-irrelevant variables (i.e. reward prediction error). Increasing volatility reduces how many past trials influence behaviour but not happiness. Finally, depressive symptoms reduce happiness more in volatile than stable environments. Our results suggest that how we learn about our world may be more important for how we feel than the rewards we actually receive. Many people believe they would be happier if only they had more money. And events such as winning the lottery or receiving a large pay rise do make people happy, at least temporarily. But recent studies suggest that the main factor driving happiness on such occasions is not the size of the reward received. Instead, it is how well that reward matches up with expectations. Receiving a 10% pay rise when you were expecting 1% will make you feel happier than receiving 10% when you had been expecting 20%. This difference between an expected and an actual reward is referred to as a reward prediction error. Reward prediction errors have a key role in learning. They motivate people to repeat behaviours that led to unexpectedly large rewards. But they also enable people to update their beliefs about the world, which is rewarding in itself. Could it be that reward prediction errors are associated with happiness mainly because they help us understand the world a little better than before? To test this idea, Blain and Rutledge designed a task in which the likelihood of receiving a reward was unrelated to the size of the reward. This study design makes it possible to separate out the contributions of learning versus reward to moment-by-moment happiness. In the task, volunteers had to decide which of two cars would win a race. In the ‘stable’ condition, one of the cars always had an 80% chance of winning. In the ‘volatile’ condition, one car had an 80% chance of winning for the first 20 trials. The other car then had an 80% chance of winning for the next 20 trials. The volunteers were not told these probabilities in advance, but had to work them out by playing the game. However, on every trial, the volunteers were shown the reward they would receive if they chose either of the cars and that car went on to win. The size of the rewards varied at random and was unrelated to the likelihood of a car winning. Every few trials, the volunteers were asked to indicate their current level of happiness on a scale. The results showed that volunteers were happier after winning than after losing. On average they were also happier in the stable condition than in the volatile condition. This was especially true for volunteers with pre-existing symptoms of depression. Moreover, happiness after wins did not depend on how large the reward they got was, but instead simply on how surprised they were to win. These results suggest that how we learn about the world around us can be more important for how we feel than rewards we receive directly. Measuring happiness in various types of environment could help us understand factors affecting mental health. The current results suggest, for example, that uncertain environments may be especially unpleasant for people with depression. Further research is needed to understand why this might be the case. In the real world, rewards are often uncertain and infrequent, but learning may nevertheless have the potential to boost happiness.
Collapse
Affiliation(s)
- Bastien Blain
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
| | - Robb B Rutledge
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom.,Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom.,Department of Psychology, Yale University, New Haven, United States
| |
Collapse
|
38
|
Monosov IE. How Outcome Uncertainty Mediates Attention, Learning, and Decision-Making. Trends Neurosci 2020; 43:795-809. [PMID: 32736849 PMCID: PMC8153236 DOI: 10.1016/j.tins.2020.06.009] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 06/16/2020] [Accepted: 06/24/2020] [Indexed: 01/24/2023]
Abstract
Animals and humans evolved sophisticated nervous systems that endowed them with the ability to form internal-models or beliefs and make predictions about the future to survive and flourish in a world in which future outcomes are often uncertain. Crucial to this capacity is the ability to adjust behavioral and learning policies in response to the level of uncertainty. Until recently, the neuronal mechanisms that could underlie such uncertainty-guided control have been largely unknown. In this review, I discuss newly discovered neuronal circuits in primates that represent uncertainty about future rewards and propose how they guide information-seeking, attention, decision-making, and learning to help us survive in an uncertain world. Lastly, I discuss the possible relevance of these findings to learning in artificial systems.
Collapse
Affiliation(s)
- Ilya E Monosov
- Department of Neuroscience and Neurosurgery, Washington University School of Medicine in St. Louis, MO, USA; Department of Biomedical Engineering, Washington University School of Medicine in St. Louis, MO, USA; Washington University Pain Center, Washington University School of Medicine in St. Louis, MO, USA.
| |
Collapse
|
39
|
Farashahi S, Xu J, Wu SW, Soltani A. Learning arbitrary stimulus-reward associations for naturalistic stimuli involves transition from learning about features to learning about objects. Cognition 2020; 205:104425. [PMID: 32958287 DOI: 10.1016/j.cognition.2020.104425] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2019] [Revised: 07/29/2020] [Accepted: 07/31/2020] [Indexed: 10/23/2022]
Abstract
Most cognitive processes are studied using abstract or synthetic stimuli with specific features to fully control what is presented to subjects. However, recent studies have revealed enhancements of cognitive capacities (such as working memory) when processing naturalistic versus abstract stimuli. Using abstract stimuli constructed from distinct visual features (e.g., color and shape), we have recently shown that human subjects can learn multidimensional stimulus-reward associations via initially estimating reward value of individual features (feature-based learning) before gradually switching to learning about reward value of individual stimuli (object-based learning). Here, we examined whether similar strategies are adopted during learning about naturalistic stimuli that are clearly perceived as objects (instead of a combination of features) and contain both task-relevant and irrelevant features. We found that similar to learning about abstract stimuli, subjects initially adopted feature-based learning more strongly before transitioning to object-based learning. However, there were three key differences between learning about naturalistic and abstract stimuli. First, compared with abstract stimuli, the initial learning strategy was less feature-based for naturalistic stimuli. Second, subjects transitioned to object-based learning faster for naturalistic stimuli. Third, unexpectedly, subjects were more likely to adopt feature-based learning for naturalistic stimuli, both at the steady state and overall. These results suggest that despite the stronger tendency to perceive naturalistic stimuli as objects, which leads to greater likelihood of using object-based learning as the initial strategy and a faster transition to object-based learning, the influence of individual features on learning is stronger for these stimuli such that ultimately the object-based strategy is adopted less. Overall, our findings suggest that feature-based learning is a general initial strategy for learning about reward value of all types of multi-dimensional stimuli.
Collapse
Affiliation(s)
- Shiva Farashahi
- Department of Psychological and Brain Sciences, Dartmouth College, NH 03755, United States of America; Flatiron Institute, Simons Foundation, New York, NY 10010, United States of America
| | - Jane Xu
- Department of Psychological and Brain Sciences, Dartmouth College, NH 03755, United States of America
| | - Shih-Wei Wu
- Institute of Neuroscience, National Yang-Ming University, Taipei, Taiwan; Brain Research Center, National Yang-Ming University, Taipei, Taiwan
| | - Alireza Soltani
- Department of Psychological and Brain Sciences, Dartmouth College, NH 03755, United States of America.
| |
Collapse
|
40
|
Multiple timescales of neural dynamics and integration of task-relevant signals across cortex. Proc Natl Acad Sci U S A 2020; 117:22522-22531. [PMID: 32839338 DOI: 10.1073/pnas.2005993117] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
A long-lasting challenge in neuroscience has been to find a set of principles that could be used to organize the brain into distinct areas with specific functions. Recent studies have proposed the orderly progression in the time constants of neural dynamics as an organizational principle of cortical computations. However, relationships between these timescales and their dependence on response properties of individual neurons are unknown, making it impossible to determine how mechanisms underlying such a computational principle are related to other aspects of neural processing. Here, we developed a comprehensive method to simultaneously estimate multiple timescales in neuronal dynamics and integration of task-relevant signals along with selectivity to those signals. By applying our method to neural and behavioral data during a dynamic decision-making task, we found that most neurons exhibited multiple timescales in their response, which consistently increased from parietal to prefrontal and cingulate cortex. While predicting rates of behavioral adjustments, these timescales were not correlated across individual neurons in any cortical area, resulting in independent parallel hierarchies of timescales. Additionally, none of these timescales depended on selectivity to task-relevant signals. Our results not only suggest the existence of multiple canonical mechanisms for increasing timescales of neural dynamics across cortex but also point to additional mechanisms that allow decorrelation of these timescales to enable more flexibility.
Collapse
|
41
|
Zhang L, Gläscher J. A brain network supporting social influences in human decision-making. SCIENCE ADVANCES 2020; 6:eabb4159. [PMID: 32875112 PMCID: PMC7438106 DOI: 10.1126/sciadv.abb4159] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Accepted: 07/07/2020] [Indexed: 05/20/2023]
Abstract
Humans learn from their own trial-and-error experience and observing others. However, it remains unknown how brain circuits compute expected values when direct learning and social learning coexist in uncertain environments. Using a multiplayer reward learning paradigm with 185 participants (39 being scanned) in real time, we observed that individuals succumbed to the group when confronted with dissenting information but observing confirming information increased their confidence. Leveraging computational modeling and functional magnetic resonance imaging, we tracked direct valuation through experience and vicarious valuation through observation and their dissociable, but interacting neural representations in the ventromedial prefrontal cortex and the anterior cingulate cortex, respectively. Their functional coupling with the right temporoparietal junction representing instantaneous social information instantiated a hitherto uncharacterized social prediction error, rather than a reward prediction error, in the putamen. These findings suggest that an integrated network involving the brain's reward hub and social hub supports social influence in human decision-making.
Collapse
Affiliation(s)
- Lei Zhang
- Institute of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, 20246 Hamburg, Germany
- Neuropsychopharmacology and Biopsychology Unit, Department of Cognition, Emotion, and Methods in Psychology, Faculty of Psychology, University of Vienna, 1010 Vienna, Austria
| | - Jan Gläscher
- Institute of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, 20246 Hamburg, Germany
| |
Collapse
|
42
|
Chau BKH, Law CK, Lopez-Persem A, Klein-Flügge MC, Rushworth MFS. Consistent patterns of distractor effects during decision making. eLife 2020; 9:e53850. [PMID: 32628109 PMCID: PMC7371422 DOI: 10.7554/elife.53850] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 07/06/2020] [Indexed: 01/24/2023] Open
Abstract
The value of a third potential option or distractor can alter the way in which decisions are made between two other options. Two hypotheses have received empirical support: that a high value distractor improves the accuracy with which decisions between two other options are made and that it impairs accuracy. Recently, however, it has been argued that neither observation is replicable. Inspired by neuroimaging data showing that high value distractors have different impacts on prefrontal and parietal regions, we designed a dual route decision-making model that mimics the neural signals of these regions. Here we show in the dual route model and empirical data that both enhancement and impairment effects are robust phenomena but predominate in different parts of the decision space defined by the options' and the distractor's values. However, beyond these constraints, both effects co-exist under similar conditions. Moreover, both effects are robust and observable in six experiments.
Collapse
Affiliation(s)
- Bolton KH Chau
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic UniversityHong KongHong Kong
- University Research Facility in Behavioral and Systems Neuroscience, The Hong Kong Polytechnic UniversityHong KongHong Kong
| | - Chun-Kit Law
- Department of Rehabilitation Sciences, The Hong Kong Polytechnic UniversityHong KongHong Kong
| | - Alizée Lopez-Persem
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of OxfordOxfordUnited Kingdom
- FrontLab, Paris Brain Institute (ICM), Inserm U 1127, CNRS UMR 7225, Sorbonne UniversitéParisFrance
| | - Miriam C Klein-Flügge
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of OxfordOxfordUnited Kingdom
| | - Matthew FS Rushworth
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of OxfordOxfordUnited Kingdom
| |
Collapse
|
43
|
Mysore SP, Kothari NB. Mechanisms of competitive selection: A canonical neural circuit framework. eLife 2020; 9:e51473. [PMID: 32431293 PMCID: PMC7239658 DOI: 10.7554/elife.51473] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Accepted: 04/02/2020] [Indexed: 01/25/2023] Open
Abstract
Competitive selection, the transformation of multiple competing sensory inputs and internal states into a unitary choice, is a fundamental component of animal behavior. Selection behaviors have been studied under several intersecting umbrellas including decision-making, action selection, perceptual categorization, and attentional selection. Neural correlates of these behaviors and computational models have been investigated extensively. However, specific, identifiable neural circuit mechanisms underlying the implementation of selection remain elusive. Here, we employ a first principles approach to map competitive selection explicitly onto neural circuit elements. We decompose selection into six computational primitives, identify demands that their execution places on neural circuit design, and propose a canonical neural circuit framework. The resulting framework has several links to neural literature, indicating its biological feasibility, and has several common elements with prominent computational models, suggesting its generality. We propose that this framework can help catalyze experimental discovery of the neural circuit underpinnings of competitive selection.
Collapse
Affiliation(s)
- Shreesh P Mysore
- Department of Psychological and Brain Sciences, Johns Hopkins UniversityBaltimoreUnited States
- The Solomon H. Snyder Department of Neuroscience, Johns Hopkins UniversityBaltimoreUnited States
| | - Ninad B Kothari
- Department of Psychological and Brain Sciences, Johns Hopkins UniversityBaltimoreUnited States
| |
Collapse
|
44
|
Schoemann M, Scherbaum S. From high- to one-dimensional dynamics of decision making: testing simplifications in attractor models. Cogn Process 2020; 21:303-313. [PMID: 32016686 PMCID: PMC7203584 DOI: 10.1007/s10339-020-00953-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 01/22/2020] [Indexed: 11/14/2022]
Abstract
Computational models introduce simplifications that need to be understood and validated. For attractor models of decision making, the main simplification is the high-level representation of different sub-processes of the complex decision system in one dynamic description of the overall process dynamics. This simplification implies that the overall process dynamics of the decision system are independent from specific values handled in different sub-processes. Here, we test the validity of this simplification empirically by investigating choice perseveration in a nonverbal, value-based decision task. Specifically, we tested whether choice perseveration occurred irrespectively of the attribute dimension as suggested by a simulation of the computational model. We find evidence supporting the validity of the simplification. We conclude that the simplification might capture mechanistic aspects of decision-making processes, and that the summation of the overall process dynamics of decision systems into one single variable is a valid approach in computational modeling. Supplement materials such as empirical data, analysis scripts, and the computational model are publicly available at the Open Science Framework (osf.io/7fb5q).
Collapse
Affiliation(s)
- Martin Schoemann
- Department of Psychology, Technische Universität Dresden, Zellescher Weg 17, 01069 Dresden, Germany
- Department of Management/MAPP, Aarhus University, Fuglesangs Allé 4, 8210 Aarhus V, Denmark
| | - Stefan Scherbaum
- Department of Psychology, Technische Universität Dresden, Zellescher Weg 17, 01069 Dresden, Germany
| |
Collapse
|
45
|
Balasubramani PP, Pesce MC, Hayden BY. Activity in orbitofrontal neuronal ensembles reflects inhibitory control. Eur J Neurosci 2019; 51:2033-2051. [PMID: 31803972 DOI: 10.1111/ejn.14638] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 10/28/2019] [Accepted: 11/28/2019] [Indexed: 11/27/2022]
Abstract
Stopping, or inhibition, is a form of self-control that is a core element of flexible and adaptive behavior. Its neural origins remain unclear. Some views hold that inhibition decisions reflect the aggregation of widespread and diverse pieces of information, including information arising in ostensible core reward regions (i.e., outside the canonical executive system). We recorded activity of single neurons in the orbitofrontal cortex (OFC) of macaques, a region associated with economic decisions, and whose role in inhibition is debated. Subjects performed a classic inhibition task known as the stop signal task. Ensemble decoding analyses reveal a clear firing rate pattern that distinguishes successful from failed inhibition and that begins after the stop signal and before the stop signal reaction time (SSRT). We also found a different and orthogonal ensemble pattern that distinguishes successful from failed stopping before the beginning of the trial. These signals were distinct from, and orthogonal to, value encoding, which was also observed in these neurons. The timing of the early and late signals was, respectively, consistent with the idea that neuronal activity in OFC encodes inhibition both proactively and reactively.
Collapse
Affiliation(s)
| | | | - Benjamin Y Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, and Center for Neuroengineering, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
46
|
Koechlin E. Human Decision-Making beyond the Rational Decision Theory. Trends Cogn Sci 2019; 24:4-6. [PMID: 31767211 DOI: 10.1016/j.tics.2019.11.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 11/01/2019] [Indexed: 11/30/2022]
Abstract
Two recent studies (Farashahi et al. and Rouault et al.) provide compelling evidence refuting the Subjective Expected Utility (SEU) hypothesis as a ground model describing human decision-making. Together, these studies pave the way towards a new model that subsumes the notion of decision-making and adaptive behavior into a single account.
Collapse
Affiliation(s)
- Etienne Koechlin
- Ecole Normale Superieure, PSL Research University, Université Pierre et Marie Curie, Institut National de la Santé et de la Recherche Biomédicale, Paris, France.
| |
Collapse
|
47
|
Eisenreich BR, Hayden BY, Zimmermann J. Macaques are risk-averse in a freely moving foraging task. Sci Rep 2019; 9:15091. [PMID: 31636348 PMCID: PMC6803699 DOI: 10.1038/s41598-019-51442-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Accepted: 09/26/2019] [Indexed: 11/09/2022] Open
Abstract
Rhesus macaques (Macaca mulatta) appear to be robustly risk-seeking in computerized gambling tasks typically used for electrophysiology. This behavior distinguishes them from many other animals, which are risk-averse, albeit measured in more naturalistic contexts. We wondered whether macaques' risk preferences reflect their evolutionary history or derive from the less naturalistic elements of task design associated with the demands of physiological recording. We assessed macaques' risk attitudes in a task that is somewhat more naturalistic than many that have previously been used: subjects foraged at four feeding stations in a large enclosure. Patches (i.e., stations), provided either stochastically or non-stochastically depleting rewards. Subjects' patch residence times were longer at safe than at risky stations, indicating a preference for safe options. This preference was not attributable to a win-stay-lose-shift heuristic and reversed as the environmental richness increased. These findings highlight the lability of risk attitudes in macaques and support the hypothesis that the ecological validity of a task can influence the expression of risk preference.
Collapse
Affiliation(s)
- Benjamin R Eisenreich
- Department of Neuroscience, Center for Magnetic Resonance Research, and Center for Neuroengineering University of Minnesota, Minneapolis, MN, 55455, USA.
| | - Benjamin Y Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, and Center for Neuroengineering University of Minnesota, Minneapolis, MN, 55455, USA
| | - Jan Zimmermann
- Department of Neuroscience, Center for Magnetic Resonance Research, and Center for Neuroengineering University of Minnesota, Minneapolis, MN, 55455, USA
| |
Collapse
|
48
|
Spitmaan M, Horno O, Chu E, Soltani A. Combinations of low-level and high-level neural processes account for distinct patterns of context-dependent choice. PLoS Comput Biol 2019; 15:e1007427. [PMID: 31609970 PMCID: PMC6812848 DOI: 10.1371/journal.pcbi.1007427] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Revised: 10/24/2019] [Accepted: 09/20/2019] [Indexed: 11/18/2022] Open
Abstract
Context effects have been explained by either low-level neural adjustments or high-level cognitive processes but not their combination. It is currently unclear how these processes interact to shape individuals’ responses to context. Here, we used a large cohort of human subjects in experiments involving choice between two or three gambles in order to study the dependence of context effects on neural adaptation and individuals’ risk attitudes. Our experiments did not provide any evidence that neural adaptation on long timescales (~100 trials) contributes to context effects. Using post-hoc analyses we identified two groups of subjects with distinct patterns of responses to decoys, both of which depended on individuals’ risk aversion. Subjects in the first group exhibited strong, consistent decoy effects and became more risk averse due to decoy presentation. In contrast, subjects in the second group did not show consistent decoy effects and became more risk seeking. The degree of change in risk aversion due to decoy presentation was positively correlated with the original degrees of risk aversion. To explain these results and reveal underlying neural mechanisms, we developed new models incorporating both low- and high-level processes and used these models to fit individuals’ choice behavior. We found that observed distinct patterns of decoy effects can be explained by a combination of adjustments in neural representations and competitive weighting of reward attributes, both of which depend on risk aversion but in opposite directions. Altogether, our results demonstrate how a combination of low- and high-level processes shapes choice behavior in more naturalistic settings, modulates overall risk preference, and explains distinct behavioral phenotypes. A large body of experimental work has illustrated that the introduction of a new, and often irrelevant, option can influence preference among the existing options, a phenomenon referred to as context or decoy effects. For example, introducing a new option that is worse than one of the two existing options in all its attributes but better than the alternative option in some attributes (and thus should not ever be selected) can increase the preference for the former option. Context effects have been explained by high-level cognitive processes—such as comparisons and competitions between attributes—or low-level adjustments of neural representations. However, it is unclear how these processes interact to shape individuals’ responses to context. Here, we show that both high-level cognitive processes and low-level neural adjustments shift risk preference during choice between multiple risky options but in opposite directions. Moreover, we demonstrate that combinations of these processes can account for distinct patterns of context effects in human subjects.
Collapse
Affiliation(s)
- Mehran Spitmaan
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hamphire, United States of America
| | - Oihane Horno
- Champalimaud Research, Champalimaud Centre for the Unknown, Lisbon, Portugal
| | - Emily Chu
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hamphire, United States of America
| | - Alireza Soltani
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hamphire, United States of America
- * E-mail:
| |
Collapse
|