1
|
Inkeller J, Knakker B, Kovács P, Lendvai B, Hernádi I. Intrinsic anticipatory motives in non-human primate food consumption behavior. iScience 2024; 27:109459. [PMID: 38558930 PMCID: PMC10981109 DOI: 10.1016/j.isci.2024.109459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 01/18/2024] [Accepted: 03/07/2024] [Indexed: 04/04/2024] Open
Abstract
Future-oriented behavior is regarded as a cornerstone of human cognition. One key phenomenon through which future orientation can be studied is the delay of gratification, when consumption of an immediate reward is withstood to achieve a larger reward later. The delays used in animal delay of gratification paradigms are rather short to be considered relevant for studying human-like future orientation. Here, for the first time, we show that rhesus macaques exhibit human-relevant future orientation downregulating their operant food consumption in anticipation of a nutritionally equivalent but more palatable food with an unprecedentedly long delay of approximately 2.5 h. Importantly, this behavior is not a result of conditioning but intrinsic to the animals. Our results show that the cognitive time horizon of primates, when tested in ecologically valid foraging-like experiments, extends much further into the future than previously considered, opening up new avenues for translational biomedical research.
Collapse
Affiliation(s)
- Judit Inkeller
- Grastyán E. Translational Research Centre, University of Pécs, Pécs, Hungary
| | - Balázs Knakker
- Grastyán E. Translational Research Centre, University of Pécs, Pécs, Hungary
| | - Péter Kovács
- Department of Pharmacology and Drug Safety Research, Gedeon Richter Plc., Budapest, Hungary
| | - Balázs Lendvai
- Department of Pharmacology and Drug Safety Research, Gedeon Richter Plc., Budapest, Hungary
- Richter Department, Semmelweis University, Budapest, Hungary
| | - István Hernádi
- Grastyán E. Translational Research Centre, University of Pécs, Pécs, Hungary
- Department of Neurobiology, Institute of Biology, Faculty of Sciences, University of Pécs, Pécs, Hungary
- Institute of Physiology, Medical School, University of Pécs, Pécs, Hungary
| |
Collapse
|
2
|
Bustamante LA, Oshinowo T, Lee JR, Tong E, Burton AR, Shenhav A, Cohen JD, Daw ND. Effort Foraging Task reveals positive correlation between individual differences in the cost of cognitive and physical effort in humans. Proc Natl Acad Sci U S A 2023; 120:e2221510120. [PMID: 38064507 PMCID: PMC10723129 DOI: 10.1073/pnas.2221510120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 10/26/2023] [Indexed: 12/17/2023] Open
Abstract
Effort-based decisions, in which people weigh potential future rewards against effort costs required to achieve those rewards involve both cognitive and physical effort, though the mechanistic relationship between them is not yet understood. Here, we use an individual differences approach to isolate and measure the computational processes underlying effort-based decisions and test the association between cognitive and physical domains. Patch foraging is an ecologically valid reward rate maximization problem with well-developed theoretical tools. We developed the Effort Foraging Task, which embedded cognitive or physical effort into patch foraging, to quantify the cost of both cognitive and physical effort indirectly, by their effects on foraging choices. Participants chose between harvesting a depleting patch, or traveling to a new patch that was costly in time and effort. Participants' exit thresholds (reflecting the reward they expected to receive by harvesting when they chose to travel to a new patch) were sensitive to cognitive and physical effort demands, allowing us to quantify the perceived effort cost in monetary terms. The indirect sequential choice style revealed effort-seeking behavior in a minority of participants (preferring high over low effort) that has apparently been missed by many previous approaches. Individual differences in cognitive and physical effort costs were positively correlated, suggesting that these are perceived and processed in common. We used canonical correlation analysis to probe the relationship of task measures to self-reported affect and motivation, and found correlations of cognitive effort with anxiety, cognitive function, behavioral activation, and self-efficacy, but no similar correlations with physical effort.
Collapse
Affiliation(s)
- Laura A. Bustamante
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ08544
- Department of Psychological and Brain Sciences, Washington University in Saint Louis, Saint Louis, MO63130
| | - Temitope Oshinowo
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ08544
| | - Jeremy R. Lee
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ08544
| | - Elizabeth Tong
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ08544
| | - Allison R. Burton
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ08544
| | - Amitai Shenhav
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI02912
- Carney Institute for Brain Science, Brown University, Providence, RI02906
| | - Jonathan D. Cohen
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ08544
| | - Nathaniel D. Daw
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ08544
| |
Collapse
|
3
|
Garcia M, Gupta S, Wikenheiser AM. Sex differences in patch-leaving foraging decisions in rats. OXFORD OPEN NEUROSCIENCE 2023; 2:kvad011. [PMID: 38596244 PMCID: PMC11003400 DOI: 10.1093/oons/kvad011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 10/11/2023] [Accepted: 10/12/2023] [Indexed: 04/11/2024]
Abstract
The ubiquity, importance, and sophistication of foraging behavior makes it an ideal platform for studying naturalistic decision making in animals. We developed a spatial patch-foraging task for rats, in which subjects chose how long to remain in one foraging patch as the rate of food earnings steadily decreased. The cost of seeking out a new location was varied across sessions. The behavioral task was designed to mimic the structure of natural foraging problems, where distinct spatial locations are associated with different reward statistics, and decisions require navigation and movement through space. Male and female Long-Evans rats generally followed the predictions of theoretical models of foraging, albeit with a consistent tendency to persist with patches for too long compared to behavioral strategies that maximize food intake rate. The tendency to choose overly-long patch residence times was stronger in male rats. We also observed sex differences in locomotion as rats performed the task, but these differences in movement only partially accounted for the differences in patch residence durations observed between male and female rats. Together, these results suggest a nuanced relationship between movement, sex, and foraging decisions.
Collapse
Affiliation(s)
- Marissa Garcia
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Sukriti Gupta
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Andrew M Wikenheiser
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Brain Research Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
4
|
Garcia M, Gupta S, Wikenheiser AM. Sex differences in patch-leaving foraging decisions in rats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.19.529135. [PMID: 36824852 PMCID: PMC9949151 DOI: 10.1101/2023.02.19.529135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
The ubiquity, importance, and sophistication of foraging behavior makes it an ideal platform for studying naturalistic decision making in animals. We developed a spatial patch-foraging task for rats, in which subjects chose how long to remain in one foraging patch as the rate of food earnings steadily decreased. The cost of seeking out a new location was varied across sessions. The behavioral task was designed to mimic the structure of natural foraging problems, where distinct spatial locations are associated with different reward statistics, and decisions require navigation and movement through space. Male and female Long-Evans rats generally followed the predictions of theoretical models of foraging, albeit with a consistent tendency to persist with patches for too long compared to behavioral strategies that maximize food intake rate. The tendency to choose overly-long patch residence times was stronger in male rats. We also observed sex differences in locomotion as rats performed the task, but these differences in movement only partially accounted for the differences in patch residence durations observed between male and female rats. Together, these results suggest a nuanced relationship between movement, sex, and foraging decisions.
Collapse
Affiliation(s)
- Marissa Garcia
- Department of Psychology, University of California, Los Angeles, Los Angeles, California 90095
- Current address: Neurosciences Graduate Program, University of California, San Diego, San Diego, CA 92093
| | - Sukriti Gupta
- Department of Psychology, University of California, Los Angeles, Los Angeles, California 90095
| | - Andrew M. Wikenheiser
- Department of Psychology, University of California, Los Angeles, Los Angeles, California 90095
- Brain Research Institute, University of California, Los Angeles, Los Angeles, California 90095
| |
Collapse
|
5
|
Gancarz AM, Mitchell SH, George AM, Martin CD, Turk MC, Bool HM, Aktar F, Kwarteng F, Palmer AA, Meyer PJ, Richards JB, Dietz DM, Ishiwari K. Reward maximization assessed using a sequential patch depletion task in a large sample of heterogeneous stock rats. Sci Rep 2023; 13:7027. [PMID: 37120610 PMCID: PMC10148848 DOI: 10.1038/s41598-023-34179-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Accepted: 04/25/2023] [Indexed: 05/01/2023] Open
Abstract
Choice behavior requires animals to evaluate both short- and long-term advantages and disadvantages of all potential alternatives. Impulsive choice is traditionally measured in laboratory tasks by utilizing delay discounting (DD), a paradigm that offers a choice between a smaller immediate reward, or a larger more delayed reward. This study tested a large sample of Heterogeneous Stock (HS) male (n = 896) and female (n = 898) rats, part of a larger genetic study, to investigate whether measures of reward maximization overlapped with traditional models of delay discounting via the patch depletion model using a Sequential Patch Depletion procedure. In this task, rats were offered a concurrent choice between two water "patches" and could elect to "stay" in the current patch or "leave" for an alternative patch. Staying in the current patch resulted in decreasing subsequent reward magnitudes, whereas the choice to leave a patch was followed by a delay and a resetting to the maximum reward magnitude. Based on the delay in a given session, different visit durations were necessary to obtain the maximum number of rewards. Visit duration may be analogous to an indifference point in traditional DD tasks. Males and females did not significantly differ on traditional measures of DD (e.g. delay gradient; AUC). When examining measures of patch utilization, females made fewer patch changes at all delays and spent more time in the patch before leaving for the alternative patch compared to males. Consistent with this, there was some evidence that females deviated from reward maximization more than males. However, when controlling for body weight, females had a higher normalized rate of reinforcement than males. Measures of reward maximization were only weakly associated with traditional DD measures and may represent distinctive underlying processes. Taken together, females performance differed from males with regard to reward maximization that were not observed utilizing traditional measures of DD, suggesting that the patch depletion model was more sensitive to modest sex differences when compared to traditional DD measures in a large sample of HS rats.
Collapse
Affiliation(s)
- Amy M Gancarz
- Department of Psychology, California State University, Bakersfield, Bakersfield, CA, 93311, USA.
| | - Suzanne H Mitchell
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR, 97239, USA
- Department of Psychiatry, Oregon Health & Science University, Portland, OR, 97239, USA
- Oregon Institute for Occupational Health Sciences, Oregon Health & Science University, Portland, OR, 97239, USA
| | - Anthony M George
- Clinical and Research Institute on Addictions, University at Buffalo, Buffalo, NY, 14203, USA
| | - Connor D Martin
- Clinical and Research Institute on Addictions, University at Buffalo, Buffalo, NY, 14203, USA
- Department of Pharmacology and Toxicology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, 14203, USA
| | - Marisa C Turk
- Clinical and Research Institute on Addictions, University at Buffalo, Buffalo, NY, 14203, USA
- Department of Pharmacology and Toxicology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, 14203, USA
| | - Heather M Bool
- Clinical and Research Institute on Addictions, University at Buffalo, Buffalo, NY, 14203, USA
- Department of Pharmacology and Toxicology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, 14203, USA
| | - Fahmida Aktar
- Clinical and Research Institute on Addictions, University at Buffalo, Buffalo, NY, 14203, USA
- Department of Pharmacology and Toxicology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, 14203, USA
| | - Francis Kwarteng
- Clinical and Research Institute on Addictions, University at Buffalo, Buffalo, NY, 14203, USA
- Department of Pharmacology and Toxicology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, 14203, USA
| | - Abraham A Palmer
- Department of Psychiatry, University of California San Diego, La Jolla, CA, 92093, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, 92093, USA
| | - Paul J Meyer
- Department of Psychology, University at Buffalo, Buffalo, NY, 14260, USA
| | - Jerry B Richards
- Department of Pharmacology and Toxicology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, 14203, USA
| | - David M Dietz
- Clinical and Research Institute on Addictions, University at Buffalo, Buffalo, NY, 14203, USA
- Department of Pharmacology and Toxicology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, 14203, USA
| | - Keita Ishiwari
- Clinical and Research Institute on Addictions, University at Buffalo, Buffalo, NY, 14203, USA.
- Department of Pharmacology and Toxicology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, 14203, USA.
| |
Collapse
|
6
|
Harhen NC, Bornstein AM. Overharvesting in human patch foraging reflects rational structure learning and adaptive planning. Proc Natl Acad Sci U S A 2023; 120:e2216524120. [PMID: 36961923 PMCID: PMC10068834 DOI: 10.1073/pnas.2216524120] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 02/11/2023] [Indexed: 03/26/2023] Open
Abstract
Patch foraging presents a sequential decision-making problem widely studied across organisms-stay with a current option or leave it in search of a better alternative? Behavioral ecology has identified an optimal strategy for these decisions, but, across species, foragers systematically deviate from it, staying too long with an option or "overharvesting" relative to this optimum. Despite the ubiquity of this behavior, the mechanism underlying it remains unclear and an object of extensive investigation. Here, we address this gap by approaching foraging as both a decision-making and learning problem. Specifically, we propose a model in which foragers 1) rationally infer the structure of their environment and 2) use their uncertainty over the inferred structure representation to adaptively discount future rewards. We find that overharvesting can emerge from this rational statistical inference and uncertainty adaptation process. In a patch-leaving task, we show that human participants adapt their foraging to the richness and dynamics of the environment in ways consistent with our model. These findings suggest that definitions of optimal foraging could be extended by considering how foragers reduce and adapt to uncertainty over representations of their environment.
Collapse
Affiliation(s)
- Nora C. Harhen
- Department of Cognitive Sciences, University of California, Irvine, CA92697
| | - Aaron M. Bornstein
- Department of Cognitive Sciences, University of California, Irvine, CA92697
- Center for the Neurobiology of Learning and Memory, University of California, Irvine, CA92697
| |
Collapse
|
7
|
Gancarz AM, Mitchell SH, George AM, Martin CD, Turk MC, Bool HM, Aktar F, Kwarteng F, Palmer AA, Meyer PJ, Richards JB, Dietz DM, Isiwari K. Reward Maximization Assessed Using a Sequential Patch Depletion Task in a Large Sample of Heterogeneous Stock Rats. RESEARCH SQUARE 2023:rs.3.rs-2525080. [PMID: 36778344 PMCID: PMC9915773 DOI: 10.21203/rs.3.rs-2525080/v1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Choice behavior requires animals to evaluate both short- and long-term advantages and disadvantages of all potential alternatives. Impulsive choice is traditionally measured in laboratory tasks by utilizing delay discounting (DD), a paradigm that offers a choice between a smaller immediate reward, or a larger more delayed reward. This study tested a large sample of Heterogeneous Stock (HS) male (n = 896) and female (n = 898) rats, part of a larger genetic study, to investigate whether measures of reward maximization overlapped with traditional models of delay discounting via the patch depletion model using a Sequential Patch Depletion procedure. In this task, rats were offered a concurrent choice between two water "patches" and could elect to "stay" in the current patch or "leave" for an alternative patch. Staying in the current patch resulted in decreasing subsequent reward magnitudes, whereas the choice to leave a patch was followed by a delay and a resetting to the maximum reward magnitude. Based on the delay in a given session, different visit durations were necessary to obtain the maximum number of rewards. Visit duration may be analogous to an indifference point in traditional DD tasks. While differences in traditional DD measures (e.g., delay gradient) have been detected between males and females, these effects were small and inconsistent. However, when examining measures of reward maximization, females made fewer patch changes at all delays and spent more time in the patch before leaving for the alternative patch compared to males. This pattern of choice resulted in males having a higher rate of reinforcement than females. Consistent with this, there was some evidence that females deviated from the optimal more, leading to less reward. Measures of reward maximization were only weakly associated with traditional DD measures and may represent distinctive underlying processes. Taken together, females performance differed from males with regard to reward maximization that were not observed utilizing traditional measures of DD, suggesting that the patch depletion model was more sensitive to modest sex differences when compared to traditional DD measures in a large sample of HS rats.
Collapse
|
8
|
Lind EB, Sweis BM, Asp AJ, Esguerra M, Silvis KA, David Redish A, Thomas MJ. A quadruple dissociation of reward-related behaviour in mice across excitatory inputs to the nucleus accumbens shell. Commun Biol 2023; 6:119. [PMID: 36717646 PMCID: PMC9886947 DOI: 10.1038/s42003-023-04429-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 01/05/2023] [Indexed: 02/01/2023] Open
Abstract
The nucleus accumbens shell (NAcSh) is critically important for reward valuations, yet it remains unclear how valuation information is integrated in this region to drive behaviour during reinforcement learning. Using an optogenetic spatial self-stimulation task in mice, here we show that contingent activation of different excitatory inputs to the NAcSh change expression of different reward-related behaviours. Our data indicate that medial prefrontal inputs support place preference via repeated actions, ventral hippocampal inputs consistently promote place preferences, basolateral amygdala inputs produce modest place preferences but as a byproduct of increased sensitivity to time investments, and paraventricular inputs reduce place preferences yet do not produce full avoidance behaviour. These findings suggest that each excitatory input provides distinct information to the NAcSh, and we propose that this reflects the reinforcement of different credit assignment functions. Our finding of a quadruple dissociation of NAcSh input-specific behaviours provides insights into how types of information carried by distinct inputs to the NAcSh could be integrated to help drive reinforcement learning and situationally appropriate behavioural responses.
Collapse
Affiliation(s)
- Erin B Lind
- Department of Neuroscience, University of Minnesota, 6-145 Jackson Hall, 321 Church St SE, Minneapolis, MN, 55455, USA
- Medical Discovery Team on Addiction, University of Minnesota, 3-432 McGuire Translational Research Facility, 2001 6th St SE, Minneapolis, MN, 55455, USA
| | - Brian M Sweis
- Department of Neuroscience, University of Minnesota, 6-145 Jackson Hall, 321 Church St SE, Minneapolis, MN, 55455, USA
- Medical Discovery Team on Addiction, University of Minnesota, 3-432 McGuire Translational Research Facility, 2001 6th St SE, Minneapolis, MN, 55455, USA
- Department of Psychiatry, Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| | - Anders J Asp
- Department of Neuroscience, University of Minnesota, 6-145 Jackson Hall, 321 Church St SE, Minneapolis, MN, 55455, USA
- Rehabilitation Medicine Research Center, Department of Physical Medicine and Rehabilitation, Mayo Clinic, 200 First St SW, Rochester, MN, 55905, USA
| | - Manuel Esguerra
- Department of Neuroscience, University of Minnesota, 6-145 Jackson Hall, 321 Church St SE, Minneapolis, MN, 55455, USA
- Medical Discovery Team on Addiction, University of Minnesota, 3-432 McGuire Translational Research Facility, 2001 6th St SE, Minneapolis, MN, 55455, USA
| | - Keelia A Silvis
- Department of Neuroscience, University of Minnesota, 6-145 Jackson Hall, 321 Church St SE, Minneapolis, MN, 55455, USA
- Medical Discovery Team on Addiction, University of Minnesota, 3-432 McGuire Translational Research Facility, 2001 6th St SE, Minneapolis, MN, 55455, USA
| | - A David Redish
- Department of Neuroscience, University of Minnesota, 6-145 Jackson Hall, 321 Church St SE, Minneapolis, MN, 55455, USA
- Medical Discovery Team on Addiction, University of Minnesota, 3-432 McGuire Translational Research Facility, 2001 6th St SE, Minneapolis, MN, 55455, USA
| | - Mark J Thomas
- Department of Neuroscience, University of Minnesota, 6-145 Jackson Hall, 321 Church St SE, Minneapolis, MN, 55455, USA.
- Medical Discovery Team on Addiction, University of Minnesota, 3-432 McGuire Translational Research Facility, 2001 6th St SE, Minneapolis, MN, 55455, USA.
| |
Collapse
|
9
|
Traner MR, Bromberg-Martin ES, Monosov IE. How the value of the environment controls persistence in visual search. PLoS Comput Biol 2021; 17:e1009662. [PMID: 34905548 PMCID: PMC8714092 DOI: 10.1371/journal.pcbi.1009662] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 12/28/2021] [Accepted: 11/21/2021] [Indexed: 11/18/2022] Open
Abstract
Classic foraging theory predicts that humans and animals aim to gain maximum reward per unit time. However, in standard instrumental conditioning tasks individuals adopt an apparently suboptimal strategy: they respond slowly when the expected value is low. This reward-related bias is often explained as reduced motivation in response to low rewards. Here we present evidence this behavior is associated with a complementary increased motivation to search the environment for alternatives. We trained monkeys to search for reward-related visual targets in environments with different values. We found that the reward-related bias scaled with environment value, was consistent with persistent searching after the target was already found, and was associated with increased exploratory gaze to objects in the environment. A novel computational model of foraging suggests that this search strategy could be adaptive in naturalistic settings where both environments and the objects within them provide partial information about hidden, uncertain rewards.
Collapse
Affiliation(s)
- Michael R. Traner
- Department of Biomedical Engineering, Washington University, St. Louis, Missouri, United States of America
| | - Ethan S. Bromberg-Martin
- Department of Neuroscience, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Ilya E. Monosov
- Department of Biomedical Engineering, Washington University, St. Louis, Missouri, United States of America
- Department of Neuroscience, Washington University School of Medicine, St. Louis, Missouri, United States of America
- Department of Neurosurgery, Washington University, St. Louis, Missouri, United States of America
- Pain Center, Washington University, St. Louis, Missouri, United States of America
- Department of Electrical Engineering, Washington University, St. Louis, Missouri, United States of America
- * E-mail:
| |
Collapse
|
10
|
Abstract
Humans and other animals evolved to make decisions that extend over time with continuous and ever-changing options. Nonetheless, the academic study of decision-making is mostly limited to the simple case of choice between two options. Here, we advocate that the study of choice should expand to include continuous decisions. Continuous decisions, by our definition, involve a continuum of possible responses and take place over an extended period of time during which the response is continuously subject to modification. In most continuous decisions, the range of options can fluctuate and is affected by recent responses, making consideration of reciprocal feedback between choices and the environment essential. The study of continuous decisions raises new questions, such as how abstract processes of valuation and comparison are co-implemented with action planning and execution, how we simulate the large number of possible futures our choices lead to, and how our brains employ hierarchical structure to make choices more efficiently. While microeconomic theory has proven invaluable for discrete decisions, we propose that engineering control theory may serve as a better foundation for continuous ones. And while the concept of value has proven foundational for discrete decisions, goal states and policies may prove more useful for continuous ones. This article is part of the theme issue 'Existence and prevalence of economic behaviours among non-human primates'.
Collapse
Affiliation(s)
- Seng Bum Michael Yoo
- Center for Neuroscience Imaging Research, Institute for Basic Science, Suwon, Republic of Korea, 16419
| | - Benjamin Yost Hayden
- Department of Neuroscience, Center for Neuroengineering, Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN 55455, USA
| | - John M. Pearson
- Department of Biostatistics and Bioinformatics, Center for Cognitive Neuroscience, Department of Neurobiology, Department of Electrical and Computer Engineering, Duke University, Durham, NC 27708, USA
| |
Collapse
|
11
|
Cash-Padgett T, Hayden B. Behavioural variability contributes to over-staying in patchy foraging. Biol Lett 2020; 16:20190915. [PMID: 32156171 DOI: 10.1098/rsbl.2019.0915] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Foragers often systematically deviate from rate-maximizing choices in two ways: accuracy and precision. That is, they use suboptimal threshold values and also show variability in their application of those thresholds. We hypothesized that these biases are related and, more specifically, that foragers' widely known accuracy bias--over-staying--could be explained, at least in part, by their imprecision. To test this hypothesis, we analysed choices made by three rhesus macaques in a computerized patch foraging task. Confirming previously observed findings, we found high levels of variability. We then showed, through simulations, that this variability changed optimal thresholds, meaning that a forager aware of its own variability should increase its leaving threshold (i.e. over-stay) to increase performance. All subjects showed thresholds that were biased in the predicted direction. These results indicate that over-staying in patches may reflect, in part, an adaptation to behavioural variability.
Collapse
Affiliation(s)
- Tyler Cash-Padgett
- Department of Neuroscience, Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN 55455, USA.,Center for Neuroengineering, University of Minnesota, Minneapolis, MN 55455, USA
| | - Benjamin Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN 55455, USA.,Center for Neuroengineering, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
12
|
Yoo SBM, Tu JC, Piantadosi ST, Hayden BY. The neural basis of predictive pursuit. Nat Neurosci 2020; 23:252-259. [PMID: 31907436 PMCID: PMC7007341 DOI: 10.1038/s41593-019-0561-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Accepted: 11/20/2019] [Indexed: 12/16/2022]
Abstract
It remains unclear whether and, if so, how nonhuman animals make on-the-fly predictions during pursuit. Here we used a novel laboratory pursuit task that incentivizes the prediction of future prey positions. We trained three macaques to perform a joystick-controlled pursuit task in which prey follow intelligent escape algorithms. Subjects aimed toward the likely future positions of the prey, which indicated that they generate internal predictions and use these to guide behavior. We then developed a generative model that explains real-time pursuit trajectories and showed that our subjects use prey position, velocity and acceleration to make predictions. We identified neurons in the dorsal anterior cingulate cortex whose responses track these three variables. These neurons multiplexed prediction-related variables with a distinct and explicit representation of the future position of the prey. Our results provide a clear demonstration that the brain can explicitly represent future predictions and highlight the critical role of anterior cingulate cortex for future-oriented cognition.
Collapse
Affiliation(s)
- Seng Bum Michael Yoo
- Department of Neuroscience, Center for Magnetic Resonance Research, Minneapolis, MN, USA.
| | - Jiaxin Cindy Tu
- Department of Neuroscience, Center for Magnetic Resonance Research, Minneapolis, MN, USA.,Department of Neuroscience, Washington University in St Louis, St Louis, MO, USA
| | - Steven T Piantadosi
- Department of Psychology, University of California Berkeley, Berkeley, CA, USA
| | - Benjamin Yost Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, Minneapolis, MN, USA.,Center for Neuroengineering, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
13
|
Eisenreich BR, Hayden BY, Zimmermann J. Macaques are risk-averse in a freely moving foraging task. Sci Rep 2019; 9:15091. [PMID: 31636348 PMCID: PMC6803699 DOI: 10.1038/s41598-019-51442-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Accepted: 09/26/2019] [Indexed: 11/09/2022] Open
Abstract
Rhesus macaques (Macaca mulatta) appear to be robustly risk-seeking in computerized gambling tasks typically used for electrophysiology. This behavior distinguishes them from many other animals, which are risk-averse, albeit measured in more naturalistic contexts. We wondered whether macaques' risk preferences reflect their evolutionary history or derive from the less naturalistic elements of task design associated with the demands of physiological recording. We assessed macaques' risk attitudes in a task that is somewhat more naturalistic than many that have previously been used: subjects foraged at four feeding stations in a large enclosure. Patches (i.e., stations), provided either stochastically or non-stochastically depleting rewards. Subjects' patch residence times were longer at safe than at risky stations, indicating a preference for safe options. This preference was not attributable to a win-stay-lose-shift heuristic and reversed as the environmental richness increased. These findings highlight the lability of risk attitudes in macaques and support the hypothesis that the ecological validity of a task can influence the expression of risk preference.
Collapse
Affiliation(s)
- Benjamin R Eisenreich
- Department of Neuroscience, Center for Magnetic Resonance Research, and Center for Neuroengineering University of Minnesota, Minneapolis, MN, 55455, USA.
| | - Benjamin Y Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, and Center for Neuroengineering University of Minnesota, Minneapolis, MN, 55455, USA
| | - Jan Zimmermann
- Department of Neuroscience, Center for Magnetic Resonance Research, and Center for Neuroengineering University of Minnesota, Minneapolis, MN, 55455, USA
| |
Collapse
|
14
|
Kane GA, Bornstein AM, Shenhav A, Wilson RC, Daw ND, Cohen JD. Rats exhibit similar biases in foraging and intertemporal choice tasks. eLife 2019; 8:48429. [PMID: 31532391 PMCID: PMC6794087 DOI: 10.7554/elife.48429] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 09/17/2019] [Indexed: 12/05/2022] Open
Abstract
Animals, including humans, consistently exhibit myopia in two different contexts: foraging, in which they harvest locally beyond what is predicted by optimal foraging theory, and intertemporal choice, in which they exhibit a preference for immediate vs. delayed rewards beyond what is predicted by rational (exponential) discounting. Despite the similarity in behavior between these two contexts, previous efforts to reconcile these observations in terms of a consistent pattern of time preferences have failed. Here, via extensive behavioral testing and quantitative modeling, we show that rats exhibit similar time preferences in both contexts: they prefer immediate vs. delayed rewards and they are sensitive to opportunity costs of delays to future decisions. Further, a quasi-hyperbolic discounting model, a form of hyperbolic discounting with separate components for short- and long-term rewards, explains individual rats’ time preferences across both contexts, providing evidence for a common mechanism for myopic behavior in foraging and intertemporal choice. Often decisions have to be made on whether to stick with a resource or leave it behind to search for a better alternative. Should you book that hotel room or continue looking at others? Is it time to start searching for a new job, or even for a new partner? Animals face similar 'stick or twist' decisions when foraging for food. Knowing how to maximize the amount of food you obtain is key to survival. Studies have shown that most animals tend to stick with a food source for a little too long, a phenomenon known as 'overharvesting'. To find out why, Kane et al. designed carefully controlled experiments to compare foraging behavior in rats to another form of decision-making, known as intertemporal choice. The latter involves choosing between a small reward now versus a larger reward later. Given this choice, most rats opt to receive a smaller reward now rather than wait for the larger reward. This suggests that rats value rewards available in the future less than rewards they can get immediately. Kane et al. showed that this preference for short-term rewards can also explain why rats overharvest in foraging scenarios. By leaving one food source to go in search of another, rats must put up with a delay before they can access the new food supply. This delay, due to the time required to travel and search, reduces the value of the future reward. As a result, rats are more likely to stick with their current food source, even though leaving it would yield a greater reward in the long run. These findings in rats raise important questions about the mechanisms that lead to biases in thinking, and how factors like changes in the environment or specific disease states can influence these biases.
Collapse
Affiliation(s)
- Gary A Kane
- Department of Psychology, Princeton Neuroscience Institute, Princeton University, Princeton, United States.,Rowland Institute at Harvard, Harvard University, Cambridge, United States
| | - Aaron M Bornstein
- Department of Psychology, Princeton Neuroscience Institute, Princeton University, Princeton, United States.,Department of Cognitive Sciences, Center for the Neurobiology of Learning and Memory, University of California, Irvine, Irvine, United States
| | - Amitai Shenhav
- Department of Cognitive, Linguistic and Psychological Sciences, Carney Institute for Brain Science, Brown University, Providence, United States
| | - Robert C Wilson
- Department of Psychology, Cognitive Science Program, University of Arizona, Tucson, United States
| | - Nathaniel D Daw
- Department of Psychology, Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Jonathan D Cohen
- Department of Psychology, Princeton Neuroscience Institute, Princeton University, Princeton, United States
| |
Collapse
|
15
|
Langdon AJ, Song M, Niv Y. Uncovering the 'state': Tracing the hidden state representations that structure learning and decision-making. Behav Processes 2019; 167:103891. [PMID: 31381985 DOI: 10.1016/j.beproc.2019.103891] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Revised: 05/23/2019] [Accepted: 06/21/2019] [Indexed: 02/02/2023]
Abstract
We review the abstract concept of a 'state' - an internal representation posited by reinforcement learning theories to be used by an agent, whether animal, human or artificial, to summarize the features of the external and internal environment that are relevant for future behavior on a particular task. Armed with this summary representation, an agent can make decisions and perform actions to interact effectively with the world. Here, we review recent findings from the neurobiological and behavioral literature to ask: 'what is a state?' with respect to the internal representations that organize learning and decision making across a range of tasks. We find that state representations include information beyond a straightforward summary of the immediate cues in the environment, providing timing or contextual information from the recent or more distant past, which allows these additional factors to influence decision making and other goal-directed behaviors in complex and perhaps unexpected ways.
Collapse
Affiliation(s)
- Angela J Langdon
- Princeton Neuroscience Institute and Department of Psychology, Princeton University, Princeton, NJ, 08544, United States.
| | - Mingyu Song
- Princeton Neuroscience Institute and Department of Psychology, Princeton University, Princeton, NJ, 08544, United States
| | - Yael Niv
- Princeton Neuroscience Institute and Department of Psychology, Princeton University, Princeton, NJ, 08544, United States.
| |
Collapse
|
16
|
Mehta PS, Tu JC, LoConte GA, Pesce MC, Hayden BY. Ventromedial Prefrontal Cortex Tracks Multiple Environmental Variables during Search. J Neurosci 2019; 39:5336-5350. [PMID: 31028117 PMCID: PMC6607750 DOI: 10.1523/jneurosci.2365-18.2019] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Revised: 04/16/2019] [Accepted: 04/18/2019] [Indexed: 11/21/2022] Open
Abstract
To make efficient foraging decisions, we must combine information about the values of available options with nonvalue information. Some accounts of ventromedial PFC (vmPFC) suggest that it has a narrow role limited to evaluating immediately available options. We examined responses of neurons in area 14 (a putative macaque homolog of human vmPFC) as 2 male macaques performed a novel foraging search task. Although many neurons encoded the values of immediately available offers, they also independently encoded several other variables that influence choice, but that are conceptually distinct from offer value. These variables include average reward rate, number of offers viewed per trial, previous offer values, previous outcome sizes, and the locations of the currently attended offer. We conclude that, rather than serving as specialized economic value center, vmPFC plays a broad role in integrating relevant environmental information to drive foraging decisions.SIGNIFICANCE STATEMENT Decision makers must often choose whether to take an immediately available option or continue to search for a better one. We hypothesized that this process, which is integral to foraging theory, leaves neural signatures in the brain region ventromedial PFC. Subjects performed a novel foraging task in which they searched through differently valued options and attempted to balance their reward threshold with various time costs. We found that neurons not only encode the values of immediately available offers, but multiplexed these with environmental variables, including reward rate, number of offers viewed, previous offer values, and spatial information. We conclude that vmPFC plays a rich role in encoding and integrating multiple foraging-related variables during economic decisions.
Collapse
Affiliation(s)
- Priyanka S Mehta
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455, and
| | - Jiaxin Cindy Tu
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455, and
| | - Giuliana A LoConte
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455, and
| | - Meghan C Pesce
- Department of Brain and Cognitive Sciences University of Rochester, Rochester, New York 14611
| | - Benjamin Y Hayden
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, Minnesota 55455, and
| |
Collapse
|
17
|
Complex social ecology needs complex machineries of foraging. Behav Brain Sci 2019; 42:e45. [PMID: 30940259 DOI: 10.1017/s0140525x18002078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Uncertainty is caused not only by environmental changes, but also by social interference resulting from competition over food resources. Actually, foraging effort is socially facilitated, which, however, does not require incentive control by the dopamine system; Zajonc's "drive" theory is thus questionable. Instead, social adjustments may be pre-embedded in the limbic network responsible for decisions of appropriate effort-cost investment.
Collapse
|
18
|
Monkeys are curious about counterfactual outcomes. Cognition 2019; 189:1-10. [PMID: 30889493 DOI: 10.1016/j.cognition.2019.03.009] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 03/11/2019] [Accepted: 03/13/2019] [Indexed: 11/22/2022]
Abstract
Many non-human animals show exploratory behaviors. It remains unclear whether any possess human-like curiosity. We previously proposed three criteria for applying the term curiosity to animal behavior: (1) the subject is willing to sacrifice reward to obtain information, (2) the information provides no immediate instrumental or strategic benefit, and (3) the amount the subject is willing to pay depends systematically on the amount of information available. In previous work on information-seeking in animals, information generally predicts upcoming rewards, and animals' decisions may therefore be a byproduct of reinforcement processes. Here we get around this potential confound by taking advantage of macaques' ability to reason counterfactually (that is, about outcomes that could have occurred had the subject chosen differently). Specifically, macaques sacrificed fluid reward to obtain information about counterfactual outcomes. Moreover, their willingness to pay scaled with the information (Shannon entropy) offered by the counterfactual option. These results demonstrate the existence of human-like curiosity in non-human primates according to our criteria, which circumvent several confounds associated with less stringent criteria.
Collapse
|
19
|
Hayden BY. Why has evolution not selected for perfect self-control? Philos Trans R Soc Lond B Biol Sci 2019; 374:20180139. [PMID: 30966922 PMCID: PMC6335460 DOI: 10.1098/rstb.2018.0139] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/04/2018] [Indexed: 12/13/2022] Open
Abstract
Self-control refers to the ability to deliberately reject tempting options and instead select ones that produce greater long-term benefits. Although some apparent failures of self-control are, on closer inspection, reward maximizing, at least some self-control failures are clearly disadvantageous and non-strategic. The existence of poor self-control presents an important evolutionary puzzle because there is no obvious reason why good self-control should be more costly than poor self-control. After all, a rock is infinitely patient. I propose that self-control failures result from cases in which well-learned (and thus routinized) decision-making strategies yield suboptimal choices. These mappings persist in the decision-makers' repertoire because they result from learning processes that are adaptive in the broader context, either on the timescale of learning or of evolution. Self-control, then, is a form of cognitive control and the subjective feeling of effort likely reflects the true costs of cognitive control. Poor self-control, in this view, is ultimately a result of bounded optimality. This article is part of the theme issue 'Risk taking and impulsive behaviour: fundamental discoveries, theoretical perspectives and clinical implications.
Collapse
Affiliation(s)
- Benjamin Y. Hayden
- Department of Neuroscience, Center for Magnetic Resonance Research, Center for Neuroengineering, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
20
|
Abstract
The National Institute of Mental Health launched the Research Domain Criteria (RDoC) initiative to better understand dimensions of behavior and identify targets for treatment. Examining dimensions across psychiatric illnesses has proven challenging, as reliable behavioral paradigms that are known to engage specific neural circuits and translate across diagnostic populations are scarce. Delay discounting paradigms seem to be an exception: they are useful for understanding links between neural systems and behavior in healthy individuals, with potential for assessing how these mechanisms go awry in psychiatric illnesses. This article reviews relevant literature on delay discounting (or the rate at which the value of a reward decreases as the delay to receipt increases) in humans, including methods for examining it, its putative neural mechanisms, and its application in psychiatric research. There exist rigorous and reproducible paradigms to evaluate delay discounting, standard methods for calculating discount rate, and known neural systems probed by these paradigms. Abnormalities in discounting have been associated with psychopathology ranging from addiction (with steep discount rates indicating relative preference for immediate rewards) to anorexia nervosa (with shallow discount rates indicating preference for future rewards). The latest research suggests that delay discounting can be manipulated in the laboratory. Extensively studied in cognitive neuroscience, delay discounting assesses a dimension of behavior that is important for decision-making and is linked to neural substrates and to psychopathology. The question now is whether manipulating delay discounting can yield clinically significant changes in behavior that promote health. If so, then delay discounting could deliver on the RDoC promise.
Collapse
Affiliation(s)
- Karolina M Lempert
- Department of Psychology,University of Pennsylvania,Philadelphia, PA,USA
| | - Joanna E Steinglass
- Department of Psychiatry,Columbia University Medical Center,New York, NY,USA
| | - Anthony Pinto
- Department of Psychiatry,Columbia University Medical Center,New York, NY,USA
| | - Joseph W Kable
- Department of Psychology,University of Pennsylvania,Philadelphia, PA,USA
| | - Helen Blair Simpson
- Department of Psychiatry,Columbia University Medical Center,New York, NY,USA
| |
Collapse
|
21
|
Ramakrishnan A, Hayden BY, Platt ML. Local field potentials in dorsal anterior cingulate sulcus reflect rewards but not travel time costs during foraging. Brain Neurosci Adv 2019; 3:2398212818817932. [PMID: 32166176 PMCID: PMC7058217 DOI: 10.1177/2398212818817932] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Accepted: 11/12/2018] [Indexed: 11/16/2022] Open
Abstract
To maximise long-term reward rates, foragers deciding when to leave a patch must compute a decision variable that reflects both the immediately available reward and the time costs associated with travelling to the next patch. Identifying the mechanisms that mediate this computation is central to understanding how brains implement foraging decisions. We previously showed that firing rates of dorsal anterior cingulate sulcus neurons incorporate both variables. This result does not provide information about whether integration of information reflected in dorsal anterior cingulate sulcus spiking activity arises locally or whether it is inherited from upstream structures. Here, we examined local field potentials gathered simultaneously with our earlier recordings. In the majority of recording sites, local field potential spectral bands - specifically theta, beta, and gamma frequency ranges - encoded immediately available rewards but not time costs. The disjunction between information contained in spiking and local field potentials can constrain models of foraging-related processing. In particular, given the proposed link between local field potentials and inputs to a brain area, it raises the possibility that local processing within dorsal anterior cingulate sulcus serves to more fully bind immediate reward and time costs into a single decision variable.
Collapse
Affiliation(s)
- Arjun Ramakrishnan
- Department of Neuroscience, University of Pennsylvania, Philadelphia, PA, USA
| | - Benjamin Y. Hayden
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, USA
| | - Michael L. Platt
- Department of Neuroscience, University of Pennsylvania, Philadelphia, PA, USA
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, USA
- Department of Marketing, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
22
|
|
23
|
Abstract
This paper characterizes impulsive behavior using a patch-leaving paradigm and active inference-a framework for describing Bayes optimal behavior. This paradigm comprises different environments (patches) with limited resources that decline over time at different rates. The challenge is to decide when to leave the current patch for another to maximize reward. We chose this task because it offers an operational characterization of impulsive behavior, namely, maximizing proximal reward at the expense of future gain. We use a Markov decision process formulation of active inference to simulate behavioral and electrophysiological responses under different models and prior beliefs. Our main finding is that there are at least three distinct causes of impulsive behavior, which we demonstrate by manipulating three different components of the Markov decision process model. These components comprise (i) the depth of planning, (ii) the capacity to maintain and process information, and (iii) the perceived value of immediate (relative to delayed) rewards. We show how these manipulations change beliefs and subsequent choices through variational message passing. Furthermore, we appeal to the process theories associated with this message passing to simulate neuronal correlates. In future work, we will use this scheme to identify the prior beliefs that underlie different sorts of impulsive behavior-and ask whether different causes of impulsivity can be inferred from the electrophysiological correlates of choice behavior.
Collapse
|
24
|
Cash-Padgett T, Azab H, Yoo SBM, Hayden BY. Opposing pupil responses to offered and anticipated reward values. Anim Cogn 2018; 21:671-684. [PMID: 29971595 PMCID: PMC6232855 DOI: 10.1007/s10071-018-1202-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2018] [Revised: 06/04/2018] [Accepted: 06/27/2018] [Indexed: 01/01/2023]
Abstract
Previous studies have shown that the pupils dilate more in anticipation of larger rewards. This finding raises the possibility of a more general association between reward amount and pupil size. We tested this idea by characterizing macaque pupil responses to offered rewards during evaluation and comparison in a binary choice task. To control attention, we made use of a design in which offers occurred in sequence. By looking at pupil responses after choice but before reward, we confirmed the previously observed positive association between pupil size and anticipated reward values. Surprisingly, however, we find that pupil size is negatively correlated with the value of offered gambles before choice, during both evaluation and comparison stages of the task. These results demonstrate a functional distinction between offered and anticipated rewards and present evidence against a narrow version of the simulation hypothesis; the idea that we represent offers by reactivating states associated with anticipating them. They also suggest that pupil size is correlated with relative, not absolute, values of offers, suggestive of an accept-reject model of comparison.
Collapse
Affiliation(s)
- Tyler Cash-Padgett
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, 55455, USA.
| | - Habiba Azab
- Department of Brain and Cognitive Sciences and Center for Visual Sciences, Center for the Origins of Cognition, University of Rochester, Rochester, NY, USA
| | - Seng Bum Michael Yoo
- Department of Brain and Cognitive Sciences and Center for Visual Sciences, Center for the Origins of Cognition, University of Rochester, Rochester, NY, USA
| | - Benjamin Y Hayden
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, 55455, USA
| |
Collapse
|
25
|
Wendt S, Czaczkes TJ. Individual ant workers show self-control. Biol Lett 2018; 13:rsbl.2017.0450. [PMID: 29021315 DOI: 10.1098/rsbl.2017.0450] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Accepted: 09/18/2017] [Indexed: 11/12/2022] Open
Abstract
Often, the first option is not the best. Self-control can allow humans and animals to improve resource intake under such conditions. Self-control in animals is often investigated using intertemporal choice tasks-choosing a smaller reward immediately or a larger reward after a delay. However, intertemporal choice tasks may underestimate self-control, as test subjects may not fully understand the task. Vertebrates show much greater apparent self-control in more natural foraging contexts and spatial discounting tasks than in intertemporal choice tasks. However, little is still known about self-control in invertebrates. Here, we investigate self-control in the black garden ant Lasius niger We confront individual workers with a spatial discounting task, offering a high-quality reward far from the nest and a poor-quality reward closer to the nest. Most ants (69%) successfully ignored the closer, poorer reward in favour of the further, better one. However, when both the far and the close rewards were of the same quality, most ants (83%) chose the closer feeder, indicating that the ants were indeed exercising self-control, as opposed to a fixation on an already known food source.
Collapse
Affiliation(s)
- Stephanie Wendt
- Animal Comparative Economics Laboratory, Department of Zoology and Evolutionary Biology, University of Regensburg, D-93053 Regensburg, Germany
| | - Tomer J Czaczkes
- Animal Comparative Economics Laboratory, Department of Zoology and Evolutionary Biology, University of Regensburg, D-93053 Regensburg, Germany
| |
Collapse
|
26
|
Cromwell HC, Tremblay L, Schultz W. Neural encoding of choice during a delayed response task in primate striatum and orbitofrontal cortex. Exp Brain Res 2018; 236:1679-1688. [PMID: 29610950 DOI: 10.1007/s00221-018-5253-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Accepted: 03/30/2018] [Indexed: 12/12/2022]
Abstract
Reward outcomes are available in many diverse situations and all involve choice. If there are multiple outcomes each rewarding, then decisions regarding relative value lead to choosing one over another. Important factors related to choice context should be encoded and utilized for this form of adaptive choosing. These factors can include the number of alternatives, the pacing of choice behavior and the possibility to reverse one's choice. An essential step in understanding if the context of choice is encoded is to directly compare choice with a context in which choice is absent. Neural activity in orbitofrontal cortex and striatum encodes potential value parameters related to reward quality and quantity as well as relative preference. We examined how neural activations in these brain regions are sensitive to choice situations and potentially involved in a prediction for the upcoming outcome selection. Neural activity was recorded and compared between a two-choice spatial delayed response task and an imperative 'one-option' task. Neural activity was obtained that extended from the instruction cue to the movement similar to previous work utilizing the identical imperative task. Orbitofrontal and striatal neural responses depended upon the decision about the choice of which reward to collect. Moreover, signals to predictive instruction cues that precede choice were selective for the choice situation. These neural responses could reflect chosen value with greater information on relative value of individual options as well as encode choice context itself embedded in the task as a part of the post-decision variable.
Collapse
Affiliation(s)
- Howard C Cromwell
- Department of Psychology, JP Scott Center for Neuroscience, Mind and Behavior, Bowling Green State University, Bowling Green, OH, 43403, USA.
| | - Leon Tremblay
- Centre de Neuroscience Cognitive, UMR-5229 CNRS, Bron, Cedex, France
- Université Claude-Bernard Lyon 1, 69100, Villeurbanne, France
| | - Wolfram Schultz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, CB2 3DY, UK
| |
Collapse
|
27
|
On the Flexibility of Basic Risk Attitudes in Monkeys. J Neurosci 2018; 38:4383-4398. [PMID: 29626169 DOI: 10.1523/jneurosci.2260-17.2018] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Revised: 03/19/2018] [Accepted: 03/23/2018] [Indexed: 11/21/2022] Open
Abstract
Monkeys and other animals appear to share with humans two risk attitudes predicted by prospect theory: an inverse-S-shaped probability-weighting (PW) function and a steeper utility curve for losses than for gains. These findings suggest that such preferences are stable traits with common neural substrates. We hypothesized instead that animals tailor their preferences to subtle changes in task contexts, making risk attitudes flexible. Previous studies used a limited number of outcomes, trial types, and contexts. To gain a broader perspective, we examined two large datasets of male macaques' risky choices: one from a task with real (juice) gains and another from a token task with gains and losses. In contrast to previous findings, monkeys were risk seeking for both gains and losses (i.e., lacked a reflection effect) and showed steeper gain than loss curves (loss seeking). Utility curves for gains were substantially different in the two tasks. Monkeys showed nearly linear PWs in one task and S-shaped ones in the other; neither task produced a consistent inverse-S-shaped curve. To account for these observations, we developed and tested various computational models of the processes involved in the construction of reward value. We found that adaptive differential weighting of prospective gamble outcomes could partially account for the observed differences in the utility functions across the two experiments and thus provide a plausible mechanism underlying flexible risk attitudes. Together, our results support the idea that risky choices are constructed flexibly at the time of elicitation and place important constraints on neural models of economic choice.SIGNIFICANCE STATEMENT We respond in reliable ways to risk, but are our risk preferences stable traits or ephemeral states? Using various computational models, we examined two large datasets of macaque risky choices in two different tasks. We observed several deviations from "classic" risk preferences seen in humans and monkeys: no reflection effect, loss seeking as opposed to loss aversion, and linear and S-shaped, as opposed to inverse-S-shaped, probability distortion. These results challenge the idea that our risk attitudes are evolved traits shared with the last common ancestor of macaques and humans, suggesting instead that behavioral flexibility is the hallmark of risky choice in primates. We show how this flexibility can emerge partly as a result of interactions between attentional and reward systems.
Collapse
|
28
|
Blanchard TC, Piantadosi ST, Hayden BY. Robust mixture modeling reveals category-free selectivity in reward region neuronal ensembles. J Neurophysiol 2018; 119:1305-1318. [PMID: 29212924 PMCID: PMC5966738 DOI: 10.1152/jn.00808.2017] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Revised: 11/30/2017] [Accepted: 12/01/2017] [Indexed: 12/29/2022] Open
Abstract
Classification of neurons into clusters based on their response properties is an important tool for gaining insight into neural computations. However, it remains unclear to what extent neurons fall naturally into discrete functional categories. We developed a Bayesian method that models the tuning properties of neural populations as a mixture of multiple types of task-relevant response patterns. We applied this method to data from several cortical and striatal regions in economic choice tasks. In all cases, neurons fell into only two clusters: one multiple-selectivity cluster containing all cells driven by task variables of interest and another of no selectivity for those variables. The single cluster of task-sensitive cells argues against robust categorical tuning in these areas. The no-selectivity cluster was unanticipated and raises important questions about what distinguishes these neurons and what role they play. Moreover, the ability to formally identify these nonselective cells allows for more accurate measurement of ensemble effects by excluding or appropriately down-weighting them in analysis. Our findings provide a valuable tool for analysis of neural data, challenge simple categorization schemes previously proposed for these regions, and place useful constraints on neurocomputational models of economic choice and control. NEW & NOTEWORTHY We present a Bayesian method for formally detecting whether a population of neurons can be naturally classified into clusters based on their response tuning properties. We then examine several data sets of reward system neurons for variables and find in all cases that neurons can be classified into only two categories: a functional class and a non-task-driven class. These results provide important constraints for neural models of the reward system.
Collapse
Affiliation(s)
- Tommy C Blanchard
- Department of Brain and Cognitive Sciences, Center for Visual Science, and Center for the Origins of Cognition, University of Rochester , Rochester, New York
| | - Steven T Piantadosi
- Department of Brain and Cognitive Sciences, Center for Visual Science, and Center for the Origins of Cognition, University of Rochester , Rochester, New York
| | - Benjamin Y Hayden
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota , Minneapolis, Minnesota
| |
Collapse
|
29
|
Azab H, Hayden BY. Correlates of economic decisions in the dorsal and subgenual anterior cingulate cortices. Eur J Neurosci 2018; 47:979-993. [PMID: 29431892 PMCID: PMC5902660 DOI: 10.1111/ejn.13865] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Revised: 02/03/2018] [Accepted: 02/05/2018] [Indexed: 11/26/2022]
Abstract
The anterior cingulate cortex can be divided into distinct ventral (subgenual, sgACC) and dorsal (dACC), portions. The role of dACC in value-based decision-making is hotly debated, while the role of sgACC is poorly understood. We recorded neuronal activity in both regions in rhesus macaques performing a token-gambling task. We find that both encode many of the same variables; including integrated offered values of gambles, primary as well as secondary reward outcomes, number of current tokens and anticipated rewards. Both regions exhibit memory traces for offer values and putative value comparison signals. Both regions use a consistent scheme to encode the value of the attended option. This result suggests that neurones do not appear to be specialized for specific offers (that is, neurones use an attentional as opposed to labelled line coding scheme). We also observed some differences between the two regions: (i) coding strengths in dACC were consistently greater than those in sgACC, (ii) neurones in sgACC responded especially to losses and in anticipation of primary rewards, while those in dACC showed more balanced responding and (iii) responses to the first offer were slightly faster in sgACC. These results indicate that sgACC and dACC have some functional overlap in economic choice, and are consistent with the idea, inspired by neuroanatomy, which sgACC may serve as input to dACC.
Collapse
Affiliation(s)
- Habiba Azab
- Department of Brain and Cognitive Sciences and Center for Visual Sciences, University of Rochester, Rochester, NY 14618, USA
| | - Benjamin Y. Hayden
- Department of Brain and Cognitive Sciences and Center for Visual Sciences, University of Rochester, Rochester, NY 14618, USA
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis MN 55455, USA
| |
Collapse
|
30
|
Balasubramani PP, Moreno-Bote R, Hayden BY. Using a Simple Neural Network to Delineate Some Principles of Distributed Economic Choice. Front Comput Neurosci 2018; 12:22. [PMID: 29643773 PMCID: PMC5882864 DOI: 10.3389/fncom.2018.00022] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 03/12/2018] [Indexed: 01/03/2023] Open
Abstract
The brain uses a mixture of distributed and modular organization to perform computations and generate appropriate actions. While the principles under which the brain might perform computations using modular systems have been more amenable to modeling, the principles by which the brain might make choices using distributed principles have not been explored. Our goal in this perspective is to delineate some of those distributed principles using a neural network method and use its results as a lens through which to reconsider some previously published neurophysiological data. To allow for direct comparison with our own data, we trained the neural network to perform binary risky choices. We find that value correlates are ubiquitous and are always accompanied by non-value information, including spatial information (i.e., no pure value signals). Evaluation, comparison, and selection were not distinct processes; indeed, value signals even in the earliest stages contributed directly, albeit weakly, to action selection. There was no place, other than at the level of action selection, at which dimensions were fully integrated. No units were specialized for specific offers; rather, all units encoded the values of both offers in an anti-correlated format, thus contributing to comparison. Individual network layers corresponded to stages in a continuous rotation from input to output space rather than to functionally distinct modules. While our network is likely to not be a direct reflection of brain processes, we propose that these principles should serve as hypotheses to be tested and evaluated for future studies.
Collapse
Affiliation(s)
- Pragathi P. Balasubramani
- Brain and Cognitive Sciences, Center for Visual Science, Center for the Origins of Cognition, University of Rochester, Rochester, NY, United States
| | - Rubén Moreno-Bote
- Department of Information and Communications Technologies, Center for Brain and Cognition, University Pompeu Fabra, Barcelona, Spain
- Serra Húnter Fellow Programme, University Pompeu Fabra, Barcelona, Spain
| | - Benjamin Y. Hayden
- Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minnesota, MN, United States
| |
Collapse
|
31
|
Yoo SBM, Sleezer BJ, Hayden BY. Robust Encoding of Spatial Information in Orbitofrontal Cortex and Striatum. J Cogn Neurosci 2018; 30:898-913. [PMID: 29561237 DOI: 10.1162/jocn_a_01259] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Knowing whether core reward regions carry information about the positions of relevant objects is crucial for adjudicating between choice models. One limitation of previous studies, including our own, is that spatial positions can be consistently differentially associated with rewards, and thus position can be confounded with attention, motor plans, or target identity. We circumvented these problems by using a task in which value-and thus choices-was determined solely by a frequently changing rule, which was randomized relative to spatial position on each trial. We presented offers asynchronously, which allowed us to control for reward expectation, spatial attention, and motor plans in our analyses. We find robust encoding of the spatial position of both offers and choices in two core reward regions, orbitofrontal Area 13 and ventral striatum, as well as in dorsal striatum of macaques. The trial-by-trial correlation in noise in encoding of position was associated with variation in choice, an effect known as choice probability correlation, suggesting that the spatial encoding is associated with choice and is not incidental to it. Spatial information and reward information are not carried by separate sets of neurons, although the two forms of information are temporally dissociable. These results highlight the ubiquity of multiplexed information in association cortex and argue against the idea that these ostensible reward regions serve as part of a pure value domain.
Collapse
|
32
|
Eisenreich BR, Hayden BY. Cognitive Science: Persistent Apes Are Intelligent Apes. Curr Biol 2018; 28:R160-R162. [PMID: 29462583 PMCID: PMC5843479 DOI: 10.1016/j.cub.2018.01.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
In humans, self-control is correlated with general intelligence; a new study finds that this correlation extends to chimpanzees as well. The new results highlight the cognitive bases of self-control and suggest a common evolutionary history for human and primate self-control.
Collapse
Affiliation(s)
| | - Benjamin Y Hayden
- Department of Neuroscience, University of Minnesota, Minneapolis, MN 55455, USA.
| |
Collapse
|
33
|
Hayden BY, Moreno-Bote R. A neuronal theory of sequential economic choice. Brain Neurosci Adv 2018; 2:2398212818766675. [PMID: 32166137 PMCID: PMC7058205 DOI: 10.1177/2398212818766675] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Accepted: 02/27/2018] [Indexed: 11/16/2022] Open
Abstract
Results of recent studies point towards a new framework for the neural bases of economic choice. The principles of this framework include the idea that evaluation is limited to a single option within the focus of attention and that we accept or reject that option relative to the entire set of alternatives. Rejection leads attention to a new option, although it can later switch back to a previously rejected one. The option to which a neuron's firing rate refers is determined dynamically by attention and not stably by labelled lines. Value is always computed relative to the value of rejection. Comparison results not from explicit competition between discrete populations of neurons, but indirectly, as in a horse race, from the fact that the first option whose value crosses a threshold is selected. Consequently, comparison can occur within a single pool of neurons rather than by competition between two or more neuronal populations. The computations that constitute comparison thus occur at multiple levels, including premotor levels, simultaneously (i.e. the brain uses a distributed consensus), and not in discrete stages. This framework suggests a solution to a set of otherwise unresolved neuronal binding problems that result from the need to link options to values, comparisons to actions, and choices to outcomes.
Collapse
Affiliation(s)
- Benjamin Y. Hayden
- Department of Neuroscience and Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, USA
| | - Rubén Moreno-Bote
- Department of Information and Communications Technologies, Pompeu Fabra University, Barcelona, Spain
- Center for Brain and Cognition, Pompeu Fabra University, Barcelona, Spain
- Serra Húnter Fellow Programme, Pompeu Fabra University, Barcelona, Spain
| |
Collapse
|
34
|
Dorsal anterior cingulate and ventromedial prefrontal cortex have inverse roles in both foraging and economic choice. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2017; 16:1127-1139. [PMID: 27580609 DOI: 10.3758/s13415-016-0458-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Recent research has highlighted a distinction between sequential foraging choices and traditional economic choices between simultaneously presented options. This was partly motivated by observations in Kolling, Behrens, Mars, and Rushworth, Science, 336(6077), 95-98 (2012) (hereafter, KBMR) that these choice types are subserved by different circuits, with dorsal anterior cingulate (dACC) preferentially involved in foraging and ventromedial prefrontal cortex (vmPFC) preferentially involved in economic choice. To support this account, KBMR used fMRI to scan human subjects making either a foraging choice (between exploiting a current offer or swapping for potentially better rewards) or an economic choice (between two reward-probability pairs). This study found that dACC better tracked values pertaining to foraging, whereas vmPFC better tracked values pertaining to economic choice. We recently showed that dACC's role in these foraging choices is better described by the difficulty of choosing than by foraging value, when correcting for choice biases and testing a sufficiently broad set of foraging values (Shenhav, Straccia, Cohen, & Botvinick Nature Neuroscience, 17(9), 1249-1254, 2014). Here, we extend these findings in 3 ways. First, we replicate our original finding with a larger sample and a task modified to address remaining methodological gaps between our previous experiments and that of KBMR. Second, we show that dACC activity is best accounted for by choice difficulty alone (rather than in combination with foraging value) during both foraging and economic choices. Third, we show that patterns of vmPFC activity, inverted relative to dACC, also suggest a common function across both choice types. Overall, we conclude that both regions are similarly engaged by foraging-like and economic choice.
Collapse
|
35
|
Kolling N, Akam T. (Reinforcement?) Learning to forage optimally. Curr Opin Neurobiol 2017; 46:162-169. [PMID: 28918312 DOI: 10.1016/j.conb.2017.08.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Revised: 08/06/2017] [Accepted: 08/17/2017] [Indexed: 11/24/2022]
Abstract
Foraging effectively is critical to the survival of all animals and this imperative is thought to have profoundly shaped brain evolution. Decisions made by foraging animals often approximate optimal strategies, but the learning and decision mechanisms generating these choices remain poorly understood. Recent work with laboratory foraging tasks in humans suggest their behaviour is poorly explained by model-free reinforcement learning, with simple heuristic strategies better describing behaviour in some tasks, and in others evidence of prospective prediction of the future state of the environment. We suggest that model-based average reward reinforcement learning may provide a common framework for understanding these apparently divergent foraging strategies.
Collapse
Affiliation(s)
- Nils Kolling
- Department of Experimental Psychology, University of Oxford, United Kingdom
| | - Thomas Akam
- Department of Experimental Psychology, University of Oxford, United Kingdom; Champalimaud Neuroscience Program, Champalimaud Center for the Unknown, Portugal.
| |
Collapse
|
36
|
Deciphering Decision Making: Variation in Animal Models of Effort- and Uncertainty-Based Choice Reveals Distinct Neural Circuitries Underlying Core Cognitive Processes. J Neurosci 2017; 36:12069-12079. [PMID: 27903717 DOI: 10.1523/jneurosci.1713-16.2016] [Citation(s) in RCA: 77] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2016] [Revised: 10/06/2016] [Accepted: 10/19/2016] [Indexed: 12/17/2022] Open
Abstract
Maladaptive decision-making is increasingly recognized to play a significant role in numerous psychiatric disorders, such that therapeutics capable of ameliorating core impairments in judgment may be beneficial in a range of patient populations. The field of "decision neuroscience" is therefore in its ascendancy, with researchers from diverse fields bringing their expertise to bear on this complex and fascinating problem. In addition to the advances in neuroimaging and computational neuroscience that contribute enormously to this area, an increase in the complexity and sophistication of behavioral paradigms designed for nonhuman laboratory animals has also had a significant impact on researchers' ability to test the causal nature of hypotheses pertaining to the neural circuitry underlying the choice process. Multiple such decision-making assays have been developed to investigate the neural and neurochemical bases of different types of cost/benefit decisions. However, what may seem like relatively trivial variation in behavioral methodologies can actually result in recruitment of distinct cognitive mechanisms, and alter the neurobiological processes that regulate choice. Here we focus on two areas of particular interest, namely, decisions that involve an assessment of uncertainty or effort, and compare some of the most prominent behavioral paradigms that have been used to investigate these processes in laboratory rodents. We illustrate how an appreciation of the diversity in the nature of these tasks can lead to important insights into the circumstances under which different neural regions make critical contributions to decision making.
Collapse
|
37
|
|
38
|
McGraw JJ, Zona LC, Cromwell HC. The effects of ethanol on diverse components of choice in the rat: reward discrimination, preference and relative valuation. Eur J Neurosci 2017. [PMID: 28639261 DOI: 10.1111/ejn.13627] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Alcohol consumption impairs judgment and choice. How alcohol alters these crucial processes is primarily unknown. Choice can be fractionated into different components including reward discrimination, preference and relative valuation that can function together or in isolation depending upon diverse factors including choice context. We examined the diverse components and contextual effects by analyzing the effects of alcohol drinking on choice behavior in a task with a reduced level of temporal and spatial constraints. Rats were trained to drink 10% ethanol during 6 weeks of behavior testing using a combined sucrose-fade and two-bottle free-choice procedure. Two different sucrose pellet outcomes (e.g., constant vs. variable) were presented each week to examine the impact of voluntary drinking on reward-based decision-making. Behavioral contexts of single option, free choice and extinction were examined for each outcome set. Comparisons were made between alcohol and control groups and within the alcohol group over time to inspect choice profiles. Between-group results showed alcohol drinking animals expressed altered place preference and modified sucrose reward approach latencies. The within-group profile showed that alcohol drinking animals can express adequate reward discrimination, preference and incentive contrast during free choice. All of these components were significantly reduced during the context of extinction. Control animals were also impacted by extinction but not as severely. The findings point to a need for a greater focus on the context and the diverse components of choice when examining external and internal factors influencing decision-making during alcohol or other substance of abuse exposure.
Collapse
Affiliation(s)
- Justin J McGraw
- J.P. Scott Center for Neuroscience, Mind and Behavior and Department of Psychology, Bowling Green State University, Bowling Green, OH, USA
| | - Luke C Zona
- J.P. Scott Center for Neuroscience, Mind and Behavior and Department of Psychology, Bowling Green State University, Bowling Green, OH, USA
| | - Howard C Cromwell
- J.P. Scott Center for Neuroscience, Mind and Behavior and Department of Psychology, Bowling Green State University, Bowling Green, OH, USA
| |
Collapse
|
39
|
Nigg JT. Annual Research Review: On the relations among self-regulation, self-control, executive functioning, effortful control, cognitive control, impulsivity, risk-taking, and inhibition for developmental psychopathology. J Child Psychol Psychiatry 2017; 58:361-383. [PMID: 28035675 PMCID: PMC5367959 DOI: 10.1111/jcpp.12675] [Citation(s) in RCA: 723] [Impact Index Per Article: 103.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/14/2016] [Indexed: 01/08/2023]
Abstract
BACKGROUND Self-regulation (SR) is central to developmental psychopathology, but progress has been impeded by varying terminology and meanings across fields and literatures. METHODS The present review attempts to move that discussion forward by noting key sources of prior confusion such as measurement-concept confounding, and then arguing the following major points. RESULTS First, the field needs a domain-general construct of SR that encompasses SR of action, emotion, and cognition and involves both top-down and bottom-up regulatory processes. This does not assume a shared core process across emotion, action, and cognition, but is intended to provide clarity on the extent of various claims about kinds of SR. Second, top-down aspects of SR need to be integrated. These include (a) basic processes that develop early and address immediate conflict signals, such as cognitive control and effortful control (EC), and (b) complex cognition and strategies for addressing future conflict, represented by the regulatory application of complex aspects of executive functioning. Executive function (EF) and cognitive control are not identical to SR because they can be used for other activities, but account for top-down aspects of SR at the cognitive level. Third, impulsivity, risk-taking, and disinhibition are distinct although overlapping; a taxonomy of the kinds of breakdowns of SR associated with psychopathology requires their differentiation. Fourth, different aspects of the SR universe can be organized hierarchically in relation to granularity, development, and time. Low-level components assemble into high-level components. This hierarchical perspective is consistent across literatures. CONCLUSIONS It is hoped that the framework outlined here will facilitate integration and cross-talk among investigators working from different perspectives, and facilitate individual differences research on how SR relates to developmental psychopathology.
Collapse
Affiliation(s)
- Joel T Nigg
- Department of Psychiatry, Oregon Health and Science University, Portland, OR, USA
| |
Collapse
|
40
|
Pirrone A, Azab H, Hayden BY, Stafford T, Marshall JAR. Evidence for the speed-value trade-off: human and monkey decision making is magnitude sensitive. ACTA ACUST UNITED AC 2017; 5:129-142. [PMID: 29682592 DOI: 10.1037/dec0000075] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Complex natural systems from brains to bee swarms have evolved to make adaptive multifactorial decisions. Recent theoretical and empirical work suggests that many evolved systems may take advantage of common motifs across multiple domains. We are particularly interested in value sensitivity (i.e., sensitivity to the magnitude or intensity of the stimuli or reward under consideration) as a mechanism to resolve deadlocks adaptively. This mechanism favours long-term reward maximization over accuracy in a simple manner, because it avoids costly delays associated with ambivalence between similar options; speed-value trade-offs have been proposed to be evolutionarily advantageous for many kinds of decision. A key prediction of the value-sensitivity hypothesis is that choices between equally-valued options will proceed faster when the options have a high value than when they have a low value. However, value-sensitivity is not part of idealised choice models such as diffusion to bound. Here we examine two different choice behaviours in two different species, perceptual decisions in humans and economic choices in rhesus monkeys, to test this hypothesis. We observe the same value sensitivity in both human perceptual decisions and monkey value-based decisions. These results endorse the idea that neural decision systems make use of the same basic principle of value-sensitivity in order to resolve costly deadlocks and thus improve long-term reward intake.
Collapse
Affiliation(s)
- Angelo Pirrone
- Department of Psychology & Department of Computer Science, The University of Sheffield, UK
| | - Habiba Azab
- Department of Brain and Cognitive Sciences and Center for Visual Science, University of Rochester, USA
| | - Benjamin Y Hayden
- Department of Brain and Cognitive Sciences and Center for Visual Science, University of Rochester, USA
| | - Tom Stafford
- Department of Psychology, The University of Sheffield, UK
| | | |
Collapse
|
41
|
Carter EC, Redish AD. Rats value time differently on equivalent foraging and delay-discounting tasks. J Exp Psychol Gen 2016; 145:1093-101. [PMID: 27359127 DOI: 10.1037/xge0000196] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
All organisms have to consider consequences that vary through time. Theories explaining how animals handle intertemporal choice include delay-discounting models, in which the value of future rewards is discounted by the delay until receipt, and foraging models, which predict that decision-makers maximize rate of reward. We measured the behavior of rats on a 2-option delay-discounting task and a stay/go foraging task that were equivalent for rate of reward and physical demand. Despite the highly shared features of the tasks, rats were willing to wait much longer on the foraging task than on the delay-discounting task. Moreover, choice performance by rats was less optimal in terms of total reward received on the foraging task compared to the delay-discounting task. We applied a suite of intertemporal choice models to the data but found that we needed a novel model incorporating interactions of decision-making systems to successfully explain behavior. Our findings (a) highlight the importance of factors that historically have been seen as irrelevant and (b) indicate the inadequacy of current general theories of intertemporal choice. (PsycINFO Database Record
Collapse
Affiliation(s)
- Evan C Carter
- Department of Ecology, Evolution and Behavior, University of Minnesota
| | | |
Collapse
|
42
|
Beran MJ, Perdue BM, Rossettie MS, James BT, Whitham W, Walker B, Futch SE, Parrish AE. Self-control assessments of capuchin monkeys with the rotating tray task and the accumulation task. Behav Processes 2016; 129:68-79. [PMID: 27298233 DOI: 10.1016/j.beproc.2016.06.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2016] [Revised: 06/09/2016] [Accepted: 06/09/2016] [Indexed: 10/21/2022]
Abstract
Recent studies of delay of gratification in capuchin monkeys using a rotating tray (RT) task have shown improved self-control performance in these animals in comparison to the accumulation (AC) task. In this study, we investigated whether this improvement resulted from the difference in methods between the rotating tray task and previous tests, or whether it was the result of greater overall experience with delay of gratification tasks. Experiment 1 produced similar performance levels by capuchins monkeys in the RT and AC tasks when identical reward and temporal parameters were used. Experiment 2 demonstrated a similar result using reward amounts that were more similar to previous AC experiments with these monkeys. In Experiment 3, monkeys performed multiple versions of the AC task with varied reward and temporal parameters. Their self-control behavior was found to be dependent on the overall delay to reward consumption, rather than the overall reward amount ultimately consumed. These findings indicate that these capuchin monkeys' self-control capacities were more likely to have improved across studies because of the greater experience they had with delay of gratification tasks. Experiment 4 and Experiment 5 tested new, task-naïve monkeys on both tasks, finding more limited evidence of self-control, and no evidence that one task was more beneficial than the other in promoting self-control. The results of this study suggest that future testing of this kind should focus on temporal parameters and reward magnitude parameters to establish accurate measures of delay of gratification capacity and development in this species and perhaps others.
Collapse
Affiliation(s)
- Michael J Beran
- Department of Psychology and Language Research Center, Georgia State University, United States.
| | - Bonnie M Perdue
- Department of Psychology, Agnes Scott College, United States
| | | | - Brielle T James
- Department of Psychology and Language Research Center, Georgia State University, United States
| | - Will Whitham
- Department of Psychology and Language Research Center, Georgia State University, United States
| | - Bradlyn Walker
- Department of Psychology and Language Research Center, Georgia State University, United States
| | - Sara E Futch
- Department of Psychology, Wofford College, United States
| | - Audrey E Parrish
- Department of Psychology and Language Research Center, Georgia State University, United States
| |
Collapse
|
43
|
Ricker JM, Hatch JD, Powers DD, Cromwell HC. Fractionating choice: A study on reward discrimination, preference, and relative valuation in the rat (Rattus norvegicus). ACTA ACUST UNITED AC 2016; 130:174-86. [PMID: 27078079 DOI: 10.1037/com0000034] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Choice behavior combines discrimination between distinctive outcomes, preference for specific outcomes and relative valuation of comparable outcomes. Previous work has focused on 1 component (i.e., preference) disregarding other influential processes that might provide a more complete understanding. Animal models of choice have been explored primarily utilizing extensive training, limited freedom for multiple decisions and sparse behavioral measures constrained to a single phase of motivated action. The present study used a paradigm that combines different elements of previous methods with the goal to distinguish among components of choice and explore how well components match predictions based on risk-sensitive foraging strategies. In order to analyze discrimination and relative valuation, it was necessary to have an option that shifted and an option that remained constant. Shifting outcomes among weeks included a change in single-option outcome (0 to 1 to 2 pellets) or a change in mixed-option outcome (0 or 5 to 0 or 3 to 0 or 1 pellets). Constant outcomes among weeks were also mixed-option (0 or 3 pellets) or single-option (1 pellet). Shifting single-option outcomes among weeks led to better discrimination, more robust preference and significant incentive contrast effects for the alternative outcome. Shifting multioptions altered choice components and led to dissociations among discrimination, preference, and reduced contrast effects. During extinction, all components were impacted with the greatest deficits during the shifting mixed-option outcome sessions. Results suggest choice behavior can be optimized for 1 component but suboptimal for others depending upon the complexity of alterations in outcome value between options. (PsycINFO Database Record
Collapse
Affiliation(s)
- Joshua M Ricker
- J.P. Scott Center for Neuroscience, Mind and Behavior, Bowling Green State University
| | - Justin D Hatch
- J.P. Scott Center for Neuroscience, Mind and Behavior, Bowling Green State University
| | - Daniel D Powers
- J.P. Scott Center for Neuroscience, Mind and Behavior, Bowling Green State University
| | - Howard Casey Cromwell
- J.P. Scott Center for Neuroscience, Mind and Behavior, Bowling Green State University
| |
Collapse
|
44
|
|
45
|
Blanchard TC, Strait CE, Hayden BY. Ramping ensemble activity in dorsal anterior cingulate neurons during persistent commitment to a decision. J Neurophysiol 2015; 114:2439-49. [PMID: 26334016 DOI: 10.1152/jn.00711.2015] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2015] [Accepted: 08/27/2015] [Indexed: 11/22/2022] Open
Abstract
We frequently need to commit to a choice to achieve our goals; however, the neural processes that keep us motivated in pursuit of delayed goals remain obscure. We examined ensemble responses of neurons in macaque dorsal anterior cingulate cortex (dACC), an area previously implicated in self-control and persistence, in a task that requires commitment to a choice to obtain a reward. After reward receipt, dACC neurons signaled reward amount with characteristic ensemble firing rate patterns; during the delay in anticipation of the reward, ensemble activity smoothly and gradually came to resemble the postreward pattern. On the subset of risky trials, in which a reward was anticipated with 50% certainty, ramping ensemble activity evolved to the pattern associated with the anticipated reward (and not with the anticipated loss) and then, on loss trials, took on an inverted form anticorrelated with the form associated with a win. These findings enrich our knowledge of reward processing in dACC and may have broader implications for our understanding of persistence and self-control.
Collapse
Affiliation(s)
- Tommy C Blanchard
- Department of Brain and Cognitive Sciences and Center for Visual Science, University of Rochester, Rochester, New York
| | - Caleb E Strait
- Department of Brain and Cognitive Sciences and Center for Visual Science, University of Rochester, Rochester, New York
| | - Benjamin Y Hayden
- Department of Brain and Cognitive Sciences and Center for Visual Science, University of Rochester, Rochester, New York
| |
Collapse
|
46
|
Wheatley R, Angilletta MJ, Niehaus AC, Wilson RS. How Fast Should an Animal Run When Escaping? An Optimality Model Based on the Trade-Off Between Speed and Accuracy. Integr Comp Biol 2015; 55:1166-75. [PMID: 26254873 DOI: 10.1093/icb/icv091] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
How fast should animals move when trying to survive? Although many studies have examined how fast animals can move, the fastest speed is not always best. For example, an individual escaping from a predator must run fast enough to escape, but not so fast that it slips and falls. To explore this idea, we developed a simple mathematical model that predicts the optimal speed for an individual running from a predator along a straight beam. A beam was used as a proxy for straight-line running with severe consequences for missteps. We assumed that success, defined as reaching the end of the beam, had two broad requirements: (1) running fast enough to escape a predator, and (2) minimizing the probability of making a mistake that would compromise speed. Our model can be tailored to different systems by revising the predator's maximal speed, the prey's stride length and motor coordination, and the dimensions of the beam. Our model predicts that animals should run slower when the beam is narrower or when coordination is worse.
Collapse
Affiliation(s)
- Rebecca Wheatley
- *School of Biological Sciences, University of Queensland, St Lucia, Queensland 4072, Australia;
| | | | - Amanda C Niehaus
- *School of Biological Sciences, University of Queensland, St Lucia, Queensland 4072, Australia
| | - Robbie S Wilson
- *School of Biological Sciences, University of Queensland, St Lucia, Queensland 4072, Australia
| |
Collapse
|
47
|
Piantadosi ST, Hayden BY. Utility-free heuristic models of two-option choice can mimic predictions of utility-stage models under many conditions. Front Neurosci 2015; 9:105. [PMID: 25914613 PMCID: PMC4391032 DOI: 10.3389/fnins.2015.00105] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Accepted: 03/12/2015] [Indexed: 11/13/2022] Open
Abstract
Economists often model choices as if decision-makers assign each option a scalar value variable, known as utility, and then select the option with the highest utility. It remains unclear whether as-if utility models describe real mental and neural steps in choice. Although choices alone cannot prove the existence of a utility stage, utility transformations are often taken to provide the most parsimonious or psychologically plausible explanation for choice data. Here, we show that it is possible to mathematically transform a large set of common utility-stage two-option choice models (specifically ones in which dimensions are can be decomposed into additive functions) into a heuristic model (specifically, a dimensional prioritization heuristic) that has no utility computation stage. We then show that under a range of plausible assumptions, both classes of model predict similar neural responses. These results highlight the difficulties in using neuroeconomic data to infer the existence of a value stage in choice.
Collapse
Affiliation(s)
- Steven T Piantadosi
- Department of Brain and Cognitive Sciences, University of Rochester Rochester, NY, USA
| | - Benjamin Y Hayden
- Department of Brain and Cognitive Sciences, University of Rochester Rochester, NY, USA ; Center for Visual Science, University of Rochester Rochester, NY, USA
| |
Collapse
|
48
|
Guan S, Cheng L, Fan Y, Li X. Myopic decisions under negative emotions correlate with altered time perception. Front Psychol 2015; 6:468. [PMID: 25941508 PMCID: PMC4400848 DOI: 10.3389/fpsyg.2015.00468] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Accepted: 03/31/2015] [Indexed: 11/13/2022] Open
Abstract
Previous studies have obtained inconsistent findings about emotional influence on inter-temporal choice (IC). In the present study, we first examined the effect of temporary emotional priming induced by affective pictures in a trial-to-trial paradigm on IC. The results showed that negative priming resulted in much higher percentages of trials during which smaller-but-sooner reward (SS%) were chosen compared with positive and neutral priming. Next, we attempted to explore the possible mechanisms underlying such emotional effects. When participants performed a time reproduction task, mean reaction times in negative priming condition were significantly shorter than those in the other two emotional contexts, which indicated that negative emotional priming led to overestimation of time. Moreover, such overestimation was negatively correlated with performance in the IC task. In contrast, temporary changes of emotional contexts did not alter performances in a Go/NoGo task (including commission errors and omission errors). In sum, our present findings suggested that myopic decisions under negative emotions were associated with altered time perception but not response inhibition.
Collapse
Affiliation(s)
- Shuchen Guan
- Key Laboratory of Brain Functional Genomics, Ministry of Education, Shanghai Key Laboratory of Brain Functional Genomics, School of Psychology and Cognitive Science, East China Normal University , Shanghai, China
| | - Lu Cheng
- Key Laboratory of Brain Functional Genomics, Ministry of Education, Shanghai Key Laboratory of Brain Functional Genomics, School of Psychology and Cognitive Science, East China Normal University , Shanghai, China
| | - Ying Fan
- Key Laboratory of Brain Functional Genomics, Ministry of Education, Shanghai Key Laboratory of Brain Functional Genomics, School of Psychology and Cognitive Science, East China Normal University , Shanghai, China
| | - Xianchun Li
- Key Laboratory of Brain Functional Genomics, Ministry of Education, Shanghai Key Laboratory of Brain Functional Genomics, School of Psychology and Cognitive Science, East China Normal University , Shanghai, China
| |
Collapse
|