Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Stauffer WR, Lak A, Schultz W. Dopamine reward prediction error responses reflect marginal utility. Curr Biol 2014;24:2491-500. [PMID: 25283778 PMCID: PMC4228052 DOI: 10.1016/j.cub.2014.08.064] [Citation(s) in RCA: 109] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2014] [Revised: 07/28/2014] [Accepted: 08/29/2014] [Indexed: 11/19/2022]

For:	Stauffer WR, Lak A, Schultz W. Dopamine reward prediction error responses reflect marginal utility. Curr Biol 2014;24:2491-500. [PMID: 25283778 PMCID: PMC4228052 DOI: 10.1016/j.cub.2014.08.064] [Citation(s) in RCA: 109] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2014] [Revised: 07/28/2014] [Accepted: 08/29/2014] [Indexed: 11/19/2022]

Number

Cited by Other Article(s)

Feng YY, Bromberg-Martin ES, Monosov IE. Dorsal raphe neurons integrate the values of reward amount, delay, and uncertainty in multi-attribute decision-making. Cell Rep 2024;43:114341. [PMID: 38878290 DOI: 10.1016/j.celrep.2024.114341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 03/27/2024] [Accepted: 05/23/2024] [Indexed: 06/25/2024] Open

Schultz W. A dopamine mechanism for reward maximization. Proc Natl Acad Sci U S A 2024;121:e2316658121. [PMID: 38717856 PMCID: PMC11098095 DOI: 10.1073/pnas.2316658121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/18/2024] Open

Hill DF, Hickman RW, Al-Mohammad A, Stasiak A, Schultz W. Dopamine neurons encode trial-by-trial subjective reward value in an auction-like task. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.01.20.524896. [PMID: 36711724 PMCID: PMC9882283 DOI: 10.1101/2023.01.20.524896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Hughes NC, Qian H, Zargari M, Zhao Z, Singh B, Wang Z, Fulton JN, Johnson GW, Li R, Dawant BM, Englot DJ, Constantinidis C, Roberson SW, Bick SK. Reward Circuit Local Field Potential Modulations Precede Risk Taking. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.10.588629. [PMID: 38645237 PMCID: PMC11030333 DOI: 10.1101/2024.04.10.588629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]

Abstract

Risk taking behavior is a symptom of multiple neuropsychiatric disorders and often lacks effective treatments. Reward circuitry regions including the amygdala, orbitofrontal cortex, insula, and anterior cingulate have been implicated in risk-taking by neuroimaging studies. Electrophysiological activity associated with risk taking in these regions is not well understood in humans. Further characterizing the neural signalling that underlies risk-taking may provide therapeutic insight into disorders associated with risk-taking. Eleven patients with pharmacoresistant epilepsy who underwent stereotactic electroencephalography with electrodes in the amygdala, orbitofrontal cortex, insula, and/or anterior cingulate participated. Patients participated in a gambling task where they wagered on a visible playing card being higher than a hidden card, betting $5 or $20 on this outcome, while local field potentials were recorded from implanted electrodes. We used cluster-based permutation testing to identify reward prediction error signals by comparing oscillatory power following unexpected and expected rewards. We also used cluster-based permutation testing to compare power preceding high and low bets in high-risk (<50% chance of winning) trials and two-way ANOVA with bet and risk level to identify signals associated with risky, risk averse, and optimized decisions. We used linear mixed effects models to evaluate the relationship between reward prediction error and risky decision signals across trials, and a linear regression model for associations between risky decision signal power and Barratt Impulsiveness Scale scores for each patient. Reward prediction error signals were identified in the amygdala (p=0.0066), anterior cingulate (p=0.0092), and orbitofrontal cortex (p=6.0E-4, p=4.0E-4). Risky decisions were predicted by increased oscillatory power in high-gamma frequency range during card presentation in the orbitofrontal cortex (p=0.0022), and by increased power following bet cue presentation across the theta-to-beta range in the orbitofrontal cortex ( p =0.0022), high-gamma in the anterior cingulate ( p =0.0004), and high-gamma in the insula ( p =0.0014). Risk averse decisions were predicted by decreased orbitofrontal cortex gamma power ( p =2.0E-4). Optimized decisions that maximized earnings were preceded by decreases within the theta to beta range in orbitofrontal cortex ( p =2.0E-4), broad frequencies in amygdala ( p =2.0E-4), and theta to low-gamma in insula ( p =4.0E-4). Insula risky decision power was associated with orbitofrontal cortex high-gamma reward prediction error signal ( p =0.0048) and with patient impulsivity ( p =0.00478). Our findings identify and help characterize reward circuitry activity predictive of risk-taking in humans. These findings may serve as potential biomarkers to inform the development of novel treatment strategies such as closed loop neuromodulation for disorders of risk taking.

Collapse

Sasaki R, Ohta Y, Onoe H, Yamaguchi R, Miyamoto T, Tokuda T, Tamaki Y, Isa K, Takahashi J, Kobayashi K, Ohta J, Isa T. Balancing risk-return decisions by manipulating the mesofrontal circuits in primates. Science 2024;383:55-61. [PMID: 38175903 DOI: 10.1126/science.adj6645] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 11/06/2023] [Indexed: 01/06/2024]

Affiliation(s)

Ryo Sasaki Division of Physiology and Neurobiology, Department of Neuroscience, Graduate School of Medicine, Kyoto University, Kyoto-shi, Kyoto 606-8501, Japan
Yasumi Ohta Division of Materials Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma-shi, Nara 630-0192, Japan
Hirotaka Onoe Human Brain Research Center, Graduate School of Medicine, Kyoto University, Kyoto-shi, Kyoto 606-8507, Japan
Reona Yamaguchi Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto-shi, Kyoto 606-8501, Japan
Takeshi Miyamoto Division of Physiology and Neurobiology, Department of Neuroscience, Graduate School of Medicine, Kyoto University, Kyoto-shi, Kyoto 606-8501, Japan Japan Society for the Promotion of Science, Chiyoda-Ku, Tokyo 102-0083, Japan
Takashi Tokuda Institute of Innovative Research, Tokyo Institute of Technology, Meguro-Ku, Tokyo 152-8550, Japan
Yuki Tamaki Division of Physiology and Neurobiology, Department of Neuroscience, Graduate School of Medicine, Kyoto University, Kyoto-shi, Kyoto 606-8501, Japan
Kaoru Isa Division of Physiology and Neurobiology, Department of Neuroscience, Graduate School of Medicine, Kyoto University, Kyoto-shi, Kyoto 606-8501, Japan
Jun Takahashi Department of Clinical Application, Center for iPS Cell Research and Application, Kyoto University, Kyoto-shi, Kyoto 606-8507, Japan
Kenta Kobayashi Section of Viral Vector Development, National Institute for Physiological Sciences, Okazaki-shi, Aichi 444-8585, Japan
Jun Ohta Division of Materials Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma-shi, Nara 630-0192, Japan
Tadashi Isa Division of Physiology and Neurobiology, Department of Neuroscience, Graduate School of Medicine, Kyoto University, Kyoto-shi, Kyoto 606-8501, Japan Human Brain Research Center, Graduate School of Medicine, Kyoto University, Kyoto-shi, Kyoto 606-8507, Japan Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto-shi, Kyoto 606-8501, Japan

Collapse

Chan HK, Toyoizumi T. A multi-stage anticipated surprise model with dynamic expectation for economic decision-making. Sci Rep 2024;14:657. [PMID: 38182692 PMCID: PMC10770108 DOI: 10.1038/s41598-023-50529-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 12/20/2023] [Indexed: 01/07/2024] Open

Konova AB, Ceceli AO, Horga G, Moeller SJ, Alia-Klein N, Goldstein RZ. Reduced neural encoding of utility prediction errors in cocaine addiction. Neuron 2023;111:4058-4070.e6. [PMID: 37883973 PMCID: PMC10880133 DOI: 10.1016/j.neuron.2023.09.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 07/18/2023] [Accepted: 09/13/2023] [Indexed: 10/28/2023]

Pinto SR, Uchida N. Tonic dopamine and biases in value learning linked through a biologically inspired reinforcement learning model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.10.566580. [PMID: 38014087 PMCID: PMC10680794 DOI: 10.1101/2023.11.10.566580] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]

Ferrari-Toniolo S, Schultz W. Reliable population code for subjective economic value from heterogeneous neuronal signals in primate orbitofrontal cortex. Neuron 2023;111:3683-3696.e7. [PMID: 37678250 DOI: 10.1016/j.neuron.2023.08.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 03/31/2023] [Accepted: 08/08/2023] [Indexed: 09/09/2023]

Deng Y, Song D, Ni J, Qing H, Quan Z. Reward prediction error in learning-related behaviors. Front Neurosci 2023;17:1171612. [PMID: 37662112 PMCID: PMC10471312 DOI: 10.3389/fnins.2023.1171612] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 07/31/2023] [Indexed: 09/05/2023] Open

Pastor-Bernier A, Volkmann K, Chi U Seak L, Stasiak A, Plott CR, Schultz W. Studying neural responses for multi-component economic choices in human and non-human primates using concept-based behavioral choice experiments. STAR Protoc 2023;4:102296. [PMID: 37294630 PMCID: PMC10323126 DOI: 10.1016/j.xpro.2023.102296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 02/24/2023] [Accepted: 04/19/2023] [Indexed: 06/11/2023] Open

Odland AU, Sandahl R, Andreasen JT. Chronic corticosterone improves perseverative behavior in mice during sequential reversal learning. Behav Brain Res 2023;450:114479. [PMID: 37169127 DOI: 10.1016/j.bbr.2023.114479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 04/04/2023] [Accepted: 05/06/2023] [Indexed: 05/13/2023]

Hong T, Stauffer WR. Computational complexity drives sustained deliberation. Nat Neurosci 2023;26:850-857. [PMID: 37095398 PMCID: PMC10166852 DOI: 10.1038/s41593-023-01307-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 03/16/2023] [Indexed: 04/26/2023]

Huang FY, Grabenhorst F. Nutrient-Sensitive Reinforcement Learning in Monkeys. J Neurosci 2023;43:1714-1730. [PMID: 36669886 PMCID: PMC10010454 DOI: 10.1523/jneurosci.0752-22.2022] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Revised: 11/27/2022] [Accepted: 12/19/2022] [Indexed: 01/21/2023] Open

Abstract

In reinforcement learning (RL), animals choose by assigning values to options and learn by updating these values from reward outcomes. This framework has been instrumental in identifying fundamental learning variables and their neuronal implementations. However, canonical RL models do not explain how reward values are constructed from biologically critical intrinsic reward components, such as nutrients. From an ecological perspective, animals should adapt their foraging choices in dynamic environments to acquire nutrients that are essential for survival. Here, to advance the biological and ecological validity of RL models, we investigated how (male) monkeys adapt their choices to obtain preferred nutrient rewards under varying reward probabilities. We found that the nutrient composition of rewards strongly influenced learning and choices. Preferences of the animals for specific nutrients (sugar, fat) affected how they adapted to changing reward probabilities; the history of recent rewards influenced choices of the monkeys more strongly if these rewards contained the their preferred nutrients (nutrient-specific reward history). The monkeys also chose preferred nutrients even when they were associated with lower reward probability. A nutrient-sensitive RL model captured these processes; it updated the values of individual sugar and fat components of expected rewards based on experience and integrated them into subjective values that explained the choices of the monkeys. Nutrient-specific reward prediction errors guided this value-updating process. Our results identify nutrients as important reward components that guide learning and choice by influencing the subjective value of choice options. Extending RL models with nutrient-value functions may enhance their biological validity and uncover nutrient-specific learning and decision variables.SIGNIFICANCE STATEMENT RL is an influential framework that formalizes how animals learn from experienced rewards. Although reward is a foundational concept in RL theory, canonical RL models cannot explain how learning depends on specific reward properties, such as nutrients. Intuitively, learning should be sensitive to the nutrient components of the reward to benefit health and survival. Here, we show that the nutrient (fat, sugar) composition of rewards affects how the monkeys choose and learn in an RL paradigm and that key learning variables including reward history and reward prediction error should be modified with nutrient-specific components to account for the choice behavior observed in the monkeys. By incorporating biologically critical nutrient rewards into the RL framework, our findings help advance the ecological validity of RL models.

Collapse

Seak LCU, Ferrari-Toniolo S, Jain R, Nielsen K, Schultz W. Systematic comparison of risky choices in humans and monkeys. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.07.527517. [PMID: 36798272 PMCID: PMC9934584 DOI: 10.1101/2023.02.07.527517] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]

Mkrtchian A, Valton V, Roiser JP. Reliability of Decision-Making and Reinforcement Learning Computational Parameters. COMPUTATIONAL PSYCHIATRY (CAMBRIDGE, MASS.) 2023;7:30-46. [PMID: 38774643 PMCID: PMC11104400 DOI: 10.5334/cpsy.86] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 01/23/2023] [Indexed: 02/11/2023]

A neuronal prospect theory model in the brain reward circuitry. Nat Commun 2022;13:5855. [PMID: 36195765 PMCID: PMC9532451 DOI: 10.1038/s41467-022-33579-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 09/22/2022] [Indexed: 11/23/2022] Open

Karin O, Alon U. The dopamine circuit as a reward-taxis navigation system. PLoS Comput Biol 2022;18:e1010340. [PMID: 35877694 PMCID: PMC9352198 DOI: 10.1371/journal.pcbi.1010340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 08/04/2022] [Accepted: 06/29/2022] [Indexed: 01/29/2023] Open

Ferrari-Toniolo S, Seak LCU, Schultz W. Risky choice: Probability weighting explains independence axiom violations in monkeys. JOURNAL OF RISK AND UNCERTAINTY 2022;65:319-351. [PMID: 36654986 PMCID: PMC9840594 DOI: 10.1007/s11166-022-09388-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 06/02/2022] [Indexed: 06/17/2023]

Abstract

Expected Utility Theory (EUT) provides axioms for maximizing utility in risky choice. The Independence Axiom (IA) is its most demanding axiom: preferences between two options should not change when altering both options equally by mixing them with a common gamble. We tested common consequence (CC) and common ratio (CR) violations of the IA over several months in thousands of stochastic choices using a large variety of binary option sets. Three monkeys showed consistently few outright Preference Reversals (8%) but substantial graded Preference Changes (46%) between the initial preferred gamble and the corresponding altered gamble. Linear Discriminant Analysis (LDA) indicated that gamble probabilities predicted most Preference Changes in CC (72%) and CR (88%) tests. The Akaike Information Criterion indicated that probability weighting within Cumulative Prospect Theory (CPT) explained choices better than models using Expected Value (EV) or EUT. Fitting by utility and probability weighting functions of CPT resulted in nonlinear and non-parallel indifference curves (IC) in the Marschak-Machina triangle and suggested IA non-compliance of models using EV or EUT. Indeed, CPT models predicted Preference Changes better than EV and EUT models. Indifference points in out-of-sample tests were closer to CPT-estimated ICs than EV and EUT ICs. Finally, while the few outright Preference Reversals may reflect the long experience of our monkeys, their more graded Preference Changes corresponded to those reported for humans. In benefitting from the wide testing possibilities in monkeys, our stringent axiomatic tests contribute critical information about risky decision-making and serves as basis for investigating neuronal decision mechanisms.

Supplementary information

The online version contains supplementary material available at 10.1007/s11166-022-09388-7.

Collapse

Louie K. Asymmetric and adaptive reward coding via normalized reinforcement learning. PLoS Comput Biol 2022;18:e1010350. [PMID: 35862443 PMCID: PMC9345478 DOI: 10.1371/journal.pcbi.1010350] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 08/02/2022] [Accepted: 07/01/2022] [Indexed: 11/18/2022] Open

Abstract

Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making.

Reinforcement learning models are widely used to characterize reward-driven learning in biological and computational agents. Standard reinforcement learning models use linear value functions, despite strong empirical evidence that biological value representations are nonlinear functions of external rewards. Here, we examine the properties of a biologically-based nonlinear reinforcement learning algorithm employing the canonical divisive normalization function, a neural computation commonly found in sensory, cognitive, and reward coding. We show that this normalized reinforcement learning algorithm implements a simple but powerful control of how reward learning reflects relative gains and losses. This property explains diverse behavioral and neural phenomena, and suggests the importance of using biologically valid value functions in computational models of learning and decision-making.

Collapse

Efficient coding of cognitive variables underlies dopamine response and choice behavior. Nat Neurosci 2022;25:738-748. [PMID: 35668173 DOI: 10.1038/s41593-022-01085-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 04/26/2022] [Indexed: 11/26/2022]

Bujold PM, Seak LCU, Schultz W, Ferrari-Toniolo S. Comparing utility functions between risky and riskless choice in rhesus monkeys. Anim Cogn 2022;25:385-399. [PMID: 34568979 PMCID: PMC8940808 DOI: 10.1007/s10071-021-01560-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 08/10/2021] [Accepted: 09/14/2021] [Indexed: 11/29/2022]

Soutschek A, Jetter A, Tobler PN. Towards a Unifying Account of Dopamine’s Role in Cost-Benefit Decision Making. BIOLOGICAL PSYCHIATRY GLOBAL OPEN SCIENCE 2022;3:179-186. [PMID: 37124350 PMCID: PMC10140448 DOI: 10.1016/j.bpsgos.2022.02.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 02/25/2022] [Accepted: 02/25/2022] [Indexed: 10/18/2022] Open

Al-Mohammad A, Schultz W. Reward Value Revealed by Auction in Rhesus Monkeys. J Neurosci 2022;42:1510-1528. [PMID: 34937703 PMCID: PMC8883853 DOI: 10.1523/jneurosci.1275-21.2021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 11/25/2021] [Accepted: 12/01/2021] [Indexed: 11/21/2022] Open

Abstract

Economic choice is thought to involve the elicitation of the subjective values of the choice options. Thus far, value estimation in animals has relied on stochastic choices between multiple options presented in repeated trials and expressed from averages of dozens of trials. However, subjective reward valuations are made moment-to-moment and do not always require alternative options; their consequences are usually felt immediately. Here, we describe a Becker-DeGroot-Marschak (BDM) auction-like mechanism that provides more direct and simple valuations with immediate consequences. The BDM encourages agents to truthfully reveal their true subjective value in individual choices ("incentive compatibility"). Male monkeys reliably placed well-ranked BDM bids for up to five juice volumes while paying from a water budget. The bids closely approximated the average subjective values estimated with conventional binary choices (BCs), thus demonstrating procedural invariance and aligning with the wealth of knowledge acquired with these less direct estimation methods. The feasibility of BDM bidding in monkeys paves the way for an analysis of subjective neuronal value signals in single trials rather than from averages; the feasibility also bridges the gap to the increasingly used BDM method in human neuroeconomics.SIGNIFICANCE STATEMENT The subjective economic value of rewards cannot be measured directly but must be inferred from observable behavior. Until now, the estimation method in animals was rather complex and required comparison between several choice options during repeated choices; thus, such methods did not respect the imminence of the outcome from individual choices. However, human economic research has developed a simple auction-like procedure that can reveal in a direct and immediate manner the true subjective value in individual choices [Becker-DeGroot-Marschak (BDM) mechanism]. The current study implemented this mechanism in rhesus monkeys and demonstrates its usefulness for eliciting meaningful value estimates of liquid rewards. The mechanism allows future neurophysiological assessment of subjective reward value signals in single trials of controlled animal tasks.

Collapse

Seeking motivation and reward: roles of dopamine, hippocampus and supramammillo-septal pathway. Prog Neurobiol 2022;212:102252. [PMID: 35227866 PMCID: PMC8961455 DOI: 10.1016/j.pneurobio.2022.102252] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Revised: 02/09/2022] [Accepted: 02/23/2022] [Indexed: 01/07/2023]

He J, Kleyman M, Chen J, Alikaya A, Rothenhoefer KM, Ozturk BE, Wirthlin M, Bostan AC, Fish K, Byrne LC, Pfenning AR, Stauffer WR. Transcriptional and anatomical diversity of medium spiny neurons in the primate striatum. Curr Biol 2021;31:5473-5486.e6. [PMID: 34727523 PMCID: PMC9359438 DOI: 10.1016/j.cub.2021.10.015] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 09/17/2021] [Accepted: 10/06/2021] [Indexed: 10/20/2022]

Affiliation(s)

Jing He Department of Neurobiology, Systems Neuroscience Center, Brain Institute, Center for Neuroscience, Center for the Neural Basis of Cognition, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15213, USA
Michael Kleyman Department of Computational Biology, School of Computer Science, Neuroscience Institute, Center for the Neural Basis of Cognition, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA
Jianjiao Chen Department of Neurobiology, Systems Neuroscience Center, Brain Institute, Center for Neuroscience, Center for the Neural Basis of Cognition, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15213, USA
Aydin Alikaya Department of Neurobiology, Systems Neuroscience Center, Brain Institute, Center for Neuroscience, Center for the Neural Basis of Cognition, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15213, USA
Kathryn M Rothenhoefer Department of Neurobiology, Systems Neuroscience Center, Brain Institute, Center for Neuroscience, Center for the Neural Basis of Cognition, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15213, USA
Bilge Esin Ozturk Department of Ophthalmology, Brain Institute, Center for Neuroscience, Center for the Neural Basis of Cognition, University of Pittsburgh, Pittsburgh, PA 15213, USA
Morgan Wirthlin Department of Computational Biology, School of Computer Science, Neuroscience Institute, Center for the Neural Basis of Cognition, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA
Andreea C Bostan Department of Neurobiology, Systems Neuroscience Center, Brain Institute, Center for Neuroscience, Center for the Neural Basis of Cognition, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15213, USA
Kenneth Fish Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA 15213, USA
Leah C Byrne Department of Ophthalmology, Brain Institute, Center for Neuroscience, Center for the Neural Basis of Cognition, University of Pittsburgh, Pittsburgh, PA 15213, USA
Andreas R Pfenning Department of Computational Biology, School of Computer Science, Neuroscience Institute, Center for the Neural Basis of Cognition, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA.
William R Stauffer Department of Neurobiology, Systems Neuroscience Center, Brain Institute, Center for Neuroscience, Center for the Neural Basis of Cognition, University of Pittsburgh, 3501 Fifth Avenue, Pittsburgh, PA 15213, USA.

Collapse

Rational regulation of water-seeking effort in rodents. Proc Natl Acad Sci U S A 2021;118:2111742118. [PMID: 34810265 PMCID: PMC8640740 DOI: 10.1073/pnas.2111742118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/17/2021] [Indexed: 01/20/2023] Open

Abstract

We confirm that rats can act as rational economic agents, making choices about how much work to do to obtain a reward in a way that optimally trades off the value of the reward against the cost of the effort. Contrary to the notion that bigger rewards are more motivating, rats worked harder in economies where rewards were small, ensuring a sufficient minimum income of water. But they chose to earn and consume more water per day when water was “cheap” (available for little work). We present a mathematical model explaining why rats work when they do (surprisingly, not just when they are thirsty) and suggesting where in the brain animals might compute the current value of working for water.

In the laboratory, animals’ motivation to work tends to be positively correlated with reward magnitude. But in nature, rewards earned by work are essential to survival (e.g., working to find water), and the payoff of that work can vary on long timescales (e.g., seasonally). Under these constraints, the strategy of working less when rewards are small could be fatal. We found that instead, rats in a closed economy did more work for water rewards when the rewards were stably smaller, a phenomenon also observed in human labor supply curves. Like human consumers, rats showed elasticity of demand, consuming far more water per day when its price in effort was lower. The neural mechanisms underlying such “rational” market behaviors remain largely unexplored. We propose a dynamic utility maximization model that can account for the dependence of rat labor supply (trials/day) on the wage rate (milliliter/trial) and also predict the temporal dynamics of when rats work. Based on data from mice, we hypothesize that glutamatergic neurons in the subfornical organ in lamina terminalis continuously compute the instantaneous marginal utility of voluntary work for water reward and causally determine the amount and timing of work.

Collapse

Báez-Mendoza R, Vázquez Y, Mastrobattista EP, Williams ZM. Neuronal Circuits for Social Decision-Making and Their Clinical Implications. Front Neurosci 2021;15:720294. [PMID: 34658766 PMCID: PMC8517320 DOI: 10.3389/fnins.2021.720294] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 09/09/2021] [Indexed: 11/13/2022] Open

Tanaka S, Taylor JE, Sakagami M. The effect of effort on reward prediction error signals in midbrain dopamine neurons. Curr Opin Behav Sci 2021. [DOI: 10.1016/j.cobeha.2021.07.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Schultz W, Stauffer WR, Lak A, Pastor-Bernier A. Smarter than humans: rationality reflected in primate neuronal reward signals. Curr Opin Behav Sci 2021. [DOI: 10.1016/j.cobeha.2021.03.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Bujold PM, Ferrari-Toniolo S, Schultz W. Adaptation of utility functions to reward distribution in rhesus monkeys. Cognition 2021;214:104764. [PMID: 34000666 PMCID: PMC8346953 DOI: 10.1016/j.cognition.2021.104764] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 04/26/2021] [Accepted: 05/04/2021] [Indexed: 10/25/2022]

Pastor-Bernier A, Stasiak A, Schultz W. Reward-specific satiety affects subjective value signals in orbitofrontal cortex during multicomponent economic choice. Proc Natl Acad Sci U S A 2021;118:e2022650118. [PMID: 34285071 PMCID: PMC8325167 DOI: 10.1073/pnas.2022650118] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Wang G, Li J, Zhu C, Wang S, Jiang S. How Do Reference Points Influence the Representation of the N200 for Consumer Preference? Front Psychol 2021;12:645775. [PMID: 34248744 PMCID: PMC8266263 DOI: 10.3389/fpsyg.2021.645775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 05/10/2021] [Indexed: 11/20/2022] Open

Oleson EB, Hamilton LR, Gomez DM. Cannabinoid Modulation of Dopamine Release During Motivation, Periodic Reinforcement, Exploratory Behavior, Habit Formation, and Attention. Front Synaptic Neurosci 2021;13:660218. [PMID: 34177546 PMCID: PMC8222827 DOI: 10.3389/fnsyn.2021.660218] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 05/05/2021] [Indexed: 12/12/2022] Open

Abstract

Motivational and attentional processes energize action sequences to facilitate evolutionary competition and promote behavioral fitness. Decades of neuropharmacology, electrophysiology and electrochemistry research indicate that the mesocorticolimbic DA pathway modulates both motivation and attention. More recently, it was realized that mesocorticolimbic DA function is tightly regulated by the brain's endocannabinoid system and greatly influenced by exogenous cannabinoids-which have been harnessed by humanity for medicinal, ritualistic, and recreational uses for 12,000 years. Exogenous cannabinoids, like the primary psychoactive component of cannabis, delta-9-tetrahydrocannabinol, produce their effects by acting at binding sites for naturally occurring endocannabinoids. The brain's endocannabinoid system consists of two G-protein coupled receptors, endogenous lipid ligands for these receptor targets, and several synthetic and metabolic enzymes involved in their production and degradation. Emerging evidence indicates that the endocannabinoid 2-arachidonoylglycerol is necessary to observe concurrent increases in DA release and motivated behavior. And the historical pharmacology literature indicates a role for cannabinoid signaling in both motivational and attentional processes. While both types of behaviors have been scrutinized under manipulation by either DA or cannabinoid agents, there is considerably less insight into prospective interactions between these two important signaling systems. This review attempts to summate the relevance of cannabinoid modulation of DA release during operant tasks designed to investigate either motivational or attentional control of behavior. We first describe how cannabinoids influence DA release and goal-directed action under a variety of reinforcement contingencies. Then we consider the role that endocannabinoids might play in switching an animal's motivation from a goal-directed action to the search for an alternative outcome, in addition to the formation of long-term habits. Finally, dissociable features of attentional behavior using both the 5-choice serial reaction time task and the attentional set-shifting task are discussed along with their distinct influences by DA and cannabinoids. We end with discussing potential targets for further research regarding DA-cannabinoid interactions within key substrates involved in motivation and attention.

Collapse

Ghazizadeh A, Hikosaka O. Common coding of expected value and value uncertainty memories in the prefrontal cortex and basal ganglia output. SCIENCE ADVANCES 2021;7:eabe0693. [PMID: 33980480 PMCID: PMC8115923 DOI: 10.1126/sciadv.abe0693] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 03/23/2021] [Indexed: 05/12/2023]

Morville T, Madsen KH, Siebner HR, Hulme OJ. Reward signalling in brainstem nuclei under fluctuating blood glucose. PLoS One 2021;16:e0243899. [PMID: 33826633 PMCID: PMC8026025 DOI: 10.1371/journal.pone.0243899] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 12/01/2020] [Indexed: 11/18/2022] Open

Rothenhoefer KM, Hong T, Alikaya A, Stauffer WR. Rare rewards amplify dopamine responses. Nat Neurosci 2021;24:465-469. [PMID: 33686298 PMCID: PMC9373731 DOI: 10.1038/s41593-021-00807-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 01/20/2021] [Indexed: 01/02/2023]

Ferrari-Toniolo S, Bujold PM, Grabenhorst F, Báez-Mendoza R, Schultz W. Nonhuman Primates Satisfy Utility Maximization in Compliance with the Continuity Axiom of Expected Utility Theory. J Neurosci 2021;41:2964-2979. [PMID: 33542082 PMCID: PMC8018892 DOI: 10.1523/jneurosci.0955-20.2020] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 11/13/2020] [Accepted: 11/19/2020] [Indexed: 11/21/2022] Open

Abstract

Expected Utility Theory (EUT), the first axiomatic theory of risky choice, describes choices as a utility maximization process: decision makers assign a subjective value (utility) to each choice option and choose the one with the highest utility. The continuity axiom, central to Expected Utility Theory and its modifications, is a necessary and sufficient condition for the definition of numerical utilities. The axiom requires decision makers to be indifferent between a gamble and a specific probabilistic combination of a more preferred and a less preferred gamble. While previous studies demonstrated that monkeys choose according to combinations of objective reward magnitude and probability, a concept-driven experimental approach for assessing the axiomatically defined conditions for maximizing utility by animals is missing. We experimentally tested the continuity axiom for a broad class of gamble types in 4 male rhesus macaque monkeys, showing that their choice behavior complied with the existence of a numerical utility measure as defined by the economic theory. We used the numerical quantity specified in the continuity axiom to characterize subjective preferences in a magnitude-probability space. This mapping highlighted a trade-off relation between reward magnitudes and probabilities, compatible with the existence of a utility function underlying subjective value computation. These results support the existence of a numerical utility function able to describe choices, allowing for the investigation of the neuronal substrates responsible for coding such rigorously defined quantity.SIGNIFICANCE STATEMENT A common assumption of several economic choice theories is that decisions result from the comparison of subjectively assigned values (utilities). This study demonstrated the compliance of monkey behavior with the continuity axiom of Expected Utility Theory, implying a subjective magnitude-probability trade-off relation, which supports the existence of numerical utility directly linked to the theoretical economic framework. We determined a numerical utility measure able to describe choices, which can serve as a correlate for the neuronal activity in the quest for brain structures and mechanisms guiding decisions.

Collapse

Fung BJ, Sutlief E, Hussain Shuler MG. Dopamine and the interdependency of time perception and reward. Neurosci Biobehav Rev 2021;125:380-391. [PMID: 33652021 DOI: 10.1016/j.neubiorev.2021.02.030] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 02/16/2021] [Accepted: 02/19/2021] [Indexed: 01/14/2023]

Hesp C, Smith R, Parr T, Allen M, Friston KJ, Ramstead MJD. Deeply Felt Affect: The Emergence of Valence in Deep Active Inference. Neural Comput 2021;33:398-446. [PMID: 33253028 PMCID: PMC8594962 DOI: 10.1162/neco_a_01341] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 08/17/2020] [Indexed: 01/20/2023]

Abstract

The positive-negative axis of emotional valence has long been recognized as fundamental to adaptive behavior, but its origin and underlying function have largely eluded formal theorizing and computational modeling. Using deep active inference, a hierarchical inference scheme that rests on inverting a model of how sensory data are generated, we develop a principled Bayesian model of emotional valence. This formulation asserts that agents infer their valence state based on the expected precision of their action model-an internal estimate of overall model fitness ("subjective fitness"). This index of subjective fitness can be estimated within any environment and exploits the domain generality of second-order beliefs (beliefs about beliefs). We show how maintaining internal valence representations allows the ensuing affective agent to optimize confidence in action selection preemptively. Valence representations can in turn be optimized by leveraging the (Bayes-optimal) updating term for subjective fitness, which we label affective charge (AC). AC tracks changes in fitness estimates and lends a sign to otherwise unsigned divergences between predictions and outcomes. We simulate the resulting affective inference by subjecting an in silico affective agent to a T-maze paradigm requiring context learning, followed by context reversal. This formulation of affective inference offers a principled account of the link between affect, (mental) action, and implicit metacognition. It characterizes how a deep biological system can infer its affective state and reduce uncertainty about such inferences through internal action (i.e., top-down modulation of priors that underwrite confidence). Thus, we demonstrate the potential of active inference to provide a formal and computationally tractable account of affect. Our demonstration of the face validity and potential utility of this formulation represents the first step within a larger research program. Next, this model can be leveraged to test the hypothesized role of valence by fitting the model to behavioral and neuronal responses.

Collapse

Verstynen T, Dunovan K, Walsh C, Kuan CH, Manuck SB, Gianaros PJ. Adiposity covaries with signatures of asymmetric feedback learning during adaptive decisions. Soc Cogn Affect Neurosci 2020;15:1145-1156. [PMID: 32608485 PMCID: PMC7657458 DOI: 10.1093/scan/nsaa088] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 06/03/2020] [Accepted: 06/15/2020] [Indexed: 12/19/2022] Open

Mendoza JA, Lafferty CK, Yang AK, Britt JP. Cue-Evoked Dopamine Neuron Activity Helps Maintain but Does Not Encode Expected Value. Cell Rep 2020;29:1429-1437.e3. [PMID: 31693885 DOI: 10.1016/j.celrep.2019.09.077] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 08/21/2019] [Accepted: 09/26/2019] [Indexed: 11/16/2022] Open

Neuser MP, Kühnel A, Svaldi J, Kroemer NB. Beyond the average: The role of variable reward sensitivity in eating disorders. Physiol Behav 2020;223:112971. [DOI: 10.1016/j.physbeh.2020.112971] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Revised: 04/30/2020] [Accepted: 05/13/2020] [Indexed: 01/13/2023]

Emberly E, Seamans JK. Abrupt, Asynchronous Changes in Action Representations by Anterior Cingulate Cortex Neurons during Trial and Error Learning. Cereb Cortex 2020;30:4336-4345. [PMID: 32239139 DOI: 10.1093/cercor/bhaa019] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 01/09/2020] [Accepted: 01/12/2020] [Indexed: 11/13/2022] Open

Yoon T, Jaleel A, Ahmed AA, Shadmehr R. Saccade vigor and the subjective economic value of visual stimuli. J Neurophysiol 2020;123:2161-2172. [PMID: 32374201 DOI: 10.1152/jn.00700.2019] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Abstract

Decisions are made based on the subjective value that the brain assigns to options. However, subjective value is a mathematical construct that cannot be measured directly, but rather is inferred from choices. Recent results have demonstrated that reaction time, amplitude, and velocity of movements are modulated by reward, raising the possibility that there is a link between how the brain evaluates an option and how it controls movements toward that option. Here, we asked people to choose among risky options represented by abstract stimuli, some associated with gain (points in a game), and others with loss. From their choices we estimated the subjective value that they assigned to each stimulus. In probe trials, a single stimulus appeared at center, instructing subjects to make a saccade to a peripheral target. We found that the reaction time, peak velocity, and amplitude of the peripherally directed saccade varied roughly linearly with the subjective value that the participant had assigned to the central stimulus: reaction time was shorter, velocity was higher, and amplitude was larger for stimuli that the participant valued more. Naturally, participants differed in how much they valued a given stimulus. Remarkably, those who valued a stimulus more, as evidenced by their choices in decision trials, tended to move with shorter reaction time and greater velocity in response to that stimulus in probe trials. Overall, the reaction time of the saccade in response to a stimulus partly predicted the subjective value that the brain assigned to that stimulus.NEW & NOTEWORTHY Behavioral economics relies on subjective evaluation, an abstract quantity that cannot be measured directly but must be inferred by fitting decision models to the choice patterns. Here, we present a new approach to estimate subjective value: with nothing to fit, we show that it is possible to estimate subjective value based on movement kinematics, providing a modest ability to predict a participant's preferences without prior measurement of their choice patterns.

Collapse

van Swieten MMH, Bogacz R. Modeling the effects of motivation on choice and learning in the basal ganglia. PLoS Comput Biol 2020;16:e1007465. [PMID: 32453725 PMCID: PMC7274475 DOI: 10.1371/journal.pcbi.1007465] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 06/05/2020] [Accepted: 04/03/2020] [Indexed: 01/08/2023] Open

Rationalization of emotion is also rational. Behav Brain Sci 2020;43:e43. [PMID: 32292159 DOI: 10.1017/s0140525x19002292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Bayer J, Rusch T, Zhang L, Gläscher J, Sommer T. Dose-dependent effects of estrogen on prediction error related neural activity in the nucleus accumbens of healthy young women. Psychopharmacology (Berl) 2020;237:745-755. [PMID: 31773208 DOI: 10.1007/s00213-019-05409-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Accepted: 11/18/2019] [Indexed: 12/24/2022]

A distributional code for value in dopamine-based reinforcement learning. Nature 2020;577:671-675. [PMID: 31942076 DOI: 10.1038/s41586-019-1924-6] [Citation(s) in RCA: 170] [Impact Index Per Article: 42.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2019] [Accepted: 11/19/2019] [Indexed: 12/12/2022]

Do domain-general executive resources play a role in linguistic prediction? Re-evaluation of the evidence and a path forward. Neuropsychologia 2020;136:107258. [DOI: 10.1016/j.neuropsychologia.2019.107258] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 11/07/2019] [Accepted: 11/07/2019] [Indexed: 12/13/2022]