Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Millner AJ, Gershman SJ, Nock MK, den Ouden HEM. Pavlovian Control of Escape and Avoidance. J Cogn Neurosci 2017;30:1379-1390. [PMID: 29244641 DOI: 10.1162/jocn_a_01224] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

For:	Millner AJ, Gershman SJ, Nock MK, den Ouden HEM. Pavlovian Control of Escape and Avoidance. J Cogn Neurosci 2017;30:1379-1390. [PMID: 29244641 DOI: 10.1162/jocn_a_01224] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Number

Cited by Other Article(s)

Colas JT, O’Doherty JP, Grafton ST. Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts. PLoS Comput Biol 2024;20:e1011950. [PMID: 38552190 PMCID: PMC10980507 DOI: 10.1371/journal.pcbi.1011950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/26/2024] [Indexed: 04/01/2024] Open

Abstract

Active reinforcement learning enables dynamic prediction and control, where one should not only maximize rewards but also minimize costs such as of inference, decisions, actions, and time. For an embodied agent such as a human, decisions are also shaped by physical aspects of actions. Beyond the effects of reward outcomes on learning processes, to what extent can modeling of behavior in a reinforcement-learning task be complicated by other sources of variance in sequential action choices? What of the effects of action bias (for actions per se) and action hysteresis determined by the history of actions chosen previously? The present study addressed these questions with incremental assembly of models for the sequential choice data from a task with hierarchical structure for additional complexity in learning. With systematic comparison and falsification of computational models, human choices were tested for signatures of parallel modules representing not only an enhanced form of generalized reinforcement learning but also action bias and hysteresis. We found evidence for substantial differences in bias and hysteresis across participants-even comparable in magnitude to the individual differences in learning. Individuals who did not learn well revealed the greatest biases, but those who did learn accurately were also significantly biased. The direction of hysteresis varied among individuals as repetition or, more commonly, alternation biases persisting from multiple previous actions. Considering that these actions were button presses with trivial motor demands, the idiosyncratic forces biasing sequences of action choices were robust enough to suggest ubiquity across individuals and across tasks requiring various actions. In light of how bias and hysteresis function as a heuristic for efficient control that adapts to uncertainty or low motivation by minimizing the cost of effort, these phenomena broaden the consilient theory of a mixture of experts to encompass a mixture of expert and nonexpert controllers of behavior.

Collapse

Fahey MP, Yee DM, Leng X, Tarlow M, Shenhav A. Motivational context determines the impact of aversive outcomes on mental effort allocation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.27.564461. [PMID: 37961466 PMCID: PMC10634922 DOI: 10.1101/2023.10.27.564461] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]

Fang S, Law SF, Ji X, Liu Q, Zhang P, Zhong R, Li H, Wang X, Yao S, Wang X. Potential neuropsychological mechanism involved in the transition from suicide ideation to action - a resting-state fMRI study implicating the insula. Eur Psychiatry 2023;66:e69. [PMID: 37694389 PMCID: PMC10594382 DOI: 10.1192/j.eurpsy.2023.2444] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 08/06/2023] [Accepted: 08/08/2023] [Indexed: 09/12/2023] Open

Abstract

BACKGROUND

Understanding the neural mechanism underlying the transition from suicidal ideation to action is crucial but remains unclear. To explore this mechanism, we combined resting-state functional connectivity (rsFC) and computational modeling to investigate differences between those who attempted suicide(SA) and those who hold only high levels of suicidal ideation(HSI).

METHODS

A total of 120 MDD patients were categorized into SA group (n=47) and HSI group (n=73). All participants completed a resting-state functional MRI scan, with three subregions of the insula and the dorsal anterior cingulate cortex (dACC) being chosen as the region of interest (ROI) in seed-to-voxel analyses. Additionally, 86 participants completed the balloon analogue risk task (BART), and a five-parameter Bayesian modeling of BART was estimated.

RESULTS

In the SA group, the FC between the ventral anterior insula (vAI) and the superior/middle frontal gyrus (vAI-SFG, vAI-MFG), as well as the FC between posterior insula (pI) and MFG (pI-MFG), were lower than those in HSI group. The correlation analysis showed a negative correlation between the FC of vAI-SFG and psychological pain avoidance in SA group, whereas a positive correlation in HSI group. Furthermore, the FC of vAI-MFG displayed a negative correlation with loss aversion in SA group, while a positive correlation was found with psychological pain avoidance in HSI group.

CONCLUSION

In current study, two distinct neural mechanisms were identified in the insula which involving in the progression from suicidal ideation to action. Dysfunction in vAI FCs may gradually stabilize as individuals experience heightened psychological pain, and a shift from positive to negative correlation patterns of vAI-MFC may indicate a transition from state to trait impairment. Additionally, the dysfunction in PI FC may lead to a lowered threshold for suicide by blunting the perception of physical harm.

Collapse

Kim H, Hur JK, Kwon M, Kim S, Zoh Y, Ahn WY. Causal role of the dorsolateral prefrontal cortex in modulating the balance between Pavlovian and instrumental systems in the punishment domain. PLoS One 2023;18:e0286632. [PMID: 37267307 PMCID: PMC10237433 DOI: 10.1371/journal.pone.0286632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 05/19/2023] [Indexed: 06/04/2023] Open

Abstract

Previous literature suggests that a balance between Pavlovian and instrumental decision-making systems is critical for optimal decision-making. Pavlovian bias (i.e., approach toward reward-predictive stimuli and avoid punishment-predictive stimuli) often contrasts with the instrumental response. Although recent neuroimaging studies have identified brain regions that may be related to Pavlovian bias, including the dorsolateral prefrontal cortex (dlPFC), it is unclear whether a causal relationship exists. Therefore, we investigated whether upregulation of the dlPFC using transcranial current direct stimulation (tDCS) would reduce Pavlovian bias. In this double-blind study, participants were assigned to the anodal or the sham group; they received stimulation over the right dlPFC for 3 successive days. On the last day, participants performed a reinforcement learning task known as the orthogonalized go/no-go task; this was used to assess each participant's degree of Pavlovian bias in reward and punishment domains. We used computational modeling and hierarchical Bayesian analysis to estimate model parameters reflecting latent cognitive processes, including Pavlovian bias, go bias, and choice randomness. Several computational models were compared; the model with separate Pavlovian bias parameters for reward and punishment domains demonstrated the best model fit. When using a behavioral index of Pavlovian bias, the anodal group showed significantly lower Pavlovian bias in the punishment domain, but not in the reward domain, compared with the sham group. In addition, computational modeling showed that Pavlovian bias parameter in the punishment domain was lower in the anodal group than in the sham group, which is consistent with the behavioral findings. The anodal group also showed a lower go bias and choice randomness, compared with the sham group. These findings suggest that anodal tDCS may lead to behavioral suppression or change in Pavlovian bias in the punishment domain, which will help to improve comprehension of the causal neural mechanism.

Collapse

Yamamori Y, Robinson OJ. Computational perspectives on human fear and anxiety. Neurosci Biobehav Rev 2023;144:104959. [PMID: 36375584 PMCID: PMC10564627 DOI: 10.1016/j.neubiorev.2022.104959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 10/25/2022] [Accepted: 11/09/2022] [Indexed: 11/12/2022]

Myers CE, Interian A, Moustafa AA. A practical introduction to using the drift diffusion model of decision-making in cognitive psychology, neuroscience, and health sciences. Front Psychol 2022;13:1039172. [PMID: 36571016 PMCID: PMC9784241 DOI: 10.3389/fpsyg.2022.1039172] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 10/27/2022] [Indexed: 12/14/2022] Open

Liu Q, Zhong R, Ji X, Law S, Xiao F, Wei Y, Fang S, Kong X, Zhang X, Yao S, Wang X. Decision-making biases in suicide attempters with major depressive disorder: A computational modeling study using the balloon analog risk task (BART). Depress Anxiety 2022;39:845-857. [PMID: 36329675 DOI: 10.1002/da.23291] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 09/30/2022] [Accepted: 10/22/2022] [Indexed: 11/06/2022] Open

Affiliation(s)

Qinyu Liu Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China
Runqing Zhong Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China
Xinlei Ji Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China
Samuel Law Department of Psychiatry, University of Toronto, Ontario, Toronto, Canada
Fan Xiao Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China
Yiming Wei Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
Shulin Fang Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China
Xinyuan Kong Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China
Xiaocui Zhang Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China
Shuqiao Yao Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China
Xiang Wang Medical Psychological Center, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China.,China National Clinical Research Center on Mental Disorders (Xiangya), Changsha, Hunan, China

Collapse

Weber I, Zorowitz S, Niv Y, Bennett D. The effects of induced positive and negative affect on Pavlovian-instrumental interactions. Cogn Emot 2022;36:1343-1360. [PMID: 35929878 PMCID: PMC9852069 DOI: 10.1080/02699931.2022.2109600] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 07/19/2022] [Accepted: 07/26/2022] [Indexed: 01/22/2023]

Colas JT, Dundon NM, Gerraty RT, Saragosa‐Harris NM, Szymula KP, Tanwisuth K, Tyszka JM, van Geen C, Ju H, Toga AW, Gold JI, Bassett DS, Hartley CA, Shohamy D, Grafton ST, O'Doherty JP. Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T. Hum Brain Mapp 2022;43:4750-4790. [PMID: 35860954 PMCID: PMC9491297 DOI: 10.1002/hbm.25988] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 05/20/2022] [Accepted: 06/10/2022] [Indexed: 11/12/2022] Open

Affiliation(s)

Jaron T. Colas Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA Computation and Neural Systems Program, California Institute of TechnologyPasadenaCaliforniaUSA
Neil M. Dundon Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA Department of Child and Adolescent Psychiatry, Psychotherapy, and PsychosomaticsUniversity of FreiburgFreiburg im BreisgauGermany
Raphael T. Gerraty Department of PsychologyColumbia UniversityNew YorkNew YorkUSA Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA Center for Science and SocietyColumbia UniversityNew YorkNew YorkUSA
Natalie M. Saragosa‐Harris Department of PsychologyNew York UniversityNew YorkNew YorkUSA Department of PsychologyUniversity of CaliforniaLos AngelesCaliforniaUSA
Karol P. Szymula Department of BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
Koranis Tanwisuth Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA Department of PsychologyUniversity of CaliforniaBerkeleyCaliforniaUSA
J. Michael Tyszka Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA
Camilla van Geen Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA Department of PsychologyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
Harang Ju Neuroscience Graduate GroupUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
Arthur W. Toga Laboratory of Neuro ImagingUSC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern CaliforniaLos AngelesCaliforniaUSA
Joshua I. Gold Department of NeuroscienceUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
Dani S. Bassett Department of BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Department of Electrical and Systems EngineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Department of NeurologyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Department of PsychiatryUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Department of Physics and AstronomyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA Santa Fe InstituteSanta FeNew MexicoUSA
Catherine A. Hartley Department of PsychologyNew York UniversityNew YorkNew YorkUSA Center for Neural ScienceNew York UniversityNew YorkNew YorkUSA
Daphna Shohamy Department of PsychologyColumbia UniversityNew YorkNew YorkUSA Zuckerman Mind Brain Behavior Institute, Columbia UniversityNew YorkNew YorkUSA Kavli Institute for Brain ScienceColumbia UniversityNew YorkNew YorkUSA
Scott T. Grafton Department of Psychological and Brain SciencesUniversity of CaliforniaSanta BarbaraCaliforniaUSA
John P. O'Doherty Division of the Humanities and Social SciencesCalifornia Institute of TechnologyPasadenaCaliforniaUSA Computation and Neural Systems Program, California Institute of TechnologyPasadenaCaliforniaUSA

Collapse

Karvelis P, Diaconescu AO. A Computational Model of Hopelessness and Active-Escape Bias in Suicidality. COMPUTATIONAL PSYCHIATRY (CAMBRIDGE, MASS.) 2022;6:34-59. [PMID: 38774778 PMCID: PMC11104346 DOI: 10.5334/cpsy.80] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Accepted: 02/15/2022] [Indexed: 12/27/2022]

Abstract

Currently, psychiatric practice lacks reliable predictive tools and a sufficiently detailed mechanistic understanding of suicidal thoughts and behaviors (STB) to provide timely and personalized interventions. Developing computational models of STB that integrate across behavioral, cognitive and neural levels of analysis could help better understand STB vulnerabilities and guide personalized interventions. To that end, we present a computational model based on the active inference framework. With this model, we show that several STB risk markers - hopelessness, Pavlovian bias and active-escape bias - are interrelated via the drive to maximize one's model evidence. We propose four ways in which these effects can arise: (1) increased learning from aversive outcomes, (2) reduced belief decay in response to unexpected outcomes, (3) increased stress sensitivity and (4) reduced sense of stressor controllability. These proposals stem from considering the neurocircuits implicated in STB: how the locus coeruleus - norepinephrine (LC-NE) system together with the amygdala (Amy), the dorsal prefrontal cortex (dPFC) and the anterior cingulate cortex (ACC) mediate learning in response to acute stress and volatility as well as how the dorsal raphe nucleus - serotonin (DRN-5-HT) system together with the ventromedial prefrontal cortex (vmPFC) mediate stress reactivity based on perceived stressor controllability. We validate the model by simulating performance in an Avoid/Escape Go/No-Go task replicating recent behavioral findings. This serves as a proof of concept and provides a computational hypothesis space that can be tested empirically and be used to distinguish planful versus impulsive STB subtypes. We discuss the relevance of the proposed model for treatment response prediction, including pharmacotherapy and psychotherapy, as well as sex differences as it relates to stress reactivity and suicide risk.

Collapse

Yee DM, Leng X, Shenhav A, Braver TS. Aversive motivation and cognitive control. Neurosci Biobehav Rev 2022;133:104493. [PMID: 34910931 PMCID: PMC8792354 DOI: 10.1016/j.neubiorev.2021.12.016] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Revised: 11/12/2021] [Accepted: 12/09/2021] [Indexed: 02/03/2023]

Paulus MP, Thompson WK. Computational approaches and machine learning for individual-level treatment predictions. Psychopharmacology (Berl) 2021;238:1231-1239. [PMID: 31134293 PMCID: PMC6879811 DOI: 10.1007/s00213-019-05282-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Accepted: 05/17/2019] [Indexed: 12/24/2022]

Binti Affandi AH, Pike AC, Robinson OJ. Threat of shock promotes passive avoidance, but not active avoidance. Eur J Neurosci 2021;55:2571-2580. [PMID: 33714211 DOI: 10.1111/ejn.15184] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 01/29/2021] [Accepted: 03/09/2021] [Indexed: 11/28/2022]

Miletić S, Boag RJ, Trutti AC, Stevenson N, Forstmann BU, Heathcote A. A new model of decision processing in instrumental learning tasks. eLife 2021;10:e63055. [PMID: 33501916 PMCID: PMC7880686 DOI: 10.7554/elife.63055] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 01/26/2021] [Indexed: 01/12/2023] Open

Huys QJM, Browning M, Paulus MP, Frank MJ. Advances in the computational understanding of mental illness. Neuropsychopharmacology 2021;46:3-19. [PMID: 32620005 PMCID: PMC7688938 DOI: 10.1038/s41386-020-0746-4] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 06/11/2020] [Accepted: 06/15/2020] [Indexed: 12/11/2022]

Millner AJ, Robinaugh DJ, Nock MK. Advancing the Understanding of Suicide: The Need for Formal Theory and Rigorous Descriptive Research. Trends Cogn Sci 2020;24:704-716. [PMID: 32680678 PMCID: PMC7429350 DOI: 10.1016/j.tics.2020.06.007] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 06/10/2020] [Accepted: 06/17/2020] [Indexed: 01/05/2023]

Shinn M, Lam NH, Murray JD. A flexible framework for simulating and fitting generalized drift-diffusion models. eLife 2020;9:56938. [PMID: 32749218 PMCID: PMC7462609 DOI: 10.7554/elife.56938] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 08/03/2020] [Indexed: 01/10/2023] Open

Peterburs J, Frieling A, Bellebaum C. Asymmetric coupling of action and outcome valence in active and observational feedback learning. PSYCHOLOGICAL RESEARCH 2020;85:1553-1566. [PMID: 32322967 PMCID: PMC8211594 DOI: 10.1007/s00426-020-01340-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 04/07/2020] [Indexed: 11/23/2022]

Abstract

Learning to execute a response to obtain a reward or to inhibit a response to avoid punishment is much easier than learning the reverse, which has been referred to as “Pavlovian” biases. Despite a growing body of research into similarities and differences between active and observational learning, it is as yet unclear if Pavlovian learning biases are specific for active task performance, i.e., learning from feedback provided for one’s own actions, or if they persist also when learning by observing another person’s actions and subsequent outcomes. The present study, therefore, investigated the influence of action and outcome valence in active and observational feedback learning. Healthy adult volunteers completed a go/nogo task that decoupled outcome valence (win/loss) and action (execution/inhibition) either actively or by observing a virtual co-player’s responses and subsequent feedback. Moreover, in a more naturalistic follow-up experiment, pairs of subjects were tested with the same task, with one subject as active learner and the other as observational learner. The results revealed Pavlovian learning biases both in active and in observational learning, with learning of go responses facilitated in the context of reward obtainment, and learning of nogo responses facilitated in the context of loss avoidance. Although the neural correlates of active and observational feedback learning have been shown to differ to some extent, these findings suggest similar mechanisms to underlie both types of learning with respect to the influence of Pavlovian biases. Moreover, performance levels and result patterns were similar in those observational learners who had observed a virtual co-player and those who had completed the task together with an active learner, suggesting that inclusion of a virtual co-player in a computerized task provides an effective manipulation of agency.

Collapse

Paulus MP. Driven by Pain, Not Gain: Computational Approaches to Aversion-Related Decision Making in Psychiatry. Biol Psychiatry 2020;87:359-367. [PMID: 31653478 PMCID: PMC7012695 DOI: 10.1016/j.biopsych.2019.08.025] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 08/02/2019] [Accepted: 08/28/2019] [Indexed: 12/21/2022]

Miletić S, Boag RJ, Forstmann BU. Mutual benefits: Combining reinforcement learning with sequential sampling models. Neuropsychologia 2019;136:107261. [PMID: 31733237 DOI: 10.1016/j.neuropsychologia.2019.107261] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 10/21/2019] [Accepted: 11/10/2019] [Indexed: 12/21/2022]

Seymour B. Pain: A Precision Signal for Reinforcement Learning and Control. Neuron 2019;101:1029-1041. [PMID: 30897355 DOI: 10.1016/j.neuron.2019.01.055] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 01/18/2019] [Accepted: 01/27/2019] [Indexed: 12/18/2022]

Shahar N, Hauser TU, Moutoussis M, Moran R, Keramati M, Dolan RJ. Improving the reliability of model-based decision-making estimates in the two-stage decision task with reaction-times and drift-diffusion modeling. PLoS Comput Biol 2019;15:e1006803. [PMID: 30759077 PMCID: PMC6391008 DOI: 10.1371/journal.pcbi.1006803] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 02/26/2019] [Accepted: 01/17/2019] [Indexed: 01/10/2023] Open

Dorsal striatal dopamine D1 receptor availability predicts an instrumental bias in action learning. Proc Natl Acad Sci U S A 2018;116:261-270. [PMID: 30563856 PMCID: PMC6320523 DOI: 10.1073/pnas.1816704116] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Abstract

The brain’s dopaminergic pathways are crucially important for adaptive behavior. They are thought to enable us to approach rewards and stay away from punishments. During learning, dopaminergic reward prediction errors are thought to reinforce previously rewarded actions, so they become easier to repeat. This dopaminergic activity could lead to a systematic bias by which rewarded actions are more readily learned than rewarded inactions. We present two findings. First, dopamine receptors in cortex, dorsal striatum, and nucleus accumbens provide distinct sources of variance in the human brain. Second, the boost in an individual’s learning rate from previously rewarded actions is dependent on the dopamine receptor density in dorsal striatum, a central structure in the dopaminergic circuit.

Learning to act to obtain reward and inhibit to avoid punishment is easier compared with learning the opposite contingencies. This coupling of action and valence is often thought of as a Pavlovian bias, although recent research has shown it may also emerge through instrumental mechanisms. We measured this learning bias with a rewarded go/no-go task in 60 adults of different ages. Using computational modeling, we characterized the bias as being instrumental. To assess the role of endogenous dopamine (DA) in the expression of this bias, we quantified DA D1 receptor availability using positron emission tomography (PET) with the radioligand [¹¹C]SCH23390. Using principal-component analysis on the binding potentials in a number of cortical and striatal regions of interest, we demonstrated that cortical, dorsal striatal, and ventral striatal areas provide independent sources of variance in DA D1 receptor availability. Interindividual variation in the dorsal striatal component was related to the strength of the instrumental bias during learning. These data suggest at least three anatomical sources of variance in DA D1 receptor availability separable using PET in humans, and we provide evidence that human dorsal striatal DA D1 receptors are involved in the modulation of instrumental learning biases.

Collapse

Gershman SJ. Uncertainty and Exploration. ACTA ACUST UNITED AC 2018;6:277-286. [PMID: 33768122 DOI: 10.1037/dec0000101] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]