1
|
Ullsperger M. Beyond peaks and troughs: Multiplexed performance monitoring signals in the EEG. Psychophysiology 2024; 61:e14553. [PMID: 38415791 DOI: 10.1111/psyp.14553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/08/2024] [Accepted: 02/10/2024] [Indexed: 02/29/2024]
Abstract
With the discovery of event-related potentials elicited by errors more than 30 years ago, a new avenue of research on performance monitoring, cognitive control, and decision making emerged. Since then, the field has developed and expanded fulminantly. After a brief overview on the EEG correlates of performance monitoring, this article reviews recent advancements based on single-trial analyses using independent component analysis, multiple regression, and multivariate pattern classification. Given the close interconnection between performance monitoring and reinforcement learning, computational modeling and model-based EEG analyses have made a particularly strong impact. The reviewed findings demonstrate that error- and feedback-related EEG dynamics represent variables reflecting how performance-monitoring signals are weighted and transformed into an adaptation signal that guides future decisions and actions. The model-based single-trial analysis approach goes far beyond conventional peak-and-trough analyses of event-related potentials and enables testing mechanistic theories of performance monitoring, cognitive control, and decision making.
Collapse
Affiliation(s)
- Markus Ullsperger
- Department of Neuropsychology, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
- Center for Behavioral Brain Sciences, Magdeburg, Germany
- German Center for Mental Health (DZPG), partner site Halle-Jena-Magdeburg, Magdeburg, Germany
- Center for Intervention and Research on adaptive and maladaptive brain Circuits underlying mental health (C-I-R-C), Halle-Jena-Magdeburg, Germany
| |
Collapse
|
2
|
Krikova K, Klein S, Kampa M, Walter B, Stark R, Klucken T. Appetitive conditioning with pornographic stimuli elicits stronger activation in reward regions than monetary and gaming-related stimuli. Hum Brain Mapp 2024; 45:e26711. [PMID: 38798103 PMCID: PMC11128778 DOI: 10.1002/hbm.26711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 04/24/2024] [Accepted: 05/02/2024] [Indexed: 05/29/2024] Open
Abstract
Appetitive conditioning plays an important role in the development and maintenance of pornography-use and gaming disorders. It is assumed that primary and secondary reinforcers are involved in these processes. Despite the common use of pornography and gaming in the general population appetitive conditioning processes in this context are still not well studied. This study aims to compare appetitive conditioning processes using primary (pornographic) and secondary (monetary and gaming-related) rewards as unconditioned stimuli (UCS) in the general population. Additionally, it investigates the conditioning processes with gaming-related stimuli as this type of UCS was not used in previous studies. Thirty-one subjects participated in a differential conditioning procedure in which four geometric symbols were paired with either pornographic, monetary, or gaming-related rewards or with nothing to become conditioned stimuli (CS + porn, CS + game, CS + money, and CS-) in an functional magnetic resonance imaging study. We observed elevated arousal and valence ratings as well as skin conductance responses for each CS+ condition compared to the CS-. On the neural level, we found activations during the presentation of the CS + porn in the bilateral nucleus accumbens, right medial orbitofrontal cortex, and the right ventral anterior cingulate cortex compared to the CS-, but no significant activations during CS + money and CS + game compared to the CS-. These results indicate that different processes emerge depending on whether primary and secondary rewards are presented separately or together in the same experimental paradigm. Additionally, monetary and gaming-related stimuli seem to have a lower appetitive value than pornographic rewards.
Collapse
Affiliation(s)
- Kseniya Krikova
- Clinical Psychology and PsychotherapyUniversity of SiegenSiegenGermany
- Department of Psychotherapy and Systems NeuroscienceJustus Liebig University GiessenGiessenGermany
- Bender Institute for Neuroimaging (BION)Justus Liebig University GiessenGiessenGermany
| | - Sanja Klein
- Department of Psychotherapy and Systems NeuroscienceJustus Liebig University GiessenGiessenGermany
- Bender Institute for Neuroimaging (BION)Justus Liebig University GiessenGiessenGermany
| | - Miriam Kampa
- Department of Psychotherapy and Systems NeuroscienceJustus Liebig University GiessenGiessenGermany
- Bender Institute for Neuroimaging (BION)Justus Liebig University GiessenGiessenGermany
| | - Bertram Walter
- Department of Psychotherapy and Systems NeuroscienceJustus Liebig University GiessenGiessenGermany
- Bender Institute for Neuroimaging (BION)Justus Liebig University GiessenGiessenGermany
| | - Rudolf Stark
- Department of Psychotherapy and Systems NeuroscienceJustus Liebig University GiessenGiessenGermany
- Bender Institute for Neuroimaging (BION)Justus Liebig University GiessenGiessenGermany
- Center for Mind, Brain and BehaviorUniversities of Marburg and GießenMarburgGermany
| | - Tim Klucken
- Clinical Psychology and PsychotherapyUniversity of SiegenSiegenGermany
| |
Collapse
|
3
|
Bo K, Kraynak TE, Kwon M, Sun M, Gianaros PJ, Wager TD. A systems identification approach using Bayes factors to deconstruct the brain bases of emotion regulation. Nat Neurosci 2024; 27:975-987. [PMID: 38519748 DOI: 10.1038/s41593-024-01605-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 02/15/2024] [Indexed: 03/25/2024]
Abstract
Cognitive reappraisal is fundamental to cognitive therapies and everyday emotion regulation. Analyses using Bayes factors and an axiomatic systems identification approach identified four reappraisal-related components encompassing distributed neural activity patterns across two independent functional magnetic resonance imaging (fMRI) studies (n = 182 and n = 176): (1) an anterior prefrontal system selectively involved in cognitive reappraisal; (2) a fronto-parietal-insular system engaged by both reappraisal and emotion generation, demonstrating a general role in appraisal; (3) a largely subcortical system activated during negative emotion generation but unaffected by reappraisal, including amygdala, hypothalamus and periaqueductal gray; and (4) a posterior cortical system of negative emotion-related regions downregulated by reappraisal. These systems covaried with individual differences in reappraisal success and were differentially related to neurotransmitter binding maps, implicating cannabinoid and serotonin systems in reappraisal. These findings challenge 'limbic'-centric models of reappraisal and provide new systems-level targets for assessing and enhancing emotion regulation.
Collapse
Affiliation(s)
- Ke Bo
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Thomas E Kraynak
- Department of Epidemiology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Mijin Kwon
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Michael Sun
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Peter J Gianaros
- Department of Psychology, University of Pittsburgh, Pittsburgh, PA, USA.
| | - Tor D Wager
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA.
| |
Collapse
|
4
|
Jaiswal S, Chakravarthula LNC, Padmala S. Additive Effects of Monetary Loss and Positive Emotion in the Human Brain. eNeuro 2024; 11:ENEURO.0374-23.2024. [PMID: 38565297 PMCID: PMC11026344 DOI: 10.1523/eneuro.0374-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 02/26/2024] [Accepted: 03/22/2024] [Indexed: 04/04/2024] Open
Abstract
In many real-life scenarios, our decisions could lead to multiple outcomes that conflict with value. Hence, an appropriate neural representation of the net experienced value of conflicting outcomes, which play a crucial role in guiding future decisions, is critical for adaptive behavior. As some recent functional neuroimaging work has primarily focused on the concurrent processing of monetary gains and aversive information, very little is known regarding the integration of conflicting value signals involving monetary losses and appetitive information in the human brain. To address this critical gap, we conducted a functional MRI study involving healthy human male participants to examine the nature of integrating positive emotion and monetary losses. We employed a novel experimental design where the valence (positive or neutral) of an emotional stimulus indicated the type of outcome (loss or no loss) in a choice task. Specifically, we probed two plausible integration patterns while processing conflicting value signals involving positive emotion and monetary losses: interactive versus additive. We found overlapping main effects of positive (vs neutral) emotion and loss (vs no loss) in multiple brain regions, including the ventromedial prefrontal cortex, striatum, and amygdala, notably with a lack of evidence for interaction. Thus, our findings revealed the additive integration pattern of monetary loss and positive emotion outcomes, suggesting that the experienced value of the monetary loss was not modulated by the valence of the image signaling those outcomes. These findings contribute to our limited understanding of the nature of integrating conflicting outcomes in the healthy human brain with potential clinical relevance.
Collapse
Affiliation(s)
- Sagarika Jaiswal
- Centre for Neuroscience, Indian Institute of Science, Bangalore, Karnataka 560012, India
| | | | - Srikanth Padmala
- Centre for Neuroscience, Indian Institute of Science, Bangalore, Karnataka 560012, India
| |
Collapse
|
5
|
Singletary NM, Gottlieb J, Horga G. The parieto-occipital cortex is a candidate neural substrate for the human ability to approximate Bayesian inference. Commun Biol 2024; 7:165. [PMID: 38337012 PMCID: PMC10858241 DOI: 10.1038/s42003-024-05821-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 01/15/2024] [Indexed: 02/12/2024] Open
Abstract
Adaptive decision-making often requires one to infer unobservable states based on incomplete information. Bayesian logic prescribes that individuals should do so by estimating the posterior probability by integrating the prior probability with new information, but the neural basis of this integration is incompletely understood. We record fMRI during a task in which participants infer the posterior probability of a hidden state while we independently modulate the prior probability and likelihood of evidence regarding the state; the task incentivizes participants to make accurate inferences and dissociates expected value from posterior probability. Here we show that activation in a region of left parieto-occipital cortex independently tracks the subjective posterior probability, combining its subcomponents of prior probability and evidence likelihood, and reflecting the individual participants' systematic deviations from objective probabilities. The parieto-occipital cortex is thus a candidate neural substrate for humans' ability to approximate Bayesian inference by integrating prior beliefs with new information.
Collapse
Affiliation(s)
- Nicholas M Singletary
- Doctoral Program in Neurobiology and Behavior, Columbia University, New York, NY, USA.
- Department of Neuroscience, Columbia University, New York, NY, USA.
- Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA.
- New York State Psychiatric Institute, New York, NY, USA.
| | - Jacqueline Gottlieb
- Department of Neuroscience, Columbia University, New York, NY, USA.
- Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA.
- Kavli Institute for Brain Science, Columbia University, New York, NY, USA.
| | - Guillermo Horga
- New York State Psychiatric Institute, New York, NY, USA.
- Department of Psychiatry, Columbia University, New York, NY, USA.
| |
Collapse
|
6
|
Blanco-Pozo M, Akam T, Walton ME. Dopamine-independent effect of rewards on choices through hidden-state inference. Nat Neurosci 2024; 27:286-297. [PMID: 38216649 PMCID: PMC10849965 DOI: 10.1038/s41593-023-01542-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 12/01/2023] [Indexed: 01/14/2024]
Abstract
Dopamine is implicated in adaptive behavior through reward prediction error (RPE) signals that update value estimates. There is also accumulating evidence that animals in structured environments can use inference processes to facilitate behavioral flexibility. However, it is unclear how these two accounts of reward-guided decision-making should be integrated. Using a two-step task for mice, we show that dopamine reports RPEs using value information inferred from task structure knowledge, alongside information about reward rate and movement. Nonetheless, although rewards strongly influenced choices and dopamine activity, neither activating nor inhibiting dopamine neurons at trial outcome affected future choice. These data were recapitulated by a neural network model where cortex learned to track hidden task states by predicting observations, while basal ganglia learned values and actions via RPEs. This shows that the influence of rewards on choices can stem from dopamine-independent information they convey about the world's state, not the dopaminergic RPEs they produce.
Collapse
Affiliation(s)
- Marta Blanco-Pozo
- Department of Experimental Psychology, Oxford University, Oxford, UK.
- Wellcome Centre for Integrative Neuroimaging, Oxford University, Oxford, UK.
| | - Thomas Akam
- Department of Experimental Psychology, Oxford University, Oxford, UK.
- Wellcome Centre for Integrative Neuroimaging, Oxford University, Oxford, UK.
| | - Mark E Walton
- Department of Experimental Psychology, Oxford University, Oxford, UK.
- Wellcome Centre for Integrative Neuroimaging, Oxford University, Oxford, UK.
| |
Collapse
|
7
|
Kirschner H, Nassar MR, Fischer AG, Frodl T, Meyer-Lotz G, Froböse S, Seidenbecher S, Klein TA, Ullsperger M. Transdiagnostic inflexible learning dynamics explain deficits in depression and schizophrenia. Brain 2024; 147:201-214. [PMID: 38058203 PMCID: PMC10766268 DOI: 10.1093/brain/awad362] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 09/25/2023] [Accepted: 10/10/2023] [Indexed: 12/08/2023] Open
Abstract
Deficits in reward learning are core symptoms across many mental disorders. Recent work suggests that such learning impairments arise by a diminished ability to use reward history to guide behaviour, but the neuro-computational mechanisms through which these impairments emerge remain unclear. Moreover, limited work has taken a transdiagnostic approach to investigate whether the psychological and neural mechanisms that give rise to learning deficits are shared across forms of psychopathology. To provide insight into this issue, we explored probabilistic reward learning in patients diagnosed with major depressive disorder (n = 33) or schizophrenia (n = 24) and 33 matched healthy controls by combining computational modelling and single-trial EEG regression. In our task, participants had to integrate the reward history of a stimulus to decide whether it is worthwhile to gamble on it. Adaptive learning in this task is achieved through dynamic learning rates that are maximal on the first encounters with a given stimulus and decay with increasing stimulus repetitions. Hence, over the course of learning, choice preferences would ideally stabilize and be less susceptible to misleading information. We show evidence of reduced learning dynamics, whereby both patient groups demonstrated hypersensitive learning (i.e. less decaying learning rates), rendering their choices more susceptible to misleading feedback. Moreover, there was a schizophrenia-specific approach bias and a depression-specific heightened sensitivity to disconfirmational feedback (factual losses and counterfactual wins). The inflexible learning in both patient groups was accompanied by altered neural processing, including no tracking of expected values in either patient group. Taken together, our results thus provide evidence that reduced trial-by-trial learning dynamics reflect a convergent deficit across depression and schizophrenia. Moreover, we identified disorder distinct learning deficits.
Collapse
Affiliation(s)
- Hans Kirschner
- Institute of Psychology, Otto-von-Guericke University, D-39106 Magdeburg, Germany
| | - Matthew R Nassar
- Robert J. and Nancy D. Carney Institute for Brain Science, Brown University, Providence, RI 02912-1821, USA
- Department of Neuroscience, Brown University, Providence, RI 02912-1821, USA
| | - Adrian G Fischer
- Department of Education and Psychology, Freie Universität Berlin, D-14195 Berlin, Germany
| | - Thomas Frodl
- Department of Psychiatry and Psychotherapy, Otto-von-Guericke University, D-39106 Magdeburg, Germany
- Department of Psychiatry, Psychotherapy and Psychosomatics, RWTH Aachen University, Aachen 52074, Germany
- German Center for Mental Health (DZPG), D-39106 Magdeburg, Germany
- Center for Intervention and Research on adaptive and maladaptive brain Circuits underlying mental health (C-I-R-C), Jena-Magdeburg-Halle, D-39106 Magdeburg, Germany
| | - Gabriela Meyer-Lotz
- Department of Psychiatry and Psychotherapy, Otto-von-Guericke University, D-39106 Magdeburg, Germany
| | - Sören Froböse
- Department of Psychiatry and Psychotherapy, Otto-von-Guericke University, D-39106 Magdeburg, Germany
| | - Stephanie Seidenbecher
- Department of Psychiatry and Psychotherapy, Otto-von-Guericke University, D-39106 Magdeburg, Germany
| | - Tilmann A Klein
- Institute of Psychology, Otto-von-Guericke University, D-39106 Magdeburg, Germany
- Center for Behavioral Brain Sciences, D-39106 Magdeburg, Germany
| | - Markus Ullsperger
- Institute of Psychology, Otto-von-Guericke University, D-39106 Magdeburg, Germany
- German Center for Mental Health (DZPG), D-39106 Magdeburg, Germany
- Center for Intervention and Research on adaptive and maladaptive brain Circuits underlying mental health (C-I-R-C), Jena-Magdeburg-Halle, D-39106 Magdeburg, Germany
- Center for Behavioral Brain Sciences, D-39106 Magdeburg, Germany
| |
Collapse
|
8
|
Carvalheiro J, Philiastides MG. Distinct spatiotemporal brainstem pathways of outcome valence during reward- and punishment-based learning. Cell Rep 2023; 42:113589. [PMID: 38100353 DOI: 10.1016/j.celrep.2023.113589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 10/05/2023] [Accepted: 11/30/2023] [Indexed: 12/17/2023] Open
Abstract
Learning to seek rewards and avoid punishments, based on positive and negative choice outcomes, is essential for human survival. Yet, the neural underpinnings of outcome valence in the human brainstem and the extent to which they differ in reward and punishment learning contexts remain largely elusive. Here, using simultaneously acquired electroencephalography and functional magnetic resonance imaging data, we show that during reward learning the substantia nigra (SN)/ventral tegmental area (VTA) and locus coeruleus are initially activated following negative outcomes, while the VTA subsequently re-engages exhibiting greater responses for positive than negative outcomes, consistent with an early arousal/avoidance response and a later value-updating process, respectively. During punishment learning, we show that distinct raphe nucleus and SN subregions are activated only by negative outcomes with a sustained post-outcome activity across time, supporting the involvement of these brainstem subregions in avoidance behavior. Finally, we demonstrate that the coupling of these brainstem structures with other subcortical and cortical areas helps to shape participants' serial choice behavior in each context.
Collapse
Affiliation(s)
- Joana Carvalheiro
- School of Psychology and Neuroscience, University of Glasgow, Glasgow G12 8QB, UK; Centre for Cognitive Neuroimaging, University of Glasgow, Glasgow G12 8QB, UK.
| | - Marios G Philiastides
- School of Psychology and Neuroscience, University of Glasgow, Glasgow G12 8QB, UK; Centre for Cognitive Neuroimaging, University of Glasgow, Glasgow G12 8QB, UK.
| |
Collapse
|
9
|
Konova AB, Ceceli AO, Horga G, Moeller SJ, Alia-Klein N, Goldstein RZ. Reduced neural encoding of utility prediction errors in cocaine addiction. Neuron 2023; 111:4058-4070.e6. [PMID: 37883973 PMCID: PMC10880133 DOI: 10.1016/j.neuron.2023.09.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 07/18/2023] [Accepted: 09/13/2023] [Indexed: 10/28/2023]
Abstract
Influential accounts of addiction posit alterations in adaptive behavior driven by deficient dopaminergic prediction errors (PEs), signaling the discrepancy between actual and expected reward. Dopamine neurons encode these error signals in subjective terms, calibrated by individual risk preferences, as "utility" PEs. It remains unclear, however, whether people with drug addiction have PE deficits or their computational source. Here, using an analogous task to prior single-unit studies with known expectancies, we show that fMRI-measured PEs similarly reflect utility PEs. Relative to control participants, people with chronic cocaine addiction demonstrate reduced utility PEs in the dopaminoceptive ventral striatum, with similar trends in orbitofrontal cortex. Dissecting this PE signal into its subcomponent terms attributed these reductions to weaker striatal responses to received reward/utility, whereas suppression of activity with reward expectation was unchanged. These findings support that addiction may fundamentally disrupt PE signaling and reveal an underappreciated role for perceived reward value in this mechanism.
Collapse
Affiliation(s)
- Anna B Konova
- Department of Psychiatry, University Behavioral Health Care & the Brain Health Institute, Rutgers University-New Brunswick, Piscataway, NJ 08855, USA.
| | - Ahmet O Ceceli
- Departments of Psychiatry & Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Guillermo Horga
- Department of Psychiatry, Columbia University, New York, NY 10024, USA
| | - Scott J Moeller
- Department of Psychiatry, Renaissance School of Medicine at Stony Brook University, Stony Brook, NY 11794, USA
| | - Nelly Alia-Klein
- Departments of Psychiatry & Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Rita Z Goldstein
- Departments of Psychiatry & Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| |
Collapse
|
10
|
Chakravarthula LN, Padmala S. Negative emotion reduces the discriminability of reward outcomes in the ventromedial prefrontal cortex. Soc Cogn Affect Neurosci 2023; 18:nsad067. [PMID: 37978320 PMCID: PMC10661064 DOI: 10.1093/scan/nsad067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 09/11/2023] [Accepted: 11/17/2023] [Indexed: 11/19/2023] Open
Abstract
Reward and emotion are tightly intertwined, so there is a growing interest in mapping their interactions. However, our knowledge of these interactions in the human brain, especially during the consummatory phase of reward is limited. To address this critical gap, we conducted a functional magnetic resonance imaging study to investigate the effects of negative emotion on reward outcome processing. We employed a novel design where emotional valence (negative or neutral) indicated the type of outcome (reward or no-reward) in a choice task. We focused our functional magnetic resonance imaging analysis on the ventro-medial prefrontal cortex (vmPFC), ventral striatum and amygdala, which were frequently implicated in reward outcome processing. In these regions of interest, we performed multi-voxel pattern analysis to specifically probe how negative emotion modulates reward outcome processing. In vmPFC, using decoding analysis, we found evidence consistent with the reduced discriminability of multi-variate activity patterns of reward vs no-reward outcomes when signaled by a negative relative to a neutral image, suggesting an emotional modulation of reward processing along the plausible common value/valence dimension. These findings advance our limited understanding of the basic brain mechanisms underlying the influence of negative emotion on consummatory reward processing, with potential implications for mental disorders, particularly anxiety and depression.
Collapse
Affiliation(s)
| | - Srikanth Padmala
- Centre for Neuroscience, Indian Institute of Science, Bangalore, KA 560012, India
| |
Collapse
|
11
|
Gold BP, Pearce MT, McIntosh AR, Chang C, Dagher A, Zatorre RJ. Auditory and reward structures reflect the pleasure of musical expectancies during naturalistic listening. Front Neurosci 2023; 17:1209398. [PMID: 37928727 PMCID: PMC10625409 DOI: 10.3389/fnins.2023.1209398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 10/05/2023] [Indexed: 11/07/2023] Open
Abstract
Enjoying music consistently engages key structures of the neural auditory and reward systems such as the right superior temporal gyrus (R STG) and ventral striatum (VS). Expectations seem to play a central role in this effect, as preferences reliably vary according to listeners' uncertainty about the musical future and surprise about the musical past. Accordingly, VS activity reflects the pleasure of musical surprise, and exhibits stronger correlations with R STG activity as pleasure grows. Yet the reward value of musical surprise - and thus the reason for these surprises engaging the reward system - remains an open question. Recent models of predictive neural processing and learning suggest that forming, testing, and updating hypotheses about one's environment may be intrinsically rewarding, and that the constantly evolving structure of musical patterns could provide ample opportunity for this procedure. Consistent with these accounts, our group previously found that listeners tend to prefer melodic excerpts taken from real music when it either validates their uncertain melodic predictions (i.e., is high in uncertainty and low in surprise) or when it challenges their highly confident ones (i.e., is low in uncertainty and high in surprise). An independent research group (Cheung et al., 2019) replicated these results with musical chord sequences, and identified their fMRI correlates in the STG, amygdala, and hippocampus but not the VS, raising new questions about the neural mechanisms of musical pleasure that the present study seeks to address. Here, we assessed concurrent liking ratings and hemodynamic fMRI signals as 24 participants listened to 50 naturalistic, real-world musical excerpts that varied across wide spectra of computationally modeled uncertainty and surprise. As in previous studies, liking ratings exhibited an interaction between uncertainty and surprise, with the strongest preferences for high uncertainty/low surprise and low uncertainty/high surprise. FMRI results also replicated previous findings, with music liking effects in the R STG and VS. Furthermore, we identify interactions between uncertainty and surprise on the one hand, and liking and surprise on the other, in VS activity. Altogether, these results provide important support for the hypothesized role of the VS in deriving pleasure from learning about musical structure.
Collapse
Affiliation(s)
- Benjamin P. Gold
- Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, United States
- Vanderbilt University Institute of Imaging Science, Vanderbilt University Medical Center, Nashville, TN, United States
- Montreal Neurological Institute, McGill University, Montreal, QC, Canada
- International Laboratory for Brain, Music and Sound Research (BRAMS), Montreal, QC, Canada
- Centre for Research on Brain, Language and Music (CRBLM), Montreal, QC, Canada
- Centre for Interdisciplinary Research in Music, Media, and Technology (CIRMMT), Montreal, QC, Canada
| | - Marcus T. Pearce
- Cognitive Science Research Group, School of Electronic Engineering & Computer Science, Queen Mary University of London, London, United Kingdom
- Department of Clinical Medicine, Aarhus University, Aarhus, Denmark
| | - Anthony R. McIntosh
- Baycrest Centre, Rotman Research Institute, Toronto, ON, Canada
- Department of Psychology, University of Toronto, Toronto, ON, Canada
| | - Catie Chang
- Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, United States
- Vanderbilt University Institute of Imaging Science, Vanderbilt University Medical Center, Nashville, TN, United States
- Department of Biomedical Engineering, Vanderbilt University, Nashville, TN, United States
- Department of Computer Science, Vanderbilt University, Nashville, TN, United States
| | - Alain Dagher
- Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Robert J. Zatorre
- Montreal Neurological Institute, McGill University, Montreal, QC, Canada
- International Laboratory for Brain, Music and Sound Research (BRAMS), Montreal, QC, Canada
- Centre for Research on Brain, Language and Music (CRBLM), Montreal, QC, Canada
- Centre for Interdisciplinary Research in Music, Media, and Technology (CIRMMT), Montreal, QC, Canada
| |
Collapse
|
12
|
Hird EJ, Diederen K, Leucht S, Jensen KB, McGuire P. The Placebo Effect in Psychosis: Why It Matters and How to Measure It. BIOLOGICAL PSYCHIATRY GLOBAL OPEN SCIENCE 2023; 3:605-613. [PMID: 37881581 PMCID: PMC10593894 DOI: 10.1016/j.bpsgos.2023.02.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 12/04/2022] [Accepted: 02/20/2023] [Indexed: 03/07/2023] Open
Abstract
Psychosis is characterized by unusual percepts and beliefs in the form of hallucinations and delusions. Antipsychotic medication, the primary treatment for psychosis, is often ineffective and accompanied by severe side effects, but research has not identified an effective alternative in several decades. One reason that clinical trials fail is that patients with psychosis tend to show a significant therapeutic response to inert control treatments, known as the placebo effect, which makes it difficult to distinguish drug effects from placebo effects. Conversely, in clinical practice, a strong placebo effect may be useful because it could enhance the overall treatment response. Identifying factors that predict large placebo effects could improve the future outlook of psychosis treatment. Biomarkers of the placebo effect have already been suggested in pain and depression, but not in psychosis. Quantifying markers of the placebo effect would have the potential to predict placebo effects in psychosis clinical trials. Furthermore, the placebo effect and psychosis may represent a shared neurocognitive mechanism in which prior beliefs are weighted against new sensory information to make inferences about reality. Examining this overlap could reveal new insights into the mechanisms underlying psychosis and indicate novel treatment targets. We provide a narrative review of the importance of the placebo effect in psychosis and propose a novel method to assess it.
Collapse
Affiliation(s)
- Emily J. Hird
- Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, England
| | - Kelly Diederen
- Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, England
| | - Stefan Leucht
- Department of Psychiatry and Psychotherapy, Technical University of Munich, Munich, Germany
| | - Karin B. Jensen
- Department of Clinical Neuroscience, Karolinska Institute, Stockholm, Sweden
| | - Philip McGuire
- Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, England
| |
Collapse
|
13
|
Newton-Fenner A, Hewitt D, Henderson J, Roberts H, Mari T, Gu Y, Gorelkina O, Giesbrecht T, Fallon N, Roberts C, Stancak A. Economic value in the Brain: A meta-analysis of willingness-to-pay using the Becker-DeGroot-Marschak auction. PLoS One 2023; 18:e0286969. [PMID: 37428744 DOI: 10.1371/journal.pone.0286969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 05/29/2023] [Indexed: 07/12/2023] Open
Abstract
Forming and comparing subjective values (SVs) of choice options is a critical stage of decision-making. Previous studies have highlighted a complex network of brain regions involved in this process by utilising a diverse range of tasks and stimuli, varying in economic, hedonic and sensory qualities. However, the heterogeneity of tasks and sensory modalities may systematically confound the set of regions mediating the SVs of goods. To identify and delineate the core brain valuation system involved in processing SV, we utilised the Becker-DeGroot-Marschak (BDM) auction, an incentivised demand-revealing mechanism which quantifies SV through the economic metric of willingness-to-pay (WTP). A coordinate-based activation likelihood estimation meta-analysis analysed twenty-four fMRI studies employing a BDM task (731 participants; 190 foci). Using an additional contrast analysis, we also investigated whether this encoding of SV would be invariant to the concurrency of auction task and fMRI recordings. A fail-safe number analysis was conducted to explore potential publication bias. WTP positively correlated with fMRI-BOLD activations in the left ventromedial prefrontal cortex with a sub-cluster extending into anterior cingulate cortex, bilateral ventral striatum, right dorsolateral prefrontal cortex, right inferior frontal gyrus, and right anterior insula. Contrast analysis identified preferential engagement of the mentalizing-related structures in response to concurrent scanning. Together, our findings offer succinct empirical support for the core structures participating in the formation of SV, separate from the hedonic aspects of reward and evaluated in terms of WTP using BDM, and show the selective involvement of inhibition-related brain structures during active valuation.
Collapse
Affiliation(s)
- Alice Newton-Fenner
- Department of Psychology, University of Liverpool, Liverpool, United Kingdom
- Institute of Risk and Uncertainty, University of Liverpool, Liverpool, United Kingdom
| | - Danielle Hewitt
- Department of Psychology, University of Liverpool, Liverpool, United Kingdom
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, Oxford, United Kingdom
| | - Jessica Henderson
- Department of Psychology, University of Liverpool, Liverpool, United Kingdom
| | - Hannah Roberts
- Department of Psychology, University of Liverpool, Liverpool, United Kingdom
| | - Tyler Mari
- Department of Psychology, University of Liverpool, Liverpool, United Kingdom
| | - Yiquan Gu
- Henley Business School, University of Reading, Reading, United Kingdom
| | - Olga Gorelkina
- Management School, University of Liverpool, Liverpool, United Kingdom
| | - Timo Giesbrecht
- Unilever, Research and Development, Port Sunlight, United Kingdom
| | - Nicolas Fallon
- Department of Psychology, University of Liverpool, Liverpool, United Kingdom
| | - Carl Roberts
- Department of Psychology, University of Liverpool, Liverpool, United Kingdom
| | - Andrej Stancak
- Department of Psychology, University of Liverpool, Liverpool, United Kingdom
- Institute of Risk and Uncertainty, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
14
|
Nair A, Niyogi RK, Shang F, Tabrizi SJ, Rees G, Rutledge RB. Opportunity cost determines free-operant action initiation latency and predicts apathy. Psychol Med 2023; 53:1850-1859. [PMID: 37310334 PMCID: PMC10106307 DOI: 10.1017/s0033291721003469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 06/20/2021] [Accepted: 08/03/2021] [Indexed: 10/20/2022]
Abstract
BACKGROUND Apathy, a disabling and poorly understood neuropsychiatric symptom, is characterised by impaired self-initiated behaviour. It has been hypothesised that the opportunity cost of time (OCT) may be a key computational variable linking self-initiated behaviour with motivational status. OCT represents the amount of reward which is foregone per second if no action is taken. Using a novel behavioural task and computational modelling, we investigated the relationship between OCT, self-initiation and apathy. We predicted that higher OCT would engender shorter action latencies, and that individuals with greater sensitivity to OCT would have higher behavioural apathy. METHODS We modulated the OCT in a novel task called the 'Fisherman Game', Participants freely chose when to self-initiate actions to either collect rewards, or on occasion, to complete non-rewarding actions. We measured the relationship between action latencies, OCT and apathy for each participant across two independent non-clinical studies, one under laboratory conditions (n = 21) and one online (n = 90). 'Average-reward' reinforcement learning was used to model our data. We replicated our findings across both studies. RESULTS We show that the latency of self-initiation is driven by changes in the OCT. Furthermore, we demonstrate, for the first time, that participants with higher apathy showed greater sensitivity to changes in OCT in younger adults. Our model shows that apathetic individuals experienced greatest change in subjective OCT during our task as a consequence of being more sensitive to rewards. CONCLUSIONS Our results suggest that OCT is an important variable for determining free-operant action initiation and understanding apathy.
Collapse
Affiliation(s)
- Akshay Nair
- Huntington's Disease Centre, UCL Queen Square Institute of Neurology, University College London, Russell Square House, 10-12 Russell Square, London, WC1B 5EH, UK
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, UCL Queen Square Institute of Neurology, University College London, Russell Square House, 10-12 Russell Square, London, WC1B 5EH, UK
| | - Ritwik K. Niyogi
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, UCL Queen Square Institute of Neurology, University College London, Russell Square House, 10-12 Russell Square, London, WC1B 5EH, UK
| | - Fei Shang
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, UCL Queen Square Institute of Neurology, University College London, Russell Square House, 10-12 Russell Square, London, WC1B 5EH, UK
- Department of Psychiatry, Yale University, New Haven, CT 06510, USA
| | - Sarah J. Tabrizi
- Huntington's Disease Centre, UCL Queen Square Institute of Neurology, University College London, Russell Square House, 10-12 Russell Square, London, WC1B 5EH, UK
- Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, University College London, 12 Queen Square, London WC1N 3AR, UK
| | - Geraint Rees
- Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, University College London, 12 Queen Square, London WC1N 3AR, UK
- UCL Institute of Cognitive Neuroscience, UCL Queen Square Institute of Neurology, University College London, 17-19 Queen Square, London, WC1N 3AZ, UK
| | - Robb B. Rutledge
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, UCL Queen Square Institute of Neurology, University College London, Russell Square House, 10-12 Russell Square, London, WC1B 5EH, UK
- Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, University College London, 12 Queen Square, London WC1N 3AR, UK
- Department of Psychology, Yale University, New Haven, CT 06511, USA
| |
Collapse
|
15
|
Speer SPH, Keysers C, Barrios JC, Teurlings CJS, Smidts A, Boksem MAS, Wager TD, Gazzola V. A multivariate brain signature for reward. Neuroimage 2023; 271:119990. [PMID: 36878456 DOI: 10.1016/j.neuroimage.2023.119990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Revised: 02/20/2023] [Accepted: 02/25/2023] [Indexed: 03/07/2023] Open
Abstract
The processing of reinforcers and punishers is crucial to adapt to an ever changing environment and its dysregulation is prevalent in mental health and substance use disorders. While many human brain measures related to reward have been based on activity in individual brain regions, recent studies indicate that many affective and motivational processes are encoded in distributed systems that span multiple regions. Consequently, decoding these processes using individual regions yields small effect sizes and limited reliability, whereas predictive models based on distributed patterns yield larger effect sizes and excellent reliability. To create such a predictive model for the processes of rewards and losses, termed the Brain Reward Signature (BRS), we trained a model to predict the signed magnitude of monetary rewards on the Monetary Incentive Delay task (MID; N = 39) and achieved a highly significant decoding performance (92% for decoding rewards versus losses). We subsequently demonstrate the generalizability of our signature on another version of the MID in a different sample (92% decoding accuracy; N = 12) and on a gambling task from a large sample (73% decoding accuracy, N = 1084). We further provided preliminary data to characterize the specificity of the signature by illustrating that the signature map generates estimates that significantly differ between rewarding and negative feedback (92% decoding accuracy) but do not differ for conditions that differ in disgust rather than reward in a novel Disgust-Delay Task (N = 39). Finally, we show that passively viewing positive and negatively valenced facial expressions loads positively on our signature, in line with previous studies on morbid curiosity. We thus created a BRS that can accurately predict brain responses to rewards and losses in active decision making tasks, and that possibly relates to information seeking in passive observational tasks.
Collapse
Affiliation(s)
- Sebastian P H Speer
- Social Brain Lab, Netherlands Institute for Neuroscience, Amsterdam, The Netherlands; Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA
| | - Christian Keysers
- Social Brain Lab, Netherlands Institute for Neuroscience, Amsterdam, The Netherlands; Brain and Cognition, Department of Psychology, University of Amsterdam, The Netherlands
| | | | - Cas J S Teurlings
- Social Brain Lab, Netherlands Institute for Neuroscience, Amsterdam, The Netherlands
| | - Ale Smidts
- Rotterdam School of Management, Erasmus University, 3062 PA Rotterdam, The Netherlands
| | - Maarten A S Boksem
- Rotterdam School of Management, Erasmus University, 3062 PA Rotterdam, The Netherlands
| | - Tor D Wager
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH 03755, USA
| | - Valeria Gazzola
- Social Brain Lab, Netherlands Institute for Neuroscience, Amsterdam, The Netherlands.
| |
Collapse
|
16
|
Suzuki S, Zhang X, Dezfouli A, Braganza L, Fulcher BD, Parkes L, Fontenelle LF, Harrison BJ, Murawski C, Yücel M, Suo C. Individuals with problem gambling and obsessive-compulsive disorder learn through distinct reinforcement mechanisms. PLoS Biol 2023; 21:e3002031. [PMID: 36917567 PMCID: PMC10013903 DOI: 10.1371/journal.pbio.3002031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 02/08/2023] [Indexed: 03/16/2023] Open
Abstract
Obsessive-compulsive disorder (OCD) and pathological gambling (PG) are accompanied by deficits in behavioural flexibility. In reinforcement learning, this inflexibility can reflect asymmetric learning from outcomes above and below expectations. In alternative frameworks, it reflects perseveration independent of learning. Here, we examine evidence for asymmetric reward-learning in OCD and PG by leveraging model-based functional magnetic resonance imaging (fMRI). Compared with healthy controls (HC), OCD patients exhibited a lower learning rate for worse-than-expected outcomes, which was associated with the attenuated encoding of negative reward prediction errors in the dorsomedial prefrontal cortex and the dorsal striatum. PG patients showed higher and lower learning rates for better- and worse-than-expected outcomes, respectively, accompanied by higher encoding of positive reward prediction errors in the anterior insula than HC. Perseveration did not differ considerably between the patient groups and HC. These findings elucidate the neural computations of reward-learning that are altered in OCD and PG, providing a potential account of behavioural inflexibility in those mental disorders.
Collapse
Affiliation(s)
- Shinsuke Suzuki
- Centre for Brain, Mind and Markets, The University of Melbourne, Carlton, Australia
- Center for the Promotion of Social Data Science Education and Research, Hitotsubashi University, Tokyo, Japan
- * E-mail:
| | - Xiaoliu Zhang
- BrainPark, Turner Institute for Brain and Mental Health, School of Psychological Sciences, and Monash Biomedical Imaging Facility, Monash University, Clayton, Australia
| | - Amir Dezfouli
- Data61, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Sydney, Australia
| | - Leah Braganza
- BrainPark, Turner Institute for Brain and Mental Health, School of Psychological Sciences, and Monash Biomedical Imaging Facility, Monash University, Clayton, Australia
| | - Ben D. Fulcher
- School of Physics, The University of Sydney, Sydney, Australia
| | - Linden Parkes
- BrainPark, Turner Institute for Brain and Mental Health, School of Psychological Sciences, and Monash Biomedical Imaging Facility, Monash University, Clayton, Australia
- Department of Bioengineering, School of Engineering & Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Leonardo F. Fontenelle
- BrainPark, Turner Institute for Brain and Mental Health, School of Psychological Sciences, and Monash Biomedical Imaging Facility, Monash University, Clayton, Australia
| | - Ben J. Harrison
- Melbourne Neuropsychiatry Centre, Department of Psychiatry, The University of Melbourne, Carlton, Australia
| | - Carsten Murawski
- Centre for Brain, Mind and Markets, The University of Melbourne, Carlton, Australia
| | - Murat Yücel
- BrainPark, Turner Institute for Brain and Mental Health, School of Psychological Sciences, and Monash Biomedical Imaging Facility, Monash University, Clayton, Australia
| | - Chao Suo
- BrainPark, Turner Institute for Brain and Mental Health, School of Psychological Sciences, and Monash Biomedical Imaging Facility, Monash University, Clayton, Australia
| |
Collapse
|
17
|
Müller-Pinzler L, Czekalla N, Mayer AV, Schröder A, Stolz DS, Paulus FM, Krach S. Neurocomputational mechanisms of affected beliefs. Commun Biol 2022; 5:1241. [PMCID: PMC9663730 DOI: 10.1038/s42003-022-04165-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 10/25/2022] [Indexed: 11/16/2022] Open
Abstract
AbstractThe feedback people receive on their behavior shapes the process of belief formation and self-efficacy in mastering a particular task. However, the neural and computational mechanisms of how the subjective value of self-efficacy beliefs, and the corresponding affect, influence the learning process remain unclear. We investigated these mechanisms during self-efficacy belief formation using fMRI, pupillometry, and computational modeling, and by analyzing individual differences in affective experience. Biases in the formation of self-efficacy beliefs were associated with affect, pupil dilation, and neural activity within the anterior insula, amygdala, ventral tegmental area/ substantia nigra, and mPFC. Specifically, neural and pupil responses mapped the valence of the prediction errors in correspondence with individuals’ experienced affective states and learning biases during self-efficacy belief formation. Together with the functional connectivity dynamics of the anterior insula within this network, our results provide evidence for neural and computational mechanisms of how we arrive at affected beliefs.
Collapse
|
18
|
Aberg KC, Paz R. Average reward rates enable motivational transfer across independent reinforcement learning tasks. Front Behav Neurosci 2022; 16:1041566. [PMID: 36439970 PMCID: PMC9682033 DOI: 10.3389/fnbeh.2022.1041566] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Accepted: 10/26/2022] [Indexed: 08/26/2023] Open
Abstract
Outcomes and feedbacks on performance may influence behavior beyond the context in which it was received, yet it remains unclear what neurobehavioral mechanisms may account for such lingering influences on behavior. The average reward rate (ARR) has been suggested to regulate motivated behavior, and was found to interact with dopamine-sensitive cognitive processes, such as vigilance and associative memory encoding. The ARR could therefore provide a bridge between independent tasks when these are performed in temporal proximity, such that the reward rate obtained in one task could influence performance in a second subsequent task. Reinforcement learning depends on the coding of prediction error signals by dopamine neurons and their downstream targets, in particular the nucleus accumbens. Because these brain regions also respond to changes in ARR, reinforcement learning may be vulnerable to changes in ARR. To test this hypothesis, we designed a novel paradigm in which participants (n = 245) performed two probabilistic reinforcement learning tasks presented in interleaved trials. The ARR was controlled by an "induction" task which provided feedback with a low (p = 0.58), a medium (p = 0.75), or a high probability of reward (p = 0.92), while the impact of ARR on reinforcement learning was tested by a second "reference" task with a constant reward probability (p = 0.75). We find that performance was significantly lower in the reference task when the induction task provided low reward probabilities (i.e., during low levels of ARR), as compared to the medium and high ARR conditions. Behavioral modeling further revealed that the influence of ARR is best described by models which accumulates average rewards (rather than average prediction errors), and where the ARR directly modulates the prediction error signal (rather than affecting learning rates or exploration). Our results demonstrate how affective information in one domain may transfer and affect motivated behavior in other domains. These findings are particularly relevant for understanding mood disorders, but may also inform abnormal behaviors attributed to dopamine dysfunction.
Collapse
Affiliation(s)
- Kristoffer C. Aberg
- Department of Brain Sciences, Weizmann Institute of Science, Rehovot, Israel
| | | |
Collapse
|
19
|
Ojala KE, Tzovara A, Poser BA, Lutti A, Bach DR. Asymmetric representation of aversive prediction errors in Pavlovian threat conditioning. Neuroimage 2022; 263:119579. [PMID: 35995374 DOI: 10.1016/j.neuroimage.2022.119579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Revised: 08/16/2022] [Accepted: 08/17/2022] [Indexed: 11/24/2022] Open
Abstract
Survival in biological environments requires learning associations between predictive sensory cues and threatening outcomes. Such aversive learning may be implemented through reinforcement learning algorithms that are driven by the signed difference between expected and encountered outcomes, termed prediction errors (PEs). While PE-based learning is well established for reward learning, the role of putative PE signals in aversive learning is less clear. Here, we used functional magnetic resonance imaging in humans (21 healthy men and women) to investigate the neural representation of PEs during maintenance of learned aversive associations. Four visual cues, each with a different probability (0, 33, 66, 100%) of being followed by an aversive outcome (electric shock), were repeatedly presented to participants. We found that neural activity at omission (US-) but not occurrence of the aversive outcome (US+) encoded PEs in the medial prefrontal cortex. More expected omission of aversive outcome was associated with lower neural activity. No neural signals fulfilled axiomatic criteria, which specify necessary and sufficient components of PE signals, for signed PE representation in a whole-brain search or in a-priori regions of interest. Our results might suggest that, different from reward learning, aversive learning does not involve signed PE signals that are represented within the same brain region for all conditions.
Collapse
Affiliation(s)
- Karita E Ojala
- Computational Psychiatry Research, Department of Psychiatry, Psychotherapy and Psychosomatics, Psychiatric Hospital, University of Zurich, Lenggstrasse 31, Zurich 8032, Switzerland; Neuroscience Centre Zurich, University of Zurich, Winterthurerstrasse 190, Zürich 8057, Switzerland.
| | - Athina Tzovara
- Computational Psychiatry Research, Department of Psychiatry, Psychotherapy and Psychosomatics, Psychiatric Hospital, University of Zurich, Lenggstrasse 31, Zurich 8032, Switzerland; Neuroscience Centre Zurich, University of Zurich, Winterthurerstrasse 190, Zürich 8057, Switzerland; Institute of Computer Science, University of Bern, Neubrückstrasse 10, Bern 3012, Switzerland
| | - Benedikt A Poser
- Department of Cognitive Neuroscience, Faculty of Psychology and Neuroscience, Maastricht University, Oxfordlaan 55 EV 6299, Maastricht, the Netherlands
| | - Antoine Lutti
- Laboratory for Research in Neuroimaging, Department of Clinical Neuroscience, Lausanne University Hospital and University of Lausanne, Chemin de Mont-Paisible 16, Lausanne 1011, Switzerland
| | - Dominik R Bach
- Computational Psychiatry Research, Department of Psychiatry, Psychotherapy and Psychosomatics, Psychiatric Hospital, University of Zurich, Lenggstrasse 31, Zurich 8032, Switzerland; Neuroscience Centre Zurich, University of Zurich, Winterthurerstrasse 190, Zürich 8057, Switzerland; Wellcome Centre for Human Neuroimaging and Max-Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, 10-12 Russell Square, London WC1B 5EH, United Kingdom.
| |
Collapse
|
20
|
Yu LQ, Dana J, Kable JW. Individuals with ventromedial frontal damage display unstable but transitive preferences during decision making. Nat Commun 2022; 13:4758. [PMID: 35963856 PMCID: PMC9376076 DOI: 10.1038/s41467-022-32511-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 08/03/2022] [Indexed: 11/17/2022] Open
Abstract
The ventromedial frontal lobe (VMF) is important for decision-making, but the precise causal role of the VMF in the decision process has not been fully established. Previous studies have suggested that individuals with VMF damage violate transitivity, a hallmark axiom of rational decisions. However, these prior studies cannot properly distinguish whether individuals with VMF damage are truly prone to choosing irrationally from whether their preferences are simply more variable. We had individuals with focal VMF damage, individuals with other frontal damage, and healthy controls make repeated choices across three categories-artworks, chocolate bar brands, and gambles. Using proper tests of transitivity, we find that, in our study, individuals with VMF damage make rational decisions consistent with transitive preferences, even though they exhibit greater variability in their preferences. That is, the VMF is necessary for having strong and reliable preferences, but not for being a rational decision maker. VMF damage affects the variability with which value is assessed, but not the consistency with which value is sought.
Collapse
Affiliation(s)
- Linda Q Yu
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, 19104, USA.
- Department of Neuroscience, Brown University, Providence, RI, 02912, USA.
| | - Jason Dana
- Yale School of Management, Yale University, New Haven, CT, 06520, USA
| | - Joseph W Kable
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, 19104, USA
| |
Collapse
|
21
|
Jepma M, Roy M, Ramlakhan K, van Velzen M, Dahan A. Different brain systems support learning from received and avoided pain during human pain-avoidance learning. eLife 2022; 11:74149. [PMID: 35731646 PMCID: PMC9217130 DOI: 10.7554/elife.74149] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 06/07/2022] [Indexed: 12/14/2022] Open
Abstract
Both unexpected pain and unexpected pain absence can drive avoidance learning, but whether they do so via shared or separate neural and neurochemical systems is largely unknown. To address this issue, we combined an instrumental pain-avoidance learning task with computational modeling, functional magnetic resonance imaging (fMRI), and pharmacological manipulations of the dopaminergic (100 mg levodopa) and opioidergic (50 mg naltrexone) systems (N = 83). Computational modeling provided evidence that untreated participants learned more from received than avoided pain. Our dopamine and opioid manipulations negated this learning asymmetry by selectively increasing learning rates for avoided pain. Furthermore, our fMRI analyses revealed that pain prediction errors were encoded in subcortical and limbic brain regions, whereas no-pain prediction errors were encoded in frontal and parietal cortical regions. However, we found no effects of our pharmacological manipulations on the neural encoding of prediction errors. Together, our results suggest that human pain-avoidance learning is supported by separate threat- and safety-learning systems, and that dopamine and endogenous opioids specifically regulate learning from successfully avoided pain.
Collapse
Affiliation(s)
- Marieke Jepma
- Department of Psychology, University of Amsterdam, Amsterdam, Netherlands.,Department of Psychology, Leiden University, Leiden, Netherlands.,Leiden Institute for Brain and Cognition, Leiden, Netherlands
| | - Mathieu Roy
- Department of Psychology, McGill University, Montreal, Canada.,Alan Edwards Centre for Research on Pain, McGill University, Montreal, Canada
| | - Kiran Ramlakhan
- Department of Psychology, Leiden University, Leiden, Netherlands.,Department of Research and Statistics, Municipality of Amsterdam, Amsterdam, Netherlands
| | - Monique van Velzen
- Department of Anesthesiology, Leiden University Medical Center, Leiden, Netherlands
| | - Albert Dahan
- Department of Anesthesiology, Leiden University Medical Center, Leiden, Netherlands
| |
Collapse
|
22
|
van Elzelingen W, Goedhoop J, Warnaar P, Denys D, Arbab T, Willuhn I. A unidirectional but not uniform striatal landscape of dopamine signaling for motivational stimuli. Proc Natl Acad Sci U S A 2022; 119:e2117270119. [PMID: 35594399 PMCID: PMC9171911 DOI: 10.1073/pnas.2117270119] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 04/04/2022] [Indexed: 11/18/2022] Open
Abstract
Dopamine signals in the striatum are critical for motivated behavior. However, their regional specificity and precise information content are actively debated. Dopaminergic projections to the striatum are topographically organized. Thus, we quantified dopamine release in response to motivational stimuli and associated predictive cues in six principal striatal regions of unrestrained, behaving rats. Absolute signal size and its modulation by stimulus value and by subjective state of the animal were interregionally heterogeneous on a medial to lateral gradient. In contrast, dopamine-concentration direction of change was homogeneous across all regions: appetitive stimuli increased and aversive stimuli decreased dopamine concentration. Although cues predictive of such motivational stimuli acquired the same influence over dopamine homogeneously across all regions, dopamine-mediated prediction-error signals were restricted to the ventromedial, limbic striatum. Together, our findings demonstrate a nuanced striatal landscape of unidirectional but not uniform dopamine signals, topographically encoding distinct aspects of motivational stimuli and their prediction.
Collapse
Affiliation(s)
- Wouter van Elzelingen
- Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences, 1105 BA Amsterdam, The Netherlands
- Department of Psychiatry, Amsterdam University Medical Centers, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
| | - Jessica Goedhoop
- Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences, 1105 BA Amsterdam, The Netherlands
- Department of Psychiatry, Amsterdam University Medical Centers, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
| | - Pascal Warnaar
- Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences, 1105 BA Amsterdam, The Netherlands
- Department of Psychiatry, Amsterdam University Medical Centers, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
| | - Damiaan Denys
- Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences, 1105 BA Amsterdam, The Netherlands
- Department of Psychiatry, Amsterdam University Medical Centers, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
| | - Tara Arbab
- Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences, 1105 BA Amsterdam, The Netherlands
- Department of Psychiatry, Amsterdam University Medical Centers, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
| | - Ingo Willuhn
- Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences, 1105 BA Amsterdam, The Netherlands
- Department of Psychiatry, Amsterdam University Medical Centers, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
| |
Collapse
|
23
|
Dennison JB, Sazhin D, Smith DV. Decision neuroscience and neuroeconomics: Recent progress and ongoing challenges. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2022; 13:e1589. [PMID: 35137549 PMCID: PMC9124684 DOI: 10.1002/wcs.1589] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 11/28/2021] [Accepted: 12/21/2021] [Indexed: 01/10/2023]
Abstract
In the past decade, decision neuroscience and neuroeconomics have developed many new insights in the study of decision making. This review provides an overarching update on how the field has advanced in this time period. Although our initial review a decade ago outlined several theoretical, conceptual, methodological, empirical, and practical challenges, there has only been limited progress in resolving these challenges. We summarize significant trends in decision neuroscience through the lens of the challenges outlined for the field and review examples where the field has had significant, direct, and applicable impacts across economics and psychology. First, we review progress on topics including reward learning, explore-exploit decisions, risk and ambiguity, intertemporal choice, and valuation. Next, we assess the impacts of emotion, social rewards, and social context on decision making. Then, we follow up with how individual differences impact choices and new exciting developments in the prediction and neuroforecasting of future decisions. Finally, we consider how trends in decision-neuroscience research reflect progress toward resolving past challenges, discuss new and exciting applications of recent research, and identify new challenges for the field. This article is categorized under: Psychology > Reasoning and Decision Making Psychology > Emotion and Motivation.
Collapse
Affiliation(s)
- Jeffrey B Dennison
- Department of Psychology, Temple University, Philadelphia, Pennsylvania, USA
| | - Daniel Sazhin
- Department of Psychology, Temple University, Philadelphia, Pennsylvania, USA
| | - David V Smith
- Department of Psychology, Temple University, Philadelphia, Pennsylvania, USA
| |
Collapse
|
24
|
Chen Y, Huang Y, Yen N. Role of anterior midcingulate cortex in self-reward representation and reward allocation judgments within social context. Hum Brain Mapp 2022; 43:2377-2390. [PMID: 35103356 PMCID: PMC8996356 DOI: 10.1002/hbm.25793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 11/10/2021] [Accepted: 01/12/2022] [Indexed: 11/07/2022] Open
Abstract
Evaluating rewards for the self and others is essential for social interactions. Previous research has probed the neural substrates signaling rewards in social decision-making tasks as well as the differentiation between self- and other-reward representations. However, studies with different designs have yielded mixed results. After analyzing and comparing previous designs, we differentiated three components in this study: task (reward representation vs. social judgment of reward allocation), agency (self vs. other), and social context (without vs. within). Participants were asked to imagine various share sizes as a proposer in a dictator game during fMRI, and then rated their willingness and preference for these offers in a post-scan behavioral task. To differentiate the regions involved in processing rewards without and within context, we presented the reward to each agent in two sequential frames. Parametric analyses showed that, in the second frame (i.e., within social context), the anterior midcingulate cortex (aMCC) signaled self-reward and preferences for the offer, whereas the right insula tracked the likelihood of proposing the offer. Belief in a just world is positively associated with aMCC responses to self-reward. These results shed light on the role of the aMCC in coding self-reward within the social context to guide social behaviors.
Collapse
Affiliation(s)
- Ying‐Chun Chen
- Department of PsychologyNational Chengchi UniversityTaipei CityTaiwan
| | - Yun‐Hsin Huang
- Department of PsychologyNational Chengchi UniversityTaipei CityTaiwan
| | - Nai‐Shing Yen
- Department of PsychologyNational Chengchi UniversityTaipei CityTaiwan
- Research Center for Mind, Brain, and LearningNational Chengchi UniversityTaipei CityTaiwan
| |
Collapse
|
25
|
Horing B, Büchel C. The human insula processes both modality-independent and pain-selective learning signals. PLoS Biol 2022; 20:e3001540. [PMID: 35522696 PMCID: PMC9116652 DOI: 10.1371/journal.pbio.3001540] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Revised: 05/18/2022] [Accepted: 04/15/2022] [Indexed: 12/02/2022] Open
Abstract
Prediction errors (PEs) are generated when there are differences between an expected and an actual event or sensory input. The insula is a key brain region involved in pain processing, and studies have shown that the insula encodes the magnitude of an unexpected outcome (unsigned PEs). In addition to signaling this general magnitude information, PEs can give specific information on the direction of this deviation-i.e., whether an event is better or worse than expected. It is unclear whether the unsigned PE responses in the insula are selective for pain or reflective of a more general processing of aversive events irrespective of modality. It is also unknown whether the insula can process signed PEs at all. Understanding these specific mechanisms has implications for understanding how pain is processed in the brain in both health and in chronic pain conditions. In this study, 47 participants learned associations between 2 conditioned stimuli (CS) with 4 unconditioned stimuli (US; painful heat or loud sound, of one low and one high intensity each) while undergoing functional magnetic resonance imaging (fMRI) and skin conductance response (SCR) measurements. We demonstrate that activation in the anterior insula correlated with unsigned intensity PEs, irrespective of modality, indicating an unspecific aversive surprise signal. Conversely, signed intensity PE signals were modality specific, with signed PEs following pain but not sound located in the dorsal posterior insula, an area implicated in pain intensity processing. Previous studies have identified abnormal insula function and abnormal learning as potential causes of pain chronification. Our findings link these results and suggest that a misrepresentation of learning relevant PEs in the insular cortex may serve as an underlying factor in chronic pain.
Collapse
Affiliation(s)
- Björn Horing
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Christian Büchel
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| |
Collapse
|
26
|
Acute stress blunts prediction error signals in the dorsal striatum during reinforcement learning. Neurobiol Stress 2021; 15:100412. [PMID: 34761081 PMCID: PMC8566898 DOI: 10.1016/j.ynstr.2021.100412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 10/12/2021] [Accepted: 10/24/2021] [Indexed: 11/20/2022] Open
Abstract
Acute stress is pervasive in everyday modern life and is thought to affect how people make choices and learn from them. Reinforcement learning, which implicates learning from the unexpected rewarding and punishing outcomes of our choices (i.e., prediction errors), is critical for adjusted behaviour and seems to be affected by acute stress. However, the neural mechanisms by which acute stress disrupts this type of learning are still poorly understood. Here, we investigate whether and how acute stress blunts neural signalling of prediction errors during reinforcement learning using model-based functional magnetic resonance imaging. Male participants completed a well-established reinforcement-learning task involving monetary gains and losses whilst under stress and control conditions. Acute stress impaired participants’ (n = 23) behavioural performance towards obtaining monetary gains (p < 0.001), but not towards avoiding losses (p = 0.57). Importantly, acute stress blunted signalling of prediction errors during gain and loss trials in the dorsal striatum (p = 0.040) — with subsidiary analyses suggesting that acute stress preferentially blunted signalling of positive prediction errors. Our results thus reveal a neurocomputational mechanism by which acute stress may impair reward learning.
Collapse
|
27
|
Tivadar RI, Knight RT, Tzovara A. Automatic Sensory Predictions: A Review of Predictive Mechanisms in the Brain and Their Link to Conscious Processing. Front Hum Neurosci 2021; 15:702520. [PMID: 34489663 PMCID: PMC8416526 DOI: 10.3389/fnhum.2021.702520] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 07/12/2021] [Indexed: 01/22/2023] Open
Abstract
The human brain has the astonishing capacity of integrating streams of sensory information from the environment and forming predictions about future events in an automatic way. Despite being initially developed for visual processing, the bulk of predictive coding research has subsequently focused on auditory processing, with the famous mismatch negativity signal as possibly the most studied signature of a surprise or prediction error (PE) signal. Auditory PEs are present during various consciousness states. Intriguingly, their presence and characteristics have been linked with residual levels of consciousness and return of awareness. In this review we first give an overview of the neural substrates of predictive processes in the auditory modality and their relation to consciousness. Then, we focus on different states of consciousness - wakefulness, sleep, anesthesia, coma, meditation, and hypnosis - and on what mysteries predictive processing has been able to disclose about brain functioning in such states. We review studies investigating how the neural signatures of auditory predictions are modulated by states of reduced or lacking consciousness. As a future outlook, we propose the combination of electrophysiological and computational techniques that will allow investigation of which facets of sensory predictive processes are maintained when consciousness fades away.
Collapse
Affiliation(s)
| | - Robert T. Knight
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, United States
- Department of Psychology, University of California, Berkeley, Berkeley, CA, United States
| | - Athina Tzovara
- Institute of Computer Science, University of Bern, Bern, Switzerland
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, United States
- Sleep-Wake Epilepsy Center | NeuroTec, Department of Neurology, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| |
Collapse
|
28
|
Meta-analytic clustering dissociates brain activity and behavior profiles across reward processing paradigms. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2021; 20:215-235. [PMID: 31872334 DOI: 10.3758/s13415-019-00763-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Reward learning is a ubiquitous cognitive mechanism guiding adaptive choices and behaviors, and when impaired, can lead to considerable mental health consequences. Reward-related functional neuroimaging studies have begun to implicate networks of brain regions essential for processing various peripheral influences (e.g., risk, subjective preference, delay, social context) involved in the multifaceted reward processing construct. To provide a more complete neurocognitive perspective on reward processing that synthesizes findings across the literature while also appreciating these peripheral influences, we used emerging meta-analytic techniques to elucidate brain regions, and in turn networks, consistently engaged in distinct aspects of reward processing. Using a data-driven, meta-analytic, k-means clustering approach, we dissociated seven meta-analytic groupings (MAGs) of neuroimaging results (i.e., brain activity maps) from 749 experimental contrasts across 176 reward processing studies involving 13,358 healthy participants. We then performed an exploratory functional decoding approach to gain insight into the putative functions associated with each MAG. We identified a seven-MAG clustering solution that represented dissociable patterns of convergent brain activity across reward processing tasks. Additionally, our functional decoding analyses revealed that each of these MAGs mapped onto discrete behavior profiles that suggested specialized roles in predicting value (MAG-1 & MAG-2) and processing a variety of emotional (MAG-3), external (MAG-4 & MAG-5), and internal (MAG-6 & MAG-7) influences across reward processing paradigms. These findings support and extend aspects of well-accepted reward learning theories and highlight large-scale brain network activity associated with distinct aspects of reward processing.
Collapse
|
29
|
Anatomical dissociation of intracerebral signals for reward and punishment prediction errors in humans. Nat Commun 2021; 12:3344. [PMID: 34099678 PMCID: PMC8184756 DOI: 10.1038/s41467-021-23704-w] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Accepted: 05/06/2021] [Indexed: 11/17/2022] Open
Abstract
Whether maximizing rewards and minimizing punishments rely on distinct brain systems remains debated, given inconsistent results coming from human neuroimaging and animal electrophysiology studies. Bridging the gap across techniques, we recorded intracerebral activity from twenty participants while they performed an instrumental learning task. We found that both reward and punishment prediction errors (PE), estimated from computational modeling of choice behavior, correlate positively with broadband gamma activity (BGA) in several brain regions. In all cases, BGA scaled positively with the outcome (reward or punishment versus nothing) and negatively with the expectation (predictability of reward or punishment). However, reward PE were better signaled in some regions (such as the ventromedial prefrontal and lateral orbitofrontal cortex), and punishment PE in other regions (such as the anterior insula and dorsolateral prefrontal cortex). These regions might therefore belong to brain systems that differentially contribute to the repetition of rewarded choices and the avoidance of punished choices. Whether maximizing rewards and minimizing punishments rely on distinct brain learning systems remains debated. Here, using intracerebral recordings in humans, the authors provide evidence for brain regions differentially engaged in signaling reward and punishment prediction errors that prescribe repetition versus avoidance of past choices.
Collapse
|
30
|
Mollick JA, Chang LJ, Krishnan A, Hazy TE, Krueger KA, Frank GKW, Wager TD, O'Reilly RC. The Neural Correlates of Cued Reward Omission. Front Hum Neurosci 2021; 15:615313. [PMID: 33679345 PMCID: PMC7928384 DOI: 10.3389/fnhum.2021.615313] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 01/19/2021] [Indexed: 11/13/2022] Open
Abstract
Compared to our understanding of positive prediction error signals occurring due to unexpected reward outcomes, less is known about the neural circuitry in humans that drives negative prediction errors during omission of expected rewards. While classical learning theories such as Rescorla-Wagner or temporal difference learning suggest that both types of prediction errors result from a simple subtraction, there has been recent evidence suggesting that different brain regions provide input to dopamine neurons which contributes to specific components of this prediction error computation. Here, we focus on the brain regions responding to negative prediction error signals, which has been well-established in animal studies to involve a distinct pathway through the lateral habenula. We examine the activity of this pathway in humans, using a conditioned inhibition paradigm with high-resolution functional MRI. First, participants learned to associate a sensory stimulus with reward delivery. Then, reward delivery was omitted whenever this stimulus was presented simultaneously with a different sensory stimulus, the conditioned inhibitor (CI). Both reward presentation and the reward-predictive cue activated midbrain dopamine regions, insula and orbitofrontal cortex. While we found significant activity at an uncorrected threshold for the CI in the habenula, consistent with our predictions, it did not survive correction for multiple comparisons and awaits further replication. Additionally, the pallidum and putamen regions of the basal ganglia showed modulations of activity for the inhibitor that did not survive the corrected threshold.
Collapse
Affiliation(s)
- Jessica A Mollick
- Department of Psychiatry, Yale University, New Haven, CT, United States
| | - Luke J Chang
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, United States
| | - Anjali Krishnan
- Department of Psychology, Brooklyn College, City University of New York, Brooklyn, NY, United States
| | | | | | - Guido K W Frank
- UCSD Eating Disorder Center for Treatment and Research, University of California, San Diego, San Diego, CA, United States
| | - Tor D Wager
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, United States
| | - Randall C O'Reilly
- Department of Psychology and Computer Science Center for Neuroscience, University of California, Davis, Davis, CA, United States
| |
Collapse
|
31
|
Reward Learning Through the Lens of RDoC: a Review of Theory, Assessment, and Empirical Findings in the Eating Disorders. Curr Psychiatry Rep 2021; 23:2. [PMID: 33386514 DOI: 10.1007/s11920-020-01213-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/17/2020] [Indexed: 01/19/2023]
Abstract
PURPOSE OF REVIEW Reward-related processes may represent important transdiagnostic factors underlying eating pathology. Using the NIMH Research Domain Criteria as a guide, the current article reviews theories, behavioral and self-report assessments, and empirical findings related to reward learning in the eating disorders. RECENT FINDINGS Data from behavioral tasks suggest deficits in reinforcement learning, which may become more pronounced with increasing disorder severity and duration. Self-report data strongly implicate positive eating and thinness/restriction expectancies (an element of reward prediction error) in the onset and maintenance of eating pathology. Finally, self-report measures of habit strength demonstrate relationships with eating pathology and illness duration; however, behavioral task data do not support relationships between eating pathology and a propensity towards general habit development. Existing studies are limited, but provide preliminary support for the presence of abnormal reward learning in eating disorders. Continued research is needed to address identified gaps in the literature.
Collapse
|
32
|
Gabitov E, Lungu O, Albouy G, Doyon J. Movement errors during skilled motor performance engage distinct prediction error mechanisms. Commun Biol 2020; 3:763. [PMID: 33311566 PMCID: PMC7732826 DOI: 10.1038/s42003-020-01465-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 11/06/2020] [Indexed: 12/23/2022] Open
Abstract
The brain detects deviations from intended behaviors by estimating the mismatch between predicted and actual outcomes. Axiomatic to these computations are salience and valence prediction error signals, which alert the brain to the occurrence and value of unexpected events. Despite the theoretical assertion of these prediction error signals, it is unknown whether and how brain mechanisms underlying their computations support error processing during skilled motor behavior. Here we demonstrate, with functional magnetic resonance imaging, that internal detection, i.e., without externally-provided feedback, of self-generated movement errors evokes instantaneous activity increases within the salience network and delayed lingering decreases within the nucleus accumbens - a key structure in the reward valuation pathway. A widespread suppression within the sensorimotor network was also observed. Our findings suggest that neural computations of salience and valence prediction errors during skilled motor behaviors operate on different time-scales and, therefore, may contribute differentially to immediate and longer-term adaptive processes.
Collapse
Affiliation(s)
- Ella Gabitov
- McConnell Brain Imaging Center, Montreal Neurological Institute, Montreal, QC, H3A 2B4, Canada. .,Department of Neurology and Neurosurgery, Montreal Neurological Institute, McGill University, Montreal, QC, H3A 2B4, Canada.
| | - Ovidiu Lungu
- McConnell Brain Imaging Center, Montreal Neurological Institute, Montreal, QC, H3A 2B4, Canada.,Department of Neurology and Neurosurgery, Montreal Neurological Institute, McGill University, Montreal, QC, H3A 2B4, Canada
| | - Geneviève Albouy
- Movement Control and Neuroplasticity Research Group, Department of Movement Sciences, KU Leuven, 3000, Leuven, Belgium
| | - Julien Doyon
- McConnell Brain Imaging Center, Montreal Neurological Institute, Montreal, QC, H3A 2B4, Canada. .,Department of Neurology and Neurosurgery, Montreal Neurological Institute, McGill University, Montreal, QC, H3A 2B4, Canada.
| |
Collapse
|
33
|
Behavioral, Physiological, and Neural Signatures of Surprise during Naturalistic Sports Viewing. Neuron 2020; 109:377-390.e7. [PMID: 33242421 DOI: 10.1016/j.neuron.2020.10.029] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Revised: 08/07/2020] [Accepted: 10/22/2020] [Indexed: 12/13/2022]
Abstract
Surprise signals a discrepancy between past and current beliefs. It is theorized to be linked to affective experiences, the creation of particularly resilient memories, and segmentation of the flow of experience into discrete perceived events. However, the ability to precisely measure naturalistic surprise has remained elusive. We used advanced basketball analytics to derive a quantitative measure of surprise and characterized its behavioral, physiological, and neural correlates in human subjects observing basketball games. We found that surprise was associated with segmentation of ongoing experiences, as reflected by subjectively perceived event boundaries and shifts in neocortical patterns underlying belief states. Interestingly, these effects differed by whether surprising moments contradicted or bolstered current predominant beliefs. Surprise also positively correlated with pupil dilation, activation in subcortical regions associated with dopamine, game enjoyment, and long-term memory. These investigations support key predictions from event segmentation theory and extend theoretical conceptualizations of surprise to real-world contexts.
Collapse
|
34
|
Tschida JE, Yerys BE. A Systematic Review of the Positive Valence System in Autism Spectrum Disorder. Neuropsychol Rev 2020; 31:58-88. [PMID: 33174110 DOI: 10.1007/s11065-020-09459-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Accepted: 09/23/2020] [Indexed: 01/04/2023]
Abstract
This review synthesized current literature of behavioral and cognitive studies targeting reward processing in autism spectrum disorder (ASD). The National Institute of Mental Health's Research Domain Criteria (RDoC) Positive Valence System (PVS) domain was used as an overarching framework. The objectives were to determine which component operations of reward processing may be atypical in ASD and consequently postulate a heuristic model of reward processing in ASD that could be evaluated with future research. 34 studies were identified from the Embase, PubMed, PsycINFO, and Web of Science databases and included in the review using guidelines from the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (also known as PRISMA guidelines). The extant literature suggested potential relationships between social symptoms of ASD and PVS sub-constructs of reward anticipation, probabilistic and reinforcement learning, reward prediction error, reward (probability), delay, and effort as well as between restricted and repetitive behaviors and interests (RRBIs) and PVS-sub constructs of initial response to reward, reward anticipation, reward (probability), delay, and effort. However, these findings are limited by a sparse and mixed literature for some sub-constructs. We put forward a developmentally informed heuristic model that posits how these component reward processes may be implicated in early ASD behaviors as well as later emerging and more intransigent symptoms. Future research is needed to comprehensively evaluate the proposed model.
Collapse
Affiliation(s)
- Jessica E Tschida
- Children's Hospital of Philadelphia, Roberts Center for Pediatric Research Building, Center for Autism Research, 2716 South Street, 5th Floor, Philadelphia, PA, 19146, USA.
| | - Benjamin E Yerys
- Children's Hospital of Philadelphia, Roberts Center for Pediatric Research Building, Center for Autism Research, 2716 South Street, 5th Floor, Philadelphia, PA, 19146, USA.,Perelman School of Medicine, Department of Psychiatry, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
35
|
Grohn J, Schüffelgen U, Neubert FX, Bongioanni A, Verhagen L, Sallet J, Kolling N, Rushworth MFS. Multiple systems in macaques for tracking prediction errors and other types of surprise. PLoS Biol 2020; 18:e3000899. [PMID: 33125367 PMCID: PMC7657565 DOI: 10.1371/journal.pbio.3000899] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Revised: 11/11/2020] [Accepted: 09/18/2020] [Indexed: 11/21/2022] Open
Abstract
Animals learn from the past to make predictions. These predictions are adjusted after prediction errors, i.e., after surprising events. Generally, most reward prediction errors models learn the average expected amount of reward. However, here we demonstrate the existence of distinct mechanisms for detecting other types of surprising events. Six macaques learned to respond to visual stimuli to receive varying amounts of juice rewards. Most trials ended with the delivery of either 1 or 3 juice drops so that animals learned to expect 2 juice drops on average even though instances of precisely 2 drops were rare. To encourage learning, we also included sessions during which the ratio between 1 and 3 drops changed. Additionally, in all sessions, the stimulus sometimes appeared in an unexpected location. Thus, 3 types of surprising events could occur: reward amount surprise (i.e., a scalar reward prediction error), rare reward surprise, and visuospatial surprise. Importantly, we can dissociate scalar reward prediction errors—rewards that deviated from the average reward amount expected—and rare reward events—rewards that accorded with the average reward expectation but that rarely occurred. We linked each type of surprise to a distinct pattern of neural activity using functional magnetic resonance imaging. Activity in the vicinity of the dopaminergic midbrain only reflected surprise about the amount of reward. Lateral prefrontal cortex had a more general role in detecting surprising events. Posterior lateral orbitofrontal cortex specifically detected rare reward events regardless of whether they followed average reward amount expectations, but only in learnable reward environments. Animals learn from the past to make predictions, and predictions are adjusted whenever surprising outcomes violate those predictions. This study shows that macaques use multiple systems to detect different types of rare reward events; while activity in the vicinity of the dopaminergic midbrain reflected scalar reward prediction errors (i.e. reward amount), activity in the orbitofrontal cortex reflected a rare reward signal.
Collapse
Affiliation(s)
- Jan Grohn
- Wellcome Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Urs Schüffelgen
- Wellcome Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Franz-Xaver Neubert
- Wellcome Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Alessandro Bongioanni
- Wellcome Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Lennart Verhagen
- Wellcome Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
| | - Jerome Sallet
- Wellcome Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
- Univ Lyon, Université Lyon 1, Inserm, Stem Cell and Brain Research Institute U1208, Bron, France
| | - Nils Kolling
- Wellcome Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
- Wellcome Integrative Neuroimaging (WIN), Oxford Centre for Human Brain Activity (OHBA), University of Oxford, Oxford, United Kingdom
| | - Matthew F S Rushworth
- Wellcome Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
36
|
Cai LX, Pizano K, Gundersen GW, Hayes CL, Fleming WT, Holt S, Cox JM, Witten IB. Distinct signals in medial and lateral VTA dopamine neurons modulate fear extinction at different times. eLife 2020; 9:54936. [PMID: 32519951 PMCID: PMC7363446 DOI: 10.7554/elife.54936] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 06/05/2020] [Indexed: 12/18/2022] Open
Abstract
Dopamine (DA) neurons are thought to encode reward prediction error (RPE), in addition to other signals, such as salience. While RPE is known to support learning, the role of salience in learning remains less clear. To address this, we recorded and manipulated VTA DA neurons in mice during fear extinction. We applied deep learning to classify mouse freezing behavior, eliminating the need for human scoring. Our fiber photometry recordings showed DA neurons in medial and lateral VTA have distinct activity profiles during fear extinction: medial VTA activity more closely reflected RPE, while lateral VTA activity more closely reflected a salience-like signal. Optogenetic inhibition of DA neurons in either region slowed fear extinction, with the relevant time period for inhibition differing across regions. Our results indicate salience-like signals can have similar downstream consequences to RPE-like signals, although with different temporal dependencies.
Collapse
Affiliation(s)
- Lili X Cai
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Katherine Pizano
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Gregory W Gundersen
- Department of Computer Science, Princeton University, Princeton, United States
| | - Cameron L Hayes
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Weston T Fleming
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Sebastian Holt
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Julia M Cox
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Ilana B Witten
- Princeton Neuroscience Institute, Princeton University, Princeton, United States.,Department of Psychology, Princeton University, Princeton, United States
| |
Collapse
|
37
|
Kiverstein J, Miller M, Rietveld E. How mood tunes prediction: a neurophenomenological account of mood and its disturbance in major depression. Neurosci Conscious 2020; 2020:niaa003. [PMID: 32818063 PMCID: PMC7425816 DOI: 10.1093/nc/niaa003] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 02/21/2020] [Accepted: 03/07/2020] [Indexed: 12/12/2022] Open
Abstract
In this article, we propose a neurophenomenological account of what moods are, and how they work. We draw upon phenomenology to show how mood attunes a person to a space of significant possibilities. Mood structures a person's lived experience by fixing the kinds of significance the world can have for them in a given situation. We employ Karl Friston's free-energy principle to show how this phenomenological concept of mood can be smoothly integrated with cognitive neuroscience. We will argue that mood is a consequence of acting in the world with the aim of minimizing expected free energy-a measure of uncertainty about the future consequences of actions. Moods summarize how the organism is faring overall in its predictive engagements, tuning the organism's expectations about how it is likely to fare in the future. Agents that act to minimize expected free energy will have a feeling of how well or badly they are doing at maintaining grip on the multiple possibilities that matter to them. They will have what we will call a 'feeling of grip' that structures the possibilities they are ready to engage with over long time-scales, just as moods do.
Collapse
Affiliation(s)
- Julian Kiverstein
- Department of Psychiatry, Amsterdam University Medical Centre, Meibergdreef 9, 1105AZ Amsterdam South East, The Netherlands
| | - Mark Miller
- Department of Informatics, University of Sussex, Sussex House, Falmer, Brighton, BN1 9RH, UK
| | - Erik Rietveld
- Department of Psychiatry, Amsterdam University Medical Centre, Meibergdreef 9, 1105AZ Amsterdam South East, The Netherlands
- Department of Philosophy/ILLC, University of Amsterdam, Oude Turfmarkt 141-3, 1012GC Amsterdam, The Netherlands
- Department of Philosophy, University of Twente, 7500AE Enschede, The Netherlands
| |
Collapse
|
38
|
Loued-Khenissi L, Pfeuffer A, Einhäuser W, Preuschoff K. Anterior insula reflects surprise in value-based decision-making and perception. Neuroimage 2020; 210:116549. [DOI: 10.1016/j.neuroimage.2020.116549] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 12/20/2019] [Accepted: 01/14/2020] [Indexed: 11/30/2022] Open
|
39
|
Amphetamine disrupts haemodynamic correlates of prediction errors in nucleus accumbens and orbitofrontal cortex. Neuropsychopharmacology 2020; 45:793-803. [PMID: 31703234 PMCID: PMC7075902 DOI: 10.1038/s41386-019-0564-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 10/02/2019] [Accepted: 10/29/2019] [Indexed: 11/08/2022]
Abstract
In an uncertain world, the ability to predict and update the relationships between environmental cues and outcomes is a fundamental element of adaptive behaviour. This type of learning is typically thought to depend on prediction error, the difference between expected and experienced events and in the reward domain that has been closely linked to mesolimbic dopamine. There is also increasing behavioural and neuroimaging evidence that disruption to this process may be a cross-diagnostic feature of several neuropsychiatric and neurological disorders in which dopamine is dysregulated. However, the precise relationship between haemodynamic measures, dopamine and reward-guided learning remains unclear. To help address this issue, we used a translational technique, oxygen amperometry, to record haemodynamic signals in the nucleus accumbens (NAc) and orbitofrontal cortex (OFC), while freely moving rats performed a probabilistic Pavlovian learning task. Using a model-based analysis approach to account for individual variations in learning, we found that the oxygen signal in the NAc correlated with a reward prediction error, whereas in the OFC it correlated with an unsigned prediction error or salience signal. Furthermore, an acute dose of amphetamine, creating a hyperdopaminergic state, disrupted rats' ability to discriminate between cues associated with either a high or a low probability of reward and concomitantly corrupted prediction error signalling. These results demonstrate parallel but distinct prediction error signals in NAc and OFC during learning, both of which are affected by psychostimulant administration. Furthermore, they establish the viability of tracking and manipulating haemodynamic signatures of reward-guided learning observed in human fMRI studies by using a proxy signal for BOLD in a freely behaving rodent.
Collapse
|
40
|
Viviani R, Dommes L, Bosch J, Steffens M, Paul A, Schneider KL, Stingl JC, Beschoner P. Signals of anticipation of reward and of mean reward rates in the human brain. Sci Rep 2020; 10:4287. [PMID: 32152378 PMCID: PMC7062891 DOI: 10.1038/s41598-020-61257-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 02/23/2020] [Indexed: 01/14/2023] Open
Abstract
Theoretical models of dopamine function stemming from reinforcement learning theory have emphasized the importance of prediction errors, which signal changes in the expectation of impending rewards. Much less is known about the effects of mean reward rates, which may be of motivational significance due to their role in computing the optimal effort put into exploiting reward opportunities. Here, we used a reinforcement learning model to design three functional neuroimaging studies and disentangle the effects of changes in reward expectations and mean reward rates, showing recruitment of specific regions in the brainstem regardless of prediction errors. While changes in reward expectations activated ventral striatal areas as in previous studies, mean reward rates preferentially modulated the substantia nigra/ventral tegmental area, deep layers of the superior colliculi, and a posterior pontomesencephalic region. These brainstem structures may work together to set motivation and attentional efforts levels according to perceived reward opportunities.
Collapse
Affiliation(s)
- Roberto Viviani
- Institute of Psychology, University of Innsbruck, 6020, Innsbruck, Austria.
- Department of Psychiatry and Psychotherapy III, University of Ulm, 89075, Ulm, Germany.
| | - Lisa Dommes
- Department of Psychiatry and Psychotherapy III, University of Ulm, 89075, Ulm, Germany
| | - Julia Bosch
- Department of Psychiatry and Psychotherapy III, University of Ulm, 89075, Ulm, Germany
| | - Michael Steffens
- Research Division, Federal Institute for Drugs and Medical Devices (BfArM), 53175, Bonn, Germany
| | - Anna Paul
- Research Division, Federal Institute for Drugs and Medical Devices (BfArM), 53175, Bonn, Germany
| | - Katharina L Schneider
- Research Division, Federal Institute for Drugs and Medical Devices (BfArM), 53175, Bonn, Germany
| | - Julia C Stingl
- Institute of Clinical Pharmacology, University Hospital of RWTH Aachen, Aachen, Germany
| | - Petra Beschoner
- Department of Psychosomatic Medicine and Psychotherapy, University of Ulm, 89075, Ulm, Germany
| |
Collapse
|
41
|
Suzuki S, O'Doherty JP. Breaking human social decision making into multiple components and then putting them together again. Cortex 2020; 127:221-230. [PMID: 32224320 DOI: 10.1016/j.cortex.2020.02.014] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 01/23/2020] [Accepted: 02/28/2020] [Indexed: 10/24/2022]
Abstract
Most of our waking time as human beings is spent interacting with other individuals. In order to make good decisions in this social milieu, it is often necessary to make inferences about the internal states, traits and intentions of others. Recently, some progress has been made toward uncovering the neural computations underlying human social decision-making by combining functional magnetic resonance neuroimaging (fMRI) with computational modeling of behavior. Modeling of behavioral data allows us to identify the key computations necessary for social decision-making and to determine how these computations are integrated. Furthermore, by correlating these variables against neuroimaging data, it has become possible to elucidate where in the brain various computations are implemented. Here we review the current state of knowledge in the domain of social computational neuroscience. Findings to date have emphasized that social decisions are driven by multiple computations conducted in parallel, and implemented in distinct brain regions. We suggest that further progress is going to depend on identifying how and where such variables get integrated in order to yield a coherent behavioral output.
Collapse
Affiliation(s)
- Shinsuke Suzuki
- Brain, Mind and Markets Laboratory, Department of Finance, Faculty of Business and Economics, The University of Melbourne, Parkville, Australia; Frontier Research Institute for Interdisciplinary Sciences, Tohoku University, Sendai, Japan.
| | - John P O'Doherty
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, USA; Computation and Neural Systems, California Institute of Technology, Pasadena, USA
| |
Collapse
|
42
|
Schüller T, Fischer AG, Gruendler TOJ, Baldermann JC, Huys D, Ullsperger M, Kuhn J. Decreased transfer of value to action in Tourette syndrome. Cortex 2020; 126:39-48. [PMID: 32062469 DOI: 10.1016/j.cortex.2019.12.027] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Revised: 09/13/2019] [Accepted: 12/26/2019] [Indexed: 11/30/2022]
Abstract
OBJECTIVE Tourette syndrome is a neurodevelopmental disorder putatively associated with a hyperdopaminergic state. Therefore, it seems plausible that excessive dopamine transmission in Tourette syndrome alters the ability to learn based on rewards and punishments. We tested whether Tourette syndrome patients exhibited altered reinforcement learning and corresponding feedback-related EEG deflections. METHODS We used a reinforcement learning task providing factual and counterfactual feedback in a sample of 15 Tourette syndrome patients and matched healthy controls whilst recording EEG. The paradigm presented various reward probabilities to enforce adaptive adjustments. We employed a computational model to derive estimates of the prediction error, which we used for single-trial regression analysis of the EEG data. RESULTS We found that Tourette syndrome patients showed increased choice stochasticity compared to controls. The feedback-related negativity represented an axiomatic prediction error for factual feedback and did not differ between groups. We observed attenuated P3a modulation specifically for factual feedback in Tourette syndrome patients, representing impaired coding of attention allocation. CONCLUSION Our findings indicate that cortical prediction error coding is unaffected by Tourette syndrome. Nonetheless, the transfer of learned values into choice formation is degraded, in line with a hyperdopaminergic state.
Collapse
Affiliation(s)
- Thomas Schüller
- University of Cologne, Faculty of Medicine and University Hospital Cologne, Department of Psychiatry and Psychotherapy, Cologne, Germany.
| | - Adrian G Fischer
- Otto von Guericke University, Center for Behavioral Brain Sciences, Magdeburg, Germany; Freie Universität Berlin, Center for Cognitive Neuroscience, Berlin, Germany
| | - Theo O J Gruendler
- Otto von Guericke University, Center for Behavioral Brain Sciences, Magdeburg, Germany
| | - Juan Carlos Baldermann
- University of Cologne, Faculty of Medicine and University Hospital Cologne, Department of Psychiatry and Psychotherapy, Cologne, Germany
| | - Daniel Huys
- University of Cologne, Faculty of Medicine and University Hospital Cologne, Department of Psychiatry and Psychotherapy, Cologne, Germany
| | - Markus Ullsperger
- Otto von Guericke University, Center for Behavioral Brain Sciences, Magdeburg, Germany; Otto von Guericke University, Institute of Psychology, Magdeburg, Germany
| | - Jens Kuhn
- University of Cologne, Faculty of Medicine and University Hospital Cologne, Department of Psychiatry and Psychotherapy, Cologne, Germany; Johanniter Hospital Oberhausen, Department of Psychiatry, Psychotherapy and Psychosomatic, Oberhausen, Germany
| |
Collapse
|
43
|
Ballard I, Miller EM, Piantadosi ST, Goodman ND, McClure SM. Beyond Reward Prediction Errors: Human Striatum Updates Rule Values During Learning. Cereb Cortex 2019; 28:3965-3975. [PMID: 29040494 DOI: 10.1093/cercor/bhx259] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Accepted: 09/13/2017] [Indexed: 11/13/2022] Open
Abstract
Humans naturally group the world into coherent categories defined by membership rules. Rules can be learned implicitly by building stimulus-response associations using reinforcement learning or by using explicit reasoning. We tested if the striatum, in which activation reliably scales with reward prediction error, would track prediction errors in a task that required explicit rule generation. Using functional magnetic resonance imaging during a categorization task, we show that striatal responses to feedback scale with a "surprise" signal derived from a Bayesian rule-learning model and are inconsistent with RL prediction error. We also find that striatum and caudal inferior frontal sulcus (cIFS) are involved in updating the likelihood of discriminative rules. We conclude that the striatum, in cooperation with the cIFS, is involved in updating the values assigned to categorization rules when people learn using explicit reasoning.
Collapse
Affiliation(s)
- Ian Ballard
- Stanford Neurosciences Graduate Training Program, Stanford University, Stanford, CA, USA
| | - Eric M Miller
- Department of Psychology, Stanford University, Stanford, CA, USA
| | - Steven T Piantadosi
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA
| | - Noah D Goodman
- Department of Psychology, Stanford University, Stanford, CA, USA
| | - Samuel M McClure
- Department of Psychology, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
44
|
Dissociating neural learning signals in human sign- and goal-trackers. Nat Hum Behav 2019; 4:201-214. [DOI: 10.1038/s41562-019-0765-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Accepted: 09/25/2019] [Indexed: 01/20/2023]
|
45
|
Klein-Flügge MC, Wittmann MK, Shpektor A, Jensen DEA, Rushworth MFS. Multiple associative structures created by reinforcement and incidental statistical learning mechanisms. Nat Commun 2019; 10:4835. [PMID: 31645545 PMCID: PMC6811627 DOI: 10.1038/s41467-019-12557-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Accepted: 09/16/2019] [Indexed: 01/07/2023] Open
Abstract
Learning the structure of the world can be driven by reinforcement but also occurs incidentally through experience. Reinforcement learning theory has provided insight into how prediction errors drive updates in beliefs but less attention has been paid to the knowledge resulting from such learning. Here we contrast associative structures formed through reinforcement and experience of task statistics. BOLD neuroimaging in human volunteers demonstrates rigid representations of rewarded sequences in temporal pole and posterior orbito-frontal cortex, which are constructed backwards from reward. By contrast, medial prefrontal cortex and a hippocampal-amygdala border region carry reward-related knowledge but also flexible statistical knowledge of the currently relevant task model. Intriguingly, ventral striatum encodes prediction error responses but not the full RL- or statistically derived task knowledge. In summary, representations of task knowledge are derived via multiple learning processes operating at different time scales that are associated with partially overlapping and partially specialized anatomical regions. Associative learning occurs through reinforcement mechanisms as well as incidentally through experience of statistical relationships. Here, the authors report that these two learning processes are associated with specialized anatomical regions that operate at different time scales.
Collapse
Affiliation(s)
- Miriam C Klein-Flügge
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3TA, UK. .,Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK.
| | - Marco K Wittmann
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3TA, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Anna Shpektor
- Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Daria E A Jensen
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3TA, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK.,Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, OX3 7JX, UK
| | - Matthew F S Rushworth
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3TA, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| |
Collapse
|
46
|
Michely J, Rigoli F, Rutledge RB, Hauser TU, Dolan RJ. Distinct Processing of Aversive Experience in Amygdala Subregions. BIOLOGICAL PSYCHIATRY: COGNITIVE NEUROSCIENCE AND NEUROIMAGING 2019; 5:291-300. [PMID: 31542358 PMCID: PMC7059109 DOI: 10.1016/j.bpsc.2019.07.008] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Revised: 07/22/2019] [Accepted: 07/22/2019] [Indexed: 11/21/2022]
Abstract
Background The amygdala is an anatomically complex medial temporal brain structure whose subregions are considered to serve distinct functions. However, their precise role in mediating human aversive experience remains ill understood. Methods We used functional magnetic resonance imaging in 39 healthy volunteers with varying levels of trait anxiety to assess distinct contributions of the basolateral amygdala (BLA) and centromedial amygdala to anticipation and experience of aversive events. Additionally, we examined the relationship between any identified functional subspecialization and measures of subjective reported aversion and trait anxiety. Results Our results show that the centromedial amygdala is responsive to aversive outcomes but insensitive to predictive aversive cues. In contrast, the BLA encodes an aversive prediction error that quantifies whether cues and outcomes are worse than expected. A neural representation within the BLA for distinct threat levels was mirrored in self-reported subjective anxiety across individuals. Furthermore, high trait-anxious individuals were characterized by indiscriminately heightened BLA activity in response to aversive cues, regardless of actual threat level. Conclusions Our results demonstrate that amygdala subregions are distinctly engaged in processing of aversive experience, with elevated and undifferentiated BLA responses to threat emerging as a potential neurobiological mediator of vulnerability to anxiety disorders.
Collapse
Affiliation(s)
- Jochen Michely
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom.
| | - Francesco Rigoli
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom; Department of Psychology, University of London, London, United Kingdom
| | - Robb B Rutledge
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
| | - Tobias U Hauser
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
| | - Raymond J Dolan
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, United Kingdom
| |
Collapse
|
47
|
Kirk U, Pagnoni G, Hétu S, Montague R. Short-term mindfulness practice attenuates reward prediction errors signals in the brain. Sci Rep 2019; 9:6964. [PMID: 31061515 PMCID: PMC6502850 DOI: 10.1038/s41598-019-43474-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 04/23/2019] [Indexed: 12/29/2022] Open
Abstract
Activity changes in dopaminergic neurons encode the ongoing discrepancy between expected and actual value of a stimulus, providing a teaching signal for a reward prediction process. Previous work comparing a cohort of long-term Zen meditators to controls demonstrated an attenuation of reward prediction signals to appetitive reward in the striatum. Using a cross-commodity design encompassing primary- and secondary-reward conditioning experiments, the present study asks the question of whether reward prediction signals are causally altered by mindfulness training in naïve subjects. Volunteers were randomly assigned to 8 weeks of mindfulness training (MT), active control training (CT), or a one-time mindfulness induction group (MI). We observed a decreased response to positive prediction errors in the putamen in the MT group compared to CT using both a primary and a secondary-reward experiment. Furthermore, the posterior insula showed greater activation to primary rewards, independently of their predictability, in the MT group, relative to CT and MI group. These results support the notion that increased attention to the present moment and its interoceptive features - a core component of mindfulness practice - may reduce predictability effects in reward processing, without dampening (in fact, enhancing) the response to the actual delivery of the stimulus.
Collapse
Affiliation(s)
- Ulrich Kirk
- Department of Psychology, University of Southern Denmark, Odense, Denmark. .,The Warburg Institute, University of London, London, United Kingdom.
| | - Giuseppe Pagnoni
- Department of Biomedical, Metabolic and Neural Sciences, University of Modena and Reggio Emilia, Modena, Italy.,Center for Neuroscience and Neurotechnology, University of Modena and Reggio Emilia, Modena, Italy
| | - Sébastien Hétu
- Department of Psychology, Université de Montréal, Montréal, Canada
| | - Read Montague
- Human Neuroimaging Laboratory, Fralin Biomedical Research Institute at VTC, Roanoke, United States.,Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom
| |
Collapse
|
48
|
Hauser TU, Will GJ, Dubois M, Dolan RJ. Annual Research Review: Developmental computational psychiatry. J Child Psychol Psychiatry 2019; 60:412-426. [PMID: 30252127 DOI: 10.1111/jcpp.12964] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/03/2018] [Indexed: 11/29/2022]
Abstract
Most psychiatric disorders emerge during childhood and adolescence. This is also a period that coincides with the brain undergoing substantial growth and reorganisation. However, it remains unclear how a heightened vulnerability to psychiatric disorder relates to this brain maturation. Here, we propose 'developmental computational psychiatry' as a framework for linking brain maturation to cognitive development. We argue that through modelling some of the brain's fundamental cognitive computations, and relating them to brain development, we can bridge the gap between brain and cognitive development. This in turn can lead to a richer understanding of the ontogeny of psychiatric disorders. We illustrate this perspective with examples from reinforcement learning and dopamine function. Specifically, we show how computational modelling deepens an understanding of how cognitive processes, such as reward learning, effort learning, and social learning might go awry in psychiatric disorders. Finally, we sketch the promises and limitations of a developmental computational psychiatry.
Collapse
Affiliation(s)
- Tobias U Hauser
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK.,Wellcome Centre for Human Neuroimaging, University College London, London, UK
| | - Geert-Jan Will
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK.,Institute of Psychology, Leiden University, Leiden, The Netherlands
| | - Magda Dubois
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK
| | - Raymond J Dolan
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK.,Wellcome Centre for Human Neuroimaging, University College London, London, UK
| |
Collapse
|
49
|
Musical reward prediction errors engage the nucleus accumbens and motivate learning. Proc Natl Acad Sci U S A 2019; 116:3310-3315. [PMID: 30728301 DOI: 10.1073/pnas.1809855116] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Enjoying music reliably ranks among life's greatest pleasures. Like many hedonic experiences, it engages several reward-related brain areas, with activity in the nucleus accumbens (NAc) most consistently reflecting the listener's subjective response. Converging evidence suggests that this activity arises from musical "reward prediction errors" (RPEs) that signal the difference between expected and perceived musical events, but this hypothesis has not been directly tested. In the present fMRI experiment, we assessed whether music could elicit formally modeled RPEs in the NAc by applying a well-established decision-making protocol designed and validated for studying RPEs. In the scanner, participants chose between arbitrary cues that probabilistically led to dissonant or consonant music, and learned to make choices associated with the consonance, which they preferred. We modeled regressors of trial-by-trial RPEs, finding that NAc activity tracked musically elicited RPEs, to an extent that explained variance in the individual learning rates. These results demonstrate that music can act as a reward, driving learning and eliciting RPEs in the NAc, a hub of reward- and music enjoyment-related activity.
Collapse
|
50
|
Abstract
Habits form a crucial component of behavior. In recent years, key computational models have conceptualized habits as arising from model-free reinforcement learning mechanisms, which typically select between available actions based on the future value expected to result from each. Traditionally, however, habits have been understood as behaviors that can be triggered directly by a stimulus, without requiring the animal to evaluate expected outcomes. Here, we develop a computational model instantiating this traditional view, in which habits develop through the direct strengthening of recently taken actions rather than through the encoding of outcomes. We demonstrate that this model accounts for key behavioral manifestations of habits, including insensitivity to outcome devaluation and contingency degradation, as well as the effects of reinforcement schedule on the rate of habit formation. The model also explains the prevalent observation of perseveration in repeated-choice tasks as an additional behavioral manifestation of the habit system. We suggest that mapping habitual behaviors onto value-free mechanisms provides a parsimonious account of existing behavioral and neural data. This mapping may provide a new foundation for building robust and comprehensive models of the interaction of habits with other, more goal-directed types of behaviors and help to better guide research into the neural mechanisms underlying control of instrumental behavior more generally. (PsycINFO Database Record (c) 2019 APA, all rights reserved).
Collapse
Affiliation(s)
| | - Amitai Shenhav
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown Institute for Brain Science, Brown University
| | | |
Collapse
|