1
|
Heimer O, Kron A, Hertz U. Temporal dynamics of the semantic versus affective representations of valence during reversal learning. Cognition 2023; 236:105423. [PMID: 36933517 DOI: 10.1016/j.cognition.2023.105423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 02/25/2023] [Accepted: 02/27/2023] [Indexed: 03/18/2023]
Abstract
Valence, the representation of a stimulus in terms of good or bad, plays a central role in models of affect, value-based learning theories, and value-based decision-making models. Previous work used Unconditioned Stimulus (US) to support a theoretical division between two different types of valence representations for a stimulus: the semantic representation of valence, i.e., stored accumulated knowledge about the value of the stimulus, and the affective representation of valence, i.e., the valence of the affective response to this stimulus. The current work extended past research by using a neutral Conditioned Stimulus (CS) in the context of reversal learning, a type of associative learning. The impact of expected uncertainty (the variability of rewards) and unexpected uncertainty (reversal) on the evolving temporal dynamics of the two types of valence representations of the CS was tested in two experiments. Results show that in an environment presenting the two types of uncertainty, the adaptation process (learning rate) of the choices and of the semantic valence representation is slower than the adaptation of the affective valence representation. In contrast, in environments with only unexpected uncertainty (i.e., fixed rewards), there is no difference in the temporal dynamics of the two types of valence representations. Implications for models of affect, value-based learning theories, and value-based decision-making models are discussed.
Collapse
Affiliation(s)
- Orit Heimer
- Department of Psychology, University of Haifa, Haifa, Israel.
| | - Assaf Kron
- Department of Psychology, University of Haifa, Haifa, Israel
| | - Uri Hertz
- Department of Cognitive Sciences, University of Haifa, Haifa, Israel
| |
Collapse
|
2
|
Bertram T, Hoffmann Ayala D, Huber M, Brandl F, Starke G, Sorg C, Mulej Bratec S. Human threat circuits: Threats of pain, aggressive conspecific, and predator elicit distinct BOLD activations in the amygdala and hypothalamus. Front Psychiatry 2023; 13:1063238. [PMID: 36733415 PMCID: PMC9887727 DOI: 10.3389/fpsyt.2022.1063238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 12/16/2022] [Indexed: 01/12/2023] Open
Abstract
Introduction Threat processing, enabled by threat circuits, is supported by a remarkably conserved neural architecture across mammals. Threatening stimuli relevant for most species include the threat of being attacked by a predator or an aggressive conspecific and the threat of pain. Extensive studies in rodents have associated the threats of pain, predator attack and aggressive conspecific attack with distinct neural circuits in subregions of the amygdala, the hypothalamus and the periaqueductal gray. Bearing in mind the considerable conservation of both the anatomy of these regions and defensive behaviors across mammalian species, we hypothesized that distinct brain activity corresponding to the threats of pain, predator attack and aggressive conspecific attack would also exist in human subcortical brain regions. Methods Forty healthy female subjects underwent fMRI scanning during aversive classical conditioning. In close analogy to rodent studies, threat stimuli consisted of painful electric shocks, a short video clip of an attacking bear and a short video clip of an attacking man. Threat processing was conceptualized as the expectation of the aversive stimulus during the presentation of the conditioned stimulus. Results Our results demonstrate differential brain activations in the left and right amygdala as well as in the left hypothalamus for the threats of pain, predator attack and aggressive conspecific attack, for the first time showing distinct threat-related brain activity within the human subcortical brain. Specifically, the threat of pain showed an increase of activity in the left and right amygdala and the left hypothalamus compared to the threat of conspecific attack (pain > conspecific), and increased activity in the left amygdala compared to the threat of predator attack (pain > predator). Threat of conspecific attack revealed heightened activity in the right amygdala, both in comparison to threat of pain (conspecific > pain) and threat of predator attack (conspecific > predator). Finally, for the condition threat of predator attack we found increased activity in the bilateral amygdala and the hypothalamus when compared to threat of conspecific attack (predator > conspecific). No significant clusters were found for the contrast predator attack > pain. Conclusion Results suggest that threat type-specific circuits identified in rodents might be conserved in the human brain.
Collapse
Affiliation(s)
- Teresa Bertram
- Department of Neuroradiology, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- TUM-NIC Neuroimaging Center, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- Department of Psychiatry and Psychotherapy, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Daniel Hoffmann Ayala
- Department of Neuroradiology, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- TUM-NIC Neuroimaging Center, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- Department of Neurosurgery, Klinikum Großhadern, Ludwig-Maximilians-University, Munich, Germany
| | - Maria Huber
- Department of Neuroradiology, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- TUM-NIC Neuroimaging Center, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Felix Brandl
- Department of Neuroradiology, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- TUM-NIC Neuroimaging Center, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- Department of Psychiatry and Psychotherapy, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Georg Starke
- Department of Neuroradiology, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- TUM-NIC Neuroimaging Center, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- College of Humanities, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Christian Sorg
- Department of Neuroradiology, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- TUM-NIC Neuroimaging Center, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- Department of Psychiatry and Psychotherapy, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - Satja Mulej Bratec
- Department of Neuroradiology, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- TUM-NIC Neuroimaging Center, Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
- Department of Psychology, Faculty of Arts, University of Maribor, Maribor, Slovenia
| |
Collapse
|
3
|
Human inference reflects a normative balance of complexity and accuracy. Nat Hum Behav 2022; 6:1153-1168. [PMID: 35637296 PMCID: PMC9446026 DOI: 10.1038/s41562-022-01357-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Accepted: 04/20/2022] [Indexed: 02/03/2023]
Abstract
We must often infer latent properties of the world from noisy and changing observations. Complex, probabilistic approaches to this challenge such as Bayesian inference are accurate but cognitively demanding, relying on extensive working memory and adaptive processing. Simple heuristics are easy to implement but may be less accurate. What is the appropriate balance between complexity and accuracy? Here we model a hierarchy of strategies of variable complexity and find a power law of diminishing returns: increasing complexity gives progressively smaller gains in accuracy. The rate of diminishing returns depends systematically on the statistical uncertainty in the world, such that complex strategies do not provide substantial benefits over simple ones when uncertainty is either too high or too low. In between, there is a complexity dividend. In two psychophysical experiments, we confirm specific model predictions about how working memory and adaptivity should be modulated by uncertainty.
Collapse
|
4
|
Wang S, Feng SF, Bornstein AM. Mixing memory and desire: How memory reactivation supports deliberative decision-making. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2021; 13:e1581. [PMID: 34665529 DOI: 10.1002/wcs.1581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Revised: 08/24/2021] [Accepted: 09/16/2021] [Indexed: 11/09/2022]
Abstract
Memories affect nearly every aspect of our mental life. They allow us to both resolve uncertainty in the present and to construct plans for the future. Recently, renewed interest in the role memory plays in adaptive behavior has led to new theoretical advances and empirical observations. We review key findings, with particular emphasis on how the retrieval of many kinds of memories affects deliberative action selection. These results are interpreted in a sequential inference framework, in which reinstatements from memory serve as "samples" of potential action outcomes. The resulting model suggests a central role for the dynamics of memory reactivation in determining the influence of different kinds of memory in decisions. We propose that representation-specific dynamics can implement a bottom-up "product of experts" rule that integrates multiple sets of action-outcome predictions weighted based on their uncertainty. We close by reviewing related findings and identifying areas for further research. This article is categorized under: Psychology > Reasoning and Decision Making Neuroscience > Cognition Neuroscience > Computation.
Collapse
Affiliation(s)
- Shaoming Wang
- Department of Psychology, New York University, New York, New York, USA
| | - Samuel F Feng
- Department of Mathematics, Khalifa University of Science and Technology, Abu Dhabi, UAE.,Khalifa University Centre for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, UAE
| | - Aaron M Bornstein
- Department of Cognitive Sciences, University of California-Irvine, Irvine, California, USA.,Center for the Neurobiology of Learning & Memory, University of California-Irvine, Irvine, California, USA.,Institute for Mathematical Behavioral Sciences, University of California-Irvine, Irvine, California, USA
| |
Collapse
|
5
|
Reward and fictive prediction error signals in ventral striatum: asymmetry between factual and counterfactual processing. Brain Struct Funct 2021; 226:1553-1569. [PMID: 33839955 DOI: 10.1007/s00429-021-02270-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Accepted: 03/27/2021] [Indexed: 10/21/2022]
Abstract
Reward prediction error, the difference between the expected and obtained reward, is known to act as a reinforcement learning neural signal. In the current study, we propose a model fitting approach that combines behavioral and neural data to fit computational models of reinforcement learning. Briefly, we penalized subject-specific fitted parameters that moved away too far from the group median, except when that deviation led to an improvement in the model's fit to neural responses. By means of a probabilistic monetary learning task and fMRI, we compared our approach with standard model fitting methods. Q-learning outperformed actor-critic at both behavioral and neural level, although the inclusion of neuroimaging data into model fitting improved the fit of actor-critic models. We observed both action-value and state-value prediction error signals in the striatum, while standard model fitting approaches failed to capture state-value signals. Finally, left ventral striatum correlated with reward prediction error while right ventral striatum with fictive prediction error, suggesting a functional hemispheric asymmetry regarding prediction-error driven learning.
Collapse
|
6
|
Klein-Flügge MC, Wittmann MK, Shpektor A, Jensen DEA, Rushworth MFS. Multiple associative structures created by reinforcement and incidental statistical learning mechanisms. Nat Commun 2019; 10:4835. [PMID: 31645545 PMCID: PMC6811627 DOI: 10.1038/s41467-019-12557-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Accepted: 09/16/2019] [Indexed: 01/07/2023] Open
Abstract
Learning the structure of the world can be driven by reinforcement but also occurs incidentally through experience. Reinforcement learning theory has provided insight into how prediction errors drive updates in beliefs but less attention has been paid to the knowledge resulting from such learning. Here we contrast associative structures formed through reinforcement and experience of task statistics. BOLD neuroimaging in human volunteers demonstrates rigid representations of rewarded sequences in temporal pole and posterior orbito-frontal cortex, which are constructed backwards from reward. By contrast, medial prefrontal cortex and a hippocampal-amygdala border region carry reward-related knowledge but also flexible statistical knowledge of the currently relevant task model. Intriguingly, ventral striatum encodes prediction error responses but not the full RL- or statistically derived task knowledge. In summary, representations of task knowledge are derived via multiple learning processes operating at different time scales that are associated with partially overlapping and partially specialized anatomical regions. Associative learning occurs through reinforcement mechanisms as well as incidentally through experience of statistical relationships. Here, the authors report that these two learning processes are associated with specialized anatomical regions that operate at different time scales.
Collapse
Affiliation(s)
- Miriam C Klein-Flügge
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3TA, UK. .,Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK.
| | - Marco K Wittmann
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3TA, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Anna Shpektor
- Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Daria E A Jensen
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3TA, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK.,Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, OX3 7JX, UK
| | - Matthew F S Rushworth
- Department of Experimental Psychology, University of Oxford, Tinsley Building, Mansfield Road, Oxford, OX1 3TA, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), University of Oxford, Nuffield Department of Clinical Neurosciences, Level 6, West Wing, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| |
Collapse
|
7
|
Buch VP, Richardson AG, Brandon C, Stiso J, Khattak MN, Bassett DS, Lucas TH. Network Brain-Computer Interface (nBCI): An Alternative Approach for Cognitive Prosthetics. Front Neurosci 2018; 12:790. [PMID: 30443203 PMCID: PMC6221897 DOI: 10.3389/fnins.2018.00790] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2018] [Accepted: 10/12/2018] [Indexed: 11/13/2022] Open
Abstract
Brain computer interfaces (BCIs) have been applied to sensorimotor systems for many years. However, BCI technology has broad potential beyond sensorimotor systems. The emerging field of cognitive prosthetics, for example, promises to improve learning and memory for patients with cognitive impairment. Unfortunately, our understanding of the neural mechanisms underlying these cognitive processes remains limited in part due to the extensive individual variability in neural coding and circuit function. As a consequence, the development of methods to ascertain optimal control signals for cognitive decoding and restoration remains an active area of inquiry. To advance the field, robust tools are required to quantify time-varying and task-dependent brain states predictive of cognitive performance. Here, we suggest that network science is a natural language in which to formulate and apply such tools. In support of our argument, we offer a simple demonstration of the feasibility of a network approach to BCI control signals, which we refer to as network BCI (nBCI). Finally, in a single subject example, we show that nBCI can reliably predict online cognitive performance and is superior to certain common spectral approaches currently used in BCIs. Our review of the literature and preliminary findings support the notion that nBCI could provide a powerful approach for future applications in cognitive prosthetics.
Collapse
Affiliation(s)
- Vivek P Buch
- Department of Neurosurgery, Hospital of the University of Pennsylvania, Philadelphia, PA, United States
| | - Andrew G Richardson
- Department of Neurosurgery, Hospital of the University of Pennsylvania, Philadelphia, PA, United States
| | - Cameron Brandon
- Department of Neurosurgery, Hospital of the University of Pennsylvania, Philadelphia, PA, United States
| | - Jennifer Stiso
- Department of Neuroscience, University of Pennsylvania, Philadelphia, PA, United States
| | - Monica N Khattak
- Department of Neurosurgery, Hospital of the University of Pennsylvania, Philadelphia, PA, United States
| | - Danielle S Bassett
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, United States.,Department of Physics and Astronomy, University of Pennsylvania, Philadelphia, PA, United States.,Department of Neurology, University of Pennsylvania, Philadelphia, PA, United States.,Department of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, PA, United States
| | - Timothy H Lucas
- Department of Neurosurgery, Hospital of the University of Pennsylvania, Philadelphia, PA, United States.,Department of Neuroscience, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
8
|
Meder D, Kolling N, Verhagen L, Wittmann MK, Scholl J, Madsen KH, Hulme OJ, Behrens TEJ, Rushworth MFS. Simultaneous representation of a spectrum of dynamically changing value estimates during decision making. Nat Commun 2017; 8:1942. [PMID: 29208968 PMCID: PMC5717172 DOI: 10.1038/s41467-017-02169-w] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2017] [Accepted: 11/07/2017] [Indexed: 01/26/2023] Open
Abstract
Decisions are based on value expectations derived from experience. We show that dorsal anterior cingulate cortex and three other brain regions hold multiple representations of choice value based on different timescales of experience organized in terms of systematic gradients across the cortex. Some parts of each area represent value estimates based on recent reward experience while others represent value estimates based on experience over the longer term. The value estimates within these areas interact with one another according to their temporal scaling. Some aspects of the representations change dynamically as the environment changes. The spectrum of value estimates may act as a flexible selection mechanism for combining experience-derived value information with other aspects of value to allow flexible and adaptive decisions in changing environments.
Collapse
Affiliation(s)
- David Meder
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, South Parks Road, Oxford, OX1 3UD, UK. .,Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Hvidovre, Hvidovre, 2650, Denmark.
| | - Nils Kolling
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, South Parks Road, Oxford, OX1 3UD, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), Centre for Functional MRI of the Brain (FMRIB), Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK
| | - Lennart Verhagen
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, South Parks Road, Oxford, OX1 3UD, UK
| | - Marco K Wittmann
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, South Parks Road, Oxford, OX1 3UD, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), Centre for Functional MRI of the Brain (FMRIB), Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK
| | - Jacqueline Scholl
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, South Parks Road, Oxford, OX1 3UD, UK
| | - Kristoffer H Madsen
- Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Hvidovre, Hvidovre, 2650, Denmark
| | - Oliver J Hulme
- Danish Research Centre for Magnetic Resonance, Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Hvidovre, Hvidovre, 2650, Denmark
| | - Timothy E J Behrens
- Wellcome Centre for Integrative Neuroimaging (WIN), Centre for Functional MRI of the Brain (FMRIB), Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK
| | - Matthew F S Rushworth
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, South Parks Road, Oxford, OX1 3UD, UK.,Wellcome Centre for Integrative Neuroimaging (WIN), Centre for Functional MRI of the Brain (FMRIB), Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK
| |
Collapse
|
9
|
Parthasarathi T, McConnell MH, Luery J, Kable JW. The Vivid Present: Visualization Abilities Are Associated with Steep Discounting of Future Rewards. Front Psychol 2017; 8:289. [PMID: 28321198 PMCID: PMC5337487 DOI: 10.3389/fpsyg.2017.00289] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Accepted: 02/15/2017] [Indexed: 11/14/2022] Open
Abstract
Humans and other animals discount the value of future rewards, a phenomenon known as delay discounting. Individuals vary widely in the extent to which they discount future rewards, and these tendencies have been associated with important life outcomes. Recent studies have demonstrated that imagining the future reduces subsequent discounting behavior, but no research to date has examined whether a similar principle applies at the trait level, and whether training visualization changes discounting. The current study examined if individual differences in visualization abilities are linked to individual differences in discounting and whether practicing visualization can change discounting behaviors in a lasting way. Participants (n = 48) completed the Vividness of Visual Imagery Questionnaire (VVIQ) and delay discounting task and then underwent a 4-week intervention consisting of visualization training (intervention) or relaxation training (control). Contrary to our hypotheses, participants who reported greater visualization abilities (lower scores) on the VVIQ were higher discounters. To further examine this relationship, an additional 106 participants completed the VVIQ and delay discounting task. In the total sample (n = 154), there was a significant negative correlation between VVIQ scores and discount rates, showing that individuals who are better visualizers are also higher discounters. Consistent with this relationship but again to our surprise, visualization training tended, albeit weakly, to increase discount rates, and those whose VVIQ decreased the most were those whose discount rates increased the most. These results suggest a novel association between visualization abilities and delay discounting.
Collapse
Affiliation(s)
- Trishala Parthasarathi
- Neuroscience Graduate Group, Department of Neuroscience, University of Pennsylvania, PhiladelphiaPA, USA
| | | | - Jeffrey Luery
- Department of Psychology, University of Pennsylvania, PhiladelphiaPA, USA
| | - Joseph W. Kable
- Neuroscience Graduate Group, Department of Neuroscience, University of Pennsylvania, PhiladelphiaPA, USA
- Department of Psychology, University of Pennsylvania, PhiladelphiaPA, USA
| |
Collapse
|
10
|
Hasson U. The neurobiology of uncertainty: implications for statistical learning. Philos Trans R Soc Lond B Biol Sci 2017; 372:20160048. [PMID: 27872367 PMCID: PMC5124074 DOI: 10.1098/rstb.2016.0048] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/01/2016] [Indexed: 11/12/2022] Open
Abstract
The capacity for assessing the degree of uncertainty in the environment relies on estimating statistics of temporally unfolding inputs. This, in turn, allows calibration of predictive and bottom-up processing, and signalling changes in temporally unfolding environmental features. In the last decade, several studies have examined how the brain codes for and responds to input uncertainty. Initial neurobiological experiments implicated frontoparietal and hippocampal systems, based largely on paradigms that manipulated distributional features of visual stimuli. However, later work in the auditory domain pointed to different systems, whose activation profiles have interesting implications for computational and neurobiological models of statistical learning (SL). This review begins by briefly recapping the historical development of ideas pertaining to the sensitivity to uncertainty in temporally unfolding inputs. It then discusses several issues at the interface of studies of uncertainty and SL. Following, it presents several current treatments of the neurobiology of uncertainty and reviews recent findings that point to principles that serve as important constraints on future neurobiological theories of uncertainty, and relatedly, SL. This review suggests it may be useful to establish closer links between neurobiological research on uncertainty and SL, considering particularly mechanisms sensitive to local and global structure in inputs, the degree of input uncertainty, the complexity of the system generating the input, learning mechanisms that operate on different temporal scales and the use of learnt information for online prediction.This article is part of the themed issue 'New frontiers for statistical learning in the cognitive sciences'.
Collapse
Affiliation(s)
- Uri Hasson
- Center for Mind/Brain Sciences, The University of Trento, via delle Regole 101, Mattarello, TN 38123, Italy
| |
Collapse
|
11
|
Mulej Bratec S, Xie X, Wang Y, Schilbach L, Zimmer C, Wohlschläger AM, Riedl V, Sorg C. Cognitive emotion regulation modulates the balance of competing influences on ventral striatal aversive prediction error signals. Neuroimage 2016; 147:650-657. [PMID: 28040541 DOI: 10.1016/j.neuroimage.2016.12.078] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2016] [Revised: 12/02/2016] [Accepted: 12/28/2016] [Indexed: 11/25/2022] Open
Abstract
Cognitive emotion regulation (CER) is a critical human ability to face aversive emotional stimuli in a flexible way, via recruitment of specific prefrontal brain circuits. Animal research reveals a central role of ventral striatum in emotional behavior, for both aversive conditioning, with striatum signaling aversive prediction errors (aPE), and for integrating competing influences of distinct striatal inputs from regions such as the prefrontal cortex (PFC), amygdala, hippocampus and ventral tegmental area (VTA). Translating these ventral striatal findings from animal research to human CER, we hypothesized that successful CER would affect the balance of competing influences of striatal afferents on striatal aPE signals, in a way favoring PFC as opposed to 'subcortical' (i.e., non-isocortical) striatal inputs. Using aversive Pavlovian conditioning with and without CER during fMRI, we found that during CER, superior regulators indeed reduced the modulatory impact of 'subcortical' striatal afferents (hippocampus, amygdala and VTA) on ventral striatal aPE signals, while keeping the PFC impact intact. In contrast, inferior regulators showed an opposite pattern. Our results demonstrate that ventral striatal aPE signals and associated competing modulatory inputs are critical mechanisms underlying successful cognitive regulation of aversive emotions in humans.
Collapse
Affiliation(s)
- Satja Mulej Bratec
- Klinikum rechts der Isar, Technische Universität München, Department of Neuroradiology, Munich 81675, Germany; Klinikum rechts der Isar, Technische Universität München, TUM-NIC Neuroimaging Center, Munich 81675, Germany; Ludwig-Maximilians-Universität München, Graduate School of Systemic Neurosciences, Planegg-Martinsried 82152, Germany
| | - Xiyao Xie
- Klinikum rechts der Isar, Technische Universität München, Department of Neuroradiology, Munich 81675, Germany; Klinikum rechts der Isar, Technische Universität München, TUM-NIC Neuroimaging Center, Munich 81675, Germany; Ludwig-Maximilians-Universität München, Department of Psychology, Munich 80802, Germany
| | - Yijun Wang
- Klinikum rechts der Isar, Technische Universität München, TUM-NIC Neuroimaging Center, Munich 81675, Germany; Duke-NUS Graduate Medical School Singapore, Singapore 169857, Singapore
| | - Leonhard Schilbach
- Max Planck Institute of Psychiatry, Independent Max Planck Research Group Social Neuroscience, Munich 80804, Germany; University Hospital of Cologne, Department of Psychiatry, Cologne 50924, Germany
| | - Claus Zimmer
- Klinikum rechts der Isar, Technische Universität München, Department of Neuroradiology, Munich 81675, Germany
| | - Afra M Wohlschläger
- Klinikum rechts der Isar, Technische Universität München, Department of Neuroradiology, Munich 81675, Germany; Klinikum rechts der Isar, Technische Universität München, TUM-NIC Neuroimaging Center, Munich 81675, Germany
| | - Valentin Riedl
- Klinikum rechts der Isar, Technische Universität München, Department of Neuroradiology, Munich 81675, Germany; Klinikum rechts der Isar, Technische Universität München, TUM-NIC Neuroimaging Center, Munich 81675, Germany; Klinikum rechts der Isar, Technische Universität München, Department of Nuclear Medicine, Munich 81675, Germany
| | - Christian Sorg
- Klinikum rechts der Isar, Technische Universität München, Department of Neuroradiology, Munich 81675, Germany; Klinikum rechts der Isar, Technische Universität München, TUM-NIC Neuroimaging Center, Munich 81675, Germany; Klinikum rechts der Isar, Technische Universität München, Department of Psychiatry, Munich 81675, Germany.
| |
Collapse
|
12
|
Reinforcement learning models and their neural correlates: An activation likelihood estimation meta-analysis. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2016; 15:435-59. [PMID: 25665667 DOI: 10.3758/s13415-015-0338-7] [Citation(s) in RCA: 114] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Reinforcement learning describes motivated behavior in terms of two abstract signals. The representation of discrepancies between expected and actual rewards/punishments-prediction error-is thought to update the expected value of actions and predictive stimuli. Electrophysiological and lesion studies have suggested that mesostriatal prediction error signals control behavior through synaptic modification of cortico-striato-thalamic networks. Signals in the ventromedial prefrontal and orbitofrontal cortex are implicated in representing expected value. To obtain unbiased maps of these representations in the human brain, we performed a meta-analysis of functional magnetic resonance imaging studies that had employed algorithmic reinforcement learning models across a variety of experimental paradigms. We found that the ventral striatum (medial and lateral) and midbrain/thalamus represented reward prediction errors, consistent with animal studies. Prediction error signals were also seen in the frontal operculum/insula, particularly for social rewards. In Pavlovian studies, striatal prediction error signals extended into the amygdala, whereas instrumental tasks engaged the caudate. Prediction error maps were sensitive to the model-fitting procedure (fixed or individually estimated) and to the extent of spatial smoothing. A correlate of expected value was found in a posterior region of the ventromedial prefrontal cortex, caudal and medial to the orbitofrontal regions identified in animal studies. These findings highlight a reproducible motif of reinforcement learning in the cortico-striatal loops and identify methodological dimensions that may influence the reproducibility of activation patterns across studies.
Collapse
|
13
|
Mulej Bratec S, Xie X, Schmid G, Doll A, Schilbach L, Zimmer C, Wohlschläger A, Riedl V, Sorg C. Cognitive emotion regulation enhances aversive prediction error activity while reducing emotional responses. Neuroimage 2015; 123:138-48. [PMID: 26306990 DOI: 10.1016/j.neuroimage.2015.08.038] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2015] [Revised: 08/12/2015] [Accepted: 08/14/2015] [Indexed: 11/24/2022] Open
Abstract
Cognitive emotion regulation is a powerful way of modulating emotional responses. However, despite the vital role of emotions in learning, it is unknown whether the effect of cognitive emotion regulation also extends to the modulation of learning. Computational models indicate prediction error activity, typically observed in the striatum and ventral tegmental area, as a critical neural mechanism involved in associative learning. We used model-based fMRI during aversive conditioning with and without cognitive emotion regulation to test the hypothesis that emotion regulation would affect prediction error-related neural activity in the striatum and ventral tegmental area, reflecting an emotion regulation-related modulation of learning. Our results show that cognitive emotion regulation reduced emotion-related brain activity, but increased prediction error-related activity in a network involving ventral tegmental area, hippocampus, insula and ventral striatum. While the reduction of response activity was related to behavioral measures of emotion regulation success, the enhancement of prediction error-related neural activity was related to learning performance. Furthermore, functional connectivity between the ventral tegmental area and ventrolateral prefrontal cortex, an area involved in regulation, was specifically increased during emotion regulation and likewise related to learning performance. Our data, therefore, provide first-time evidence that beyond reducing emotional responses, cognitive emotion regulation affects learning by enhancing prediction error-related activity, potentially via tegmental dopaminergic pathways.
Collapse
Affiliation(s)
- Satja Mulej Bratec
- Department of Neuroradiology, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; TUM-NIC Neuroimaging Center, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; Graduate School of Systemic Neurosciences, Ludwig-Maximilians-Universität München, 82152 Planegg-Martinsried, Germany.
| | - Xiyao Xie
- Department of Neuroradiology, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; TUM-NIC Neuroimaging Center, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; Department of Psychology, Ludwig-Maximilians-Universität München, 80802 Munich, Germany.
| | - Gabriele Schmid
- Department of Psychosomatics and Psychotherapy, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany.
| | - Anselm Doll
- Department of Neuroradiology, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; TUM-NIC Neuroimaging Center, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; Graduate School of Systemic Neurosciences, Ludwig-Maximilians-Universität München, 82152 Planegg-Martinsried, Germany.
| | - Leonhard Schilbach
- Department of Psychiatry, University Hospital Cologne, Cologne, Germany.
| | - Claus Zimmer
- Department of Neuroradiology, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany.
| | - Afra Wohlschläger
- Department of Neuroradiology, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; TUM-NIC Neuroimaging Center, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany.
| | - Valentin Riedl
- Department of Neuroradiology, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; TUM-NIC Neuroimaging Center, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; Department of Nuclear Medicine, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany.
| | - Christian Sorg
- Department of Neuroradiology, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; TUM-NIC Neuroimaging Center, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany; Department of Psychiatry, Klinikum rechts der Isar, Technische Universität München, 81675 Munich, Germany.
| |
Collapse
|
14
|
Biskup CS, Gaber T, Helmbold K, Bubenzer-Busch S, Zepf FD. Amino acid challenge and depletion techniques in human functional neuroimaging studies: an overview. Amino Acids 2015; 47:651-83. [DOI: 10.1007/s00726-015-1919-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Accepted: 01/09/2015] [Indexed: 01/16/2023]
|
15
|
van Holst RJ, Clark L, Veltman DJ, van den Brink W, Goudriaan AE. Enhanced striatal responses during expectancy coding in alcohol dependence. Drug Alcohol Depend 2014; 142:204-8. [PMID: 25012896 DOI: 10.1016/j.drugalcdep.2014.06.019] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/15/2013] [Revised: 06/13/2014] [Accepted: 06/14/2014] [Indexed: 02/07/2023]
Abstract
BACKGROUND Individuals with alcohol dependence are known to make disadvantageous decisions, possibly caused by alterations in either reward or punishment sensitivity, which lead to persistent alcohol use despite its adverse consequences. Previous studies in alcohol dependence have mainly focused on reward anticipation processing and results from these studies are mixed. To clarify the nature of the motivational deficit that underlies disadvantageous choice in alcohol dependence, the current study sought to characterize the neural representation of expected value in individuals with alcohol dependence, separating expectancy-related processing of gains and losses, as a function of outcome magnitude and outcome probability. METHOD Functional MRI was used to examine brain responses during the expectation of gains and losses in patients with alcohol dependence (n=19) and healthy controls (n=19). The task manipulated outcome magnitude (€1 and €5) and outcome probability (30% and 70%). RESULTS Compared to healthy controls, patients with alcohol dependence were more responsive to the expectancy of large wins, in the caudate and putamen. This effect was driven by a higher caudate activity in the contrast comparing €5 vs. €1 trials in patients with alcohol dependence. There were no group differences in the responses to the expectancy for loss. The patient group reported lower expectancies of winning in the trial-by-trial ratings. CONCLUSIONS Patients with alcohol dependence showed caudate hyperactivity when expecting wins. The result contrasts with past work using the monetary incentive delay task, showing caudate hypoactivity; the passive nature of our task contrasts with an active response requirement in the MIDT studies.
Collapse
Affiliation(s)
- Ruth J van Holst
- Donders Institute for Cognition, Brain and Behaviour, Radboud University, Kapittelweg 29, 6525 EN Nijmegen, The Netherlands.
| | - Luke Clark
- Department of Psychology, University of Cambridge, Downing Street, CB2 3EB Cambridge, United Kingdom
| | - Dick J Veltman
- Department of Psychiatry, VU University Medical Center, AJ Ernststraat 1187, 1081 HL Amsterdam, The Netherlands
| | - Wim van den Brink
- Department of Psychiatry, Amsterdam Institute for Addiction Research, Academic Medical Center, Meibergdreef 5, 1100 DD Amsterdam, The Netherlands
| | - Anna E Goudriaan
- Department of Psychiatry, Amsterdam Institute for Addiction Research, Academic Medical Center, Meibergdreef 5, 1100 DD Amsterdam, The Netherlands; Arkin Mental Health Institute, Klaprozenweg 111, 1033 NN Amsterdam, The Netherlands
| |
Collapse
|
16
|
Abstract
The neural mechanisms that produce hallucinations and other psychotic symptoms remain unclear. Previous research suggests that deficits in predictive signals for learning, such as prediction error signals, may underlie psychotic symptoms, but the mechanism by which such deficits produce psychotic symptoms remains to be established. We used model-based fMRI to study sensory prediction errors in human patients with schizophrenia who report daily auditory verbal hallucinations (AVHs) and sociodemographically matched healthy control subjects. We manipulated participants' expectations for hearing speech at different periods within a speech decision-making task. Patients activated a voice-sensitive region of the auditory cortex while they experienced AVHs in the scanner and displayed a concomitant deficit in prediction error signals in a similar portion of auditory cortex. This prediction error deficit correlated strongly with increased activity during silence and with reduced volumes of the auditory cortex, two established neural phenotypes of AVHs. Furthermore, patients with more severe AVHs had more deficient prediction error signals and greater activity during silence within the region of auditory cortex where groups differed, regardless of the severity of psychotic symptoms other than AVHs. Our findings suggest that deficient predictive coding accounts for the resting hyperactivity in sensory cortex that leads to hallucinations.
Collapse
|
17
|
Meffert H, Brislin SJ, White SF, Blair JR. Prediction errors to emotional expressions: the roles of the amygdala in social referencing. Soc Cogn Affect Neurosci 2014; 10:537-44. [PMID: 24939872 DOI: 10.1093/scan/nsu085] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 06/13/2014] [Indexed: 11/13/2022] Open
Abstract
Social referencing paradigms in humans and observational learning paradigms in animals suggest that emotional expressions are important for communicating valence. It has been proposed that these expressions initiate stimulus-reinforcement learning. Relatively little is known about the role of emotional expressions in reinforcement learning, particularly in the context of social referencing. In this study, we examined object valence learning in the context of a social referencing paradigm. Participants viewed objects and faces that turned toward the objects and displayed a fearful, happy or neutral reaction to them, while judging the gender of these faces. Notably, amygdala activation was larger when the expressions following an object were less expected. Moreover, when asked, participants were both more likely to want to approach, and showed stronger amygdala responses to, objects associated with happy relative to objects associated with fearful expressions. This suggests that the amygdala plays two roles in social referencing: (i) initiating learning regarding the valence of an object as a function of prediction errors to expressions displayed toward this object and (ii) orchestrating an emotional response to the object when value judgments are being made regarding this object.
Collapse
Affiliation(s)
- Harma Meffert
- Section of Affective and Cognitive Neuroscience, National Institutes of Health, Bethesda, MD 20892, and Clinical Psychology Program, Florida State University, Tallahassee, FL 32306
| | - Sarah J Brislin
- Section of Affective and Cognitive Neuroscience, National Institutes of Health, Bethesda, MD 20892, and Clinical Psychology Program, Florida State University, Tallahassee, FL 32306
| | - Stuart F White
- Section of Affective and Cognitive Neuroscience, National Institutes of Health, Bethesda, MD 20892, and Clinical Psychology Program, Florida State University, Tallahassee, FL 32306
| | - James R Blair
- Section of Affective and Cognitive Neuroscience, National Institutes of Health, Bethesda, MD 20892, and Clinical Psychology Program, Florida State University, Tallahassee, FL 32306
| |
Collapse
|
18
|
Gariépy JF, Watson KK, Du E, Xie DL, Erb J, Amasino D, Platt ML. Social learning in humans and other animals. Front Neurosci 2014; 8:58. [PMID: 24765063 PMCID: PMC3982061 DOI: 10.3389/fnins.2014.00058] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Accepted: 03/13/2014] [Indexed: 01/25/2023] Open
Abstract
Decisions made by individuals can be influenced by what others think and do. Social learning includes a wide array of behaviors such as imitation, observational learning of novel foraging techniques, peer or parental influences on individual preferences, as well as outright teaching. These processes are believed to underlie an important part of cultural variation among human populations and may also explain intraspecific variation in behavior between geographically distinct populations of animals. Recent neurobiological studies have begun to uncover the neural basis of social learning. Here we review experimental evidence from the past few decades showing that social learning is a widespread set of skills present in multiple animal species. In mammals, the temporoparietal junction, the dorsomedial, and dorsolateral prefrontal cortex, as well as the anterior cingulate gyrus, appear to play critical roles in social learning. Birds, fish, and insects also learn from others, but the underlying neural mechanisms remain poorly understood. We discuss the evolutionary implications of these findings and highlight the importance of emerging animal models that permit precise modification of neural circuit function for elucidating the neural basis of social learning.
Collapse
Affiliation(s)
- Jean-François Gariépy
- Department of Neurobiology, Center for Cognitive Neuroscience and Duke Institute for Brain Sciences, Duke University Durham, NC, USA
| | - Karli K Watson
- Department of Neurobiology, Center for Cognitive Neuroscience and Duke Institute for Brain Sciences, Duke University Durham, NC, USA
| | - Emily Du
- Department of Neurobiology, Center for Cognitive Neuroscience and Duke Institute for Brain Sciences, Duke University Durham, NC, USA
| | - Diana L Xie
- Department of Neurobiology, Center for Cognitive Neuroscience and Duke Institute for Brain Sciences, Duke University Durham, NC, USA
| | - Joshua Erb
- Department of Neurobiology, Center for Cognitive Neuroscience and Duke Institute for Brain Sciences, Duke University Durham, NC, USA
| | - Dianna Amasino
- Department of Neurobiology, Center for Cognitive Neuroscience and Duke Institute for Brain Sciences, Duke University Durham, NC, USA
| | - Michael L Platt
- Department of Neurobiology, Center for Cognitive Neuroscience and Duke Institute for Brain Sciences, Duke University Durham, NC, USA ; Department of Biological Anthropology, Duke University Durham, NC, USA
| |
Collapse
|
19
|
Seger CA, Peterson EJ. Categorization = decision making + generalization. Neurosci Biobehav Rev 2013; 37:1187-200. [PMID: 23548891 PMCID: PMC3739997 DOI: 10.1016/j.neubiorev.2013.03.015] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2012] [Revised: 03/21/2013] [Accepted: 03/22/2013] [Indexed: 11/22/2022]
Abstract
We rarely, if ever, repeatedly encounter exactly the same situation. This makes generalization crucial for real world decision making. We argue that categorization, the study of generalizable representations, is a type of decision making, and that categorization learning research would benefit from approaches developed to study the neuroscience of decision making. Similarly, methods developed to examine generalization and learning within the field of categorization may enhance decision making research. We first discuss perceptual information processing and integration, with an emphasis on accumulator models. We then examine learning the value of different decision making choices via experience, emphasizing reinforcement learning modeling approaches. Next we discuss how value is combined with other factors in decision making, emphasizing the effects of uncertainty. Finally, we describe how a final decision is selected via thresholding processes implemented by the basal ganglia and related regions. We also consider how memory related functions in the hippocampus may be integrated with decision making mechanisms and contribute to categorization.
Collapse
Affiliation(s)
- Carol A Seger
- Department of Psychology, Colorado State University Fort Collins, CO 80523, USA.
| | | |
Collapse
|
20
|
Wilson RC, Nassar MR, Gold JI. A mixture of delta-rules approximation to bayesian inference in change-point problems. PLoS Comput Biol 2013; 9:e1003150. [PMID: 23935472 PMCID: PMC3723502 DOI: 10.1371/journal.pcbi.1003150] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2012] [Accepted: 06/06/2013] [Indexed: 11/19/2022] Open
Abstract
Error-driven learning rules have received considerable attention because of their close relationships to both optimal theory and neurobiological mechanisms. However, basic forms of these rules are effective under only a restricted set of conditions in which the environment is stable. Recent studies have defined optimal solutions to learning problems in more general, potentially unstable, environments, but the relevance of these complex mathematical solutions to how the brain solves these problems remains unclear. Here, we show that one such Bayesian solution can be approximated by a computationally straightforward mixture of simple error-driven 'Delta' rules. This simpler model can make effective inferences in a dynamic environment and matches human performance on a predictive-inference task using a mixture of a small number of Delta rules. This model represents an important conceptual advance in our understanding of how the brain can use relatively simple computations to make nearly optimal inferences in a dynamic world.
Collapse
Affiliation(s)
- Robert C Wilson
- Princeton Neuroscience Institute, Princeton University, Princeton, New Jersey, United States of America.
| | | | | |
Collapse
|
21
|
Gluth S, Rieskamp J, Büchel C. Neural evidence for adaptive strategy selection in value-based decision-making. Cereb Cortex 2013; 24:2009-21. [PMID: 23476024 DOI: 10.1093/cercor/bht049] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In everyday life, humans often encounter complex environments in which multiple sources of information can influence their decisions. We propose that in such situations, people select and apply different strategies representing different cognitive models of the decision problem. Learning advances by evaluating the success of using a strategy and eventually by switching between strategies. To test our strategy selection model, we investigated how humans solve a dynamic learning task with complex auditory and visual information, and assessed the underlying neural mechanisms with functional magnetic resonance imaging. Using the model, we were able to capture participants' choices and to successfully attribute expected values and reward prediction errors to activations in the dopaminoceptive system (e.g., ventral striatum [VS]) as well as decision conflict to signals in the anterior cingulate cortex. The model outperformed an alternative approach that did not update decision strategies, but the relevance of information itself. Activation of sensory areas depended on whether the selected strategy made use of the respective source of information. Selection of a strategy also determined how value-related information influenced effective connectivity between sensory systems and the VS. Our results suggest that humans can structure their search for and use of relevant information by adaptively selecting between decision strategies.
Collapse
Affiliation(s)
- Sebastian Gluth
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg D-20246, Germany and
| | - Jörg Rieskamp
- Department of Psychology, University of Basel, Basel CH-4055, Switzerland
| | - Christian Büchel
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg D-20246, Germany and
| |
Collapse
|
22
|
Bornstein AM, Daw ND. Dissociating hippocampal and striatal contributions to sequential prediction learning. Eur J Neurosci 2013; 35:1011-23. [PMID: 22487032 DOI: 10.1111/j.1460-9568.2011.07920.x] [Citation(s) in RCA: 91] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Behavior may be generated on the basis of many different kinds of learned contingencies. For instance, responses could be guided by the direct association between a stimulus and response, or by sequential stimulus-stimulus relationships (as in model-based reinforcement learning or goal-directed actions). However, the neural architecture underlying sequential predictive learning is not well understood, in part because it is difficult to isolate its effect on choice behavior. To track such learning more directly, we examined reaction times (RTs) in a probabilistic sequential picture identification task in healthy individuals. We used computational learning models to isolate trial-by-trial effects of two distinct learning processes in behavior, and used these as signatures to analyse the separate neural substrates of each process. RTs were best explained via the combination of two delta rule learning processes with different learning rates. To examine neural manifestations of these learning processes, we used functional magnetic resonance imaging to seek correlates of time-series related to expectancy or surprise. We observed such correlates in two regions, hippocampus and striatum. By estimating the learning rates best explaining each signal, we verified that they were uniquely associated with one of the two distinct processes identified behaviorally. These differential correlates suggest that complementary anticipatory functions drive each region's effect on behavior. Our results provide novel insights as to the quantitative computational distinctions between medial temporal and basal ganglia learning networks and enable experiments that exploit trial-by-trial measurement of the unique contributions of both hippocampus and striatum to response behavior.
Collapse
Affiliation(s)
- Aaron M Bornstein
- Department of Psychology, New York University, 4 Washington Pl. Suite 888, New York, NY 10003, USA.
| | | |
Collapse
|
23
|
Boll S, Gamer M, Gluth S, Finsterbusch J, Büchel C. Separate amygdala subregions signal surprise and predictiveness during associative fear learning in humans. Eur J Neurosci 2012; 37:758-67. [PMID: 23278978 DOI: 10.1111/ejn.12094] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Revised: 10/31/2012] [Accepted: 11/16/2012] [Indexed: 11/30/2022]
Abstract
It has recently been suggested that learning signals in the amygdala might be best characterized by attentional theories of associative learning [such as Pearce-Hall (PH)] and more recent hybrid variants that combine Rescorla-Wagner and PH learning models. In these models, unsigned prediction errors (PEs) determine the associability of a cue, which is used in turn to control learning of outcome expectations dynamically and reflects a function of the reliability of prior outcome predictions. Here, we employed an aversive Pavlovian reversal-learning task to investigate computational signals derived from such a hybrid model. Unlike previous accounts, our paradigm allowed for the separate assessment of associability at the time of cue presentation and PEs at the time of outcome. We combined this approach with high-resolution functional magnetic resonance imaging to understand how different subregions of the human amygdala contribute to associative learning. Signal changes in the corticomedial amygdala and in the midbrain represented unsigned PEs at the time of outcome showing increased responses irrespective of whether a shock was unexpectedly administered or omitted. In contrast, activity in basolateral amygdala regions correlated negatively with associability at the time of cue presentation. Thus, whereas the corticomedial amygdala and the midbrain reflected immediate surprise, the basolateral amygdala represented predictiveness and displayed increased responses when outcome predictions became more reliable. These results extend previous findings on PH-like mechanisms in the amygdala and provide unique insights into human amygdala circuits during associative learning.
Collapse
Affiliation(s)
- Sabrina Boll
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Martinistrasse 52, Building W34, D-20246, Hamburg, Germany.
| | | | | | | | | |
Collapse
|
24
|
Eippert F, Gamer M, Büchel C. Neurobiological mechanisms underlying the blocking effect in aversive learning. J Neurosci 2012; 32:13164-76. [PMID: 22993433 PMCID: PMC6621462 DOI: 10.1523/jneurosci.1210-12.2012] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2012] [Revised: 07/19/2012] [Accepted: 07/30/2012] [Indexed: 11/21/2022] Open
Abstract
Current theories of classical conditioning assume that learning depends on the predictive relationship between events, not just on their temporal contiguity. Here we employ the classic experiment substantiating this reasoning-the blocking paradigm-in combination with functional magnetic resonance imaging (fMRI) to investigate whether human amygdala responses in aversive learning conform to these assumptions. In accordance with blocking, we demonstrate that significantly stronger behavioral and amygdala responses are evoked by conditioned stimuli that are predictive of the unconditioned stimulus than by conditioned stimuli that have received the same pairing with the unconditioned stimulus, yet have no predictive value. When studying the development of this effect, we not only observed that it was related to the strength of previous conditioned responses, but also that predictive compared with nonpredictive conditioned stimuli received more overt attention, as measured by fMRI-concurrent eye tracking, and that this went along with enhanced amygdala responses. We furthermore observed that prefrontal regions play a role in the development of the blocking effect: ventromedial prefrontal cortex (subgenual anterior cingulate) only exhibited responses when conditioned stimuli had to be established as nonpredictive for an outcome, whereas dorsolateral prefrontal cortex also showed responses when conditioned stimuli had to be established as predictive. Most importantly, dorsolateral prefrontal cortex connectivity to amygdala flexibly switched between positive and negative coupling, depending on the requirements posed by predictive relationships. Together, our findings highlight the role of predictive value in explaining amygdala responses and identify mechanisms that shape these responses in human fear conditioning.
Collapse
Affiliation(s)
- Falk Eippert
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, 20246 Hamburg, Germany.
| | | | | |
Collapse
|
25
|
Hindi Attar C, Finckh B, Büchel C. The influence of serotonin on fear learning. PLoS One 2012; 7:e42397. [PMID: 22879964 PMCID: PMC3411733 DOI: 10.1371/journal.pone.0042397] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Accepted: 07/04/2012] [Indexed: 11/18/2022] Open
Abstract
Learning of associations between aversive stimuli and predictive cues is the basis of Pavlovian fear conditioning and is driven by a mismatch between expectation and outcome. To investigate whether serotonin modulates the formation of such aversive cue-outcome associations, we used functional magnetic resonance imaging (fMRI) and dietary tryptophan depletion to reduce brain serotonin (5-HT) levels in healthy human subjects. In a Pavlovian fear conditioning paradigm, 5-HT depleted subjects compared to a non-depleted control group exhibited attenuated autonomic responses to cues indicating the upcoming of an aversive event. These results were closely paralleled by reduced aversive learning signals in the amygdala and the orbitofrontal cortex, two prominent structures of the neural fear circuit. In agreement with current theories of serotonin as a motivational opponent system to dopamine in fear learning, our data provide first empirical evidence for a role of serotonin in representing formally derived learning signals for aversive events.
Collapse
Affiliation(s)
- Catherine Hindi Attar
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
| | | | | |
Collapse
|
26
|
van Holst RJ, Veltman DJ, Büchel C, van den Brink W, Goudriaan AE. Distorted expectancy coding in problem gambling: is the addictive in the anticipation? Biol Psychiatry 2012; 71:741-8. [PMID: 22342105 DOI: 10.1016/j.biopsych.2011.12.030] [Citation(s) in RCA: 111] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/28/2011] [Revised: 12/24/2011] [Accepted: 12/28/2011] [Indexed: 11/26/2022]
Abstract
BACKGROUND Pathologic gamblers are known to have abnormal neural responses associated with experiencing monetary wins and losses. However, neural responsiveness during reward and loss expectations in pathologic gamblers has not yet been investigated. METHODS We used a functional magnetic resonance imaging paradigm that allowed us to investigate the dissociable reward- and loss-related expectancies with various probabilities of winning or losing different amounts of money in 15 patients with problem gambling (PRGs) and 16 healthy control subjects (HCs). RESULTS Compared with HCs, PRGs showed stronger activation in the bilateral ventral striatum to 5 euro than to 1 euro trials. PRGs also showed more activation of the bilateral ventral striatum and left orbitofrontal cortex associated with gain-related expected value than HCs. In addition, regression analyses indicated a highly significant negative correlation between gambling severity scores and right amygdala activation associated with gain-related expected value coding. There were no group differences in brain activation for loss-related expected value. CONCLUSIONS PRGs show higher activity in the reward system during reward expectation than HCs, whereas we observed no difference between PRGs and HC in the loss value system. Furthermore, the negative relation between gambling severity and amygdala activation in gain expected value coding suggests that more severe PRGs are less likely to be risk aversive during gambling. Our study provides evidence that PRGs are characterized by abnormally increased reward expectancy coding, which may render them overoptimistic with regard to gambling outcomes.
Collapse
Affiliation(s)
- Ruth J van Holst
- Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.
| | | | | | | | | |
Collapse
|
27
|
Abstract
Pavlovian fear conditioning is highly conserved across species, providing a powerful model of aversive learning. In rodents, fear memory is stored and reactivated under the influence of the amygdala. There is no evidence for an equivalent mechanism in primates, and an opposite mechanism is proposed whereby primate amygdala contributes only to an initial phase of aversive learning, subsequently ceding fear memory to extra-amygdalar regions. Here, we reexamine this question by exploiting human high-resolution functional magnetic resonance imaging in conjunction with multivariate methods. By assuming a sparse neural coding, we show it is possible, at an individual subject level, to discriminate responses to conditioned (CS+ and CS-) stimuli in both basolateral and centro-cortical amygdala nuclei. The strength of this discrimination increased over time and was tightly coupled to the behavioral expression of fear, consistent with an expression of a stable fear memory trace. These data highlight that the human basolateral and centro-cortical amygdala support initial learning as well more enduring fear memory storage. A sparse neuronal representation for fear, here revealed by multivariate pattern classification, resolves why an enduring memory trace has proven elusive in previous human studies.
Collapse
|
28
|
Shackman AJ, Salomons TV, Slagter HA, Fox AS, Winter JJ, Davidson RJ. The integration of negative affect, pain and cognitive control in the cingulate cortex. Nat Rev Neurosci 2011; 12:154-67. [PMID: 21331082 DOI: 10.1038/nrn2994] [Citation(s) in RCA: 1404] [Impact Index Per Article: 108.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
It has been argued that emotion, pain and cognitive control are functionally segregated in distinct subdivisions of the cingulate cortex. However, recent observations encourage a fundamentally different view. Imaging studies demonstrate that negative affect, pain and cognitive control activate an overlapping region of the dorsal cingulate--the anterior midcingulate cortex (aMCC). Anatomical studies reveal that the aMCC constitutes a hub where information about reinforcers can be linked to motor centres responsible for expressing affect and executing goal-directed behaviour. Computational modelling and other kinds of evidence suggest that this intimacy reflects control processes that are common to all three domains. These observations compel a reconsideration of the dorsal cingulate's contribution to negative affect and pain.
Collapse
Affiliation(s)
- Alexander J Shackman
- Department of Psychology, University of Wisconsin, Madison, Wisconsin, WI 53706, USA.
| | | | | | | | | | | |
Collapse
|
29
|
Iidaka T, Saito DN, Komeda H, Mano Y, Kanayama N, Osumi T, Ozaki N, Sadato N. Transient neural activation in human amygdala involved in aversive conditioning of face and voice. J Cogn Neurosci 2011; 22:2074-85. [PMID: 19803681 DOI: 10.1162/jocn.2009.21347] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Elucidating the neural mechanisms involved in aversive conditioning helps find effective treatments for psychiatric disorders such as anxiety disorder and phobia. Previous studies using fMRI and human subjects have reported that the amygdala plays a role in this phenomenon. However, the noxious stimuli that were used as unconditioned stimuli in previous studies (e.g., electric shock) might have been ecologically invalid because we seldom encounter such stimuli in daily life. Therefore, we investigated whether a face stimulus could be conditioned by using a voice that had negative emotional valence and was collected from a real-life environment. A skin conductance response showed that healthy subjects were conditioned by using these stimuli. In an fMRI study, there was greater amygdala activation in response to the faces that had been paired with the voice than to those that had not. The right amygdala showed transient activity in the early stage of acquisition. A psychophysiological interaction analysis indicated that the subcortical pathway from the medial geniculate body to the amygdala played a role in conditioning. Modulation of the subcortical pathway by voice stimuli preceded the transient activity in the amygdala. The finding that an ecologically valid stimulus elicited the conditioning and amygdala response suggests that our brain is automatically processing unpleasant stimuli in daily life.
Collapse
Affiliation(s)
- Tetsuya Iidaka
- Department of Psychiatry, Graduate School of Medicine, Nagoya University, Nagoya, Japan.
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Nakahara H, Kaveri S. Internal-time temporal difference model for neural value-based decision making. Neural Comput 2010; 22:3062-106. [PMID: 20858126 DOI: 10.1162/neco_a_00049] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
The temporal difference (TD) learning framework is a major paradigm for understanding value-based decision making and related neural activities (e.g., dopamine activity). The representation of time in neural processes modeled by a TD framework, however, is poorly understood. To address this issue, we propose a TD formulation that separates the time of the operator (neural valuation processes), which we refer to as internal time, from the time of the observer (experiment), which we refer to as conventional time. We provide the formulation and theoretical characteristics of this TD model based on internal time, called internal-time TD, and explore the possible consequences of the use of this model in neural value-based decision making. Due to the separation of the two times, internal-time TD computations, such as TD error, are expressed differently, depending on both the time frame and time unit. We examine this operator-observer problem in relation to the time representation used in previous TD models. An internal time TD value function exhibits the co-appearance of exponential and hyperbolic discounting at different delays in intertemporal choice tasks. We further examine the effects of internal time noise on TD error, the dynamic construction of internal time, and the modulation of internal time with the internal time hypothesis of serotonin function. We also relate the internal TD formulation to research on interval timing and subjective time.
Collapse
Affiliation(s)
- Hiroyuki Nakahara
- Laboratory for Integrated Theoretical Neuroscience, RIKEN Brain Science Institute,Wako, Saitama, 351-0198 Japan.
| | | |
Collapse
|
31
|
Neural correlates of aversive conditioning: development of a functional imaging paradigm for the investigation of anxiety disorders. Eur Arch Psychiatry Clin Neurosci 2010; 260:443-53. [PMID: 20148332 DOI: 10.1007/s00406-010-0099-9] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/27/2009] [Accepted: 01/11/2010] [Indexed: 10/19/2022]
Abstract
The purpose of the present study was to establish a short paradigm for the examination of classical aversive conditioning processes for application in patients with anxiety disorders. We measured behavioral, autonomic and neural correlates of the paradigm in healthy subjects, applying functional magnetic resonance imaging (fMRI) and measurement of skin conductance. Therefore, neutral visual stimuli were paired with an unpleasant white noise as unconditioned stimulus. Twenty healthy subjects performed three experimental phases of learning: familiarization, acquisition and extinction. Subjective ratings of valence and arousal after each phase of conditioning as well as skin conductance measurement indicated successful conditioning. During acquisition, fMRI results showed increased activation for the conditioned stimulus (CS+(unpaired)) when compared with the non-conditioned stimulus (CS-) in the right amygdala, the insulae, the anterior cingulate cortex and the parahippocampal gyrus, all regions known to be involved in emotional processing. In addition, a linearly decreasing activation in the right amygdala/hippocampus for the CS- across the acquisition phase was found. There were no significant differences between CS+ and CS- during extinction. In conclusion, the applicability of this paradigm for the evaluation of neural correlates in conditioning and extinction processes has been proven. Thus, we present a promising paradigm for the examination of the fear-circuit in patients with anxiety disorders and additionally effects of cognitive-behavioral interventions.
Collapse
|
32
|
Bromberg-Martin ES, Matsumoto M, Nakahara H, Hikosaka O. Multiple timescales of memory in lateral habenula and dopamine neurons. Neuron 2010; 67:499-510. [PMID: 20696385 DOI: 10.1016/j.neuron.2010.06.031] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/08/2010] [Indexed: 01/10/2023]
Abstract
Midbrain dopamine neurons are thought to signal predictions about future rewards based on the memory of past rewarding experience. Little is known about the source of their reward memory and the factors that control its timescale. Here we recorded from dopamine neurons, as well as one of their sources of input, the lateral habenula, while animals predicted upcoming rewards based on the past reward history. We found that lateral habenula and dopamine neurons accessed two distinct reward memories: a short-timescale memory expressed at the start of the task and a near-optimal long-timescale memory expressed when a future reward outcome was revealed. The short- and long-timescale memories were expressed in different forms of reward-oriented eye movements. Our data show that the habenula-dopamine pathway contains multiple timescales of memory and provide evidence for their role in motivated behavior.
Collapse
|
33
|
Separate mechanisms for placebo and opiate analgesia? Pain 2010; 150:8-9. [DOI: 10.1016/j.pain.2010.03.010] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2010] [Accepted: 03/02/2010] [Indexed: 11/21/2022]
|
34
|
Bach DR, Daunizeau J, Friston KJ, Dolan RJ. Dynamic causal modelling of anticipatory skin conductance responses. Biol Psychol 2010; 85:163-70. [PMID: 20599582 PMCID: PMC2923733 DOI: 10.1016/j.biopsycho.2010.06.007] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2010] [Revised: 06/15/2010] [Accepted: 06/17/2010] [Indexed: 11/09/2022]
Abstract
Anticipatory skin conductance responses [SCRs] are a widely used measure of aversive conditioning in humans. Here, we describe a dynamic causal model [DCM] of how anticipatory, evoked, and spontaneous skin conductance changes are generated by sudomotor nerve activity. Inversion of this model, using variational Bayes, provides a means of inferring the most likely sympathetic nerve activity, given observed skin conductance responses. In two fear conditioning experiments, we demonstrate the predictive validity of the DCM by showing it has greater sensitivity to the effects of conditioning, relative to alternative (conventional) response estimates. Furthermore, we establish face validity by showing that trial-by-trial estimates of anticipatory sudomotor activity are better predicted by formal learning models, relative to response estimates from peak-scoring approaches. The model furnishes a potentially powerful approach to characterising SCR that exploits knowledge about how these signals are generated.
Collapse
Affiliation(s)
- Dominik R Bach
- Wellcome Trust Centre for Neuroimaging, University College London, 12 Queen Square, London, United Kingdom.
| | | | | | | |
Collapse
|
35
|
Cohen MX. Individual differences and the neural representations of reward expectation and reward prediction error. Soc Cogn Affect Neurosci 2010; 2:20-30. [PMID: 17710118 PMCID: PMC1945222 DOI: 10.1093/scan/nsl021] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2006] [Accepted: 08/08/2006] [Indexed: 11/14/2022] Open
Abstract
Reward expectation and reward prediction errors are thought to be critical for dynamic adjustments in decision-making and reward-seeking behavior, but little is known about their representation in the brain during uncertainty and risk-taking. Furthermore, little is known about what role individual differences might play in such reinforcement processes. In this study, it is shown behavioral and neural responses during a decision-making task can be characterized by a computational reinforcement learning model and that individual differences in learning parameters in the model are critical for elucidating these processes. In the fMRI experiment, subjects chose between high- and low-risk rewards. A computational reinforcement learning model computed expected values and prediction errors that each subject might experience on each trial. These outputs predicted subjects' trial-to-trial choice strategies and neural activity in several limbic and prefrontal regions during the task. Individual differences in estimated reinforcement learning parameters proved critical for characterizing these processes, because models that incorporated individual learning parameters explained significantly more variance in the fMRI data than did a model using fixed learning parameters. These findings suggest that the brain engages a reinforcement learning process during risk-taking and that individual differences play a crucial role in modeling this process.
Collapse
Affiliation(s)
- Michael X Cohen
- Department of Epilepsy, University of Bonn, Sigmund-Freud-Strasse 25, Bonn, Germany.
| |
Collapse
|
36
|
Schiller D, Delgado MR. Overlapping neural systems mediating extinction, reversal and regulation of fear. Trends Cogn Sci 2010; 14:268-76. [PMID: 20493762 DOI: 10.1016/j.tics.2010.04.002] [Citation(s) in RCA: 210] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2009] [Revised: 04/09/2010] [Accepted: 04/11/2010] [Indexed: 10/19/2022]
Abstract
Learned fear is a process allowing quick detection of associations between cues in the environment and prediction of imminent threat. Adaptive function in a changing environment, however, requires organisms to quickly update this learning and have the ability to hinder fear responses when predictions are no longer correct. Here we focus on three strategies that can modify conditioned fear, namely extinction, reversal and regulation of fear, and review their underlying neural mechanisms. By directly comparing neuroimaging data from three separate studies that employ each strategy, we highlight overlapping brain structures that comprise a general circuitry in the human brain. This circuitry potentially enables the flexible control of fear, regardless of the particular task demands.
Collapse
Affiliation(s)
- Daniela Schiller
- Center for Neural Science, New York University, New York, NY 10003, USA; Department of Psychology, New York University, New York, NY 10003, USA.
| | | |
Collapse
|
37
|
Gläscher JP, O'Doherty JP. Model‐based approaches to neuroimaging: combining reinforcement learning theory with fMRI data. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2010; 1:501-510. [DOI: 10.1002/wcs.57] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Jan P. Gläscher
- California Institute of Technology, Division of the Humanities and Social Sciences, Pasadena, CA 91125, USA
| | - John P. O'Doherty
- California Institute of Technology, Division of the Humanities and Social Sciences, Pasadena, CA 91125, USA
- Trinity College, Institute of Neuroscience, Dublin, Ireland
| |
Collapse
|
38
|
Rodriguez PF. Using conditional maximization to determine hyperparameters in model-based fMRI. Neuroimage 2009; 50:472-8. [PMID: 20026226 DOI: 10.1016/j.neuroimage.2009.12.026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2009] [Accepted: 12/04/2009] [Indexed: 10/20/2022] Open
Abstract
In model-based analysis of fMRI data, a neural or cognitive mathematical model of behavior is used to predict changes in fMRI activity. The model predictions are often applied as a parametric modulation of the main stimulus effect within the context of the general linear model (GLM). Using a mathematical model has become an important method for connecting fMRI signals to behavior because the model represents how stimulus processing leads to behavior, and the parametric modulation represents a specific test about the profile of stimulus-related fMRI activity (for review and discussion, see O'Doherty et al., 2007). However, in some cases the parameters of the mathematical model may be under-determined because there is a range of values that equally well account for behavior, or perhaps an exploratory analysis is desired. Thus, in order to fully gauge the applicability of some mathematical model it would be useful to understand how fMRI analysis depends on those parameters. Here, a conditional maximization procedure is developed to search for parameter values in the mathematical model as hyperparameters in the GLM. Simulations and analysis with real fMRI data show that conditional maximization is an effective and simple procedure for estimating hyperparameters. General recommendations and caveats for using hyperparameters in model-based fMRI analysis are also presented.
Collapse
Affiliation(s)
- Paul F Rodriguez
- Department of Radiology, University of San Diego, California, La Jolla, CA 92037, USA.
| |
Collapse
|
39
|
Markman AB, Beer JS, Grimm LR, Rein JR, Maddox WT. The optimal level of fuzz: Case studies in a methodology for psychological research. J EXP THEOR ARTIF IN 2009; 21:197-215. [PMID: 19756251 PMCID: PMC2743110 DOI: 10.1080/09528130903065380] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
Cognitive Science research is hard to conduct, because researchers must take phenomena from the world and turn them into laboratory tasks for which a reasonable level of experimental control can be achieved. Consequently, research necessarily makes tradeoffs between internal validity (experimental control) and external validity (the degree to which a task represents behavior outside of the lab). Researchers are thus seeking the best possible tradeoff between these constraints, which we refer to as the optimal level of fuzz. We present two principles for finding the optimal level of fuzz, in research, and then illustrate these principles using research from motivation, individual differences, and cognitive neuroscience.
Collapse
|
40
|
Rodriguez PF. Stimulus-outcome learnability differentially activates anterior cingulate and hippocampus at feedback processing. Learn Mem 2009; 16:324-31. [DOI: 10.1101/lm.1191609] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
41
|
Diekhof EK, Falkai P, Gruber O. Functional neuroimaging of reward processing and decision-making: A review of aberrant motivational and affective processing in addiction and mood disorders. ACTA ACUST UNITED AC 2008; 59:164-84. [DOI: 10.1016/j.brainresrev.2008.07.004] [Citation(s) in RCA: 134] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2007] [Revised: 07/10/2008] [Accepted: 07/11/2008] [Indexed: 11/28/2022]
|
42
|
Boksem MA, Tops M, Kostermans E, De Cremer D. Sensitivity to punishment and reward omission: Evidence from error-related ERP components. Biol Psychol 2008; 79:185-92. [DOI: 10.1016/j.biopsycho.2008.04.010] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2007] [Revised: 04/24/2008] [Accepted: 04/24/2008] [Indexed: 12/28/2022]
|
43
|
den Ouden HEM, Friston KJ, Daw ND, McIntosh AR, Stephan KE. A dual role for prediction error in associative learning. ACTA ACUST UNITED AC 2008; 19:1175-85. [PMID: 18820290 PMCID: PMC2665159 DOI: 10.1093/cercor/bhn161] [Citation(s) in RCA: 216] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Confronted with a rich sensory environment, the brain must learn statistical regularities across sensory domains to construct causal models of the world. Here, we used functional magnetic resonance imaging and dynamic causal modeling (DCM) to furnish neurophysiological evidence that statistical associations are learnt, even when task-irrelevant. Subjects performed an audio-visual target-detection task while being exposed to distractor stimuli. Unknown to them, auditory distractors predicted the presence or absence of subsequent visual distractors. We modeled incidental learning of these associations using a Rescorla–Wagner (RW) model. Activity in primary visual cortex and putamen reflected learning-dependent surprise: these areas responded progressively more to unpredicted, and progressively less to predicted visual stimuli. Critically, this prediction-error response was observed even when the absence of a visual stimulus was surprising. We investigated the underlying mechanism by embedding the RW model into a DCM to show that auditory to visual connectivity changed significantly over time as a function of prediction error. Thus, consistent with predictive coding models of perception, associative learning is mediated by prediction-error dependent changes in connectivity. These results posit a dual role for prediction-error in encoding surprise and driving associative plasticity.
Collapse
Affiliation(s)
- Hanneke E M den Ouden
- Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, 12 Queen Square, London WC1N 3BG, UK.
| | | | | | | | | |
Collapse
|
44
|
Abstract
Emotion plays a critical role in many contemporary accounts of decision making, but exactly what underlies its influence and how this is mediated in the brain remain far from clear. Here, we review behavioral studies that suggest that Pavlovian processes can exert an important influence over choice and may account for many effects that have traditionally been attributed to emotion. We illustrate how recent experiments cast light on the underlying structure of Pavlovian control and argue that generally this influence makes good computational sense. Corresponding neuroscientific data from both animals and humans implicate a central role for the amygdala through interactions with other brain areas. This yields a neurobiological account of emotion in which it may operate, often covertly, to optimize rather than corrupt economic choice.
Collapse
Affiliation(s)
- Ben Seymour
- Wellcome Trust Centre for Imaging Neuroscience, University College London, 12 Queen Square, London WC1N3BG, UK.
| | | |
Collapse
|
45
|
Blockade of endogenous opioid neurotransmission enhances acquisition of conditioned fear in humans. J Neurosci 2008; 28:5465-72. [PMID: 18495880 DOI: 10.1523/jneurosci.5336-07.2008] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
The endogenous opioid system is involved in fear learning in rodents, as opioid agonists attenuate and opioid antagonists facilitate the acquisition of conditioned fear. It has been suggested that an opioidergic signal, which is engaged through conditioning and acts inhibitory on unconditioned stimulus input, is the source of these effects. To clarify whether blockade of endogenous opioid neurotransmission enhances acquisition of conditioned fear in humans, and to elucidate the neural underpinnings of such an effect, we used functional magnetic resonance imaging in combination with behavioral recordings and a double-blind pharmacological intervention. All subjects underwent the same classical fear-conditioning paradigm, but subjects in the experimental group received the opioid antagonist naloxone before and during the experiment, in contrast to subjects in the control group, who received saline. Blocking endogenous opioid neurotransmission with naloxone led to more sustained responses to the unconditioned stimulus across trials, evident in both behavioral and blood oxygen level-dependent responses in pain responsive cortical regions. This effect was likely caused by naloxone blocking conditioned responses in a pain-inhibitory circuit involving opioid-rich areas such as the rostral anterior cingulate cortex, amygdala, and periaqueductal gray. Most importantly, naloxone enhanced the acquisition of fear on the behavioral level and changed the activation profile of the amygdala: whereas the control group showed rapidly decaying conditioned responses across trials, the naloxone group showed sustained conditioned responses in the amygdala. Together, these results demonstrate that in humans the endogenous opioid system has an inhibitory role in the acquisition of fear.
Collapse
|
46
|
Li J, McClure SM, King-Casas B, Read Montague P. Policy adjustment in a dynamic economic game. PLoS One 2006; 1:e103. [PMID: 17183636 PMCID: PMC1762366 DOI: 10.1371/journal.pone.0000103] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2006] [Accepted: 11/18/2006] [Indexed: 11/19/2022] Open
Abstract
Making sequential decisions to harvest rewards is a notoriously difficult problem. One difficulty is that the real world is not stationary and the reward expected from a contemplated action may depend in complex ways on the history of an animal's choices. Previous functional neuroimaging work combined with principled models has detected brain responses that correlate with computations thought to guide simple learning and action choice. Those works generally employed instrumental conditioning tasks with fixed action-reward contingencies. For real-world learning problems, the history of reward-harvesting choices can change the likelihood of rewards collected by the same choices in the near-term future. We used functional MRI to probe brain and behavioral responses in a continuous decision-making task where reward contingency is a function of both a subject's immediate choice and his choice history. In these more complex tasks, we demonstrated that a simple actor-critic model can account for both the subjects' behavioral and brain responses, and identified a reward prediction error signal in ventral striatal structures active during these non-stationary decision tasks. However, a sudden introduction of new reward structures engages more complex control circuitry in the prefrontal cortex (inferior frontal gyrus and anterior insula) and is not captured by a simple actor-critic model. Taken together, these results extend our knowledge of reward-learning signals into more complex, history-dependent choice tasks. They also highlight the important interplay between striatum and prefrontal cortex as decision-makers respond to the strategic demands imposed by non-stationary reward environments more reminiscent of real-world tasks.
Collapse
Affiliation(s)
- Jian Li
- Human Neuroimaging Laboratory, Center for Theoretical Neuroscience, Department of Neuroscience, Baylor College of Medicine, Houston, Texas, United States of America
| | - Samuel M. McClure
- Human Neuroimaging Laboratory, Center for Theoretical Neuroscience, Department of Neuroscience, Baylor College of Medicine, Houston, Texas, United States of America
| | - Brooks King-Casas
- Human Neuroimaging Laboratory, Center for Theoretical Neuroscience, Department of Neuroscience, Baylor College of Medicine, Houston, Texas, United States of America
| | - P. Read Montague
- Human Neuroimaging Laboratory, Center for Theoretical Neuroscience, Department of Neuroscience, Baylor College of Medicine, Houston, Texas, United States of America
- Menninger Department of Psychiatry and Behavioral Sciences, Baylor College of Medicine, Houston, Texas, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
47
|
Yacubian J, Gläscher J, Schroeder K, Sommer T, Braus DF, Büchel C. Dissociable systems for gain- and loss-related value predictions and errors of prediction in the human brain. J Neurosci 2006; 26:9530-7. [PMID: 16971537 PMCID: PMC6674602 DOI: 10.1523/jneurosci.2915-06.2006] [Citation(s) in RCA: 345] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Midbrain dopaminergic neurons projecting to the ventral striatum code for reward magnitude and probability during reward anticipation and then indicate the difference between actual and predicted outcome. It has been questioned whether such a common system for the prediction and evaluation of reward exists in humans. Using functional magnetic resonance imaging and a guessing task in two large cohorts, we are able to confirm ventral striatal responses coding both reward probability and magnitude during anticipation, permitting the local computation of expected value (EV). However, the ventral striatum only represented the gain-related part of EV (EV+). At reward delivery, the same area shows a reward probability and magnitude-dependent prediction error signal, best modeled as the difference between actual outcome and EV+. In contrast, loss-related expected value (EV-) and the associated prediction error was represented in the amygdala. Thus, the ventral striatum and the amygdala distinctively process the value of a prediction and subsequently compute a prediction error for gains and losses, respectively. Therefore, a homeostatic balance of both systems might be important for generating adequate expectations under uncertainty. Prevalence of either part might render expectations more positive or negative, which could contribute to the pathophysiology of mood disorders like major depression.
Collapse
Affiliation(s)
| | | | - Katrin Schroeder
- Psychiatry, NeuroImage Nord, University Medical Center Hamburg-Eppendorf, D-20246 Hamburg, Germany
| | | | - Dieter F. Braus
- Psychiatry, NeuroImage Nord, University Medical Center Hamburg-Eppendorf, D-20246 Hamburg, Germany
| | | |
Collapse
|
48
|
Hampton AN, Bossaerts P, O’Doherty JP. The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. J Neurosci 2006; 26:8360-7. [PMID: 16899731 PMCID: PMC6673813 DOI: 10.1523/jneurosci.1010-06.2006] [Citation(s) in RCA: 350] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Many real-life decision-making problems incorporate higher-order structure, involving interdependencies between different stimuli, actions, and subsequent rewards. It is not known whether brain regions implicated in decision making, such as the ventromedial prefrontal cortex (vmPFC), use a stored model of the task structure to guide choice (model-based decision making) or merely learn action or state values without assuming higher-order structure as in standard reinforcement learning. To discriminate between these possibilities, we scanned human subjects with functional magnetic resonance imaging while they performed a simple decision-making task with higher-order structure, probabilistic reversal learning. We found that neural activity in a key decision-making region, the vmPFC, was more consistent with a computational model that exploits higher-order structure than with simple reinforcement learning. These results suggest that brain regions, such as the vmPFC, use an abstract model of task structure to guide behavioral choice, computations that may underlie the human capacity for complex social interactions and abstract strategizing.
Collapse
|
49
|
Seymour B, O'Doherty JP, Koltzenburg M, Wiech K, Frackowiak R, Friston K, Dolan R. Opponent appetitive-aversive neural processes underlie predictive learning of pain relief. Nat Neurosci 2005; 8:1234-40. [PMID: 16116445 DOI: 10.1038/nn1527] [Citation(s) in RCA: 301] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2005] [Accepted: 08/02/2005] [Indexed: 11/08/2022]
Abstract
Termination of a painful or unpleasant event can be rewarding. However, whether the brain treats relief in a similar way as it treats natural reward is unclear, and the neural processes that underlie its representation as a motivational goal remain poorly understood. We used fMRI (functional magnetic resonance imaging) to investigate how humans learn to generate expectations of pain relief. Using a pavlovian conditioning procedure, we show that subjects experiencing prolonged experimentally induced pain can be conditioned to predict pain relief. This proceeds in a manner consistent with contemporary reward-learning theory (average reward/loss reinforcement learning), reflected by neural activity in the amygdala and midbrain. Furthermore, these reward-like learning signals are mirrored by opposite aversion-like signals in lateral orbitofrontal cortex and anterior cingulate cortex. This dual coding has parallels to 'opponent process' theories in psychology and promotes a formal account of prediction and expectation during pain.
Collapse
Affiliation(s)
- Ben Seymour
- Wellcome Department of Imaging Neuroscience, 12 Queen Square, London WC1N 3BG, UK.
| | | | | | | | | | | | | |
Collapse
|