1
|
Hart G, Burton TJ, Nolan CR, Balleine BW. Striatal dopamine release tracks the relationship between actions and their consequences. Cell Rep 2024; 43:113828. [PMID: 38386550 DOI: 10.1016/j.celrep.2024.113828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 12/05/2023] [Accepted: 02/03/2024] [Indexed: 02/24/2024] Open
Abstract
The acquisition and performance of goal-directed actions has long been argued to depend on the integration of glutamatergic inputs to the posterior dorsomedial striatum (pDMS) under the modulatory influence of dopamine. Nevertheless, relatively little is known about the dynamics of striatal dopamine during goal-directed actions. To investigate this, we chronically recorded dopamine release in the pDMS as rats acquired two actions for distinct outcomes as these action-outcome associations were incremented and then subsequently degraded or reversed. We found that bilateral dopamine release scaled with action value, whereas the lateralized dopamine signal, i.e., the difference in dopamine release ipsilaterally and contralaterally to the direction of the goal-directed action, reflected the strength of the action-outcome association independently of changes in movement. Our results establish, therefore, that striatal dopamine activity during goal-directed action reflects both bilateral moment-to-moment changes in action value and the long-term action-outcome association.
Collapse
Affiliation(s)
- G Hart
- Decision Neuroscience Laboratory, School of Psychology, UNSW Sydney, Sydney, NSW, Australia
| | - T J Burton
- Decision Neuroscience Laboratory, School of Psychology, UNSW Sydney, Sydney, NSW, Australia
| | - C R Nolan
- Decision Neuroscience Laboratory, School of Psychology, UNSW Sydney, Sydney, NSW, Australia
| | - B W Balleine
- Decision Neuroscience Laboratory, School of Psychology, UNSW Sydney, Sydney, NSW, Australia.
| |
Collapse
|
2
|
Seabrook LT, Peterson CS, Noble D, Sobey M, Tayyab T, Kenney T, Judge AK, Armstrong M, Lin S, Borgland SL. Short- and Long-Term High-Fat Diet Exposure Differentially Alters Phasic and Tonic GABAergic Signaling onto Lateral Orbitofrontal Pyramidal Neurons. J Neurosci 2023; 43:8582-8595. [PMID: 37793910 PMCID: PMC10727176 DOI: 10.1523/jneurosci.0831-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Revised: 08/26/2023] [Accepted: 09/21/2023] [Indexed: 10/06/2023] Open
Abstract
The chronic consumption of caloric dense high-fat foods is a major contributor to increased body weight, obesity, and other chronic health conditions. The orbitofrontal cortex (OFC) is critical in guiding decisions about food intake and is altered with diet-induced obesity. Obese rodents have altered morphologic and synaptic electrophysiological properties in the lateral orbitofrontal cortex (lOFC). Yet the time course by which exposure to a high-fat diet (HFD) induces these changes is poorly understood. Here, male mice are exposed to either short-term (7 d) or long-term (90 d) HFD. Long-term HFD exposure increases body weight, and glucose signaling compared with short-term HFD or a standard control diet (SCD). Both short and long-term HFD exposure increased the excitability of lOFC pyramidal neurons. However, phasic and tonic GABAergic signaling was differentially altered depending on HFD exposure length, such that tonic GABAergic signaling was decreased with early exposure to the HFD and phasic signaling was changed with long-term diet exposure. Furthermore, alterations in the short-term diet exposure were transient, as removal of the diet restored electrophysiological characteristics similar to mice fed SCD, whereas long-term HFD electrophysiological changes were persistent and remained after HFD removal. Finally, we demonstrate that changes in reward devaluation occur early with diet exposure. Together, these results suggest that the duration of HFD exposure differentially alters lOFC function and provides mechanistic insights into the susceptibility of the OFC to impairments in outcome devaluation.SIGNIFICANCE STATEMENT This study provides mechanistic insight on the impact of short-term and long-term high-fat diet (HFD) exposure on GABAergic function in the lateral orbitofrontal cortex (lOFC), a region known to guide decision-making. We find short-term HFD exposure induces transient changes in firing and tonic GABA action on lOFC pyramidal neurons, whereas long-term HFD induces obesity and has lasting changes on firing, tonic GABA and inhibitory synaptic transmission onto lOFC neurons. Given that GABAergic signaling in the lOFC can influence decision-making around food, these results have important implications in present society as palatable energy dense foods are abundantly available.
Collapse
Affiliation(s)
- Lauren T Seabrook
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Colleen S Peterson
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Duncan Noble
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Marissa Sobey
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Temoor Tayyab
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Tyra Kenney
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Allap K Judge
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Mataea Armstrong
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Shihao Lin
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Stephanie L Borgland
- Department of Physiology and Pharmacology, Hotchkiss Brain Institute, The University of Calgary, Calgary, Alberta T2N 4N1, Canada
| |
Collapse
|
3
|
van Timmeren T, de Wit S. Instant habits versus flexible tenacity: Do implementation intentions accelerate habit formation? Q J Exp Psychol (Hove) 2023; 76:2479-2492. [PMID: 36476147 PMCID: PMC10585941 DOI: 10.1177/17470218221147024] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 11/23/2022] [Accepted: 12/07/2022] [Indexed: 10/19/2023]
Abstract
Implementation intentions (strategic "if-then" plans) have been shown to support behaviour change. This may be achieved by mentally forming stimulus-response associations, thereby promoting habit formation. Does this deliberate attempt to instal "strategic automaticity" only offer advantages, or does it also come at the cost of reduced flexibility that characterises learnt habits? To investigate this, we tested healthy, young participants on a computerised instrumental learning task. Critically, we introduced implementation intentions ("if I see stimulus X, then I will respond") versus goal intentions ("for outcome Z, I will respond)" during instrumental acquisition, and subsequently assessed behavioural flexibility in an outcome-revaluation test. In Experiment 1, we conducted a between-subjects manipulation of strategic planning, and in Experiment 2, a within-subject manipulation. We hypothesised that implementation intentions would lead to strong stimulus-response associations and consequently impair performance when the signalled outcome value changed and therefore required a different response, while benefitting performance when the outcome value (and required response) remained the same. We found that implementation intentions supported instrumental learning, but impaired test performance overall (most robustly in Experiment 2), irrespective of whether the signalled outcome value had changed. We argue that this general detrimental effect of implementation intentions on test performance is likely a consequence of their negative effect on stimulus-outcome learning. Our findings warrant caution when applying if-then plans to situations where the agent does not already possess perfect knowledge of behavioural contingencies.While implementation intentions may support efficient and fast behavioural execution, this may come at the expense of behavioural flexibility.
Collapse
Affiliation(s)
- Tim van Timmeren
- Habit Lab, Department of Clinical Psychology, University of Amsterdam, Amsterdam, The Netherlands
- Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, The Netherlands
- Department of Social, Health and Organisational Psychology, Utrecht University, Utrecht, The Netherlands
| | - Sanne de Wit
- Habit Lab, Department of Clinical Psychology, University of Amsterdam, Amsterdam, The Netherlands
- Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
4
|
Sood A, Richard JM. Sex-biased effects of outcome devaluation by sensory-specific satiety on Pavlovian-conditioned behavior. Front Behav Neurosci 2023; 17:1259003. [PMID: 37860163 PMCID: PMC10582633 DOI: 10.3389/fnbeh.2023.1259003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 09/12/2023] [Indexed: 10/21/2023] Open
Abstract
Goal-directed behavior relies on accurate mental representations of the value of expected outcomes. Disruptions to this process are a central feature of several neuropsychiatric disorders, including addiction. Goal-directed behavior is most frequently studied using instrumental paradigms paired with outcome devaluation, but cue-evoked behaviors in Pavlovian settings can also be goal-directed and therefore sensitive to changes in outcome value. Emerging literature suggests that male and female rats may differ in the degree to which their Pavlovian-conditioned responses are goal-directed, but interpretation of these findings is complicated by the tendency of female and male rats to engage in distinct types of Pavlovian responses when trained with localizable cues. Here, we used outcome devaluation via sensory-specific satiety to assess the behavioral responses in male and female Long Evans rats trained to respond to an auditory CS (conditioned stimulus) in a Pavlovian-conditioning paradigm. We found that satiety-induced devaluation led to a decrease in behavioral responding to the reward-predictive CS, with males showing an effect on both port entry latency and probability and females showing an effect only on port entry probability. Overall, our results suggest that outcome devaluation affects Pavlovian-conditioned responses in both male and female rats, but that females may be less sensitive to outcome devaluation.
Collapse
Affiliation(s)
- Ankit Sood
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, United States
- Medical Discovery Team on Addiction, University of Minnesota, Minneapolis, MN, United States
| | - Jocelyn M. Richard
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, United States
- Medical Discovery Team on Addiction, University of Minnesota, Minneapolis, MN, United States
| |
Collapse
|
5
|
Sood A, Richard JM. Outcome devaluation by sensory-specific satiety alters Pavlovian-conditioned behavior in male and female rats. bioRxiv 2023:2023.07.05.547810. [PMID: 37461584 PMCID: PMC10349988 DOI: 10.1101/2023.07.05.547810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/21/2023]
Abstract
Goal-directed behavior relies on accurate mental representations of the value of expected outcomes. Disruptions to this process are a central feature of several neuropsychiatric disorders, including addiction. Goal-directed behavior is most frequently studied using instrumental paradigms paired with outcome devaluation, but cue-evoked behaviors in Pavlovian settings can also be goal-directed and therefore sensitive to changes in outcome value. Emerging literature suggests that male and female rats may differ in the degree to which their Pavlovian-conditioned responses are goal-directed, but interpretation of these findings is complicated by the tendency of female and male rats to engage in distinct types of Pavlovian responses when trained with localizable cues. Here, we used outcome devaluation via sensory-specific satiety to assess the behavioral responses in male and female Long Evans rats trained to respond to an auditory CS (conditioned stimulus) in a Pavlovian-conditioning paradigm. We found that satiety-induced devaluation led to a decrease in behavioral responding to the reward-predictive CS, with males showing an effect on both port entry latency and probability and females showing an effect only on port entry probability. Overall, our results suggest that outcome devaluation affects Pavlovian-conditioned responses in both male and female rats, but that females may be less sensitive to outcome devaluation.
Collapse
Affiliation(s)
- Ankit Sood
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, 55455
- Medical Discovery Team on Addiction, University of Minnesota, Minneapolis, MN, 55455
| | - Jocelyn M. Richard
- Department of Neuroscience, University of Minnesota, Minneapolis, MN, 55455
- Medical Discovery Team on Addiction, University of Minnesota, Minneapolis, MN, 55455
| |
Collapse
|
6
|
Bertran-Gonzalez J, Dinale C, Matamales M. Restoring the youthful state of striatal plasticity in aged mice re-enables cognitive control of action. Curr Biol 2023; 33:1997-2007.e5. [PMID: 37141886 DOI: 10.1016/j.cub.2023.04.020] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 03/21/2023] [Accepted: 04/12/2023] [Indexed: 05/06/2023]
Abstract
Multidisciplinary evidence suggests that the control of voluntary action arbitrates between two major forms of behavioral processing: cognitively guided (or goal directed) and autonomously guided (or habitual). Brain-state irregularities affecting the striatum-such as aging-commonly shift control toward the latter, although the responsible neural mechanisms remain unknown. Combining instrumental conditioning with cell-specific mapping and chemogenetics in striatal neurons, we explored strategies that invigorate goal-directed capacity in aged mice. We found that, under conditions favoring goal-directed control, aged animals resiliently expressed autonomously guided behavior, a response that was underpinned by a characteristic one-to-one functional engagement of the two main neuronal populations in the striatum-D1- and D2-dopamine receptor-expressing spiny projection neurons (SPNs). Chemogenetically induced desensitization of D2-SPN signaling in aged transgenic mice recapitulated the striatal plasticity state observed in young mice, an effect that shifted behavior toward vigorous, goal-directed action. Our findings contribute to the understanding of the neural bases of behavioral control and propose neural system interventions that enhance cognitive functioning in habit-prone brains.
Collapse
Affiliation(s)
- Jesus Bertran-Gonzalez
- Decision Neuroscience Laboratory, School of Psychology, University of New South Wales, Sydney, NSW 2052, Australia.
| | - Caroline Dinale
- Decision Neuroscience Laboratory, School of Psychology, University of New South Wales, Sydney, NSW 2052, Australia
| | - Miriam Matamales
- Decision Neuroscience Laboratory, School of Psychology, University of New South Wales, Sydney, NSW 2052, Australia.
| |
Collapse
|
7
|
de Carvalho G, Khoja S, Haile MT, Chen LY. Early life adversity impaired dorsal striatal synaptic transmission and behavioral adaptability to appropriate action selection in a sex-dependent manner. Front Synaptic Neurosci 2023; 15:1128640. [PMID: 37091877 PMCID: PMC10116150 DOI: 10.3389/fnsyn.2023.1128640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 02/10/2023] [Indexed: 04/25/2023] Open
Abstract
Early life adversity (ELA) is a major health burden in the United States, with 62% of adults reporting at least one adverse childhood experience. These experiences during critical stages of brain development can perturb the development of neural circuits that mediate sensory cue processing and behavioral regulation. Recent studies have reported that ELA impaired the maturation of dendritic spines on neurons in the dorsolateral striatum (DLS) but not in the dorsomedial striatum (DMS). The DMS and DLS are part of two distinct corticostriatal circuits that have been extensively implicated in behavioral flexibility by regulating and integrating action selection with the reward value of those actions. To date, no studies have investigated the multifaceted effects of ELA on aspects of behavioral flexibility that require alternating between different action selection strategies or higher-order cognitive processes, and the underlying synaptic transmission in corticostriatal circuitries. To address this, we employed whole-cell patch-clamp electrophysiology to assess the effects of ELA on synaptic transmission in the DMS and DLS. We also investigated the effects of ELA on the ability to update action control in response to outcome devaluation in an instrumental learning paradigm and reversal of action-outcome contingency in a water T-maze paradigm. At the circuit level, ELA decreased corticostriatal glutamate transmission in male but not in female mice. Interestingly, in DMS, glutamate transmission is decreased in male ELA mice, but increased in female ELA mice. ELA impaired the ability to update action control in response to reward devaluation in a context that promotes goal-directedness in male mice and induced deficits in reversal learning. Overall, our findings demonstrate the sex- and region-dependent effects of ELA on behavioral flexibility and underlying corticostriatal glutamate transmission. By establishing a link between ELA and circuit mechanisms underlying behavioral flexibility, our findings will begin to identify novel molecular mechanisms that can represent strategies for treating behavioral inflexibility in individuals who experienced early life traumatic incidents.
Collapse
Affiliation(s)
- Gregory de Carvalho
- Department of Anatomy & Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, United States
| | - Sheraz Khoja
- Department of Anatomy & Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, United States
| | - Mulatwa T Haile
- Department of Anatomy & Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, United States
| | - Lulu Y Chen
- Department of Anatomy & Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, United States
- UCI-Conte Center, UCI-NIMH, University of California, Irvine, Irvine, CA, United States
| |
Collapse
|
8
|
Keefer SE, Kochli DE, Calu DJ. Inactivation of the Basolateral Amygdala to Insular Cortex Pathway Makes Sign-Tracking Sensitive to Outcome Devaluation. eNeuro 2022; 9:ENEURO. [PMID: 36127135 DOI: 10.1523/ENEURO.0156-22.2022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 08/26/2022] [Accepted: 09/11/2022] [Indexed: 12/15/2022] Open
Abstract
Goal-tracking (GT) rats are sensitive to Pavlovian outcome devaluation while sign-tracking (ST) rats are devaluation insensitive. During outcome devaluation, GT rats flexibly modify responding to cues based on the current value of the associated outcome. However, ST rats rigidly respond to cues regardless of the current outcome value. Prior work demonstrated disconnection of the basolateral amygdala (BLA) and anterior insular cortex (aIC) decreased both GT and ST behaviors. Given the role of these regions in appetitive motivation and behavioral flexibility, we predicted that disrupting BLA to aIC pathway during outcome devaluation would reduce flexibility in GT rats and reduce rigid appetitive motivation in ST rats. We inhibited the BLA to aIC pathway by infusing inhibitory DREADDs (hM4Di-mcherry) or control (mCherry) virus into the BLA and implanted cannulae into the aIC to inhibit BLA terminals using intracranial injections of clozapine N-oxide (CNO). After training, we used a within-subject satiety-induced outcome devaluation procedure in which we sated rats on training pellets (devalued condition) or homecage chow (valued condition). All rats received bilateral CNO infusions into the aIC before brief nonreinforced test sessions. Contrary to our hypothesis, BLA-IC inhibition did not interfere with devaluation sensitivity in GT rats but did make ST behaviors sensitive to devaluation. Intermediate rats showed the opposite effect, showing rigid responding to cues with BLA-aIC pathway inactivation. Together, these results demonstrate BLA-IC projections mediate tracking-specific Pavlovian devaluation sensitivity and highlights the importance of considering individual differences in Pavlovian approach when evaluating circuitry contributions to behavioral flexibility.
Collapse
|
9
|
Sommer S, Münster A, Fehrentz JA, Hauber W. Effects of Motivational Downshifts on Specific Pavlovian-Instrumental Transfer in Rats. Int J Neuropsychopharmacol 2022; 25:173-184. [PMID: 35043951 PMCID: PMC8929758 DOI: 10.1093/ijnp/pyab075] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 10/19/2021] [Accepted: 11/05/2022] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Pavlovian stimuli predictive of appetitive outcomes can exert a powerful influence on the selection and initiation of action, a phenomenon termed outcome-selective Pavlovian-instrumental transfer (sPIT). Rodent studies suggest that sPIT is insensitive to motivational downshift induced by outcome devaluation, an effect that is, however, relatively underexplored. METHODS Here we examined in detail the effects of distinct shifts in motivation from hunger to a state of relative satiety on sPIT in rats. RESULTS A motivational downshift by outcome-specific devaluation immediately prior to testing markedly reduced overall lever responding and magazine entries but left intact the sPIT effect. A motivational downshift prior testing by (1) giving ad libitum rather than restricted access to maintenance diet in the home cage for 24 hours or by (2) a systemic blockade of hormone secretagogue receptor subtype 1A receptors to inhibit orexigenic actions of ghrelin both reduced overall lever responding and magazine entries. Moreover, these latter motivational downshifts reduced the sPIT effect; however, the sizes of the sPIT effects were still large. CONCLUSIONS Collectively, our rodent findings indicate that major effects of various motivational downshifts are overall inhibition of lever pressing and magazine approach, possibly reflecting reduced general motivation. The observed effects of motivational downshifts on sPIT have implications with regard to the role of general motivating effects in sPIT and to the contribution of Pavlovian-instrumental interactions to excessive food seeking as well as obesity in humans.
Collapse
Affiliation(s)
- Susanne Sommer
- Department of Neurobiology, University of Stuttgart, Stuttgart, Germany
| | - Alexandra Münster
- Systems Neurobiology Research Unit, University of Stuttgart, Stuttgart, Germany
| | - Jean-Alain Fehrentz
- IBMM, University Montpellier, CNRS, ENSCM, Faculty of Pharmacy, Montpellier, France
| | - Wolfgang Hauber
- Systems Neurobiology Research Unit, University of Stuttgart, Stuttgart, Germany,Correspondence: Wolfgang Hauber Prof. Dr., University of Stuttgart, Systems Neurobiology Research Unit, Pfaffenwaldring 57, D-70550 Stuttgart, Germany ()
| |
Collapse
|
10
|
Keefer SE, Bacharach SZ, Kochli DE, Chabot JM, Calu DJ. Effects of Limited and Extended Pavlovian Training on Devaluation Sensitivity of Sign- and Goal-Tracking Rats. Front Behav Neurosci 2020; 14:3. [PMID: 32116587 PMCID: PMC7010919 DOI: 10.3389/fnbeh.2020.00003] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Accepted: 01/07/2020] [Indexed: 12/30/2022] Open
Abstract
Individual differences in Pavlovian approach predict differences in devaluation sensitivity. Recent studies indicate goal-tracking (GT) rats are sensitive to outcome devaluation while sign-tracking (ST) rats are not. With extended training in Pavlovian lever autoshaping (PLA), GT rats display more lever-directed behavior, typical of ST rats, suggesting they may become insensitive to devaluation with more Pavlovian training experience. Here, we use a within-subject satiety-induced outcome devaluation procedure to test devaluation sensitivity after limited and extended PLA training in GT and ST rats. We trained rats in PLA to determine GT and ST groups. Then, we sated rats on either the training pellets (devalued condition) or homecage chow (valued condition) prior to brief non-reinforced test sessions after limited (sessions 5/6) and extended (sessions 17/18) PLA training. GT rats decreased conditioned responding under devalued relative to valued conditions after both limited and extended training, demonstrating they are sensitive to satiety devaluation regardless of the amount of PLA training. While ST rats were insensitive to satiety devaluation after limited training, their lever directed behavior became devaluation sensitive after extended training. To determine whether sign-tracking rats also displayed sensitivity to illness-induced outcome devaluation after extended training, we trained a separate cohort of rats in extended PLA and devalued the outcome with lithium chloride injections after pellet consumption in the homecage. ST rats failed to decrease conditioned responding after illness-induced outcome devaluation, while Non-ST rats (GT and intermediates) were sensitive to illness-induced outcome devaluation after extended training. Together, our results confirm devaluation sensitivity is stable in GT rats across training and devaluation approaches. Extended training unmasks devaluation sensitivity in ST rats after satiety, but not illness-induced devaluation, suggesting ST rats respond appropriately by decreasing responding to cues during state-dependent but not inference-based devaluation. The differences in behavioral flexibility across tracking groups and devaluation paradigms have translational relevance for the understanding state- vs. inference-based reward devaluation as it pertains to drug addiction vulnerability.
Collapse
Affiliation(s)
- Sara E Keefer
- Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Sam Z Bacharach
- Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, MD, United States.,Program in Neuroscience, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Daniel E Kochli
- Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Jules M Chabot
- Neuroscience and Behavior Program, Wesleyan University, Middletown, CT, United States
| | - Donna J Calu
- Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, MD, United States.,Program in Neuroscience, University of Maryland School of Medicine, Baltimore, MD, United States
| |
Collapse
|
11
|
Schreiner DC, Renteria R, Gremel CM. Fractionating the all-or-nothing definition of goal-directed and habitual decision-making. J Neurosci Res 2019; 98:998-1006. [PMID: 31642551 DOI: 10.1002/jnr.24545] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 09/30/2019] [Accepted: 10/09/2019] [Indexed: 12/16/2022]
Abstract
Goal-directed and habitual decision-making are fundamental processes that support the ongoing adaptive behavior. There is a growing interest in examining their disruption in psychiatric disease, often with a focus on a disease shifting control from one process to the other, usually a shift from goal-directed to habitual control. However, several different experimental procedures can be used to probe whether decision-making is under goal-directed or habitual control, including outcome devaluation and contingency degradation. These different experimental procedures may recruit diverse behavioral and neural processes. Thus, there are potentially many opportunities for these disease phenotypes to manifest as alterations to both goal-directed and habitual controls. In this review, we highlight the examples of behavioral and neural circuit divergence and similarity, and suggest that interpretation based on behavioral processes recruited during testing may leave more room for goal-directed and habitual decision-making to coexist. Furthermore, this may improve our understanding of precisely what the involved neural mechanisms underlying aspects of goal-directed and habitual behavior are, as well as how disease affects behavior and these circuits.
Collapse
Affiliation(s)
- Drew C Schreiner
- Department of Psychology, University of California, San Diego, La Jolla, CA, USA
| | - Rafael Renteria
- Department of Psychology, University of California, San Diego, La Jolla, CA, USA
| | - Christina M Gremel
- Department of Psychology, University of California, San Diego, La Jolla, CA, USA.,Neurosciences Graduate Program, University of California, San Diego, La Jolla, CA, USA
| |
Collapse
|
12
|
Hinojosa-Aguayo I, González F. Affect-driven impulsivity impairs human action control and selection, as measured through Pavlovian instrumental transfer and outcome devaluation. Q J Exp Psychol (Hove) 2019; 73:537-554. [PMID: 31590606 DOI: 10.1177/1747021819883963] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Two experiments were designed to study the role of emotional impulsiveness in action control and selection, involving healthy young women participants. In Experiment 1, the effects of both outcome devaluation and Pavlovian-to-instrumental transfer (PIT) were assessed on instrumental responding. In Experiment 2, we further explored the effect of outcome devaluation on outcome-specific PIT. The role of emotional impulsivity, specifically negative urgency (NU), was also evaluated in both experiments using a self-reported measure (UPPS-P scale, Spanish short version). Experiment 1 showed both outcome devaluation and outcome-specific PIT effects, which were positively inter-correlated and negatively correlated with scores in NU. Experiment 2 found an effect of outcome devaluation on outcome-specific PIT, which was negatively correlated with scores on NU. These results highlight the relevance of considering individual differences in affect-driven impulsivity, specifically NU, when addressing failures in action control and selection (proneness to habit). Moreover, these findings suggest that, at least with the procedure used in these experiments, outcome-specific PIT may be based on a goal-directed process that is under the participant's control.
Collapse
Affiliation(s)
- Irene Hinojosa-Aguayo
- Department of Experimental Psychology, Universidad de Granada, Granada, Spain.,Mind, Brain, and Behaviour Research Centre (CIMCYC), Universidad de Granada, Granada, Spain
| | - Felisa González
- Department of Experimental Psychology, Universidad de Granada, Granada, Spain.,Mind, Brain, and Behaviour Research Centre (CIMCYC), Universidad de Granada, Granada, Spain
| |
Collapse
|
13
|
Abstract
The effect of stress on the balance between goal-directed behavior and stimulus-response habits has been demonstrated in a number of studies, but the extent to which stressful events that occur during development affect the balance between these systems later in life is less clear. Here, we examined whether individuals with a history of early-life stress (ELS) show a bias toward avoidance habits on an instrumental learning task as adults. Participants (N = 189 in Experiment 1 and N = 112 in Experiment 2) were undergraduate students at the University of California, Los Angeles. In Experiment 1, we hypothesized that a history of ELS and a longer training phase would be associated with greater avoidance habits. Participants learned to make button-press responses to visual stimuli in order to avoid aversive auditory outcomes. Following a training phase involving extensive practice of the responses, participants were tested for habitual responding using outcome devaluation. After completing the instrumental learning task, participants provided retrospective reports of stressful events they experienced during their first 16 years of life. We did not observe evidence for an effect of the length of training, but we did observe an effect of ELS, with greater stress predicting greater odds of performing the avoidance habit. In Experiment 2, we sought to replicate the effect of ELS observed in Experiment 1, and we also tested whether the presence of distraction during training would increase avoidance habit performance. We replicated the effect of ELS but we did not observe evidence of an effect of distraction. Taken together, these data lend support to the hypothesis that stress occurring during development can have lasting effects on the balance between goal-directed behavior and stimulus-response habits in humans. Enhancement of avoidance habits may help explain the higher levels of negative health outcomes such as heart and liver disease that have been observed in individuals with a history of ELS. Some of the negative health behaviors that contribute to these negative health outcomes, e.g., overeating and substance use, may be performed initially to avoid feelings of distress and then transition to being performed habitually.
Collapse
Affiliation(s)
- Tara K Patterson
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, United States
| | - Michelle G Craske
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, United States
| | - Barbara J Knowlton
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, United States
| |
Collapse
|
14
|
Abstract
A hallmark of habitual actions is that, once they are established, they become insensitive to changes in the values of action outcomes. In this article, we review empirical research that examined effects of posttraining changes in outcome values in outcome-selective Pavlovian-to-instrumental transfer (PIT) tasks. This review suggests that cue-instigated action tendencies in these tasks are not affected by weak and/or incomplete revaluation procedures (e.g., selective satiety) and substantially disrupted by a strong and complete devaluation of reinforcers. In a second part, we discuss two alternative models of a motivational control of habitual action: a default-interventionist framework and expected value of control theory. It is argued that the default-interventionist framework cannot solve the problem of an infinite regress (i.e., what controls the controller?). In contrast, expected value of control can explain control of habitual actions with local computations and feedback loops without (implicit) references to control homunculi. It is argued that insensitivity to changes in action outcomes is not an intrinsic design feature of habits but, rather, a function of the cognitive system that controls habitual action tendencies.
Collapse
Affiliation(s)
- Andreas B. Eder
- Department of Psychology, Julius Maximilian University of Würzburg, Würzburg, Germany
| | - David Dignath
- Department of Psychology, University of Freiburg, Freiburg, Germany
| |
Collapse
|
15
|
Abstract
This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an action serves to disclose outcomes that have intrinsic meaningfulness to an agent itself. This motivated two central algorithmic ingredients: devaluation and devaluation progress, both underpin agent's cognition concerning intrinsically generated rewards. The two serve as an instantiation of homeostatic and heterostatic intrinsic motivation. A key insight from our algorithm is that the two seemingly opposite motivations can be reconciled-without which exploration and information-gathering cannot be effectively carried out. We supported this claim with empirical evidence, showing that boredom-enabled agents consistently outperformed other curious or explorative agent variants in model building benchmarks based on self-assisted experience accumulation.
Collapse
Affiliation(s)
- Yen Yu
- Araya, Inc., Tokyo, Japan
| | | | | |
Collapse
|
16
|
Panayi MC, Killcross S. Functional heterogeneity within the rodent lateral orbitofrontal cortex dissociates outcome devaluation and reversal learning deficits. eLife 2018; 7:37357. [PMID: 30044220 PMCID: PMC6101941 DOI: 10.7554/elife.37357] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2018] [Accepted: 07/24/2018] [Indexed: 01/12/2023] Open
Abstract
The orbitofrontal cortex (OFC) is critical for updating reward-directed behaviours flexibly when outcomes are devalued or when task contingencies are reversed. Failure to update behaviour in outcome devaluation and reversal learning procedures are considered canonical deficits following OFC lesions in non-human primates and rodents. We examined the generality of these findings in rodents using lesions of the rodent lateral OFC (LO) in instrumental action-outcome and Pavlovian cue-outcome devaluation procedures. LO lesions disrupted outcome devaluation in Pavlovian but not instrumental procedures. Furthermore, although both anterior and posterior LO lesions disrupted Pavlovian outcome devaluation, only posterior LO lesions were found to disrupt reversal learning. Posterior but not anterior LO lesions were also found to disrupt the attribution of motivational value to Pavlovian cues in sign-tracking. These novel dissociable task- and subregion-specific effects suggest a way to reconcile contradictory findings between rodent and non-human primate OFC research.
Collapse
Affiliation(s)
- Marios C Panayi
- School of Psychology, The University of New South Wales, Kensington, Australia.,Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom
| | - Simon Killcross
- School of Psychology, The University of New South Wales, Kensington, Australia
| |
Collapse
|
17
|
Hart G, Bradfield LA, Fok SY, Chieng B, Balleine BW. The Bilateral Prefronto-striatal Pathway Is Necessary for Learning New Goal-Directed Actions. Curr Biol 2018; 28:2218-2229.e7. [PMID: 30056856 DOI: 10.1016/j.cub.2018.05.028] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Revised: 05/07/2018] [Accepted: 05/11/2018] [Indexed: 11/22/2022]
Abstract
The acquisition of new goal-directed actions requires the encoding of action-outcome associations. At a neural level, this encoding has been hypothesized to involve a prefronto-striatal circuit extending between the prelimbic cortex (PL) and the posterior dorsomedial striatum (pDMS); however, no research identifying this pathway with any precision has been reported. We started by mapping the prelimbic input to the dorsal and ventral striatum using a combination of retrograde and anterograde tracing with CLARITY and established that PL-pDMS projections share some overlap with projections to the nucleus accumbens core (NAc) in rats. We then tested whether each of these pathways were functionally required for goal-directed learning; we used a pathway-specific dual-virus chemogenetic approach to selectively silence pDMS-projecting or NAc-projecting PL neurons during instrumental training and tested rats for goal-directed action. We found that silencing PL-pDMS projections abolished goal-directed learning, whereas silencing PL-NAc projections left goal-directed learning intact. Finally, we used a three-virus approach to silence bilateral and contralateral pDMS-projecting PL neurons and again blocked goal-directed learning. These results establish that the acquisition of new goal-directed actions depends on the bilateral PL-pDMS pathway driven by intratelencephalic cortical neurons.
Collapse
|
18
|
Hart G, Bradfield LA, Balleine BW. Prefrontal Corticostriatal Disconnection Blocks the Acquisition of Goal-Directed Action. J Neurosci 2018; 38:1311-22. [PMID: 29301872 DOI: 10.1523/JNEUROSCI.2850-17.2017] [Citation(s) in RCA: 65] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Revised: 11/23/2017] [Accepted: 12/15/2017] [Indexed: 11/21/2022] Open
Abstract
The acquisition of goal-directed action requires encoding of the association between an action and its specific consequences or outcome. At a neural level, this encoding has been hypothesized to involve a prefrontal corticostriatal circuit involving the projection from the prelimbic cortex (PL) to the posterior dorsomedial striatum (pDMS); however, no direct evidence for this claim has been reported. In a series of experiments, we performed functional disconnection of this pathway using targeted lesions of the anterior corpus callosum to disrupt contralateral corticostriatal projections with asymmetrical lesions of the PL and/or pDMS to block plasticity in this circuit in rats. We first demonstrated that unilaterally blocking the PL input to the pDMS prevented the phosphorylation of extracellular signal-related kinase/mitogen activated protein kinase (pERK/pMAPK) induced by instrumental training. Next, we used a full bilateral disconnection of the PL from the pDMS and assessed goal-directed action using an outcome-devaluation test. Importantly, we found evidence that rats maintaining an ipsilateral and/or contralateral connection between the PL and the pDMS were able to acquire goal-directed actions. In contrast, bilateral PL-pDMS disconnection abolished the acquisition of goal-directed actions. Finally, we used a temporary pharmacological disconnection to disrupt PL inputs to the pDMS by infusing the NMDA antagonist dl-2-amino-5-phosphonopentanoic acid into the pDMS during instrumental training and found that this manipulation also disrupted goal-directed learning. These results establish that, in rats, the acquisition of new goal-directed actions depends on a prefrontal-corticostriatal circuit involving a connection between the PL and the pDMS.SIGNIFICANCE STATEMENT It has been hypothesized that the prelimbic cortex (PL) and posterior dorsomedial striatum (pDMS) in rodents interact in a corticostriatal circuit to mediate goal-directed learning. However, no direct evidence supporting this claim has been reported. Using targeted lesions, we performed functional disconnection of the PL-pDMS pathway to assess its role in goal-directed learning. In the first experiment, we demonstrated that PL input to the pDMS is necessary for instrumental training-induced neuronal activity. Next, we disrupted ipsilateral, contralateral, or bilateral PL-pDMS connections and found that only bilateral PL-pDMS disconnection disrupted the acquisition of goal-directed actions, a finding we replicated in our final study using a pharmacological disconnection procedure.
Collapse
|
19
|
Bradfield LA, Balleine BW. Thalamic Control of Dorsomedial Striatum Regulates Internal State to Guide Goal-Directed Action Selection. J Neurosci 2017; 37:3721-33. [PMID: 28242795 DOI: 10.1523/JNEUROSCI.3860-16.2017] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Revised: 02/02/2017] [Accepted: 02/13/2017] [Indexed: 11/21/2022] Open
Abstract
We (Bradfield et al., 2013) have demonstrated previously that parafascicular thalamic nucleus (PF)-controlled neurons in the posterior dorsomedial striatum (pDMS) are critical for interlacing new and existing action-outcome contingencies to control goal-directed action. Based on these findings, it was suggested that animals with a dysfunctional PF-pDMS pathway might suffer a deficit in creating or retrieving internal contexts or "states" on which such information could become conditional. To assess this hypothesis more directly, rats were given a disconnection treatment using contralateral cytotoxic lesions of the PF and pDMS (Group CONTRA) or ipsilateral control lesions (Group IPSI) and trained to press a right and left lever for sucrose and pellet outcomes, after which these contingencies were reversed. The rats were then given an outcome devaluation test (all experiments) and a test of outcome-specific reinstatement (Experiments 1 and 3). We found that devaluation performance was intact for both groups after training of initial contingencies, but impaired for Group CONTRA after reversal. However, performance was restored by additional reversal training. Furthermore, when tested a second time after reversal training, rats in both groups demonstrated responding in accordance with the original contingencies, providing direct evidence of modulation of action selection by state. Finally, we found that external context could substitute for internal state and so could rescue responding in Group CONTRA, but only in the reinstatement test. Together, these findings suggest that animals use internal state information to guide action selection and that this information is modulated by the PF-pDMS pathway.SIGNIFICANCE STATEMENT Individuals with Parkinson's disease dementia often suffer a characteristic deficit in "cognitive flexibility." It has been suggested that neurodegeneration in the pathway between the centromedian/parafascicular thalalmic nucleus (PF) and striatum might underlie such deficits (Smith et al., 2014). In rats, we have similarly observed that a functional disconnection of the PF-posterior dorsomedial striatal pathway produces a specific impairment in the ability to alter goal-directed actions (Bradfield et al., 2013). It was suggested that this impairment could be a result of a deficit in state modulation. Here, we present four experiments that provide evidence for this hypothesis and suggest several ways (e.g., extended practice, providing external cues) in which this state modulation can be rescued.
Collapse
|
20
|
Halbout B, Liu AT, Ostlund SB. A Closer Look at the Effects of Repeated Cocaine Exposure on Adaptive Decision-Making under Conditions That Promote Goal-Directed Control. Front Psychiatry 2016; 7:44. [PMID: 27047400 PMCID: PMC4800177 DOI: 10.3389/fpsyt.2016.00044] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 03/07/2016] [Indexed: 11/13/2022] Open
Abstract
It has been proposed that compulsive drug seeking reflects an underlying dysregulation in adaptive behavior that favors habitual (automatic and inflexible) over goal-directed (deliberative and highly flexible) action selection. Rodent studies have established that repeated exposure to cocaine or amphetamine facilitates the development of habits, producing behavior that becomes unusually insensitive to a reduction in the value of its outcome. The current study more directly investigated the effects of cocaine pre-exposure on goal-directed learning and action selection using an approach that discourages habitual performance. After undergoing a 15-day series of cocaine (15 or 30 mg/kg, i.p.) or saline injections and a drug withdrawal period, rats were trained to perform two different lever-press actions for distinct reward options. During a subsequent outcome devaluation test, both cocaine- and saline-treated rats showed a robust bias in their choice between the two actions, preferring whichever action had been trained with the reward that retained its value. Thus, it appears that the tendency for repeated cocaine exposure to promote habit formation does not extend to a more complex behavioral scenario that encourages goal-directed control. To further explore this issue, we assessed how prior cocaine treatment would affect the rats' ability to learn about a selective reduction in the predictive relationship between one of the two actions and its outcome, which is another fundamental feature of goal-directed behavior. Interestingly, we found that cocaine-treated rats showed enhanced, rather than diminished, sensitivity to this action-outcome contingency degradation manipulation. Given their mutual dependence on striatal dopamine signaling, we suggest that cocaine's effects on habit formation and contingency learning may stem from a common adaptation in this neurochemical system.
Collapse
Affiliation(s)
- Briac Halbout
- Department of Anesthesiology and Perioperative Care, School of Medicine, University of California Irvine, Irvine, CA, USA; UC Irvine Center for Addiction Neuroscience, Irvine, CA, USA
| | - Angela T Liu
- Department of Anesthesiology and Perioperative Care, School of Medicine, University of California Irvine, Irvine, CA, USA; UC Irvine Center for Addiction Neuroscience, Irvine, CA, USA
| | - Sean B Ostlund
- Department of Anesthesiology and Perioperative Care, School of Medicine, University of California Irvine, Irvine, CA, USA; UC Irvine Center for Addiction Neuroscience, Irvine, CA, USA
| |
Collapse
|
21
|
Abstract
Previous work has demonstrated that goal-directed control of alcohol-seeking and other drug-related behaviors is reduced following extended self-administration and drug exposure. Here, we examined how the magnitude of stimulus influences on responding changes across similar training and drug exposure. Rats self-administered alcohol or sucrose for 2 or 8 weeks. Previous work has shown that 8 weeks, but not 2 weeks of self-administration produces habitual alcohol seeking. Next, all animals received equivalent Pavlovian conditioning sessions where a discrete stimulus predicted the delivery of alcohol or sucrose. Finally, the impact of the stimuli on ongoing instrumental responding was examined in a Pavlovian-instrumental transfer (PIT) test. While a significant PIT effect was observed following 2 weeks of either alcohol or sucrose self-administration, the magnitude of this effect was greater following 8 weeks of training. The specificity of the PIT effect appeared unchanged by extended training. While it is well established that evaluation of the outcome of responding contributes less to behavioral control following extended training and/or drug exposure, our data indicate that reward-predictive stimuli have a stronger contribution to responding after extended training. Together, these findings provide insight into the factors that control behavior after extended drug use, which will be important for developing effective methods for controlling and ideally reducing these behaviors.
Collapse
Affiliation(s)
- Laura H Corbit
- School of Psychology, The University of Sydney , Sydney, NSW , Australia
| | - Patricia H Janak
- Department of Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD, USA; Department of Neuroscience, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
22
|
Nasser HM, Chen YW, Fiscella K, Calu DJ. Individual variability in behavioral flexibility predicts sign-tracking tendency. Front Behav Neurosci 2015; 9:289. [PMID: 26578917 PMCID: PMC4630296 DOI: 10.3389/fnbeh.2015.00289] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 10/12/2015] [Indexed: 11/13/2022] Open
Abstract
Sign-tracking rats show heightened sensitivity to food- and drug-associated cues, which serve as strong incentives for driving reward seeking. We hypothesized that this enhanced incentive drive is accompanied by an inflexibility when incentive value changes. To examine this we tested rats in Pavlovian outcome devaluation or second-order conditioning prior to the assessment of sign-tracking tendency. To assess behavioral flexibility we trained rats to associate a light with a food outcome. After the food was devalued by pairing with illness, we measured conditioned responding (CR) to the light during an outcome devaluation probe test. The level of CR during outcome devaluation probe test correlated with the rats' subsequent tracking tendency, with sign-tracking rats failing to suppress CR to the light after outcome devaluation. To assess Pavlovian incentive learning, we trained rats on first-order (CS+, CS-) and second-order (SOCS+, SOCS-) discriminations. After second-order conditioning, we measured CR to the second-order cues during a probe test. Second-order conditioning was observed across all rats regardless of tracking tendency. The behavioral inflexibility of sign-trackers has potential relevance for understanding individual variation in vulnerability to drug addiction.
Collapse
Affiliation(s)
- Helen M Nasser
- Behavioral Neuroscience Research Branch, Intramural Research Program, National Institute on Drug Abuse, National Institutes of Health, Department of Health and Human Services Baltimore, MD, USA ; Anatomy and Neurobiology, University of Maryland School of Medicine Baltimore, MD, USA
| | - Yu-Wei Chen
- Behavioral Neuroscience Research Branch, Intramural Research Program, National Institute on Drug Abuse, National Institutes of Health, Department of Health and Human Services Baltimore, MD, USA
| | - Kimberly Fiscella
- Behavioral Neuroscience Research Branch, Intramural Research Program, National Institute on Drug Abuse, National Institutes of Health, Department of Health and Human Services Baltimore, MD, USA
| | - Donna J Calu
- Behavioral Neuroscience Research Branch, Intramural Research Program, National Institute on Drug Abuse, National Institutes of Health, Department of Health and Human Services Baltimore, MD, USA ; Anatomy and Neurobiology, University of Maryland School of Medicine Baltimore, MD, USA
| |
Collapse
|
23
|
Parkes SL, Bradfield LA, Balleine BW. Interaction of insular cortex and ventral striatum mediates the effect of incentive memory on choice between goal-directed actions. J Neurosci 2015; 35:6464-71. [PMID: 25904797 DOI: 10.1523/JNEUROSCI.4153-14.2015] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The anterior insular cortex (IC) and the nucleus accumbens (NAc) core have been separately implicated in the selection and performance of actions based on the incentive value of the instrumental outcome. Here, we examined the role of connections between the IC and the NAc core in the performance of goal-directed actions. Rats were trained on two actions for distinct outcomes, after which one of the two outcomes was devalued by specific satiety immediately before a choice extinction test. We first confirmed the projection from the IC to the NAc core and then disconnected these structures via asymmetrical excitotoxic lesions before training. Contralateral, but not ipsilateral, disconnection of the IC and NAc core disrupted outcome devaluation. We hypothesized that communication between the IC and NAc core is necessary for the retrieval of incentive value at test. To test this, we infused the GABAA agonist muscimol into the IC and the μ-opioid receptor antagonist CTAP into the contralateral NAc before the choice extinction test. As expected, inactivation of the IC in one hemisphere and blocking μ-opioid receptors in the contralateral NAc core abolished outcome-selective devaluation. These results suggest that the IC and NAc core form part of a circuit mediating the retrieval of outcome values and the subsequent choice between goal-directed actions based on those values.
Collapse
|
24
|
Furlong TM, Jayaweera HK, Balleine BW, Corbit LH. Binge-like consumption of a palatable food accelerates habitual control of behavior and is dependent on activation of the dorsolateral striatum. J Neurosci 2014; 34:5012-22. [PMID: 24695718 DOI: 10.1523/JNEUROSCI.3707-13.2014] [Citation(s) in RCA: 126] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Access to highly palatable and calorically dense foods contributes to increasing rates of obesity worldwide. Some have made the controversial argument that consumption of such foods can lead to "food addiction," yet little is known about how long-term access to highly palatable foods might alter goal-directed learning and decision making. In the following experiments, rats were given 5 weeks of continuous or restricted daily access to sweetened condensed milk (SCM) before instrumental training for food reward. Subsequently we examined whether goal-directed performance was impaired in these groups using the outcome-devaluation task. Control rats reduced responding following devaluation of the earned outcome as did those with previous continuous access to SCM. Of interest, rats with previous restricted access to SCM responded similarly under the devalued and nondevalued conditions, indicating loss of goal-directed control of responding. To identify whether the loss of goal-directed control was accompanied by differences in neuronal activity, we used c-Fos immunohistochemistry to examine the patterns of activation during devaluation testing. We observed greater c-Fos immunoreactivity in the dorsolateral striatum (DLS) and associated cortical regions in the group that received previous restricted access to SCM and demonstrated a lack of sensitivity to outcome devaluation. Infusion of the AMPA-receptor antagonist CNQX or dopamine D1-receptor antagonist SCH-23390 into the DLS before testing restored goal-directed performance in the restricted SCM group, confirming that this region is essential for habit-based performance. These results indicate that previous diet can alter subsequent learning and activity in the neural circuits that support performance.
Collapse
|
25
|
Corbit LH, Ostlund SB, Balleine BW. Sensitivity to instrumental contingency degradation is mediated by the entorhinal cortex and its efferents via the dorsal hippocampus. J Neurosci 2002; 22:10976-84. [PMID: 12486193 PMCID: PMC6758438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/28/2023] Open
Abstract
Previous studies have shown that electrolytic lesions of the dorsal hippocampus render the instrumental performance of rats insensitive to selective degradation of the action-outcome contingency (Corbit and Balleine, 2000). In the present experiments, we sought to replicate this finding and to examine the effects of excitotoxic lesions. In the first three experiments, rats with either electrolytic or NMDA lesions of the dorsal hippocampus and sham-lesioned controls were trained to press two levers, each of which delivered a unique food outcome, before their sensitivity to outcome devaluation and degradation of the instrumental contingency was assessed. Although we were able to replicate our original finding that electrolytic lesions of the dorsal hippocampus render rats insensitive to selective degradation of the instrumental contingency, NMDA lesions of the dorsal hippocampus had no effect. Neither lesion had any detectable effect on sensitivity to outcome devaluation. In experiment 4, we assessed the possibility that the effect of the electrolytic lesion resulted from damage to fibers originating in the retrohippocampal region (including both entorhinal cortex and subiculum) by examining the impact of bilateral NMDA-induced lesions of the retrohippocampus on the same tasks. Importantly, this lesion produced a deficit similar to that observed after electrolytic hippocampal lesions. The final experiment used a disconnection procedure to assess more directly whether damage to efferents from the retrohippocampal region, rather than the dorsal hippocampus itself, can account for the observed deficit. The data from these tests suggest that the deficits observed previously after electrolytic hippocampal lesions were the result of damage to entorhinal efferents.
Collapse
Affiliation(s)
- Laura H Corbit
- Department of Psychology University of California, Los Angeles, Los Angeles, California 90095, USA.
| | | | | |
Collapse
|
26
|
Corbit LH, Balleine BW. The role of the hippocampus in instrumental conditioning. J Neurosci 2000; 20:4233-9. [PMID: 10818159 PMCID: PMC6772620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023] Open
Abstract
Considerable evidence suggests that, in instrumental conditioning, rats can encode both the specific action-outcome associations to which they are exposed and the degree to which an action is causal in producing its associated outcome. Three experiments assessed the involvement of the hippocampus in encoding these aspects of instrumental learning. In each study, rats with electrolytic lesions of the dorsal hippocampus and sham-lesioned controls were trained while hungry to press two levers, each of which delivered a unique food outcome. Experiments 1A and 1B used an outcome devaluation procedure to assess the effects of the lesion on encoding the action-outcome relationship. After training, one of the two outcomes was devalued using a specific satiety procedure, after which performance on the two levers was assessed in a choice extinction test. The lesion had no detectable effect on either the acquisition of instrumental performance or on the rats' sensitivity to outcome devaluation; lesion and sham groups both reduced responding on the lever associated with the devalued outcome compared with the other lever. In experiment 2, the sensitivity of hippocampal rats to the causal efficacy of their actions was assessed by selectively degrading the contingency between one of the actions and its associated outcome. Whereas sham rats selectively reduced performance on the lever for which the action-outcome contingency had been degraded, hippocampal rats did not. These results suggest that, in instrumental conditioning, lesions of the dorsal hippocampus selectively impair the ability of rats to represent the causal relationship between an action and its consequences.
Collapse
Affiliation(s)
- L H Corbit
- Department of Psychology, University of California, Los Angeles, Los Angeles, California 90095, USA.
| | | |
Collapse
|