1
|
Hart G, Burton TJ, Balleine BW. What Role Does Striatal Dopamine Play in Goal-directed Action? Neuroscience 2024; 546:20-32. [PMID: 38521480 DOI: 10.1016/j.neuroscience.2024.03.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 02/15/2024] [Accepted: 03/18/2024] [Indexed: 03/25/2024]
Abstract
Evidence suggests that dopamine activity provides a US-related prediction error for Pavlovian conditioning and the reinforcement signal supporting the acquisition of habits. However, its role in goal-directed action is less clear. There are currently few studies that have assessed dopamine release as animals acquire and perform self-paced instrumental actions. Here we briefly review the literature documenting the psychological, behavioral and neural bases of goal-directed actions in rats and mice, before turning to describe recent studies investigating the role of dopamine in instrumental learning and performance. Plasticity in dorsomedial striatum, a central node in the network supporting goal-directed action, clearly requires dopamine release, the timing of which, relative to cortical and thalamic inputs, determines the degree and form of that plasticity. Beyond this, bilateral release appears to reflect reward prediction errors as animals experience the consequences of an action. Such signals feedforward to update the value of the specific action associated with that outcome during subsequent performance, with dopamine release at the time of action reflecting the updated predicted action value. More recently, evidence has also emerged for a hemispherically lateralised signal associated with the action; dopamine release is greater in the hemisphere contralateral to the spatial target of the action. This effect emerges over the course of acquisition and appears to reflect the strength of the action-outcome association. Thus, during goal-directed action, dopamine release signals the action, the outcome and their association to shape the learning and performance processes necessary to support this form of behavioral control.
Collapse
Affiliation(s)
- Genevra Hart
- Decision Neuroscience Lab, UNSW Sydney, Australia
| | | | | |
Collapse
|
2
|
Hart G, Burton TJ, Nolan CR, Balleine BW. Striatal dopamine release tracks the relationship between actions and their consequences. Cell Rep 2024; 43:113828. [PMID: 38386550 DOI: 10.1016/j.celrep.2024.113828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 12/05/2023] [Accepted: 02/03/2024] [Indexed: 02/24/2024] Open
Abstract
The acquisition and performance of goal-directed actions has long been argued to depend on the integration of glutamatergic inputs to the posterior dorsomedial striatum (pDMS) under the modulatory influence of dopamine. Nevertheless, relatively little is known about the dynamics of striatal dopamine during goal-directed actions. To investigate this, we chronically recorded dopamine release in the pDMS as rats acquired two actions for distinct outcomes as these action-outcome associations were incremented and then subsequently degraded or reversed. We found that bilateral dopamine release scaled with action value, whereas the lateralized dopamine signal, i.e., the difference in dopamine release ipsilaterally and contralaterally to the direction of the goal-directed action, reflected the strength of the action-outcome association independently of changes in movement. Our results establish, therefore, that striatal dopamine activity during goal-directed action reflects both bilateral moment-to-moment changes in action value and the long-term action-outcome association.
Collapse
Affiliation(s)
- G Hart
- Decision Neuroscience Laboratory, School of Psychology, UNSW Sydney, Sydney, NSW, Australia
| | - T J Burton
- Decision Neuroscience Laboratory, School of Psychology, UNSW Sydney, Sydney, NSW, Australia
| | - C R Nolan
- Decision Neuroscience Laboratory, School of Psychology, UNSW Sydney, Sydney, NSW, Australia
| | - B W Balleine
- Decision Neuroscience Laboratory, School of Psychology, UNSW Sydney, Sydney, NSW, Australia.
| |
Collapse
|
3
|
Thrailkill EA, Daniels CW. The temporal structure of goal-directed and habitual operant behavior. J Exp Anal Behav 2024; 121:38-51. [PMID: 38131488 PMCID: PMC10872308 DOI: 10.1002/jeab.896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Accepted: 12/01/2023] [Indexed: 12/23/2023]
Abstract
Operant behavior can reflect the influence of goal-directed and habitual processes. These can be distinguished by changes to response rate following devaluation of the reinforcing outcome. Whether a response is goal directed or habitual depends on whether devaluation affects response rate. Response rate can be decomposed into frequencies of bouts and pauses by analyzing the distribution of interresponse times. This study sought to characterize goal-directed and habitual behaviors in terms of bout-initiation rate, within-bout response rate, bout length, and bout duration. Data were taken from three published studies that compared sensitivity to devaluation following brief and extended training with variable-interval schedules. Analyses focused on goal-directed and habitual responding, a comparison of a habitual response to a similarly trained response that had been converted back to goal-directed status after a surprising event, and a demonstration of contextual control of habit and goal direction in the same subjects. Across experiments and despite responses being clearly distinguished as goal directed and habitual by total response rate, analyses of bout-initiation rate, within-bout rate, bout length, and bout duration did not reveal a pattern that distinguished goal-directed from habitual responding.
Collapse
|
4
|
van Timmeren T, de Wit S. Instant habits versus flexible tenacity: Do implementation intentions accelerate habit formation? Q J Exp Psychol (Hove) 2023; 76:2479-2492. [PMID: 36476147 PMCID: PMC10585941 DOI: 10.1177/17470218221147024] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 11/23/2022] [Accepted: 12/07/2022] [Indexed: 10/19/2023]
Abstract
Implementation intentions (strategic "if-then" plans) have been shown to support behaviour change. This may be achieved by mentally forming stimulus-response associations, thereby promoting habit formation. Does this deliberate attempt to instal "strategic automaticity" only offer advantages, or does it also come at the cost of reduced flexibility that characterises learnt habits? To investigate this, we tested healthy, young participants on a computerised instrumental learning task. Critically, we introduced implementation intentions ("if I see stimulus X, then I will respond") versus goal intentions ("for outcome Z, I will respond)" during instrumental acquisition, and subsequently assessed behavioural flexibility in an outcome-revaluation test. In Experiment 1, we conducted a between-subjects manipulation of strategic planning, and in Experiment 2, a within-subject manipulation. We hypothesised that implementation intentions would lead to strong stimulus-response associations and consequently impair performance when the signalled outcome value changed and therefore required a different response, while benefitting performance when the outcome value (and required response) remained the same. We found that implementation intentions supported instrumental learning, but impaired test performance overall (most robustly in Experiment 2), irrespective of whether the signalled outcome value had changed. We argue that this general detrimental effect of implementation intentions on test performance is likely a consequence of their negative effect on stimulus-outcome learning. Our findings warrant caution when applying if-then plans to situations where the agent does not already possess perfect knowledge of behavioural contingencies.While implementation intentions may support efficient and fast behavioural execution, this may come at the expense of behavioural flexibility.
Collapse
Affiliation(s)
- Tim van Timmeren
- Habit Lab, Department of Clinical Psychology, University of Amsterdam, Amsterdam, The Netherlands
- Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, The Netherlands
- Department of Social, Health and Organisational Psychology, Utrecht University, Utrecht, The Netherlands
| | - Sanne de Wit
- Habit Lab, Department of Clinical Psychology, University of Amsterdam, Amsterdam, The Netherlands
- Amsterdam Brain and Cognition, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
5
|
Abdollahipour R, Valtr L, Banátová K, Bizovská L, Klein T, Svoboda Z, Steenbergen B, Wilson PH. Motor-cognitive coupling is impaired in children with mild or severe forms of developmental coordination disorder. Front Hum Neurosci 2023; 17:1252852. [PMID: 37941568 PMCID: PMC10628065 DOI: 10.3389/fnhum.2023.1252852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 10/09/2023] [Indexed: 11/10/2023] Open
Abstract
Children with developmental coordination disorder (DCD) show deficits in motor-cognitive coupling. However, it remains unclear whether such deficits depend on the severity of DCD. The aim of this study was to examine cognitive-motor coupling under different levels of inhibitory control in children with severe (s-DCD) or moderate DCD (m-DCD), compared with typically-developing children (TDC). The performance of 29 primary-school children aged 6-12 years with s-DCD (Mage = 9.12 ± 1.56 years), 53 m-DCD (Mage = 8.78 ± 1.67 years), and 201 TDC (Mage = 9.20 ± 1.50 years) was compared on a double jump reaching task (DJRT) paradigm, presented on a large 42-inch touchscreen. The task display had a circular home-base, centred at the bottom of the display, and three target locations at radials of -20°, 0°, and 20°, 40 cm above the home-base circle. For the standard double-jump reaching task (DJRT), children moved their index finger from home-base circle to touch the target stimulus as fast as possible; 20% were jump trials where the target shifted left or right at lift-off. For the anti-jump reaching task (AJRT), 20% of trials required an anti-jump movement, touching the contralateral target location. While no group differences were shown on the DJRT, the DCD group were slower to complete reaching movements than the TDC group on AJRT; on the latter, the two DCD sub-groups were not shown to differ. Results confirm the presence of motor inhibition deficits in DCD which may not be dependent on the motor severity of the disorder.
Collapse
Affiliation(s)
- Reza Abdollahipour
- Department of Natural Sciences in Kinanthropology, Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Ludvík Valtr
- Department of Natural Sciences in Kinanthropology, Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Kamila Banátová
- Department of Natural Sciences in Kinanthropology, Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Lucia Bizovská
- Department of Natural Sciences in Kinanthropology, Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Tomáš Klein
- Department of Natural Sciences in Kinanthropology, Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Zdeněk Svoboda
- Department of Natural Sciences in Kinanthropology, Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Bert Steenbergen
- Department of Pedagogical and Educational Sciences, Behavioural Science Institute, Radboud University, Nijmegen, Netherlands
| | - Peter Henry Wilson
- Department of Natural Sciences in Kinanthropology, Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
- Healthy Brain and Mind Research Centre, School of Behavioural and Health Sciences, Faculty of Health Sciences, Australian Catholic University, Melbourne, VIC, Australia
| |
Collapse
|
6
|
Tomonaga M, Imura T. Attention to inattention: effect of the other's attentional state on visual search performance in chimpanzees. Perception 2023; 52:266-294. [PMID: 36879506 DOI: 10.1177/03010066231158255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]
Abstract
In the present study, we examined the effects of the other's triadic attention to objects on visual search performances in chimpanzees. We found the search-asymmetry-like effect of the other's attentional state; the chimpanzees searched a target object not attended by the other individual more efficiently than that attended (Experiment 1). Additional experiments explored the possibility that the other individual "holding an object but not looking at it" led to expectancy violation (Experiment 2) or the role of nonsocial cues such as the proximity relation between the head and the object (Experiment 3). Still, these accounts alone did not explain this effect. It was also shown that the other's attentional state affected the chimpanzees' performances more readily as the interference effect than the facilitation effect (Experiment 4). Furthermore, the same effect was observed in the visual search for the gaze (head direction) of others (Experiment 5). We obtained the same results using photographs of chimpanzees (Experiment 6). Contrary to the chimpanzees, humans detected the object to which attention was directed more efficiently than vice versa (Experiment 7). The present results may reflect species differences between chimpanzees and humans in processing triadic social attention.
Collapse
|
7
|
Dhungana A, Becchi S, Leake J, Morris G, Avgan N, Balleine BW, Vissel B, Bradfield LA. Goal-Directed Action Is Initially Impaired in a hAPP-J20 Mouse Model of Alzheimer's Disease. eNeuro 2023; 10:ENEURO.0363-22.2023. [PMID: 36650070 PMCID: PMC9927544 DOI: 10.1523/eneuro.0363-22.2023] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 12/12/2022] [Accepted: 01/03/2023] [Indexed: 01/19/2023] Open
Abstract
Cognitive-behavioral testing in preclinical models of Alzheimer's disease has failed to capture deficits in goal-directed action control. Here, we provide the first comprehensive investigation of goal-directed action in a transgenic mouse model of Alzheimer's disease. Specifically, we tested outcome devaluation performance in male and female human amyloid precursor protein (hAPP)-J20 mice. Mice were first trained to press left and right levers for pellet and sucrose outcomes, respectively (counterbalanced), over 4 d. On test, mice were prefed one of the outcomes to satiety and given a choice between levers. Devaluation performance was intact for 36-week-old wild-types of both sexes, who responded more on the valued relative to the devalued lever (Valued > Devalued). By contrast, devaluation was impaired (Valued = Devalued) for J20 mice of both sexes, and for 52-week-old male mice regardless of genotype. After additional lever press training (i.e., 8-d lever pressing in total), devaluation was intact for all mice, demonstrating that the initial deficits were not a result of a nonspecific impairment in reward processing, depression, or locomotor activity in J20 or aging mice. Follow-up analyses revealed that microglial expression in the dorsal CA1 region of the hippocampus was associated with poorer outcome devaluation performance on initial, but not later tests. Together, these data demonstrate that goal-directed action is initially impaired in J20 mice of both sexes and in aging male mice regardless of genotype, and that this impairment is related to neuroinflammation in the dorsal CA1 hippocampal region.
Collapse
Affiliation(s)
- Amolika Dhungana
- School of Life Sciences, Faculty of Science, University of Technology Sydney, Sydney, New South Wales 2007, Australia
- Centre for Neuroscience and Regenerative Medicine, St. Vincent's Centre for Applied Medical Research, St. Vincent's Health Network, Sydney, New South Wales 2010, Australia
| | - Serena Becchi
- School of Psychology, University of New South Wales Sydney, Sydney, New South Wales 2052, Australia
| | - Jessica Leake
- School of Life Sciences, Faculty of Science, University of Technology Sydney, Sydney, New South Wales 2007, Australia
- Centre for Neuroscience and Regenerative Medicine, St. Vincent's Centre for Applied Medical Research, St. Vincent's Health Network, Sydney, New South Wales 2010, Australia
- School of Psychology, University of New South Wales Sydney, Sydney, New South Wales 2052, Australia
| | - Gary Morris
- School of Life Sciences, Faculty of Science, University of Technology Sydney, Sydney, New South Wales 2007, Australia
- Centre for Neuroscience and Regenerative Medicine, St. Vincent's Centre for Applied Medical Research, St. Vincent's Health Network, Sydney, New South Wales 2010, Australia
- Tasmanian School of Medicine, College of Health and Medicine, University of Tasmania, Hobart, Tasmania 7000, Australia
| | - Nesli Avgan
- Centre for Neuroscience and Regenerative Medicine, St. Vincent's Centre for Applied Medical Research, St. Vincent's Health Network, Sydney, New South Wales 2010, Australia
| | - Bernard W Balleine
- Decision Neuroscience Laboratory, School of Psychology, University of New South Wales Sydney, Sydney, New South Wales 2052, Australia
| | - Bryce Vissel
- Centre for Neuroscience and Regenerative Medicine, St. Vincent's Centre for Applied Medical Research, St. Vincent's Health Network, Sydney, New South Wales 2010, Australia
- School of Clinical Medicine, University of New South Wales Medicine & Health, St Vincent's Healthcare Clinical Campus, Faculty of Medicine and Health, University of New South Wales Sydney, Sydney, New South Wales 2052, Australia
| | - Laura A Bradfield
- School of Life Sciences, Faculty of Science, University of Technology Sydney, Sydney, New South Wales 2007, Australia
- Centre for Neuroscience and Regenerative Medicine, St. Vincent's Centre for Applied Medical Research, St. Vincent's Health Network, Sydney, New South Wales 2010, Australia
| |
Collapse
|
8
|
Morris RW, Dezfouli A, Griffiths KR, Le Pelley ME, Balleine BW. The Neural Bases of Action-Outcome Learning in Humans. J Neurosci 2022; 42:3636-3647. [PMID: 35296548 PMCID: PMC9053851 DOI: 10.1523/jneurosci.1079-21.2022] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 02/17/2022] [Accepted: 02/22/2022] [Indexed: 11/21/2022] Open
Abstract
From an associative perspective the acquisition of new goal-directed actions requires the encoding of specific action-outcome (AO) associations and, therefore, sensitivity to the validity of an action as a predictor of a specific outcome relative to other events. Although competitive architectures have been proposed within associative learning theory to achieve this kind of identity-based selection, whether and how these architectures are implemented by the brain is still a matter of conjecture. To investigate this issue, we trained human participants to encode various AO associations while undergoing functional neuroimaging (fMRI). We then degraded one AO contingency by increasing the probability of the outcome in the absence of its associated action while keeping other AO contingencies intact. We found that this treatment selectively reduced performance of the degraded action. Furthermore, when a signal predicted the unpaired outcome, performance of the action was restored, suggesting that the degradation effect reflects competition between the action and the context for prediction of the specific outcome. We used a Kalman filter to model the contribution of different causal variables to AO learning and found that activity in the medial prefrontal cortex (mPFC) and the dorsal anterior cingulate cortex (dACC) tracked changes in the association of the action and context, respectively, with regard to the specific outcome. Furthermore, we found the mPFC participated in a network with the striatum and posterior parietal cortex to segregate the influence of the various competing predictors to establish specific AO associations.SIGNIFICANCE STATEMENT Humans and other animals learn the consequences of their actions, allowing them to control their environment in a goal-directed manner. Nevertheless, it is unknown how we parse environmental causes from the effects of our own actions to establish these specific action-outcome (AO) relationships. Here, we show that the brain learns the causal structure of the environment by segregating the unique influence of actions from other causes in the medial prefrontal and anterior cingulate cortices and, through a network of structures, including the caudate nucleus and posterior parietal cortex, establishes the distinct causal relationships from which specific AO associations are formed.
Collapse
Affiliation(s)
- Richard W Morris
- Centre for Translational Data Science, University of Sydney, Sydney, NSW 2006, Australia
| | - Amir Dezfouli
- Data61, Commonwealth Scientific and Industrial Research Organisation, Sydney, NSW 2015, Australia
| | - Kristi R Griffiths
- Brain Dynamics Centre, Westmead Institute for Medical Research, University of Sydney, Sydney, NSW 2145, Australia
| | - Mike E Le Pelley
- School of Psychology, University of New South Wales Sydney, Sydney, NSW 2052, Australia
| | - Bernard W Balleine
- School of Psychology, University of New South Wales Sydney, Sydney, NSW 2052, Australia
| |
Collapse
|
9
|
Bouton ME, Allan SM, Tavakkoli A, Steinfeld MR, Thrailkill EA. Effect of context on the instrumental reinforcer devaluation effect produced by taste-aversion learning. J Exp Psychol Anim Learn Cogn 2021; 47:476-489. [PMID: 34516195 PMCID: PMC8713511 DOI: 10.1037/xan0000295] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Four experiments manipulated the context in which taste-aversion conditioning occurred when the reinforcer was devalued after instrumental learning. In all experiments, rats learned to lever press in an operant conditioning chamber and then had an aversion to the food-pellet reinforcer conditioned by pairing it with lithium chloride (LiCl) in either that context or a different context. Lever pressing was then tested in extinction to assess its status as a goal-directed action. In Experiment 1, aversion conditioning in the operant conditioning chamber suppressed lever-pressing during the test, but aversion conditioning in the home cage did not. Exposure to the averted pellet in the operant conditioning chamber after conditioning in the home cage did not change this effect (Experiment 2). The same pattern was observed when the different context was a second operant-style chamber (counterbalanced), exposure to the contexts was controlled, and pellets were presented in them in the same manner (Experiment 3). The greater effect of aversion conditioning in the instrumental context was not merely due to potentiated contextual conditioning (Experiment 4). Importantly, consumption tests revealed that the aversion conditioned in the different context had transferred to the test context. Thus, when reinforcer devaluation occurred in a different context, the rats lever pressed in extinction for a reinforcer they would otherwise reject. The results suggest that animals encode contextual information about the reinforcer during instrumental learning and suggest caution in making inferences about action versus habit learning when the reinforcer is devalued in a different context. (PsycInfo Database Record (c) 2021 APA, all rights reserved).
Collapse
|
10
|
Takahashi N, Moberg S, Zolnik TA, Catanese J, Sachdev RNS, Larkum ME, Jaeger D. Thalamic input to motor cortex facilitates goal-directed action initiation. Curr Biol 2021; 31:4148-4155.e4. [PMID: 34302741 PMCID: PMC8478854 DOI: 10.1016/j.cub.2021.06.089] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 05/20/2021] [Accepted: 06/29/2021] [Indexed: 11/16/2022]
Abstract
Prompt execution of planned motor action is essential for survival. The interactions between frontal cortical circuits and the basal ganglia are central to goal-oriented action selection and initiation.1-4 In rodents, the ventromedial thalamic nucleus (VM) is one of the critical nodes that conveys the output of the basal ganglia to the frontal cortical areas including the anterior lateral motor cortex (ALM).5-9 Recent studies showed the critical role of ALM and its interplay with the motor thalamus in preparing sensory-cued rewarded movements, specifically licking.10-12 Work in primates suggests that the basal ganglia output to the motor thalamus transmits an urgency or vigor signal,13-15 which leads to shortened reaction times and faster movement initiation. As yet, little is known about what signals are transmitted from the motor thalamus to the cortex during cued movements and how these signals contribute to movement initiation. In the present study, we employed a tactile-cued licking task in mice while monitoring reaction times of the initial lick. We found that inactivation of ALM delayed the initiation of cued licking. Two-photon Ca2+ imaging of VM axons revealed that the majority of the axon terminals in ALM were transiently active during licking. Their activity was predictive of the time of the first lick. Chemogenetic and optogenetic manipulation of VM axons in ALM indicated that VM inputs facilitate the initiation of cue-triggered and impulsive licking in trained mice. Our results suggest that VM thalamocortical inputs increase the probability and vigor of initiating planned motor responses.
Collapse
Affiliation(s)
- Naoya Takahashi
- Institute for Biology, Humboldt University of Berlin, 10117 Berlin, Germany.
| | - Sara Moberg
- Einstein Center for Neurosciences Berlin, 10117 Berlin, Germany
| | - Timothy A Zolnik
- Institute for Biology, Humboldt University of Berlin, 10117 Berlin, Germany
| | - Julien Catanese
- Department of Biology, Emory University, Atlanta, GA 30322, USA
| | - Robert N S Sachdev
- Institute for Biology, Humboldt University of Berlin, 10117 Berlin, Germany
| | - Matthew E Larkum
- Institute for Biology, Humboldt University of Berlin, 10117 Berlin, Germany
| | - Dieter Jaeger
- Department of Biology, Emory University, Atlanta, GA 30322, USA.
| |
Collapse
|
11
|
Stayte S, Dhungana A, Vissel B, Bradfield LA. Parafascicular Thalamic and Orbitofrontal Cortical Inputs to Striatum Represent States for Goal-Directed Action Selection. Front Behav Neurosci 2021; 15:655029. [PMID: 33841111 PMCID: PMC8029974 DOI: 10.3389/fnbeh.2021.655029] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 03/01/2021] [Indexed: 12/02/2022] Open
Abstract
Several lines of evidence accrued over the last 5–10 years have converged to suggest that the parafascicular nucleus of the thalamus and the lateral orbitofrontal cortex each represent or contribute to internal state/context representations that guide action selection in partially observable task situations. In rodents, inactivations of each structure have been found to selectively impair performance in paradigms testing goal-directed action selection, but only when that action selection relies on state representations. Electrophysiological evidence has suggested that each structure achieves this function via inputs onto cholinergic interneurons (CINs) in the dorsomedial striatum. Here, we briefly review these studies, then point to anatomical evidence regarding the afferents of each structure and what they suggest about the specific features that each contribute to internal state representations. Finally, we speculate as to whether this role might be achieved interdependently through direct PF→OFC projections, or through the convergence of independent direct orbitofrontal cortex (OFC) and parafascicular nucleus of the thalamus (PF) inputs onto striatal targets.
Collapse
Affiliation(s)
- Sandy Stayte
- Centre for Neuroscience and Regenerative Medicine, University of Technology Sydney, Sydney, NSW, Australia.,St. Vincent's Centre for Applied Medical Research, St. Vincent's Hospital Sydney, Sydney, NSW, Australia
| | - Amolika Dhungana
- Centre for Neuroscience and Regenerative Medicine, University of Technology Sydney, Sydney, NSW, Australia.,St. Vincent's Centre for Applied Medical Research, St. Vincent's Hospital Sydney, Sydney, NSW, Australia
| | - Bryce Vissel
- Centre for Neuroscience and Regenerative Medicine, University of Technology Sydney, Sydney, NSW, Australia.,St. Vincent's Centre for Applied Medical Research, St. Vincent's Hospital Sydney, Sydney, NSW, Australia
| | - Laura A Bradfield
- Centre for Neuroscience and Regenerative Medicine, University of Technology Sydney, Sydney, NSW, Australia.,St. Vincent's Centre for Applied Medical Research, St. Vincent's Hospital Sydney, Sydney, NSW, Australia
| |
Collapse
|
12
|
Nakashima R, Kumada T. Explicit Sense of Agency in an Automatic Control Situation: Effects of Goal-Directed Action and the Gradual Emergence of Outcome. Front Psychol 2020; 11:2062. [PMID: 32982855 PMCID: PMC7488175 DOI: 10.3389/fpsyg.2020.02062] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Accepted: 07/27/2020] [Indexed: 12/02/2022] Open
Abstract
Sense of agency (SoA), or the subjective feeling that “I am the agent controlling the object,” is essential for learning and enjoying object manipulation. Recently developed automatic control systems, such as the cruise control systems in autonomous vehicles, require less manual control from the manipulators. It has to date been impossible to completely relieve operators of the need for manual control in many automatic control systems developed for tool-using situations. Therefore, it is important to examine how to maintain SoA (illusorily) during an automatic control situation. We investigated the effects of two typical characteristics of everyday tool-use situations on SoA when braking a moving object with a keypress. These characteristics included the presence of a goal (e.g., in driving situations, the driver steps on the brake pedal to stop the car at an expected position) and the gradual emergence of the outcome (e.g., the driver steps on the brake pedal and the car usually slows down first and then stops). We conducted an experiment in which participants stopped a moving object and then rated their SoA for stopping the object. Participants were explicitly informed that the object would sometimes stop independently of their keypress. Results showed that both characteristics decreased SoA in the manual control situation but increased SoA in the automatic control situation. Thus, these characteristics could be useful for maintaining a sense of agency in automatic control situations.
Collapse
Affiliation(s)
- Ryoichi Nakashima
- Department of Psychology, The University of Tokyo, Bunkyo, Japan.,RIKEN CBS-TOYOTA Collaboration Center, RIKEN, Wako, Japan
| | - Takatsune Kumada
- RIKEN CBS-TOYOTA Collaboration Center, RIKEN, Wako, Japan.,Graduate School of Informatics, Kyoto University, Kyoto, Japan
| |
Collapse
|
13
|
Hezemans FH, Wolpe N, Rowe JB. Apathy is associated with reduced precision of prior beliefs about action outcomes. J Exp Psychol Gen 2020; 149:1767-1777. [PMID: 32039624 PMCID: PMC7397861 DOI: 10.1037/xge0000739] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 12/11/2019] [Accepted: 12/23/2019] [Indexed: 01/06/2023]
Abstract
Apathy is a debilitating syndrome that is associated with reduced goal-directed behavior. Although apathy is common and detrimental to prognosis in many neuropsychiatric diseases, its underlying mechanisms remain controversial. We propose a new model of apathy, in the context of Bayesian theories of brain function, whereby actions require predictions of their outcomes to be held with sufficient precision for "explaining away" differences in sensory inputs. In the active inference model, apathy results from reduced precision of prior beliefs about action outcomes. We tested this hypothesis using a visuomotor task in healthy adults (N = 47), with experimental manipulation of physical effort and financial reward. Bayesian modeling of performance and participants' perception of their performance was used to infer the precision of their priors. We confirmed that the perception of performance was biased toward the target, which was accounted for by relatively precise prior beliefs about action outcomes. These priors were consistently more precise than the corresponding performance distribution, and were scaled to effort and reward. Crucially, prior precision was negatively associated with trait apathy, suggesting that apathetic individuals had less precise prior beliefs about action outcomes. The results support a Bayesian account of apathy that could inform future studies of clinical populations. (PsycInfo Database Record (c) 2020 APA, all rights reserved).
Collapse
Affiliation(s)
- Frank H Hezemans
- Medical Research Council (MRC) Cognition, University of Cambridge
| | - Noham Wolpe
- Medical Research Council (MRC) Cognition, University of Cambridge
| | - James B Rowe
- Medical Research Council (MRC) Cognition, University of Cambridge
| |
Collapse
|
14
|
Hilo-Merkovich R, Yuval-Greenberg S. The coordinate system of endogenous spatial attention during smooth pursuit. J Vis 2020; 20:26. [PMID: 32720972 PMCID: PMC7424112 DOI: 10.1167/jov.20.7.26] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Accepted: 03/18/2020] [Indexed: 11/24/2022] Open
Abstract
A central question in vision is whether spatial attention is represented in an eye-centered (retinotopic) or world-centered (spatiotopic) reference-frame. Most previous studies on this question focused on how coordinates are modulated across saccades. In the present study, we investigated the reference-frame of attention across smooth pursuit eye-movements using a goal-directed saccade task. In two experiments, participants were asked to pursue a moving target while attending to one or two grating stimuli. On each trial, one stimulus was constant in its retinal position and the other was constant in its spatial position. Upon detection of a slight change in stimulus orientation, participants were asked to stop pursuing and perform a fast saccade toward the modified stimulus. In the focused attention condition, they attended one, predefined, stimulus, and in the divided attention condition they attended both. In Experiment 1 the angle of the orientation change marking the target event was constant across participants and conditions. In Experiment 2, the angle was individually adapted to equate performance across participants and conditions. Findings of the two experiments were consistent and showed that the enhancement of mean visual sensitivity in the focused relative to the divided attention condition was similar in magnitude for both retinotopic and spatiotopic targets. This indicates that during smooth pursuit, endogenous attention was proportionally divided between targets in retinotopic and spatiotopic frames of reference.
Collapse
Affiliation(s)
| | - Shlomit Yuval-Greenberg
- School of Psychological Sciences, Tel-Aviv University, Tel-Aviv, Israel
- Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, Israel
| |
Collapse
|
15
|
Juvrud J, Gredebäck G. The teleological stance: Past, present, and future. Dev Sci 2020; 23:e12970. [PMID: 32304172 DOI: 10.1111/desc.12970] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Revised: 11/05/2018] [Accepted: 11/27/2018] [Indexed: 11/27/2022]
Abstract
We review the support for, and criticisms of, the teleological stance theory, often described as a foundation for goal-directed action understanding early in life. A major point of contention in the literature has been how teleological processes and assumptions of rationality are represented and understood in infancy, and this debate has been largely centered on three paradigms. Visual habituation studies assess infant's abilities to retrospectively assess teleological processes; the presence of such processes is supported by the literature. Rational imitation is a phenomenon that has been questioned both theoretically and empirically, and there is currently little support for this concept in the literature. The involvement of teleological processes in action prediction is unclear. To date, the ontology of teleological processes remains unspecified. To remedy this, we present a new action-based theory of teleological processes (here referred to as the embodied account of teleological processes), based on the development of goal-directed reaching with its origin during the fetal period and continuous development over the first few months of life.
Collapse
Affiliation(s)
- Joshua Juvrud
- Department of Psychology, Uppsala Child and Baby Lab, Uppsala University, Uppsala, Sweden
| | - Gustaf Gredebäck
- Department of Psychology, Uppsala Child and Baby Lab, Uppsala University, Uppsala, Sweden
| |
Collapse
|
16
|
Luis Del Campo V, Pajuelo Miranda JM, Morenas Martín J. Training With Direct Versus Indirect Spatial Stimulus-Response Compatibility in Combat Sports. Percept Mot Skills 2020; 127:722-741. [PMID: 32356486 DOI: 10.1177/0031512520917806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The study examined effects of spatial stimulus-response compatibility on response time and response accuracy in 20 novice combat sport athletes. Two equivalent groups, based on initial reaction time measures, were required to perceive and move quickly and accurately in response to an unspecific visual stimulus presented on a large screen during the two types of perceptual training in eight laboratory sessions. One group reacted by moving the fist toward the stimulus location on the target (direct compatibility condition). Another group was required to move the fist away from target in the opposite direction (indirect compatibility condition). Specifically, the indirect compatibility group achieved faster reaction times than the direct compatibility group during the two posttests containing video-projected attacks of the opponents, and in one of the two posttests containing real opponents' attacks. Results seem to reveal higher combat performance against real opponents when athletes trained with an indirect stimulus-response compatibility condition.
Collapse
Affiliation(s)
- Vicente Luis Del Campo
- Laboratory of Motor Control and Learning, Faculty of Sport Sciences, University of Extremadura Spain
| | | | - Jesús Morenas Martín
- Laboratory of Motor Control and Learning, Faculty of Sport Sciences, University of Extremadura Spain
| |
Collapse
|
17
|
Abstract
The ability to establish behaviorally what psychological capacity an animal is deploying-to discern accurately what an animal is doing-is key to functional analyses of the brain. Our current understanding of these capacities suggests, however, that this task is complex; there is evidence that multiple capacities are engaged simultaneously and contribute independently to the control of behavior. As such, establishing the contribution of a cell, circuit, or neural system to any one function requires careful dissection of that role from its influence on other functions and, therefore, the careful selection and design of behavioral tasks fit for that purpose. Here I describe recent research that has sought to utilize behavioral tools to investigate the neural bases of instrumental conditioning, particularly the circuits and systems supporting the capacity for goal-directed action, as opposed to conditioned reflexes and habits, and how these sources of action control interact to generate adaptive behavior.
Collapse
|
18
|
Cuevas Rivera D, Strobel A, Goschke T, Kiebel SJ. Modeling Dynamic Allocation of Effort in a Sequential Task Using Discounting Models. Front Neurosci 2020; 14:242. [PMID: 32269509 PMCID: PMC7109442 DOI: 10.3389/fnins.2020.00242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Accepted: 03/04/2020] [Indexed: 11/13/2022] Open
Abstract
Most rewards in our lives require effort to obtain them. It is known that effort is seen by humans as carrying an intrinsic disutility which devalues the obtainable reward. Established models for effort discounting account for this by using participant-specific discounting parameters inferred from experiments. These parameters offer only a static glance into the bigger picture of effort exertion. The mechanism underlying the dynamic changes in a participant's willingness to exert effort is still unclear and an active topic of research. Here, we modeled dynamic effort exertion as a consequence of effort- and probability-discounting mechanisms during goal reaching, sequential behavior. To do this, we developed a novel sequential decision-making task in which participants made binary choices to reach a minimum number of points. Importantly, the time points and circumstances of effort allocation were decided by participants according to their own preferences and not imposed directly by the task. Using the computational model to analyze participants' choices, we show that the dynamics of effort exertion arise from a combination of changing task needs and forward planning. In other words, the interplay between a participant's inferred discounting parameters is sufficient to explain the dynamic allocation of effort during goal reaching. Using formal model comparison, we also inferred the forward-planning strategy used by participants. The model allowed us to characterize a participant's effort exertion in terms of only a few parameters. Moreover, the model can be adapted to a number of tasks used in establishing the neural underpinnings of forward-planning behavior and meta-control, allowing for the characterization of behavior in terms of model parameters.
Collapse
Affiliation(s)
- Darío Cuevas Rivera
- Chair of Neuroimaging, Faculty of Psychology, Technische Universität Dresden, Dresden, Germany
| | - Alexander Strobel
- Chair of Differential and Personality Psychology, Faculty of Psychology, Technische Universität Dresden, Dresden, Germany
| | - Thomas Goschke
- Chair of General Psychology, Faculty of Psychology, Technische Universität Dresden, Dresden, Germany
| | - Stefan J. Kiebel
- Chair of Neuroimaging, Faculty of Psychology, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
19
|
Goldwater MB, Gershman SJ, Moul C, Ludowici C, Burton A, Killer B, Kuhnert RL, Ridgway K. Children's understanding of habitual behaviour. Dev Sci 2020; 23:e12951. [PMID: 32058673 DOI: 10.1111/desc.12951] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Revised: 09/24/2019] [Accepted: 01/14/2020] [Indexed: 11/28/2022]
Abstract
Research into the development of Theory of Mind (ToM) has shown how children from a very early age infer other people's goals. However, human behaviour is sometimes driven not by plans to achieve goals, but by habits, which are formed over long periods of reinforcement. Habitual and goal-directed behaviours are often aligned with one another but can diverge when the optimal behavioural policy changes without being directly reinforced (thus specifically hobbling the habitual learning strategy). Unlike the flexibility of goal-directed behaviour, rigid habits can cause agents to persist in behaviour that is no longer adaptive. In the current study, all children predict agents will tend to behave consistently with their goals, but between the ages of 5 and 10, children showed an increasing understanding of how habits can cause agents to persistently take suboptimal actions. These findings stand out from the typical way the development of social reasoning is examined, which instead focuses on children's increasing appreciation of how others' beliefs or expectations affect how they will act in service of their goals. The current findings show that children also learn that under certain circumstances, people's actions are suboptimal despite potentially 'knowing better.'
Collapse
Affiliation(s)
| | | | | | | | - Amy Burton
- University of Sydney, Camperdown, NSW, Australia
| | | | | | - Kate Ridgway
- University of Sydney, Camperdown, NSW, Australia
| |
Collapse
|
20
|
Crego ACG, Štoček F, Marchuk AG, Carmichael JE, van der Meer MAA, Smith KS. Complementary Control over Habits and Behavioral Vigor by Phasic Activity in the Dorsolateral Striatum. J Neurosci 2020; 40:2139-53. [PMID: 31969469 DOI: 10.1523/JNEUROSCI.1313-19.2019] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Revised: 12/18/2019] [Accepted: 12/19/2019] [Indexed: 12/13/2022] Open
Abstract
Despite clear evidence linking the basal ganglia to the control of outcome insensitivity (i.e., habit) and behavioral vigor (i.e., its behavioral speed/fluidity), it remains unclear whether or how these functions relate to one another. Here, using male Long-Evans rats in response-based and cue-based maze-running tasks, we demonstrate that phasic dorsolateral striatum (DLS) activity occurring at the onset of a learned behavior regulates how vigorous and habitual it is. In a response-based task, brief optogenetic excitation at the onset of runs decreased run duration and the occurrence of deliberative behaviors, whereas midrun stimulation carried little effect. Outcome devaluation showed these runs to be habitual. DLS inhibition at run start did not produce robust effects on behavior until after outcome devaluation. At that time, when the DLS was plausibly most critically required for performance (i.e., habitual), inhibition reduced performance vigor measures and caused a dramatic loss of habitual responding (i.e., animals quit the task). In a second cue-based "beacon" task requiring behavior initiation at the start of the run and again in the middle of the run, DLS excitation at both time points could improve the vigor of runs. Postdevaluation testing showed behavior on the beacon task to be habitual as well. This pattern of results suggests that one role for phasic DLS activity at behavior initiation is to promote the execution of the behavior in a vigorous and habitual fashion by a diverse set of measures.SIGNIFICANCE STATEMENT Our research expands the literature twofold. First, we find that features of a habitual behavior that are typically studied separately (i.e., maze response performance, deliberation movements, running vigor, and outcome insensitivity) are quite closely linked together. Second, efforts have been made to understand "what" the dorsolateral striatum (DLS) does for habitual behavior, and our research provides a key set of results showing "when" it is important (i.e., at behavior initiation). By showing such dramatic control over habits by DLS activity in a phasic time window, plausible real-world applications could involve more informed DLS perturbations to curb intractably problematic habits.
Collapse
|
21
|
Abstract
It is now commonly accepted that instrumental actions can reflect goal-directed control; i.e., they can show sensitivity to changes in the relationship to and the value of their consequences. With overtraining, stress, neurodegeneration, psychiatric conditions, or after exposure to various drugs of abuse, goal-directed control declines and instrumental actions are performed independently of their consequences. Although this latter insensitivity has been argued to reflect the development of habitual control, the lack of a positive definition of habits has rendered this conclusion controversial. Here we consider various alternative definitions of habit, including recent suggestions they reflect chunked action sequences, to derive criteria with which to categorize responses as habitual. We consider various theories regarding the interaction between goal-directed and habitual controllers and propose a collaborative model based on their hierarchical integration. We argue that this model is consistent with the available data, can be instantiated both at an associative level and computationally and generates interesting predictions regarding the influence of this collaborative integration on behavior.
Collapse
Affiliation(s)
- Bernard W Balleine
- Decision Neuroscience Laboratory, School of Psychology, University of New South Wales Sydney, Sydney, NSW, Australia
| | - Amir Dezfouli
- Data 61, Commonwealth Scientific and Industrial Research Organisation, Sydney, NSW, Australia
| |
Collapse
|
22
|
Abstract
The effect of stress on the balance between goal-directed behavior and stimulus-response habits has been demonstrated in a number of studies, but the extent to which stressful events that occur during development affect the balance between these systems later in life is less clear. Here, we examined whether individuals with a history of early-life stress (ELS) show a bias toward avoidance habits on an instrumental learning task as adults. Participants (N = 189 in Experiment 1 and N = 112 in Experiment 2) were undergraduate students at the University of California, Los Angeles. In Experiment 1, we hypothesized that a history of ELS and a longer training phase would be associated with greater avoidance habits. Participants learned to make button-press responses to visual stimuli in order to avoid aversive auditory outcomes. Following a training phase involving extensive practice of the responses, participants were tested for habitual responding using outcome devaluation. After completing the instrumental learning task, participants provided retrospective reports of stressful events they experienced during their first 16 years of life. We did not observe evidence for an effect of the length of training, but we did observe an effect of ELS, with greater stress predicting greater odds of performing the avoidance habit. In Experiment 2, we sought to replicate the effect of ELS observed in Experiment 1, and we also tested whether the presence of distraction during training would increase avoidance habit performance. We replicated the effect of ELS but we did not observe evidence of an effect of distraction. Taken together, these data lend support to the hypothesis that stress occurring during development can have lasting effects on the balance between goal-directed behavior and stimulus-response habits in humans. Enhancement of avoidance habits may help explain the higher levels of negative health outcomes such as heart and liver disease that have been observed in individuals with a history of ELS. Some of the negative health behaviors that contribute to these negative health outcomes, e.g., overeating and substance use, may be performed initially to avoid feelings of distress and then transition to being performed habitually.
Collapse
Affiliation(s)
- Tara K Patterson
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, United States
| | - Michelle G Craske
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, United States
| | - Barbara J Knowlton
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, United States
| |
Collapse
|
23
|
Abstract
The mechanism of action selection is a widely shared fundamental process required by animals to interact with the environment and adapt to it. A key step in this process is the filtering of the "distracting" sensory inputs that may disturb action selection. Because it has been suggested that, in principle, action selection may also be processed by shared circuits in vertebrate and invertebrates, we wondered whether invertebrates show the ability to filter out "distracting" stimuli during a goal-directed action, as seen in vertebrates. In this experiment, action selection was studied in wild-type Drosophila melanogaster by investigating their reaction to the abrupt appearance of a visual distractor during an ongoing locomotor action directed to a visual target. We found that when the distractor was present, flies tended to shift the original trajectory toward it, thus acknowledging its presence, but they did not fully commit to it, suggesting that an inhibition process took place to continue the unfolding of the planned goal-directed action. To some extent flies appeared to take into account and represent motorically the distractor, but they did not engage in a complete change of their initial motor program in favor of the distractor. These results provide interesting insights into the selection-for-action mechanism, in a context requiring action-centered attention, that might have appeared rather early in the course of evolution. NEW & NOTEWORTHY Action selection and maintenance of a goal-directed action require animals to ignore irrelevant "distracting" stimuli that might elicit alternative motor programs. In this study we observed, in Drosophila melanogaster, a top-down mechanism inhibiting the response toward salient stimuli, to accomplish a goal-directed action. These data highlight, for the first time in an invertebrate organism, that the action-based attention shown by higher organisms, such as humans and nonhuman primates, might have an ancestral origin.
Collapse
Affiliation(s)
- Giovanni Frighetto
- Department of General Psychology, University of Padova , Padua , Italy.,Department of Biomedical Sciences, University of Padova , Padua , Italy
| | - Mauro A Zordan
- Department of Biology, University of Padova , Padua , Italy.,Padova Neuroscience Center, University of Padova , Padua , Italy
| | - Umberto Castiello
- Department of General Psychology, University of Padova , Padua , Italy
| | - Aram Megighian
- Department of Biomedical Sciences, University of Padova , Padua , Italy.,Padova Neuroscience Center, University of Padova , Padua , Italy
| |
Collapse
|
24
|
Abstract
Maintaining intentions over time is fundamental to goal-directed action, and previous research demonstrated that intentions are encoded and maintained in a fronto-parietal network including e.g., the dlPFC and IPS. Yet, intention maintenance is highly challenging in the constantly changing environments we experience every day. While we might have formed an intention under specific conditions, this context can change rapidly and unexpectedly. Some suggested that intentions representations in the fronto-parietal cortex change flexibly when external demands change (context-dependent coding). Others suggested that these representations are encoded in an abstract format that is not affected by changes in external demands (context-invariant coding). Here, I will first outline an analysis approach using multivariate pattern analysis of fMRI data to comprehensively assess the context-dependence / invariance of intention representations in the fronto-parietal cortex. I will then highlight some research following the proposed analysis strategy. Results to date are mixed, showing context-dependence in some, but context-invariance in other cases. In an attempt to synthesize these somewhat divergent results, I will argue that depending on characteristics of the intentions as well as the environment, intentions can either be encoded in a context-dependent or a context-invariant format. This enables us to achieve both stability and flexibility of behavior under constantly changing external demands.
Collapse
Affiliation(s)
- David Wisniewski
- Department of Experimental Psychology, Ghent University, Ghent, Belgium
| |
Collapse
|
25
|
Watson P, van Wingen G, de Wit S. Conflicted between Goal-Directed and Habitual Control, an fMRI Investigation. eNeuro 2018; 5:ENEURO. [PMID: 30310863 DOI: 10.1523/ENEURO.0240-18.2018] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 06/27/2018] [Indexed: 11/21/2022] Open
Abstract
“Slips of action” occur in everyday life when we momentarily lose sight of a goal (for example, when in a rush or distracted). Associative models propose that these habitual responses can be activated via a direct stimulus-response (S-R) mechanism, regardless of the current hedonic value of the outcome. The slips-of-action task (SOAT) has been extensively used in both healthy and pathological populations to measure habit tendencies, the likelihood of making erroneous responses for devalued outcomes. Inspection of behavioral performance does not reveal, however, whether the impairments were due to impaired goal-directed control or aberrantly strong habit formation. In the current study, we used functional MRI while human participants performed both the instrumental training and SOAT test phases, to elucidate the relative contributions of these mechanisms to performance on the SOAT. On trials in which conflict arises between competing goal-directed and habitual responses, we observed increased activation across areas including the anterior cingulate cortex, paracingulate gyrus, lateral orbitofrontal cortex (OFC), insula, and inferior frontal gyrus (IFG). Responding for devalued outcomes was related to increased activation in the premotor cortex and cerebellum, implicating these regions in habitual responding. Increased activation in the caudate, dorsolateral prefrontal cortex (dlPFC), and frontal pole during training was associated with better performance during the test phase, indicative of goal-directed action control. These results endorse interpretation of the SOAT in terms of competing goal-directed and habitual mechanisms and highlight that cognitive control processes present an additional bottleneck for successful performance on this task.
Collapse
|
26
|
Thrailkill EA, Trask S, Vidal P, Alcalá JA, Bouton ME. Stimulus control of actions and habits: A role for reinforcer predictability and attention in the development of habitual behavior. J Exp Psychol Anim Learn Cogn 2018; 44:370-384. [PMID: 30407063 PMCID: PMC6233324 DOI: 10.1037/xan0000188] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Goal-directed actions are instrumental behaviors whose performance depends on the organism's knowledge of the reinforcing outcome's value. In contrast, habits are instrumental behaviors that are insensitive to the outcome's current value. Although habits in everyday life are typically controlled by stimuli that occasion them, most research has studied habits using free-operant procedures in which no discrete stimuli are present to occasion the response. We therefore studied habit learning when rats were reinforced for lever pressing on a random-interval 30-s schedule in the presence of a discriminative stimulus (S) but not in its absence. In Experiment 1, devaluing the reinforcer with taste aversion conditioning weakened instrumental responding in a 30-s S after 4, 22, and 66 sessions of instrumental training. Even extensive practice thus produced goal-directed action, not habit. Experiments 2 and 3 contrastingly found habit when the duration of S was increased from 30 s to 8 min. Experiment 4 then found habit with the 30-s S when it always contained a reinforcer; goal-directed action was maintained when reinforcers were earned at the same rate but occurred in only 50% of Ss (as in the previous experiments). The results challenge the view that habits are an inevitable consequence of repeated reinforcement (as in the law of effect) and instead suggest that discriminated habits develop when the reinforcer becomes predictable. Under those conditions, organisms may pay less attention to their behavior, much as they pay less attention to signals associated with predicted reinforcers in Pavlovian conditioning. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Collapse
Affiliation(s)
| | - Sydney Trask
- Department of Psychological Science, University of Vermont
| | - Pedro Vidal
- Department of Psychological Science, University of Vermont
| | - José A Alcalá
- Department of Psychological Science, University of Vermont
| | - Mark E Bouton
- Department of Psychological Science, University of Vermont
| |
Collapse
|
27
|
Marshall SK, Stainton T, Wall JM, Zhu M, Murray J, Wu S, El Bouhali A, Parada F, Zaidman-Zait A, Young RA. Transition to Adulthood as a Joint Parent-Youth Project for Young Persons With Intellectual and Developmental Disabilities. Intellect Dev Disabil 2018; 56:263-277. [PMID: 30024849 DOI: 10.1352/1934-9556-56.5.263] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Eight dyads ( N = 16) residing in Western Canada participated in this investigation of how young adults with intellectual and developmental disabilities (IDD) and their parents jointly construct, articulate, and act on goals pertinent to the young adults' transition to adulthood. Using the action-project method to collect and analyze conversations and video recall data, cases were grouped representing the ways goal-directed projects brought relationship ( n = 4), planning ( n = 3) or both ( n = 1) to the foreground as joint projects. Resources internal to the dyad such as emotional resources, and external to the dyad, facilitated formulation and pursuit of projects. Lack of external supports and limited parental knowledge about IDD hindered joint project formulation.
Collapse
Affiliation(s)
- Sheila K Marshall
- Sheila K. Marshall, Tim Stainton, Jessie M. Wall, Ma Zhu, and John Murray, University of British Columbia
| | - Tim Stainton
- Sheila K. Marshall, Tim Stainton, Jessie M. Wall, Ma Zhu, and John Murray, University of British Columbia
| | - Jessie M Wall
- Sheila K. Marshall, Tim Stainton, Jessie M. Wall, Ma Zhu, and John Murray, University of British Columbia
| | - Ma Zhu
- Sheila K. Marshall, Tim Stainton, Jessie M. Wall, Ma Zhu, and John Murray, University of British Columbia
| | - John Murray
- Sheila K. Marshall, Tim Stainton, Jessie M. Wall, Ma Zhu, and John Murray, University of British Columbia
| | - Siwei Wu
- Siwei Wu, Hengyang Normal University
| | | | | | - Anat Zaidman-Zait
- Anat Zaidman-Zait, University of Tel Aviv; and Richard A. Young, University of British Columbia
| | - Richard A Young
- Anat Zaidman-Zait, University of Tel Aviv; and Richard A. Young, University of British Columbia
| |
Collapse
|
28
|
Peak J, Hart G, Balleine BW. From learning to action: the integration of dorsal striatal input and output pathways in instrumental conditioning. Eur J Neurosci 2018; 49:658-671. [PMID: 29791051 DOI: 10.1111/ejn.13964] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2018] [Revised: 04/18/2018] [Accepted: 05/16/2018] [Indexed: 02/04/2023]
Abstract
Considerable evidence suggests that the learning and performance of instrumental actions depend on activity in basal ganglia circuitry; however, these two functions have generally been considered independently. Whereas research investigating the associative mechanisms underlying instrumental conditioning has identified critical cortical and limbic input pathways to the dorsal striatum, the performance of instrumental actions has largely been attributed to activity in the dorsal striatal output pathways, with direct and indirect pathway projection neurons mediating action initiation, perseveration and cessation. Here, we discuss evidence that the dorsal striatal input and basal ganglia output pathways mediate the learning and performance of instrumental actions, respectively, with the dorsal striatum functioning as a transition point. From this perspective, the issue of how multiple striatal inputs are integrated at the level of the dorsal striatum and converted into relatively restricted outputs becomes one of critical significance for understanding how learning is translated into action. So too does the question of how learning signals are modulated by recent experience. We propose that this occurs through recurrent corticostriatothalamic feedback circuits that serve to integrate performance signals by updating ongoing action-related learning.
Collapse
Affiliation(s)
- James Peak
- Decision Neuroscience Lab, School of Psychology, UNSW Sydney, Randwick, NSW, 2031, Australia
| | - Genevra Hart
- Decision Neuroscience Lab, School of Psychology, UNSW Sydney, Randwick, NSW, 2031, Australia
| | - Bernard W Balleine
- Decision Neuroscience Lab, School of Psychology, UNSW Sydney, Randwick, NSW, 2031, Australia
| |
Collapse
|
29
|
de Wit S, Kindt M, Knot SL, Verhoeven AAC, Robbins TW, Gasull-Camos J, Evans M, Mirza H, Gillan CM. Shifting the balance between goals and habits: Five failures in experimental habit induction. J Exp Psychol Gen 2018; 147:1043-1065. [PMID: 29975092 PMCID: PMC6033090 DOI: 10.1037/xge0000402] [Citation(s) in RCA: 100] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Revised: 09/20/2017] [Accepted: 11/30/2017] [Indexed: 12/28/2022]
Abstract
Habits are repetitive behaviors that become ingrained with practice, routine, and repetition. The more we repeat an action, the stronger our habits become. Behavioral and clinical neuroscientists have become increasingly interested in this topic because habits may contribute to aspects of maladaptive human behavior, such as compulsive behavior in psychiatry. Numerous studies have demonstrated that habits can be induced in otherwise healthy rats by simply overtraining stimulus-response behaviors. However, despite growing interest in this topic and its application to psychiatry, a similar body of work in humans is absent. Only a single study has been published in humans that shows the effect of extensive training on habit expression. Here, we report five failed attempts to demonstrate that overtraining instrumental behavior leads to the development of inflexible habits in humans, using variants of four previously published outcome devaluation paradigms. Extensive training did not lead to greater habits in two versions of an avoidance learning task, in an appetitive slips-of-action task, or in two independent attempts to replicate the original demonstration. The finding that these outcome devaluation procedures may be insensitive to duration of stimulus-response training in humans has implications for prior work in psychiatric populations. Specifically, it converges with the suggestion that the failures in outcome devaluation in compulsive individuals reflect dysfunction in goal-directed control, rather than overactive habit learning. We discuss why habits are difficult to experimentally induce in healthy humans, and the implications of this for future research in healthy and disordered populations. (PsycINFO Database Record
Collapse
Affiliation(s)
| | | | | | | | | | - Julia Gasull-Camos
- Department of Neurochemistry and Neuropharmacology, Institute of Biomedical Research Barcelona
| | | | - Hira Mirza
- Department of Psychology, New York University
| | - Claire M Gillan
- School of Psychology and Global Brain Health Institute, Trinity College Dublin
| |
Collapse
|
30
|
Hart G, Bradfield LA, Fok SY, Chieng B, Balleine BW. The Bilateral Prefronto-striatal Pathway Is Necessary for Learning New Goal-Directed Actions. Curr Biol 2018; 28:2218-2229.e7. [PMID: 30056856 DOI: 10.1016/j.cub.2018.05.028] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Revised: 05/07/2018] [Accepted: 05/11/2018] [Indexed: 11/22/2022]
Abstract
The acquisition of new goal-directed actions requires the encoding of action-outcome associations. At a neural level, this encoding has been hypothesized to involve a prefronto-striatal circuit extending between the prelimbic cortex (PL) and the posterior dorsomedial striatum (pDMS); however, no research identifying this pathway with any precision has been reported. We started by mapping the prelimbic input to the dorsal and ventral striatum using a combination of retrograde and anterograde tracing with CLARITY and established that PL-pDMS projections share some overlap with projections to the nucleus accumbens core (NAc) in rats. We then tested whether each of these pathways were functionally required for goal-directed learning; we used a pathway-specific dual-virus chemogenetic approach to selectively silence pDMS-projecting or NAc-projecting PL neurons during instrumental training and tested rats for goal-directed action. We found that silencing PL-pDMS projections abolished goal-directed learning, whereas silencing PL-NAc projections left goal-directed learning intact. Finally, we used a three-virus approach to silence bilateral and contralateral pDMS-projecting PL neurons and again blocked goal-directed learning. These results establish that the acquisition of new goal-directed actions depends on the bilateral PL-pDMS pathway driven by intratelencephalic cortical neurons.
Collapse
|
31
|
Kosaki Y, Pearce JM, McGregor A. The response strategy and the place strategy in a plus-maze have different sensitivities to devaluation of expected outcome. Hippocampus 2018; 28:484-496. [PMID: 29637657 PMCID: PMC6055719 DOI: 10.1002/hipo.22847] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Revised: 03/03/2018] [Accepted: 04/04/2018] [Indexed: 11/17/2022]
Abstract
Previous studies have suggested that spatial navigation can be achieved with at least two distinct learning processes, involving either cognitive map‐like representations of the local environment, referred to as the “place strategy”, or simple stimulus‐response (S‐R) associations, the “response strategy”. A similar distinction between cognitive/behavioral processes has been made in the context of non‐spatial, instrumental conditioning, with the definition of two processes concerning the sensitivity of a given behavior to the expected value of its outcome as well as to the response‐outcome contingency (“goal‐directed action” and “S‐R habit”). Here we investigated whether these two versions of dichotomist definitions of learned behavior, one spatial and the other non‐spatial, correspond to each other in a formal way. Specifically, we assessed the goal‐directed nature of two navigational strategies, using a combination of an outcome devaluation procedure and a spatial probe trial frequently used to dissociate the two navigational strategies. In Experiment 1, rats trained in a dual‐solution T‐maze task were subjected to an extinction probe trial from the opposite start arm, with or without prefeeding‐induced devaluation of the expected outcome. We found that a non‐significant preference for the place strategy in the non‐devalued condition was completely reversed after devaluation, such that significantly more animals displayed the use of the response strategy. The result suggests that the place strategy is sensitive to the expected value of the outcome, while the response strategy is not. In Experiment 2, rats with hippocampal lesions showed significant reliance on the response strategy, regardless of whether the expected outcome was devalued or not. The result thus offers further evidence that the response strategy conforms to the definition of an outcome‐insensitive, habitual form of instrumental behavior. These results together attest a formal correspondence between two types of dual‐process accounts of animal learning and behavior.
Collapse
Affiliation(s)
- Yutaka Kosaki
- Department of Psychology, Durham University, Durham DH1 3LE, United Kingdom.,School of Psychology, Cardiff University, Cardiff CF10 3AT, United Kingdom.,Department of Psychology, Waseda University, Shinjuku, Tokyo 162-8644, Japan
| | - John M Pearce
- School of Psychology, Cardiff University, Cardiff CF10 3AT, United Kingdom
| | - Anthony McGregor
- Department of Psychology, Durham University, Durham DH1 3LE, United Kingdom
| |
Collapse
|
32
|
Abstract
Action is widely characterized as possessing a teleological dimension. The dominant way of describing goal-directed action and agency is in terms of exploitation, i.e., pursuing pre-specified goals using existing strategies. Recent theoretical developments emphasize the place of exploration, i.e., discovering new goals or acquiring new strategies. The exploitation-exploration distinction poses questions with regard to goals and agency: Should exploration, as some authors have suggested, be regarded as acting without a goal? We argue that recognizing the hierarchical nature of goals is crucial in distinguishing the two kinds of activity, because this recognition prevents the claim that exploration is goal-free, while allowing for a homogeneous account of both exploitative and explorative actions. An action typically causes relatively low-level/proximal (i.e., sensorimotor, immediate) and relatively high-level/distal (i.e., in the environment, at a wider timescale) outcomes. In exploitation, one relies on existing associations between low- and high-level states, whereas in exploration one does not have the ability or intention to control high-level/distal states. We argue that explorative action entails the capacity to exercise control within the low-level/proximal states, which enables the pursuit of indeterminate goals at the higher levels of a goal hierarchy, and the possibility of acquiring new goals and reorganization of goal hierarchies. We consider how the dominant models of agency might accommodate this capacity for explorative action.
Collapse
Affiliation(s)
- Davood G Gozli
- Department of Psychology, Faculty of Social Sciences, University of Macau, Macau, China
| | - Nevia Dolcini
- Philosophy and Religious Studies Programme, Faculty of Arts and Humanities, University of Macau, Macau, China
| |
Collapse
|
33
|
Inuggi A, Campus C, Vastano R, Saunier G, Keuroghlanian A, Pozzo T. Observation of Point-Light-Walker Locomotion Induces Motor Resonance When Explicitly Represented; An EEG Source Analysis Study. Front Psychol 2018; 9:303. [PMID: 29593607 PMCID: PMC5857608 DOI: 10.3389/fpsyg.2018.00303] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Accepted: 02/23/2018] [Indexed: 11/13/2022] Open
Abstract
Understanding human motion, to infer the goal of others' actions, is thought to involve the observer's motor repertoire. One prominent class of actions, the human locomotion, has been object of several studies, all focused on manipulating the shape of degraded human figures like point-light walker (PLW) stimuli, represented as walking on the spot. Nevertheless, since the main goal of the locomotor function is to displace the whole body from one position to the other, these stimuli might not fully represent a goal-directed action and thus might not be able to induce the same motor resonance mechanism expected when observing a natural locomotion. To explore this hypothesis, we recorded the event-related potentials (ERP) of canonical/scrambled and translating/centered PLWs decoding. We individuated a novel ERP component (N2c) over central electrodes, around 435 ms after stimulus onset, for translating compared to centered PLW, only when the canonical shape was preserved. Consistently with our hypothesis, sources analysis associated this component to the activation of trunk and lower legs primary sensory-motor and supplementary motor areas. These results confirm the role of own motor repertoire in processing human action and suggest that ERP can detect the associated motor resonance only when the human figure is explicitly involved in performing a meaningful action.
Collapse
Affiliation(s)
- Alberto Inuggi
- Unit of Robotics, Brain and Cognitive, Istituto Italiano di Tecnologia, Center for Human Technologies, Genova, Italy.,Sciences, Istituto Italiano di Tecnologia, Center for Human Technologies, Genova, Italy
| | - Claudio Campus
- Unit for Visually Impaired People, Istituto Italiano di Tecnologia, Center for Human Technologies, Genova, Italy
| | - Roberta Vastano
- Unit of Robotics, Brain and Cognitive, Istituto Italiano di Tecnologia, Center for Human Technologies, Genova, Italy.,Sciences, Istituto Italiano di Tecnologia, Center for Human Technologies, Genova, Italy
| | - Ghislain Saunier
- Laboratório de Cognição Motora, Departamento de Anatomia, Universidade Federal do Pará, Belém, Brazil
| | - Alejo Keuroghlanian
- Unit of Robotics, Brain and Cognitive, Istituto Italiano di Tecnologia, Center for Human Technologies, Genova, Italy.,Sciences, Istituto Italiano di Tecnologia, Center for Human Technologies, Genova, Italy
| | - Thierry Pozzo
- Institut National de la Santé et de la Recherche Médical, Cognition-Action-Plasticité Sensorimotrice, Université Bourgogne Franche-Comté, Dijon, France.,Centro di Neurofisiologia Traslazionale, Istituto Italiano di Tecnologia, Ferrara, Italy
| |
Collapse
|
34
|
Hart G, Bradfield LA, Balleine BW. Prefrontal Corticostriatal Disconnection Blocks the Acquisition of Goal-Directed Action. J Neurosci 2018; 38:1311-22. [PMID: 29301872 DOI: 10.1523/JNEUROSCI.2850-17.2017] [Citation(s) in RCA: 65] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Revised: 11/23/2017] [Accepted: 12/15/2017] [Indexed: 11/21/2022] Open
Abstract
The acquisition of goal-directed action requires encoding of the association between an action and its specific consequences or outcome. At a neural level, this encoding has been hypothesized to involve a prefrontal corticostriatal circuit involving the projection from the prelimbic cortex (PL) to the posterior dorsomedial striatum (pDMS); however, no direct evidence for this claim has been reported. In a series of experiments, we performed functional disconnection of this pathway using targeted lesions of the anterior corpus callosum to disrupt contralateral corticostriatal projections with asymmetrical lesions of the PL and/or pDMS to block plasticity in this circuit in rats. We first demonstrated that unilaterally blocking the PL input to the pDMS prevented the phosphorylation of extracellular signal-related kinase/mitogen activated protein kinase (pERK/pMAPK) induced by instrumental training. Next, we used a full bilateral disconnection of the PL from the pDMS and assessed goal-directed action using an outcome-devaluation test. Importantly, we found evidence that rats maintaining an ipsilateral and/or contralateral connection between the PL and the pDMS were able to acquire goal-directed actions. In contrast, bilateral PL-pDMS disconnection abolished the acquisition of goal-directed actions. Finally, we used a temporary pharmacological disconnection to disrupt PL inputs to the pDMS by infusing the NMDA antagonist dl-2-amino-5-phosphonopentanoic acid into the pDMS during instrumental training and found that this manipulation also disrupted goal-directed learning. These results establish that, in rats, the acquisition of new goal-directed actions depends on a prefrontal-corticostriatal circuit involving a connection between the PL and the pDMS.SIGNIFICANCE STATEMENT It has been hypothesized that the prelimbic cortex (PL) and posterior dorsomedial striatum (pDMS) in rodents interact in a corticostriatal circuit to mediate goal-directed learning. However, no direct evidence supporting this claim has been reported. Using targeted lesions, we performed functional disconnection of the PL-pDMS pathway to assess its role in goal-directed learning. In the first experiment, we demonstrated that PL input to the pDMS is necessary for instrumental training-induced neuronal activity. Next, we disrupted ipsilateral, contralateral, or bilateral PL-pDMS connections and found that only bilateral PL-pDMS disconnection disrupted the acquisition of goal-directed actions, a finding we replicated in our final study using a pharmacological disconnection procedure.
Collapse
|
35
|
Chen J, Liang J, Lin X, Zhang Y, Zhang Y, Lu L, Shi J. Sleep Deprivation Promotes Habitual Control over Goal-Directed Control: Behavioral and Neuroimaging Evidence. J Neurosci 2017; 37:11979-92. [PMID: 29109237 DOI: 10.1523/JNEUROSCI.1612-17.2017] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 10/10/2017] [Accepted: 10/31/2017] [Indexed: 11/21/2022] Open
Abstract
Sleep is one of the most fundamental processes of life, playing an important role in the regulation of brain function. The long-term lack of sleep can cause memory impairments, declines in learning ability, and executive dysfunction. In the present study, we evaluated the effects of sleep deprivation on instrumental learning behavior, particularly goal-directed and habitual actions in humans, and investigated the underlying neural mechanisms. Healthy college students of either gender were enrolled and randomly divided into sleep deprivation group and sleep control group. fMRI data were collected. We found that one night of sleep deprivation led to greater responsiveness to stimuli that were associated with devalued outcomes in the slips-of-action test, indicating a deficit in the formation of goal-directed control and an overreliance on habits. Furthermore, sleep deprivation had no effect on the expression of acquired goal-directed action. The level of goal-directed action after sleep deprivation was positively correlated with baseline working memory capacity. The neuroimaging data indicated that goal-directed learning mainly recruited the ventromedial PFC (vmPFC), the activation of which was less pronounced during goal-directed learning after sleep deprivation. Activation of the vmPFC during goal-directed learning during training was positively correlated with the level of goal-directed action performance. The present study suggests that people rely predominantly on habits at the expense of goal-directed control after sleep deprivation, and this process involves the vmPFC. These results contribute to a better understanding of the effects of sleep loss on decision-making.SIGNIFICANCE STATEMENT Understanding the cognitive consequences of sleep deprivation has become extremely important over the past half century, given the continued decline in sleep duration in industrialized societies. Our results provide novel evidence that goal-directed action may be particularly vulnerable to sleep loss, and the brain mechanism underlying this effect was explored. Elucidation of the effects of sleep deprivation on decision-making will deepen our understanding of the function of sleep, emphasizing the role of sleep in cognitive impairments and mental health.
Collapse
|
36
|
Young RA, Marshall SK, Stainton T, Wall JM, Curle D, Zhu M, Munro D, Murray J, El Bouhali A, Parada F, Zaidman-Zait A. The transition to adulthood of young adults with IDD: Parents' joint projects. J Appl Res Intellect Disabil 2017; 31 Suppl 2:224-233. [PMID: 28834118 DOI: 10.1111/jar.12395] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/25/2017] [Indexed: 11/29/2022]
Abstract
INTRODUCTION Parents have found the transition to adulthood for their sons or daughters with intellectual and/or developmental disabilities (IDD) particularly challenging. The literature has not examined how parents work together and with others in face of this transition nor has it highlighted parental goals in this process. This study used a perspective based on joint, goal-direct action to describe the projects that Canadian parents engaged in together and with others relative to this transition. METHODS Using the qualitative action-project method, joint projects between parents and with others were identified from their conversations and followed for 6 months. FINDINGS Three groups of projects were described: equipping the young adult for adult life, connecting for personal support and managing day-to-day while planning for the future. CONCLUSIONS Parents act together and with others relative to the transition to adulthood of their young adult children with IDD. These projects are complex and differ in goals, steps, resources and emotional regulation and motivation.
Collapse
Affiliation(s)
| | | | - Tim Stainton
- University of British Columbia, Vancouver, BC, Canada
| | - Jessie M Wall
- University of British Columbia, Vancouver, BC, Canada
| | - Deirdre Curle
- University of British Columbia, Vancouver, BC, Canada
| | - Ma Zhu
- University of British Columbia, Vancouver, BC, Canada
| | - David Munro
- University of British Columbia, Vancouver, BC, Canada
| | - John Murray
- University of British Columbia, Vancouver, BC, Canada
| | | | | | | |
Collapse
|
37
|
Hart G, Balleine BW. Consolidation of Goal-Directed Action Depends on MAPK/ERK Signaling in Rodent Prelimbic Cortex. J Neurosci 2016; 36:11974-86. [PMID: 27881782 DOI: 10.1523/JNEUROSCI.1772-16.2016] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Revised: 09/19/2016] [Accepted: 10/03/2016] [Indexed: 02/02/2023] Open
Abstract
The prelimbic prefrontal cortex (PL) has consistently been found to be necessary for the acquisition of goal-directed actions in rodents. Nevertheless, the specific cellular processes underlying this learning remain unknown. We assessed changes in learning-related expression of mitogen-activated protein kinase/extracellular signal-related kinase (MAPK/ERK1/2) phosphorylation (pERK) in layers 2-3 and 5-6 of the anterior and posterior PL and in the population of neurons projecting to posterior dorsomedial striatum (pDMS), also implicated in goal-directed learning. Rats were given either a single session of training to press a lever for a pellet reward or yoked reward deliveries without instrumental training and assessed 5 or 60 min after training for pERK expression. Relative to yoked training, instrumental training produced an increase in pERK expression in all regions of the PL both at 5 and 60 min, and this was accompanied by an increase in nuclear pERK expression in the posterior PL in rats given instrumental training. pDMS-projecting neurons showed a transient increase in pERK expression in posterior layer 5-6 projection neurons after 5 min, and a delayed increase in anterior layer 2-3 neurons after 60 min, suggesting that ERK expression in the PL is necessary for the consolidation of goal-directed learning. Consistent with this claim, we found that rats trained on two lever press actions for distinct outcomes and then infused with the MEK inhibitor PD98059 into the PL immediately after training failed to acquire specific action-outcome associations, suggesting that persistent pERK signaling in the PL is necessary for goal-directed learning. SIGNIFICANCE STATEMENT The prelimbic cortex is implicated in goal-directed learning in rodents; however, it is unclear whether it is involved in the consolidation of this learning, and what cellular processes are involved. We used pERK as a marker of activity-related synaptic plasticity to assess learning-induced changes in distinct layers and neuronal populations of the prelimbic prefrontal cortex (PL). Training produced long-lasting upregulation of pERK throughout the PL and specifically within neurons that project to the pDMS, another region critical for goal-directed learning. Next, we demonstrated that pERK signaling in the PL was necessary for the consolidation of goal-directed learning. Together, these results indicate that instrumental training induces ERK signaling in distinct layers and populations in the PL and this signaling underlies the consolidation of goal-directed learning.
Collapse
|
38
|
Santamaria V, Rachwani J, Manselle W, Saavedra SL, Woollacott M. The Impact of Segmental Trunk Support on Posture and Reaching While Sitting in Healthy Adults. J Mot Behav 2017; 50:51-64. [PMID: 28350227 DOI: 10.1080/00222895.2017.1283289] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
The authors investigated postural and arm control in seated reaches while providing trunk support at midribs and pelvic levels in adults. Kinematics and electromyography of the arm and ipsiliateral and contralateral paraspinal muscles were examined before and during reaching. Kinematics remained constant across conditions, but changes were observed in neuromuscular control. With midribs support, the ipsilateral cervical muscle showed either increased anticipatory activity or earlier compensatory muscle responses, suggesting its major role in head stabilization. The baseline activity of bilateral lumbar muscles was enhanced with midribs support, whereas with pelvic support, the activation frequency of paraspinal muscles increased during reaching. The results suggest that segmental trunk support in healthy adults modulates ipsilateral or contralateral paraspinal activity while overall kinematic outputs remain invariant.
Collapse
Affiliation(s)
- Victor Santamaria
- a Motor Control and Cognition Lab, Human Physiology Department & Institute of Neuroscience , University of Oregon , Eugene , Oregon
| | - Jaya Rachwani
- a Motor Control and Cognition Lab, Human Physiology Department & Institute of Neuroscience , University of Oregon , Eugene , Oregon
| | - Wayne Manselle
- a Motor Control and Cognition Lab, Human Physiology Department & Institute of Neuroscience , University of Oregon , Eugene , Oregon
| | - Sandra L Saavedra
- b Department of Rehabilitation Sciences , University of Hartford , West Hartford , Connecticut
| | - Marjorie Woollacott
- a Motor Control and Cognition Lab, Human Physiology Department & Institute of Neuroscience , University of Oregon , Eugene , Oregon
| |
Collapse
|
39
|
Bradfield LA, Balleine BW. Thalamic Control of Dorsomedial Striatum Regulates Internal State to Guide Goal-Directed Action Selection. J Neurosci 2017; 37:3721-33. [PMID: 28242795 DOI: 10.1523/JNEUROSCI.3860-16.2017] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Revised: 02/02/2017] [Accepted: 02/13/2017] [Indexed: 11/21/2022] Open
Abstract
We (Bradfield et al., 2013) have demonstrated previously that parafascicular thalamic nucleus (PF)-controlled neurons in the posterior dorsomedial striatum (pDMS) are critical for interlacing new and existing action-outcome contingencies to control goal-directed action. Based on these findings, it was suggested that animals with a dysfunctional PF-pDMS pathway might suffer a deficit in creating or retrieving internal contexts or "states" on which such information could become conditional. To assess this hypothesis more directly, rats were given a disconnection treatment using contralateral cytotoxic lesions of the PF and pDMS (Group CONTRA) or ipsilateral control lesions (Group IPSI) and trained to press a right and left lever for sucrose and pellet outcomes, after which these contingencies were reversed. The rats were then given an outcome devaluation test (all experiments) and a test of outcome-specific reinstatement (Experiments 1 and 3). We found that devaluation performance was intact for both groups after training of initial contingencies, but impaired for Group CONTRA after reversal. However, performance was restored by additional reversal training. Furthermore, when tested a second time after reversal training, rats in both groups demonstrated responding in accordance with the original contingencies, providing direct evidence of modulation of action selection by state. Finally, we found that external context could substitute for internal state and so could rescue responding in Group CONTRA, but only in the reinstatement test. Together, these findings suggest that animals use internal state information to guide action selection and that this information is modulated by the PF-pDMS pathway.SIGNIFICANCE STATEMENT Individuals with Parkinson's disease dementia often suffer a characteristic deficit in "cognitive flexibility." It has been suggested that neurodegeneration in the pathway between the centromedian/parafascicular thalalmic nucleus (PF) and striatum might underlie such deficits (Smith et al., 2014). In rats, we have similarly observed that a functional disconnection of the PF-posterior dorsomedial striatal pathway produces a specific impairment in the ability to alter goal-directed actions (Bradfield et al., 2013). It was suggested that this impairment could be a result of a deficit in state modulation. Here, we present four experiments that provide evidence for this hypothesis and suggest several ways (e.g., extended practice, providing external cues) in which this state modulation can be rescued.
Collapse
|
40
|
Furlong TM, Supit AS, Corbit LH, Killcross S, Balleine BW. Pulling habits out of rats: adenosine 2A receptor antagonism in dorsomedial striatum rescues meth-amphetamine-induced deficits in goal-directed action. Addict Biol 2017; 22:172-183. [PMID: 26515740 DOI: 10.1111/adb.12316] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Revised: 08/06/2015] [Accepted: 09/02/2015] [Indexed: 11/29/2022]
Abstract
Addiction is characterized by a persistent loss of behavioral control resulting in insensitivity to negative feedback and abnormal decision-making. Here, we investigated the influence of methamphetamine (METH)-paired contextual cues on decision-making in rats. Choice between goal-directed actions was sensitive to outcome devaluation in a saline-paired context but was impaired in the METH-paired context, a deficit that was also found when negative feedback was provided. Reductions in c-Fos-related immunoreactivity were found in dorsomedial striatum (DMS) but not dorsolateral striatum after exposure to the METH context suggesting this effect reflected a loss specifically in goal-directed control in the METH context. This reduction in c-Fos was localized to non-enkephalin-expressing neurons in the DMS, likely dopamine D1-expressing direct pathway neurons, suggesting a relative change in control by the D1-direct versus D2-indirect pathways originating in the DMS may have been induced by METH-context exposure. To test this suggestion, we infused the adenosine 2A receptor antagonist ZM241385 into the DMS prior to test to reduce activity in D2 neurons relative to D1 neurons in the hope of reducing the inhibitory output from this region of the striatum. We found that this treatment fully restored sensitivity to negative feedback in a test conducted in the METH-paired context. These results suggest that drug exposure alters decision-making by downregulation of the circuitry mediating goal-directed action, an effect that can be ameliorated by acute A2A receptor inhibition in this circuit.
Collapse
Affiliation(s)
- Teri M. Furlong
- Brain & Mind Research Institute; University of Sydney; Australia
| | - Alva S.A. Supit
- Brain & Mind Research Institute; University of Sydney; Australia
| | | | | | | |
Collapse
|
41
|
Webber ES, Mankin DE, Cromwell HC. Striatal Activity and Reward Relativity: Neural Signals Encoding Dynamic Outcome Valuation. eNeuro 2016; 3:ENEURO. [PMID: 27822506 DOI: 10.1523/ENEURO.0022-16.2016] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2016] [Revised: 10/06/2016] [Accepted: 10/07/2016] [Indexed: 11/21/2022] Open
Abstract
The striatum is a key brain region involved in reward processing. Striatal activity has been linked to encoding reward magnitude and integrating diverse reward outcome information. Recent work has supported the involvement of striatum in the valuation of outcomes. The present work extends this idea by examining striatal activity during dynamic shifts in value that include different levels and directions of magnitude disparity. A novel task was used to produce diverse relative reward effects on a chain of instrumental action. Rats (Rattus norvegicus) were trained to respond to cues associated with specific outcomes varying by food pellet magnitude. Animals were exposed to single-outcome sessions followed by mixed-outcome sessions, and neural activity was compared among identical outcome trials from the different behavioral contexts. Results recording striatal activity show that neural responses to different task elements reflect incentive contrast as well as other relative effects that involve generalization between outcomes or possible influences of outcome variety. The activity that was most prevalent was linked to food consumption and post-food consumption periods. Relative encoding was sensitive to magnitude disparity. A within-session analysis showed strong contrast effects that were dependent upon the outcome received in the immediately preceding trial. Significantly higher numbers of responses were found in ventral striatum linked to relative outcome effects. Our results support the idea that relative value can incorporate diverse relationships, including comparisons from specific individual outcomes to general behavioral contexts. The striatum contains these diverse relative processes, possibly enabling both a higher information yield concerning value shifts and a greater behavioral flexibility.
Collapse
|
42
|
Jackson SAW, Horst NK, Pears A, Robbins TW, Roberts AC. Role of the Perigenual Anterior Cingulate and Orbitofrontal Cortex in Contingency Learning in the Marmoset. Cereb Cortex 2016; 26:3273-84. [PMID: 27130662 PMCID: PMC4898677 DOI: 10.1093/cercor/bhw067] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Two learning mechanisms contribute to decision-making: goal-directed actions and the "habit" system, by which action-outcome and stimulus-response associations are formed, respectively. Rodent lesion studies and human neuroimaging have implicated both the medial prefrontal cortex (mPFC) and the orbitofrontal cortex (OFC) in the neural basis of contingency learning, a critical component of goal-directed actions, though some published findings are conflicting. We sought to reconcile the existing literature by comparing the effects of excitotoxic lesions of the perigenual anterior cingulate cortex (pgACC), a region of the mPFC, and OFC on contingency learning in the marmoset monkey using a touchscreen-based paradigm, in which the contingent relationship between one of a pair of actions and its outcome was degraded selectively. Both the pgACC and OFC lesion groups were insensitive to the contingency degradation, whereas the control group demonstrated selectively higher performance of the nondegraded action when compared with the degraded action. These findings suggest the pgACC and OFC are both necessary for normal contingency learning and therefore goal-directed behavior.
Collapse
Affiliation(s)
- Stacey A. W. Jackson
- Department of Psychology
- Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge CB2 3EB, UK
| | - Nicole K. Horst
- Department of Psychology
- Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge CB2 3EB, UK
| | - Andrew Pears
- Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge CB2 3EB, UK
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, UK
| | - Trevor W. Robbins
- Department of Psychology
- Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge CB2 3EB, UK
| | - Angela C. Roberts
- Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge CB2 3EB, UK
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, UK
| |
Collapse
|
43
|
Parkes SL, Bradfield LA, Balleine BW. Interaction of insular cortex and ventral striatum mediates the effect of incentive memory on choice between goal-directed actions. J Neurosci 2015; 35:6464-71. [PMID: 25904797 DOI: 10.1523/JNEUROSCI.4153-14.2015] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The anterior insular cortex (IC) and the nucleus accumbens (NAc) core have been separately implicated in the selection and performance of actions based on the incentive value of the instrumental outcome. Here, we examined the role of connections between the IC and the NAc core in the performance of goal-directed actions. Rats were trained on two actions for distinct outcomes, after which one of the two outcomes was devalued by specific satiety immediately before a choice extinction test. We first confirmed the projection from the IC to the NAc core and then disconnected these structures via asymmetrical excitotoxic lesions before training. Contralateral, but not ipsilateral, disconnection of the IC and NAc core disrupted outcome devaluation. We hypothesized that communication between the IC and NAc core is necessary for the retrieval of incentive value at test. To test this, we infused the GABAA agonist muscimol into the IC and the μ-opioid receptor antagonist CTAP into the contralateral NAc before the choice extinction test. As expected, inactivation of the IC in one hemisphere and blocking μ-opioid receptors in the contralateral NAc core abolished outcome-selective devaluation. These results suggest that the IC and NAc core form part of a circuit mediating the retrieval of outcome values and the subsequent choice between goal-directed actions based on those values.
Collapse
|
44
|
Leung BK, Balleine BW. Ventral pallidal projections to mediodorsal thalamus and ventral tegmental area play distinct roles in outcome-specific Pavlovian-instrumental transfer. J Neurosci 2015; 35:4953-64. [PMID: 25810525 DOI: 10.1523/JNEUROSCI.4837-14.2015] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Outcome-specific Pavlovian-instrumental transfer (PIT) demonstrates the way that reward-related cues influence choice between instrumental actions. The nucleus accumbens shell (NAc-S) contributes critically to this effect, particularly through its output to the rostral medial ventral pallidum (VP-m). Using rats, we investigated in two experiments the role in the PIT effect of the two major outputs of this VP-m region innervated by the NAc-S, the mediodorsal thalamus (MD) and the ventral tegmental area (VTA). First, two retrograde tracers were injected into the MD and VTA to compare the neuronal activity of the two populations of projection neurons in the VP-m during PIT relative to controls. Second, the functional role of the connection between the VP-m and the MD or VTA was assessed using asymmetrical pharmacological manipulations before a PIT test. It was found that, whereas neurons in the VP-m projecting to the MD showed significantly more neuronal activation during PIT than those projecting to the VTA, neuronal activation of these latter neurons correlated with the size of the PIT effect. Disconnection of the two pathways during PIT also revealed different deficits in performance: disrupting the VP-m to MD pathway removed the response biasing effects of reward-related cues, whereas disrupting the VP-m to VTA pathway preserved the response bias but altered the overall rate of responding. The current results therefore suggest that the VP-m exerts distinct effects on the VTA and MD and that these latter structures mediate the motivational and cognitive components of specific PIT, respectively.
Collapse
|
45
|
Ruge H, Wolfensteller U. Distinct fronto-striatal couplings reveal the double-faced nature of response-outcome relations in instruction-based learning. Cogn Affect Behav Neurosci 2015; 15:349-64. [PMID: 25361755 PMCID: PMC4436102 DOI: 10.3758/s13415-014-0325-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Higher species commonly learn novel behaviors by evaluating retrospectively whether actions have yielded desirable outcomes. By relying on explicit behavioral instructions, only humans can use an acquisition shortcut that prospectively specifies how to yield intended outcomes under the appropriate stimulus conditions. A recent and largely unexplored hypothesis suggests that striatal areas interact with lateral prefrontal cortex (LPFC) when novel behaviors are learned via explicit instruction, and that regional subspecialization exists for the integration of differential response-outcome contingencies into the current task model. Behaviorally, outcome integration during instruction-based learning has been linked to functionally distinct performance indices. This includes (1) compatibility effects, measured in a postlearning test procedure probing the encoding strength of outcome-response (O-R) associations, and (2) increasing response slowing across learning, putatively indicating active usage of O-R associations for the online control of goal-directed action. In the present fMRI study, we examined correlations between these behavioral indices and the dynamics of fronto-striatal couplings in order to mutually constrain and refine the interpretation of neural and behavioral measures in terms of separable subprocesses during outcome integration. We found that O-R encoding strength correlated with LPFC-putamen coupling, suggesting that the putamen is relevant for the formation of both S-R habits and habit-like O-R associations. By contrast, response slowing as a putative index of active usage of O-R associations correlated with LPFC-caudate coupling. This finding highlights the relevance of the caudate for the online control of goal-directed action also under instruction-based learning conditions, and in turn clarifies the functional relevance of the behavioral slowing effect.
Collapse
Affiliation(s)
- Hannes Ruge
- Department of Psychology, Technische Universität Dresden, Dresden, Germany,
| | | |
Collapse
|
46
|
Shan Q, Ge M, Christie MJ, Balleine BW. The acquisition of goal-directed actions generates opposing plasticity in direct and indirect pathways in dorsomedial striatum. J Neurosci 2014; 34:9196-201. [PMID: 25009253 DOI: 10.1523/JNEUROSCI.0313-14.2014] [Citation(s) in RCA: 91] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
A cortical-basal ganglia network involving, particularly, the posterior region of dorsomedial striatum (DMS) has been implicated in the acquisition of goal-directed actions; however, no direct evidence of learning-related plasticity in this striatal region has been reported, nor is it known whether, or which, specific cell types are involved in this learning process. The striatum is primarily composed of two classes of spiny projection neurons (SPNs): the striatonigral and striatopallidal SPNs, which express dopamine D1 and D2 receptors, respectively. Here we establish that, in mice, the acquisition of goal-directed actions induced plasticity in both D1- and D2-SPNs specifically in the DMS and, importantly, that these changes were in opposing directions; after learning, AMPA/NMDA ratios were increased in D1-SPNs and reduced in the D2-SPNs in the DMS. Such opposing plasticity could provide the basis for rapidly rebiasing the control of task-specific actions, and its dysregulation could underlie disorders associated with striatal function.
Collapse
|
47
|
Laurent V, Wong FL, Balleine BW. δ-Opioid receptors in the accumbens shell mediate the influence of both excitatory and inhibitory predictions on choice. Br J Pharmacol 2014; 172:562-70. [PMID: 24758591 DOI: 10.1111/bph.12731] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 02/05/2014] [Accepted: 03/18/2014] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND AND PURPOSE Stimuli that predict rewarding events can control choice between future actions, and this control could be mediated by δ-opioid receptors in the nucleus accumbens shell (NAc-S). Stimuli predicting the absence of important events can also guide choice, although it remains unknown whether they do so via changes in an accumbal δ-opioid receptor-related process. EXPERIMENTAL APPROACH δ-opioid receptor-eGFP mice were trained to perform two instrumental actions that delivered different food outcomes. Choice between the two actions was then tested in the presence of stimuli paired with either the delivery or the non-delivery of each of the two outcomes. Bilateral infusions of the δ-opioid receptor antagonist naltrindole into the NAc-S were used to determine the role of these receptors at the time of choice and δ-opioid receptor expression in the NAc-S used to assess functional activity. KEY RESULTS A stimulus predicting a specific outcome biased choice performance towards the action previously earning that same outcome. In contrast, a stimulus signalling the absence of that outcome biased performance away from the action that delivered that outcome towards actions associated with the absence of that outcome. Both effects were associated with increased δ-opioid receptor expression on the membrane of cholinergic interneurons within the NAc-S. Furthermore, both effects were blocked by naltrindole infused into the NAc-S. CONCLUSIONS AND IMPLICATIONS These findings suggest that δ-opioid receptors in the NAc-S were involved in the effects of predictive learning on choice between actions, whether those predictions involve the presence or absence of specific rewarding events. LINKED ARTICLES This article is part of a themed section on Opioids: New Pathways to Functional Selectivity. To view the other articles in this section visit http://dx.doi.org/10.1111/bph.2015.172.issue-2.
Collapse
Affiliation(s)
- Vincent Laurent
- Behavioural Neuroscience Laboratory, Brain and Mind Research Institute, The University of Sydney, Camperdown, NSW, 2006, Australia
| | | | | |
Collapse
|
48
|
Wolpe N, Wolpert DM, Rowe JB. Seeing what you want to see: priors for one's own actions represent exaggerated expectations of success. Front Behav Neurosci 2014; 8:232. [PMID: 25018710 PMCID: PMC4073091 DOI: 10.3389/fnbeh.2014.00232] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2014] [Accepted: 06/09/2014] [Indexed: 02/02/2023] Open
Abstract
People perceive the consequences of their own actions differently to how they perceive other sensory events. A large body of psychology research has shown that people also consistently overrate their own performance relative to others, yet little is known about how these "illusions of superiority" are normally maintained. Here we examined the visual perception of the sensory consequences of self-generated and observed goal-directed actions. Across a series of visuomotor tasks, we found that the perception of the sensory consequences of one's own actions is more biased toward success relative to the perception of observed actions. Using Bayesian models, we show that this bias could be explained by priors that represent exaggerated predictions of success. The degree of exaggeration of priors was unaffected by learning, but was correlated with individual differences in trait optimism. In contrast, when observing these actions, priors represented more accurate predictions of the actual performance. The results suggest that the brain internally represents optimistic predictions for one's own actions. Such exaggerated predictions bind the sensory consequences of our own actions with our intended goal, explaining how it is that when acting we tend to see what we want to see.
Collapse
Affiliation(s)
- Noham Wolpe
- Department of Clinical Neurosciences, University of CambridgeCambridge, UK,Medical Research Council Cognition and Brain Sciences UnitCambridge, UK,*Correspondence: Noham Wolpe, Department of Clinical Neurosciences, University of Cambridge, Herchel Smith Building, Cambridge CB2 0SZ, UK e-mail:
| | - Daniel M. Wolpert
- Computational and Biological Learning Laboratory, Department of Engineering, University of CambridgeCambridge, UK
| | - James B. Rowe
- Department of Clinical Neurosciences, University of CambridgeCambridge, UK,Medical Research Council Cognition and Brain Sciences UnitCambridge, UK
| |
Collapse
|
49
|
Griffiths KR, Morris RW, Balleine BW. Translational studies of goal-directed action as a framework for classifying deficits across psychiatric disorders. Front Syst Neurosci 2014; 8:101. [PMID: 24904322 PMCID: PMC4033402 DOI: 10.3389/fnsys.2014.00101] [Citation(s) in RCA: 83] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2014] [Accepted: 05/09/2014] [Indexed: 11/13/2022] Open
Abstract
The ability to learn contingencies between actions and outcomes in a dynamic environment is critical for flexible, adaptive behavior. Goal-directed actions adapt to changes in action-outcome contingencies as well as to changes in the reward-value of the outcome. When networks involved in reward processing and contingency learning are maladaptive, this fundamental ability can be lost, with detrimental consequences for decision-making. Impaired decision-making is a core feature in a number of psychiatric disorders, ranging from depression to schizophrenia. The argument can be developed, therefore, that seemingly disparate symptoms across psychiatric disorders can be explained by dysfunction within common decision-making circuitry. From this perspective, gaining a better understanding of the neural processes involved in goal-directed action, will allow a comparison of deficits observed across traditional diagnostic boundaries within a unified theoretical framework. This review describes the key processes and neural circuits involved in goal-directed decision-making using evidence from animal studies and human neuroimaging. Select studies are discussed to outline what we currently know about causal judgments regarding actions and their consequences, action-related reward evaluation, and, most importantly, how these processes are integrated in goal-directed learning and performance. Finally, we look at how adaptive decision-making is impaired across a range of psychiatric disorders and how deepening our understanding of this circuitry may offer insights into phenotypes and more targeted interventions.
Collapse
Affiliation(s)
- Kristi R Griffiths
- Behavioural Neuroscience Laboratory, Brain and Mind Research Institute, University of Sydney Camperdown, Sydney, NSW, Australia
| | - Richard W Morris
- Behavioural Neuroscience Laboratory, Brain and Mind Research Institute, University of Sydney Camperdown, Sydney, NSW, Australia
| | - Bernard W Balleine
- Behavioural Neuroscience Laboratory, Brain and Mind Research Institute, University of Sydney Camperdown, Sydney, NSW, Australia
| |
Collapse
|
50
|
Abstract
Our recent target article on the allocation of attention to action (herein called the AAA model; Franz, 2012) considered implicated subcortical processes and networks in people with intact corpus callosum (CC) and people without a CC due to commissurotomy or callosotomy. However, a small error in print—namely that the term “commissurotomy” was printed in place of “callosotomy” in some instances—led us to further explore whether any key functional roles have been attributed to the two primary cortical commissures (the anterior and posterior commissures) which remain intact in people with callosotomy, and if so, whether those would be relevant to our current AAA framework. Although existing evidence is sparse, here we consider the hypothesis that the anterior commissure (AC) is a remnant fiber tract which has been largely replaced with evolution of the CC (and we do not herein discuss the posterior commissure further). Indeed, a dearth of studies is available on the AC, calling the need for further research. Herein, we briefly review literature on the AC in humans and then propose a method that might be worthwhile to pursue in future studies.
Collapse
Affiliation(s)
- Taylor J Winter
- Action, Brain, and Cognition Laboratory and fMRIOtago, Division of Science, Department of Psychology, University of Otago Dunedin, New Zealand
| | - Elizabeth A Franz
- Action, Brain, and Cognition Laboratory and fMRIOtago, Division of Science, Department of Psychology, University of Otago Dunedin, New Zealand
| |
Collapse
|