1
|
Cheng Y, Magnard R, Langdon AJ, Lee D, Janak PH. Chronic Ethanol Exposure Produces Persistent Impairment in Cognitive Flexibility and Decision Signals in the Striatum. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.10.584332. [PMID: 38585868 PMCID: PMC10996555 DOI: 10.1101/2024.03.10.584332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Lack of cognitive flexibility is a hallmark of substance use disorders and has been associated with drug-induced synaptic plasticity in the dorsomedial striatum (DMS). Yet the possible impact of altered plasticity on real-time striatal neural dynamics during decision-making is unclear. Here, we identified persistent impairments induced by chronic ethanol (EtOH) exposure on cognitive flexibility and striatal decision signals. After a substantial withdrawal period from prior EtOH vapor exposure, male, but not female, rats exhibited reduced adaptability and exploratory behavior during a dynamic decision-making task. Reinforcement learning models showed that prior EtOH exposure enhanced learning from rewards over omissions. Notably, neural signals in the DMS related to the decision outcome were enhanced, while those related to choice and choice-outcome conjunction were reduced, in EtOH-treated rats compared to the controls. These findings highlight the profound impact of chronic EtOH exposure on adaptive decision-making, pinpointing specific changes in striatal representations of actions and outcomes as underlying mechanisms for cognitive deficits.
Collapse
Affiliation(s)
- Yifeng Cheng
- Department Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD
| | - Robin Magnard
- Department Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD
| | - Angela J. Langdon
- Intramural Research Program, National Institute of Mental Health, National Institutes of Health, Bethesda, MD
| | - Daeyeol Lee
- Department Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD
- Zanvyl Krieger Mind/Brain Institute, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD
- Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD
- Kavli Neuroscience Discovery Institute, Johns Hopkins University, Baltimore, MD
| | - Patricia H. Janak
- Department Psychological and Brain Sciences, Krieger School of Arts and Sciences, Johns Hopkins University, Baltimore, MD
- Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD
- Kavli Neuroscience Discovery Institute, Johns Hopkins University, Baltimore, MD
| |
Collapse
|
2
|
Brown LS, Cho JR, Bolkan SS, Nieh EH, Schottdorf M, Tank DW, Brody CD, Witten IB, Goldman MS. Neural circuit models for evidence accumulation through choice-selective sequences. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.01.555612. [PMID: 38234715 PMCID: PMC10793437 DOI: 10.1101/2023.09.01.555612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Decision making is traditionally thought to be mediated by populations of neurons whose firing rates persistently accumulate evidence across time. However, recent decision-making experiments in rodents have observed neurons across the brain that fire sequentially as a function of spatial position or time, rather than persistently, with the subset of neurons in the sequence depending on the animal's choice. We develop two new candidate circuit models, in which evidence is encoded either in the relative firing rates of two competing chains of neurons or in the network location of a stereotyped pattern ("bump") of neural activity. Encoded evidence is then faithfully transferred between neuronal populations representing different positions or times. Neural recordings from four different brain regions during a decision-making task showed that, during the evidence accumulation period, different brain regions displayed tuning curves consistent with different candidate models for evidence accumulation. This work provides mechanistic models and potential neural substrates for how graded-value information may be precisely accumulated within and transferred between neural populations, a set of computations fundamental to many cognitive operations.
Collapse
|
3
|
Parker NF, Baidya A, Cox J, Haetzel LM, Zhukovskaya A, Murugan M, Engelhard B, Goldman MS, Witten IB. Choice-selective sequences dominate in cortical relative to thalamic inputs to NAc to support reinforcement learning. Cell Rep 2022; 39:110756. [PMID: 35584665 PMCID: PMC9218875 DOI: 10.1016/j.celrep.2022.110756] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2019] [Revised: 02/18/2022] [Accepted: 04/07/2022] [Indexed: 11/25/2022] Open
Abstract
How are actions linked with subsequent outcomes to guide choices? The nucleus accumbens, which is implicated in this process, receives glutamatergic inputs from the prelimbic cortex and midline regions of the thalamus. However, little is known about whether and how representations differ across these input pathways. By comparing these inputs during a reinforcement learning task in mice, we discovered that prelimbic cortical inputs preferentially represent actions and choices, whereas midline thalamic inputs preferentially represent cues. Choice-selective activity in the prelimbic cortical inputs is organized in sequences that persist beyond the outcome. Through computational modeling, we demonstrate that these sequences can support the neural implementation of reinforcement-learning algorithms, in both a circuit model based on synaptic plasticity and one based on neural dynamics. Finally, we test and confirm a prediction of our circuit models by direct manipulation of nucleus accumbens input neurons.
Collapse
Affiliation(s)
- Nathan F Parker
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA
| | - Avinash Baidya
- Center for Neuroscience, University of California, Davis, Davis, CA 95616, USA; Department of Physics and Astronomy, University of California, Davis, Davis, CA 95616, USA
| | - Julia Cox
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA; Department of Neuroscience, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Laura M Haetzel
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA
| | - Anna Zhukovskaya
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA
| | - Malavika Murugan
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA
| | - Ben Engelhard
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA
| | - Mark S Goldman
- Center for Neuroscience, University of California, Davis, Davis, CA 95616, USA; Department of Neurobiology, Physiology and Behavior, University of California, Davis, Davis, CA 95616, USA; Department of Ophthalmology and Vision Science, University of California, Davis, Davis, CA 95616, USA.
| | - Ilana B Witten
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544, USA; Department of Psychology, Princeton University, Princeton, NJ 08544, USA.
| |
Collapse
|
4
|
K Namboodiri VM, Stuber GD. The learning of prospective and retrospective cognitive maps within neural circuits. Neuron 2021; 109:3552-3575. [PMID: 34678148 PMCID: PMC8809184 DOI: 10.1016/j.neuron.2021.09.034] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 08/26/2021] [Accepted: 09/16/2021] [Indexed: 11/18/2022]
Abstract
Brain circuits are thought to form a "cognitive map" to process and store statistical relationships in the environment. A cognitive map is commonly defined as a mental representation that describes environmental states (i.e., variables or events) and the relationship between these states. This process is commonly conceptualized as a prospective process, as it is based on the relationships between states in chronological order (e.g., does reward follow a given state?). In this perspective, we expand this concept on the basis of recent findings to postulate that in addition to a prospective map, the brain forms and uses a retrospective cognitive map (e.g., does a given state precede reward?). In doing so, we demonstrate that many neural signals and behaviors (e.g., habits) that seem inflexible and non-cognitive can result from retrospective cognitive maps. Together, we present a significant conceptual reframing of the neurobiological study of associative learning, memory, and decision making.
Collapse
Affiliation(s)
- Vijay Mohan K Namboodiri
- Department of Neurology, Center for Integrative Neuroscience, Kavli Institute for Fundamental Neuroscience, Neuroscience Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA.
| | - Garret D Stuber
- Center for the Neurobiology of Addiction, Pain, and Emotion, Department of Anesthesiology and Pain Medicine, Department of Pharmacology, Neuroscience Graduate Program, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
5
|
Cunningham PJ, Regier PS, Redish AD. Dorsolateral Striatal Task-initiation Bursts Represent Past Experiences More than Future Action Plans. J Neurosci 2021; 41:8051-8064. [PMID: 34376584 PMCID: PMC8460149 DOI: 10.1523/jneurosci.3080-20.2021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Revised: 07/08/2021] [Accepted: 08/04/2021] [Indexed: 11/21/2022] Open
Abstract
The dorsolateral striatum (DLS) is involved in learning and executing procedural actions. Cell ensembles in the DLS, but not the dorsomedial striatum (DMS), exhibit a burst of firing at the start of a well-learned action sequence ("task-bracketing"). However, it is currently unclear what information is contained in these bursts. Some theories suggest that these bursts should represent the procedural action sequence itself (that they should be about future action chains), whereas others suggest that they should contain representations of the current state of the world, taking into account primarily past information. In addition, the DLS local field potential shows transient bursts of power in the 50 Hz range (γ50) around the time a learned action sequence is initiated. However, it is currently unknown how bursts of activity in DLS cell ensembles and bursts of γ50 power in the DLS local field potential are related to each other. We found that DLS bursts at lap initiation in rats represented recently experienced reward locations more than future procedural actions, indicating that task-initiation DLS bursts contain primarily retrospective, rather than prospective, information to guide procedural actions. Furthermore, representations of past reward locations increased during periods of increased γ50 power in the DLS. There was no evidence of task-initiation bursts, increased γ50 power, or retrospective reward location information in the neighboring dorsomedial striatum. These data support a role for the DLS in model-free theories of procedural decision-making over planned action-chain theories, suggesting that procedural actions derive from representations of the current and recent past.SIGNIFICANCE STATEMENT While it is well-established that the dorsolateral striatum (DLS) plays a critical role in procedural decision-making, open questions remain about the kinds of representations contained in DLS ensemble activity that guide procedural actions. We found that DLS, but not DMS, cell ensembles contained nonlocal representations of past reward locations that appear moments before task-initiation DLS bursts. These retrospective representations were temporally linked to a rise in γ50 power that also preceded the characteristic DLS burst at task-initiation. These results support models of procedural decision-making based on associations between available actions and the current state of the world over models based on planning over action-chains.
Collapse
Affiliation(s)
- Paul J Cunningham
- Department of Neuroscience, University of Minnesota, Minneapolis MN 55455
| | - Paul S Regier
- Department of Psychiatry, University of Pennsylvania Philadelphia PA 19104
| | - A David Redish
- Department of Neuroscience, University of Minnesota, Minneapolis MN 55455
| |
Collapse
|
6
|
Yu X, Chen S, Shan Q. Depression in the Direct Pathway of the Dorsomedial Striatum Permits the Formation of Habitual Action. Cereb Cortex 2021; 31:3551-3564. [PMID: 33774666 DOI: 10.1093/cercor/bhab031] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 01/16/2021] [Accepted: 01/26/2021] [Indexed: 02/05/2023] Open
Abstract
In order to achieve optimal outcomes in an ever-changing environment, humans and animals generally manage their action control via either goal-directed action or habitual action. These two action strategies are thought to be encoded in distinct parallel circuits in the dorsal striatum, specifically, the posterior dorsomedial striatum (DMS) and the dorsolateral striatum (DLS), respectively. The striatum is primarily composed of two subtypes of medium spiny neurons (MSNs): the direct-pathway striatonigral and the indirect-pathway striatopallidal MSNs. MSN-subtype-specific synaptic plasticity in the DMS and the DLS has been revealed to underlie goal-directed action and habitual action, respectively. However, whether any MSN-subtype-specific synaptic plasticity in the DMS is associated with habitual action, and if so, whether the synaptic plasticity affects the formation of habitual action, are not known. This study demonstrates that postsynaptic depression in the excitatory synapses of the direct-pathway striatonigral MSNs in the DMS is formed after habit learning. Moreover, chemogenetically rescuing this depression compromises the acquisition, but not the expression, of habitual action. These findings reveal that an MSN-subtype-specific synaptic plasticity in the DMS affects habitual action and suggest that plasticity in the DMS as well as in the DLS contributes to the formation of habitual action.
Collapse
Affiliation(s)
- Xiaoxuan Yu
- Laboratory for Synaptic Plasticity, Shantou University Medical College, Shantou, Guangdong 515041, China
| | - Shijie Chen
- Laboratory for Synaptic Plasticity, Shantou University Medical College, Shantou, Guangdong 515041, China
| | - Qiang Shan
- Laboratory for Synaptic Plasticity, Shantou University Medical College, Shantou, Guangdong 515041, China
| |
Collapse
|
7
|
Shin EJ, Jang Y, Kim S, Kim H, Cai X, Lee H, Sul JH, Lee SH, Chung Y, Lee D, Jung MW. Robust and distributed neural representation of action values. eLife 2021; 10:53045. [PMID: 33876728 PMCID: PMC8104958 DOI: 10.7554/elife.53045] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Accepted: 04/19/2021] [Indexed: 11/13/2022] Open
Abstract
Studies in rats, monkeys, and humans have found action-value signals in multiple regions of the brain. These findings suggest that action-value signals encoded in these brain structures bias choices toward higher expected rewards. However, previous estimates of action-value signals might have been inflated by serial correlations in neural activity and also by activity related to other decision variables. Here, we applied several statistical tests based on permutation and surrogate data to analyze neural activity recorded from the striatum, frontal cortex, and hippocampus. The results show that previously identified action-value signals in these brain areas cannot be entirely accounted for by concurrent serial correlations in neural activity and action value. We also found that neural activity related to action value is intermixed with signals related to other decision variables. Our findings provide strong evidence for broadly distributed neural signals related to action value throughout the brain.
Collapse
Affiliation(s)
- Eun Ju Shin
- Center for Synaptic Brain Dysfunctions, Institute for Basic Science, Daejeon, Republic of Korea.,Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Yunsil Jang
- Center for Synaptic Brain Dysfunctions, Institute for Basic Science, Daejeon, Republic of Korea.,Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Soyoun Kim
- Center for Neuroscience Imaging Research, Institute for Basic Science, Suwon, Republic of Korea
| | - Hoseok Kim
- Department of Neuroscience, Biomedicum, Karolinska Institutet, Stockholm, Sweden
| | - Xinying Cai
- New York University Shanghai, NYU-ECNU Institute of Brain and Cognitive Science at NYU Shanghai, and Shanghai Key Laboratory of Brain Functional Genomics (Ministry of Education), School of Psychology and Cognitive Science, East China Normal University, Shanghai, China
| | - Hyunjung Lee
- Department of Anatomy, Kyungpook National University School of Medicine, Daegu, Republic of Korea
| | - Jung Hoon Sul
- Center for Synaptic Brain Dysfunctions, Institute for Basic Science, Daejeon, Republic of Korea
| | - Sung-Hyun Lee
- Neuroscience Graduate Program, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Yeonseung Chung
- Department of Mathematical Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Daeyeol Lee
- The Zanvyl Krieger Mind/Brain Institute, Kavli Neuroscience Discovery Institute, Department of Neuroscience, and Department of Psychological and Brain Sciences, Johns Hopkins University, Baltimore, United States
| | - Min Whan Jung
- Center for Synaptic Brain Dysfunctions, Institute for Basic Science, Daejeon, Republic of Korea.,Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| |
Collapse
|
8
|
Bari BA, Cohen JY. Dynamic decision making and value computations in medial frontal cortex. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2021; 158:83-113. [PMID: 33785157 DOI: 10.1016/bs.irn.2020.12.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Dynamic decision making requires an intact medial frontal cortex. Recent work has combined theory and single-neuron measurements in frontal cortex to advance models of decision making. We review behavioral tasks that have been used to study dynamic decision making and algorithmic models of these tasks using reinforcement learning theory. We discuss studies linking neurophysiology and quantitative decision variables. We conclude with hypotheses about the role of other cortical and subcortical structures in dynamic decision making, including ascending neuromodulatory systems.
Collapse
Affiliation(s)
- Bilal A Bari
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, Johns Hopkins University, Baltimore, MD, United States
| | - Jeremiah Y Cohen
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, Johns Hopkins University, Baltimore, MD, United States.
| |
Collapse
|
9
|
Active maintenance of eligibility trace in rodent prefrontal cortex. Sci Rep 2020; 10:18860. [PMID: 33139778 PMCID: PMC7608665 DOI: 10.1038/s41598-020-75820-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 09/29/2020] [Indexed: 12/05/2022] Open
Abstract
Even though persistent neural activity has been proposed as a mechanism for maintaining eligibility trace, direct empirical evidence for active maintenance of eligibility trace has been lacking. We recorded neuronal activity in the medial prefrontal cortex (mPFC) in rats performing a dynamic foraging task in which a choice must be remembered until its outcome on the timescale of seconds for correct credit assignment. We found that mPFC neurons maintain significant choice signals during the time period between action selection and choice outcome. We also found that neural signals for choice, outcome, and action value converge in the mPFC when choice outcome was revealed. Our results indicate that the mPFC maintains choice signals necessary for temporal credit assignment in the form of persistent neural activity in our task. They also suggest that the mPFC might update action value by combining actively maintained eligibility trace with action value and outcome signals.
Collapse
|
10
|
Complex Movement Control in a Rat Model of Parkinsonian Falls: Bidirectional Control by Striatal Cholinergic Interneurons. J Neurosci 2020; 40:6049-6067. [PMID: 32554512 DOI: 10.1523/jneurosci.0220-20.2020] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 05/10/2020] [Accepted: 05/15/2020] [Indexed: 01/18/2023] Open
Abstract
Older persons and, more severely, persons with Parkinson's disease (PD) exhibit gait dysfunction, postural instability and a propensity for falls. These dopamine (DA) replacement-resistant symptoms are associated with losses of basal forebrain and striatal cholinergic neurons, suggesting that falls reflect disruption of the corticostriatal transfer of movement-related cues and their striatal integration with movement sequencing. To advance a rodent model of the complex movement deficits of Parkinsonian fallers, here we first demonstrated that male and female rats with dual cortical cholinergic and striatal DA losses (DL rats) exhibit cued turning deficits, modeling the turning deficits seen in these patients. As striatal cholinergic interneurons (ChIs) are positioned to integrate movement cues with gait, and as ChI loss has been associated with falls in PD, we next used this task, as well as a previously established task used to reveal heightened fall rates in DL rats, to broadly test the role of ChIs. Chemogenetic inhibition of ChIs in otherwise intact male and female rats caused cued turning deficits and elevated fall rates. Spontaneous turning was unaffected. Furthermore, chemogenetic stimulation of ChIs in DL rats reduced fall rates and restored cued turning performance. Stimulation of ChIs was relatively more effective in rats with viral transfection spaces situated lateral to the DA depletion areas in the dorsomedial striatum. These results indicate that striatal ChIs are essential for the control of complex movements, and they suggest a therapeutic potential of stimulation of ChIs to restore gait and balance, and to prevent falls in PD.SIGNIFICANCE STATEMENT In persons with Parkinson's disease, gait dysfunction and the associated risk for falls do not benefit from dopamine replacement therapy and often result in long-term hospitalization and nursing home placement. Here, we first validated a new task to demonstrate impairments in cued turning behavior in rodents modeling the cholinergic-dopaminergic losses observed in Parkinsonian fallers. We then demonstrated the essential role of striatal cholinergic interneurons for turning behavior as well as for traversing dynamic surfaces and avoiding falls. Stimulation of these interneurons in the rat model rescued turning performance and reduced fall rates. Our findings indicate the feasibility of investigating the neuronal circuitry underling complex movement control in rodents, and that striatal cholinergic interneurons are an essential node of such circuitry.
Collapse
|
11
|
Gong WK, Ni J, Yu LF, Wang L, Huang ZL. Temporal dynamics of Arc/Arg3.1 expression in the dorsal striatum during acquisition and consolidation of a motor skill in mice. Neurobiol Learn Mem 2020; 168:107156. [PMID: 31904548 DOI: 10.1016/j.nlm.2019.107156] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 12/15/2019] [Accepted: 12/31/2019] [Indexed: 12/20/2022]
Abstract
Region- and pathway-specific plasticity within striatal circuits is critically involved in the acquisition and long-term retention of a new motor skill as it becomes automatized. However, the molecular substrates contributing to this plasticity remain unclear. Here, we examined the expression of the activity-regulated cytoskeleton-associated protein (Arc) in the associative or dorsomedial striatum (DMS) and the sensorimotor or dorsolateral striatum (DLS), as well as in striatonigral and striatopallidal neurons, during different skill learning phases in the accelerating rotarod task. We found that Arc was mainly expressed in the DMS during early motor learning and progressively increased in the DLS during gradual motor skill consolidation. Moreover, Arc was preferentially expressed in striatopallidal neurons early in training and gradually increased in striatonigral neurons later in training. These data demonstrate that in the dorsal striatum, the expression of Arc exhibits a region- and cell-specific transfer during the learning of a motor skill, suggesting a link between striatal Arc expression and motor skill learning in mice.
Collapse
Affiliation(s)
- Wan-Kun Gong
- State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, and Department of Pharmacology, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China
| | - Jian Ni
- State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, and Department of Pharmacology, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China
| | - Lan-Fang Yu
- State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, and Department of Pharmacology, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China
| | - Lu Wang
- State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, and Department of Pharmacology, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China.
| | - Zhi-Li Huang
- State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, and Department of Pharmacology, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China.
| |
Collapse
|
12
|
No substantial change in the balance between model-free and model-based control via training on the two-step task. PLoS Comput Biol 2019; 15:e1007443. [PMID: 31725719 PMCID: PMC6855413 DOI: 10.1371/journal.pcbi.1007443] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Accepted: 09/26/2019] [Indexed: 11/20/2022] Open
Abstract
Human decisions can be habitual or goal-directed, also known as model-free (MF) or model-based (MB) control. Previous work suggests that the balance between the two decision systems is impaired in psychiatric disorders such as compulsion and addiction, via overreliance on MF control. However, little is known whether the balance can be altered through task training. Here, 20 healthy participants performed a well-established two-step task that differentiates MB from MF control, across five training sessions. We used computational modelling and functional near-infrared spectroscopy to assess changes in decision-making and brain hemodynamic over time. Mixed-effects modelling revealed overall no substantial changes in MF and MB behavior across training. Although our behavioral and brain findings show task-induced changes in learning rates, these parameters have no direct relation to either MF or MB control or the balance between the two systems, and thus do not support the assumption of training effects on MF or MB strategies. Our findings indicate that training on the two-step paradigm in its current form does not support a shift in the balance between MF and MB control. We discuss these results with respect to implications for restoring the balance between MF and MB control in psychiatric conditions. Psychiatric conditions such as compulsion or addiction are associated with an overreliance on habitual, or model-free, decision-making. Goal-directed, or model-based, decision-making may protect against such overreliance. We therefore asked whether model-free control could be reduced, and model-based control strengthened, via task training. We used the well-characterized two-step task that differentiates model-based from model-free actions. Our results suggest that training on the current form of the two-step task does not support a shift in the balance between model-free and model-based strategies. Factors such as devaluation, demotivation or automatization during training may play a role in the missing emergence of a training effect. Future studies could adapt the two-step task so as to separate such factors from decision-making strategies.
Collapse
|
13
|
Siniscalchi MJ, Wang H, Kwan AC. Enhanced Population Coding for Rewarded Choices in the Medial Frontal Cortex of the Mouse. Cereb Cortex 2019; 29:4090-4106. [PMID: 30615132 PMCID: PMC6735259 DOI: 10.1093/cercor/bhy292] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 10/26/2018] [Accepted: 10/31/2018] [Indexed: 12/29/2022] Open
Abstract
Instrumental behavior is characterized by the selection of actions based on the degree to which they lead to a desired outcome. However, we lack a detailed understanding of how rewarded actions are reinforced and preferentially implemented. In rodents, the medial frontal cortex is hypothesized to play an important role in this process, based in part on its capacity to encode chosen actions and their outcomes. We therefore asked how neural representations of choice and outcome might interact to facilitate instrumental behavior. To investigate this question, we imaged neural ensemble activity in layer 2/3 of the secondary motor region (M2) while mice engaged in a two-choice auditory discrimination task with probabilistic outcomes. Correct choices could result in one of three reward amounts (single, double or omitted reward), which allowed us to measure neural and behavioral effects of reward magnitude, as well as its categorical presence or absence. Single-unit and population decoding analyses revealed a consistent influence of outcome on choice signals in M2. Specifically, rewarded choices were more robustly encoded relative to unrewarded choices, with little dependence on the exact magnitude of reinforcement. Our results provide insight into the integration of past choices and outcomes in the rodent brain during instrumental behavior.
Collapse
Affiliation(s)
- Michael J Siniscalchi
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT, USA
| | - Hongli Wang
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT, USA
| | - Alex C Kwan
- Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT, USA
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
- Department of Neuroscience, Yale University School of Medicine, New Haven, CT, USA
| |
Collapse
|
14
|
Distributed Encoding of Reinforcement in Rat Cortico-Striatal-Limbic Networks. Neuroscience 2019; 413:169-182. [PMID: 31229632 DOI: 10.1016/j.neuroscience.2019.06.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Revised: 05/24/2019] [Accepted: 06/12/2019] [Indexed: 11/22/2022]
Abstract
Decision-making in the mammalian brain typically involves multiple brain structures within the midbrain, thalamus, striatum, limbic system, and cortex. Although task specific contributions of each brain region have been identified, neurons responding to reinforcement have been found throughout these structures. We sought to determine if any brain area, or cluster of areas, are the source of information, and if the fidelity of information varies among the areas. We recorded simultaneous field potentials (FPs) in rats from seven brain regions as they completed a binary choice task. The FPs of a 0.5 s window following reinforcement were given as input to a classifier that attempted to predict whether or not the rat received reward on each trial. The classifier correctly categorized reward on 77% of trials. Any region-specific signal could be omitted without lowering accuracy. Frequencies above 40 Hz and signals recorded later than 0.25 s following reinforcement were necessary to achieve this accuracy. Further, the classifier was able to predict reinforcement outcome above chance levels when using FPs from any single recorded brain region. Some combinations of structures, however, were more predictive than others. Analysis of FPs prior to reward revealed most regions reflected the prior probability of reward. Lastly, analyses of information flow suggested reinforcement information does not originate within a single structure of the network, within the resolution afforded by FP recordings. These data suggest reward delivery information is rapidly distributed non-uniformly across the network, and there is no canonical flow of information about reward events in the recorded structures.
Collapse
|
15
|
Bari BA, Grossman CD, Lubin EE, Rajagopalan AE, Cressy JI, Cohen JY. Stable Representations of Decision Variables for Flexible Behavior. Neuron 2019; 103:922-933.e7. [PMID: 31280924 DOI: 10.1016/j.neuron.2019.06.001] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Revised: 05/03/2019] [Accepted: 05/31/2019] [Indexed: 12/25/2022]
Abstract
Decisions occur in dynamic environments. In the framework of reinforcement learning, the probability of performing an action is influenced by decision variables. Discrepancies between predicted and obtained rewards (reward prediction errors) update these variables, but they are otherwise stable between decisions. Although reward prediction errors have been mapped to midbrain dopamine neurons, it is unclear how the brain represents decision variables themselves. We trained mice on a dynamic foraging task in which they chose between alternatives that delivered reward with changing probabilities. Neurons in the medial prefrontal cortex, including projections to the dorsomedial striatum, maintained persistent firing rate changes over long timescales. These changes stably represented relative action values (to bias choices) and total action values (to bias response times) with slow decay. In contrast, decision variables were weakly represented in the anterolateral motor cortex, a region necessary for generating choices. Thus, we define a stable neural mechanism to drive flexible behavior.
Collapse
Affiliation(s)
- Bilal A Bari
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Cooper D Grossman
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Emily E Lubin
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Adithya E Rajagopalan
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Jianna I Cressy
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Jeremiah Y Cohen
- The Solomon H. Snyder Department of Neuroscience, Brain Science Institute, Kavli Neuroscience Discovery Institute, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
| |
Collapse
|
16
|
Short-Term Influence of Recent Trial History on Perceptual Choice Changes with Stimulus Strength. Neuroscience 2019; 409:1-15. [PMID: 30986438 DOI: 10.1016/j.neuroscience.2019.04.010] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Revised: 04/02/2019] [Accepted: 04/03/2019] [Indexed: 12/19/2022]
Abstract
Perceptual decisions, especially for difficult stimuli, can be influenced by choices and outcomes in previous trials. However, it is not well understood how stimulus strength modulates the temporal characteristics as well as the magnitude of trial history influence. We addressed this question using a contrast detection task in freely moving mice. We found that, at lower as compared to higher stimulus contrast, the current choice of the mice was more influenced by choices and outcomes in the past trials and the influence emerged from a longer history. To examine the neural basis of stimulus strength-dependent history influence, we recorded from the secondary motor cortex (M2), a prefrontal region that plays an important role in cue-guided actions and memory-guided behaviors. We found that more M2 neurons conveyed information about choices on the past two trials at lower than at higher contrast. Furthermore, history-trial activity in M2 was important for decoding upcoming choice at low contrast. Thus, trial history influence of perceptual choice is adaptive to the strength of sensory evidence, which may be important for action selection in a dynamic environment.
Collapse
|
17
|
Railo H, Olkoniemi H, Eeronheimo E, Pääkkönen O, Joutsa J, Kaasinen V. Dopamine and eye movement control in Parkinson's disease: deficits in corollary discharge signals? PeerJ 2018; 6:e6038. [PMID: 30568856 PMCID: PMC6287583 DOI: 10.7717/peerj.6038] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Accepted: 10/30/2018] [Indexed: 11/20/2022] Open
Abstract
Movement in Parkinson's disease (PD) is fragmented, and the patients depend on visual information in their behavior. This suggests that the patients may have deficits in internally monitoring their own movements. Internal monitoring of movements is assumed to rely on corollary discharge signals that enable the brain to predict the sensory consequences of actions. We studied early-stage PD patients (N = 14), and age-matched healthy control participants (N = 14) to examine whether PD patients reveal deficits in updating their sensory representations after eye movements. The participants performed a double-saccade task where, in order to accurately fixate a second target, the participant must correct for the displacement caused by the first saccade. In line with previous reports, the patients had difficulties in fixating the second target when the eye movement was performed without visual guidance. Furthermore, the patients had difficulties in taking into account the error in the first saccade when making a saccade toward the second target, especially when eye movements were made toward the side with dominant motor symptoms. Across PD patients, the impairments in saccadic eye movements correlated with the integrity of the dopaminergic system as measured with [123I]FP-CIT SPECT: Patients with lower striatal (caudate, anterior putamen, and posterior putamen) dopamine transporter binding made larger errors in saccades. This effect was strongest when patients made memory-guided saccades toward the second target. Our results provide tentative evidence that the motor deficits in PD may be partly due to deficits in internal monitoring of movements.
Collapse
Affiliation(s)
- Henry Railo
- Department of Clinical Neurophysiology, University of Turku, Turku, Finland
- Turku Brain and Mind Centre, University of Turku, Turku, Finland
- Department of Psychology, University of Turku, Turku, Finland
| | - Henri Olkoniemi
- Department of Psychology, University of Turku, Turku, Finland
| | - Enni Eeronheimo
- Department of Psychology, University of Turku, Turku, Finland
| | - Oona Pääkkönen
- Department of Psychology, University of Turku, Turku, Finland
| | - Juho Joutsa
- Athinoula A. Martinos Center for Biomedical Imaging, Harvard Medical School, Charlestown, MA, USA
- Berenson-Allen Center for Noninvasive Brain Stimulation, Harvard Medical School, Charlestown, MA, USA
- Department of Neurology, University of Turku, Turku, Finland
- Division of Clinical Neurosciences, Turku University Hospital, Turku, Finland
| | - Valtteri Kaasinen
- Turku Brain and Mind Centre, University of Turku, Turku, Finland
- Department of Neurology, University of Turku, Turku, Finland
- Division of Clinical Neurosciences, Turku University Hospital, Turku, Finland
| |
Collapse
|
18
|
Jung MW, Lee H, Jeong Y, Lee JW, Lee I. Remembering rewarding futures: A simulation-selection model of the hippocampus. Hippocampus 2018; 28:913-930. [PMID: 30155938 PMCID: PMC6587829 DOI: 10.1002/hipo.23023] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Revised: 07/06/2018] [Accepted: 08/23/2018] [Indexed: 02/06/2023]
Abstract
Despite tremendous progress, the neural circuit dynamics underlying hippocampal mnemonic processing remain poorly understood. We propose a new model for hippocampal function-the simulation-selection model-based on recent experimental findings and neuroecological considerations. Under this model, the mammalian hippocampus evolved to simulate and evaluate arbitrary navigation sequences. Specifically, we suggest that CA3 simulates unexperienced navigation sequences in addition to remembering experienced ones, and CA1 selects from among these CA3-generated sequences, reinforcing those that are likely to maximize reward during offline idling states. High-value sequences reinforced in CA1 may allow flexible navigation toward a potential rewarding location during subsequent navigation. We argue that the simulation-selection functions of the hippocampus have evolved in mammals mostly because of the unique navigational needs of land mammals. Our model may account for why the mammalian hippocampus has evolved not only to remember, but also to imagine episodes, and how this might be implemented in its neural circuits.
Collapse
Affiliation(s)
- Min Whan Jung
- Center for Synaptic Brain Dysfunctions, Institute for Basic ScienceDaejeonSouth Korea
- Department of Biological SciencesKorea Advanced Institute of Science and TechnologyDaejeonSouth Korea
| | - Hyunjung Lee
- Department of AnatomyKyungpook National University School of MedicineDaeguSouth Korea
| | - Yeongseok Jeong
- Center for Synaptic Brain Dysfunctions, Institute for Basic ScienceDaejeonSouth Korea
- Department of Biological SciencesKorea Advanced Institute of Science and TechnologyDaejeonSouth Korea
| | - Jong Won Lee
- Center for Synaptic Brain Dysfunctions, Institute for Basic ScienceDaejeonSouth Korea
| | - Inah Lee
- Department of Brain and Cognitive SciencesSeoul National UniversitySeoulSouth Korea
| |
Collapse
|
19
|
Silveira MM, Tremblay M, Winstanley CA. Dissociable contributions of dorsal and ventral striatal regions on a rodent cost/benefit decision-making task requiring cognitive effort. Neuropharmacology 2018; 137:322-331. [DOI: 10.1016/j.neuropharm.2018.04.025] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2018] [Revised: 03/14/2018] [Accepted: 04/23/2018] [Indexed: 12/19/2022]
|
20
|
Marshall AT, Kirkpatrick K. Reinforcement learning models of risky choice and the promotion of risk-taking by losses disguised as wins in rats. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL LEARNING AND COGNITION 2018; 43:262-279. [PMID: 29120214 DOI: 10.1037/xan0000141] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Risky decisions are inherently characterized by the potential to receive gains or incur losses, and these outcomes have distinct effects on subsequent decision-making. One important factor is that individuals engage in loss-chasing, in which the reception of a loss is followed by relatively increased risk-taking. Unfortunately, the mechanisms of loss-chasing are poorly understood, despite the potential importance for understanding pathological choice behavior. The goal of the present experiment was to illuminate the mechanisms governing individual differences in loss-chasing and risky-choice behaviors. Rats chose between a low-uncertainty outcome that always delivered a variable amount of reward and a high-uncertainty outcome that probabilistically delivered reward. Loss-processing and loss-chasing were assessed in the context of losses disguised as wins (LDWs), which are loss outcomes that are presented along with gain-related stimuli. LDWs have been suggested to interfere with adaptive decision-making in humans and thus potentially increase loss-making. Here, the rats presented with LDWs were riskier, in that they made more choices for the high-uncertainty outcome. A series of nonlinear models were fit to individual rats' data to elucidate the possible psychological mechanisms that best account for individual differences in high-uncertainty choices and loss-chasing behaviors. The models suggested that the rats presented with LDWs were more prone to showing a stay bias following high-uncertainty outcomes compared to rats not presented with LDWs. These results collectively suggest that LDWs acquire conditioned reinforcement properties that encourage continued risk-taking and increase loss-chasing following previous high-risk decisions. (PsycINFO Database Record
Collapse
|
21
|
Elber-Dorozko L, Loewenstein Y. Striatal action-value neurons reconsidered. eLife 2018; 7:e34248. [PMID: 29848442 PMCID: PMC6008056 DOI: 10.7554/elife.34248] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Accepted: 05/13/2018] [Indexed: 11/13/2022] Open
Abstract
It is generally believed that during economic decisions, striatal neurons represent the values associated with different actions. This hypothesis is based on studies, in which the activity of striatal neurons was measured while the subject was learning to prefer the more rewarding action. Here we show that these publications are subject to at least one of two critical confounds. First, we show that even weak temporal correlations in the neuronal data may result in an erroneous identification of action-value representations. Second, we show that experiments and analyses designed to dissociate action-value representation from the representation of other decision variables cannot do so. We suggest solutions to identifying action-value representation that are not subject to these confounds. Applying one solution to previously identified action-value neurons in the basal ganglia we fail to detect action-value representations. We conclude that the claim that striatal neurons encode action-values must await new experiments and analyses.
Collapse
Affiliation(s)
- Lotem Elber-Dorozko
- The Edmond & Lily Safra Center for Brain SciencesThe Hebrew University of JerusalemJerusalemIsrael
| | - Yonatan Loewenstein
- The Edmond & Lily Safra Center for Brain SciencesThe Hebrew University of JerusalemJerusalemIsrael
- Department of Neurobiology, The Alexander Silberman Institute of Life SciencesThe Hebrew University of JerusalemJerusalemIsrael
- The Federmann Center for the Study of RationalityThe Hebrew University of JerusalemJerusalemIsrael
| |
Collapse
|
22
|
Bergstrom HC, Lipkin AM, Lieberman AG, Pinard CR, Gunduz-Cinar O, Brockway ET, Taylor WW, Nonaka M, Bukalo O, Wills TA, Rubio FJ, Li X, Pickens CL, Winder DG, Holmes A. Dorsolateral Striatum Engagement Interferes with Early Discrimination Learning. Cell Rep 2018; 23:2264-2272. [PMID: 29791838 PMCID: PMC6015733 DOI: 10.1016/j.celrep.2018.04.081] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Revised: 12/15/2017] [Accepted: 04/18/2018] [Indexed: 12/31/2022] Open
Abstract
In current models, learning the relationship between environmental stimuli and the outcomes of actions involves both stimulus-driven and goal-directed systems, mediated in part by the DLS and DMS, respectively. However, though these models emphasize the importance of the DLS in governing actions after extensive experience has accumulated, there is growing evidence of DLS engagement from the onset of training. Here, we used in vivo photosilencing to reveal that DLS recruitment interferes with early touchscreen discrimination learning. We also show that the direct output pathway of the DLS is preferentially recruited and causally involved in early learning and find that silencing the normal contribution of the DLS produces plasticity-related alterations in a PL-DMS circuit. These data provide further evidence suggesting that the DLS is recruited in the construction of stimulus-elicited actions that ultimately automate behavior and liberate cognitive resources for other demands, but with a cost to performance at the outset of learning.
Collapse
Affiliation(s)
- Hadley C Bergstrom
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA.
| | - Anna M Lipkin
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - Abby G Lieberman
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - Courtney R Pinard
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - Ozge Gunduz-Cinar
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - Emma T Brockway
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - William W Taylor
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - Mio Nonaka
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - Olena Bukalo
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - Tiffany A Wills
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN, USA
| | - F Javier Rubio
- Behavioral Neuroscience Research Branch, National Institute on Drug Abuse, NIH, Baltimore, MD, USA
| | - Xuan Li
- Behavioral Neuroscience Research Branch, National Institute on Drug Abuse, NIH, Baltimore, MD, USA
| | - Charles L Pickens
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| | - Danny G Winder
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN, USA; Vanderbilt Center for Addiction Research, Nashville, TN, USA
| | - Andrew Holmes
- Laboratory of Behavioral and Genomic Neuroscience, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD, USA
| |
Collapse
|
23
|
Wang JX, Kurth-Nelson Z, Kumaran D, Tirumala D, Soyer H, Leibo JZ, Hassabis D, Botvinick M. Prefrontal cortex as a meta-reinforcement learning system. Nat Neurosci 2018; 21:860-868. [DOI: 10.1038/s41593-018-0147-8] [Citation(s) in RCA: 258] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Accepted: 04/05/2018] [Indexed: 11/09/2022]
|
24
|
Stolyarova A. Solving the Credit Assignment Problem With the Prefrontal Cortex. Front Neurosci 2018; 12:182. [PMID: 29636659 PMCID: PMC5881225 DOI: 10.3389/fnins.2018.00182] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Accepted: 03/06/2018] [Indexed: 12/13/2022] Open
Abstract
In naturalistic multi-cue and multi-step learning tasks, where outcomes of behavior are delayed in time, discovering which choices are responsible for rewards can present a challenge, known as the credit assignment problem. In this review, I summarize recent work that highlighted a critical role for the prefrontal cortex (PFC) in assigning credit where it is due in tasks where only a few of the multitude of cues or choices are relevant to the final outcome of behavior. Collectively, these investigations have provided compelling support for specialized roles of the orbitofrontal (OFC), anterior cingulate (ACC), and dorsolateral prefrontal (dlPFC) cortices in contingent learning. However, recent work has similarly revealed shared contributions and emphasized rich and heterogeneous response properties of neurons in these brain regions. Such functional overlap is not surprising given the complexity of reciprocal projections spanning the PFC. In the concluding section, I overview the evidence suggesting that the OFC, ACC and dlPFC communicate extensively, sharing the information about presented options, executed decisions and received rewards, which enables them to assign credit for outcomes to choices on which they are contingent. This account suggests that lesion or inactivation/inhibition experiments targeting a localized PFC subregion will be insufficient to gain a fine-grained understanding of credit assignment during learning and instead poses refined questions for future research, shifting the focus from focal manipulations to experimental techniques targeting cortico-cortical projections.
Collapse
Affiliation(s)
- Alexandra Stolyarova
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, United States
| |
Collapse
|
25
|
Differential coding of reward and movement information in the dorsomedial striatal direct and indirect pathways. Nat Commun 2018; 9:404. [PMID: 29374173 PMCID: PMC5786099 DOI: 10.1038/s41467-017-02817-1] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 12/31/2017] [Indexed: 11/30/2022] Open
Abstract
The direct and indirect pathways of the basal ganglia have long been thought to mediate behavioral promotion and inhibition, respectively. However, this classic dichotomous model has been recently challenged. To better understand neural processes underlying reward-based learning and movement control, we recorded from direct (dSPNs) and indirect (iSPNs) pathway spiny projection neurons in the dorsomedial striatum of D1-Cre and D2-Cre mice performing a probabilistic Pavlovian conditioning task. dSPNs tend to increase activity while iSPNs decrease activity as a function of reward value, suggesting the striatum represents value in the relative activity levels of dSPNs versus iSPNs. Lick offset-related activity increase is largely dSPN selective, suggesting dSPN involvement in suppressing ongoing licking behavior. Rapid responses to negative outcome and previous reward-related responses are more frequent among iSPNs than dSPNs, suggesting stronger contributions of iSPNs to outcome-dependent behavioral adjustment. These findings provide new insights into striatal neural circuit operations. Classically the direct and indirect pathways of basal ganglia are understood to have opposing roles in movement and reward learning, but recent work suggests a more complicated view. Here the authors further study indirect and direct pathway neurons, in the context of a probabilistic reward task.
Collapse
|
26
|
Achiro JM, Shen J, Bottjer SW. Neural activity in cortico-basal ganglia circuits of juvenile songbirds encodes performance during goal-directed learning. eLife 2017; 6:e26973. [PMID: 29256393 PMCID: PMC5762157 DOI: 10.7554/elife.26973] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2017] [Accepted: 12/02/2017] [Indexed: 11/13/2022] Open
Abstract
Cortico-basal ganglia circuits are thought to mediate goal-directed learning by a process of outcome evaluation to gradually select appropriate motor actions. We investigated spiking activity in core and shell subregions of the cortical nucleus LMAN during development as juvenile zebra finches are actively engaged in evaluating feedback of self-generated behavior in relation to their memorized tutor song (the goal). Spiking patterns of single neurons in both core and shell subregions during singing correlated with acoustic similarity to tutor syllables, suggesting a process of outcome evaluation. Both core and shell neurons encoded tutor similarity via either increases or decreases in firing rate, although only shell neurons showed a significant association at the population level. Tutor similarity predicted firing rates most strongly during early stages of learning, and shell but not core neurons showed decreases in response variability across development, suggesting that the activity of shell neurons reflects the progression of learning.
Collapse
Affiliation(s)
- Jennifer M Achiro
- Neuroscience Graduate ProgramUniversity of Southern CaliforniaLos AngelesUnited States
| | - John Shen
- Neuroscience Graduate ProgramUniversity of Southern CaliforniaLos AngelesUnited States
| | - Sarah W Bottjer
- Section of NeurobiologyUniversity of Southern CaliforniaLos AngelesUnited States
| |
Collapse
|
27
|
Lee SH, Huh N, Lee JW, Ghim JW, Lee I, Jung MW. Neural Signals Related to Outcome Evaluation Are Stronger in CA1 than CA3. Front Neural Circuits 2017. [PMID: 28638322 PMCID: PMC5461339 DOI: 10.3389/fncir.2017.00040] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
We have shown previously that CA1 conveys significant neural signals necessary to update value of the chosen target, namely chosen value and reward signals. To better understand hippocampal neural processes related to valuation, we compared chosen value- and reward-related neural activity between the CA3 and CA1 regions. Single units were recorded with tetrodes from the dorsal CA3 and CA1 regions of rats performing a dynamic foraging task, and chosen value- and reward-related neural activity was estimated using a reinforcement learning model and multiple regression analyses. Neural signals for chosen value and reward converged in both CA3 and CA1 when a trial outcome was revealed. However, these neural signals were stronger in CA1 than CA3. Consequently, neural signals for reward prediction error and updated chosen value were stronger in CA1 than CA3. Together with our previous finding that CA1 conveys stronger value signals than the subiculum, our results raise the possibility that CA1 might play a particularly important role among hippocampal subregions in evaluating experienced events.
Collapse
Affiliation(s)
- Sung-Hyun Lee
- Center for Synaptic Brain Dysfunctions, Institute for Basic ScienceDaejeon, South Korea.,Neuroscience Graduate Program, Ajou University School of MedicineSuwon, South Korea
| | - Namjung Huh
- Center for Synaptic Brain Dysfunctions, Institute for Basic ScienceDaejeon, South Korea
| | - Jong Won Lee
- Center for Synaptic Brain Dysfunctions, Institute for Basic ScienceDaejeon, South Korea
| | - Jeong-Wook Ghim
- Center for Synaptic Brain Dysfunctions, Institute for Basic ScienceDaejeon, South Korea
| | - Inah Lee
- Department of Brain and Cognitive Science, Seoul National UniversitySeoul, South Korea
| | - Min W Jung
- Center for Synaptic Brain Dysfunctions, Institute for Basic ScienceDaejeon, South Korea.,Neuroscience Graduate Program, Ajou University School of MedicineSuwon, South Korea.,Department of Biological Sciences, Korea Advanced Institute of Science and TechnologyDaejeon, South Korea
| |
Collapse
|
28
|
Gao P, de Munck JC, Limpens JHW, Vanderschuren LJMJ, Voorn P. A neuronal activation correlate in striatum and prefrontal cortex of prolonged cocaine intake. Brain Struct Funct 2017; 222:3453-3475. [PMID: 28393262 PMCID: PMC5676843 DOI: 10.1007/s00429-017-1412-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2016] [Accepted: 03/22/2017] [Indexed: 01/05/2023]
Abstract
Maladaptive changes in the involvement of striatal and frontal cortical regions in drug use are thought to underlie the progression to habitual drug use and loss of cognitive control over drug intake that occur with accumulating drug experience. The present experiments focus on changes in neuronal activity in these regions associated with short-term (10 days) and long-term (60 days) self-administration of cocaine. Quantitative in situ hybridization for the immediate early gene Mkp1 was combined with statistical parametric mapping to assess the distribution of neuronal activity. We hypothesized that neuronal activity in striatum would increase in its dorsal part and that activity in frontal cortex would decrease with prolonged cocaine self-administration experience. Expression of Mkp1 was profoundly increased after cocaine self-administration, and the magnitude of this effect was greater after short-term compared to long-term self-administration. Increased neuronal activity was seen in both dorsal and ventral sectors of the striatum after 10 days exposure to cocaine. However, enhanced activity was restricted to dorsomedial and dorsocentral striatum after 60 days cocaine self-administration. In virtually all medial prefrontal and most orbitofrontal areas, increased expression of Mkp1 was observed after 10 days of cocaine taking, whereas after 60 days, enhanced expression was restricted to caudal parts of medial prefrontal and caudomedial parts of orbitofrontal cortex. Our data reveal functional changes in cellular activity in striatum and frontal cortex with increasing cocaine self-administration experience. These changes might reflect the neural processes that underlie the descent from recreational drug taking to compulsive cocaine use.
Collapse
Affiliation(s)
- Ping Gao
- Department of Anatomy and Neurosciences, Neuroscience Campus Amsterdam, VU University Medical Center, Amsterdam, The Netherlands
| | - Jan C de Munck
- Department of Physics and Medical Technology, VU University Medical Center, Amsterdam, The Netherlands
| | - Jules H W Limpens
- Department of Translational Neuroscience, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Louk J M J Vanderschuren
- Division of Behavioural Neuroscience, Faculty of Veterinary Medicine, Department of Animals in Science and Society, Utrecht University, Utrecht, The Netherlands
| | - Pieter Voorn
- Department of Anatomy and Neurosciences, Neuroscience Campus Amsterdam, VU University Medical Center, Amsterdam, The Netherlands.
| |
Collapse
|
29
|
Bailey MR, Simpson EH, Balsam PD. Neural substrates underlying effort, time, and risk-based decision making in motivated behavior. Neurobiol Learn Mem 2016; 133:233-256. [PMID: 27427327 PMCID: PMC5007005 DOI: 10.1016/j.nlm.2016.07.015] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2016] [Revised: 07/11/2016] [Accepted: 07/12/2016] [Indexed: 11/16/2022]
Abstract
All mobile organisms rely on adaptive motivated behavior to overcome the challenges of living in an environment in which essential resources may be limited. A variety of influences ranging from an organism's environment, experiential history, and physiological state all influence a cost-benefit analysis which allows motivation to energize behavior and direct it toward specific goals. Here we review the substantial amount of research aimed at discovering the interconnected neural circuits which allow organisms to carry-out the cost-benefit computations which allow them to behave in adaptive ways. We specifically focus on how the brain deals with different types of costs, including effort requirements, delays to reward and payoff riskiness. An examination of this broad literature highlights the importance of the extended neural circuits which enable organisms to make decisions about these different types of costs. This involves Cortical Structures, including the Anterior Cingulate Cortex (ACC), the Orbital Frontal Cortex (OFC), the Infralimbic Cortex (IL), and prelimbic Cortex (PL), as well as the Baso-Lateral Amygdala (BLA), the Nucleus Accumbens (NAcc), the Ventral Pallidal (VP), the Sub Thalamic Nucleus (STN) among others. Some regions are involved in multiple aspects of cost-benefit computations while the involvement of other regions is restricted to information relating to specific types of costs.
Collapse
Affiliation(s)
- Matthew R Bailey
- Department of Psychology, Columbia University, New York, NY, USA.
| | - Eleanor H Simpson
- Department of Psychiatry, Columbia University, New York, NY, USA; New York State Psychiatric Institute, New York, NY, USA
| | - Peter D Balsam
- Department of Psychiatry, Columbia University, New York, NY, USA; New York State Psychiatric Institute, New York, NY, USA; Barnard College, Columbia University, New York, NY, USA
| |
Collapse
|
30
|
Thompson JA, Costabile JD, Felsen G. Mesencephalic representations of recent experience influence decision making. eLife 2016; 5. [PMID: 27454033 PMCID: PMC4987136 DOI: 10.7554/elife.16572] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 07/23/2016] [Indexed: 01/25/2023] Open
Abstract
Decisions are influenced by recent experience, but the neural basis for this phenomenon is not well understood. Here, we address this question in the context of action selection. We focused on activity in the pedunculopontine tegmental nucleus (PPTg), a mesencephalic region that provides input to several nuclei in the action selection network, in well-trained mice selecting actions based on sensory cues and recent trial history. We found that, at the time of action selection, the activity of many PPTg neurons reflected the action on the previous trial and its outcome, and the strength of this activity predicted the upcoming choice. Further, inactivating the PPTg predictably decreased the influence of recent experience on action selection. These findings suggest that PPTg input to downstream motor regions, where it can be integrated with other relevant information, provides a simple mechanism for incorporating recent experience into the computations underlying action selection. DOI:http://dx.doi.org/10.7554/eLife.16572.001 The decisions we make are influenced by recent experience, yet it is not known how this experience is represented in the brain. For decisions about when, where and how to move, researchers have hypothesized that recent experience might influence activity in a region of the brainstem – the central trunk of the brain – that is known to be involved in movement. When deciding when, where and how to move, several areas of the brain are involved in selecting the optimal action. Recent studies suggest that groups of neurons known as locomotor brainstem nuclei may also contribute to making decisions about movements. Thompson et al. investigated whether a brainstem locomotor area called the pedunculopontine tegmental (PPTg) nucleus in mice might contribute to decision making rather than just conveying the selected response. The mice were trained to recognize particular odors and move to either the left or right to collect a food reward. While the mice were selecting an action, the activity of neurons in the PPTg nucleus reflected the action they had chosen on a previous experience and the outcome of that choice (i.e. whether they received a reward). These representations of past experiences influenced the upcoming decision the mice were about to take. The findings of Thompson et al. suggest that the PPTg nucleus might play a critical role in the process of selecting the optimal action. Future work will examine what kinds of information about the environment or recent experience have the biggest effect on the activity of this region. DOI:http://dx.doi.org/10.7554/eLife.16572.002
Collapse
Affiliation(s)
- John A Thompson
- Department of Neurosurgery, University of Colorado School of Medicine, Aurora, United States.,Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, United States
| | - Jamie D Costabile
- Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, United States
| | - Gidon Felsen
- Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, United States
| |
Collapse
|
31
|
Neuronal activity in dorsomedial and dorsolateral striatum under the requirement for temporal credit assignment. Sci Rep 2016; 6:27056. [PMID: 27245401 PMCID: PMC4887996 DOI: 10.1038/srep27056] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Accepted: 05/13/2016] [Indexed: 11/17/2022] Open
Abstract
To investigate neural processes underlying temporal credit assignment in the striatum, we recorded neuronal activity in the dorsomedial and dorsolateral striatum (DMS and DLS, respectively) of rats performing a dynamic foraging task in which a choice has to be remembered until its outcome is revealed for correct credit assignment. Choice signals appeared sequentially, initially in the DMS and then in the DLS, and they were combined with action value and reward signals in the DLS when choice outcome was revealed. Unlike in conventional dynamic foraging tasks, neural signals for chosen value were elevated in neither brain structure. These results suggest that dynamics of striatal neural signals related to evaluating choice outcome might differ drastically depending on the requirement for temporal credit assignment. In a behavioral context requiring temporal credit assignment, the DLS, but not the DMS, might be in charge of updating the value of chosen action by integrating choice, action value, and reward signals together.
Collapse
|
32
|
Bailey MR, Jensen G, Taylor K, Mezias C, Williamson C, Silver R, Simpson EH, Balsam PD. A novel strategy for dissecting goal-directed action and arousal components of motivated behavior with a progressive hold-down task. Behav Neurosci 2016; 129:269-80. [PMID: 26030428 DOI: 10.1037/bne0000060] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Motivation serves 2 important functions: It guides actions to be goal-directed, and it provides the energy and vigor required to perform the work necessary to meet those goals. Dissociating these 2 processes with existing behavioral assays has been a challenge. In this article, we report a novel experimental strategy to distinguish the 2 processes in mice. First, we characterize a novel motivation assay in which animals must hold down a lever for progressively longer intervals to earn each subsequent reward; we call this the progressive hold-down (PHD) task. We find that performance on the PHD task is sensitive to both food deprivation level and reward value. Next, we use a dose of methamphetamine (METH) 1.0 mg/kg, to evaluate behavior in both the progressive ratio (PR) and PHD tasks. Treatment with METH leads to more persistent lever pressing for food rewards in the PR. In the PHD task, we found that METH increased arousal, which leads to numerous bouts of hyperactive responding but neither increases nor impairs goal-directed action. The results demonstrate that these tools enable a more precise understanding of the underlying processes being altered in manipulations that alter motivated behavior.
Collapse
Affiliation(s)
| | - Greg Jensen
- Department of Psychology, Columbia University
| | | | | | | | - Rae Silver
- Department of Psychology, Barnard College
| | | | | |
Collapse
|
33
|
Neurophysiology of rule switching in the corticostriatal circuit. Neuroscience 2016; 345:64-76. [PMID: 26851774 DOI: 10.1016/j.neuroscience.2016.01.062] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Revised: 01/12/2016] [Accepted: 01/28/2016] [Indexed: 01/06/2023]
Abstract
The ability to adjust behavioral responses to cues in a changing environment is crucial for survival. Activity in the medial Prefrontal Cortex (mPFC) is thought to both represent rules to guide behavior as well as detect and resolve conflicts between rules in changing contingencies. While lesion and pharmacological studies have supported a crucial role for mPFC in this type of set-shifting, an understanding of how mPFC represents current rules or detects and resolves conflict between different rules is still unclear. Meanwhile, medial dorsal striatum (mDS) receives major projections from mPFC and neural activity of mDS is closely linked to action selection, making the mDS a potential major player for enacting rule-guided action policies. However, exactly what is signaled by mPFC and how this impacts neural signals in mDS is not well known. In this review, we will summarize what is known about neural signals of rules and set shifting in both prefrontal cortex and dorsal striatum, as well as provide questions and directions for future experiments.
Collapse
|
34
|
Devan BD, Chaban N, Piscopello J, Deibel SH, McDonald RJ. Cognitive and Stimulus–Response Habit Functions of the Neo-(Dorsal) Striatum. INNOVATIONS IN COGNITIVE NEUROSCIENCE 2016. [DOI: 10.1007/978-3-319-42743-0_18] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
35
|
Kucinski A, Sarter M. Modeling Parkinson's disease falls associated with brainstem cholinergic systems decline. Behav Neurosci 2015; 129:96-104. [PMID: 25798629 DOI: 10.1037/bne0000048] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
In addition to the primary disease-defining symptoms, approximately half of patients with Parkinson's disease (PD) suffer from postural instability, impairments in gait control and a propensity for falls. Consistent with evidence from patients, we previously demonstrated that combined striatal dopamine (DA) and basal forebrain (BF) cholinergic cell loss causes falls in rats traversing dynamic surfaces. Because evidence suggests that degeneration of brainstem cholinergic neurons arising from the pedunculopontine nucleus (PPN) also contributes to impaired gait and falls, here we assessed the effects of selective cholinergic PPN lesions in combination with striatal DA loss or BF cholinergic cells loss as well as losses in all 3 regions. Results indicate that all combination losses that included the BF cholinergic system slowed traversal and increased slips and falls. However, the performance of rats with losses in all 3 regions (PPN, BF, and DA) was not more severely impaired than following combined BF cholinergic and striatal DA lesions. These results confirm the hypothesis that BF cholinergic-striatal disruption of attentional-motor interactions is a primary source of falls. Additional losses of PPN cholinergic neurons may worsen posture and gait control in situations not captured by the current testing conditions.
Collapse
|
36
|
Yang JH, Liao RM. Dissociable contribution of nucleus accumbens and dorsolateral striatum to the acquisition of risk choice behavior in the rat. Neurobiol Learn Mem 2015; 126:67-77. [PMID: 26551356 DOI: 10.1016/j.nlm.2015.11.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Revised: 09/14/2015] [Accepted: 11/03/2015] [Indexed: 12/18/2022]
Abstract
While a growing body of research has suggested that the mesocorticolimbic dopamine systems play a key role in decision making under risk, how the nucleus accumbens (NAC) is involved in the acquisition of risk choice behavior remains unclear. This study used a T-maze task to assess risk-based decision making in which the rat was required to assess the risk by choosing to enter either a small and certain reward arm or a large but uncertain reward arm of the maze. The latter option, when chosen, resulted in provision of 2, 4, or 8 sweeten pellets with a probability (p) of 0.5, 0.25, or 0.125, respectively. Thus the latter arm provided three different conditions of reward ratio, compared to the choice of former arm, which always provided 1 pellet with p=1. This risk choice task was then run with the expected value being equality between the binary choice options. The experimental rats first received an excitoneurotoxic lesion affecting either the NAC or the dorsolateral striatum (DLS) and this was followed by post-lesion behavioral examination. The sham lesion control rats acquired a stable risk choice with regard to each reward ratio over a 10-day test. The pattern of choice behavior appeared in risk-seeking when p=0.5 to obtain 2 pellets, and was risk-averse when larger reward resulted in lower p. The NAC lesion significantly disrupted the acquisition of the aforementioned risk choice behavior and apparently shifted the choice into a risk-averse style for all three reward ratios. No such effect was observed in the rats with DLS lesions. Neither the gross motor action nor the discrimination of different reward magnitudes was impaired by the lesions affecting either the NAC or DLS as assessed by an additional experiment. These findings suggest that firstly there is heterogeneity between NAC and DLS with respect to risk-based decision making, and that secondly the NAC is involved and critical to the acquisition of behavioral choice under risk, specially when the expected value of the reward under the two choice options is equal.
Collapse
Affiliation(s)
- Jen-Hau Yang
- Department of Psychology, National Cheng-Chi University, Taipei, Taiwan
| | - Ruey-Ming Liao
- Department of Psychology, National Cheng-Chi University, Taipei, Taiwan; Institute of Neuroscience, National Cheng-Chi University, Taipei, Taiwan; Research Center for Mind, Brain and Learning, National Cheng-Chi University, Taipei, Taiwan.
| |
Collapse
|
37
|
Distinct neural representation in the dorsolateral, dorsomedial, and ventral parts of the striatum during fixed- and free-choice tasks. J Neurosci 2015; 35:3499-514. [PMID: 25716849 DOI: 10.1523/jneurosci.1962-14.2015] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The striatum is a major input site of the basal ganglia, which play an essential role in decision making. Previous studies have suggested that subareas of the striatum have distinct roles: the dorsolateral striatum (DLS) functions in habitual action, the dorsomedial striatum (DMS) in goal-directed actions, and the ventral striatum (VS) in motivation. To elucidate distinctive functions of subregions of the striatum in decision making, we systematically investigated information represented by phasically active neurons in DLS, DMS, and VS. Rats performed two types of choice tasks: fixed- and free-choice tasks. In both tasks, rats were required to perform nose poking to either the left or right hole after cue-tone presentation. A food pellet was delivered probabilistically depending on the presented cue and the selected action. The reward probability was fixed in fixed-choice task and varied in a block-wise manner in free-choice task. We found the following: (1) when rats began the tasks, a majority of VS neurons increased their firing rates and information regarding task type and state value was most strongly represented in VS; (2) during action selection, information of action and action values was most strongly represented in DMS; (3) action-command information (action representation before action selection) was stronger in the fixed-choice task than in the free-choice task in both DLS and DMS; and (4) action-command information was strongest in DLS, particularly when the same choice was repeated. We propose a hypothesis of hierarchical reinforcement learning in the basal ganglia to coherently explain these results.
Collapse
|
38
|
Characterizing the associative content of brain structures involved in habitual and goal-directed actions in humans: a multivariate FMRI study. J Neurosci 2015; 35:3764-71. [PMID: 25740507 DOI: 10.1523/jneurosci.4677-14.2015] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
While there is accumulating evidence for the existence of distinct neural systems supporting goal-directed and habitual action selection in the mammalian brain, much less is known about the nature of the information being processed in these different brain regions. Associative learning theory predicts that brain systems involved in habitual control, such as the dorsolateral striatum, should contain stimulus and response information only, but not outcome information, while regions involved in goal-directed action, such as ventromedial and dorsolateral prefrontal cortex and dorsomedial striatum, should be involved in processing information about outcomes as well as stimuli and responses. To test this prediction, human participants underwent fMRI while engaging in a binary choice task designed to enable the separate identification of these different representations with a multivariate classification analysis approach. Consistent with our predictions, the dorsolateral striatum contained information about responses but not outcomes at the time of an initial stimulus, while the regions implicated in goal-directed action selection contained information about both responses and outcomes. These findings suggest that differential contributions of these regions to habitual and goal-directed behavioral control may depend in part on basic differences in the type of information that these regions have access to at the time of decision making.
Collapse
|
39
|
Kucinski A, Albin RL, Lustig C, Sarter M. Modeling falls in Parkinson's disease: Slow gait, freezing episodes and falls in rats with extensive striatal dopamine loss. Behav Brain Res 2015; 282:155-64. [PMID: 25595423 DOI: 10.1016/j.bbr.2015.01.012] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2014] [Revised: 12/09/2014] [Accepted: 01/06/2015] [Indexed: 11/17/2022]
Abstract
Falls in patients with Parkinson's disease (PD) are a major and levodopa-unresponsive source of morbidity. We previously described an animal model of falls resulting from impairments in attentional-motor interactions. Reproducing the multisystem dopaminergic-cholinergic cell loss in patients with a history for falls, partial loss of striatal dopamine innervation interacted with loss of forebrain cholinergic neurons to generate falls that was hypothesized to reflect impairments in the attentional control of gait and balance and the sequencing of complex movements [1]. As clinical evidence also indicates that basal ganglia dopamine (DA) loss per se is associated with severe discoordination and thus a greater risk for falls, here we demonstrate that relatively extensive striatal DA loss, in contrast to the lack of effects of smaller, dorsal striatal DA losses and sham lesions, increased falls and slips and caused slowing while traversing dynamic surfaces. Falls in large DA rats were associated specifically with spontaneous or slip-triggered stoppages of forward movement. Collectively, the evidence suggests that low motivation or vigor for movement in general, and for initiating corrective movements in particular, are major sources for falls in rats with large DA losses. Falls are a result of complex cognitive-motor interactions, and rats with large DA losses model the impact of a propensity for freezing of gait when traversing dynamic surfaces.
Collapse
Affiliation(s)
| | - Roger L Albin
- Neurology Service and GRECC, VAAAHS; Department of Neurology
| | - Cindy Lustig
- Department of Psychology; Neuroscience Program, University of Michigan, Ann Arbor, MI, USA
| | - Martin Sarter
- Department of Psychology; Neuroscience Program, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
40
|
Naudé J, Dongelmans M, Faure P. Nicotinic alteration of decision-making. Neuropharmacology 2014; 96:244-54. [PMID: 25498234 DOI: 10.1016/j.neuropharm.2014.11.021] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Revised: 11/09/2014] [Accepted: 11/26/2014] [Indexed: 10/24/2022]
Abstract
Addiction to nicotine is characterized by impulses, urges and lack of self-control towards cigarettes. A key element in the process of addiction is the development of habits oriented towards nicotine consumption that surpass flexible systems as a consequence of a gradual adaptation to chronic drug exposure. However, the long-term effects of nicotine on brain circuits also induce wide changes in decision-making processes, affecting behaviors unrelated to cigarettes. This review aims at providing an update on the implications of nicotine on general decision-making processes, with an emphasis on impulsivity and risk-taking. As impulsivity is a rather ambiguous behavioral trait, we build on economic and normative theories to better characterize these nicotine-induced alterations in decision-making. Nonetheless, experimental data are sparse and often contradictory. We will discuss how the latest findings on the neurobiological basis of choice behavior may help disentangling these issues. We focus on the role of nicotine acetylcholine receptors and their different subunits, and on the spatio-temporal dynamics (i.e. diversity of the neural circuits, short- and long-term effects) of both endogenous acetylcholine and nicotine action. Finally, we try to link these neurobiological results with neuro-computational models of attention, valuation and action, and of the role of acetylcholine in these decision processes. This article is part of the Special Issue entitled 'The Nicotinic Acetylcholine Receptor: From Molecular Biology to Cognition'.
Collapse
Affiliation(s)
- Jérémie Naudé
- Université Pierre et Marie Curie, CNRS UMR 8246, INSERM U 1130, UPMC UM CR18, 75005 Paris, France
| | - Malou Dongelmans
- Université Pierre et Marie Curie, CNRS UMR 8246, INSERM U 1130, UPMC UM CR18, 75005 Paris, France
| | - Philippe Faure
- Université Pierre et Marie Curie, CNRS UMR 8246, INSERM U 1130, UPMC UM CR18, 75005 Paris, France.
| |
Collapse
|
41
|
Yanike M, Ferrera VP. Interpretive monitoring in the caudate nucleus. eLife 2014; 3. [PMID: 25415238 PMCID: PMC4238052 DOI: 10.7554/elife.03727] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 10/23/2014] [Indexed: 12/05/2022] Open
Abstract
In a dynamic environment an organism has to constantly adjust ongoing behavior to adapt to a given context. This process requires continuous monitoring of ongoing behavior to provide its meaningful interpretation. The caudate nucleus is known to have a role in behavioral monitoring, but the nature of these signals during dynamic behavior is still unclear. We recorded neuronal activity in the caudate nucleus in monkeys during categorization behavior that changed rapidly across contexts. We found that neuronal activity maintained representation of the identity and context of a recently categorized stimulus, as well as interpreted the behavioral meaningfulness of the maintained trace. The accuracy of this cognitive monitoring signal was highest for behavior for which subjects were prone to make errors. Thus, the caudate nucleus provides interpretive monitoring of ongoing behavior, which is necessary for contextually specific decisions to adapt to rapidly changing conditions. DOI:http://dx.doi.org/10.7554/eLife.03727.001 The ability to adapt behavior in a changing environment is a hallmark of intelligent systems. From adjusting our driving speed to match road conditions to responding to a last-minute change of plans, mental flexibility underpins much of our day-to-day functioning. To perform optimally, an animal must continuously monitor its own behavior and adjust it according to circumstances. A region of the brain called the caudate nucleus is thought to contribute to this process by keeping track of the relation between an action and its outcomes, but it is not clear how it monitors cognitive aspects of ongoing behavior. Yanike and Ferrera have clarified this process by recording electrical activity from the caudate nucleus in two monkeys as they categorized visual stimuli. The monkeys viewed a moving stimulus and classified it as ‘fast’ or ‘slow’ relative to a reference speed that varied from trial to trial. The monkeys were trained to use two different references speeds and were told which reference speed to use at the start of each trial. They used an eye movement to indicate their decision. Most neurons within the caudate nucleus responded after the monkey had made a decision, suggesting that these neurons might be involved in evaluating the decision that had just been made. The response of the neurons depended on the stimulus speed, and also on the category (fast or slow) in which the stimulus belonged. This observation indicates that the caudate nucleus tracked the context (reference speed) as well as the stimulus speed. Yanike and Ferrera also showed that the response of the entire population of caudate neurons could be decoded to reveal both the speed of the stimulus and whether the monkey had categorized it as fast or slow. This shows that after a decision has been made, neurons continue to signal both the stimulus and the context in which that stimulus was presented. Such ‘post-decision’ monitoring is important for anticipating the outcome of the decision. Overall the results suggest that the caudate nucleus helps animals to adapt their behavior to rapidly changing circumstances by supporting decision-making that takes context into account. DOI:http://dx.doi.org/10.7554/eLife.03727.002
Collapse
Affiliation(s)
- Marianna Yanike
- Department of Neuroscience, Columbia University, New York, United States
| | - Vincent P Ferrera
- Department of Neuroscience, Columbia University, New York, United States
| |
Collapse
|
42
|
Braun AA, Amos-Kroohs RM, Gutierrez A, Lundgren KH, Seroogy KB, Skelton MR, Vorhees CV, Williams MT. Dopamine depletion in either the dorsomedial or dorsolateral striatum impairs egocentric Cincinnati water maze performance while sparing allocentric Morris water maze learning. Neurobiol Learn Mem 2014; 118:55-63. [PMID: 25451306 DOI: 10.1016/j.nlm.2014.10.009] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2013] [Revised: 10/24/2014] [Accepted: 10/29/2014] [Indexed: 10/24/2022]
Abstract
Both egocentric route-based learning and spatial learning, as assessed by the Cincinnati water maze (CWM) and Morris water maze (MWM), respectively, are impaired following an 80% dopamine (DA) loss in the neostriatum after 6-hydroxydopamine (6-OHDA) administration in rats. The dorsolateral striatum (DLS) and the dorsomedial striatum (DMS) are implicated in different navigational learning types, namely the DLS is implicated in egocentric learning while the DMS is implicated in spatial learning. This experiment tested whether selective DA loss through 6-OHDA lesions in the DMS or DLS would impair one or both types of navigation. Both DLS and DMS DA loss significantly impaired route-based CWM learning, without affecting spatial or cued MWM performance. DLS 6-OHDA lesions produced a 75% DA loss in this region, with no changes in other monoamine levels in the DLS or DMS. DMS 6-OHDA lesions produced a 62% DA loss in this region, without affecting other monoamine levels in the DMS or DLS. The results indicate a role for DA in DLS and DMS regions in route-based egocentric but not spatial learning and memory. Spatial learning deficits may require more pervasive monoamine reductions within each region before deficits are exhibited. This is the first study to implicate DLS and DMS DA in route-based egocentric navigation.
Collapse
Affiliation(s)
- Amanda A Braun
- Division of Neurology, Department of Pediatrics, Cincinnati Children's Research Foundation, University of Cincinnati College of Medicine, Cincinnati, OH 45229, United States
| | - Robyn M Amos-Kroohs
- Division of Neurology, Department of Pediatrics, Cincinnati Children's Research Foundation, University of Cincinnati College of Medicine, Cincinnati, OH 45229, United States
| | - Arnold Gutierrez
- Division of Neurology, Department of Pediatrics, Cincinnati Children's Research Foundation, University of Cincinnati College of Medicine, Cincinnati, OH 45229, United States
| | - Kerstin H Lundgren
- Department of Neurology, University of Cincinnati College of Medicine, Cincinnati, OH 45267, United States
| | - Kim B Seroogy
- Department of Neurology, University of Cincinnati College of Medicine, Cincinnati, OH 45267, United States
| | - Matthew R Skelton
- Division of Neurology, Department of Pediatrics, Cincinnati Children's Research Foundation, University of Cincinnati College of Medicine, Cincinnati, OH 45229, United States
| | - Charles V Vorhees
- Division of Neurology, Department of Pediatrics, Cincinnati Children's Research Foundation, University of Cincinnati College of Medicine, Cincinnati, OH 45229, United States.
| | - Michael T Williams
- Division of Neurology, Department of Pediatrics, Cincinnati Children's Research Foundation, University of Cincinnati College of Medicine, Cincinnati, OH 45229, United States.
| |
Collapse
|
43
|
Oscillatory activity in the medial prefrontal cortex and nucleus accumbens correlates with impulsivity and reward outcome. PLoS One 2014; 9:e111300. [PMID: 25333512 PMCID: PMC4205097 DOI: 10.1371/journal.pone.0111300] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2014] [Accepted: 09/15/2014] [Indexed: 01/22/2023] Open
Abstract
Actions expressed prematurely without regard for their consequences are considered impulsive. Such behaviour is governed by a network of brain regions including the prefrontal cortex (PFC) and nucleus accumbens (NAcb) and is prevalent in disorders including attention deficit hyperactivity disorder (ADHD) and drug addiction. However, little is known of the relationship between neural activity in these regions and specific forms of impulsive behaviour. In the present study we investigated local field potential (LFP) oscillations in distinct sub-regions of the PFC and NAcb on a 5-choice serial reaction time task (5-CSRTT), which measures sustained, spatially-divided visual attention and action restraint. The main findings show that power in gamma frequency (50-60 Hz) LFP oscillations transiently increases in the PFC and NAcb during both the anticipation of a cue signalling the spatial location of a nose-poke response and again following correct responses. Gamma oscillations were coupled to low-frequency delta oscillations in both regions; this coupling strengthened specifically when an error response was made. Theta (7-9 Hz) LFP power in the PFC and NAcb increased during the waiting period and was also related to response outcome. Additionally, both gamma and theta power were significantly affected by upcoming premature responses as rats waited for the visual cue to respond. In a subgroup of rats showing persistently high levels of impulsivity we found that impulsivity was associated with increased error signals following a nose-poke response, as well as reduced signals of previous trial outcome during the waiting period. Collectively, these in-vivo neurophysiological findings further implicate the PFC and NAcb in anticipatory impulsive responses and provide evidence that abnormalities in the encoding of rewarding outcomes may underlie trait-like impulsive behaviour.
Collapse
|
44
|
Seo H, Cai X, Donahue CH, Lee D. Neural correlates of strategic reasoning during competitive games. Science 2014; 346:340-3. [PMID: 25236468 DOI: 10.1126/science.1256254] [Citation(s) in RCA: 65] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Although human and animal behaviors are largely shaped by reinforcement and punishment, choices in social settings are also influenced by information about the knowledge and experience of other decision-makers. During competitive games, monkeys increased their payoffs by systematically deviating from a simple heuristic learning algorithm and thereby countering the predictable exploitation by their computer opponent. Neurons in the dorsomedial prefrontal cortex (dmPFC) signaled the animal's recent choice and reward history that reflected the computer's exploitative strategy. The strength of switching signals in the dmPFC also correlated with the animal's tendency to deviate from the heuristic learning algorithm. Therefore, the dmPFC might provide control signals for overriding simple heuristic learning algorithms based on the inferred strategies of the opponent.
Collapse
Affiliation(s)
- Hyojung Seo
- Department of Neurobiology, Yale University School of Medicine, New Haven, CT 06510, USA.
| | - Xinying Cai
- Department of Neurobiology, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Christopher H Donahue
- Department of Neurobiology, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Daeyeol Lee
- Department of Neurobiology, Yale University School of Medicine, New Haven, CT 06510, USA. Kavli Institute for Neuroscience, Yale University School of Medicine, New Haven, CT 06510, USA. Department of Psychology, Yale University, New Haven, CT 06510, USA.
| |
Collapse
|
45
|
Where attention falls: Increased risk of falls from the converging impact of cortical cholinergic and midbrain dopamine loss on striatal function. Exp Neurol 2014; 257:120-9. [PMID: 24805070 DOI: 10.1016/j.expneurol.2014.04.032] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2014] [Revised: 04/25/2014] [Accepted: 04/29/2014] [Indexed: 12/13/2022]
Abstract
Falls are a major source of hospitalization, long-term institutionalization, and death in older adults and patients with Parkinson's disease (PD). Limited attentional resources are a major risk factor for falls. In this review, we specify cognitive-behavioral mechanisms that produce falls and map these mechanisms onto a model of multi-system degeneration. Results from PET studies in PD fallers and findings from a recently developed animal model support the hypothesis that falls result from interactions between loss of basal forebrain cholinergic projections to the cortex and striatal dopamine loss. Striatal dopamine loss produces inefficient, low-vigor gait, posture control, and movement. Cortical cholinergic deafferentation impairs a wide range of attentional processes, including monitoring of gait, posture and complex movements. Cholinergic cell loss reveals the full impact of striatal dopamine loss on motor performance, reflecting loss of compensatory attentional supervision of movement. Dysregulation of dorsomedial striatal circuitry is an essential, albeit not exclusive, mediator of falls in this dual-system model. Because cholinergic neuromodulatory activity influences cortical circuitry primarily via stimulation of α4β2* nicotinic acetylcholine receptors, and because agonists at these receptors are known to benefit attentional processes in animals and humans, treating PD fallers with such agonists, as an adjunct to dopaminergic treatment, is predicted to reduce falls. Falls are an informative behavioral endpoint to study attentional-motor integration by striatal circuitry.
Collapse
|
46
|
Abstract
The anterior caudate nucleus is essential for goal-directed behavior because it links outcome information to actions. It is well known that caudate neurons provide a variety of reward-related and action signals. However, it is still unclear how the two signals are integrated. We investigated whether and how outcome risk modulates spatial representation. We recorded neural activity in the anterior caudate nucleus while monkeys made saccades to multiple spatial targets, each associated with either fixed (safe) or variable (risky) amount of reward. We report that individual neurons combined the outcome reward signal with spatial information about the direction of saccades. These signals could be reliably read out from the populations of neurons. Moreover, the prospect of a risky outcome improved the quality of spatial information. These results provide direct evidence that global spatial representation in the caudate is modulated by outcome, which can be important for flexible control of behavior, particularly during learning and habit formation, when outcomes vary.
Collapse
|
47
|
Skelin I, Hakstol R, VanOyen J, Mudiayi D, Molina LA, Holec V, Hong NS, Euston DR, McDonald RJ, Gruber AJ. Lesions of dorsal striatum eliminate lose-switch responding but not mixed-response strategies in rats. Eur J Neurosci 2014; 39:1655-63. [DOI: 10.1111/ejn.12518] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Revised: 12/07/2013] [Accepted: 01/18/2014] [Indexed: 11/30/2022]
Affiliation(s)
- Ivan Skelin
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - Rhys Hakstol
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - Jenn VanOyen
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - Dominic Mudiayi
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - Leonardo A. Molina
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - Victoria Holec
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - Nancy S. Hong
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - David R. Euston
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - Robert J. McDonald
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| | - Aaron J. Gruber
- Department of Neuroscience; Canadian Centre for Behavioural Neuroscience; University of Lethbridge; 4401 University Dr. W. T1K 3M4 Lethbridge AB Canada
| |
Collapse
|
48
|
Gersch TM, Foley NC, Eisenberg I, Gottlieb J. Neural correlates of temporal credit assignment in the parietal lobe. PLoS One 2014; 9:e88725. [PMID: 24523935 PMCID: PMC3921206 DOI: 10.1371/journal.pone.0088725] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2013] [Accepted: 01/09/2014] [Indexed: 11/27/2022] Open
Abstract
Empirical studies of decision making have typically assumed that value learning is governed by time, such that a reward prediction error arising at a specific time triggers temporally-discounted learning for all preceding actions. However, in natural behavior, goals must be acquired through multiple actions, and each action can have different significance for the final outcome. As is recognized in computational research, carrying out multi-step actions requires the use of credit assignment mechanisms that focus learning on specific steps, but little is known about the neural correlates of these mechanisms. To investigate this question we recorded neurons in the monkey lateral intraparietal area (LIP) during a serial decision task where two consecutive eye movement decisions led to a final reward. The underlying decision trees were structured such that the two decisions had different relationships with the final reward, and the optimal strategy was to learn based on the final reward at one of the steps (the “F” step) but ignore changes in this reward at the remaining step (the “I” step). In two distinct contexts, the F step was either the first or the second in the sequence, controlling for effects of temporal discounting. We show that LIP neurons had the strongest value learning and strongest post-decision responses during the transition after the F step regardless of the serial position of this step. Thus, the neurons encode correlates of temporal credit assignment mechanisms that allocate learning to specific steps independently of temporal discounting.
Collapse
Affiliation(s)
- Timothy M. Gersch
- Department of Neuroscience, Columbia University, New York, New York, United States of America
| | - Nicholas C. Foley
- Department of Neuroscience, Columbia University, New York, New York, United States of America
| | - Ian Eisenberg
- Department of Neuroscience, Columbia University, New York, New York, United States of America
| | - Jacqueline Gottlieb
- Department of Neuroscience, Columbia University, New York, New York, United States of America
- The Kavli Institute for Brain Science Columbia University, New York, New York, United States of America
- * E-mail:
| |
Collapse
|
49
|
Modeling fall propensity in Parkinson's disease: deficits in the attentional control of complex movements in rats with cortical-cholinergic and striatal-dopaminergic deafferentation. J Neurosci 2013; 33:16522-39. [PMID: 24133257 DOI: 10.1523/jneurosci.2545-13.2013] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Cognitive symptoms, complex movement deficits, and increased propensity for falls are interrelated and levodopa-unresponsive symptoms in patients with Parkinson's disease (PD). We developed a test system for the assessment of fall propensity in rats and tested the hypothesis that interactions between loss of cortical cholinergic and striatal dopaminergic afferents increase fall propensity. Rats were trained to traverse stationary and rotating rods, placed horizontally or at inclines, and while exposed to distractors. Rats also performed an operant Sustained Attention Task (SAT). Partial cortical cholinergic and/or caudate dopaminergic deafferentation were produced by bilateral infusions of 192 IgG-saporin (SAP) into the basal forebrain and/or 6-hydroxydopamine (6-OHDA) into the caudate nucleus, respectively, modeling the lesions seen in early PD. Rats with dual cholinergic-dopaminergic lesions (DL) fell more frequently than SAP or 6-OHDA rats. Falls in DL rats were associated with incomplete rebalancing after slips and low traversal speed. Ladder rung walking and pasta handling performance did not indicate sensorimotor deficits. SAT performance was impaired in DL and SAP rats; however, SAT performance and falls were correlated only in DL rats. Furthermore, in DL rats, but not in rats with only dopaminergic lesions, the placement and size of dopaminergic lesion correlated significantly with fall rates. The results support the hypothesis that after dual cholinergic-dopaminergic lesions, attentional resources can no longer be recruited to compensate for diminished striatal control of complex movement, thereby "unmasking" impaired striatal control of complex movements and yielding falls.
Collapse
|
50
|
Abstract
In stable environments, decision makers can exploit their previously learned strategies for optimal outcomes, while exploration might lead to better options in unstable environments. Here, to investigate the cortical contributions to exploratory behavior, we analyzed single-neuron activity recorded from four different cortical areas of monkeys performing a matching-pennies task and a visual search task, which encouraged and discouraged exploration, respectively. We found that neurons in multiple regions in the frontal and parietal cortex tended to encode signals related to previously rewarded actions more reliably than unrewarded actions. In addition, signals for rewarded choices in the supplementary eye field were attenuated during the visual search task and were correlated with the tendency to switch choices during the matching-pennies task. These results suggest that the supplementary eye field might play a unique role in encouraging animals to explore alternative decision-making strategies.
Collapse
|