1
|
Pasquereau B, Turner RS. Neural dynamics underlying self-control in the primate subthalamic nucleus. eLife 2023; 12:e83971. [PMID: 37204300 PMCID: PMC10259453 DOI: 10.7554/elife.83971] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 05/18/2023] [Indexed: 05/20/2023] Open
Abstract
The subthalamic nucleus (STN) is hypothesized to play a central role in neural processes that regulate self-control. Still uncertain, however, is how that brain structure participates in the dynamically evolving estimation of value that underlies the ability to delay gratification and wait patiently for a gain. To address that gap in knowledge, we studied the spiking activity of neurons in the STN of monkeys during a task in which animals were required to remain motionless for varying periods of time in order to obtain food reward. At the single-neuron and population levels, we found a cost-benefit integration between the desirability of the expected reward and the imposed delay to reward delivery, with STN signals that dynamically combined both attributes of the reward to form a single integrated estimate of value. This neural encoding of subjective value evolved dynamically across the waiting period that intervened after instruction cue. Moreover, this encoding was distributed inhomogeneously along the antero-posterior axis of the STN such that the most dorso-posterior-placed neurons represented the temporal discounted value most strongly. These findings highlight the selective involvement of the dorso-posterior STN in the representation of temporally discounted rewards. The combination of rewards and time delays into an integrated representation is essential for self-control, the promotion of goal pursuit, and the willingness to bear the costs of time delays.
Collapse
Affiliation(s)
- Benjamin Pasquereau
- Institut des Sciences Cognitives Marc Jeannerod, UMR 5229, Centre National de la Recherche Scientifique, 69675 Bron CedexBronFrance
- Université Claude Bernard Lyon 1, 69100 VilleurbanneVilleurbanneFrance
| | - Robert S Turner
- Department of Neurobiology, Center for Neuroscience and The Center for the Neural Basis of Cognition, University of PittsburghPittsburghUnited States
| |
Collapse
|
2
|
Buschman TJ, Miller EK. Working Memory Is Complex and Dynamic, Like Your Thoughts. J Cogn Neurosci 2023; 35:17-23. [PMID: 36322832 PMCID: PMC9832367 DOI: 10.1162/jocn_a_01940] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
Working memory is where thoughts are held and manipulated. For many years, the dominant model was that working memory relied on steady-state neural dynamics. A neural representation was activated and then held in that state. However, as often happens, the more we examine working memory (especially with new technology), the more complex it looks. Recent discoveries show that working memory involves multiple mechanisms, including discontinuous bouts of spiking. Memories are also dynamic, evolving in a task-dependent manner. Cortical rhythms may control those dynamics, thereby endowing top-down "executive" control over our thoughts.
Collapse
|
3
|
Cho YT, Moujaes F, Schleifer CH, Starc M, Ji JL, Santamauro N, Adkinson B, Kolobaric A, Flynn M, Krystal JH, Murray JD, Repovs G, Anticevic A. Reward and loss incentives improve spatial working memory by shaping trial-by-trial posterior frontoparietal signals. Neuroimage 2022; 254:119139. [PMID: 35346841 PMCID: PMC9264479 DOI: 10.1016/j.neuroimage.2022.119139] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 03/15/2022] [Accepted: 03/22/2022] [Indexed: 10/29/2022] Open
Abstract
Integrating motivational signals with cognition is critical for goal-directed activities. The mechanisms that link neural changes with motivated working memory continue to be understood. Here, we tested how externally cued and non-cued (internally represented) reward and loss impact spatial working memory precision and neural circuits in human subjects using fMRI. We translated the classic delayed-response spatial working memory paradigm from non-human primate studies to take advantage of a continuous numeric measure of working memory precision, and the wealth of translational neuroscience yielded by these studies. Our results demonstrated that both cued and non-cued reward and loss improved spatial working memory precision. Visual association regions of the posterior prefrontal and parietal cortices, specifically the precentral sulcus (PCS) and intraparietal sulcus (IPS), had increased BOLD signal during incentivized spatial working memory. A subset of these regions had trial-by-trial increases in BOLD signal that were associated with better working memory precision, suggesting that these regions may be critical for linking neural signals with motivated working memory. In contrast, regions straddling executive networks, including areas in the dorsolateral prefrontal cortex, anterior parietal cortex and cerebellum displayed decreased BOLD signal during incentivized working memory. While reward and loss similarly impacted working memory processes, they dissociated during feedback when money won or avoided in loss was given based on working memory performance. During feedback, the trial-by-trial amount and valence of reward/loss received was dissociated amongst regions such as the ventral striatum, habenula and periaqueductal gray. Overall, this work suggests motivated spatial working memory is supported by complex sensory processes, and that the IPS and PCS in the posterior frontoparietal cortices may be key regions for integrating motivational signals with spatial working memory precision.
Collapse
Affiliation(s)
- Youngsun T Cho
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA; Yale University, Child Study Center, 230 South Frontage Road, New Haven, CT, 06519, USA; Connecticut Mental Health Center, Clinical Neuroscience Research Unit, 34 Park Street, 3rd floor, New Haven, CT, 06519, USA; Yale University, Interdepartmental Neuroscience Program, Yale University Neuroscience Program, P.O. Box 208074, New Haven, CT, 06520, USA.
| | - Flora Moujaes
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA
| | - Charles H Schleifer
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA
| | | | - Jie Lisa Ji
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA
| | - Nicole Santamauro
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA
| | - Brendan Adkinson
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA
| | - Antonija Kolobaric
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA
| | - Morgan Flynn
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA
| | - John H Krystal
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA; Yale University, NIAAA Center for Translational Neuroscience of Alcoholism, 34 Park Street, 3rd floor, New Haven, CT 06519 USA
| | - John D Murray
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA; Yale University, Interdepartmental Neuroscience Program, Yale University Neuroscience Program, P.O. Box 208074, New Haven, CT, 06520, USA; Yale University, Department of Physics, 217 Prospect Street, New Haven, CT, 06511, USA
| | - Grega Repovs
- University of Ljubljana, Department of Psychology
| | - Alan Anticevic
- Yale University, Department of Psychiatry, 300 George Street, Suite 901, New Haven, CT, 06511, USA; Connecticut Mental Health Center, Clinical Neuroscience Research Unit, 34 Park Street, 3rd floor, New Haven, CT, 06519, USA; Yale University, Interdepartmental Neuroscience Program, Yale University Neuroscience Program, P.O. Box 208074, New Haven, CT, 06520, USA; University of Zagreb, University Psychiatric Hospital Vrapce; Yale University, Department of Psychology, Box 208205, New Haven, CT, 06520-8205, USA; Yale University, NIAAA Center for Translational Neuroscience of Alcoholism, 34 Park Street, 3rd floor, New Haven, CT 06519 USA.
| |
Collapse
|
4
|
Baizer JS. Functional and Neuropathological Evidence for a Role of the Brainstem in Autism. Front Integr Neurosci 2021; 15:748977. [PMID: 34744648 PMCID: PMC8565487 DOI: 10.3389/fnint.2021.748977] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 09/21/2021] [Indexed: 12/27/2022] Open
Abstract
The brainstem includes many nuclei and fiber tracts that mediate a wide range of functions. Data from two parallel approaches to the study of autistic spectrum disorder (ASD) implicate many brainstem structures. The first approach is to identify the functions affected in ASD and then trace the neural systems mediating those functions. While not included as core symptoms, three areas of function are frequently impaired in ASD: (1) Motor control both of the limbs and body and the control of eye movements; (2) Sensory information processing in vestibular and auditory systems; (3) Control of affect. There are critical brainstem nuclei mediating each of those functions. There are many nuclei critical for eye movement control including the superior colliculus. Vestibular information is first processed in the four nuclei of the vestibular nuclear complex. Auditory information is relayed to the dorsal and ventral cochlear nuclei and subsequently processed in multiple other brainstem nuclei. Critical structures in affect regulation are the brainstem sources of serotonin and norepinephrine, the raphe nuclei and the locus ceruleus. The second approach is the analysis of abnormalities from direct study of ASD brains. The structure most commonly identified as abnormal in neuropathological studies is the cerebellum. It is classically a major component of the motor system, critical for coordination. It has also been implicated in cognitive and language functions, among the core symptoms of ASD. This structure works very closely with the cerebral cortex; the cortex and the cerebellum show parallel enlargement over evolution. The cerebellum receives input from cortex via relays in the pontine nuclei. In addition, climbing fiber input to cerebellum comes from the inferior olive of the medulla. Mossy fiber input comes from the arcuate nucleus of the medulla as well as the pontine nuclei. The cerebellum projects to several brainstem nuclei including the vestibular nuclear complex and the red nucleus. There are thus multiple brainstem nuclei distributed at all levels of the brainstem, medulla, pons, and midbrain, that participate in functions affected in ASD. There is direct evidence that the cerebellum may be abnormal in ASD. The evidence strongly indicates that analysis of these structures could add to our understanding of the neural basis of ASD.
Collapse
Affiliation(s)
- Joan S. Baizer
- Department of Physiology and Biophysics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, United States
| |
Collapse
|
5
|
Hori Y, Mimura K, Nagai Y, Fujimoto A, Oyama K, Kikuchi E, Inoue KI, Takada M, Suhara T, Richmond BJ, Minamimoto T. Single caudate neurons encode temporally discounted value for formulating motivation for action. eLife 2021; 10:e61248. [PMID: 34328413 PMCID: PMC8352586 DOI: 10.7554/elife.61248] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 07/29/2021] [Indexed: 11/27/2022] Open
Abstract
The term 'temporal discounting' describes both choice preferences and motivation for delayed rewards. Here we show that neuronal activity in the dorsal part of the primate caudate head (dCDh) signals the temporally discounted value needed to compute the motivation for delayed rewards. Macaque monkeys performed an instrumental task, in which visual cues indicated the forthcoming size and delay duration before reward. Single dCDh neurons represented the temporally discounted value without reflecting changes in the animal's physiological state. Bilateral pharmacological or chemogenetic inactivation of dCDh markedly distorted the normal task performance based on the integration of reward size and delay, but did not affect the task performance for different reward sizes without delay. These results suggest that dCDh is involved in encoding the integrated multi-dimensional information critical for motivation.
Collapse
Affiliation(s)
- Yukiko Hori
- Department of Functional Brain Imaging, National Institutes for Quantum and Radiological Science and TechnologyChibaJapan
| | - Koki Mimura
- Department of Functional Brain Imaging, National Institutes for Quantum and Radiological Science and TechnologyChibaJapan
| | - Yuji Nagai
- Department of Functional Brain Imaging, National Institutes for Quantum and Radiological Science and TechnologyChibaJapan
| | - Atsushi Fujimoto
- Department of Functional Brain Imaging, National Institutes for Quantum and Radiological Science and TechnologyChibaJapan
| | - Kei Oyama
- Department of Functional Brain Imaging, National Institutes for Quantum and Radiological Science and TechnologyChibaJapan
| | - Erika Kikuchi
- Department of Functional Brain Imaging, National Institutes for Quantum and Radiological Science and TechnologyChibaJapan
| | - Ken-ichi Inoue
- Systems Neuroscience Section, Primate Research Institute, Kyoto UniversityInuyamaJapan
| | - Masahiko Takada
- Systems Neuroscience Section, Primate Research Institute, Kyoto UniversityInuyamaJapan
| | - Tetsuya Suhara
- Department of Functional Brain Imaging, National Institutes for Quantum and Radiological Science and TechnologyChibaJapan
| | - Barry J Richmond
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Department of Health and Human ServicesBethesdaUnited States
| | - Takafumi Minamimoto
- Department of Functional Brain Imaging, National Institutes for Quantum and Radiological Science and TechnologyChibaJapan
| |
Collapse
|
6
|
Pomper JK, Spadacenta S, Bunjes F, Arnstein D, Giese MA, Thier P. Representation of the observer's predicted outcome value in mirror and nonmirror neurons of macaque F5 ventral premotor cortex. J Neurophysiol 2020; 124:941-961. [PMID: 32783574 DOI: 10.1152/jn.00234.2020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
In the search for the function of mirror neurons, a previous study reported that F5 mirror neuron responses are modulated by the value that the observing monkey associates with the grasped object. Yet we do not know whether mirror neurons are modulated by the expected reward value for the observer or also by other variables, which are causally dependent on value (e.g., motivation, attention directed at the observed action, arousal). To clarify this, we trained two rhesus macaques to observe a grasping action on an object kept constant, followed by four fully predictable outcomes of different values (2 outcomes with positive and 2 with negative emotional valence). We found a consistent order in population activity of both mirror and nonmirror neurons that matches the order of the value of this predicted outcome but that does not match the order of the above-mentioned value-dependent variables. These variables were inferred from the probability not to abort a trial, saccade latency, modulation of eye position during action observation, heart rate, and pupil size. Moreover, we found subpopulations of neurons tuned to each of the four predicted outcome values. Multidimensional scaling revealed equal normalized distances of 0.25 between the two positive and between the two negative outcomes suggesting the representation of a relative value, scaled to the task setting. We conclude that F5 mirror neurons and nonmirror neurons represent the observer's predicted outcome value, which in the case of mirror neurons may be transferred to the observed object or action.NEW & NOTEWORTHY Both the populations of F5 mirror neurons and nonmirror neurons represent the predicted value of an outcome resulting from the observation of a grasping action. Value-dependent motivation, arousal, and attention directed at the observed action do not provide a better explanation for this representation. The population activity's metric suggests an optimal scaling of value representation to task setting.
Collapse
Affiliation(s)
- Joern K Pomper
- Department of Cognitive Neurology, Hertie Institute for Clinical Brain Research, University of Tuebingen, Tuebingen, Germany
| | - Silvia Spadacenta
- Department of Cognitive Neurology, Hertie Institute for Clinical Brain Research, University of Tuebingen, Tuebingen, Germany
| | - Friedemann Bunjes
- Department of Cognitive Neurology, Hertie Institute for Clinical Brain Research, University of Tuebingen, Tuebingen, Germany
| | - Daniel Arnstein
- Department of Cognitive Neurology, Hertie Institute for Clinical Brain Research, University of Tuebingen, Tuebingen, Germany
| | - Martin A Giese
- Section for Computational Sensomotorics, Department of Cognitive Neurology, Centre for Integrative Neuroscience and Hertie Institute for Clinical Brain Research, University of Tuebingen, Tuebingen, Germany
| | - Peter Thier
- Department of Cognitive Neurology, Hertie Institute for Clinical Brain Research, University of Tuebingen, Tuebingen, Germany
| |
Collapse
|
7
|
Miller EK, Lundqvist M, Bastos AM. Working Memory 2.0. Neuron 2018; 100:463-475. [PMID: 30359609 PMCID: PMC8112390 DOI: 10.1016/j.neuron.2018.09.023] [Citation(s) in RCA: 390] [Impact Index Per Article: 65.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Revised: 09/04/2018] [Accepted: 09/12/2018] [Indexed: 12/24/2022]
Abstract
Working memory is the fundamental function by which we break free from reflexive input-output reactions to gain control over our own thoughts. It has two types of mechanisms: online maintenance of information and its volitional or executive control. Classic models proposed persistent spiking for maintenance but have not explicitly addressed executive control. We review recent theoretical and empirical studies that suggest updates and additions to the classic model. Synaptic weight changes between sparse bursts of spiking strengthen working memory maintenance. Executive control acts via interplay between network oscillations in gamma (30-100 Hz) in superficial cortical layers (layers 2 and 3) and alpha and beta (10-30 Hz) in deep cortical layers (layers 5 and 6). Deep-layer alpha and beta are associated with top-down information and inhibition. It regulates the flow of bottom-up sensory information associated with superficial layer gamma. We propose that interactions between different rhythms in distinct cortical layers underlie working memory maintenance and its volitional control.
Collapse
Affiliation(s)
- Earl K Miller
- The Picower Institute for Learning and Memory, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Mikael Lundqvist
- The Picower Institute for Learning and Memory, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - André M Bastos
- The Picower Institute for Learning and Memory, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
8
|
Encoding of Reward and Decoding Movement from the Frontal Eye Field during Smooth Pursuit Eye Movements. J Neurosci 2018; 38:10515-10524. [PMID: 30355635 DOI: 10.1523/jneurosci.1654-18.2018] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 10/02/2018] [Accepted: 10/05/2018] [Indexed: 11/21/2022] Open
Abstract
Expectation of reward potentiates sensorimotor transformations to drive vigorous movements. One of the main challenges in studying reward is to determine how representations of reward interact with the computations that drive behavior. We recorded activity in smooth pursuit neurons in the frontal eye field (FEF) of two male rhesus monkeys while controlling the eye speed by manipulating either reward size or target speed. The neurons encoded the different reward conditions more strongly than the different target speed conditions. This pattern could not be explained by differences in the eye speed, since the eye speed sensitivity of the neurons was also larger for the reward conditions. Pooling the responses by the preferred direction of the neurons attenuated the reward modulation and led to a tighter association between neural activity and behavior. Therefore, a plausible decoder such as the population vector could explain how the FEF both drives behavior and encodes reward beyond behavior.SIGNIFICANCE STATEMENT Motor areas combine sensory and reward information to drive movement. To disambiguate these sources, we manipulated the speed of smooth pursuit eye movements by controlling either the size of the reward or the speed of the visual motion signals. We found that the relationship between activity in frontal eye field and eye kinematics varied: the eye speed sensitivity was larger for the different reward conditions than for the different target speed conditions. Decoders that pooled signals by the preferred direction of the neurons attenuated the reward modulations. These decoders may indicate how reward can be both encoded beyond eye kinematics at the single neuron level and drive movement at the population level.
Collapse
|
9
|
Hartmann K, Veit L, Nieder A. Neurons in the crow nidopallium caudolaterale encode varying durations of visual working memory periods. Exp Brain Res 2017; 236:215-226. [PMID: 29128976 DOI: 10.1007/s00221-017-5120-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 11/02/2017] [Indexed: 12/14/2022]
Abstract
Adaptive sequential behaviors rely on the bridging and integration of temporally separate information for the realization of prospective goals. Corvids' remarkable behavioral flexibility is thought to depend on the workings of the nidopallium caudolaterale (NCL), a high-level avian associative forebrain area. We trained carrion crows to remember visual items for three alternating delay durations in a delayed match-to-sample task and recorded single-unit activity from the NCL. Sample-selective delay activity, a correlate of visual working memory, was maintained throughout different working memory durations. Delay responses remained selective for the same preferred sample item across blocks with different delay durations. However, selectivity strength decreased with increasing delay durations, mirroring worsened behavioral performance with longer memory delays. Behavioral relevance of delay activity was further evidenced by reduced encoding of the preferred sample item during error trials. In addition, NCL neurons adapted their time-dependent discharges to blocks of different memory durations, so that delay duration could be successfully classified based on population activity a few trials after the delay duration switched. Therefore, NCL neurons not only maintain information from individual trials, but also keep track of the duration for which this information is needed in the context of the task. These results strengthen the role of corvid NCL in maintaining working memory for flexible control of temporally extended goal-directed behavior.
Collapse
Affiliation(s)
- Konstantin Hartmann
- Animal Physiology, Institute of Neurobiology, University of Tübingen, Auf der Morgenstelle 28, 72076, Tübingen, Germany
| | - Lena Veit
- Animal Physiology, Institute of Neurobiology, University of Tübingen, Auf der Morgenstelle 28, 72076, Tübingen, Germany
| | - Andreas Nieder
- Animal Physiology, Institute of Neurobiology, University of Tübingen, Auf der Morgenstelle 28, 72076, Tübingen, Germany.
| |
Collapse
|
10
|
The necessity to choose causes reward-related anticipatory biasing: Parieto-occipital alpha-band oscillations reveal suppression of low-value targets. Sci Rep 2017; 7:14318. [PMID: 29085041 PMCID: PMC5662762 DOI: 10.1038/s41598-017-14742-w] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 10/12/2017] [Indexed: 01/08/2023] Open
Abstract
Positive outcome of actions can be maximized by choosing the option with the highest reward. For saccades, it has recently been suggested that the necessity to choose is, in fact, an important factor mediating reward effects: latencies to single low-reward targets increased with an increasing proportion of interleaved choice-trials, in which participants were free to choose between two targets to obtain either a high or low reward. Here, we replicate this finding for manual responses, demonstrating that this effect of choice is a more general, effector-independent phenomenon. Oscillatory activity in the alpha and beta band in the preparatory period preceding target onset was analysed for a parieto-occipital and a centrolateral region of interest to identify an anticipatory neural biasing mechanism related to visuospatial attention or motor preparation. When the proportion of interleaved choices was high, an increase in lateralized posterior alpha power indicated that the hemifield associated with a low reward was suppressed in preparation for reward-maximizing target selection. The larger the individual increase in lateralized alpha power, the slower the reaction times to low-reward targets. At a broader level, these findings support the notion that reward only affects responses when behaviour can be optimized to maximize positive outcome.
Collapse
|
11
|
Ramakrishnan A, Byun YW, Rand K, Pedersen CE, Lebedev MA, Nicolelis MAL. Cortical neurons multiplex reward-related signals along with sensory and motor information. Proc Natl Acad Sci U S A 2017; 114:E4841-E4850. [PMID: 28559307 PMCID: PMC5474796 DOI: 10.1073/pnas.1703668114] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Rewards are known to influence neural activity associated with both motor preparation and execution. This influence can be exerted directly upon the primary motor (M1) and somatosensory (S1) cortical areas via the projections from reward-sensitive dopaminergic neurons of the midbrain ventral tegmental areas. However, the neurophysiological manifestation of reward-related signals in M1 and S1 are not well understood. Particularly, it is unclear how the neurons in these cortical areas multiplex their traditional functions related to the control of spatial and temporal characteristics of movements with the representation of rewards. To clarify this issue, we trained rhesus monkeys to perform a center-out task in which arm movement direction, reward timing, and magnitude were manipulated independently. Activity of several hundred cortical neurons was simultaneously recorded using chronically implanted microelectrode arrays. Many neurons (9-27%) in both M1 and S1 exhibited activity related to reward anticipation. Additionally, neurons in these areas responded to a mismatch between the reward amount given to the monkeys and the amount they expected: A lower-than-expected reward caused a transient increase in firing rate in 60-80% of the total neuronal sample, whereas a larger-than-expected reward resulted in a decreased firing rate in 20-35% of the neurons. Moreover, responses of M1 and S1 neurons to reward omission depended on the direction of movements that led to those rewards. These observations suggest that sensorimotor cortical neurons corepresent rewards and movement-related activity, presumably to enable reward-based learning.
Collapse
Affiliation(s)
- Arjun Ramakrishnan
- Department of Neurobiology, Duke University, Durham, NC 27710
- Duke University Center for Neuroengineering, Duke University, Durham, NC 27710
| | - Yoon Woo Byun
- Duke University Center for Neuroengineering, Duke University, Durham, NC 27710
- Department of Biomedical Engineering, Duke University, Durham, NC 27708
| | - Kyle Rand
- Department of Biomedical Engineering, Duke University, Durham, NC 27708
| | - Christian E Pedersen
- Joint Department of Biomedical Engineering, University of North Carolina-Chapel Hill and North Carolina State University, Raleigh, NC 27695
| | - Mikhail A Lebedev
- Department of Neurobiology, Duke University, Durham, NC 27710
- Duke University Center for Neuroengineering, Duke University, Durham, NC 27710
| | - Miguel A L Nicolelis
- Department of Neurobiology, Duke University, Durham, NC 27710;
- Duke University Center for Neuroengineering, Duke University, Durham, NC 27710
- Department of Biomedical Engineering, Duke University, Durham, NC 27708
- Department of Psychology and Neuroscience, Duke University, Durham, NC 27708
- Department of Neurology, Duke University, Durham, NC 27710
- Edmund and Lily Safra International Institute of Neurosciences, Natal 59066060, Brazil
| |
Collapse
|
12
|
Padoa-Schioppa C, Schoenbaum G. Dialogue on economic choice, learning theory, and neuronal representations. Curr Opin Behav Sci 2015; 5:16-23. [PMID: 26613099 DOI: 10.1016/j.cobeha.2015.06.004] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
In recent years, two distinct lines of work have focused on the substrates of associative learning and on the mechanisms of economic decisions. While experiments often focused the same brain regions - most notably the orbitofrontal cortex - the two literatures have remained largely distinct. Here we engage in a dialogue with the intent to clarify the relationship between the two frameworks. We identify a potential correspondence between the concept of outcome defined in learning theory and that of good defined in neuroeconomics, and we specifically discuss the concept of value defined in the two frameworks. While many differences remain unresolved, a common idea is that good/outcome values are subjective, devaluation-sensitive and computed on the fly, not "cached" or pre-computed.
Collapse
Affiliation(s)
- Camillo Padoa-Schioppa
- Departments of Anatomy and Neurobiology, Economics and Biomedical Engineering, Washington University, St. Louis, MO 63110
| | | |
Collapse
|
13
|
Abstract
Rewards are crucial objects that induce learning, approach behavior, choices, and emotions. Whereas emotions are difficult to investigate in animals, the learning function is mediated by neuronal reward prediction error signals which implement basic constructs of reinforcement learning theory. These signals are found in dopamine neurons, which emit a global reward signal to striatum and frontal cortex, and in specific neurons in striatum, amygdala, and frontal cortex projecting to select neuronal populations. The approach and choice functions involve subjective value, which is objectively assessed by behavioral choices eliciting internal, subjective reward preferences. Utility is the formal mathematical characterization of subjective value and a prime decision variable in economic choice theory. It is coded as utility prediction error by phasic dopamine responses. Utility can incorporate various influences, including risk, delay, effort, and social interaction. Appropriate for formal decision mechanisms, rewards are coded as object value, action value, difference value, and chosen value by specific neurons. Although all reward, reinforcement, and decision variables are theoretical constructs, their neuronal signals constitute measurable physical implementations and as such confirm the validity of these concepts. The neuronal reward signals provide guidance for behavior while constraining the free will to act.
Collapse
Affiliation(s)
- Wolfram Schultz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
14
|
Nakazono T, Sano T, Takahashi S, Sakurai Y. Theta oscillation and neuronal activity in rat hippocampus are involved in temporal discrimination of time in seconds. Front Syst Neurosci 2015; 9:95. [PMID: 26157367 PMCID: PMC4475904 DOI: 10.3389/fnsys.2015.00095] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2015] [Accepted: 06/04/2015] [Indexed: 11/13/2022] Open
Abstract
The discovery of time cells revealed that the rodent hippocampus has information regarding time. Previous studies have suggested that the role of hippocampal time cells is to integrate temporally segregated events into a sequence using working memory with time perception. However, it is unclear whether hippocampal cells contribute to time perception itself because most previous studies employed delayed matching-to-sample tasks that did not separately evaluate time perception from working memory processes. Here, we investigated the function of the rat hippocampus in time perception using a temporal discrimination task. In the task, rats had to discriminate between durations of 1 and 3 s to get a reward, and maintaining task-related information as working memory was not required. We found that some hippocampal neurons showed firing rate modulation similar to that of time cells. Moreover, theta oscillation of local field potentials (LFPs) showed a transient enhancement of power during time discrimination periods. However, there were little relationships between the neuronal activities and theta oscillations. These results suggest that both the individual neuronal activities and theta oscillations of LFPs in the hippocampus have a possibility to be engaged in seconds order time perception; however, they participate in different ways.
Collapse
Affiliation(s)
- Tomoaki Nakazono
- Department of Psychology, Graduate School of Letters, Kyoto University Kyoto, Japan ; Laboratory of Neural Information, Graduate School of Brain Science, Doshisha University Kyotanabe, Japan
| | - Tomomi Sano
- Department of Psychology, Graduate School of Letters, Kyoto University Kyoto, Japan
| | - Susumu Takahashi
- Laboratory of Neural Circuitry, Graduate School of Brain Science, Doshisha University Kyotanabe, Japan
| | - Yoshio Sakurai
- Department of Psychology, Graduate School of Letters, Kyoto University Kyoto, Japan ; Laboratory of Neural Information, Graduate School of Brain Science, Doshisha University Kyotanabe, Japan
| |
Collapse
|
15
|
Kilavik BE, Confais J, Riehle A. Signs of timing in motor cortex during movement preparation and cue anticipation. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2015; 829:121-42. [PMID: 25358708 DOI: 10.1007/978-1-4939-1782-2_7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
The capacity to accurately anticipate the timing of predictable events is essential for sensorimotor behavior. Motor cortex holds an established role in movement preparation and execution. In this chapter we review the different ways in which motor cortical activity is modulated by event timing in sensorimotor delay tasks. During movement preparation, both single neuron and population responses reflect the temporal constraints of the task. Anticipatory modulations prior to sensory cues are also observed in motor cortex when the cue timing is predictable. We propose that the motor cortical activity during cue anticipation and movement preparation is embedded in a timing network that facilitates sensorimotor processing. In this context, the pre-cue and post-cue activity may reflect a presetting mechanism, complementing processing during movement execution, while prohibiting premature responses in situations requiring delayed motor output.
Collapse
Affiliation(s)
- Bjørg Elisabeth Kilavik
- Institut de Neurosciences de la Timone (INT), CNRS - Aix Marseille Université, Marseille, France
| | | | | |
Collapse
|
16
|
Ichihara-Takeda S, Yazawa S, Murahara T, Toyoshima T, Shinozaki J, Ishiguro M, Shiraishi H, Ikeda N, Matsuyama K, Funahashi S, Nagamine T. Modulation of Alpha Activity in the Parieto-occipital Area by Distractors during a Visuospatial Working Memory Task: A Magnetoencephalographic Study. J Cogn Neurosci 2015; 27:453-63. [DOI: 10.1162/jocn_a_00718] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Abstract
Oscillatory brain activity is known to play an essential role in information processing in working memory. Recent studies have indicated that alpha activity (8–13 Hz) in the parieto-occipital area is strongly modulated in working memory tasks. However, the function of alpha activity in working memory is open to several interpretations, such that alpha activity may be a direct neural correlate of information processing in working memory or may reflect disengagement from information processing in other brain areas. To examine the functional contribution of alpha activity to visuospatial working memory, we introduced visuospatial distractors during a delay period and examined neural activity from the whole brain using magnetoencephalography. The strength of event-related alpha activity was estimated using the temporal spectral evolution (TSE) method. The results were as follows: (1) an increase of alpha activity during the delay period as indicated by elevated TSE curves was observed in parieto-occipital sensors in both the working memory task and a control task that did not require working memory; and (2) an increase of alpha activity during the delay period was not observed when distractors were presented, although TSE curves were constructed only from correct trials. These results indicate that the increase of alpha activity is not directly related to information processing in working memory but rather reflects the disengagement of attention from the visuospatial input.
Collapse
|
17
|
Neurophysiology of Reward-Guided Behavior: Correlates Related to Predictions, Value, Motivation, Errors, Attention, and Action. Curr Top Behav Neurosci 2015; 27:199-230. [PMID: 26276036 DOI: 10.1007/7854_2015_382] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Many brain areas are activated by the possibility and receipt of reward. Are all of these brain areas reporting the same information about reward? Or are these signals related to other functions that accompany reward-guided learning and decision-making? Through carefully controlled behavioral studies, it has been shown that reward-related activity can represent reward expectations related to future outcomes, errors in those expectations, motivation, and signals related to goal- and habit-driven behaviors. These dissociations have been accomplished by manipulating the predictability of positively and negatively valued events. Here, we review single neuron recordings in behaving animals that have addressed this issue. We describe data showing that several brain areas, including orbitofrontal cortex, anterior cingulate, and basolateral amygdala signal reward prediction. In addition, anterior cingulate, basolateral amygdala, and dopamine neurons also signal errors in reward prediction, but in different ways. For these areas, we will describe how unexpected manipulations of positive and negative value can dissociate signed from unsigned reward prediction errors. All of these signals feed into striatum to modify signals that motivate behavior in ventral striatum and guide responding via associative encoding in dorsolateral striatum.
Collapse
|
18
|
Bermudez MA, Schultz W. Timing in reward and decision processes. Philos Trans R Soc Lond B Biol Sci 2014; 369:20120468. [PMID: 24446502 DOI: 10.1098/rstb.2012.0468] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Sensitivity to time, including the time of reward, guides the behaviour of all organisms. Recent research suggests that all major reward structures of the brain process the time of reward occurrence, including midbrain dopamine neurons, striatum, frontal cortex and amygdala. Neuronal reward responses in dopamine neurons, striatum and frontal cortex show temporal discounting of reward value. The prediction error signal of dopamine neurons includes the predicted time of rewards. Neurons in the striatum, frontal cortex and amygdala show responses to reward delivery and activities anticipating rewards that are sensitive to the predicted time of reward and the instantaneous reward probability. Together these data suggest that internal timing processes have several well characterized effects on neuronal reward processing.
Collapse
Affiliation(s)
- Maria A Bermudez
- Department of Physiology, Development and Neuroscience, University of Cambridge, , Cambridge, UK
| | | |
Collapse
|
19
|
From Duration and Distance Comparisons to Goal Encoding in Prefrontal Cortex. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 829:167-86. [DOI: 10.1007/978-1-4939-1782-2_10] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
20
|
Matell MS. Searching for the holy grail: temporally informative firing patterns in the rat. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 829:209-34. [PMID: 25358713 DOI: 10.1007/978-1-4939-1782-2_12] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
This chapter reviews our work from the past decade investigating cortical and striatal firing patterns in rats while they time intervals in the multi-seconds range. We have found that both cortical and striatal firing rates contain information that the rat can use to identify how much time has elapsed both from trial onset and from the onset of an active response state. I describe findings showing that the striatal neurons that are modulated by time are also modulated by overt behaviors, suggesting that time modulates the strength of motor coding in the striatum, rather than being represented as an abstract quantity in isolation. I also describe work showing that there are a variety of temporally informative activity patterns in pre-motor cortex, and argue that the heterogeneity of these patterns can enhance an organism's temporal estimate. Finally, I describe recent behavioral work from my lab in which the simultaneous cueing of multiple durations leads to a scalar temporal expectation at an intermediate time, providing strong support for a monotonic representation of time.
Collapse
|
21
|
Prefrontal cortex and neural mechanisms of executive function. ACTA ACUST UNITED AC 2013; 107:471-82. [DOI: 10.1016/j.jphysparis.2013.05.001] [Citation(s) in RCA: 123] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2013] [Revised: 04/26/2013] [Accepted: 05/02/2013] [Indexed: 11/23/2022]
|
22
|
Abstract
Recent work has advanced our knowledge of phasic dopamine reward prediction error signals. The error signal is bidirectional, reflects well the higher order prediction error described by temporal difference learning models, is compatible with model-free and model-based reinforcement learning, reports the subjective rather than physical reward value during temporal discounting and reflects subjective stimulus perception rather than physical stimulus aspects. Dopamine activations are primarily driven by reward, and to some extent risk, whereas punishment and salience have only limited activating effects when appropriate controls are respected. The signal is homogeneous in terms of time course but heterogeneous in many other aspects. It is essential for synaptic plasticity and a range of behavioural learning situations.
Collapse
Affiliation(s)
- Wolfram Schultz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge CB2 3DY, UK.
| |
Collapse
|
23
|
Minamimoto T, Hori Y, Richmond BJ. Is working more costly than waiting in monkeys? PLoS One 2012; 7:e48434. [PMID: 23144881 PMCID: PMC3492350 DOI: 10.1371/journal.pone.0048434] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2012] [Accepted: 09/25/2012] [Indexed: 11/18/2022] Open
Abstract
We studied how value for instrumental action is discounted by predicted effort and delay. The monkeys were trained to perform instrumental trials that required a bar release when a visual target changed from red-to-green. There were two trial conditions. In delay trials, after the monkeys performed one instrumental trial correctly a reward was delivered 0-7 seconds later. In work trials, the monkeys had to perform 0, 1, or 2 additional instrumental trials to obtain a reward. The lengths of trials in delay matched the time it took to complete work trials. The length of delay or number of trials was indicated by a visual cue presented throughout the trial. Our hypothesis was that the monkeys would all show temporal discounting of reward in the delay trials, and that in the work trials the monkeys' performance might reflect an additional cost due to working. The error rate increased linearly as remaining cost increased for all 8 monkeys. For 4 monkeys the error rate was significantly larger in work trials than in delay trials (effort sensitive monkeys). For the other 4 monkeys there was no significant difference in error rate (effort insensitive monkeys). Since the error rate has an inverse relation with value for action, these results suggest that value is discounted hyperbolically by effort as well as by delay. Error rates generally increased as the testing sessions progressed and the total reward accumulated (i.e., effect of reward devaluation). The relative impact of delay and effort on error rates was reasonably stable within subjects. Thus, within the monkey population there seems to be a significant dichotomy in the sensitivity governing whether working is more costly than waiting, possibly arising from a constitutional or genetic trait.
Collapse
Affiliation(s)
- Takafumi Minamimoto
- Department of Molecular Neuroimaging, Molecular Imaging Center, National Institute of Radiological Sciences, Chiba, Japan.
| | | | | |
Collapse
|
24
|
The temporal derivative of expected utility: a neural mechanism for dynamic decision-making. Neuroimage 2012; 65:223-30. [PMID: 22963852 DOI: 10.1016/j.neuroimage.2012.08.063] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2012] [Revised: 07/28/2012] [Accepted: 08/21/2012] [Indexed: 11/20/2022] Open
Abstract
Real world tasks involving moving targets, such as driving a vehicle, are performed based on continuous decisions thought to depend upon the temporal derivative of the expected utility (∂V/∂t), where the expected utility (V) is the effective value of a future reward. However, the neural mechanisms that underlie dynamic decision-making are not well understood. This study investigates human neural correlates of both V and ∂V/∂t using fMRI and a novel experimental paradigm based on a pursuit-evasion game optimized to isolate components of dynamic decision processes. Our behavioral data show that players of the pursuit-evasion game adopt an exponential discounting function, supporting the expected utility theory. The continuous functions of V and ∂V/∂t were derived from the behavioral data and applied as regressors in fMRI analysis, enabling temporal resolution that exceeded the sampling rate of image acquisition, hyper-temporal resolution, by taking advantage of numerous trials that provide rich and independent manipulation of those variables. V and ∂V/∂t were each associated with distinct neural activity. Specifically, ∂V/∂t was associated with anterior and posterior cingulate cortices, superior parietal lobule, and ventral pallidum, whereas V was primarily associated with supplementary motor, pre and post central gyri, cerebellum, and thalamus. The association between the ∂V/∂t and brain regions previously related to decision-making is consistent with the primary role of the temporal derivative of expected utility in dynamic decision-making.
Collapse
|
25
|
Kan JYY, Niel U, Dorris MC. Evidence for a link between the experiential allocation of saccade preparation and visuospatial attention. J Neurophysiol 2011; 107:1413-20. [PMID: 22170962 DOI: 10.1152/jn.00534.2011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Whether a link exists between the two orienting processes of saccade preparation and visuospatial attention has typically been studied by using either sensory cues or predetermined rules that instruct subjects where to allocate these limited resources. In the real world, explicit instructions are not always available and presumably expectations shaped by previous experience play an important role in the allocation of these processes. Here we examined whether manipulating two experiential factors that clearly influence saccade preparation--the probability and timing of saccadic responses--also influences the allocation of visuospatial attention. Occasionally, a visual probe was presented whose spatial location and time of presentation varied relative to those of the saccade target. The proportion of erroneous saccades directed toward this probe indexed saccade preparation, and the proportion of correct discriminations of probe orientation indexed visuospatial attention. Overall, preparation and attention were significantly correlated to each other across these manipulations of saccade probability and timing. Saccade probability influenced both preparation and attention processes, whereas saccade timing influenced only preparation processes. Unexpectedly, discrimination ability was not improved in those trials in which the probe triggered an erroneous saccade despite particularly heightened levels of saccade preparation. To account for our results, we propose a conceptual dual-purpose threshold model based on neurophysiological considerations that link the processes of saccade preparation and visuospatial attention. The threshold acts both as the minimum activity level required for eliciting saccades and a maximum level for which neural activity can provide attentional benefits.
Collapse
Affiliation(s)
- Janis Y Y Kan
- Department of Physiology and Centre for Neuroscience Studies, Queen’s University, Kingston, Ontario, Canada
| | | | | |
Collapse
|
26
|
Roesch MR, Bryden DW. Impact of size and delay on neural activity in the rat limbic corticostriatal system. Front Neurosci 2011; 5:130. [PMID: 22363252 PMCID: PMC3277262 DOI: 10.3389/fnins.2011.00130] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2011] [Accepted: 11/04/2011] [Indexed: 11/17/2022] Open
Abstract
A number of factors influence an animal’s economic decisions. Two most commonly studied are the magnitude of and delay to reward. To investigate how these factors are represented in the firing rates of single neurons, we devised a behavioral task that independently manipulated the expected delay to and size of reward. Rats perceived the differently delayed and sized rewards as having different values and were more motivated under short delay and big-reward conditions than under long delay and small reward conditions as measured by percent choice, accuracy, and reaction time. Since the creation of this task, we have recorded from several different brain areas including, orbitofrontal cortex, striatum, amygdala, substantia nigra pars reticulata, and midbrain dopamine neurons. Here, we review and compare those data with a substantial focus on those areas that have been shown to be critical for performance on classic time discounting procedures and provide a potential mechanism by which they might interact when animals are deciding between differently delayed rewards. We found that most brain areas in the cortico-limbic circuit encode both the magnitude and delay to reward delivery in one form or another, but only a few encode them together at the single neuron level.
Collapse
Affiliation(s)
- Matthew R Roesch
- Department of Psychology and Program in Neuroscience and Cognitive Science, University of Maryland College Park, MD, USA
| | | |
Collapse
|
27
|
Kim S, Lee D. Prefrontal cortex and impulsive decision making. Biol Psychiatry 2011; 69:1140-6. [PMID: 20728878 PMCID: PMC2991430 DOI: 10.1016/j.biopsych.2010.07.005] [Citation(s) in RCA: 162] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/01/2010] [Revised: 07/04/2010] [Accepted: 07/07/2010] [Indexed: 10/19/2022]
Abstract
Impulsivity refers to a set of heterogeneous behaviors that are tuned suboptimally along certain temporal dimensions. Impulsive intertemporal choice refers to the tendency to forego a large but delayed reward and to seek an inferior but more immediate reward, whereas impulsive motor responses also result when the subjects fail to suppress inappropriate automatic behaviors. In addition, impulsive actions can be produced when too much emphasis is placed on speed rather than accuracy in a wide range of behaviors, including perceptual decision making. Despite this heterogeneous nature, the prefrontal cortex and its connected areas, such as the basal ganglia, play an important role in gating impulsive actions in a variety of behavioral tasks. Here, we describe key features of computations necessary for optimal decision making and how their failures can lead to impulsive behaviors. We also review the recent findings from neuroimaging and single-neuron recording studies on the neural mechanisms related to impulsive behaviors. Converging approaches in economics, psychology, and neuroscience provide a unique vista for better understanding the nature of behavioral impairments associated with impulsivity.
Collapse
Affiliation(s)
- Soyoun Kim
- Department of Neurobiology, Yale University School of Medicine
| | - Daeyeol Lee
- Department of Neurobiology, Yale University School of Medicine, Kavli Institute for Neuroscience, Yale University School of Medicine, Department of Psychology, Yale University
| |
Collapse
|
28
|
Yumoto N, Lu X, Henry TR, Miyachi S, Nambu A, Fukai T, Takada M. A neural correlate of the processing of multi-second time intervals in primate prefrontal cortex. PLoS One 2011; 6:e19168. [PMID: 21556372 PMCID: PMC3083430 DOI: 10.1371/journal.pone.0019168] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2010] [Accepted: 03/21/2011] [Indexed: 12/02/2022] Open
Abstract
Several areas of the brain are known to participate in temporal processing.
Neurons in the prefrontal cortex (PFC) are thought to contribute to perception
of time intervals. However, it remains unclear whether the PFC itself can
generate time intervals independently of external stimuli. Here we describe a
group of PFC neurons in area 9 that became active when monkeys recognized a
particular elapsed time within the range of 1–7 seconds. Another group of
area 9 neurons became active only when subjects reproduced a specific interval
without external cues. Both types of neurons were individually tuned to
recognize or reproduce particular intervals. Moreover, the injection of
muscimol, a GABA agonist, into this area bilaterally resulted in an increase in
the error rate during time interval reproduction. These results suggest that
area 9 may process multi-second intervals not only in perceptual recognition,
but also in internal generation of time intervals.
Collapse
Affiliation(s)
- Naosugi Yumoto
- Department of System Neuroscience, Tokyo
Metropolitan Institute for Neuroscience, Tokyo Metropolitan Organization for
Medical Research, Fuchu, Tokyo, Japan
| | - Xiaofeng Lu
- Department of System Neuroscience, Tokyo
Metropolitan Institute for Neuroscience, Tokyo Metropolitan Organization for
Medical Research, Fuchu, Tokyo, Japan
- Department of Neurology, School of Medicine,
University of Minnesota, Minneapolis, Minnesota, United States of
America
- Clinical Neuroscience Center, University of
Minnesota Medical Center, Fairview, Minneapolis, Minnesota, United States of
America
- Department of Neurophysiology, School of
Medicine, Juntendo University, Tokyo, Japan
- * E-mail:
| | - Thomas R. Henry
- Department of Neurology, School of Medicine,
University of Minnesota, Minneapolis, Minnesota, United States of
America
- Clinical Neuroscience Center, University of
Minnesota Medical Center, Fairview, Minneapolis, Minnesota, United States of
America
| | - Shigehiro Miyachi
- Department of System Neuroscience, Tokyo
Metropolitan Institute for Neuroscience, Tokyo Metropolitan Organization for
Medical Research, Fuchu, Tokyo, Japan
- Cognitive Neuroscience Section, Primate
Research Institute, Kyoto University, Inuyama, Aichi, Japan
| | - Atsushi Nambu
- Division of System Neurophysiology, National
Institute for Physiological Sciences, Okazaki, Aichi, Japan
| | - Tomoki Fukai
- Laboratory for Neural Circuit Theory, RIKEN
Brain Science Institute, Wako, Saitama, Japan
| | - Masahiko Takada
- Department of System Neuroscience, Tokyo
Metropolitan Institute for Neuroscience, Tokyo Metropolitan Organization for
Medical Research, Fuchu, Tokyo, Japan
- Systems Neuroscience Section, Primate Research
Institute, Kyoto University, Inuyama, Aichi, Japan
| |
Collapse
|
29
|
Berdyyeva TK, Olson CR. Relation of ordinal position signals to the expectation of reward and passage of time in four areas of the macaque frontal cortex. J Neurophysiol 2011; 105:2547-59. [PMID: 21389312 DOI: 10.1152/jn.00903.2010] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Neurons in several areas of the monkey frontal cortex exhibit rank selectivity, firing differentially as a function of the stage attained during the performance of a serial order task. The activity of these neurons is commonly thought to represent ordinal position within the trial. However, they might also be sensitive to factors correlated with ordinal position including time elapsed during the trial (which is greater for each successive stage) and the degree of anticipation of reward (which probably increases at each successive stage). To compare the influences of these factors, we monitored neuronal activity in the supplementary motor area (SMA), presupplementary motor area (pre-SMA), supplementary eye field (SEF), and dorsolateral prefrontal cortex during the performance of a serial order task (requiring a series of saccades in three specified directions), a variable reward task (in which a cue displayed early in the trial indicated whether the reward received at the end of the trial would be large or small), and a long delay task (in which the monkey had simply to maintain fixation during a period of time approximating the duration of an average trial in the serial order task). We found that rank signals were partially correlated with sensitivity to elapsed time and anticipated reward. The connection to elapsed time was strongest in the pre-SMA. The connection to anticipated reward was most pronounced in the SMA and SEF. However, critically, these factors could not fully explain rank selectivity in any of the areas tested.
Collapse
Affiliation(s)
- Tamara K Berdyyeva
- Center for the Neural Basis of Cognition and Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA.
| | | |
Collapse
|
30
|
Cai X, Kim S, Lee D. Heterogeneous coding of temporally discounted values in the dorsal and ventral striatum during intertemporal choice. Neuron 2011; 69:170-82. [PMID: 21220107 DOI: 10.1016/j.neuron.2010.11.041] [Citation(s) in RCA: 127] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/19/2010] [Indexed: 10/18/2022]
Abstract
In choosing between different rewards expected after unequal delays, humans and animals often prefer the smaller but more immediate reward, indicating that the subjective value or utility of reward is depreciated according to its delay. Here, we show that neurons in the primate caudate nucleus and ventral striatum modulate their activity according to temporally discounted values of rewards with a similar time course. However, neurons in the caudate nucleus encoded the difference in the temporally discounted values of the two alternative targets more reliably than neurons in the ventral striatum. In contrast, neurons in the ventral striatum largely encoded the sum of the temporally discounted values, and therefore, the overall goodness of available options. These results suggest a more pivotal role for the dorsal striatum in action selection during intertemporal choice.
Collapse
Affiliation(s)
- Xinying Cai
- Department of Neurobiology, Yale University School of Medicine, New Haven, CT 06510, USA
| | | | | |
Collapse
|
31
|
Matell MS, Shea-Brown E, Gooch C, Wilson AG, Rinzel J. A heterogeneous population code for elapsed time in rat medial agranular cortex. Behav Neurosci 2011; 125:54-73. [PMID: 21319888 PMCID: PMC3078766 DOI: 10.1037/a0021954] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The neural mechanisms underlying the temporal control of behavior are largely unknown. Here we recorded from medial agranular cortex neurons in rats while they freely behaved in a temporal production task, the peak-interval procedure. Due to variability in estimating the time of food availability, robust responding typically bracketed the expected duration, starting some time before and ending some time after the signaled delay. These response periods provided analytic "steady state" windows during which subjects actively indicated their temporal expectation of food availability. Remarkably, during these response periods, a variety of firing patterns were seen that could be broadly described as ramps, peaks, and dips, with different slopes, directions, and times at which maxima or minima occur. Regularized linear discriminant analysis indicated that these patterns provided sufficiently reliable information to discriminate the elapsed duration of responding within these response periods. Modeling this across neuron variability showed that the utilization of ramps, dips, and peaks, with different slopes and minimal/maximal rates at different times, led to a substantial improvement in temporal prediction errors, suggesting that heterogeneity in the neural representation of elapsed time may facilitate temporally controlled behavior.
Collapse
Affiliation(s)
- Matthew S Matell
- Department of Psychology, Villanova University, 800 Lancaster Ave., Villanova, PA 19085, USA.
| | | | | | | | | |
Collapse
|
32
|
Nakahara H, Kaveri S. Internal-time temporal difference model for neural value-based decision making. Neural Comput 2010; 22:3062-106. [PMID: 20858126 DOI: 10.1162/neco_a_00049] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
The temporal difference (TD) learning framework is a major paradigm for understanding value-based decision making and related neural activities (e.g., dopamine activity). The representation of time in neural processes modeled by a TD framework, however, is poorly understood. To address this issue, we propose a TD formulation that separates the time of the operator (neural valuation processes), which we refer to as internal time, from the time of the observer (experiment), which we refer to as conventional time. We provide the formulation and theoretical characteristics of this TD model based on internal time, called internal-time TD, and explore the possible consequences of the use of this model in neural value-based decision making. Due to the separation of the two times, internal-time TD computations, such as TD error, are expressed differently, depending on both the time frame and time unit. We examine this operator-observer problem in relation to the time representation used in previous TD models. An internal time TD value function exhibits the co-appearance of exponential and hyperbolic discounting at different delays in intertemporal choice tasks. We further examine the effects of internal time noise on TD error, the dynamic construction of internal time, and the modulation of internal time with the internal time hypothesis of serotonin function. We also relate the internal TD formulation to research on interval timing and subjective time.
Collapse
Affiliation(s)
- Hiroyuki Nakahara
- Laboratory for Integrated Theoretical Neuroscience, RIKEN Brain Science Institute,Wako, Saitama, 351-0198 Japan.
| | | |
Collapse
|
33
|
Campos M, Breznen B, Andersen RA. A neural representation of sequential states within an instructed task. J Neurophysiol 2010; 104:2831-49. [PMID: 20739594 DOI: 10.1152/jn.01124.2009] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
In the study of the neural basis of sensorimotor transformations, it has become clear that the brain does not always wait to sense external events and afterward select the appropriate responses. If there are predictable regularities in the environment, the brain begins to anticipate the timing of instructional cues and the signals to execute a response, revealing an internal representation of the sequential behavioral states of the task being performed. To investigate neural mechanisms that could represent the sequential states of a task, we recorded neural activity from two oculomotor structures implicated in behavioral timing--the supplementary eye fields (SEF) and the lateral intraparietal area (LIP)--while rhesus monkeys performed a memory-guided saccade task. The neurons of the SEF were found to collectively encode the progression of the task with individual neurons predicting and/or detecting states or transitions between states. LIP neurons, while also encoding information about the current temporal interval, were limited with respect to SEF neurons in two ways. First, LIP neurons tended to be active when the monkey was planning a saccade but not in the precue or intertrial intervals, whereas SEF neurons tended to have activity modulation in all intervals. Second, the LIP neurons were more likely to be spatially tuned than SEF neurons. SEF neurons also show anticipatory activity. The state-selective and anticipatory responses of SEF neurons support two complementary models of behavioral timing, state dependent and accumulator models, and suggest that each model describes a contribution SEF makes to timing at different temporal resolutions.
Collapse
Affiliation(s)
- Michael Campos
- Computation and Neural Systems, Division of Biology, California Institute of Technology, Pasadena, California, USA.
| | | | | |
Collapse
|
34
|
Abstract
A key question in the neurobiology of reward relates to the nature of coding. Rewards are objects that are advantageous or necessary for the survival of individuals in a variety of environmental situations. Thus reward appears to depend on the individual and its environment. The question arises whether neuronal systems in humans and monkeys code reward in subjective terms, objective terms or both. The present review addresses this issue by dealing with two important reward processes, namely the individual discounting of reward value across temporal delays, and the processing of information about risky rewards that depends on individual risk attitudes. The subjective value of rewards decreases with the temporal distance to the reward. In experiments using neurophysiology and brain imaging, dopamine neurons and striatal systems discount reward value across temporal delays of a few seconds, despite unchanged objective reward value, suggesting subjective value coding. The subjective values of risky outcomes depend on the risk attitude of individual decision makers; these values decrease for risk-avoiders and increase for risk-seekers. The signal for risk and the signal for the value of risky reward covary with individual risk attitudes in regions of the human prefrontal cortex, suggesting subjective rather than objective coding of risk and risky value. These data demonstrate that important parameters of reward are coded in a subjective manner in key reward structures of the brain. However, these data do not rule out that other neurons or brain structures may code reward according to its objective value and risk.
Collapse
Affiliation(s)
- Wolfram Schultz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Downing Street, Cambridge CB2 3DY, UK.
| |
Collapse
|
35
|
A role for the ventral premotor cortex beyond performance monitoring. Proc Natl Acad Sci U S A 2009; 106:18815-9. [PMID: 19846763 DOI: 10.1073/pnas.0910524106] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Depending on the circumstances, decision making requires either comparing current sensory information with that showed recently or with that recovered from long-term memory (LTM). In both cases, to learn from past decisions and adapt future ones, memories and outcomes have to be available after the report of a decision. The ventral premotor cortex (PMv) is a good candidate for integrating memory traces and outcomes because it is involved in working-memory, decision-making, and encoding the outcomes. To test this hypothesis we recorded the extracellular unit activity while monkeys performed 2 variants of a visual discrimination task. In one task, the decision was based on the comparison of the orientation of a current stimulus with that of another stimulus recently shown. In the other task, the monkeys had to compare the current orientation of the stimulus with the correct one retrieved from LTM. Here, we report that when the task required retrieval of the stimulus and its use in the following trials, the neurons continue encoding this internal representation together with the outcomes after the monkey has emitted the motor response. However, this codification did not occur when the stimulus was shown recently and updated every trial. These results suggest that the PMv activity represents the information needed to evaluate the consequences of a decision. We interpret these results as evidence that the PMv plays a role in evaluating the outcomes that can serve to learn and thus adapt future decision to environmental demands.
Collapse
|
36
|
Abstract
Several widely accepted models of decision making suggest that, during simple decision tasks, neural activity builds up until a threshold is reached and a decision is made. These models explain error rates and reaction time distributions in a variety of tasks and are supported by neurophysiological studies showing that neural activity in several cortical and subcortical regions gradually builds up at a rate related to task difficulty and reaches a relatively constant level of discharge at a time that predicts movement initiation. The mechanism responsible for this buildup is believed to be related to the temporal integration of sequential samples of sensory information. However, an alternative mechanism that may explain the neural and behavioral data is one in which the buildup of activity is instead attributable to a growing signal related to the urgency to respond, which multiplicatively modulates updated estimates of sensory evidence. These models are difficult to distinguish when, as in previous studies, subjects are presented with constant sensory evidence throughout each trial. To distinguish the models, we presented human subjects with a task in which evidence changed over the course of each trial. Our results are more consistent with "urgency gating" than with temporal integration of sensory samples and suggest a simple mechanism for implementing trade-offs between the speed and accuracy of decisions.
Collapse
|
37
|
Hwang J, Kim S, Lee D. Temporal discounting and inter-temporal choice in rhesus monkeys. Front Behav Neurosci 2009; 3:9. [PMID: 19562091 PMCID: PMC2701682 DOI: 10.3389/neuro.08.009.2009] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2009] [Accepted: 06/01/2009] [Indexed: 11/26/2022] Open
Abstract
Humans and animals are more likely to take an action leading to an immediate reward than actions with delayed rewards of similar magnitudes. Although such devaluation of delayed rewards has been almost universally described by hyperbolic discount functions, the rate of this temporal discounting varies substantially among different animal species. This might be in part due to the differences in how the information about reward is presented to decision makers. In previous animal studies, reward delays or magnitudes were gradually adjusted across trials, so the animals learned the properties of future rewards from the rewards they waited for and consumed previously. In contrast, verbal cues have been used commonly in human studies. In the present study, rhesus monkeys were trained in a novel inter-temporal choice task in which the magnitude and delay of reward were indicated symbolically using visual cues and varied randomly across trials. We found that monkeys could extract the information about reward delays from visual symbols regardless of the number of symbols used to indicate the delay. The rate of temporal discounting observed in the present study was comparable to the previous estimates in other mammals, and the animal's choice behavior was largely consistent with hyperbolic discounting. Our results also suggest that the rate of temporal discounting might be influenced by contextual factors, such as the novelty of the task. The flexibility furnished by this new inter-temporal choice task might be useful for future neurobiological investigations on inter-temporal choice in non-human primates.
Collapse
Affiliation(s)
- Jaewon Hwang
- Department of Brain and Cognitive Sciences, University of Rochester Rochester, NY, USA
| | | | | |
Collapse
|
38
|
Walsh ND, Phillips ML. Interacting outcome retrieval, anticipation, and feedback processes in the human brain. ACTA ACUST UNITED AC 2009; 20:271-81. [PMID: 19429861 DOI: 10.1093/cercor/bhp098] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Cognitive control is an inherently multivariate phenomenon, and its neural basis is currently unclear. Here we examined using functional magnetic resonance imaging how participants retrieve prelearnt information from memory, use this information to guide responses for an impending decision, and adjust their responses based on outcome feedback. We developed a behavioral task designed to manipulate memory outcome-retrieval load, outcome-anticipation interval, and outcome-feedback processes. This allowed us to understand the neural basis of these cognitive processes in isolation and how they interact. Extending previous work, we found a retrieval-load by outcome-feedback interaction in the left globus pallidus; an outcome-feedback by anticipation-interval interaction in the inferior prefrontal cortex; a retrieval-load by anticipation-interval interaction in the midcingulate gyrus and a load by interval by outcome interaction in the right frontal pole. These results further advance our knowledge of how fundamental cognitive processes interact physiologically to give rise to higher-level behavioral control.
Collapse
Affiliation(s)
- Nicholas D Walsh
- Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA.
| | | |
Collapse
|
39
|
Monkey supplementary eye field neurons signal the ordinal position of both actions and objects. J Neurosci 2009; 29:591-9. [PMID: 19158286 DOI: 10.1523/jneurosci.4803-08.2009] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
When a monkey executes a learned series of eye movements (for example, rightward followed by upward followed by leftward), neurons in the supplementary eye field (SEF) fire differentially in conjunction with the first, second, and third movements. It has not been clear whether such ordinal position signals are truly general, accompanying all forms of sequential behavior, or accompany only learned sequences of movements. To resolve this issue, we trained monkeys to perform both a serial action task (making saccades in a fixed sequence of directions) and a serial object task (making saccades to a fixed sequence of objects). We found concordant ordinal position selectivity in the two tasks. Neuronal selectivity for the passage of time and expectation of reward could not explain such concordance. We conclude that SEF neurons signal ordinal position consistently across different task contexts. These signals presumably underlie the ability of primates including humans to perform a broad range of serial order tasks.
Collapse
|
40
|
Kim S, Hwang J, Seo H, Lee D. Valuation of uncertain and delayed rewards in primate prefrontal cortex. Neural Netw 2009; 22:294-304. [PMID: 19375276 DOI: 10.1016/j.neunet.2009.03.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2008] [Revised: 03/09/2009] [Accepted: 03/21/2009] [Indexed: 11/26/2022]
Abstract
Humans and animals often must choose between rewards that differ in their qualities, magnitudes, immediacy, and likelihood, and must estimate these multiple reward parameters from their experience. However, the neural basis for such complex decision making is not well understood. To understand the role of the primate prefrontal cortex in determining the subjective value of delayed or uncertain reward, we examined the activity of individual prefrontal neurons during an inter-temporal choice task and a computer-simulated competitive game. Consistent with the findings from previous studies in humans and other animals, the monkey's behaviors during inter-temporal choice were well accounted for by a hyperbolic discount function. In addition, the activity of many neurons in the lateral prefrontal cortex reflected the signals related to the magnitude and delay of the reward expected from a particular action, and often encoded the difference in temporally discounted values that predicted the animal's choice. During a computerized matching pennies game, the animals approximated the optimal strategy, known as Nash equilibrium, using a reinforcement learning algorithm. We also found that many neurons in the lateral prefrontal cortex conveyed the signals related to the animal's previous choices and their outcomes, suggesting that this cortical area might play an important role in forming associations between actions and their outcomes. These results show that the primate lateral prefrontal cortex plays a central role in estimating the values of alternative actions based on multiple sources of information.
Collapse
Affiliation(s)
- Soyoun Kim
- Department of Neurobiology, Yale University School of Medicine, New Haven, CT 06510, USA
| | | | | | | |
Collapse
|
41
|
Gregorios-Pippas L, Tobler PN, Schultz W. Short-term temporal discounting of reward value in human ventral striatum. J Neurophysiol 2009; 101:1507-23. [PMID: 19164109 PMCID: PMC2666398 DOI: 10.1152/jn.90730.2008] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Delayed rewards lose their value for economic decisions and constitute weaker reinforcers for learning. Temporal discounting of reward value already occurs within a few seconds in animals, which allows investigations of the underlying neurophysiological mechanisms. However, it is difficult to relate these mechanisms to human discounting behavior, which is usually studied over days and months and may engage different brain processes. Our study aimed to bridge the gap by using very short delays and measuring human functional magnetic resonance responses in one of the key reward centers of the brain, the ventral striatum. We used psychometric methods to assess subjective timing and valuation of monetary rewards with delays of 4.0-13.5 s. We demonstrated hyperbolic and exponential decreases of striatal responses to reward predicting stimuli within this time range, irrespective of changes in reward rate. Lower reward magnitudes induced steeper behavioral and striatal discounting. By contrast, striatal responses following the delivery of reward reflected the uncertainty in subjective timing associated with delayed rewards rather than value discounting. These data suggest that delays of a few seconds affect the neural processing of predicted reward value in the ventral striatum and engage the temporal sensitivity of reward responses. Comparisons with electrophysiological animal data suggest that ventral striatal reward discounting may involve dopaminergic and orbitofrontal inputs.
Collapse
Affiliation(s)
- Lucy Gregorios-Pippas
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| | | | | |
Collapse
|
42
|
Minamimoto T, La Camera G, Richmond BJ. Measuring and modeling the interaction among reward size, delay to reward, and satiation level on motivation in monkeys. J Neurophysiol 2009; 101:437-47. [PMID: 18987119 PMCID: PMC2637024 DOI: 10.1152/jn.90959.2008] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2008] [Accepted: 11/03/2008] [Indexed: 11/22/2022] Open
Abstract
Motivation is usually inferred from the likelihood or the intensity with which behavior is carried out. It is sensitive to external factors (e.g., the identity, amount, and timing of a rewarding outcome) and internal factors (e.g., hunger or thirst). We trained macaque monkeys to perform a nonchoice instrumental task (a sequential red-green color discrimination) while manipulating two external factors: reward size and delay-to-reward. We also inferred the state of one internal factor, level of satiation, by monitoring the accumulated reward. A visual cue indicated the forthcoming reward size and delay-to-reward in each trial. The fraction of trials completed correctly by the monkeys increased linearly with reward size and was hyperbolically discounted by delay-to-reward duration, relations that are similar to those found in free operant and choice tasks. The fraction of correct trials also decreased progressively as a function of the satiation level. Similar (albeit noiser) relations were obtained for reaction times. The combined effect of reward size, delay-to-reward, and satiation level on the proportion of correct trials is well described as a multiplication of the effects of the single factors when each factor is examined alone. These results provide a quantitative account of the interaction of external and internal factors on instrumental behavior, and allow us to extend the concept of subjective value of a rewarding outcome, usually confined to external factors, to account also for slow changes in the internal drive of the subject.
Collapse
Affiliation(s)
- Takafumi Minamimoto
- Laboratory of Neuropsychology, National Institute of Mental Health, National Institute Health, Department of Health and Human Services, Bethesda, MD 20892-4415, USA
| | | | | |
Collapse
|
43
|
Kim S, Hwang J, Lee D. Prefrontal coding of temporally discounted values during intertemporal choice. Neuron 2008; 59:161-72. [PMID: 18614037 DOI: 10.1016/j.neuron.2008.05.010] [Citation(s) in RCA: 180] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2008] [Revised: 03/24/2008] [Accepted: 05/12/2008] [Indexed: 11/29/2022]
Abstract
Reward from a particular action is seldom immediate, and the influence of such delayed outcome on choice decreases with delay. It has been postulated that when faced with immediate and delayed rewards, decision makers choose the option with maximum temporally discounted value. We examined the preference of monkeys for delayed reward in an intertemporal choice task and the neural basis for real-time computation of temporally discounted values in the dorsolateral prefrontal cortex. During this task, the locations of the targets associated with small or large rewards and their corresponding delays were randomly varied. We found that prefrontal neurons often encoded the temporally discounted value of reward expected from a particular option. Furthermore, activity tended to increase with [corrected] discounted values for targets [corrected] presented in the neuron's preferred direction, suggesting that activity related to temporally discounted values in the prefrontal cortex might determine the animal's behavior during intertemporal choice.
Collapse
Affiliation(s)
- Soyoun Kim
- Department of Neurobiology, Yale University School of Medicine, New Haven, CT 06510, USA
| | | | | |
Collapse
|
44
|
Rezvani S, Corneil BD. Recruitment of a head-turning synergy by low-frequency activity in the primate superior colliculus. J Neurophysiol 2008; 100:397-411. [PMID: 18497351 DOI: 10.1152/jn.90223.2008] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Low-frequency activity within the oculomotor system helps bridge sensation and action. Given ocular stability, low-frequency activity sustained by some neurons within the intermediate and deep superior colliculus (dSC) is assumed to be separated from motor output. However, the dSC is an orienting structure and the influence of low-frequency dSC activity at other effectors remains untested. We studied this by simultaneously recording activity from saccade-related dSC neurons and electromyographic (EMG) activity from neck muscles that turn the head. Monkeys performed a gap-saccade paradigm with varying levels of reward expectancy. Despite head restraint and even for relatively small target eccentricities (<or=10 degrees ), increasing reward expectancy for a given target increased the level of low-frequency activity on dSC neurons encoding saccades to the rewarded target and increased the recruitment of a neck muscle synergy that would turn the head toward the target. The magnitude of neck muscle recruitment correlated positively on a trial-by-trial basis with the level of low-frequency dSC activity, and such correlations were optimized when neck muscle activity was shifted about 20 ms later to account for delays in the tecto-reticulo-spinal pathway. Further, dSC activity discriminated about the side of target presentation approximately 11 ms earlier than neck EMG activity. Considered alongside neck EMG responses evoked causally by SC stimulation, our results are consistent with low-frequency dSC activity recruiting a head-turning synergy. Our results support a brain stem circuit wherein the magnitude of neck muscle recruitment reflects the difference in comparative low-frequency activation across both dSCs, perhaps because of mutually inhibitory interactions within downstream head premotor circuits.
Collapse
Affiliation(s)
- Sam Rezvani
- Canadian Institutes of Health Research Group in Action and Perception, London, Ontario, Canada
| | | |
Collapse
|
45
|
Ichihara-Takeda S, Funahashi S. Activity of primate orbitofrontal and dorsolateral prefrontal neurons: effect of reward schedule on task-related activity. J Cogn Neurosci 2008; 20:563-79. [PMID: 18052781 DOI: 10.1162/jocn.2008.20047] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Recent studies show that task-related activity in the dorsolateral prefrontal cortex (DLPFC) is modulated by the quality and quantity of the reward, suggesting that the subject's motivational state affects cognitive operations in the DLPFC. The orbito-frontal cortex (OFC) is a possible source of motivational inputs to the DLPFC. However, it is not well known whether these two areas exhibit similar motivational effects on task-related activity. We compared motivational effects on task-related activity in these areas while a monkey performed an oculomotor delayed-response (ODR) task under two reward schedules. In the ODR-1 schedule, reward was given only after the successful completion of four consecutive trials, whereas in the ODR-2 schedule, reward was given after every correct trial. Task-related activities in both areas showed spatial selectivity. The spatial characteristics of task-related activity remained constant in both schedules. Task-related activity in both areas, especially delay-period activity, was also affected by the reward schedule and the magnitude of the activity gradually increased depending on the proximity of the reward trial in the ODR-1 schedule. More task-related OFC activities were affected by reward schedules, whereas more task-related DLPFC activities were affected by spatial factors and reward schedules. These results indicate that the OFC plays a role in monitoring the proximity of the reward trial and detecting reward delivery, whereas the DLPFC plays a role in performing cognitive operations and integrating cognitive and motivational information. These results also indicate that spatial information and the animal's motivational state independently affect neuronal activity in both areas.
Collapse
|
46
|
Kalenscher T, Pennartz CM. Is a bird in the hand worth two in the future? The neuroeconomics of intertemporal decision-making. Prog Neurobiol 2008; 84:284-315. [DOI: 10.1016/j.pneurobio.2007.11.004] [Citation(s) in RCA: 114] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2007] [Revised: 11/28/2007] [Accepted: 11/29/2007] [Indexed: 10/22/2022]
|
47
|
Abstract
The current view of brain organization supports the notion that there is a considerable degree of functional specialization and that many regions can be conceptualized as either 'affective' or 'cognitive'. Popular examples are the amygdala in the domain of emotion and the lateral prefrontal cortex in the case of cognition. This prevalent view is problematic for a number of reasons. Here, I will argue that complex cognitive-emotional behaviours have their basis in dynamic coalitions of networks of brain areas, none of which should be conceptualized as specifically affective or cognitive. Central to cognitive-emotional interactions are brain areas with a high degree of connectivity, called hubs, which are critical for regulating the flow and integration of information between regions.
Collapse
Affiliation(s)
- Luiz Pessoa
- Department of Psychological and Brain Sciences, and Programs in Neuroscience and Cognitive Science, Indiana University, Bloomington, Indiana 47405, USA.
| |
Collapse
|
48
|
Order-dependent modulation of directional signals in the supplementary and presupplementary motor areas. J Neurosci 2008; 27:13655-66. [PMID: 18077677 DOI: 10.1523/jneurosci.2982-07.2007] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
To maximize reward and minimize effort, animals must often execute multiple movements in a timely and orderly manner. Such movement sequences must be usually discovered through experience, and during this process, signals related to the animal's action, its ordinal position in the sequence, and subsequent reward need to be properly integrated. To investigate the role of the primate medial frontal cortex in planning and controlling multiple movements, monkeys were trained to produce a series of hand movements instructed by visual stimuli. We manipulated the number of movements in a sequence across trials, making it possible to dissociate the effects of the ordinal position of a given movement and the number of remaining movements necessary to obtain reward. Neurons in the supplementary and presupplementary motor areas modulated their activity according to the number of remaining movements, more often than in relation to the ordinal position, suggesting that they might encode signals related to the timing of reward or its temporally discounted value. In both cortical areas, signals related to the number of remaining movements and those related to movement direction were often combined multiplicatively, suggesting that the gain of the signals related to movements might be modulated by motivational factors. Finally, compared with the supplementary motor area, neurons in the presupplementary motor area were more likely to increase their activity when the number of remaining movements is large. These results suggest that these two areas might play complementary roles in controlling movement sequences.
Collapse
|
49
|
Padoa-Schioppa C, Assad JA. The representation of economic value in the orbitofrontal cortex is invariant for changes of menu. Nat Neurosci 2007; 11:95-102. [PMID: 18066060 DOI: 10.1038/nn2020] [Citation(s) in RCA: 238] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2007] [Accepted: 11/14/2007] [Indexed: 11/09/2022]
Abstract
Economic choice entails assigning values to the available options and is impaired by lesions to the orbitofrontal cortex (OFC). Recent results show that some neurons in the OFC encode the values that monkeys (Macaca mulatta) assign to different goods when they choose between them. A broad and fundamental question is how this neuronal representation of value depends on the behavioral context. Here we show that neuronal responses in the OFC are typically invariant for changes of menu. In other words, the activity of a neuron in response to one particular good usually does not depend on what other goods are available at the same time. Neurons in the OFC encode economic value, not relative preference. The fact that their responses are menu invariant suggests that transitivity, a fundamental trait of economic choice, may be rooted in the activity of individual neurons.
Collapse
|
50
|
Ohmae S, Lu X, Takahashi T, Uchida Y, Kitazawa S. Neuronal activity related to anticipated and elapsed time in macaque supplementary eye field. Exp Brain Res 2007; 184:593-8. [PMID: 18064442 DOI: 10.1007/s00221-007-1234-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2007] [Accepted: 11/21/2007] [Indexed: 10/22/2022]
Abstract
It is essential to sense anticipated and elapsed time in our daily life. Several areas of the brain including parietal cortex, prefrontal cortex, basal ganglia and olivo-cerebellar system are known to be related to this temporal processing. We now describe a number of cells in the supplementary eye field (SEF) with phasic, delay activity and postdelay activity modulation that varied with the length of the delay period. This variation occurred in two manners. First, cells became active with the shorter delay periods (GO signal presented earlier). We call these cells "short-delay cells". Second, cells became active with the longer delay periods (GO signal presented later). We call these cells "long-delay cells". However, such changed neuronal activity did not correlate with reaction time. These results suggest that the delay-dependent activity may reflect anticipated and elapsed time during performance of a delayed saccadic eye movement.
Collapse
Affiliation(s)
- Shogo Ohmae
- Department of Neurophysiology, Juntendo University, School of Medicine, 2-1-1 Hongo, Tokyo 113-8421, Japan
| | | | | | | | | |
Collapse
|