1
|
Feng YY, Bromberg-Martin ES, Monosov IE. Dorsal raphe neurons integrate the values of reward amount, delay, and uncertainty in multi-attribute decision-making. Cell Rep 2024; 43:114341. [PMID: 38878290 DOI: 10.1016/j.celrep.2024.114341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 03/27/2024] [Accepted: 05/23/2024] [Indexed: 06/25/2024] Open
Abstract
The dorsal raphe nucleus (DRN) is implicated in psychiatric disorders that feature impaired sensitivity to reward amount, impulsivity when facing reward delays, and risk-seeking when confronting reward uncertainty. However, it has been unclear whether and how DRN neurons signal reward amount, reward delay, and reward uncertainty during multi-attribute value-based decision-making, where subjects consider these attributes to make a choice. We recorded DRN neurons as monkeys chose between offers whose attributes, namely expected reward amount, reward delay, and reward uncertainty, varied independently. Many DRN neurons signaled offer attributes, and this population tended to integrate the attributes in a manner that reflected monkeys' preferences for amount, delay, and uncertainty. After decision-making, in response to post-decision feedback, these same neurons signaled signed reward prediction errors, suggesting a broader role in tracking value across task epochs and behavioral contexts. Our data illustrate how the DRN participates in value computations, guiding theories about the role of the DRN in decision-making and psychiatric disease.
Collapse
Affiliation(s)
- Yang-Yang Feng
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA; Department of Biomedical Engineering, Washington University, St. Louis, MO, USA
| | | | - Ilya E Monosov
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA; Department of Biomedical Engineering, Washington University, St. Louis, MO, USA; Washington University Pain Center, Washington University, St. Louis, MO, USA; Department of Neurosurgery, Washington University, St. Louis, MO, USA; Department of Electrical Engineering, Washington University, St. Louis, MO, USA.
| |
Collapse
|
2
|
Yang L, Fang LZ, Lynch MR, Xu CS, Hahm H, Zhang Y, Heitmeier MR, Costa V, Samineni VK, Creed MC. Transcriptomic landscape of mammalian ventral pallidum at single-cell resolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.24.595793. [PMID: 38826431 PMCID: PMC11142225 DOI: 10.1101/2024.05.24.595793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
The ventral pallidum (VP) is critical for motivated behaviors. While contemporary work has begun to elucidate the functional diversity of VP neurons, the molecular heterogeneity underlying this functional diversity remains incompletely understood. We used snRNA-seq and in situ hybridization to define the transcriptional taxonomy of VP cell types in mice, macaques, and baboons. We found transcriptional conservation between all three species, within the broader neurochemical cell types. Unique dopaminoceptive and cholinergic subclusters were identified and conserved across both primate species but had no homolog in mice. This harmonized consensus VP cellular atlas will pave the way for understanding the structure and function of the VP and identified key neuropeptides, neurotransmitters, and neuro receptors that could be targeted within specific VP cell types for functional investigations.
Collapse
|
3
|
Lowet AS, Zheng Q, Meng M, Matias S, Drugowitsch J, Uchida N. An opponent striatal circuit for distributional reinforcement learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.02.573966. [PMID: 38260354 PMCID: PMC10802299 DOI: 10.1101/2024.01.02.573966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Machine learning research has achieved large performance gains on a wide range of tasks by expanding the learning target from mean rewards to entire probability distributions of rewards - an approach known as distributional reinforcement learning (RL)1. The mesolimbic dopamine system is thought to underlie RL in the mammalian brain by updating a representation of mean value in the striatum2,3, but little is known about whether, where, and how neurons in this circuit encode information about higher-order moments of reward distributions4. To fill this gap, we used high-density probes (Neuropixels) to acutely record striatal activity from well-trained, water-restricted mice performing a classical conditioning task in which reward mean, reward variance, and stimulus identity were independently manipulated. In contrast to traditional RL accounts, we found robust evidence for abstract encoding of variance in the striatum. Remarkably, chronic ablation of dopamine inputs disorganized these distributional representations in the striatum without interfering with mean value coding. Two-photon calcium imaging and optogenetics revealed that the two major classes of striatal medium spiny neurons - D1 and D2 MSNs - contributed to this code by preferentially encoding the right and left tails of the reward distribution, respectively. We synthesize these findings into a new model of the striatum and mesolimbic dopamine that harnesses the opponency between D1 and D2 MSNs5-15 to reap the computational benefits of distributional RL.
Collapse
Affiliation(s)
- Adam S Lowet
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
- Program in Neuroscience, Harvard University, Boston, MA, USA
| | - Qiao Zheng
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Melissa Meng
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Sara Matias
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Jan Drugowitsch
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Naoshige Uchida
- Center for Brain Science, Harvard University, Cambridge, MA, USA
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| |
Collapse
|
4
|
Bromberg-Martin ES, Feng YY, Ogasawara T, White JK, Zhang K, Monosov IE. A neural mechanism for conserved value computations integrating information and rewards. Nat Neurosci 2024; 27:159-175. [PMID: 38177339 PMCID: PMC10774124 DOI: 10.1038/s41593-023-01511-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 10/30/2023] [Indexed: 01/06/2024]
Abstract
Behavioral and economic theory dictate that we decide between options based on their values. However, humans and animals eagerly seek information about uncertain future rewards, even when this does not provide any objective value. This implies that decisions are made by endowing information with subjective value and integrating it with the value of extrinsic rewards, but the mechanism is unknown. Here, we show that human and monkey value judgements obey strikingly conserved computational principles during multi-attribute decisions trading off information and extrinsic reward. We then identify a neural substrate in a highly conserved ancient structure, the lateral habenula (LHb). LHb neurons signal subjective value, integrating information's value with extrinsic rewards, and the LHb predicts and causally influences ongoing decisions. Neurons in key input areas to the LHb largely signal components of these computations, not integrated value signals. Thus, our data uncover neural mechanisms of conserved computations underlying decisions to seek information about the future.
Collapse
Affiliation(s)
| | - Yang-Yang Feng
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA
- Department of Biomedical Engineering, Washington University, St. Louis, MO, USA
| | - Takaya Ogasawara
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA
| | - J Kael White
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA
| | - Kaining Zhang
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA
- Department of Biomedical Engineering, Washington University, St. Louis, MO, USA
| | - Ilya E Monosov
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA.
- Department of Biomedical Engineering, Washington University, St. Louis, MO, USA.
- Department of Neurosurgery, Washington University School of Medicine, St. Louis, MO, USA.
- Department of Electrical Engineering, Washington University, St. Louis, MO, USA.
- Pain Center, Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
5
|
Li Y, Daddaoua N, Horan M, Foley NC, Gottlieb J. Uncertainty modulates visual maps during noninstrumental information demand. Nat Commun 2022; 13:5911. [PMID: 36207316 PMCID: PMC9547007 DOI: 10.1038/s41467-022-33585-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 09/22/2022] [Indexed: 11/23/2022] Open
Abstract
Animals are intrinsically motivated to obtain information independently of instrumental incentives. This motivation depends on two factors: a desire to resolve uncertainty by gathering accurate information and a desire to obtain positively-valenced observations, which predict favorable rather than unfavorable outcomes. To understand the neural mechanisms, we recorded parietal cortical activity implicated in prioritizing stimuli for spatial attention and gaze, in a task in which monkeys were free (but not trained) to obtain information about probabilistic non-contingent rewards. We show that valence and uncertainty independently modulated parietal neuronal activity, and uncertainty but not reward-related enhancement consistently correlated with behavioral sensitivity. The findings suggest uncertainty-driven and valence-driven information demand depend on partially distinct pathways, with the former being consistently related to parietal responses and the latter depending on additional mechanisms implemented in downstream structures. Curiosity is motivated by uncertainty and valence, but how uncertainty and valence are encoded in the brain remains poorly understood. Here, the authors show that parietal neurons are enhanced by both factors, but that they specifically predict visual information seeking based on uncertainty.
Collapse
Affiliation(s)
- Yvonne Li
- Department of Neuroscience, Columbia University, New York, NY, USA.,Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Nabil Daddaoua
- Department of Neuroscience, Columbia University, New York, NY, USA.,Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Mattias Horan
- Department of Neuroscience, Columbia University, New York, NY, USA
| | - Nicholas C Foley
- Department of Neuroscience, Columbia University, New York, NY, USA.,Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Jacqueline Gottlieb
- Department of Neuroscience, Columbia University, New York, NY, USA. .,Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA. .,Kavli Institute for Brain Science, Columbia University, New York, NY, USA.
| |
Collapse
|
6
|
Zhang K, Bromberg-Martin ES, Sogukpinar F, Kocher K, Monosov IE. Surprise and recency in novelty detection in the primate brain. Curr Biol 2022; 32:2160-2173.e6. [DOI: 10.1016/j.cub.2022.03.064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 02/28/2022] [Accepted: 03/24/2022] [Indexed: 11/16/2022]
|
7
|
Monosov IE, Rushworth MFS. Interactions between ventrolateral prefrontal and anterior cingulate cortex during learning and behavioural change. Neuropsychopharmacology 2022; 47:196-210. [PMID: 34234288 PMCID: PMC8617208 DOI: 10.1038/s41386-021-01079-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 05/27/2021] [Accepted: 06/15/2021] [Indexed: 02/06/2023]
Abstract
Hypotheses and beliefs guide credit assignment - the process of determining which previous events or actions caused an outcome. Adaptive hypothesis formation and testing are crucial in uncertain and changing environments in which associations and meanings are volatile. Despite primates' abilities to form and test hypotheses, establishing what is causally responsible for the occurrence of particular outcomes remains a fundamental challenge for credit assignment and learning. Hypotheses about what surprises are due to stochasticity inherent in an environment as opposed to real, systematic changes are necessary for identifying the environment's predictive features, but are often hard to test. We review evidence that two highly interconnected frontal cortical regions, anterior cingulate cortex and ventrolateral prefrontal area 47/12o, provide a biological substrate for linking two crucial components of hypothesis-formation and testing: the control of information seeking and credit assignment. Neuroimaging, targeted disruptions, and neurophysiological studies link an anterior cingulate - 47/12o circuit to generation of exploratory behaviour, non-instrumental information seeking, and interpretation of subsequent feedback in the service of credit assignment. Our observations support the idea that information seeking and credit assignment are linked at the level of neural circuits and explain why this circuit is important for ensuring behaviour is flexible and adaptive.
Collapse
Affiliation(s)
- Ilya E Monosov
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA.
- Department of Biomedical Engineering, Washington University, St. Louis, MO, USA.
- Department of Electrical Engineering, Washington University, St. Louis, MO, USA.
- Department of Neurosurgery, Washington University, St. Louis, MO, USA.
- Pain Center, Washington University, St. Louis, MO, USA.
| | - Matthew F S Rushworth
- Wellcome Centre for Integrative Neuroimaging (WIN), Department of Experimental Psychology, University of Oxford, Oxford, UK.
| |
Collapse
|
8
|
A primate temporal cortex-zona incerta pathway for novelty seeking. Nat Neurosci 2022; 25:50-60. [PMID: 34903880 DOI: 10.1038/s41593-021-00950-1] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 09/28/2021] [Indexed: 11/08/2022]
Abstract
Primates interact with the world by exploring visual objects; they seek opportunities to view novel objects even when these have no extrinsic reward value. How the brain controls this novelty seeking is unknown. Here we show that novelty seeking in monkeys is regulated by the zona incerta (ZI). As monkeys made eye movements to familiar objects to trigger an opportunity to view novel objects, many ZI neurons were preferentially activated by predictions of novel objects before the gaze shift. Low-intensity ZI stimulation facilitated gaze shifts, whereas ZI inactivation reduced novelty seeking. ZI-dependent novelty seeking was not regulated by neurons in the lateral habenula or by many dopamine neurons in the substantia nigra, traditionally associated with reward seeking. But the anterior ventral medial temporal cortex, an area important for object vision and memory, was a prominent source of novelty predictions. These data uncover a functional pathway in the primate brain that regulates novelty seeking.
Collapse
|
9
|
Jezzini A, Bromberg-Martin ES, Trambaiolli LR, Haber SN, Monosov IE. A prefrontal network integrates preferences for advance information about uncertain rewards and punishments. Neuron 2021; 109:2339-2352.e5. [PMID: 34118190 PMCID: PMC8298287 DOI: 10.1016/j.neuron.2021.05.013] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 03/30/2021] [Accepted: 05/10/2021] [Indexed: 02/06/2023]
Abstract
Humans and animals can be strongly motivated to seek information to resolve uncertainty about rewards and punishments. In particular, despite its clinical and societal relevance, very little is known about information seeking about punishments. We show that attitudes toward information about punishments and rewards are distinct and separable at both behavioral and neuronal levels. We demonstrate the existence of prefrontal neuronal populations that anticipate opportunities to gain information in a relatively valence-specific manner, separately anticipating information about either punishments or rewards. These neurons are located in anatomically interconnected subregions of anterior cingulate cortex (ACC) and ventrolateral prefrontal cortex (vlPFC) in area 12o/47. Unlike ACC, vlPFC also contains a population of neurons that integrate attitudes toward both reward and punishment information, to encode the overall preference for information in a bivalent manner. This cortical network is well suited to mediate information seeking by integrating the desire to resolve uncertainty about multiple, distinct motivational outcomes.
Collapse
Affiliation(s)
- Ahmad Jezzini
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Ethan S Bromberg-Martin
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Lucas R Trambaiolli
- Basic Neuroscience, McLean Hospital, Harvard Medical School, Belmont, MA 02478, USA
| | - Suzanne N Haber
- Department of Pharmacology and Physiology, University of Rochester, Rochester, NY 14627, USA; Basic Neuroscience, McLean Hospital, Harvard Medical School, Belmont, MA 02478, USA
| | - Ilya E Monosov
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Biomedical Engineering, Washington University, St. Louis, MO 63130, USA; Department of Electrical Engineering, Washington University, St. Louis, MO 63130, USA; Department of Neurosurgery School of Medicine, Washington University, St. Louis, MO 63110, USA; Pain Center, Washington University School of Medicine, St. Louis, MO 63110, USA.
| |
Collapse
|
10
|
Vachez YM, Tooley JR, Abiraman K, Matikainen-Ankney B, Casey E, Earnest T, Ramos LM, Silberberg H, Godynyuk E, Uddin O, Marconi L, Le Pichon CE, Creed MC. Ventral arkypallidal neurons inhibit accumbal firing to promote reward consumption. Nat Neurosci 2021; 24:379-390. [PMID: 33495635 PMCID: PMC7933121 DOI: 10.1038/s41593-020-00772-7] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Accepted: 12/04/2020] [Indexed: 02/06/2023]
Abstract
The nucleus accumbens shell (NAcSh) and the ventral pallidum (VP) are critical for reward processing, although the question of how coordinated activity within these nuclei orchestrates reward valuation and consumption remains unclear. Inhibition of NAcSh firing is necessary for reward consumption, but the source of this inhibition remains unknown. Here, we report that a subpopulation of VP neurons, the ventral arkypallidal (vArky) neurons, project back to the NAcSh, where they inhibit NAcSh neurons in vivo in mice. Consistent with this pathway driving reward consumption via inhibition of the NAcSh, calcium activity of vArky neurons scaled with reward palatability (which was dissociable from reward seeking) and predicted the subsequent drinking behavior during a free-access paradigm. Activation of the VP-NAcSh pathway increased ongoing reward consumption while amplifying hedonic reactions to reward. These results establish a pivotal role for vArky neurons in the promotion of reward consumption through modulation of NAcSh firing in a value-dependent manner.
Collapse
Affiliation(s)
- Yvan M. Vachez
- Department of Anesthesiology, Washington University Pain Center, Washington University School of Medicine, St. Louis, Missouri
| | - Jessica R. Tooley
- Department of Anesthesiology, Washington University Pain Center, Washington University School of Medicine, St. Louis, Missouri
| | - Kavitha Abiraman
- Department of Anesthesiology, Washington University Pain Center, Washington University School of Medicine, St. Louis, Missouri
| | | | - Eric Casey
- Department of Psychiatry, Washington University School of Medicine, St. Louis Missouri
| | - Tom Earnest
- Department of Anesthesiology, Washington University Pain Center, Washington University School of Medicine, St. Louis, Missouri,Department of Psychiatry, Washington University School of Medicine, St. Louis Missouri
| | - Leana M. Ramos
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda Maryland
| | - Hanna Silberberg
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda Maryland
| | - Elizabeth Godynyuk
- Department of Anesthesiology, Washington University Pain Center, Washington University School of Medicine, St. Louis, Missouri,Department of Psychiatry, Washington University School of Medicine, St. Louis Missouri
| | - Olivia Uddin
- University of Maryland, Department of Anatomy and Neurobiology, Baltimore Maryland
| | - Lauren Marconi
- University of Pennsylvania, Perelman School of Medicine, Philadelphia Pennsylvania
| | - Claire E. Le Pichon
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda Maryland
| | - Meaghan C. Creed
- Department of Anesthesiology, Washington University Pain Center, Washington University School of Medicine, St. Louis, Missouri,Department of Psychiatry, Washington University School of Medicine, St. Louis Missouri,Departments of Neuroscience and Biomedical Engineering, Washington University School of Medicine, St. Louis, Missouri,Correspondence: Meaghan C. Creed,
| |
Collapse
|
11
|
Monosov IE, Haber SN, Leuthardt EC, Jezzini A. Anterior Cingulate Cortex and the Control of Dynamic Behavior in Primates. Curr Biol 2020; 30:R1442-R1454. [PMID: 33290716 PMCID: PMC8197026 DOI: 10.1016/j.cub.2020.10.009] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The brain mechanism for controlling continuous behavior in dynamic contexts must mediate action selection and learning across many timescales, responding differentially to the level of environmental uncertainty and volatility. In this review, we argue that a part of the frontal cortex known as the anterior cingulate cortex (ACC) is particularly well suited for this function. First, the ACC is interconnected with prefrontal, parietal, and subcortical regions involved in valuation and action selection. Second, the ACC integrates diverse, behaviorally relevant information across multiple timescales, producing output signals that temporally encapsulate decision and learning processes and encode high-dimensional information about the value and uncertainty of future outcomes and subsequent behaviors. Third, the ACC signals behaviorally relevant information flexibly, displaying the capacity to represent information about current and future states in a valence-, context-, task- and action-specific manner. Fourth, the ACC dynamically controls instrumental- and non-instrumental information seeking behaviors to resolve uncertainty about future outcomes. We review electrophysiological and circuit disruption studies in primates to develop this point, discuss its relationship to novel therapeutics for neuropsychiatric disorders in humans, and conclude by relating ongoing research in primates to studies of medial frontal cortical regions in rodents.
Collapse
Affiliation(s)
- Ilya E Monosov
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Biomedical Engineering, Washington University, St. Louis, MO 63130, USA; Department of Electrical Engineering, Washington University, St. Louis, MO 63130, USA; Department of Neurosurgery School of Medicine, Washington University, St. Louis, MO 63110, USA; Pain Center, Washington University School of Medicine, St. Louis, MO 63110, USA.
| | - Suzanne N Haber
- Department of Pharmacology and Physiology, University of Rochester, Rochester, NY 14627, USA; Basic Neuroscience, McLean Hospital, Harvard Medical School, Belmont, MA 02478, USA
| | - Eric C Leuthardt
- Department of Biomedical Engineering, Washington University, St. Louis, MO 63130, USA; Department of Neurosurgery School of Medicine, Washington University, St. Louis, MO 63110, USA
| | - Ahmad Jezzini
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO 63110, USA
| |
Collapse
|
12
|
Soltani A, Rakhshan M, Schafer RJ, Burrows BE, Moore T. Separable Influences of Reward on Visual Processing and Choice. J Cogn Neurosci 2020; 33:248-262. [PMID: 33166195 DOI: 10.1162/jocn_a_01647] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Primate vision is characterized by constant, sequential processing and selection of visual targets to fixate. Although expected reward is known to influence both processing and selection of visual targets, similarities and differences between these effects remain unclear mainly because they have been measured in separate tasks. Using a novel paradigm, we simultaneously measured the effects of reward outcomes and expected reward on target selection and sensitivity to visual motion in monkeys. Monkeys freely chose between two visual targets and received a juice reward with varying probability for eye movements made to either of them. Targets were stationary apertures of drifting gratings, causing the end points of eye movements to these targets to be systematically biased in the direction of motion. We used this motion-induced bias as a measure of sensitivity to visual motion on each trial. We then performed different analyses to explore effects of objective and subjective reward values on choice and sensitivity to visual motion to find similarities and differences between reward effects on these two processes. Specifically, we used different reinforcement learning models to fit choice behavior and estimate subjective reward values based on the integration of reward outcomes over multiple trials. Moreover, to compare the effects of subjective reward value on choice and sensitivity to motion directly, we considered correlations between each of these variables and integrated reward outcomes on a wide range of timescales. We found that, in addition to choice, sensitivity to visual motion was also influenced by subjective reward value, although the motion was irrelevant for receiving reward. Unlike choice, however, sensitivity to visual motion was not affected by objective measures of reward value. Moreover, choice was determined by the difference in subjective reward values of the two options, whereas sensitivity to motion was influenced by the sum of values. Finally, models that best predicted visual processing and choice used sets of estimated reward values based on different types of reward integration and timescales. Together, our results demonstrate separable influences of reward on visual processing and choice, and point to the presence of multiple brain circuits for the integration of reward outcomes.
Collapse
|
13
|
Taghizadeh B, Foley NC, Karimimehr S, Cohanpour M, Semework M, Sheth SA, Lashgari R, Gottlieb J. Reward uncertainty asymmetrically affects information transmission within the monkey fronto-parietal network. Commun Biol 2020; 3:594. [PMID: 33087809 PMCID: PMC7578031 DOI: 10.1038/s42003-020-01320-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 09/25/2020] [Indexed: 01/02/2023] Open
Abstract
A central hypothesis in research on executive function is that controlled information processing is costly and is allocated according to the behavioral benefits it brings. However, while computational theories predict that the benefits of new information depend on prior uncertainty, the cellular effects of uncertainty on the executive network are incompletely understood. Using simultaneous recordings in monkeys, we describe several mechanisms by which the fronto-parietal network reacts to uncertainty. We show that the variance of expected rewards, independently of the value of the rewards, was encoded in single neuron and population spiking activity and local field potential (LFP) oscillations, and, importantly, asymmetrically affected fronto-parietal information transmission (measured through the coherence between spikes and LFPs). Higher uncertainty selectively enhanced information transmission from the parietal to the frontal lobe and suppressed it in the opposite direction, consistent with Bayesian principles that prioritize sensory information according to a decision maker’s prior uncertainty. Bahareh Taghizadeh and Nicholas Foley et al. show that individual neuronal responses, population spiking activity, and local field potential oscillations encode the variance of expected rewards independent of their value. They also demonstrate that reward uncertainty asymmetrically affects neuronal transmission within the monkey fronto-parietal network.
Collapse
Affiliation(s)
- Bahareh Taghizadeh
- Brain Engineering Research Center, Institute for Research in Fundamental Sciences, Tehran, Iran.,School of Cognitive Sciences, Institute for Research in Fundamental Sciences, Tehran, Iran
| | - Nicholas C Foley
- Department of Neuroscience, Columbia University, New York, NY, USA.,Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Saeed Karimimehr
- Brain Engineering Research Center, Institute for Research in Fundamental Sciences, Tehran, Iran.,School of Cognitive Sciences, Institute for Research in Fundamental Sciences, Tehran, Iran
| | - Michael Cohanpour
- Department of Neuroscience, Columbia University, New York, NY, USA.,Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Mulugeta Semework
- Department of Neuroscience, Columbia University, New York, NY, USA.,Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Sameer A Sheth
- Department of Neurosurgery, Baylor College of Medicine, Houston, TX, USA
| | - Reza Lashgari
- Brain Engineering Research Center, Institute for Research in Fundamental Sciences, Tehran, Iran.,Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Jacqueline Gottlieb
- Department of Neuroscience, Columbia University, New York, NY, USA. .,Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA. .,The Kavli Institute for Brain Science, Columbia University, New York, NY, USA.
| |
Collapse
|
14
|
Monosov IE. How Outcome Uncertainty Mediates Attention, Learning, and Decision-Making. Trends Neurosci 2020; 43:795-809. [PMID: 32736849 PMCID: PMC8153236 DOI: 10.1016/j.tins.2020.06.009] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 06/16/2020] [Accepted: 06/24/2020] [Indexed: 01/24/2023]
Abstract
Animals and humans evolved sophisticated nervous systems that endowed them with the ability to form internal-models or beliefs and make predictions about the future to survive and flourish in a world in which future outcomes are often uncertain. Crucial to this capacity is the ability to adjust behavioral and learning policies in response to the level of uncertainty. Until recently, the neuronal mechanisms that could underlie such uncertainty-guided control have been largely unknown. In this review, I discuss newly discovered neuronal circuits in primates that represent uncertainty about future rewards and propose how they guide information-seeking, attention, decision-making, and learning to help us survive in an uncertain world. Lastly, I discuss the possible relevance of these findings to learning in artificial systems.
Collapse
Affiliation(s)
- Ilya E Monosov
- Department of Neuroscience and Neurosurgery, Washington University School of Medicine in St. Louis, MO, USA; Department of Biomedical Engineering, Washington University School of Medicine in St. Louis, MO, USA; Washington University Pain Center, Washington University School of Medicine in St. Louis, MO, USA.
| |
Collapse
|
15
|
Human decisions about when to act originate within a basal forebrain-nigral circuit. Proc Natl Acad Sci U S A 2020; 117:11799-11810. [PMID: 32385157 PMCID: PMC7260969 DOI: 10.1073/pnas.1921211117] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Decision-making studies often focus on brain mechanisms for selecting between goals and actions; however, another important, and often neglected, aspect of decision-making in humans concerns whether, at any given point in time, it is worth making any action at all. We showed that a considerable portion of the variance in when voluntary actions are emitted can be explained by a simple model that that takes into account key features of the current environment. By using ultrahigh-field MRI we identified a multilayered circuit in the human brain originating far beyond the medial frontal areas typically linked to human voluntary action starting in the basal forebrain and brain stem, converging in the dopaminergic midbrain, and only then projecting to striatum and cortex. Decisions about when to act are critical for survival in humans as in animals, but how a desire is translated into the decision that an action is worth taking at any particular point in time is incompletely understood. Here we show that a simple model developed to explain when animals decide it is worth taking an action also explains a significant portion of the variance in timing observed when humans take voluntary actions. The model focuses on the current environment’s potential for reward, the timing of the individual’s own recent actions, and the outcomes of those actions. We show, by using ultrahigh-field MRI scanning, that in addition to anterior cingulate cortex within medial frontal cortex, a group of subcortical structures including striatum, substantia nigra, basal forebrain (BF), pedunculopontine nucleus (PPN), and habenula (HB) encode trial-by-trial variation in action time. Further analysis of the activity patterns found in each area together with psychophysiological interaction analysis and structural equation modeling suggested a model in which BF integrates contextual information that will influence the decision about when to act and communicates this information, in parallel with PPN and HB influences, to nigrostriatal circuits. It is then in the nigrostriatal circuit that action initiation per se begins.
Collapse
|
16
|
Abstract
Humans and other animals often show a strong desire to know the uncertain rewards their future has in store, even when they cannot use this information to influence the outcome. However, it is unknown how the brain predicts opportunities to gain information and motivates this information-seeking behavior. Here we show that neurons in a network of interconnected subregions of primate anterior cingulate cortex and basal ganglia predict the moment of gaining information about uncertain rewards. Spontaneous increases in their information prediction signals are followed by gaze shifts toward objects associated with resolving uncertainty, and pharmacologically disrupting this network reduces the motivation to seek information. These findings demonstrate a cortico-basal ganglia mechanism responsible for motivating actions to resolve uncertainty by seeking knowledge about the future. Animals resolve uncertainty by seeking knowledge about the future. How the brain controls this is unclear. The authors show that a network including primate anterior cingulate cortex and basal ganglia encodes opportunities to gain information about uncertain rewards and mediates information seeking.
Collapse
|
17
|
Grabenhorst F, Tsutsui KI, Kobayashi S, Schultz W. Primate prefrontal neurons signal economic risk derived from the statistics of recent reward experience. eLife 2019; 8:e44838. [PMID: 31343407 PMCID: PMC6658165 DOI: 10.7554/elife.44838] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 07/12/2019] [Indexed: 01/28/2023] Open
Abstract
Risk derives from the variation of rewards and governs economic decisions, yet how the brain calculates risk from the frequency of experienced events, rather than from explicit risk-descriptive cues, remains unclear. Here, we investigated whether neurons in dorsolateral prefrontal cortex process risk derived from reward experience. Monkeys performed in a probabilistic choice task in which the statistical variance of experienced rewards evolved continually. During these choices, prefrontal neurons signaled the reward-variance associated with specific objects ('object risk') or actions ('action risk'). Crucially, risk was not derived from explicit, risk-descriptive cues but calculated internally from the variance of recently experienced rewards. Support-vector-machine decoding demonstrated accurate neuronal risk discrimination. Within trials, neuronal signals transitioned from experienced reward to risk (risk updating) and from risk to upcoming choice (choice computation). Thus, prefrontal neurons encode the statistical variance of recently experienced rewards, complying with formal decision variables of object risk and action risk.
Collapse
Affiliation(s)
- Fabian Grabenhorst
- Department of Physiology, Development and NeuroscienceUniversity of CambridgeCambridgeUnited Kingdom
| | - Ken-Ichiro Tsutsui
- Department of Physiology, Development and NeuroscienceUniversity of CambridgeCambridgeUnited Kingdom
| | - Shunsuke Kobayashi
- Department of Physiology, Development and NeuroscienceUniversity of CambridgeCambridgeUnited Kingdom
| | - Wolfram Schultz
- Department of Physiology, Development and NeuroscienceUniversity of CambridgeCambridgeUnited Kingdom
| |
Collapse
|
18
|
Noritake A, Nakamura K. Encoding prediction signals during appetitive and aversive Pavlovian conditioning in the primate lateral hypothalamus. J Neurophysiol 2019; 121:396-417. [DOI: 10.1152/jn.00247.2018] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The lateral hypothalamus (LH), which plays a role in homeostatic functions such as appetite regulation, is also linked to arousal and motivational behavior. However, little is known about how these components are encoded in the LH. Thus cynomolgus monkeys were conditioned with two distinct contexts, i.e., an appetitive context with available rewards and an aversive context with predicted air puffs. Different LH neuron groups encoded different degrees of expectation, predictability, and risks of rewards in a specific manner. A nearly equal number of one-third of the recorded LH neurons showed a positive or negative correlation between their response to visual conditioned stimuli (CS) that predicted the probabilistic delivery of rewards (0%, 50%, and 100%) and the associative values. For another one-third of recorded neurons, a nearly equal number showed a positive or negative correlation between their responses to rewards [appetitive unconditioned stimulus (US)] and reward predictability. Some neurons exhibited their highest or lowest trace-period responses in the 50% reward trials. These response modulations were represented independently and overlaid on a consistent excitatory or inhibitory response across the conditioning events. LH neurons also showed consistent responses in the aversive context. However, the responses to aversive conditioning events depending on the air puff value and predictability were less common. The multifaceted modulation of consistent activity related to outcome predictions may reflect motivational and arousal signals. Furthermore, it may underlie the role the LH plays in the integration and relay of signals to cortices for adaptive and goal-directed physiological and behavioral responses to environmental changes. NEW & NOTEWORTHY The lateral hypothalamus (LH) is implicated in motivational and arousal behavior; however, the detailed information carried by single LH neurons remains unclear. We demonstrate that primate LH neurons encode multiple combinations of signals concerning different degrees of expectation, appreciation, and uncertainty of rewards in consistent responses across conditioning events and between different contexts. This multifaceted modulation of activity may underlie the role of the LH as a critical node integrating motivational signals with arousal signals.
Collapse
Affiliation(s)
- Atsushi Noritake
- Department of Physiology, Kansai Medical University, Hirakata-city, Osaka, Japan
- National Institute for Physiological Sciences, Okazaki-city, Aichi, Japan
| | - Kae Nakamura
- Department of Physiology, Kansai Medical University, Hirakata-city, Osaka, Japan
| |
Collapse
|
19
|
Chen X, Stuphorn V. Inactivation of Medial Frontal Cortex Changes Risk Preference. Curr Biol 2018; 28:3114-3122.e4. [PMID: 30245108 PMCID: PMC6177298 DOI: 10.1016/j.cub.2018.07.043] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Revised: 06/11/2018] [Accepted: 07/13/2018] [Indexed: 12/23/2022]
Abstract
Humans and other animals need to make decisions under varying degrees of uncertainty. These decisions are strongly influenced by an individual's risk preference; however, the neuronal circuitry by which risk preference shapes choice is still unclear [1]. Supplementary eye field (SEF), an oculomotor area within primate medial frontal cortex, is thought to be an essential part of the neuronal circuit underlying oculomotor decision making, including decisions under risk [2-5]. Consistent with this view, risk-related action value and monitoring signals have been observed in SEF [6-8]. However, such activity has also been observed in other frontal areas, including orbitofrontal [9-11], cingulate [12-14], and dorsal-lateral frontal cortex [15]. It is thus unknown whether the activity in SEF causally contributes to risky decisions, or whether it is merely a reflection of neural processes in other cortical regions. Here, we tested a causal role of SEF in risky oculomotor choices. We found that SEF inactivation strongly reduced the frequency of risky choices. This reduction was largely due to a reduced attraction to reward uncertainty and high reward gain, but not due to changes in the subjective estimation of reward probability or average expected reward. Moreover, SEF inactivation also led to increased sensitivity to differences between expected and actual reward during free choice. Nevertheless, it did not affect adjustments of decisions based on reward history.
Collapse
Affiliation(s)
- Xiaomo Chen
- Department of Neuroscience, Johns Hopkins University School of Medicine and Zanvyl Krieger Mind/Brain Institute, 3400 North Charles Street, Baltimore, MD 21218-2685, USA; Department of Psychological and Brain Sciences, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218-2685, USA
| | - Veit Stuphorn
- Department of Neuroscience, Johns Hopkins University School of Medicine and Zanvyl Krieger Mind/Brain Institute, 3400 North Charles Street, Baltimore, MD 21218-2685, USA; Department of Psychological and Brain Sciences, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218-2685, USA.
| |
Collapse
|
20
|
The Basal Forebrain Regulates Global Resting-State fMRI Fluctuations. Neuron 2018; 97:940-952.e4. [PMID: 29398365 DOI: 10.1016/j.neuron.2018.01.032] [Citation(s) in RCA: 139] [Impact Index Per Article: 23.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Revised: 12/10/2017] [Accepted: 01/12/2018] [Indexed: 01/06/2023]
Abstract
Patterns of spontaneous brain activity, typically measured in humans at rest with fMRI, are used routinely to assess the brain's functional organization. The mechanisms that generate and coordinate the underlying neural fluctuations are largely unknown. Here we investigate the hypothesis that the nucleus basalis of Meynert (NBM), the principal source of widespread cholinergic and GABAergic projections to the cortex, contributes critically to such activity. We reversibly inactivated two distinct sites of the NBM in macaques while measuring fMRI activity across the brain. We found that inactivation led to strong, regionalized suppression of shared or "global" signal components of cortical fluctuations ipsilateral to the injection. At the same time, the commonly studied resting-state networks retained their spatial structure under this suppression. The results indicate that the NBM contributes selectively to the global component of functional connectivity but plays little if any role in the specific correlations that define resting-state networks.
Collapse
|
21
|
Anterior cingulate is a source of valence-specific information about value and uncertainty. Nat Commun 2017; 8:134. [PMID: 28747623 PMCID: PMC5529456 DOI: 10.1038/s41467-017-00072-y] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2016] [Accepted: 05/30/2017] [Indexed: 01/29/2023] Open
Abstract
Anterior cingulate cortex (ACC) is thought to control a wide range of reward, punishment, and uncertainty-related behaviors. However, how it does so is unclear. Here, in a Pavlovian procedure in which monkeys displayed a diverse repertoire of reward-related, punishment-related, and uncertainty-related behaviors, we show that many ACC-neurons represent expected value and uncertainty in a valence-specific manner, signaling value or uncertainty predictions about either rewards or punishments. Other ACC-neurons signal prediction information about rewards and punishments by displaying excitation to both (rather than excitation to one and inhibition to the other). This diversity in valence representations may support the role of ACC in many behavioral states that are either enhanced by reward and punishment (e.g., vigilance) or specific to either reward or punishment (e.g., approach and avoidance). Also, this first demonstration of punishment-uncertainty signals in the brain suggests that ACC could be a target for the treatment of uncertainty-related disorders of mood. Rewards or punishments elicit diverse behavioral responses; however, the neural circuits underlying such flexibility are unclear. Here Monosov shows that this diversity could be supported by neurons in the anterior cingulate that represent expected value and uncertainty in a valence-specific manner.
Collapse
|
22
|
Gielow MR, Zaborszky L. The Input-Output Relationship of the Cholinergic Basal Forebrain. Cell Rep 2017; 18:1817-1830. [PMID: 28199851 PMCID: PMC5725195 DOI: 10.1016/j.celrep.2017.01.060] [Citation(s) in RCA: 133] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2016] [Revised: 12/05/2016] [Accepted: 01/24/2017] [Indexed: 12/21/2022] Open
Abstract
Basal forebrain cholinergic neurons influence cortical state, plasticity, learning, and attention. They collectively innervate the entire cerebral cortex, differentially controlling acetylcholine efflux across different cortical areas and timescales. Such control might be achieved by differential inputs driving separable cholinergic outputs, although no input-output relationship on a brain-wide level has ever been demonstrated. Here, we identify input neurons to cholinergic cells projecting to specific cortical regions by infecting cholinergic axon terminals with a monosynaptically restricted viral tracer. This approach revealed several circuit motifs, such as central amygdala neurons synapsing onto basolateral amygdala-projecting cholinergic neurons or strong somatosensory cortical input to motor cortex-projecting cholinergic neurons. The presence of input cells in the parasympathetic midbrain nuclei contacting frontally projecting cholinergic neurons suggest that the network regulating the inner eye muscles are additionally regulating cortical state via acetylcholine efflux. This dataset enables future circuit-level experiments to identify drivers of known cortical cholinergic functions.
Collapse
Affiliation(s)
- Matthew R Gielow
- Center for Molecular and Behavioral Neuroscience, Rutgers, the State University of New Jersey, Newark, NJ 07102, USA
| | - Laszlo Zaborszky
- Center for Molecular and Behavioral Neuroscience, Rutgers, the State University of New Jersey, Newark, NJ 07102, USA.
| |
Collapse
|
23
|
White JK, Monosov IE. Neurons in the primate dorsal striatum signal the uncertainty of object-reward associations. Nat Commun 2016; 7:12735. [PMID: 27623750 PMCID: PMC5027277 DOI: 10.1038/ncomms12735] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2016] [Accepted: 07/28/2016] [Indexed: 01/03/2023] Open
Abstract
To learn, obtain reward and survive, humans and other animals must monitor, approach and act on objects that are associated with variable or unknown rewards. However, the neuronal mechanisms that mediate behaviours aimed at uncertain objects are poorly understood. Here we demonstrate that a set of neurons in an internal-capsule bordering regions of the primate dorsal striatum, within the putamen and caudate nucleus, signal the uncertainty of object–reward associations. Their uncertainty responses depend on the presence of objects associated with reward uncertainty and evolve rapidly as monkeys learn novel object–reward associations. Therefore, beyond its established role in mediating actions aimed at known or certain rewards, the dorsal striatum also participates in behaviours aimed at reward-uncertain objects. The dorsal striatum (DS) is a brain region that is thought to aim actions at certain or known rewards. Here, the authors show that an internal-capsule bordering region of the primate DS signals the uncertainty of object-reward associations, suggesting a novel role for the DS in behavior under uncertainty.
Collapse
Affiliation(s)
- J Kael White
- Department of Neuroscience, Washington University School of Medicine, 660 S. Euclid Avenue, St Louis, Missouri 63110, USA
| | - Ilya E Monosov
- Department of Neuroscience, Washington University School of Medicine, 660 S. Euclid Avenue, St Louis, Missouri 63110, USA
| |
Collapse
|