1
|
Pan-Vazquez A, Sanchez Araujo Y, McMannon B, Louka M, Bandi A, Haetzel L, Faulkner M, Pillow JW, Daw ND, Witten IB. Pre-existing visual responses in a projection-defined dopamine population explain individual learning trajectories. Curr Biol 2024:S0960-9822(24)01286-7. [PMID: 39413788 DOI: 10.1016/j.cub.2024.09.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 06/11/2024] [Accepted: 09/17/2024] [Indexed: 10/18/2024]
Abstract
A key challenge of learning a new task is that the environment is high dimensional-there are many different sensory features and possible actions, with typically only a small reward-relevant subset. Although animals can learn to perform complex tasks that involve arbitrary associations between stimuli, actions, and rewards,1,2,3,4,5,6 a consistent and striking result across varied experimental paradigms is that in initially acquiring such tasks, large differences between individuals are apparent in the learning process.7,8,9,10,11,12 What neural mechanisms contribute to initial task acquisition, and why do some individuals learn a new task much more quickly than others? To address these questions, we recorded longitudinally from dopaminergic (DA) axon terminals in mice learning a visual decision-making task.7 Across striatum, DA responses tracked idiosyncratic and side-specific learning trajectories, consistent with widespread reward prediction error coding across DA terminals. However, even before any rewards were delivered, contralateral-side-specific visual responses were present in DA terminals, primarily in the dorsomedial striatum (DMS). These pre-existing responses predicted the extent of learning for contralateral stimuli. Moreover, activation of these terminals improved contralateral performance. Thus, the initial conditions of a projection-specific and feature-specific DA signal help explain individual learning trajectories. More broadly, this work suggests that functional heterogeneity across DA projections may serve to bias target regions toward learning about different subsets of task features, providing a potential mechanism to address the dimensionality of the initial task learning problem.
Collapse
Affiliation(s)
- Alejandro Pan-Vazquez
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA
| | - Yoel Sanchez Araujo
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA
| | - Brenna McMannon
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA
| | - Miranta Louka
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA
| | - Akhil Bandi
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA
| | - Laura Haetzel
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA
| | - Mayo Faulkner
- UCL Institute of Ophthalmology, University College London, 11-43 Bath Street, London EC1V 9EL, UK
| | - Jonathan W Pillow
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA
| | - Nathaniel D Daw
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA; Department of Psychology, Princeton University, Washington Road, Princeton, NJ 08540, USA.
| | - Ilana B Witten
- Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA; Howard Hughes Medical Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA.
| |
Collapse
|
2
|
Venditto SJC, Miller KJ, Brody CD, Daw ND. Dynamic reinforcement learning reveals time-dependent shifts in strategy during reward learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.28.582617. [PMID: 38464244 PMCID: PMC10925334 DOI: 10.1101/2024.02.28.582617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Different brain systems have been hypothesized to subserve multiple "experts" that compete to generate behavior. In reinforcement learning, two general processes, one model-free (MF) and one model-based (MB), are often modeled as a mixture of agents (MoA) and hypothesized to capture differences between automaticity vs. deliberation. However, shifts in strategy cannot be captured by a static MoA. To investigate such dynamics, we present the mixture-of-agents hidden Markov model (MoA-HMM), which simultaneously learns inferred action values from a set of agents and the temporal dynamics of underlying "hidden" states that capture shifts in agent contributions over time. Applying this model to a multi-step, reward-guided task in rats reveals a progression of within-session strategies: a shift from initial MB exploration to MB exploitation, and finally to reduced engagement. The inferred states predict changes in both response time and OFC neural encoding during the task, suggesting that these states are capturing real shifts in dynamics.
Collapse
|
3
|
Vloeberghs R, Urai AE, Desender K, Linderman SW. A Bayesian Hierarchical Model of Trial-To-Trial Fluctuations in Decision Criterion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.30.605869. [PMID: 39211219 PMCID: PMC11361103 DOI: 10.1101/2024.07.30.605869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
Classical decision models assume that the parameters giving rise to choice behavior are stable, yet emerging research suggests these parameters may fluctuate over time. Such fluctuations, observed in neural activity and behavioral strategies, have significant implications for understanding decision-making processes. However, empirical studies on fluctuating human decision-making strategies have been limited due to the extensive data requirements for estimating these fluctuations. Here, we introduce hMFC (Hierarchical Model for Fluctuations in Criterion), a Bayesian framework designed to estimate slow fluctuations in the decision criterion from limited data. We first showcase the importance of considering fluctuations in decision criterion: incorrectly assuming a stable criterion gives rise to apparent history effects and underestimates perceptual sensitivity. We then present a hierarchical estimation procedure capable of reliably recovering the underlying state of the fluctuating decision criterion with as few as 500 trials per participant, offering a robust tool for researchers with typical human datasets. Critically, hMFC does not only accurately recover the state of the underlying decision criterion, it also effectively deals with the confounds caused by criterion fluctuations. Lastly, we provide code and a comprehensive demo at www.github.com/robinvloeberghs/hMFC to enable widespread application of hMFC in decision-making research.
Collapse
Affiliation(s)
| | - Anne E. Urai
- Cognitive Psychology, Leiden University, The Netherlands
| | | | - Scott W. Linderman
- Department of Statistics and Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA
| |
Collapse
|
4
|
Ramamurthy DL, Rodriguez L, Cen C, Li S, Chen A, Feldman DE. Reward history guides focal attention in whisker somatosensory cortex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.603927. [PMID: 39131281 PMCID: PMC11312476 DOI: 10.1101/2024.07.17.603927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Prior reward is a potent cue for attentional capture, but the underlying neurobiology is largely unknown. In a novel whisker touch detection task, we show that mice flexibly shift attention between specific whiskers on a trial-by-trial timescale, guided by the recent history of stimulus-reward association. Two-photon calcium imaging and spike recordings revealed a robust neurobiological correlate of attention in the somatosensory cortex (S1), boosting sensory responses to the attended whisker in L2/3 and L5, but not L4. Attentional boosting in L2/3 pyramidal cells was topographically precise and whisker-specific, and shifted receptive fields toward the attended whisker. L2/3 VIP interneurons were broadly activated by whisker stimuli, motion, and arousal but did not carry a whisker-specific attentional signal, and thus did not mediate spatially focused tactile attention. Together, these findings establish a new model of focal attention in the mouse whisker tactile system, showing that the history of stimuli and rewards in the recent past can dynamically engage local modulation in cortical sensory maps to guide flexible shifts in ongoing behavior.
Collapse
Affiliation(s)
- Deepa L. Ramamurthy
- Department of Neuroscience and Helen Wills Neuroscience Institute, UC Berkeley
| | - Lucia Rodriguez
- Department of Neuroscience and Helen Wills Neuroscience Institute, UC Berkeley
- Neuroscience PhD Program, UC Berkeley
| | - Celine Cen
- Department of Neuroscience and Helen Wills Neuroscience Institute, UC Berkeley
| | - Siqian Li
- Department of Neuroscience and Helen Wills Neuroscience Institute, UC Berkeley
| | - Andrew Chen
- Department of Neuroscience and Helen Wills Neuroscience Institute, UC Berkeley
| | - Daniel E. Feldman
- Department of Neuroscience and Helen Wills Neuroscience Institute, UC Berkeley
- Lead Contact
| |
Collapse
|
5
|
Carandini M. Sensory choices as logistic classification. Neuron 2024; 112:2854-2868.e1. [PMID: 39013468 PMCID: PMC11377159 DOI: 10.1016/j.neuron.2024.06.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 05/13/2024] [Accepted: 06/19/2024] [Indexed: 07/18/2024]
Abstract
Logistic classification is a simple way to make choices based on a set of factors: give each factor a weight, sum the results, and use the sum to set the log odds of a random draw. This operation is known to describe human and animal choices based on value (economic decisions). There is increasing evidence that it also describes choices based on sensory inputs (perceptual decisions), presented across sensory modalities (multisensory integration) and combined with non-sensory factors such as prior probability, expected value, overall motivation, and recent actions. Logistic classification can also capture the effects of brain manipulations such as local inactivations. The brain may implement it by thresholding stochastic inputs (as in signal detection theory) acquired over time (as in the drift diffusion model). It is the optimal strategy under certain conditions, and the brain appears to use it as a heuristic in a wider set of conditions.
Collapse
Affiliation(s)
- Matteo Carandini
- UCL Institute of Ophthalmology, University College London, London WC1 6BT, UK.
| |
Collapse
|
6
|
Brown E, Zi Y, Vu MA, Bouabid S, Lindsey J, Godfrey-Nwachukwu C, Attarwala A, Litwin-Kumar A, DePasquale B, Howe M. Spatially organized striatal neuromodulator release encodes trajectory errors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.13.607797. [PMID: 39185163 PMCID: PMC11343099 DOI: 10.1101/2024.08.13.607797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Goal-directed navigation requires animals to continuously evaluate their current direction and speed of travel relative to landmarks to discern whether they are approaching or deviating from their goal. Striatal dopamine and acetylcholine are powerful modulators of goal-directed behavior, but it is unclear whether and how neuromodulator dynamics at landmarks incorporate relative motion for effective behavioral guidance. Using optical measurements in mice, we demonstrate that cue-evoked striatal dopamine release encodes bi-directional 'trajectory errors' reflecting relationships between ongoing speed and direction of locomotion and visual flow relative to optimal goal trajectories. Striatum-wide micro-fiber array recordings resolved an anatomical gradient of trajectory error signaling across the anterior-posterior axis, distinct from trajectory error independent cue signals. Dynamic regression modeling revealed that positive and negative trajectory error encoding emerges early and late respectively during learning and over different time courses in the medial and lateral striatum, enabling region specific contributions to learning. Striatal acetylcholine release also encodes trajectory errors, but encoding is more spatially restricted, opposite polarity, and delayed relative to dopamine, supporting distinct roles in modulating striatal output and behavior. Dopamine trajectory error signaling and task performance were reproduced in a reinforcement learning model incorporating a conjunctive state space representation, suggesting a potential neural substrate for trajectory error generation. Our results establish region specific neuromodulator signals positioned to guide the speed and direction of locomotion to reach goals based on environmental landmarks during navigation.
Collapse
Affiliation(s)
- Eleanor Brown
- Department of Psychological & Brain Sciences, Boston University, Boston, MA, USA
| | - Yihan Zi
- Department of Psychological & Brain Sciences, Boston University, Boston, MA, USA
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
| | - Mai-Anh Vu
- Department of Psychological & Brain Sciences, Boston University, Boston, MA, USA
| | - Safa Bouabid
- Department of Psychological & Brain Sciences, Boston University, Boston, MA, USA
| | - Jack Lindsey
- Department of Neuroscience, Columbia University, New York, NY, USA
| | | | - Aaquib Attarwala
- Department of Psychological & Brain Sciences, Boston University, Boston, MA, USA
| | | | - Brian DePasquale
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
| | - Mark Howe
- Department of Psychological & Brain Sciences, Boston University, Boston, MA, USA
| |
Collapse
|
7
|
Marrero K, Aruljothi K, Delgadillo C, Kabbara S, Swatch L, Zagha E. Goal-directed learning is multidimensional and accompanied by diverse and widespread changes in neocortical signaling. Cereb Cortex 2024; 34:bhae328. [PMID: 39110412 PMCID: PMC11304966 DOI: 10.1093/cercor/bhae328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 07/19/2024] [Accepted: 07/25/2024] [Indexed: 08/10/2024] Open
Abstract
New tasks are often learned in stages with each stage reflecting a different learning challenge. Accordingly, each learning stage is likely mediated by distinct neuronal processes. And yet, most rodent studies of the neuronal correlates of goal-directed learning focus on individual outcome measures and individual brain regions. Here, we longitudinally studied mice from naïve to expert performance in a head-fixed, operant conditioning whisker discrimination task. In addition to tracking the primary behavioral outcome of stimulus discrimination, we tracked and compared an array of object-based and temporal-based behavioral measures. These behavioral analyses identify multiple, partially overlapping learning stages in this task, consistent with initial response implementation, early stimulus-response generalization, and late response inhibition. To begin to understand the neuronal foundations of these learning processes, we performed widefield Ca2+ imaging of dorsal neocortex throughout learning and correlated behavioral measures with neuronal activity. We found distinct and widespread correlations between neocortical activation patterns and various behavioral measures. For example, improvements in sensory discrimination correlated with target stimulus evoked activations of response-related cortices along with distractor stimulus evoked global cortical suppression. Our study reveals multidimensional learning for a simple goal-directed learning task and generates hypotheses for the neuronal modulations underlying these various learning processes.
Collapse
Affiliation(s)
- Krista Marrero
- Neuroscience Graduate Program, University of California Riverside, 900 University Avenue, Riverside, CA 92521, USA
| | - Krithiga Aruljothi
- Department of Psychology, University of California Riverside, 900 University Avenue, Riverside, CA 92521, USA
| | - Christian Delgadillo
- Division of Biomedical Sciences, University of California Riverside, 900 University Avenue, Riverside, CA 92521, USA
| | - Sarah Kabbara
- Neuroscience Graduate Program, University of California Riverside, 900 University Avenue, Riverside, CA 92521, USA
| | - Lovleen Swatch
- College of Natural & Agricultural Sciences, University of California Riverside, 900 University Avenue, Riverside, CA 92521, USA
| | - Edward Zagha
- Neuroscience Graduate Program, University of California Riverside, 900 University Avenue, Riverside, CA 92521, USA
- Department of Psychology, University of California Riverside, 900 University Avenue, Riverside, CA 92521, USA
| |
Collapse
|
8
|
Dong Y, Lengyel G, Shivkumar S, Anzai A, DiRisio GF, Haefner RM, DeAngelis GC. How to reward animals based on their subjective percepts: A Bayesian approach to online estimation of perceptual biases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.25.605047. [PMID: 39091868 PMCID: PMC11291170 DOI: 10.1101/2024.07.25.605047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Elucidating the neural basis of perceptual biases, such as those produced by visual illusions, can provide powerful insights into the neural mechanisms of perceptual inference. However, studying the subjective percepts of animals poses a fundamental challenge: unlike human participants, animals cannot be verbally instructed to report what they see, hear, or feel. Instead, they must be trained to perform a task for reward, and researchers must infer from their responses what the animal perceived. However, animals' responses are shaped by reward feedback, thus raising the major concern that the reward regimen may alter the animal's decision strategy or even intrinsic perceptual biases. We developed a method that estimates perceptual bias during task performance and then computes the reward for each trial based on the evolving estimate of the animal's perceptual bias. Our approach makes use of multiple stimulus contexts to dissociate perceptual biases from decision-related biases. Starting with an informative prior, our Bayesian method updates a posterior over the perceptual bias after each trial. The prior can be specified based on data from past sessions, thus reducing the variability of the online estimates and allowing it to converge to a stable estimate over a small number of trials. After validating our method on synthetic data, we apply it to estimate perceptual biases of monkeys in a motion direction discrimination task in which varying background optic flow induces robust perceptual biases. This method overcomes an important challenge to understanding the neural basis of subjective percepts.
Collapse
Affiliation(s)
- Yelin Dong
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA
| | - Gabor Lengyel
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA
| | - Sabyasachi Shivkumar
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA
- Zuckerman Institute, Columbia University, New York, NY, USA
| | - Akiyuki Anzai
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA
| | - Grace F DiRisio
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA
| | - Ralf M Haefner
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA
| | - Gregory C DeAngelis
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA
| |
Collapse
|
9
|
Goudar V, Kim JW, Liu Y, Dede AJO, Jutras MJ, Skelin I, Ruvalcaba M, Chang W, Ram B, Fairhall AL, Lin JJ, Knight RT, Buffalo EA, Wang XJ. A Comparison of Rapid Rule-Learning Strategies in Humans and Monkeys. J Neurosci 2024; 44:e0231232024. [PMID: 38871463 PMCID: PMC11236592 DOI: 10.1523/jneurosci.0231-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 05/28/2024] [Accepted: 05/31/2024] [Indexed: 06/15/2024] Open
Abstract
Interspecies comparisons are key to deriving an understanding of the behavioral and neural correlates of human cognition from animal models. We perform a detailed comparison of the strategies of female macaque monkeys to male and female humans on a variant of the Wisconsin Card Sorting Test (WCST), a widely studied and applied task that provides a multiattribute measure of cognitive function and depends on the frontal lobe. WCST performance requires the inference of a rule change given ambiguous feedback. We found that well-trained monkeys infer new rules three times more slowly than minimally instructed humans. Input-dependent hidden Markov model-generalized linear models were fit to their choices, revealing hidden states akin to feature-based attention in both species. Decision processes resembled a win-stay, lose-shift strategy with interspecies similarities as well as key differences. Monkeys and humans both test multiple rule hypotheses over a series of rule-search trials and perform inference-like computations to exclude candidate choice options. We quantitatively show that perseveration, random exploration, and poor sensitivity to negative feedback account for the slower task-switching performance in monkeys.
Collapse
Affiliation(s)
- Vishwa Goudar
- Center for Neural Science, New York University, New York 10003
| | - Jeong-Woo Kim
- Center for Neural Science, New York University, New York 10003
| | - Yue Liu
- Center for Neural Science, New York University, New York 10003
| | - Adam J O Dede
- Department of Physiology and Biophysics, University of Washington, Seattle, Washington 98195
| | - Michael J Jutras
- Department of Physiology and Biophysics, University of Washington, Seattle, Washington 98195
| | - Ivan Skelin
- Department of Neurology, University of California, Davis, California 95616
- The Center for Mind and Brain, University of California, Davis, California 95616
| | - Michael Ruvalcaba
- Helen Wills Neuroscience Institute, University of California, Berkeley, California 94720
| | - William Chang
- Helen Wills Neuroscience Institute, University of California, Berkeley, California 94720
| | - Bhargavi Ram
- Department of Neurology, University of California, Davis, California 95616
- The Center for Mind and Brain, University of California, Davis, California 95616
| | - Adrienne L Fairhall
- Department of Physiology and Biophysics, University of Washington, Seattle, Washington 98195
| | - Jack J Lin
- Department of Neurology, University of California, Davis, California 95616
- The Center for Mind and Brain, University of California, Davis, California 95616
| | - Robert T Knight
- Helen Wills Neuroscience Institute, University of California, Berkeley, California 94720
- Department of Psychology, University of California, Berkeley, California 94720
| | - Elizabeth A Buffalo
- Department of Physiology and Biophysics, University of Washington, Seattle, Washington 98195
- Washington Primate Research Center, University of Washington, Seattle, Washington 98195
| | - Xiao-Jing Wang
- Center for Neural Science, New York University, New York 10003
| |
Collapse
|
10
|
Carandini M. Sensory choices as logistic classification. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.17.576029. [PMID: 38979189 PMCID: PMC11230223 DOI: 10.1101/2024.01.17.576029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Logistic classification is a simple way to make choices based on a set of factors: give each factor a weight, sum the results, and use the sum to set the log odds of a random draw. This operation is known to describe human and animal choices based on value (economic decisions). There is increasing evidence that it also describes choices based on sensory inputs (perceptual decisions), presented across sensory modalities (multisensory integration) and combined with non-sensory factors such as prior probability, expected value, overall motivation, and recent actions. Logistic classification can also capture the effects of brain manipulations such as local inactivations. The brain may implement by thresholding stochastic inputs (as in signal detection theory) acquired over time (as in the drift diffusion model). It is the optimal strategy under certain conditions, and the brain appears to use it as a heuristic in a wider set of conditions.
Collapse
Affiliation(s)
- Matteo Carandini
- UCL Institute of Ophthalmology, University College London, London WC1 6BT, UK
| |
Collapse
|
11
|
Tyler Boyd-Meredith J, Piet AT, Kopec CD, Brody CD. A cognitive process model captures near-optimal confidence-guided waiting in rats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.07.597954. [PMID: 38895394 PMCID: PMC11185770 DOI: 10.1101/2024.06.07.597954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Rational decision-makers invest more time pursuing rewards they are more confident they will eventually receive. A series of studies have therefore used willingness to wait for delayed rewards as a proxy for decision confidence. However, interpretation of waiting behavior is limited because it is unclear how environmental statistics influence optimal waiting, and how sources of internal variability influence subjects' behavior. We trained rats to perform a confidence-guided waiting task, and derived expressions for optimal waiting that make relevant environmental statistics explicit, including travel time incurred traveling from one reward opportunity to another. We found that rats waited longer than fully optimal agents, but that their behavior was closely matched by optimal agents with travel times constrained to match their own. We developed a process model describing the decision to stop waiting as an accumulation to bound process, which allowed us to compare the effects of multiple sources of internal variability on waiting. Surprisingly, although mean wait times grew with confidence, variability did not, inconsistent with scalar invariant timing, and best explained by variability in the stopping bound. Our results describe a tractable process model that can capture the influence of environmental statistics and internal sources of variability on subjects' decision process during confidence-guided waiting.
Collapse
Affiliation(s)
- J Tyler Boyd-Meredith
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
- Sainsbury Wellcome Centre, University College London, London, UK
| | - Alex T Piet
- Allen Institute, Seattle, Washington, United States
| | - Chuck D Kopec
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Carlos D Brody
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
- Howard Hughes Medical Institute, Princeton University, Princeton, United States
| |
Collapse
|
12
|
Prakash SS, Mayo JP, Ray S. Dissociation of attentional state and behavioral outcome using local field potentials. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.05.552102. [PMID: 37609148 PMCID: PMC10441331 DOI: 10.1101/2023.08.05.552102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Successful behavior depends on attentional state and other factors related to decision-making, which may modulate neuronal activity differently. Here, we investigated whether attentional state and behavioral outcome (i.e., whether a target is detected or missed) are distinguishable using the power and phase of local field potential (LFP) recorded bilaterally from area V4 of monkeys performing a cued visual attention task. To link each trial's outcome to pairwise measures of attention that are typically averaged across trials, we used several methods to obtain single-trial estimates of spike count correlation and phase consistency. Surprisingly, while attentional location was best discriminated using gamma and high-gamma power, behavioral outcome was best discriminated by alpha power and steady-state visually evoked potential. Power outperformed absolute phase in attentional/behavioral discriminability, although single-trial gamma phase consistency provided reasonably high attentional discriminability. Our results suggest a dissociation between the neuronal mechanisms that regulate attentional focus and behavioral outcome.
Collapse
Affiliation(s)
- Surya S Prakash
- Centre for Neuroscience, Indian Institute of Science, Bangalore, India, 560012
| | - J Patrick Mayo
- Department of Ophthalmology, University of Pittsburgh, Pittsburgh, Pennsylvania, USA, 15219
| | - Supratim Ray
- Centre for Neuroscience, Indian Institute of Science, Bangalore, India, 560012
| |
Collapse
|
13
|
Piet A, Ponvert N, Ollerenshaw D, Garrett M, Groblewski PA, Olsen S, Koch C, Arkhipov A. Behavioral strategy shapes activation of the Vip-Sst disinhibitory circuit in visual cortex. Neuron 2024; 112:1876-1890.e4. [PMID: 38447579 PMCID: PMC11156560 DOI: 10.1016/j.neuron.2024.02.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 11/17/2023] [Accepted: 02/08/2024] [Indexed: 03/08/2024]
Abstract
In complex environments, animals can adopt diverse strategies to find rewards. How distinct strategies differentially engage brain circuits is not well understood. Here, we investigate this question, focusing on the cortical Vip-Sst disinhibitory circuit between vasoactive intestinal peptide-postive (Vip) interneurons and somatostatin-positive (Sst) interneurons. We characterize the behavioral strategies used by mice during a visual change detection task. Using a dynamic logistic regression model, we find that individual mice use mixtures of a visual comparison strategy and a statistical timing strategy. Separately, mice also have periods of task engagement and disengagement. Two-photon calcium imaging shows large strategy-dependent differences in neural activity in excitatory, Sst inhibitory, and Vip inhibitory cells in response to both image changes and image omissions. In contrast, task engagement has limited effects on neural population activity. We find that the diversity of neural correlates of strategy can be understood parsimoniously as the increased activation of the Vip-Sst disinhibitory circuit during the visual comparison strategy, which facilitates task-appropriate responses.
Collapse
Affiliation(s)
- Alex Piet
- Allen Institute, Mindscope Program, Seattle, WA, USA.
| | - Nick Ponvert
- Allen Institute, Mindscope Program, Seattle, WA, USA
| | | | | | | | - Shawn Olsen
- Allen Institute, Mindscope Program, Seattle, WA, USA
| | - Christof Koch
- Allen Institute, Mindscope Program, Seattle, WA, USA
| | | |
Collapse
|
14
|
Zhu Z, Kuchibhotla KV. Performance errors during rodent learning reflect a dynamic choice strategy. Curr Biol 2024; 34:2107-2117.e5. [PMID: 38677279 PMCID: PMC11488394 DOI: 10.1016/j.cub.2024.04.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 02/10/2024] [Accepted: 04/08/2024] [Indexed: 04/29/2024]
Abstract
Humans, even as infants, use cognitive strategies, such as exploration and hypothesis testing, to learn about causal interactions in the environment. In animal learning studies, however, it is challenging to disentangle higher-order behavioral strategies from errors arising from imperfect task knowledge or inherent biases. Here, we trained head-fixed mice on a wheel-based auditory two-choice task and exploited the intra- and inter-animal variability to understand the drivers of errors during learning. During learning, performance errors are dominated by a choice bias, which, despite appearing maladaptive, reflects a dynamic strategy. Early in learning, mice develop an internal model of the task contingencies such that violating their expectation of reward on correct trials (by using short blocks of non-rewarded "probe" trials) leads to an abrupt shift in strategy. During the probe block, mice behave more accurately with less bias, thereby using their learned stimulus-action knowledge to test whether the outcome contingencies have changed. Despite having this knowledge, mice continued to exhibit a strong choice bias during reinforced trials. This choice bias operates on a timescale of tens to hundreds of trials with a dynamic structure, shifting between left, right, and unbiased epochs. Biased epochs also coincided with faster motor kinematics. Although bias decreased across learning, expert mice continued to exhibit short bouts of biased choices interspersed with longer bouts of unbiased choices and higher performance. These findings collectively suggest that during learning, rodents actively probe their environment in a structured manner to refine their decision-making and maintain long-term flexibility.
Collapse
Affiliation(s)
- Ziyi Zhu
- Department of Psychological and Brain Sciences, Johns Hopkins University, Baltimore, MD 21218, USA; Johns Hopkins Kavli Neuroscience Discovery Institute, Johns Hopkins University, Baltimore, MD 21218, USA; The Solomon Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Kishore V Kuchibhotla
- Department of Psychological and Brain Sciences, Johns Hopkins University, Baltimore, MD 21218, USA; Johns Hopkins Kavli Neuroscience Discovery Institute, Johns Hopkins University, Baltimore, MD 21218, USA; The Solomon Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA; Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.
| |
Collapse
|
15
|
Chakravarty S, Delgado-Sallent C, Kane GA, Xia H, Do QH, Senne RA, Scott BB. A cross-species framework for investigating perceptual evidence accumulation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.17.589945. [PMID: 38659929 PMCID: PMC11042372 DOI: 10.1101/2024.04.17.589945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Cross-species studies are important for a comprehensive understanding of brain functions. However, direct quantitative comparison of behaviors across species presents a significant challenge. To enable such comparisons in perceptual decision-making, we developed a synchronized evidence accumulation task for rodents and humans, by aligning mechanics, stimuli, and training. Rats, mice and humans readily learned the task and exhibited qualitatively similar performance. Quantitative model comparison revealed that all three species employed an evidence accumulation strategy, but differed in speed, accuracy, and key decision parameters. Human performance prioritized accuracy, whereas rodent performance was limited by internal time-pressure. Rats optimized reward rate, while mice appeared to switch between evidence accumulation and other strategies trial-to-trial. Together, these results reveal striking similarities and species-specific priorities in decision-making. Furthermore, the synchronized behavioral framework we present may facilitate future studies involving cross-species comparisons, such as evaluating the face validity of animal models of neuropsychiatric disorders. Highlights Development of a free response evidence accumulation task for rats and miceSynchronized video game allows direct comparisons with humansRat, mouse and human behavior are well fit by the same decision modelsModel parameters reveal species-specific priorities in accumulation strategy.
Collapse
|
16
|
Bernklau TW, Righetti B, Mehrke LS, Jacob SN. Striatal dopamine signals reflect perceived cue-action-outcome associations in mice. Nat Neurosci 2024; 27:747-757. [PMID: 38291283 PMCID: PMC11001585 DOI: 10.1038/s41593-023-01567-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 12/21/2023] [Indexed: 02/01/2024]
Abstract
Striatal dopamine drives associative learning by acting as a teaching signal. Much work has focused on simple learning paradigms, including Pavlovian and instrumental learning. However, higher cognition requires that animals generate internal concepts of their environment, where sensory stimuli, actions and outcomes become flexibly associated. Here, we performed fiber photometry dopamine measurements across the striatum of male mice as they learned cue-action-outcome associations based on implicit and changing task rules. Reinforcement learning models of the behavioral and dopamine data showed that rule changes lead to adjustments of learned cue-action-outcome associations. After rule changes, mice discarded learned associations and reset outcome expectations. Cue- and outcome-triggered dopamine signals became uncoupled and dependent on the adopted behavioral strategy. As mice learned the new association, coupling between cue- and outcome-triggered dopamine signals and task performance re-emerged. Our results suggest that dopaminergic reward prediction errors reflect an agent's perceived locus of control.
Collapse
Affiliation(s)
- Tobias W Bernklau
- Translational Neurotechnology Laboratory, Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
- Graduate School of Systemic Neurosciences, Ludwig-Maximilians-University Munich, Munich, Germany
| | - Beatrice Righetti
- Translational Neurotechnology Laboratory, Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Leonie S Mehrke
- Translational Neurotechnology Laboratory, Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Simon N Jacob
- Translational Neurotechnology Laboratory, Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.
| |
Collapse
|
17
|
Griffiths CS, Lebert JM, Sollini J, Bizley JK. Gradient boosted decision trees reveal nuances of auditory discrimination behavior. PLoS Comput Biol 2024; 20:e1011985. [PMID: 38626220 PMCID: PMC11051626 DOI: 10.1371/journal.pcbi.1011985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 04/26/2024] [Accepted: 03/09/2024] [Indexed: 04/18/2024] Open
Abstract
Animal psychophysics can generate rich behavioral datasets, often comprised of many 1000s of trials for an individual subject. Gradient-boosted models are a promising machine learning approach for analyzing such data, partly due to the tools that allow users to gain insight into how the model makes predictions. We trained ferrets to report a target word's presence, timing, and lateralization within a stream of consecutively presented non-target words. To assess the animals' ability to generalize across pitch, we manipulated the fundamental frequency (F0) of the speech stimuli across trials, and to assess the contribution of pitch to streaming, we roved the F0 from word token to token. We then implemented gradient-boosted regression and decision trees on the trial outcome and reaction time data to understand the behavioral factors behind the ferrets' decision-making. We visualized model contributions by implementing SHAPs feature importance and partial dependency plots. While ferrets could accurately perform the task across all pitch-shifted conditions, our models reveal subtle effects of shifting F0 on performance, with within-trial pitch shifting elevating false alarms and extending reaction times. Our models identified a subset of non-target words that animals commonly false alarmed to. Follow-up analysis demonstrated that the spectrotemporal similarity of target and non-target words rather than similarity in duration or amplitude waveform was the strongest predictor of the likelihood of false alarming. Finally, we compared the results with those obtained with traditional mixed effects models, revealing equivalent or better performance for the gradient-boosted models over these approaches.
Collapse
Affiliation(s)
| | - Jules M. Lebert
- Ear Institute, University College London, London, United Kingdom
| | - Joseph Sollini
- Ear Institute, University College London, London, United Kingdom
- Hearing Sciences, University of Nottingham, Nottingham, United Kingdom
| | | |
Collapse
|
18
|
Couto J, Lebreton M, van Maanen L. Specificity and sensitivity of the fixed-point test for binary mixture distributions. Behav Res Methods 2024; 56:2977-2991. [PMID: 37957433 PMCID: PMC11133060 DOI: 10.3758/s13428-023-02244-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/17/2023] [Indexed: 11/15/2023]
Abstract
When two cognitive processes contribute to a behavioral output-each process producing a specific distribution of the behavioral variable of interest-and when the mixture proportion of these two processes varies as a function of an experimental condition, a common density point should be present in the observed distributions of the data across said conditions. In principle, one can statistically test for the presence (or absence) of a fixed point in experimental data to provide evidence in favor of (or against) the presence of a mixture of processes, whose proportions are affected by an experimental manipulation. In this paper, we provide an empirical diagnostic of this test to detect a mixture of processes. We do so using resampling of real experimental data under different scenarios, which mimic variations in the experimental design suspected to affect the sensitivity and specificity of the fixed-point test (i.e., mixture proportion, time on task, and sample size). Resampling such scenarios with real data allows us to preserve important features of data which are typically observed in real experiments while maintaining tight control over the properties of the resampled scenarios. This is of particular relevance considering such stringent assumptions underlying the fixed-point test. With this paper, we ultimately aim at validating the fixed-point property of binary mixture data and at providing some performance metrics to researchers aiming at testing the fixed-point property on their experimental data.
Collapse
Affiliation(s)
- Joaquina Couto
- Department of Psychology, University of Amsterdam, Amsterdam, Netherlands.
- Department of Experimental Psychology, Utrecht University, Utrecht, Netherlands.
| | - Maël Lebreton
- Swiss Center for Affective Sciences, University of Geneva, Geneva, Switzerland
- Paris School of Economics, Paris, France
| | - Leendert van Maanen
- Department of Experimental Psychology, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
19
|
Maggi S, Hock RM, O'Neill M, Buckley M, Moran PM, Bast T, Sami M, Humphries MD. Tracking subjects' strategies in behavioural choice experiments at trial resolution. eLife 2024; 13:e86491. [PMID: 38426402 PMCID: PMC10959529 DOI: 10.7554/elife.86491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Accepted: 02/23/2024] [Indexed: 03/02/2024] Open
Abstract
Investigating how, when, and what subjects learn during decision-making tasks requires tracking their choice strategies on a trial-by-trial basis. Here, we present a simple but effective probabilistic approach to tracking choice strategies at trial resolution using Bayesian evidence accumulation. We show this approach identifies both successful learning and the exploratory strategies used in decision tasks performed by humans, non-human primates, rats, and synthetic agents. Both when subjects learn and when rules change the exploratory strategies of win-stay and lose-shift, often considered complementary, are consistently used independently. Indeed, we find the use of lose-shift is strong evidence that subjects have latently learnt the salient features of a new rewarded rule. Our approach can be extended to any discrete choice strategy, and its low computational cost is ideally suited for real-time analysis and closed-loop control.
Collapse
Affiliation(s)
- Silvia Maggi
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
| | - Rebecca M Hock
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
| | - Martin O'Neill
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
- Department of Health & Nutritional Sciences, Atlantic Technological UniversitySligoIreland
| | - Mark Buckley
- Department of Experimental Psychology, University of OxfordOxfordUnited Kingdom
| | - Paula M Moran
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
- Department of Neuroscience, University of NottinghamNottinghamUnited Kingdom
| | - Tobias Bast
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
- Department of Neuroscience, University of NottinghamNottinghamUnited Kingdom
| | - Musa Sami
- Institute of Mental Health, University of NottinghamNottinghamUnited Kingdom
| | - Mark D Humphries
- School of Psychology, University of NottinghamNottinghamUnited Kingdom
| |
Collapse
|
20
|
Colas JT, O’Doherty JP, Grafton ST. Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts. PLoS Comput Biol 2024; 20:e1011950. [PMID: 38552190 PMCID: PMC10980507 DOI: 10.1371/journal.pcbi.1011950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/26/2024] [Indexed: 04/01/2024] Open
Abstract
Active reinforcement learning enables dynamic prediction and control, where one should not only maximize rewards but also minimize costs such as of inference, decisions, actions, and time. For an embodied agent such as a human, decisions are also shaped by physical aspects of actions. Beyond the effects of reward outcomes on learning processes, to what extent can modeling of behavior in a reinforcement-learning task be complicated by other sources of variance in sequential action choices? What of the effects of action bias (for actions per se) and action hysteresis determined by the history of actions chosen previously? The present study addressed these questions with incremental assembly of models for the sequential choice data from a task with hierarchical structure for additional complexity in learning. With systematic comparison and falsification of computational models, human choices were tested for signatures of parallel modules representing not only an enhanced form of generalized reinforcement learning but also action bias and hysteresis. We found evidence for substantial differences in bias and hysteresis across participants-even comparable in magnitude to the individual differences in learning. Individuals who did not learn well revealed the greatest biases, but those who did learn accurately were also significantly biased. The direction of hysteresis varied among individuals as repetition or, more commonly, alternation biases persisting from multiple previous actions. Considering that these actions were button presses with trivial motor demands, the idiosyncratic forces biasing sequences of action choices were robust enough to suggest ubiquity across individuals and across tasks requiring various actions. In light of how bias and hysteresis function as a heuristic for efficient control that adapts to uncertainty or low motivation by minimizing the cost of effort, these phenomena broaden the consilient theory of a mixture of experts to encompass a mixture of expert and nonexpert controllers of behavior.
Collapse
Affiliation(s)
- Jaron T. Colas
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, California, United States of America
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, California, United States of America
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, California, United States of America
| | - John P. O’Doherty
- Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, California, United States of America
- Computation and Neural Systems Program, California Institute of Technology, Pasadena, California, United States of America
| | - Scott T. Grafton
- Department of Psychological and Brain Sciences, University of California, Santa Barbara, California, United States of America
| |
Collapse
|
21
|
Del Río M, de Lange FP, Fritsche M, Ward J. Perceptual confirmation bias and decision bias underlie adaptation to sequential regularities. J Vis 2024; 24:5. [PMID: 38381426 PMCID: PMC10902869 DOI: 10.1167/jov.24.2.5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Accepted: 12/18/2023] [Indexed: 02/22/2024] Open
Abstract
Our perception does not depend exclusively on the immediate sensory input. It is also influenced by our internal predictions derived from prior observations and the temporal regularities of the environment, which can result in choice history biases. However, it is unclear how this flexible use of prior information to predict the future influences perceptual decisions. Prior information may bias decisions independently of the current sensory input, or it may modulate the weight of current sensory input based on its consistency with the expectation. To address this question, we used a visual decision-making task and manipulated the transitional probabilities between successive noisy grating stimuli. Using a reverse correlation analysis, we evaluated the contribution of stimulus-independent decision bias and stimulus-dependent sensitivity modulations to choice history biases. We found that both effects coexist, whereby there was increased bias to respond in line with the predicted orientation alongside modulations in perceptual sensitivity to favor perceptual information consistent with the prediction, akin to selective attention. Furthermore, at the individual differences level, we investigated the relationship between autistic-like traits and the adaptation of choice history biases to the sequential statistics of the environment. Over two studies, we found no convincing evidence of reduced adaptation to sequential regularities in individuals with high autistic-like traits. In sum, we present robust evidence for both perceptual confirmation bias and decision bias supporting adaptation to sequential regularities in the environment.
Collapse
Affiliation(s)
| | - Floris P de Lange
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
| | - Matthias Fritsche
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
- Department of Physiology, Anatomy & Genetics, University of Oxford, Oxford, UK
| | - Jamie Ward
- School of Psychology, University of Sussex, Brighton, UK
| |
Collapse
|
22
|
Gupta D, DePasquale B, Kopec CD, Brody CD. Trial-history biases in evidence accumulation can give rise to apparent lapses in decision-making. Nat Commun 2024; 15:662. [PMID: 38253526 PMCID: PMC10803295 DOI: 10.1038/s41467-024-44880-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Accepted: 01/04/2024] [Indexed: 01/24/2024] Open
Abstract
Trial history biases and lapses are two of the most common suboptimalities observed during perceptual decision-making. These suboptimalities are routinely assumed to arise from distinct processes. However, previous work has suggested that they covary in their prevalence and that their proposed neural substrates overlap. Here we demonstrate that during decision-making, history biases and apparent lapses can both arise from a common cognitive process that is optimal under mistaken beliefs that the world is changing i.e. nonstationary. This corresponds to an accumulation-to-bound model with history-dependent updates to the initial state of the accumulator. We test our model's predictions about the relative prevalence of history biases and lapses, and show that they are robustly borne out in two distinct decision-making datasets of male rats, including data from a novel reaction time task. Our model improves the ability to precisely predict decision-making dynamics within and across trials, by positing a process through which agents can generate quasi-stochastic choices.
Collapse
Affiliation(s)
- Diksha Gupta
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.
- Sainsbury Wellcome Centre, University College London, London, UK.
| | - Brian DePasquale
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
| | - Charles D Kopec
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Carlos D Brody
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.
- Howard Hughes Medical Institute, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
23
|
Gurnani H, Cayco Gajic NA. Signatures of task learning in neural representations. Curr Opin Neurobiol 2023; 83:102759. [PMID: 37708653 DOI: 10.1016/j.conb.2023.102759] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 06/28/2023] [Accepted: 07/20/2023] [Indexed: 09/16/2023]
Abstract
While neural plasticity has long been studied as the basis of learning, the growth of large-scale neural recording techniques provides a unique opportunity to study how learning-induced activity changes are coordinated across neurons within the same circuit. These distributed changes can be understood through an evolution of the geometry of neural manifolds and latent dynamics underlying new computations. In parallel, studies of multi-task and continual learning in artificial neural networks hint at a tradeoff between non-interference and compositionality as guiding principles to understand how neural circuits flexibly support multiple behaviors. In this review, we highlight recent findings from both biological and artificial circuits that together form a new framework for understanding task learning at the population level.
Collapse
Affiliation(s)
- Harsha Gurnani
- Department of Biology, University of Washington, Seattle, WA, USA. https://twitter.com/HarshaGurnani
| | - N Alex Cayco Gajic
- Laboratoire de Neuroscience Cognitives, Ecole Normale Supérieure, Université PSL, Paris, France.
| |
Collapse
|
24
|
Weilnhammer V, Stuke H, Standvoss K, Sterzer P. Sensory processing in humans and mice fluctuates between external and internal modes. PLoS Biol 2023; 21:e3002410. [PMID: 38064502 PMCID: PMC10732408 DOI: 10.1371/journal.pbio.3002410] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 12/20/2023] [Accepted: 10/30/2023] [Indexed: 12/21/2023] Open
Abstract
Perception is known to cycle through periods of enhanced and reduced sensitivity to external information. Here, we asked whether such slow fluctuations arise as a noise-related epiphenomenon of limited processing capacity or, alternatively, represent a structured mechanism of perceptual inference. Using 2 large-scale datasets, we found that humans and mice alternate between externally and internally oriented modes of sensory analysis. During external mode, perception aligns more closely with the external sensory information, whereas internal mode is characterized by enhanced biases toward perceptual history. Computational modeling indicated that dynamic changes in mode are enabled by 2 interlinked factors: (i) the integration of subsequent inputs over time and (ii) slow antiphase oscillations in the impact of external sensory information versus internal predictions that are provided by perceptual history. We propose that between-mode fluctuations generate unambiguous error signals that enable optimal inference in volatile environments.
Collapse
Affiliation(s)
- Veith Weilnhammer
- Department of Psychiatry, Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
- Berlin Institute of Health, Charité-Universitätsmedizin Berlin and Max Delbrück Center, Berlin, Germany
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, California, United States of America
| | - Heiner Stuke
- Department of Psychiatry, Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
- Berlin Institute of Health, Charité-Universitätsmedizin Berlin and Max Delbrück Center, Berlin, Germany
| | - Kai Standvoss
- Department of Psychiatry, Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Philipp Sterzer
- Department of Psychiatry (UPK), University of Basel, Basel, Switzerland
| |
Collapse
|
25
|
Hajnal MA, Tran D, Einstein M, Martelo MV, Safaryan K, Polack PO, Golshani P, Orbán G. Continuous multiplexed population representations of task context in the mouse primary visual cortex. Nat Commun 2023; 14:6687. [PMID: 37865648 PMCID: PMC10590415 DOI: 10.1038/s41467-023-42441-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 10/10/2023] [Indexed: 10/23/2023] Open
Abstract
Effective task execution requires the representation of multiple task-related variables that determine how stimuli lead to correct responses. Even the primary visual cortex (V1) represents other task-related variables such as expectations, choice, and context. However, it is unclear how V1 can flexibly accommodate these variables without interfering with visual representations. We trained mice on a context-switching cross-modal decision task, where performance depends on inferring task context. We found that the context signal that emerged in V1 was behaviorally relevant as it strongly covaried with performance, independent from movement. Importantly, this signal was integrated into V1 representation by multiplexing visual and context signals into orthogonal subspaces. In addition, auditory and choice signals were also multiplexed as these signals were orthogonal to the context representation. Thus, multiplexing allows V1 to integrate visual inputs with other sensory modalities and cognitive variables to avoid interference with the visual representation while ensuring the maintenance of task-relevant variables.
Collapse
Affiliation(s)
- Márton Albert Hajnal
- Department of Computational Sciences, Wigner Research Center for Physics, Budapest, 1121, Hungary
| | - Duy Tran
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Albert Einstein College of Medicine, New York, NY, 10461, USA
| | - Michael Einstein
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Mauricio Vallejo Martelo
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Karen Safaryan
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Pierre-Olivier Polack
- Center for Molecular and Behavioral Neuroscience, Rutgers University, Newark, NJ, 07102, USA
| | - Peyman Golshani
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- Integrative Center for Learning and Memory, Brain Research Institute, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- West Los Angeles VA Medical Center, CA, 90073, Los Angeles, USA.
| | - Gergő Orbán
- Department of Computational Sciences, Wigner Research Center for Physics, Budapest, 1121, Hungary.
| |
Collapse
|
26
|
Le NM, Yildirim M, Wang Y, Sugihara H, Jazayeri M, Sur M. Mixtures of strategies underlie rodent behavior during reversal learning. PLoS Comput Biol 2023; 19:e1011430. [PMID: 37708113 PMCID: PMC10501641 DOI: 10.1371/journal.pcbi.1011430] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 08/09/2023] [Indexed: 09/16/2023] Open
Abstract
In reversal learning tasks, the behavior of humans and animals is often assumed to be uniform within single experimental sessions to facilitate data analysis and model fitting. However, behavior of agents can display substantial variability in single experimental sessions, as they execute different blocks of trials with different transition dynamics. Here, we observed that in a deterministic reversal learning task, mice display noisy and sub-optimal choice transitions even at the expert stages of learning. We investigated two sources of the sub-optimality in the behavior. First, we found that mice exhibit a high lapse rate during task execution, as they reverted to unrewarded directions after choice transitions. Second, we unexpectedly found that a majority of mice did not execute a uniform strategy, but rather mixed between several behavioral modes with different transition dynamics. We quantified the use of such mixtures with a state-space model, block Hidden Markov Model (block HMM), to dissociate the mixtures of dynamic choice transitions in individual blocks of trials. Additionally, we found that blockHMM transition modes in rodent behavior can be accounted for by two different types of behavioral algorithms, model-free or inference-based learning, that might be used to solve the task. Combining these approaches, we found that mice used a mixture of both exploratory, model-free strategies and deterministic, inference-based behavior in the task, explaining their overall noisy choice sequences. Together, our combined computational approach highlights intrinsic sources of noise in rodent reversal learning behavior and provides a richer description of behavior than conventional techniques, while uncovering the hidden states that underlie the block-by-block transitions.
Collapse
Affiliation(s)
- Nhat Minh Le
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Murat Yildirim
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Department of Neurosciences, Cleveland Clinic Lerner Research Institute, Cleveland, Ohio, United States of America
| | - Yizhi Wang
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Hiroki Sugihara
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Mehrdad Jazayeri
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Mriganka Sur
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| |
Collapse
|
27
|
Maselli A, Gordon J, Eluchans M, Lancia GL, Thiery T, Moretti R, Cisek P, Pezzulo G. Beyond simple laboratory studies: Developing sophisticated models to study rich behavior. Phys Life Rev 2023; 46:220-244. [PMID: 37499620 DOI: 10.1016/j.plrev.2023.07.006] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 07/06/2023] [Indexed: 07/29/2023]
Abstract
Psychology and neuroscience are concerned with the study of behavior, of internal cognitive processes, and their neural foundations. However, most laboratory studies use constrained experimental settings that greatly limit the range of behaviors that can be expressed. While focusing on restricted settings ensures methodological control, it risks impoverishing the object of study: by restricting behavior, we might miss key aspects of cognitive and neural functions. In this article, we argue that psychology and neuroscience should increasingly adopt innovative experimental designs, measurement methods, analysis techniques and sophisticated computational models to probe rich, ecologically valid forms of behavior, including social behavior. We discuss the challenges of studying rich forms of behavior as well as the novel opportunities offered by state-of-the-art methodologies and new sensing technologies, and we highlight the importance of developing sophisticated formal models. We exemplify our arguments by reviewing some recent streams of research in psychology, neuroscience and other fields (e.g., sports analytics, ethology and robotics) that have addressed rich forms of behavior in a model-based manner. We hope that these "success cases" will encourage psychologists and neuroscientists to extend their toolbox of techniques with sophisticated behavioral models - and to use them to study rich forms of behavior as well as the cognitive and neural processes that they engage.
Collapse
Affiliation(s)
- Antonella Maselli
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| | - Jeremy Gordon
- University of California, Berkeley, Berkeley, CA, 94704, United States
| | - Mattia Eluchans
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy; University of Rome "La Sapienza", Rome, Italy
| | - Gian Luca Lancia
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy; University of Rome "La Sapienza", Rome, Italy
| | - Thomas Thiery
- Department of Psychology, University of Montréal, Montréal, Québec, Canada
| | - Riccardo Moretti
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy; University of Rome "La Sapienza", Rome, Italy
| | - Paul Cisek
- Department of Neuroscience, University of Montréal, Montréal, Québec, Canada
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy.
| |
Collapse
|
28
|
Daniels CW, Balsam PD. Prior experience modifies acquisition trajectories via response-strategy sampling. Anim Cogn 2023; 26:1217-1239. [PMID: 37036556 PMCID: PMC11034823 DOI: 10.1007/s10071-023-01769-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 03/05/2023] [Accepted: 03/22/2023] [Indexed: 04/11/2023]
Abstract
Few studies have considered how signal detection parameters evolve during acquisition periods. We addressed this gap by training mice with differential prior experience in a conditional discrimination, auditory signal detection task. Naïve mice, mice given separate experience with each of the later correct choice options (Correct Choice Response Transfer, CCRT), and mice experienced in conditional discriminations (Conditional Discrimination Transfer, CDT) were trained to detect the presence or absence of a tone in white noise. We analyzed data assuming a two-period model of acquisition: a pre-solution and solution period (Heinemann EG (1983) in The Presolution period and the detection of statistical associations. In: Quantitative analyses of behavior: discrimination processes, vol. 4, pp. 21-36). Ballinger. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.536.1978andrep=rep1andtype=pdf ). The pre-solution period was characterized by a selective sampling of biased response strategies until adoption of a conditional responding strategy in the solution period. Correspondingly, discriminability remained low until the solution period; criterion took excursions reflecting response-strategy sampling. Prior experience affected the length and composition of the pre-solution period. Whereas CCRT and CDT mice had shorter pre-solution periods than naïve mice, CDT and Naïve mice developed substantial criterion biases and acquired asymptotic discriminability faster than CCRT mice. To explain these data, we propose a learning model in which mice selectively sample and test different response-strategies and corresponding task structures until they exit the pre-solution period. Upon exit, mice adopt the conditional responding strategy and task structure, with action values updated via inference and generalization from the other task structures. Simulations of representative mouse data illustrate the viability of this model.
Collapse
Affiliation(s)
- Carter W Daniels
- Department of Psychiatry, Columbia University, New York, USA.
- New York State Psychiatric Institute, New York, USA.
| | - Peter D Balsam
- Department of Psychiatry, Columbia University, New York, USA
- New York State Psychiatric Institute, New York, USA
- Department of Psychology, Barnard College, New York, USA
| |
Collapse
|
29
|
Charlton JA, Młynarski WF, Bai YH, Hermundstad AM, Goris RLT. Environmental dynamics shape perceptual decision bias. PLoS Comput Biol 2023; 19:e1011104. [PMID: 37289753 DOI: 10.1371/journal.pcbi.1011104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 04/13/2023] [Indexed: 06/10/2023] Open
Abstract
To interpret the sensory environment, the brain combines ambiguous sensory measurements with knowledge that reflects context-specific prior experience. But environmental contexts can change abruptly and unpredictably, resulting in uncertainty about the current context. Here we address two questions: how should context-specific prior knowledge optimally guide the interpretation of sensory stimuli in changing environments, and do human decision-making strategies resemble this optimum? We probe these questions with a task in which subjects report the orientation of ambiguous visual stimuli that were drawn from three dynamically switching distributions, representing different environmental contexts. We derive predictions for an ideal Bayesian observer that leverages knowledge about the statistical structure of the task to maximize decision accuracy, including knowledge about the dynamics of the environment. We show that its decisions are biased by the dynamically changing task context. The magnitude of this decision bias depends on the observer's continually evolving belief about the current context. The model therefore not only predicts that decision bias will grow as the context is indicated more reliably, but also as the stability of the environment increases, and as the number of trials since the last context switch grows. Analysis of human choice data validates all three predictions, suggesting that the brain leverages knowledge of the statistical structure of environmental change when interpreting ambiguous sensory signals.
Collapse
Affiliation(s)
- Julie A Charlton
- Center for Perceptual Systems, University of Texas at Austin, Austin, Texas, United States of America
| | | | - Yoon H Bai
- Center for Perceptual Systems, University of Texas at Austin, Austin, Texas, United States of America
| | - Ann M Hermundstad
- Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia, United States of America
| | - Robbe L T Goris
- Center for Perceptual Systems, University of Texas at Austin, Austin, Texas, United States of America
| |
Collapse
|
30
|
Gupta D, DePasquale B, Kopec CD, Brody CD. Trial-history biases in evidence accumulation can give rise to apparent lapses. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.18.524599. [PMID: 36778392 PMCID: PMC9915493 DOI: 10.1101/2023.01.18.524599] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Trial history biases and lapses are two of the most common suboptimalities observed during perceptual decision-making. These suboptimalities are routinely assumed to arise from distinct processes. However, several hints in the literature suggest that they covary in their prevalence and that their proposed neural substrates overlap - what could underlie these links? Here we demonstrate that history biases and apparent lapses can both arise from a common cognitive process that is normative under misbeliefs about non-stationarity in the world. This corresponds to an accumulation-to-bound model with history-dependent updates to the initial state of the accumulator. We test our model's predictions about the relative prevalence of history biases and lapses, and show that they are robustly borne out in two distinct rat decision-making datasets, including data from a novel reaction time task. Our model improves the ability to precisely predict decision-making dynamics within and across trials, by positing a process through which agents can generate quasi-stochastic choices.
Collapse
Affiliation(s)
- Diksha Gupta
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Brian DePasquale
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Charles D Kopec
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Carlos D Brody
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
- Howard Hughes Medical Institute, Princeton University, Princeton, United States
| |
Collapse
|
31
|
Jaffe PI, Poldrack RA, Schafer RJ, Bissett PG. Modelling human behaviour in cognitive tasks with latent dynamical systems. Nat Hum Behav 2023:10.1038/s41562-022-01510-8. [PMID: 36658212 DOI: 10.1038/s41562-022-01510-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 12/06/2022] [Indexed: 01/21/2023]
Abstract
Response time data collected from cognitive tasks are a cornerstone of psychology and neuroscience research, yet existing models of these data either make strong assumptions about the data-generating process or are limited to modelling single trials. We introduce task-DyVA, a deep learning framework in which expressive dynamical systems are trained to reproduce sequences of response times observed in data from individual human subjects. Models fitted to a large task-switching dataset captured subject-specific behavioural differences with high temporal precision, including task-switching costs. Through perturbation experiments and analyses of the models' latent dynamics, we find support for a rational account of switch costs in terms of a stability-flexibility trade-off. Thus, our framework can be used to discover interpretable cognitive theories that explain how the brain dynamically gives rise to behaviour.
Collapse
Affiliation(s)
- Paul I Jaffe
- Department of Psychology, Stanford University, Stanford, CA, USA. .,Lumos Labs, San Francisco, CA, USA.
| | | | | | | |
Collapse
|
32
|
Goudar V, Kim JW, Liu Y, Dede AJO, Jutras MJ, Skelin I, Ruvalcaba M, Chang W, Fairhall AL, Lin JJ, Knight RT, Buffalo EA, Wang XJ. Comparing rapid rule-learning strategies in humans and monkeys. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.10.523416. [PMID: 36711889 PMCID: PMC9882042 DOI: 10.1101/2023.01.10.523416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Inter-species comparisons are key to deriving an understanding of the behavioral and neural correlates of human cognition from animal models. We perform a detailed comparison of macaque monkey and human strategies on an analogue of the Wisconsin Card Sort Test, a widely studied and applied multi-attribute measure of cognitive function, wherein performance requires the inference of a changing rule given ambiguous feedback. We found that well-trained monkeys rapidly infer rules but are three times slower than humans. Model fits to their choices revealed hidden states akin to feature-based attention in both species, and decision processes that resembled a Win-stay lose-shift strategy with key differences. Monkeys and humans test multiple rule hypotheses over a series of rule-search trials and perform inference-like computations to exclude candidates. An attention-set based learning stage categorization revealed that perseveration, random exploration and poor sensitivity to negative feedback explain the under-performance in monkeys.
Collapse
Affiliation(s)
- Vishwa Goudar
- Center for Neural Science, New York University, NY, USA
| | - Jeong-Woo Kim
- Center for Neural Science, New York University, NY, USA
| | - Yue Liu
- Center for Neural Science, New York University, NY, USA
| | - Adam J. O. Dede
- Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA
| | - Michael J. Jutras
- Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA
| | - Ivan Skelin
- Department of Neurology, University of California, Davis, Davis, CA, USA
- The Center for Mind and Brain, University of California, Davis, Davis, CA, USA
| | - Michael Ruvalcaba
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA, USA
| | - William Chang
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA, USA
| | - Adrienne L. Fairhall
- Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA
| | - Jack J. Lin
- Department of Neurology, University of California, Davis, Davis, CA, USA
- The Center for Mind and Brain, University of California, Davis, Davis, CA, USA
| | - Robert T. Knight
- Department of Psychology, University of California Berkeley, Berkeley, CA, USA
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA, USA
| | - Elizabeth A. Buffalo
- Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA
- Washington Primate Research Center, University of Washington, Seattle, WA, USA
| | | |
Collapse
|
33
|
Masís J, Chapman T, Rhee JY, Cox DD, Saxe AM. Strategically managing learning during perceptual decision making. eLife 2023; 12:64978. [PMID: 36786427 PMCID: PMC9928425 DOI: 10.7554/elife.64978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Accepted: 01/15/2023] [Indexed: 02/15/2023] Open
Abstract
Making optimal decisions in the face of noise requires balancing short-term speed and accuracy. But a theory of optimality should account for the fact that short-term speed can influence long-term accuracy through learning. Here, we demonstrate that long-term learning is an important dynamical dimension of the speed-accuracy trade-off. We study learning trajectories in rats and formally characterize these dynamics in a theory expressed as both a recurrent neural network and an analytical extension of the drift-diffusion model that learns over time. The model reveals that choosing suboptimal response times to learn faster sacrifices immediate reward, but can lead to greater total reward. We empirically verify predictions of the theory, including a relationship between stimulus exposure and learning speed, and a modulation of reaction time by future learning prospects. We find that rats' strategies approximately maximize total reward over the full learning epoch, suggesting cognitive control over the learning process.
Collapse
Affiliation(s)
- Javier Masís
- Department of Molecular and Cellular Biology, Harvard UniversityCambridgeUnited States,Center for Brain Science, Harvard UniversityCambridgeUnited States
| | - Travis Chapman
- Center for Brain Science, Harvard UniversityCambridgeUnited States
| | - Juliana Y Rhee
- Department of Molecular and Cellular Biology, Harvard UniversityCambridgeUnited States,Center for Brain Science, Harvard UniversityCambridgeUnited States
| | - David D Cox
- Department of Molecular and Cellular Biology, Harvard UniversityCambridgeUnited States,Center for Brain Science, Harvard UniversityCambridgeUnited States
| | - Andrew M Saxe
- Department of Experimental Psychology, University of OxfordOxfordUnited Kingdom
| |
Collapse
|
34
|
Moore S, Kuchibhotla KV. Slow or sudden: Re-interpreting the learning curve for modern systems neuroscience. IBRO Neurosci Rep 2022; 13:9-14. [PMID: 35669385 PMCID: PMC9163689 DOI: 10.1016/j.ibneur.2022.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 05/20/2022] [Accepted: 05/23/2022] [Indexed: 10/27/2022] Open
|
35
|
Sylwestrak EL, Jo Y, Vesuna S, Wang X, Holcomb B, Tien RH, Kim DK, Fenno L, Ramakrishnan C, Allen WE, Chen R, Shenoy KV, Sussillo D, Deisseroth K. Cell-type-specific population dynamics of diverse reward computations. Cell 2022; 185:3568-3587.e27. [PMID: 36113428 PMCID: PMC10387374 DOI: 10.1016/j.cell.2022.08.019] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 06/16/2022] [Accepted: 08/17/2022] [Indexed: 01/26/2023]
Abstract
Computational analysis of cellular activity has developed largely independently of modern transcriptomic cell typology, but integrating these approaches may be essential for full insight into cellular-level mechanisms underlying brain function and dysfunction. Applying this approach to the habenula (a structure with diverse, intermingled molecular, anatomical, and computational features), we identified encoding of reward-predictive cues and reward outcomes in distinct genetically defined neural populations, including TH+ cells and Tac1+ cells. Data from genetically targeted recordings were used to train an optimized nonlinear dynamical systems model and revealed activity dynamics consistent with a line attractor. High-density, cell-type-specific electrophysiological recordings and optogenetic perturbation provided supporting evidence for this model. Reverse-engineering predicted how Tac1+ cells might integrate reward history, which was complemented by in vivo experimentation. This integrated approach describes a process by which data-driven computational models of population activity can generate and frame actionable hypotheses for cell-type-specific investigation in biological systems.
Collapse
Affiliation(s)
- Emily L Sylwestrak
- Department of Biology, University of Oregon, Eugene, OR 97403, USA; Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Institute of Neuroscience, University of Oregon, Eugene, OR 97403, USA.
| | - YoungJu Jo
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Department of Applied Physics, Stanford University, Stanford, CA 94305, USA
| | - Sam Vesuna
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA 94305, USA
| | - Xiao Wang
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Blake Holcomb
- Institute of Neuroscience, University of Oregon, Eugene, OR 97403, USA
| | - Rebecca H Tien
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Doo Kyung Kim
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Lief Fenno
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA 94305, USA
| | - Charu Ramakrishnan
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - William E Allen
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Neurosciences Interdepartmental Program, Stanford University, Stanford, CA 94303, USA
| | - Ritchie Chen
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Krishna V Shenoy
- Department of Neurobiology, Stanford University, Stanford, CA 94303, USA; Department of Electrical Engineering, Stanford University, Stanford, CA, USA; Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA
| | - David Sussillo
- Department of Electrical Engineering, Stanford University, Stanford, CA, USA; Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA
| | - Karl Deisseroth
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA 94305, USA; Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
36
|
Walters J, King M, Bissett PG, Ivry RB, Diedrichsen J, Poldrack RA. Predicting brain activation maps for arbitrary tasks with cognitive encoding models. Neuroimage 2022; 263:119610. [PMID: 36064138 PMCID: PMC9981816 DOI: 10.1016/j.neuroimage.2022.119610] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 08/12/2022] [Accepted: 09/02/2022] [Indexed: 11/27/2022] Open
Abstract
A deep understanding of the neural architecture of mental function should enable the accurate prediction of a specific pattern of brain activity for any psychological task, based only on the cognitive functions known to be engaged by that task. Encoding models (EMs), which predict neural responses from known features (e.g., stimulus properties), have succeeded in circumscribed domains (e.g., visual neuroscience), but implementing domain-general EMs that predict brain-wide activity for arbitrary tasks has been limited mainly by availability of datasets that 1) sufficiently span a large space of psychological functions, and 2) are sufficiently annotated with such functions to allow robust EM specification. We examine the use of EMs based on a formal specification of psychological function, to predict cortical activation patterns across a broad range of tasks. We utilized the Multi-Domain Task Battery, a dataset in which 24 subjects completed 32 ten-minute fMRI scans, switching tasks every 35 s and engaging in 44 total conditions of diverse psychological manipulations. Conditions were annotated by a group of experts using the Cognitive Atlas ontology to identify putatively engaged functions, and region-wise cognitive EMs (CEMs) were fit, for individual subjects, on neocortical responses. We found that CEMs predicted cortical activation maps of held-out tasks with high accuracy, outperforming a permutation-based null model while approaching the noise ceiling of the data, without being driven solely by either cognitive or perceptual-motor features. Hierarchical clustering on the similarity structure of CEM generalization errors revealed relationships amongst psychological functions. Spatial distributions of feature importances systematically overlapped with large-scale resting-state functional networks (RSNs), supporting the hypothesis of functional specialization within RSNs while grounding their function in an interpretable data-driven manner. Our implementation and validation of CEMs provides a proof of principle for the utility of formal ontologies in cognitive neuroscience and motivates the use of CEMs in the further testing of cognitive theories.
Collapse
Affiliation(s)
- Jonathon Walters
- Department of Psychology, Stanford University, Stanford, CA, USA.
| | - Maedbh King
- Department of Psychology, University of California Berkeley, Berkeley, CA, USA
| | | | - Richard B. Ivry
- Department of Psychology, University of California Berkeley, Berkeley, CA, USA,Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA, USA
| | - Jörn Diedrichsen
- Brain and Mind Institute, Western University, London, Ontario, Canada,Department of Computer Science, Western University, London, Ontario, Canada
| | | |
Collapse
|
37
|
Nour MM, Liu Y, Dolan RJ. Functional neuroimaging in psychiatry and the case for failing better. Neuron 2022; 110:2524-2544. [PMID: 35981525 DOI: 10.1016/j.neuron.2022.07.005] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 06/06/2022] [Accepted: 07/08/2022] [Indexed: 12/27/2022]
Abstract
Psychiatric disorders encompass complex aberrations of cognition and affect and are among the most debilitating and poorly understood of any medical condition. Current treatments rely primarily on interventions that target brain function (drugs) or learning processes (psychotherapy). A mechanistic understanding of how these interventions mediate their therapeutic effects remains elusive. From the early 1990s, non-invasive functional neuroimaging, coupled with parallel developments in the cognitive neurosciences, seemed to signal a new era of neurobiologically grounded diagnosis and treatment in psychiatry. Yet, despite three decades of intense neuroimaging research, we still lack a neurobiological account for any psychiatric condition. Likewise, functional neuroimaging plays no role in clinical decision making. Here, we offer a critical commentary on this impasse and suggest how the field might fare better and deliver impactful neurobiological insights.
Collapse
Affiliation(s)
- Matthew M Nour
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK; Wellcome Trust Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK; Department of Psychiatry, University of Oxford, Oxford OX3 7JX, UK.
| | - Yunzhe Liu
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK; State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing 100875, China; Chinese Institute for Brain Research, Beijing 102206, China
| | - Raymond J Dolan
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, UK; Wellcome Trust Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK; State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing 100875, China.
| |
Collapse
|
38
|
Yin X, Wang Y, Li J, Guo ZV. Lateralization of short-term memory in the frontal cortex. Cell Rep 2022; 40:111190. [PMID: 35977520 DOI: 10.1016/j.celrep.2022.111190] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 06/04/2022] [Accepted: 07/20/2022] [Indexed: 11/03/2022] Open
Abstract
Despite essentially symmetric structures in mammalian brains, the left and right hemispheres do not contribute equally to certain cognitive functions. How both hemispheres interact to cause this asymmetry remains unclear. Here, we study this question in the anterior lateral motor cortex (ALM) of mice performing five versions of a tactile-based decision-making task with a short-term memory (STM) component. Unilateral inhibition of ALM produces variable behavioral deficits across tasks, with the left, right, or both ALMs playing critical roles in STM. Neural activity and its encoding capability are similar across hemispheres, despite that only one hemisphere dominates in behavior. Inhibition of the dominant ALM disrupts encoding capability in the non-dominant ALM, but not vice versa. Variable behavioral deficits are predicted by the influence on contralateral activity across sessions, mice, and tasks. Together, these results reveal that the left and right ALM interact asymmetrically, leading to their differential contributions to STM.
Collapse
Affiliation(s)
- Xinxin Yin
- School of Medicine, Tsinghua University, 100084 Beijing, China; IDG/McGovern Institute for Brain Research, Tsinghua University, 100084 Beijing, China; Tsinghua-Peking Joint Center for Life Sciences, 100084 Beijing, China; School of Life Sciences, Tsinghua University, 100084 Beijing, China
| | - Yu Wang
- IDG/McGovern Institute for Brain Research, Tsinghua University, 100084 Beijing, China; Tsinghua-Peking Joint Center for Life Sciences, 100084 Beijing, China; School of Life Sciences, Tsinghua University, 100084 Beijing, China
| | - Jiejue Li
- IDG/McGovern Institute for Brain Research, Tsinghua University, 100084 Beijing, China; Tsinghua-Peking Joint Center for Life Sciences, 100084 Beijing, China; School of Life Sciences, Tsinghua University, 100084 Beijing, China
| | - Zengcai V Guo
- School of Medicine, Tsinghua University, 100084 Beijing, China; IDG/McGovern Institute for Brain Research, Tsinghua University, 100084 Beijing, China; Tsinghua-Peking Joint Center for Life Sciences, 100084 Beijing, China.
| |
Collapse
|
39
|
Tseng SY, Chettih SN, Arlt C, Barroso-Luque R, Harvey CD. Shared and specialized coding across posterior cortical areas for dynamic navigation decisions. Neuron 2022; 110:2484-2502.e16. [PMID: 35679861 PMCID: PMC9357051 DOI: 10.1016/j.neuron.2022.05.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 01/31/2022] [Accepted: 05/13/2022] [Indexed: 11/25/2022]
Abstract
Animals adaptively integrate sensation, planning, and action to navigate toward goal locations in ever-changing environments, but the functional organization of cortex supporting these processes remains unclear. We characterized encoding in approximately 90,000 neurons across the mouse posterior cortex during a virtual navigation task with rule switching. The encoding of task and behavioral variables was highly distributed across cortical areas but differed in magnitude, resulting in three spatial gradients for visual cue, spatial position plus dynamics of choice formation, and locomotion, with peaks respectively in visual, retrosplenial, and parietal cortices. Surprisingly, the conjunctive encoding of these variables in single neurons was similar throughout the posterior cortex, creating high-dimensional representations in all areas instead of revealing computations specialized for each area. We propose that, for guiding navigation decisions, the posterior cortex operates in parallel rather than hierarchically, and collectively generates a state representation of the behavior and environment, with each area specialized in handling distinct information modalities.
Collapse
Affiliation(s)
- Shih-Yi Tseng
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Selmaan N Chettih
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Charlotte Arlt
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115, USA
| | | | | |
Collapse
|
40
|
Schreiner DC, Cazares C, Renteria R, Gremel CM. Information normally considered task-irrelevant drives decision-making and affects premotor circuit recruitment. Nat Commun 2022; 13:2134. [PMID: 35440120 PMCID: PMC9018678 DOI: 10.1038/s41467-022-29807-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 03/24/2022] [Indexed: 02/02/2023] Open
Abstract
Decision-making is a continuous and dynamic process with prior experience reflected in and used by the brain to guide adaptive behavior. However, most neurobiological studies constrain behavior and/or analyses to task-related variables, not accounting for the continuous internal and temporal space in which they occur. We show mice rely on information learned through recent and longer-term experience beyond just prior actions and reward - including checking behavior and the passage of time - to guide self-initiated, self-paced, and self-generated actions. These experiences are represented in secondary motor cortex (M2) activity and its projections into dorsal medial striatum (DMS). M2 integrates this information to bias strategy-level decision-making, and DMS projections reflect specific aspects of this recent experience to guide actions. This suggests diverse aspects of experience drive decision-making and its neural representation, and shows premotor corticostriatal circuits are crucial for using selective aspects of experiential information to guide adaptive behavior.
Collapse
Affiliation(s)
- Drew C Schreiner
- Department of Psychology, University of California San Diego, La Jolla, CA, 92093, USA
| | - Christian Cazares
- The Neurosciences Graduate Program, University of California San Diego, La Jolla, CA, 92093, USA
| | - Rafael Renteria
- Department of Psychology, University of California San Diego, La Jolla, CA, 92093, USA
| | - Christina M Gremel
- Department of Psychology, University of California San Diego, La Jolla, CA, 92093, USA.
- The Neurosciences Graduate Program, University of California San Diego, La Jolla, CA, 92093, USA.
| |
Collapse
|
41
|
Beron CC, Neufeld SQ, Linderman SW, Sabatini BL. Mice exhibit stochastic and efficient action switching during probabilistic decision making. Proc Natl Acad Sci U S A 2022; 119:e2113961119. [PMID: 35385355 PMCID: PMC9169659 DOI: 10.1073/pnas.2113961119] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 03/03/2022] [Indexed: 12/05/2022] Open
Abstract
In probabilistic and nonstationary environments, individuals must use internal and external cues to flexibly make decisions that lead to desirable outcomes. To gain insight into the process by which animals choose between actions, we trained mice in a task with time-varying reward probabilities. In our implementation of such a two-armed bandit task, thirsty mice use information about recent action and action–outcome histories to choose between two ports that deliver water probabilistically. Here we comprehensively modeled choice behavior in this task, including the trial-to-trial changes in port selection, i.e., action switching behavior. We find that mouse behavior is, at times, deterministic and, at others, apparently stochastic. The behavior deviates from that of a theoretically optimal agent performing Bayesian inference in a hidden Markov model (HMM). We formulate a set of models based on logistic regression, reinforcement learning, and sticky Bayesian inference that we demonstrate are mathematically equivalent and that accurately describe mouse behavior. The switching behavior of mice in the task is captured in each model by a stochastic action policy, a history-dependent representation of action value, and a tendency to repeat actions despite incoming evidence. The models parsimoniously capture behavior across different environmental conditionals by varying the stickiness parameter, and like the mice, they achieve nearly maximal reward rates. These results indicate that mouse behavior reaches near-maximal performance with reduced action switching and can be described by a set of equivalent models with a small number of relatively fixed parameters.
Collapse
Affiliation(s)
- Celia C. Beron
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115
- HHMI, Harvard Medical School, Boston, MA 02115
| | - Shay Q. Neufeld
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115
- HHMI, Harvard Medical School, Boston, MA 02115
| | - Scott W. Linderman
- Department of Statistics, Stanford University, Stanford, CA 94305
- Wu Tsai Neurosciences Institute, Stanford University, Stanford, CA 94305
| | - Bernardo L. Sabatini
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115
- HHMI, Harvard Medical School, Boston, MA 02115
| |
Collapse
|
42
|
Bolkan SS, Stone IR, Pinto L, Ashwood ZC, Iravedra Garcia JM, Herman AL, Singh P, Bandi A, Cox J, Zimmerman CA, Cho JR, Engelhard B, Pillow JW, Witten IB. Opponent control of behavior by dorsomedial striatal pathways depends on task demands and internal state. Nat Neurosci 2022; 25:345-357. [PMID: 35260863 PMCID: PMC8915388 DOI: 10.1038/s41593-022-01021-9] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 01/21/2022] [Indexed: 11/27/2022]
Abstract
A classic view of the striatum holds that activity in direct and indirect pathways oppositely modulates motor output. Whether this involves direct control of movement, or reflects a cognitive process underlying movement, remains unresolved. Here we find that strong, opponent control of behavior by the two pathways of the dorsomedial striatum depends on the cognitive requirements of a task. Furthermore, a latent state model (a hidden Markov model with generalized linear model observations) reveals that-even within a single task-the contribution of the two pathways to behavior is state dependent. Specifically, the two pathways have large contributions in one of two states associated with a strategy of evidence accumulation, compared to a state associated with a strategy of repeating previous choices. Thus, both the demands imposed by a task, as well as the internal state of mice when performing a task, determine whether dorsomedial striatum pathways provide strong and opponent control of behavior.
Collapse
Affiliation(s)
- Scott S Bolkan
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Iris R Stone
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Lucas Pinto
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Zoe C Ashwood
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | | | - Alison L Herman
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Priyanka Singh
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Akhil Bandi
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Julia Cox
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | | | - Jounhong Ryan Cho
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Ben Engelhard
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Jonathan W Pillow
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.
- Department of Psychology, Princeton University, Princeton, NJ, USA.
| | - Ilana B Witten
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.
- Department of Psychology, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
43
|
Nguyen QN, Reinagel P. Different Forms of Variability Could Explain a Difference Between Human and Rat Decision Making. Front Neurosci 2022; 16:794681. [PMID: 35273473 PMCID: PMC8902138 DOI: 10.3389/fnins.2022.794681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 01/17/2022] [Indexed: 11/16/2022] Open
Abstract
When observers make rapid, difficult perceptual decisions, their response time is highly variable from trial to trial. In a visual motion discrimination task, it has been reported that human accuracy declines with increasing response time, whereas rat accuracy increases with response time. This is of interest because different mathematical theories of decision-making differ in their predictions regarding the correlation of accuracy with response time. On the premise that perceptual decision-making mechanisms are likely to be conserved among mammals, we seek to unify the rodent and primate results in a common theoretical framework. We show that a bounded drift diffusion model (DDM) can explain both effects with variable parameters: trial-to-trial variability in the starting point of the diffusion process produces the pattern typically observed in rats, whereas variability in the drift rate produces the pattern typically observed in humans. We further show that the same effects can be produced by deterministic biases, even in the absence of parameter stochasticity or parameter change within a trial.
Collapse
Affiliation(s)
| | - Pamela Reinagel
- Section of Neurobiology, Division of Biological Sciences, University of California, San Diego, San Diego, CA, United States
| |
Collapse
|
44
|
Cochrane A, Green CS. Assessing the functions underlying learning using by-trial and by-participant models: Evidence from two visual perceptual learning paradigms. J Vis 2021; 21:5. [PMID: 34905053 PMCID: PMC8684311 DOI: 10.1167/jov.21.13.5] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Inferred mechanisms of learning, such as those involved in improvements resulting from perceptual training, are reliant on (and reflect) the functional forms that models of learning take. However, previous investigations of the functional forms of perceptual learning have been limited in ways that are incompatible with the known mechanisms of learning. For instance, previous work has overwhelmingly aggregated learning data across learning participants, learning trials, or both. Here we approach the study of the functional form of perceptual learning on the by-person and by-trial levels at which the mechanisms of learning are expected to act. Each participant completed one of two visual perceptual learning tasks over the course of two days, with the first 75% of task performance using a single reference stimulus (i.e., "training") and the last 25% using an orthogonal reference stimulus (to test generalization). Five learning functions, coming from either the exponential or the power family, were fit to each participant's data. The exponential family was uniformly supported by Bayesian Information Criteria (BIC) model comparisons. The simplest exponential function was the best fit to learning on a texture oddball detection task, while a Weibull (augmented exponential) function tended to be the best fit to learning on a dot-motion discrimination task. The support for the exponential family corroborated previous by-person investigations of the functional form of learning, while the novel evidence supporting the Weibull learning model has implications for both the analysis and the mechanistic bases of the learning.
Collapse
Affiliation(s)
- Aaron Cochrane
- Faculty of Psychology and Education Sciences, University of Geneva, Geneva, Switzerland.,
| | - C Shawn Green
- Department of Psychology, University of Wisconsin-Madison, Madison, WI, USA.,
| |
Collapse
|
45
|
Aguillon-Rodriguez V, Angelaki D, Bayer H, Bonacchi N, Carandini M, Cazettes F, Chapuis G, Churchland AK, Dan Y, Dewitt E, Faulkner M, Forrest H, Haetzel L, Häusser M, Hofer SB, Hu F, Khanal A, Krasniak C, Laranjeira I, Mainen ZF, Meijer G, Miska NJ, Mrsic-Flogel TD, Murakami M, Noel JP, Pan-Vazquez A, Rossant C, Sanders J, Socha K, Terry R, Urai AE, Vergara H, Wells M, Wilson CJ, Witten IB, Wool LE, Zador AM. Standardized and reproducible measurement of decision-making in mice. eLife 2021; 10:63711. [PMID: 34011433 DOI: 10.1101/2020.01.17.909838] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Accepted: 04/08/2021] [Indexed: 05/25/2023] Open
Abstract
Progress in science requires standardized assays whose results can be readily shared, compared, and reproduced across laboratories. Reproducibility, however, has been a concern in neuroscience, particularly for measurements of mouse behavior. Here, we show that a standardized task to probe decision-making in mice produces reproducible results across multiple laboratories. We adopted a task for head-fixed mice that assays perceptual and value-based decision making, and we standardized training protocol and experimental hardware, software, and procedures. We trained 140 mice across seven laboratories in three countries, and we collected 5 million mouse choices into a publicly available database. Learning speed was variable across mice and laboratories, but once training was complete there were no significant differences in behavior across laboratories. Mice in different laboratories adopted similar reliance on visual stimuli, on past successes and failures, and on estimates of stimulus prior probability to guide their choices. These results reveal that a complex mouse behavior can be reproduced across multiple laboratories. They establish a standard for reproducible rodent behavior, and provide an unprecedented dataset and open-access tools to study decision-making in mice. More generally, they indicate a path toward achieving reproducibility in neuroscience through collaborative open-science approaches.
Collapse
Affiliation(s)
| | - Dora Angelaki
- Center for Neural Science, New York University, New York, United States
| | - Hannah Bayer
- Zuckerman Institute, Columbia University, New York, United States
| | | | - Matteo Carandini
- UCL Institute of Ophthalmology, University College London, London, United Kingdom
| | | | - Gaelle Chapuis
- Wolfson Institute for Biomedical Research, University College London, London, United Kingdom
| | | | - Yang Dan
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Eric Dewitt
- Champalimaud Centre for the Unknown, Lisbon, Portugal
| | - Mayo Faulkner
- Wolfson Institute for Biomedical Research, University College London, London, United Kingdom
| | - Hamish Forrest
- UCL Institute of Ophthalmology, University College London, London, United Kingdom
| | - Laura Haetzel
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Michael Häusser
- Wolfson Institute for Biomedical Research, University College London, London, United Kingdom
| | - Sonja B Hofer
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College London, London, United Kingdom
| | - Fei Hu
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Anup Khanal
- Cold Spring Harbor Laboratory, New York, United States
| | - Christopher Krasniak
- Cold Spring Harbor Laboratory, New York, United States
- Watson School of Biological Sciences, New York, United States
| | | | | | - Guido Meijer
- Champalimaud Centre for the Unknown, Lisbon, Portugal
| | - Nathaniel J Miska
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College London, London, United Kingdom
| | - Thomas D Mrsic-Flogel
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College London, London, United Kingdom
| | | | - Jean-Paul Noel
- Center for Neural Science, New York University, New York, United States
| | | | - Cyrille Rossant
- UCL Queen Square Institute of Neurology, University College London, London, United Kingdom
| | | | - Karolina Socha
- UCL Institute of Ophthalmology, University College London, London, United Kingdom
| | - Rebecca Terry
- UCL Queen Square Institute of Neurology, University College London, London, United Kingdom
| | - Anne E Urai
- Cold Spring Harbor Laboratory, New York, United States
- Cognitive Psychology Unit, Leiden University, Leiden, Netherlands
| | - Hernando Vergara
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College London, London, United Kingdom
| | - Miles Wells
- UCL Queen Square Institute of Neurology, University College London, London, United Kingdom
| | | | - Ilana B Witten
- Princeton Neuroscience Institute, Princeton University, Princeton, United States
| | - Lauren E Wool
- UCL Queen Square Institute of Neurology, University College London, London, United Kingdom
| | | |
Collapse
|
46
|
Aguillon-Rodriguez V, Angelaki D, Bayer H, Bonacchi N, Carandini M, Cazettes F, Chapuis G, Churchland AK, Dan Y, Dewitt E, Faulkner M, Forrest H, Haetzel L, Häusser M, Hofer SB, Hu F, Khanal A, Krasniak C, Laranjeira I, Mainen ZF, Meijer G, Miska NJ, Mrsic-Flogel TD, Murakami M, Noel JP, Pan-Vazquez A, Rossant C, Sanders J, Socha K, Terry R, Urai AE, Vergara H, Wells M, Wilson CJ, Witten IB, Wool LE, Zador AM. Standardized and reproducible measurement of decision-making in mice. eLife 2021; 10:e63711. [PMID: 34011433 PMCID: PMC8137147 DOI: 10.7554/elife.63711] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Accepted: 04/08/2021] [Indexed: 12/20/2022] Open
Abstract
Progress in science requires standardized assays whose results can be readily shared, compared, and reproduced across laboratories. Reproducibility, however, has been a concern in neuroscience, particularly for measurements of mouse behavior. Here, we show that a standardized task to probe decision-making in mice produces reproducible results across multiple laboratories. We adopted a task for head-fixed mice that assays perceptual and value-based decision making, and we standardized training protocol and experimental hardware, software, and procedures. We trained 140 mice across seven laboratories in three countries, and we collected 5 million mouse choices into a publicly available database. Learning speed was variable across mice and laboratories, but once training was complete there were no significant differences in behavior across laboratories. Mice in different laboratories adopted similar reliance on visual stimuli, on past successes and failures, and on estimates of stimulus prior probability to guide their choices. These results reveal that a complex mouse behavior can be reproduced across multiple laboratories. They establish a standard for reproducible rodent behavior, and provide an unprecedented dataset and open-access tools to study decision-making in mice. More generally, they indicate a path toward achieving reproducibility in neuroscience through collaborative open-science approaches.
Collapse
Affiliation(s)
- The International Brain Laboratory
- Cold Spring Harbor LaboratoryNew YorkUnited States
- Center for Neural Science, New York UniversityNew YorkUnited States
- Zuckerman Institute, Columbia UniversityNew YorkUnited States
- Champalimaud Centre for the UnknownLisbonPortugal
- UCL Institute of Ophthalmology, University College LondonLondonUnited Kingdom
- Wolfson Institute for Biomedical Research, University College LondonLondonUnited Kingdom
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
- Princeton Neuroscience Institute, Princeton UniversityPrincetonUnited States
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College LondonLondonUnited Kingdom
- Watson School of Biological SciencesNew YorkUnited States
- UCL Queen Square Institute of Neurology, University College LondonLondonUnited Kingdom
- Sanworks LLCNew YorkUnited States
- Cognitive Psychology Unit, Leiden UniversityLeidenNetherlands
| | | | - Dora Angelaki
- Center for Neural Science, New York UniversityNew YorkUnited States
| | - Hannah Bayer
- Zuckerman Institute, Columbia UniversityNew YorkUnited States
| | | | - Matteo Carandini
- UCL Institute of Ophthalmology, University College LondonLondonUnited Kingdom
| | | | - Gaelle Chapuis
- Wolfson Institute for Biomedical Research, University College LondonLondonUnited Kingdom
| | | | - Yang Dan
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| | - Eric Dewitt
- Champalimaud Centre for the UnknownLisbonPortugal
| | - Mayo Faulkner
- Wolfson Institute for Biomedical Research, University College LondonLondonUnited Kingdom
| | - Hamish Forrest
- UCL Institute of Ophthalmology, University College LondonLondonUnited Kingdom
| | - Laura Haetzel
- Princeton Neuroscience Institute, Princeton UniversityPrincetonUnited States
| | - Michael Häusser
- Wolfson Institute for Biomedical Research, University College LondonLondonUnited Kingdom
| | - Sonja B Hofer
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College LondonLondonUnited Kingdom
| | - Fei Hu
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| | - Anup Khanal
- Cold Spring Harbor LaboratoryNew YorkUnited States
| | - Christopher Krasniak
- Cold Spring Harbor LaboratoryNew YorkUnited States
- Watson School of Biological SciencesNew YorkUnited States
| | | | | | - Guido Meijer
- Champalimaud Centre for the UnknownLisbonPortugal
| | - Nathaniel J Miska
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College LondonLondonUnited Kingdom
| | - Thomas D Mrsic-Flogel
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College LondonLondonUnited Kingdom
| | | | - Jean-Paul Noel
- Center for Neural Science, New York UniversityNew YorkUnited States
| | | | - Cyrille Rossant
- UCL Queen Square Institute of Neurology, University College LondonLondonUnited Kingdom
| | | | - Karolina Socha
- UCL Institute of Ophthalmology, University College LondonLondonUnited Kingdom
| | - Rebecca Terry
- UCL Queen Square Institute of Neurology, University College LondonLondonUnited Kingdom
| | - Anne E Urai
- Cold Spring Harbor LaboratoryNew YorkUnited States
- Cognitive Psychology Unit, Leiden UniversityLeidenNetherlands
| | - Hernando Vergara
- Sainsbury-Wellcome Centre for Neural Circuits and Behaviour, University College LondonLondonUnited Kingdom
| | - Miles Wells
- UCL Queen Square Institute of Neurology, University College LondonLondonUnited Kingdom
| | | | - Ilana B Witten
- Princeton Neuroscience Institute, Princeton UniversityPrincetonUnited States
| | - Lauren E Wool
- UCL Queen Square Institute of Neurology, University College LondonLondonUnited Kingdom
| | | |
Collapse
|
47
|
Nienborg H, Meyer EE. Neuroscience needs behavior: inferring psychophysical strategy trial by trial. Neuron 2021; 109:561-563. [PMID: 33600750 DOI: 10.1016/j.neuron.2021.01.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Tools quantifying dynamic behavior are important to understand brain function. In this issue of Neuron, Roy et al. (2021) extended the available repertoire with PsyTrack, which tracks, trial by trial, how subjects performing psychophysical tasks adjust the way they weigh stimuli and task covariates or change their biases.
Collapse
Affiliation(s)
- Hendrikje Nienborg
- Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| | - Emily E Meyer
- Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
48
|
Ashwood ZC, Roy NA, Bak JH, Pillow JW. Inferring learning rules from animal decision-making. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 2020; 33:3442-3453. [PMID: 36177341 PMCID: PMC9518972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
How do animals learn? This remains an elusive question in neuroscience. Whereas reinforcement learning often focuses on the design of algorithms that enable artificial agents to efficiently learn new tasks, here we develop a modeling framework to directly infer the empirical learning rules that animals use to acquire new behaviors. Our method efficiently infers the trial-to-trial changes in an animal's policy, and decomposes those changes into a learning component and a noise component. Specifically, this allows us to: (i) compare different learning rules and objective functions that an animal may be using to update its policy; (ii) estimate distinct learning rates for different parameters of an animal's policy; (iii) identify variations in learning across cohorts of animals; and (iv) uncover trial-to-trial changes that are not captured by normative learning rules. After validating our framework on simulated choice data, we applied our model to data from rats and mice learning perceptual decision-making tasks. We found that certain learning rules were far more capable of explaining trial-to-trial changes in an animal's policy. Whereas the average contribution of the conventional REINFORCE learning rule to the policy update for mice learning the International Brain Laboratory's task was just 30%, we found that adding baseline parameters allowed the learning rule to explain 92% of the animals' policy updates under our model. Intriguingly, the best-fitting learning rates and baseline values indicate that an animal's policy update, at each trial, does not occur in the direction that maximizes expected reward. Understanding how an animal transitions from chance-level to high-accuracy performance when learning a new task not only provides neuroscientists with insight into their animals, but also provides concrete examples of biological learning algorithms to the machine learning community.
Collapse
Affiliation(s)
- Zoe C Ashwood
- Princeton Neuroscience Institute, Princeton University
- Dept. of Computer Science, Princeton University
| | | | - Ji Hyun Bak
- Redwood Center for Theoretical Neuroscience, UC Berkeley
| | - Jonathan W Pillow
- Princeton Neuroscience Institute, Princeton University
- Dept. of Psychology, Princeton University
| |
Collapse
|