1
|
Fang C, Aronov D, Abbott LF, Mackevicius EL. Neural learning rules for generating flexible predictions and computing the successor representation. eLife 2023; 12:e80680. [PMID: 36928104 PMCID: PMC10019889 DOI: 10.7554/elife.80680] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 10/26/2022] [Indexed: 03/18/2023] Open
Abstract
The predictive nature of the hippocampus is thought to be useful for memory-guided cognitive behaviors. Inspired by the reinforcement learning literature, this notion has been formalized as a predictive map called the successor representation (SR). The SR captures a number of observations about hippocampal activity. However, the algorithm does not provide a neural mechanism for how such representations arise. Here, we show the dynamics of a recurrent neural network naturally calculate the SR when the synaptic weights match the transition probability matrix. Interestingly, the predictive horizon can be flexibly modulated simply by changing the network gain. We derive simple, biologically plausible learning rules to learn the SR in a recurrent network. We test our model with realistic inputs and match hippocampal data recorded during random foraging. Taken together, our results suggest that the SR is more accessible in neural circuits than previously thought and can support a broad range of cognitive functions.
Collapse
Affiliation(s)
- Ching Fang
- Zuckerman Institute, Department of Neuroscience, Columbia UniversityNew YorkUnited States
| | - Dmitriy Aronov
- Zuckerman Institute, Department of Neuroscience, Columbia UniversityNew YorkUnited States
| | - LF Abbott
- Zuckerman Institute, Department of Neuroscience, Columbia UniversityNew YorkUnited States
| | - Emily L Mackevicius
- Zuckerman Institute, Department of Neuroscience, Columbia UniversityNew YorkUnited States
- Basis Research InstituteNew YorkUnited States
| |
Collapse
|
2
|
Ji-An L, Stefanini F, Benna MK, Fusi S, La Porta CA. Face familiarity detection with complex synapses. iScience 2022; 26:105856. [PMID: 36636347 PMCID: PMC9829748 DOI: 10.1016/j.isci.2022.105856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Revised: 11/30/2022] [Accepted: 12/16/2022] [Indexed: 12/24/2022] Open
Abstract
Synaptic plasticity is a complex phenomenon involving multiple biochemical processes that operate on different timescales. Complexity can greatly increase memory capacity when the variables characterizing the synaptic dynamics have limited precision, as shown in simple memory retrieval problems involving random patterns. Here we turn to a real-world problem, face familiarity detection, and we show that synaptic complexity can be harnessed to store in memory a large number of faces that can be recognized at a later time. The number of recognizable faces grows almost linearly with the number of synapses and quadratically with the number of neurons. Complex synapses outperform simple ones characterized by a single variable, even when the total number of dynamical variables is matched. Complex and simple synapses have distinct signatures that are testable in experiments. Our results indicate that a system with complex synapses can be used in real-world tasks such as face familiarity detection.
Collapse
Affiliation(s)
- Li Ji-An
- Zuckerman Institute, Columbia University, New York, NY 10027, USA,Neurosciences Graduate Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Fabio Stefanini
- Zuckerman Institute, Columbia University, New York, NY 10027, USA
| | - Marcus K. Benna
- Zuckerman Institute, Columbia University, New York, NY 10027, USA,Department of Neurobiology, School of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA,Corresponding author
| | - Stefano Fusi
- Zuckerman Institute, Columbia University, New York, NY 10027, USA,Corresponding author
| | | |
Collapse
|
3
|
Dekker RB, Otto F, Summerfield C. Curriculum learning for human compositional generalization. Proc Natl Acad Sci U S A 2022; 119:e2205582119. [PMID: 36191191 PMCID: PMC9564093 DOI: 10.1073/pnas.2205582119] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 08/29/2022] [Indexed: 11/18/2022] Open
Abstract
Generalization (or transfer) is the ability to repurpose knowledge in novel settings. It is often asserted that generalization is an important ingredient of human intelligence, but its extent, nature, and determinants have proved controversial. Here, we examine this ability with a paradigm that formalizes the transfer learning problem as one of recomposing existing functions to solve unseen problems. We find that people can generalize compositionally in ways that are elusive for standard neural networks and that human generalization benefits from training regimes in which items are axis aligned and temporally correlated. We describe a neural network model based around a Hebbian gating process that can capture how human generalization benefits from different training curricula. We additionally find that adult humans tend to learn composable functions asynchronously, exhibiting discontinuities in learning that resemble those seen in child development.
Collapse
Affiliation(s)
- Ronald B. Dekker
- Department of Experimental Psychology, University of Oxford, Oxford, OX2 6GG, United Kingdom
| | - Fabian Otto
- Department of Experimental Psychology, University of Oxford, Oxford, OX2 6GG, United Kingdom
| | - Christopher Summerfield
- Department of Experimental Psychology, University of Oxford, Oxford, OX2 6GG, United Kingdom
| |
Collapse
|
4
|
|
5
|
Lazartigues L, Mathy F, Lavigne F. Statistical learning of unbalanced exclusive-or temporal sequences in humans. PLoS One 2021; 16:e0246826. [PMID: 33592012 PMCID: PMC7886115 DOI: 10.1371/journal.pone.0246826] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 01/27/2021] [Indexed: 11/26/2022] Open
Abstract
A pervasive issue in statistical learning has been to determine the parameters of regularity extraction. Our hypothesis was that the extraction of transitional probabilities can prevail over frequency if the task involves prediction. Participants were exposed to four repeated sequences of three stimuli (XYZ) with each stimulus corresponding to the position of a red dot on a touch screen that participants were required to touch sequentially. The temporal and spatial structure of the positions corresponded to a serial version of the exclusive-or (XOR) that allowed testing of the respective effect of frequency and first- and second-order transitional probabilities. The XOR allowed the first-order transitional probability to vary while being not completely related to frequency and to vary while the second-order transitional probability was fixed (p(Z|X, Y) = 1). The findings show that first-order transitional probability prevails over frequency to predict the second stimulus from the first and that it also influences the prediction of the third item despite the presence of second-order transitional probability that could have offered a certain prediction of the third item. These results are particularly informative in light of statistical learning models.
Collapse
Affiliation(s)
- Laura Lazartigues
- Department of Psychology, Université Côte d’Azur, CNRS, BCL, Nice, France
- * E-mail:
| | - Fabien Mathy
- Department of Psychology, Université Côte d’Azur, CNRS, BCL, Nice, France
| | - Frédéric Lavigne
- Department of Psychology, Université Côte d’Azur, CNRS, BCL, Nice, France
| |
Collapse
|
6
|
Bouchacourt F, Palminteri S, Koechlin E, Ostojic S. Temporal chunking as a mechanism for unsupervised learning of task-sets. eLife 2020; 9:50469. [PMID: 32149602 PMCID: PMC7108869 DOI: 10.7554/elife.50469] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Accepted: 02/24/2020] [Indexed: 12/26/2022] Open
Abstract
Depending on environmental demands, humans can learn and exploit multiple concurrent sets of stimulus-response associations. Mechanisms underlying the learning of such task-sets remain unknown. Here we investigate the hypothesis that task-set learning relies on unsupervised chunking of stimulus-response associations that occur in temporal proximity. We examine behavioral and neural data from a task-set learning experiment using a network model. We first show that task-set learning can be achieved provided the timescale of chunking is slower than the timescale of stimulus-response learning. Fitting the model to behavioral data on a subject-by-subject basis confirmed this expectation and led to specific predictions linking chunking and task-set retrieval that were borne out by behavioral performance and reaction times. Comparing the model activity with BOLD signal allowed us to identify neural correlates of task-set retrieval in a functional network involving ventral and dorsal prefrontal cortex, with the dorsal system preferentially engaged when retrievals are used to improve performance.
Collapse
Affiliation(s)
- Flora Bouchacourt
- Laboratoire de Neurosciences Cognitives et Computationnelles, Institut National de la Sante et de la Recherche Medicale, Paris, France.,Departement d'Etudes Cognitives, Ecole Normale Superieure, Paris, France
| | - Stefano Palminteri
- Laboratoire de Neurosciences Cognitives et Computationnelles, Institut National de la Sante et de la Recherche Medicale, Paris, France.,Departement d'Etudes Cognitives, Ecole Normale Superieure, Paris, France.,Institut d'Etudes de la Cognition, Universite de Recherche Paris Sciences et Lettres, Paris, France
| | - Etienne Koechlin
- Laboratoire de Neurosciences Cognitives et Computationnelles, Institut National de la Sante et de la Recherche Medicale, Paris, France.,Departement d'Etudes Cognitives, Ecole Normale Superieure, Paris, France
| | - Srdjan Ostojic
- Laboratoire de Neurosciences Cognitives et Computationnelles, Institut National de la Sante et de la Recherche Medicale, Paris, France.,Departement d'Etudes Cognitives, Ecole Normale Superieure, Paris, France.,Institut d'Etudes de la Cognition, Universite de Recherche Paris Sciences et Lettres, Paris, France
| |
Collapse
|
7
|
Carson RG, Kennedy NC. Modulation of human corticospinal excitability by paired associative stimulation. Front Hum Neurosci 2013; 7:823. [PMID: 24348369 PMCID: PMC3847812 DOI: 10.3389/fnhum.2013.00823] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2013] [Accepted: 11/14/2013] [Indexed: 12/04/2022] Open
Abstract
Paired Associative Stimulation (PAS) has come to prominence as a potential therapeutic intervention for the treatment of brain injury/disease, and as an experimental method with which to investigate Hebbian principles of neural plasticity in humans. Prototypically, a single electrical stimulus is directed to a peripheral nerve in advance of transcranial magnetic stimulation (TMS) delivered to the contralateral primary motor cortex (M1). Repeated pairing of the stimuli (i.e., association) over an extended period may increase or decrease the excitability of corticospinal projections from M1, in manner that depends on the interstimulus interval (ISI). It has been suggested that these effects represent a form of associative long-term potentiation (LTP) and depression (LTD) that bears resemblance to spike-timing dependent plasticity (STDP) as it has been elaborated in animal models. With a large body of empirical evidence having emerged since the cardinal features of PAS were first described, and in light of the variations from the original protocols that have been implemented, it is opportune to consider whether the phenomenology of PAS remains consistent with the characteristic features that were initially disclosed. This assessment necessarily has bearing upon interpretation of the effects of PAS in relation to the specific cellular pathways that are putatively engaged, including those that adhere to the rules of STDP. The balance of evidence suggests that the mechanisms that contribute to the LTP- and LTD-type responses to PAS differ depending on the precise nature of the induction protocol that is used. In addition to emphasizing the requirement for additional explanatory models, in the present analysis we highlight the key features of the PAS phenomenology that require interpretation.
Collapse
Affiliation(s)
- Richard G Carson
- Trinity College Institute of Neuroscience and School of Psychology, Trinity College Dublin Dublin, Ireland ; School of Psychology, Queen's University Belfast Belfast, UK
| | - Niamh C Kennedy
- School of Psychology, Queen's University Belfast Belfast, UK ; School of Rehabilitation Sciences University of East Anglia Norwich, UK
| |
Collapse
|
8
|
Iigaya K, Fusi S. Dynamical regimes in neural network models of matching behavior. Neural Comput 2013; 25:3093-112. [PMID: 24047324 DOI: 10.1162/neco_a_00522] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
The matching law constitutes a quantitative description of choice behavior that is often observed in foraging tasks. According to the matching law, organisms distribute their behavior across available response alternatives in the same proportion that reinforcers are distributed across those alternatives. Recently a few biophysically plausible neural network models have been proposed to explain the matching behavior observed in the experiments. Here we study systematically the learning dynamics of these networks while performing a matching task on the concurrent variable interval (VI) schedule. We found that the model neural network can operate in one of three qualitatively different regimes depending on the parameters that characterize the synaptic dynamics and the reward schedule: (1) a matching behavior regime, in which the probability of choosing an option is roughly proportional to the baiting fractional probability of that option; (2) a perseverative regime, in which the network tends to make always the same decision; and (3) a tristable regime, in which the network can either perseverate or choose the two targets randomly approximately with the same probability. Different parameters of the synaptic dynamics lead to different types of deviations from the matching law, some of which have been observed experimentally. We show that the performance of the network depends on the number of stable states of each synapse and that bistable synapses perform close to optimal when the proper learning rate is chosen. Because our model provides a link between synaptic dynamics and qualitatively different behaviors, this work provides us with insight into the effects of neuromodulators on adaptive behaviors and psychiatric disorders.
Collapse
Affiliation(s)
- Kiyohito Iigaya
- Center for Theoretical Neuroscience, Department of Neuroscience, Columbia University Medical Center, New York, NY 10032, and Department of Physics, Columbia University, New York, NY 10027, U.S.A.
| | | |
Collapse
|