Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang JX, Kurth-Nelson Z, Kumaran D, Tirumala D, Soyer H, Leibo JZ, Hassabis D, Botvinick M. Prefrontal cortex as a meta-reinforcement learning system. Nat Neurosci 2018;21:860-868. [DOI: 10.1038/s41593-018-0147-8] [Citation(s) in RCA: 258] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Accepted: 04/05/2018] [Indexed: 11/09/2022]

For:	Wang JX, Kurth-Nelson Z, Kumaran D, Tirumala D, Soyer H, Leibo JZ, Hassabis D, Botvinick M. Prefrontal cortex as a meta-reinforcement learning system. Nat Neurosci 2018;21:860-868. [DOI: 10.1038/s41593-018-0147-8] [Citation(s) in RCA: 258] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Accepted: 04/05/2018] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Dubinsky JM, Hamid AA. The neuroscience of active learning and direct instruction. Neurosci Biobehav Rev 2024;163:105737. [PMID: 38796122 DOI: 10.1016/j.neubiorev.2024.105737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 05/13/2024] [Accepted: 05/20/2024] [Indexed: 05/28/2024]

Fascianelli V, Battista A, Stefanini F, Tsujimoto S, Genovesio A, Fusi S. Neural representational geometries reflect behavioral differences in monkeys and recurrent neural networks. Nat Commun 2024;15:6479. [PMID: 39090091 PMCID: PMC11294567 DOI: 10.1038/s41467-024-50503-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 07/10/2024] [Indexed: 08/04/2024] Open

Scott DN, Mukherjee A, Nassar MR, Halassa MM. Thalamocortical architectures for flexible cognition and efficient learning. Trends Cogn Sci 2024;28:739-756. [PMID: 38886139 PMCID: PMC11305962 DOI: 10.1016/j.tics.2024.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 05/12/2024] [Accepted: 05/13/2024] [Indexed: 06/20/2024]

Lv Q, Chen G, Yang Z, Zhong W, Chen CYC. Meta Learning With Graph Attention Networks for Low-Data Drug Discovery. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:11218-11230. [PMID: 37028032 DOI: 10.1109/tnnls.2023.3250324] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Zhang R, Pitkow X, Angelaki DE. Inductive biases of neural network modularity in spatial navigation. SCIENCE ADVANCES 2024;10:eadk1256. [PMID: 39028809 PMCID: PMC11259174 DOI: 10.1126/sciadv.adk1256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 06/14/2024] [Indexed: 07/21/2024]

Cone I, Clopath C, Shouval HZ. Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time. Nat Commun 2024;15:5856. [PMID: 38997276 PMCID: PMC11245539 DOI: 10.1038/s41467-024-50205-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 07/02/2024] [Indexed: 07/14/2024] Open

Lippl S, Kay K, Jensen G, Ferrera VP, Abbott LF. A mathematical theory of relational generalization in transitive inference. Proc Natl Acad Sci U S A 2024;121:e2314511121. [PMID: 38968113 PMCID: PMC11252811 DOI: 10.1073/pnas.2314511121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 05/30/2024] [Indexed: 07/07/2024] Open

Abstract

Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. We investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation ([Formula: see text] and [Formula: see text]) and generalize it to new combinations of items ([Formula: see text]). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar "conjunctivity factor" determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the "rich regime," which enables representation learning and improves generalization on many tasks, unexpectedly show poor generalization and anomalous behavior on TI. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.

Collapse

Jensen KT, Hennequin G, Mattar MG. A recurrent network model of planning explains hippocampal replay and human behavior. Nat Neurosci 2024;27:1340-1348. [PMID: 38849521 PMCID: PMC11239510 DOI: 10.1038/s41593-024-01675-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Accepted: 05/07/2024] [Indexed: 06/09/2024]

Hosoda K, Nishida K, Seno S, Mashita T, Kashioka H, Ohzawa I. A single fast Hebbian-like process enabling one-shot class addition in deep neural networks without backbone modification. Front Neurosci 2024;18:1344114. [PMID: 38933813 PMCID: PMC11202076 DOI: 10.3389/fnins.2024.1344114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Accepted: 05/16/2024] [Indexed: 06/28/2024] Open

Gong L, Pasqualetti F, Papouin T, Ching S. Astrocytes as a mechanism for contextually-guided network dynamics and function. PLoS Comput Biol 2024;20:e1012186. [PMID: 38820533 PMCID: PMC11168681 DOI: 10.1371/journal.pcbi.1012186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 06/12/2024] [Accepted: 05/21/2024] [Indexed: 06/02/2024] Open

Lakshminarasimhan KJ, Xie M, Cohen JD, Sauerbrei BA, Hantman AW, Litwin-Kumar A, Escola S. Specific connectivity optimizes learning in thalamocortical loops. Cell Rep 2024;43:114059. [PMID: 38602873 PMCID: PMC11104520 DOI: 10.1016/j.celrep.2024.114059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 01/04/2024] [Accepted: 03/20/2024] [Indexed: 04/13/2024] Open

Menéndez JA, Hennig JA, Golub MD, Oby ER, Sadtler PT, Batista AP, Chase SM, Yu BM, Latham PE. A theory of brain-computer interface learning via low-dimensional control. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.18.589952. [PMID: 38712193 PMCID: PMC11071278 DOI: 10.1101/2024.04.18.589952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]

Subramoney A, Bellec G, Scherr F, Legenstein R, Maass W. Fast learning without synaptic plasticity in spiking neural networks. Sci Rep 2024;14:8557. [PMID: 38609429 PMCID: PMC11015027 DOI: 10.1038/s41598-024-55769-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 02/27/2024] [Indexed: 04/14/2024] Open

Pereira-Obilinovic U, Hou H, Svoboda K, Wang XJ. Brain mechanism of foraging: Reward-dependent synaptic plasticity versus neural integration of values. Proc Natl Acad Sci U S A 2024;121:e2318521121. [PMID: 38551832 PMCID: PMC10998608 DOI: 10.1073/pnas.2318521121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 01/16/2024] [Indexed: 04/02/2024] Open

Mohebi A, Wei W, Pelattini L, Kim K, Berke JD. Dopamine transients follow a striatal gradient of reward time horizons. Nat Neurosci 2024;27:737-746. [PMID: 38321294 PMCID: PMC11001583 DOI: 10.1038/s41593-023-01566-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 12/21/2023] [Indexed: 02/08/2024]

Wang X, Wang S, Liang X, Zhao D, Huang J, Xu X, Dai B, Miao Q. Deep Reinforcement Learning: A Survey. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:5064-5078. [PMID: 36170386 DOI: 10.1109/tnnls.2022.3207346] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Kay K, Biderman N, Khajeh R, Beiran M, Cueva CJ, Shohamy D, Jensen G, Wei XX, Ferrera VP, Abbott LF. Emergent neural dynamics and geometry for generalization in a transitive inference task. PLoS Comput Biol 2024;20:e1011954. [PMID: 38662797 PMCID: PMC11125559 DOI: 10.1371/journal.pcbi.1011954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 05/24/2024] [Accepted: 02/28/2024] [Indexed: 05/25/2024] Open

Affiliation(s)

Kenneth Kay Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, United States of America Center for Theoretical Neuroscience, Columbia University, New York, New York, United States of America Grossman Center for the Statistics of Mind, Columbia University, New York, New York, United States of America
Natalie Biderman Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, United States of America Department of Psychology, Columbia University, New York, New York, United States of America
Ramin Khajeh Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, United States of America Center for Theoretical Neuroscience, Columbia University, New York, New York, United States of America
Manuel Beiran Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, United States of America Center for Theoretical Neuroscience, Columbia University, New York, New York, United States of America
Christopher J. Cueva Department of Brain and Cognitive Sciences, MIT, Cambridge, Massachusetts, United States of America
Daphna Shohamy Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, United States of America Department of Psychology, Columbia University, New York, New York, United States of America The Kavli Institute for Brain Science, Columbia University, New York, New York, United States of America
Greg Jensen Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, United States of America Department of Neuroscience, Columbia University Medical Center, New York, New York, United States of America Department of Psychology at Reed College, Portland, Oregon, United States of America
Xue-Xin Wei Departments of Neuroscience and Psychology, The University of Texas at Austin, Austin, Texas, United States of America
Vincent P. Ferrera Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, United States of America Department of Neuroscience, Columbia University Medical Center, New York, New York, United States of America Department of Psychiatry, Columbia University Medical Center, New York, New York, United States of America
LF Abbott Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York, United States of America Center for Theoretical Neuroscience, Columbia University, New York, New York, United States of America The Kavli Institute for Brain Science, Columbia University, New York, New York, United States of America Department of Neuroscience, Columbia University Medical Center, New York, New York, United States of America

Collapse

McNamee DC. The generative neural microdynamics of cognitive processing. Curr Opin Neurobiol 2024;85:102855. [PMID: 38428170 DOI: 10.1016/j.conb.2024.102855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 02/06/2024] [Accepted: 02/07/2024] [Indexed: 03/03/2024]

Kuroki S, Mizuseki K. CA3 Circuit Model Compressing Sequential Information in Theta Oscillation and Replay. Neural Comput 2024;36:501-548. [PMID: 38457750 DOI: 10.1162/neco_a_01641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 11/20/2023] [Indexed: 03/10/2024]

Wolff M, Halassa MM. The mediodorsal thalamus in executive control. Neuron 2024;112:893-908. [PMID: 38295791 DOI: 10.1016/j.neuron.2024.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 11/15/2023] [Accepted: 01/03/2024] [Indexed: 03/23/2024]

Jahn CI, Markov NT, Morea B, Daw ND, Ebitz RB, Buschman TJ. Learning attentional templates for value-based decision-making. Cell 2024;187:1476-1489.e21. [PMID: 38401541 DOI: 10.1016/j.cell.2024.01.041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 12/18/2023] [Accepted: 01/25/2024] [Indexed: 02/26/2024]

Carter F, Cossette MP, Trujillo-Pisanty I, Pallikaras V, Breton YA, Conover K, Caplan J, Solis P, Voisard J, Yaksich A, Shizgal P. Does phasic dopamine release cause policy updates? Eur J Neurosci 2024;59:1260-1277. [PMID: 38039083 DOI: 10.1111/ejn.16199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 10/21/2023] [Accepted: 11/01/2023] [Indexed: 12/03/2023]

Muller TH, Butler JL, Veselic S, Miranda B, Wallis JD, Dayan P, Behrens TEJ, Kurth-Nelson Z, Kennerley SW. Distributional reinforcement learning in prefrontal cortex. Nat Neurosci 2024;27:403-408. [PMID: 38200183 PMCID: PMC10917656 DOI: 10.1038/s41593-023-01535-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 11/29/2023] [Indexed: 01/12/2024]

Affiliation(s)

Timothy H Muller Department of Experimental Psychology, University of Oxford, Oxford, UK. Department of Clinical and Movement Neurosciences, University College London, London, UK.
James L Butler Department of Experimental Psychology, University of Oxford, Oxford, UK Department of Clinical and Movement Neurosciences, University College London, London, UK
Sebastijan Veselic Department of Experimental Psychology, University of Oxford, Oxford, UK Department of Clinical and Movement Neurosciences, University College London, London, UK Wellcome Trust Centre for Human Neuroimaging, University College London, London, UK
Bruno Miranda Department of Clinical and Movement Neurosciences, University College London, London, UK Institute of Physiology and Institute of Molecular Medicine, Lisbon School of Medicine, University of Lisbon, Lisbon, Portugal
Joni D Wallis Department of Psychology and Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA, USA
Peter Dayan Max Planck Institute for Biological Cybernetics, Tübingen, Germany University of Tübingen, Tübingen, Germany
Timothy E J Behrens Wellcome Trust Centre for Human Neuroimaging, University College London, London, UK Wellcome Centre for Integrative Neuroimaging, University of Oxford, John Radcliffe Hospital, Oxford, UK Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK
Zeb Kurth-Nelson Google DeepMind, London, UK. Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, London, UK.
Steven W Kennerley Department of Experimental Psychology, University of Oxford, Oxford, UK. Department of Clinical and Movement Neurosciences, University College London, London, UK. Wellcome Centre for Integrative Neuroimaging, University of Oxford, John Radcliffe Hospital, Oxford, UK.

Collapse

Simoens J, Verguts T, Braem S. Learning environment-specific learning rates. PLoS Comput Biol 2024;20:e1011978. [PMID: 38517916 PMCID: PMC10990245 DOI: 10.1371/journal.pcbi.1011978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 04/03/2024] [Accepted: 03/09/2024] [Indexed: 03/24/2024] Open

Hocker D, Constantinople CM, Savin C. Curriculum learning inspired by behavioral shaping trains neural networks to adopt animal-like decision making strategies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.12.575461. [PMID: 38318205 PMCID: PMC10843159 DOI: 10.1101/2024.01.12.575461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2024]

Wientjes S, Holroyd CB. The successor representation subserves hierarchical abstraction for goal-directed behavior. PLoS Comput Biol 2024;20:e1011312. [PMID: 38377074 PMCID: PMC10906840 DOI: 10.1371/journal.pcbi.1011312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 03/01/2024] [Accepted: 02/05/2024] [Indexed: 02/22/2024] Open

Abstract

Humans have the ability to craft abstract, temporally extended and hierarchically organized plans. For instance, when considering how to make spaghetti for dinner, we typically concern ourselves with useful "subgoals" in the task, such as cutting onions, boiling pasta, and cooking a sauce, rather than particulars such as how many cuts to make to the onion, or exactly which muscles to contract. A core question is how such decomposition of a more abstract task into logical subtasks happens in the first place. Previous research has shown that humans are sensitive to a form of higher-order statistical learning named "community structure". Community structure is a common feature of abstract tasks characterized by a logical ordering of subtasks. This structure can be captured by a model where humans learn predictions of upcoming events multiple steps into the future, discounting predictions of events further away in time. One such model is the "successor representation", which has been argued to be useful for hierarchical abstraction. As of yet, no study has convincingly shown that this hierarchical abstraction can be put to use for goal-directed behavior. Here, we investigate whether participants utilize learned community structure to craft hierarchically informed action plans for goal-directed behavior. Participants were asked to search for paintings in a virtual museum, where the paintings were grouped together in "wings" representing community structure in the museum. We find that participants' choices accord with the hierarchical structure of the museum and that their response times are best predicted by a successor representation. The degree to which the response times reflect the community structure of the museum correlates with several measures of performance, including the ability to craft temporally abstract action plans. These results suggest that successor representation learning subserves hierarchical abstractions relevant for goal-directed behavior.

Collapse

Wise T, Emery K, Radulescu A. Naturalistic reinforcement learning. Trends Cogn Sci 2024;28:144-158. [PMID: 37777463 PMCID: PMC10878983 DOI: 10.1016/j.tics.2023.08.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/23/2023] [Accepted: 08/24/2023] [Indexed: 10/02/2023]

Blanco-Pozo M, Akam T, Walton ME. Dopamine-independent effect of rewards on choices through hidden-state inference. Nat Neurosci 2024;27:286-297. [PMID: 38216649 PMCID: PMC10849965 DOI: 10.1038/s41593-023-01542-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 12/01/2023] [Indexed: 01/14/2024]

Valentin S, Kleinegesse S, Bramley NR, Seriès P, Gutmann MU, Lucas CG. Designing optimal behavioral experiments using machine learning. eLife 2024;13:e86224. [PMID: 38261382 PMCID: PMC10805374 DOI: 10.7554/elife.86224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 11/19/2023] [Indexed: 01/24/2024] Open

Abstract

Computational models are powerful tools for understanding human cognition and behavior. They let us express our theories clearly and precisely and offer predictions that can be subtle and often counter-intuitive. However, this same richness and ability to surprise means our scientific intuitions and traditional tools are ill-suited to designing experiments to test and compare these models. To avoid these pitfalls and realize the full potential of computational modeling, we require tools to design experiments that provide clear answers about what models explain human behavior and the auxiliary assumptions those models must make. Bayesian optimal experimental design (BOED) formalizes the search for optimal experimental designs by identifying experiments that are expected to yield informative data. In this work, we provide a tutorial on leveraging recent advances in BOED and machine learning to find optimal experiments for any kind of model that we can simulate data from, and show how by-products of this procedure allow for quick and straightforward evaluation of models and their parameters against real experimental data. As a case study, we consider theories of how people balance exploration and exploitation in multi-armed bandit decision-making tasks. We validate the presented approach using simulations and a real-world experiment. As compared to experimental designs commonly used in the literature, we show that our optimal designs more efficiently determine which of a set of models best account for individual human behavior, and more efficiently characterize behavior given a preferred model. At the same time, formalizing a scientific question such that it can be adequately addressed with BOED can be challenging and we discuss several potential caveats and pitfalls that practitioners should be aware of. We provide code to replicate all analyses as well as tutorial notebooks and pointers to adapt the methodology to different experimental settings.

Collapse

Algermissen J, Swart JC, Scheeringa R, Cools R, den Ouden HEM. Prefrontal signals precede striatal signals for biased credit assignment in motivational learning biases. Nat Commun 2024;15:19. [PMID: 38168089 PMCID: PMC10762147 DOI: 10.1038/s41467-023-44632-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 12/22/2023] [Indexed: 01/05/2024] Open

Seifert G, Sealander A, Marzen S, Levin M. From reinforcement learning to agency: Frameworks for understanding basal cognition. Biosystems 2024;235:105107. [PMID: 38128873 DOI: 10.1016/j.biosystems.2023.105107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 12/17/2023] [Accepted: 12/17/2023] [Indexed: 12/23/2023]

Leimar O, Quiñones AE, Bshary R. Flexible learning in complex worlds. Behav Ecol 2024;35:arad109. [PMID: 38162692 PMCID: PMC10756056 DOI: 10.1093/beheco/arad109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 10/23/2023] [Accepted: 12/03/2023] [Indexed: 01/03/2024] Open

Quispe Escudero D. It's all about making new contacts: How being metabotropic and phasicity help D1-like receptors promote LTP in the PFC. Prog Neuropsychopharmacol Biol Psychiatry 2023;127:110784. [PMID: 37169273 DOI: 10.1016/j.pnpbp.2023.110784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 04/23/2023] [Accepted: 05/04/2023] [Indexed: 05/13/2023]

Bernotat J, Landolfi L, Pasquali D, Nardelli A, Rea F. Remember me - user-centered implementation of working memory architectures on an industrial robot. Front Robot AI 2023;10:1257690. [PMID: 38116169 PMCID: PMC10728719 DOI: 10.3389/frobt.2023.1257690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 11/16/2023] [Indexed: 12/21/2023] Open

Abstract

The present research is innovative as we followed a user-centered approach to implement and train two working memory architectures on an industrial RB-KAIROS + robot: GRU, a state-of-the-art architecture, and WorkMATe, a biologically-inspired alternative. Although user-centered approaches are essential to create a comfortable and safe HRI, they are still rare in industrial settings. Closing this research gap, we conducted two online user studies with large heterogeneous samples. The major aim of these studies was to evaluate the RB-KAIROS + robot's appearance, movements, and perceived memory functions before (User Study 1) and after the implementation and training of robot working memory (User Study 2). In User Study 1, we furthermore explored participants' ideas about robot memory and what aspects of the robot's movements participants found positive and what aspects they would change. The effects of participants' demographic background and attitudes were controlled for. In User Study 1, participants' overall evaluations of the robot were moderate. Participant age and negative attitudes toward robots led to more negative robot evaluations. According to exploratory analyses, these effects were driven by perceived low experience with robots. Participants expressed clear ideas of robot memory and precise suggestions for a safe, efficient, and comfortable robot navigation which are valuable for further research and development. In User Study 2, the implementation of WorkMATe and GRU led to more positive evaluations of perceived robot memory, but not of robot appearance and movements. Participants' robot evaluations were driven by their positive views of robots. Our results demonstrate that considering potential users' views can greatly contribute to an efficient and positively perceived robot navigation, while users' experience with robots is crucial for a positive HRI.

Collapse

Hattori R, Hedrick NG, Jain A, Chen S, You H, Hattori M, Choi JH, Lim BK, Yasuda R, Komiyama T. Meta-reinforcement learning via orbitofrontal cortex. Nat Neurosci 2023;26:2182-2191. [PMID: 37957318 PMCID: PMC10689244 DOI: 10.1038/s41593-023-01485-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 10/06/2023] [Indexed: 11/15/2023]

Affiliation(s)

Ryoma Hattori Department of Neurobiology, University of California San Diego, La Jolla, CA, USA. Center for Neural Circuits and Behavior, University of California San Diego, La Jolla, CA, USA. Department of Neurosciences, University of California San Diego, La Jolla, CA, USA. Halıcıoğlu Data Science Institute, University of California San Diego, La Jolla, CA, USA. Department of Neuroscience, The Herbert Wertheim UF Scripps Institute for Biomedical Innovation & Technology, University of Florida, Jupiter, FL, USA.
Nathan G Hedrick Department of Neurobiology, University of California San Diego, La Jolla, CA, USA Center for Neural Circuits and Behavior, University of California San Diego, La Jolla, CA, USA Department of Neurosciences, University of California San Diego, La Jolla, CA, USA Halıcıoğlu Data Science Institute, University of California San Diego, La Jolla, CA, USA
Anant Jain Max Planck Florida Institute for Neuroscience, Jupiter, FL, USA
Shuqi Chen Department of Neurobiology, University of California San Diego, La Jolla, CA, USA Center for Neural Circuits and Behavior, University of California San Diego, La Jolla, CA, USA Department of Neurosciences, University of California San Diego, La Jolla, CA, USA Halıcıoğlu Data Science Institute, University of California San Diego, La Jolla, CA, USA
Hanjia You Department of Neurobiology, University of California San Diego, La Jolla, CA, USA Center for Neural Circuits and Behavior, University of California San Diego, La Jolla, CA, USA Department of Neurosciences, University of California San Diego, La Jolla, CA, USA Halıcıoğlu Data Science Institute, University of California San Diego, La Jolla, CA, USA
Mariko Hattori Department of Neurobiology, University of California San Diego, La Jolla, CA, USA Center for Neural Circuits and Behavior, University of California San Diego, La Jolla, CA, USA Department of Neurosciences, University of California San Diego, La Jolla, CA, USA Halıcıoğlu Data Science Institute, University of California San Diego, La Jolla, CA, USA
Jun-Hyeok Choi Department of Neurobiology, University of California San Diego, La Jolla, CA, USA
Byung Kook Lim Department of Neurobiology, University of California San Diego, La Jolla, CA, USA
Ryohei Yasuda Max Planck Florida Institute for Neuroscience, Jupiter, FL, USA
Takaki Komiyama Department of Neurobiology, University of California San Diego, La Jolla, CA, USA. Center for Neural Circuits and Behavior, University of California San Diego, La Jolla, CA, USA. Department of Neurosciences, University of California San Diego, La Jolla, CA, USA. Halıcıoğlu Data Science Institute, University of California San Diego, La Jolla, CA, USA.

Collapse

Jiménez GA, de la Escalera Hueso A, Gómez-Silva MJ. Reinforcement Learning Algorithms for Autonomous Mission Accomplishment by Unmanned Aerial Vehicles: A Comparative View with DQN, SARSA and A2C. SENSORS (BASEL, SWITZERLAND) 2023;23:9013. [PMID: 37960711 PMCID: PMC10649256 DOI: 10.3390/s23219013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 10/24/2023] [Accepted: 11/02/2023] [Indexed: 11/15/2023]

Krausz TA, Comrie AE, Kahn AE, Frank LM, Daw ND, Berke JD. Dual credit assignment processes underlie dopamine signals in a complex spatial environment. Neuron 2023;111:3465-3478.e7. [PMID: 37611585 PMCID: PMC10841332 DOI: 10.1016/j.neuron.2023.07.017] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 06/23/2023] [Accepted: 07/25/2023] [Indexed: 08/25/2023]

Tsuda B, Richmond BJ, Sejnowski TJ. Exploring strategy differences between humans and monkeys with recurrent neural networks. PLoS Comput Biol 2023;19:e1011618. [PMID: 37983250 PMCID: PMC10695363 DOI: 10.1371/journal.pcbi.1011618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Revised: 12/04/2023] [Accepted: 10/19/2023] [Indexed: 11/22/2023] Open

Soo WWM, Goudar V, Wang XJ. Training biologically plausible recurrent neural networks on cognitive tasks with long-term dependencies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.10.561588. [PMID: 37873445 PMCID: PMC10592728 DOI: 10.1101/2023.10.10.561588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Rajagopalan AE, Darshan R, Hibbard KL, Fitzgerald JE, Turner GC. Reward expectations direct learning and drive operant matching in Drosophila. Proc Natl Acad Sci U S A 2023;120:e2221415120. [PMID: 37733736 PMCID: PMC10523640 DOI: 10.1073/pnas.2221415120] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 08/11/2023] [Indexed: 09/23/2023] Open

Miconi T, Kay K. An active neural mechanism for relational learning and fast knowledge reassembly. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.27.550739. [PMID: 37546842 PMCID: PMC10402151 DOI: 10.1101/2023.07.27.550739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]

Hennig JA, Romero Pinto SA, Yamaguchi T, Linderman SW, Uchida N, Gershman SJ. Emergence of belief-like representations through reinforcement learning. PLoS Comput Biol 2023;19:e1011067. [PMID: 37695776 PMCID: PMC10513382 DOI: 10.1371/journal.pcbi.1011067] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 09/21/2023] [Accepted: 08/27/2023] [Indexed: 09/13/2023] Open

Kumar S, Dasgupta I, Daw ND, Cohen JD, Griffiths TL. Disentangling Abstraction from Statistical Pattern Matching in Human and Machine Learning. PLoS Comput Biol 2023;19:e1011316. [PMID: 37624841 PMCID: PMC10497163 DOI: 10.1371/journal.pcbi.1011316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 09/12/2023] [Accepted: 06/29/2023] [Indexed: 08/27/2023] Open

Astle DE, Johnson MH, Akarca D. Toward computational neuroconstructivism: a framework for developmental systems neuroscience. Trends Cogn Sci 2023;27:726-744. [PMID: 37263856 DOI: 10.1016/j.tics.2023.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 01/05/2023] [Accepted: 04/19/2023] [Indexed: 06/03/2023]

Sugiyama T, Schweighofer N, Izawa J. Reinforcement learning establishes a minimal metacognitive process to monitor and control motor learning performance. Nat Commun 2023;14:3988. [PMID: 37422476 PMCID: PMC10329706 DOI: 10.1038/s41467-023-39536-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 06/16/2023] [Indexed: 07/10/2023] Open

Ambrogioni L, Ólafsdóttir HF. Rethinking the hippocampal cognitive map as a meta-learning computational module. Trends Cogn Sci 2023:S1364-6613(23)00128-6. [PMID: 37357064 DOI: 10.1016/j.tics.2023.05.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 04/26/2023] [Accepted: 05/24/2023] [Indexed: 06/27/2023]

Poli F, Ghilardi T, Mars RB, Hinne M, Hunnius S. Eight-Month-Old Infants Meta-Learn by Downweighting Irrelevant Evidence. Open Mind (Camb) 2023;7:141-155. [PMID: 37416070 PMCID: PMC10320826 DOI: 10.1162/opmi_a_00079] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 04/06/2023] [Indexed: 07/08/2023] Open

Brea J, Clayton NS, Gerstner W. Computational models of episodic-like memory in food-caching birds. Nat Commun 2023;14:2979. [PMID: 37221167 DOI: 10.1038/s41467-023-38570-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 05/08/2023] [Indexed: 05/25/2023] Open

Coldren J. Conditions under which college students cease learning. Front Psychol 2023;14:1116853. [PMID: 37151351 PMCID: PMC10157072 DOI: 10.3389/fpsyg.2023.1116853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 03/30/2023] [Indexed: 05/09/2023] Open

Goudar V, Peysakhovich B, Freedman DJ, Buffalo EA, Wang XJ. Schema formation in a neural population subspace underlies learning-to-learn in flexible sensorimotor problem-solving. Nat Neurosci 2023;26:879-890. [PMID: 37024575 DOI: 10.1038/s41593-023-01293-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 02/27/2023] [Indexed: 04/08/2023]