1
|
Nau M, Schmid AC, Kaplan SM, Baker CI, Kravitz DJ. Centering cognitive neuroscience on task demands and generalization. Nat Neurosci 2024:10.1038/s41593-024-01711-6. [PMID: 39075326 DOI: 10.1038/s41593-024-01711-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 06/17/2024] [Indexed: 07/31/2024]
Abstract
Cognitive neuroscience seeks generalizable theories explaining the relationship between behavioral, physiological and mental states. In pursuit of such theories, we propose a theoretical and empirical framework that centers on understanding task demands and the mutual constraints they impose on behavior and neural activity. Task demands emerge from the interaction between an agent's sensory impressions, goals and behavior, which jointly shape the activity and structure of the nervous system on multiple spatiotemporal scales. Understanding this interaction requires multitask studies that vary more than one experimental component (for example, stimuli and instructions) combined with dense behavioral and neural sampling and explicit testing for generalization across tasks and data modalities. By centering task demands rather than mental processes that tasks are assumed to engage, this framework paves the way for the discovery of new generalizable concepts unconstrained by existing taxonomies, and moves cognitive neuroscience toward an action-oriented, dynamic and integrated view of the brain.
Collapse
Affiliation(s)
- Matthias Nau
- Laboratory of Brain and Cognition, National Institutes of Health, Bethesda, MD, USA.
| | - Alexandra C Schmid
- Laboratory of Brain and Cognition, National Institutes of Health, Bethesda, MD, USA
| | - Simon M Kaplan
- Department of Psychological & Brain Sciences, The George Washington University, Washington, DC, USA
| | - Chris I Baker
- Laboratory of Brain and Cognition, National Institutes of Health, Bethesda, MD, USA.
| | - Dwight J Kravitz
- Department of Psychological & Brain Sciences, The George Washington University, Washington, DC, USA.
- Division of Behavioral and Cognitive Sciences, Directorate for Social, Behavioral, and Economic Sciences, US National Science Foundation, Arlington, VA, USA.
| |
Collapse
|
2
|
Bentivegna F, Papachristou E, Flouri E. A scoping review on self-regulation and reward processing measured with gambling tasks: Evidence from the general youth population. PLoS One 2024; 19:e0301539. [PMID: 38574098 PMCID: PMC10994357 DOI: 10.1371/journal.pone.0301539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 03/17/2024] [Indexed: 04/06/2024] Open
Abstract
Aberrant reward processing and poor self-regulation have a crucial role in the development of several adverse outcomes in youth, including mental health disorders and risky behaviours. This scoping review aims to map and summarise the evidence for links between aspects and measures of reward processing and self-regulation among children and adolescents in the general population. Specifically, it examined the direct associations between self-regulation (emotional or cognitive regulation) and reward processing. Studies were included if participants were <18 years and representative of the general population. Quantitative measures were used for self-regulation, and gambling tasks were used for reward processing. Of the eighteen studies included only two were longitudinal. Overall, the direction of the significant relationships identified depended on the gambling task used and the self-regulation aspect explored. Emotional regulation was measured with self-report questionnaires only, and was the aspect with the most significant associations. Conversely, cognitive regulation was mainly assessed with cognitive assessments, and most associations with reward processing were non-significant, particularly when the cognitive regulation aspects included planning and organisational skills. Nonetheless, there was some evidence of associations with attention, cognitive control, and overall executive functioning. More longitudinal research is needed to draw accurate conclusions on the direction of the association between self-regulation and reward processing.
Collapse
Affiliation(s)
- Francesca Bentivegna
- Department of Psychology and Human Development, UCL Institute of Education, University College London, London, United Kingdom
| | - Efstathios Papachristou
- Department of Psychology and Human Development, UCL Institute of Education, University College London, London, United Kingdom
| | - Eirini Flouri
- Department of Psychology and Human Development, UCL Institute of Education, University College London, London, United Kingdom
| |
Collapse
|
3
|
Wise T, Emery K, Radulescu A. Naturalistic reinforcement learning. Trends Cogn Sci 2024; 28:144-158. [PMID: 37777463 PMCID: PMC10878983 DOI: 10.1016/j.tics.2023.08.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/23/2023] [Accepted: 08/24/2023] [Indexed: 10/02/2023]
Abstract
Humans possess a remarkable ability to make decisions within real-world environments that are expansive, complex, and multidimensional. Human cognitive computational neuroscience has sought to exploit reinforcement learning (RL) as a framework within which to explain human decision-making, often focusing on constrained, artificial experimental tasks. In this article, we review recent efforts that use naturalistic approaches to determine how humans make decisions in complex environments that better approximate the real world, providing a clearer picture of how humans navigate the challenges posed by real-world decisions. These studies purposely embed elements of naturalistic complexity within experimental paradigms, rather than focusing on simplification, generating insights into the processes that likely underpin humans' ability to navigate complex, multidimensional real-world environments so successfully.
Collapse
Affiliation(s)
- Toby Wise
- Department of Neuroimaging, King's College London, London, UK.
| | - Kara Emery
- Center for Data Science, New York University, New York, NY, USA
| | - Angela Radulescu
- Center for Computational Psychiatry, Icahn School of Medicine at Mt. Sinai, New York, NY, USA
| |
Collapse
|
4
|
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2023.01.076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
|
5
|
De Martino B, Cortese A. Goals, usefulness and abstraction in value-based choice. Trends Cogn Sci 2023; 27:65-80. [PMID: 36446707 DOI: 10.1016/j.tics.2022.11.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 10/26/2022] [Accepted: 11/01/2022] [Indexed: 11/27/2022]
Abstract
Colombian drug lord Pablo Escobar, while on the run, purportedly burned two million dollars in banknotes to keep his daughter warm. A stark reminder that, in life, circumstances and goals can quickly change, forcing us to reassess and modify our values on-the-fly. Studies in decision-making and neuroeconomics have often implicitly equated value to reward, emphasising the hedonic and automatic aspect of the value computation, while overlooking its functional (concept-like) nature. Here we outline the computational and biological principles that enable the brain to compute the usefulness of an option or action by creating abstractions that flexibly adapt to changing goals. We present different algorithmic architectures, comparing ideas from artificial intelligence (AI) and cognitive neuroscience with psychological theories and, when possible, drawing parallels.
Collapse
Affiliation(s)
- Benedetto De Martino
- Institute of Cognitive Neuroscience, University College London, London WC1N 3AZ, UK; Computational Neuroscience Laboratories, ATR Institute International, 619-0288 Kyoto, Japan.
| | - Aurelio Cortese
- Institute of Cognitive Neuroscience, University College London, London WC1N 3AZ, UK; Computational Neuroscience Laboratories, ATR Institute International, 619-0288 Kyoto, Japan.
| |
Collapse
|
6
|
Mugruza-Vassallo CA, Granados-Domínguez JL, Flores-Benites V, Córdova-Berríos L. Different Markov chains modulate visual stimuli processing in a Go-Go experiment in 2D, 3D, and augmented reality. Front Hum Neurosci 2022; 16:955534. [PMID: 36569471 PMCID: PMC9769205 DOI: 10.3389/fnhum.2022.955534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Accepted: 10/11/2022] [Indexed: 11/23/2022] Open
Abstract
The introduction of Augmented Reality (AR) has attracted several developments, although the people's experience of AR has not been clearly studied or contrasted with the human experience in 2D and 3D environments. Here, the directional task was applied in 2D, 3D, and AR using simplified stimulus in video games to determine whether there is a difference in human answer reaction time prediction using context stimulus. Testing of the directional task adapted was also done. Research question: Are the main differences between 2D, 3D, and AR able to be predicted using Markov chains? Methods: A computer was fitted with a digital acquisition card in order to record, test and validate the reaction time (RT) of participants attached to the arranged RT for the theory of Markov chain probability. A Markov chain analysis was performed on the participants' data. Subsequently, the way certain factors influenced participants RT amongst the three tasks time on the accuracy of the participants was sought in the three tasks (environments) were statistically tested using ANOVA. Results: Markov chains of order 1 and 2 successfully reproduced the average reaction time by participants in 3D and AR tasks, having only 2D tasks with the variance predicted with the current state. Moreover, a clear explanation of delayed RT in every environment was done. Mood and coffee did not show significant differences in RTs on a simplified videogame. Gender differences were found in 3D, where endogenous directional goals are in 3D, but no gender differences appeared in AR where exogenous AR buttons can explain the larger RT that compensate for the gender difference. Our results suggest that unconscious preparation of selective choices is not restricted to current motor preparation. Instead, decisions in different environments and gender evolve from the dynamics of preceding cognitive activity can fit and improve neurocomputational models.
Collapse
Affiliation(s)
- Carlos Andrés Mugruza-Vassallo
- Escuela Profesional de Medicina Humana, Universidad Privada San Juan Bautista (UPSJB), Lima, Peru,*Correspondence: Carlos Andrés Mugruza-Vassallo
| | | | - Victor Flores-Benites
- Facultad de Ingeniería y Arquitectura, Universidad de Lima, Lima, Peru,Universidad de Ingeniería y Tecnología (UTEC), Lima, Peru
| | | |
Collapse
|
7
|
Pouncy T, Gershman SJ. Inductive biases in theory-based reinforcement learning. Cogn Psychol 2022; 138:101509. [PMID: 36152355 DOI: 10.1016/j.cogpsych.2022.101509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 07/16/2022] [Accepted: 08/23/2022] [Indexed: 11/03/2022]
Abstract
Understanding the inductive biases that allow humans to learn in complex environments has been an important goal of cognitive science. Yet, while we have discovered much about human biases in specific learning domains, much of this research has focused on simple tasks that lack the complexity of the real world. In contrast, video games involving agents and objects embedded in richly structured systems provide an experimentally tractable proxy for real-world complexity. Recent work has suggested that key aspects of human learning in domains like video games can be captured by model-based reinforcement learning (RL) with object-oriented relational models-what we term theory-based RL. Restricting the model class in this way provides an inductive bias that dramatically increases learning efficiency, but in this paper we show that humans employ a stronger set of biases in addition to syntactic constraints on the structure of theories. In particular, we catalog a set of semantic biases that constrain the content of theories. Building these semantic biases into a theory-based RL system produces more human-like learning in video game environments.
Collapse
Affiliation(s)
- Thomas Pouncy
- Department of Psychology and Center for Brain Science, Harvard University, United States of America.
| | - Samuel J Gershman
- Department of Psychology and Center for Brain Science, Harvard University, United States of America; Center for Brains, Minds and Machines, MIT, United States of America
| |
Collapse
|
8
|
Abstract
Deciding whether to forgo a good choice in favour of exploring a potentially more rewarding alternative is one of the most challenging arbitrations both in human reasoning and in artificial intelligence. Humans show substantial variability in their exploration, and theoretical (but only limited empirical) work has suggested that excessive exploration is a critical mechanism underlying the psychiatric dimension of impulsivity. In this registered report, we put these theories to test using large online samples, dimensional analyses, and computational modelling. Capitalising on recent advances in disentangling distinct human exploration strategies, we not only demonstrate that impulsivity is associated with a specific form of exploration—value-free random exploration—but also explore links between exploration and other psychiatric dimensions. The Stage 1 protocol for this Registered Report was accepted in principle on 19/03/2021. The protocol, as accepted by the journal, can be found at 10.6084/m9.figshare.14346506.v1. Deciding between known rewarding options and exploring novel avenues is central to decision making. Humans show variability in their exploration. Here, the authors show that impulsivity is associated to an increased usage of a cognitively cheap (and sometimes sub-optimal) exploration strategy.
Collapse
|
9
|
Abstract
Vision and learning have long been considered to be two areas of research linked only distantly. However, recent developments in vision research have changed the conceptual definition of vision from a signal-evaluating process to a goal-oriented interpreting process, and this shift binds learning, together with the resulting internal representations, intimately to vision. In this review, we consider various types of learning (perceptual, statistical, and rule/abstract) associated with vision in the past decades and argue that they represent differently specialized versions of the fundamental learning process, which must be captured in its entirety when applied to complex visual processes. We show why the generalized version of statistical learning can provide the appropriate setup for such a unified treatment of learning in vision, what computational framework best accommodates this kind of statistical learning, and what plausible neural scheme could feasibly implement this framework. Finally, we list the challenges that the field of statistical learning faces in fulfilling the promise of being the right vehicle for advancing our understanding of vision in its entirety. Expected final online publication date for the Annual Review of Vision Science, Volume 8 is September 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- József Fiser
- Department of Cognitive Science, Center for Cognitive Computation, Central European University, Vienna 1100, Austria;
| | - Gábor Lengyel
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, New York 14627, USA
| |
Collapse
|
10
|
Sharp PB, Russek EM, Huys QJM, Dolan RJ, Eldar E. Humans perseverate on punishment avoidance goals in multigoal reinforcement learning. eLife 2022; 11:e74402. [PMID: 35199640 PMCID: PMC8912924 DOI: 10.7554/elife.74402] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Accepted: 02/21/2022] [Indexed: 11/20/2022] Open
Abstract
Managing multiple goals is essential to adaptation, yet we are only beginning to understand computations by which we navigate the resource demands entailed in so doing. Here, we sought to elucidate how humans balance reward seeking and punishment avoidance goals, and relate this to variation in its expression within anxious individuals. To do so, we developed a novel multigoal pursuit task that includes trial-specific instructed goals to either pursue reward (without risk of punishment) or avoid punishment (without the opportunity for reward). We constructed a computational model of multigoal pursuit to quantify the degree to which participants could disengage from the pursuit goals when instructed to, as well as devote less model-based resources toward goals that were less abundant. In general, participants (n = 192) were less flexible in avoiding punishment than in pursuing reward. Thus, when instructed to pursue reward, participants often persisted in avoiding features that had previously been associated with punishment, even though at decision time these features were unambiguously benign. In a similar vein, participants showed no significant downregulation of avoidance when punishment avoidance goals were less abundant in the task. Importantly, we show preliminary evidence that individuals with chronic worry may have difficulty disengaging from punishment avoidance when instructed to seek reward. Taken together, the findings demonstrate that people avoid punishment less flexibly than they pursue reward. Future studies should test in larger samples whether a difficulty to disengage from punishment avoidance contributes to chronic worry.
Collapse
Affiliation(s)
- Paul B Sharp
- The Hebrew University of JerusalemJerusalemIsrael
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College LondonLondonUnited Kingdom
- Wellcome Centre for Human Neuroimaging, University College LondonLondonUnited Kingdom
| | - Evan M Russek
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College LondonLondonUnited Kingdom
- Wellcome Centre for Human Neuroimaging, University College LondonLondonUnited Kingdom
| | - Quentin JM Huys
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College LondonLondonUnited Kingdom
- Division of Psychiatry, University College LondonLondonUnited Kingdom
| | - Raymond J Dolan
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College LondonLondonUnited Kingdom
- Wellcome Centre for Human Neuroimaging, University College LondonLondonUnited Kingdom
| | - Eran Eldar
- The Hebrew University of JerusalemJerusalemIsrael
| |
Collapse
|
11
|
Going beyond primary motor cortex to improve brain–computer interfaces. Trends Neurosci 2022; 45:176-183. [DOI: 10.1016/j.tins.2021.12.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 12/01/2021] [Accepted: 12/19/2021] [Indexed: 01/08/2023]
|
12
|
Son JY, Bhandari A, FeldmanHall O. Cognitive maps of social features enable flexible inference in social networks. Proc Natl Acad Sci U S A 2021; 118:e2021699118. [PMID: 34518372 PMCID: PMC8488581 DOI: 10.1073/pnas.2021699118] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/01/2021] [Indexed: 11/18/2022] Open
Abstract
In order to navigate a complex web of relationships, an individual must learn and represent the connections between people in a social network. However, the sheer size and complexity of the social world makes it impossible to acquire firsthand knowledge of all relations within a network, suggesting that people must make inferences about unobserved relationships to fill in the gaps. Across three studies (n = 328), we show that people can encode information about social features (e.g., hobbies, clubs) and subsequently deploy this knowledge to infer the existence of unobserved friendships in the network. Using computational models, we test various feature-based mechanisms that could support such inferences. We find that people's ability to successfully generalize depends on two representational strategies: a simple but inflexible similarity heuristic that leverages homophily, and a complex but flexible cognitive map that encodes the statistical relationships between social features and friendships. Together, our studies reveal that people can build cognitive maps encoding arbitrary patterns of latent relations in many abstract feature spaces, allowing social networks to be represented in a flexible format. Moreover, these findings shed light on open questions across disciplines about how people learn and represent social networks and may have implications for generating more human-like link prediction in machine learning algorithms.
Collapse
Affiliation(s)
- Jae-Young Son
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912
| | - Apoorva Bhandari
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912
| | - Oriel FeldmanHall
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912;
- Carney Institute for Brain Sciences, Brown University, Providence, RI 02912
| |
Collapse
|
13
|
Nguyen Duc T, Tran CM, Tan PX, Kamioka E. Domain Adaptation for Imitation Learning Using Generative Adversarial Network. SENSORS 2021; 21:s21144718. [PMID: 34300456 PMCID: PMC8309483 DOI: 10.3390/s21144718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 07/01/2021] [Accepted: 07/07/2021] [Indexed: 11/25/2022]
Abstract
Imitation learning is an effective approach for an autonomous agent to learn control policies when an explicit reward function is unavailable, using demonstrations provided from an expert. However, standard imitation learning methods assume that the agents and the demonstrations provided by the expert are in the same domain configuration. Such an assumption has made the learned policies difficult to apply in another distinct domain. The problem is formalized as domain adaptive imitation learning, which is the process of learning how to perform a task optimally in a learner domain, given demonstrations of the task in a distinct expert domain. We address the problem by proposing a model based on Generative Adversarial Network. The model aims to learn both domain-shared and domain-specific features and utilizes it to find an optimal policy across domains. The experimental results show the effectiveness of our model in a number of tasks ranging from low to complex high-dimensional.
Collapse
Affiliation(s)
- Tho Nguyen Duc
- Graduate School of Engineering and Science, Shibaura Institute of Technology, Tokyo 135-8548, Japan; (T.N.D.); (C.M.T.); (E.K.)
| | - Chanh Minh Tran
- Graduate School of Engineering and Science, Shibaura Institute of Technology, Tokyo 135-8548, Japan; (T.N.D.); (C.M.T.); (E.K.)
| | - Phan Xuan Tan
- Department of Information and Communications Engineering, Shibaura Institute of Technology, Tokyo 135-8548, Japan
- Correspondence:
| | - Eiji Kamioka
- Graduate School of Engineering and Science, Shibaura Institute of Technology, Tokyo 135-8548, Japan; (T.N.D.); (C.M.T.); (E.K.)
| |
Collapse
|