1
|
Butz MV, Mittenbühler M, Schwöbel S, Achimova A, Gumbsch C, Otte S, Kiebel S. Contextualizing predictive minds. Neurosci Biobehav Rev 2025; 168:105948. [PMID: 39580009 DOI: 10.1016/j.neubiorev.2024.105948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 09/13/2024] [Accepted: 11/16/2024] [Indexed: 11/25/2024]
Abstract
The structure of human memory seems to be optimized for efficient prediction, planning, and behavior. We propose that these capacities rely on a tripartite structure of memory that includes concepts, events, and contexts-three layers that constitute the mental world model. We suggest that the mechanism that critically increases adaptivity and flexibility is the tendency to contextualize. This tendency promotes local, context-encoding abstractions, which focus event- and concept-based planning and inference processes on the task and situation at hand. As a result, cognitive contextualization offers a solution to the frame problem-the need to select relevant features of the environment from the rich stream of sensorimotor signals. We draw evidence for our proposal from developmental psychology and neuroscience. Adopting a computational stance, we present evidence from cognitive modeling research which suggests that context sensitivity is a feature that is critical for maximizing the efficiency of cognitive processes. Finally, we turn to recent deep-learning architectures which independently demonstrate how context-sensitive memory can emerge in a self-organized learning system constrained by cognitively-inspired inductive biases.
Collapse
Affiliation(s)
- Martin V Butz
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany.
| | - Maximilian Mittenbühler
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany
| | - Sarah Schwöbel
- Cognitive Computational Neuroscience, Faculty of Psychology, TU Dresden, School of Science, Dresden 01062, Germany
| | - Asya Achimova
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany
| | - Christian Gumbsch
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany; Chair of Cognitive and Clinical Neuroscience, Faculty of Psychology, TU Dresden, Dresden 01069, Germany
| | - Sebastian Otte
- Cognitive Modeling, Faculty of Science, University of Tübingen, Sand 14, Tübingen 72076, Germany; Adaptive AI Lab, Institute of Robotics and Cognitive Systems, University of Lübeck, Ratzeburger Allee 160, Lübeck 23562, Germany
| | - Stefan Kiebel
- Cognitive Computational Neuroscience, Faculty of Psychology, TU Dresden, School of Science, Dresden 01062, Germany
| |
Collapse
|
2
|
Anokhin P, Sorokin A, Burtsev M, Friston K. Associative Learning and Active Inference. Neural Comput 2024; 36:2602-2635. [PMID: 39312494 DOI: 10.1162/neco_a_01711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 07/02/2024] [Indexed: 09/25/2024]
Abstract
Associative learning is a behavioral phenomenon in which individuals develop connections between stimuli or events based on their co-occurrence. Initially studied by Pavlov in his conditioning experiments, the fundamental principles of learning have been expanded on through the discovery of a wide range of learning phenomena. Computational models have been developed based on the concept of minimizing reward prediction errors. The Rescorla-Wagner model, in particular, is a well-known model that has greatly influenced the field of reinforcement learning. However, the simplicity of these models restricts their ability to fully explain the diverse range of behavioral phenomena associated with learning. In this study, we adopt the free energy principle, which suggests that living systems strive to minimize surprise or uncertainty under their internal models of the world. We consider the learning process as the minimization of free energy and investigate its relationship with the Rescorla-Wagner model, focusing on the informational aspects of learning, different types of surprise, and prediction errors based on beliefs and values. Furthermore, we explore how well-known behavioral phenomena such as blocking, overshadowing, and latent inhibition can be modeled within the active inference framework. We accomplish this by using the informational and novelty aspects of attention, which share similar ideas proposed by seemingly contradictory models such as Mackintosh and Pearce-Hall models. Thus, we demonstrate that the free energy principle, as a theoretical framework derived from first principles, can integrate the ideas and models of associative learning proposed based on empirical experiments and serve as a framework for a better understanding of the computational processes behind associative learning in the brain.
Collapse
Affiliation(s)
| | | | - Mikhail Burtsev
- London Institute for Mathematical Sciences, Royal Institution, London W1S 4BS, U.K.
| | - Karl Friston
- Queen Square Institute of Neurology, University College London, U.K
- VERSES AI Research Lab, Los Angeles, CA 90016, U.S.A.
| |
Collapse
|
3
|
Savoca PW, Glynn LM, Fox MM, Richards MC, Callaghan BL. Interoception in pregnancy: Implications for peripartum depression. Neurosci Biobehav Rev 2024; 166:105874. [PMID: 39243875 DOI: 10.1016/j.neubiorev.2024.105874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 08/12/2024] [Accepted: 09/02/2024] [Indexed: 09/09/2024]
Affiliation(s)
- Paul W Savoca
- Department of Psychology, University of California, Los Angeles, USA.
| | | | - Molly M Fox
- Department of Anthropology, University of California, Los Angeles, USA; Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, USA
| | - Misty C Richards
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, USA; David Geffen School of Medicine, University of California, Los Angeles, CA, USA
| | | |
Collapse
|
4
|
Priorelli M, Stoianov IP. Slow but flexible or fast but rigid? Discrete and continuous processes compared. Heliyon 2024; 10:e39129. [PMID: 39497980 PMCID: PMC11532823 DOI: 10.1016/j.heliyon.2024.e39129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 09/29/2024] [Accepted: 10/08/2024] [Indexed: 11/07/2024] Open
Abstract
A tradeoff exists when dealing with complex tasks composed of multiple steps. High-level cognitive processes can find the best sequence of actions to achieve a goal in uncertain environments, but they are slow and require significant computational demand. In contrast, lower-level processing allows reacting to environmental stimuli rapidly, but with limited capacity to determine optimal actions or to replan when expectations are not met. Through reiteration of the same task, biological organisms find the optimal tradeoff: from action primitives, composite trajectories gradually emerge by creating task-specific neural structures. The two frameworks of active inference - a recent brain paradigm that views action and perception as subject to the same free energy minimization imperative - well capture high-level and low-level processes of human behavior, but how task specialization occurs in these terms is still unclear. In this study, we compare two strategies on a dynamic pick-and-place task: a hybrid (discrete-continuous) model with planning capabilities and a continuous-only model with fixed transitions. Both models rely on a hierarchical (intrinsic and extrinsic) structure, well suited for defining reaching and grasping movements, respectively. Our results show that continuous-only models perform better and with minimal resource expenditure but at the cost of less flexibility. Finally, we propose how discrete actions might lead to continuous attractors and compare the two frameworks with different motor learning phases, laying the foundations for further studies on bio-inspired task adaptation.
Collapse
Affiliation(s)
- Matteo Priorelli
- Institute of Cognitive Sciences and Technologies, National Research Council of Italy, Padova, Italy
| | - Ivilin Peev Stoianov
- Institute of Cognitive Sciences and Technologies, National Research Council of Italy, Padova, Italy
| |
Collapse
|
5
|
Theuer JK, Koch NN, Gumbsch C, Elsner B, Butz MV. Infants infer and predict coherent event interactions: Modeling cognitive development. PLoS One 2024; 19:e0312532. [PMID: 39446862 PMCID: PMC11500850 DOI: 10.1371/journal.pone.0312532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 10/09/2024] [Indexed: 10/26/2024] Open
Abstract
Mental representations of the environment in infants are sparse and grow richer during their development. Anticipatory eye fixation studies show that infants aged around 7 months start to predict the goal of an observed action, e.g., an object targeted by a reaching hand. Interestingly, goal-predictive gaze shifts occur at an earlier age when the hand subsequently manipulates an object and later when an action is performed by an inanimate actor, e.g., a mechanical claw. We introduce CAPRI2 (Cognitive Action PRediction and Inference in Infants), a computational model that explains this development from a functional, algorithmic perspective. It is based on the theory that infants learn object files and events as they develop a physical reasoning system. In particular, CAPRI2 learns a generative event-predictive model, which it uses to both interpret sensory information and infer goal-directed behavior. When observing object interactions, CAPRI2 (i) interprets the unfolding interactions in terms of event-segmented dynamics, (ii) maximizes the coherence of its event interpretations, updating its internal estimates and (iii) chooses gaze behavior to minimize expected uncertainty. As a result, CAPRI2 mimics the developmental pathway of infants' goal-predictive gaze behavior. Our modeling work suggests that the involved event-predictive representations, longer-term generative model learning, and shorter-term retrospective and active inference principles constitute fundamental building blocks for the effective development of goal-predictive capacities.
Collapse
Affiliation(s)
- Johanna K. Theuer
- Neuro-Cognitive Modeling, Department of Computer Science and Department of Psychology, University of Tübingen, Tübingen, Germany
| | - Nadine N. Koch
- Neuro-Cognitive Modeling, Department of Computer Science and Department of Psychology, University of Tübingen, Tübingen, Germany
| | - Christian Gumbsch
- Neuro-Cognitive Modeling, Department of Computer Science and Department of Psychology, University of Tübingen, Tübingen, Germany
- Chair of Cognitive and Clinical Neuroscience, Faculty of Psychology, Technical University Dresden, Dresden, Germany
| | - Birgit Elsner
- Developmental Psychology, Faculty of Humanities, University of Potsdam, Potsdam, Germany
| | - Martin V. Butz
- Neuro-Cognitive Modeling, Department of Computer Science and Department of Psychology, University of Tübingen, Tübingen, Germany
| |
Collapse
|
6
|
Fisher EL, Smith R, Conn K, Corcoran AW, Milton LK, Hohwy J, Foldi CJ. Psilocybin increases optimistic engagement over time: computational modelling of behaviour in rats. Transl Psychiatry 2024; 14:394. [PMID: 39349428 PMCID: PMC11442808 DOI: 10.1038/s41398-024-03103-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2024] [Revised: 09/18/2024] [Accepted: 09/19/2024] [Indexed: 10/02/2024] Open
Abstract
Psilocybin has shown promise as a novel pharmacological intervention for treatment of depression, where post-acute effects of psilocybin treatment have been associated with increased positive mood and decreased pessimism. Although psilocybin is proving to be effective in clinical trials for treatment of psychiatric disorders, the information processing mechanisms affected by psilocybin are not well understood. Here, we fit active inference and reinforcement learning computational models to a novel two-armed bandit reversal learning task capable of capturing engagement behaviour in rats. The model revealed that after receiving psilocybin, rats achieve more rewards through increased task engagement, mediated by modification of forgetting rates and reduced loss aversion. These findings suggest that psilocybin may afford an optimism bias that arises through altered belief updating, with translational potential for clinical populations characterised by lack of optimism.
Collapse
Affiliation(s)
- Elizabeth L Fisher
- Monash Centre for Consciousness and Contemplative Studies, Monash University, Melbourne, VIC, Australia.
| | - Ryan Smith
- Laureate Institute for Brain Research, University of Tulsa, Tulsa Oklahoma, OK, USA
| | - Kyna Conn
- Anorexia and Feeding Disorders Laboratory, Department of Physiology, Monash University, Melbourne, VIC, Australia
- Biomedicine Discovery Institute, Monash University, Melbourne, VIC, Australia
| | - Andrew W Corcoran
- Monash Centre for Consciousness and Contemplative Studies, Monash University, Melbourne, VIC, Australia
| | - Laura K Milton
- Anorexia and Feeding Disorders Laboratory, Department of Physiology, Monash University, Melbourne, VIC, Australia
- Biomedicine Discovery Institute, Monash University, Melbourne, VIC, Australia
| | - Jakob Hohwy
- Monash Centre for Consciousness and Contemplative Studies, Monash University, Melbourne, VIC, Australia
| | - Claire J Foldi
- Anorexia and Feeding Disorders Laboratory, Department of Physiology, Monash University, Melbourne, VIC, Australia
- Biomedicine Discovery Institute, Monash University, Melbourne, VIC, Australia
| |
Collapse
|
7
|
Bosulu J, Pezzulo G, Hétu S. Needing: An Active Inference Process for Physiological Motivation. J Cogn Neurosci 2024; 36:2011-2028. [PMID: 38940737 DOI: 10.1162/jocn_a_02209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024]
Abstract
Need states are internal states that arise from deprivation of crucial biological stimuli. They direct motivation, independently of external learning. Despite their separate origin, they interact with reward processing systems that respond to external stimuli. This article aims to illuminate the functioning of the needing system through the lens of active inference, a framework for understanding brain and cognition. We propose that need states exert a pervasive influence on the organism, which in active inference terms translates to a "pervasive surprise"-a measure of the distance from the organism's preferred state. Crucially, we define needing as an active inference process that seeks to reduce this pervasive surprise. Through a series of simulations, we demonstrate that our proposal successfully captures key aspects of the phenomenology and neurobiology of needing. We show that as need states increase, the tendency to occupy preferred states strengthens, independently of external reward prediction. Furthermore, need states increase the precision of states (stimuli and actions) leading to preferred states, suggesting their ability to amplify the value of reward cues and rewards themselves. Collectively, our model and simulations provide valuable insights into the directional and underlying influence of need states, revealing how this influence amplifies the wanting or liking associated with relevant stimuli.
Collapse
Affiliation(s)
- Juvenal Bosulu
- Université de Montréal
- Centre interdisciplinaire de recherche sur le cerveau et l'apprentissage (CIRCA), Montréal, Québec, Canada
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies (ISTC-CNR), Rome, Italy
| | - Sébastien Hétu
- Université de Montréal
- Centre interdisciplinaire de recherche sur le cerveau et l'apprentissage (CIRCA), Montréal, Québec, Canada
| |
Collapse
|
8
|
Hamburg S, Jimenez Rodriguez A, Htet A, Di Nuovo A. Active Inference for Learning and Development in Embodied Neuromorphic Agents. ENTROPY (BASEL, SWITZERLAND) 2024; 26:582. [PMID: 39056944 PMCID: PMC11276484 DOI: 10.3390/e26070582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 06/23/2024] [Accepted: 06/27/2024] [Indexed: 07/28/2024]
Abstract
Taking inspiration from humans can help catalyse embodied AI solutions for important real-world applications. Current human-inspired tools include neuromorphic systems and the developmental approach to learning. However, this developmental neurorobotics approach is currently lacking important frameworks for human-like computation and learning. We propose that human-like computation is inherently embodied, with its interface to the world being neuromorphic, and its learning processes operating across different timescales. These constraints necessitate a unified framework: active inference, underpinned by the free energy principle (FEP). Herein, we describe theoretical and empirical support for leveraging this framework in embodied neuromorphic agents with autonomous mental development. We additionally outline current implementation approaches (including toolboxes) and challenges, and we provide suggestions for next steps to catalyse this important field.
Collapse
Affiliation(s)
- Sarah Hamburg
- Department of Computing, Sheffield Hallam University, Sheffield S1 1WB, UK; (A.J.R.); (A.H.); (A.D.N.)
| | | | | | | |
Collapse
|
9
|
Schwöbel S, Marković D, Smolka MN, Kiebel S. Joint modeling of choices and reaction times based on Bayesian contextual behavioral control. PLoS Comput Biol 2024; 20:e1012228. [PMID: 38968304 PMCID: PMC11290629 DOI: 10.1371/journal.pcbi.1012228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 07/31/2024] [Accepted: 06/04/2024] [Indexed: 07/07/2024] Open
Abstract
In cognitive neuroscience and psychology, reaction times are an important behavioral measure. However, in instrumental learning and goal-directed decision making experiments, findings often rely only on choice probabilities from a value-based model, instead of reaction times. Recent advancements have shown that it is possible to connect value-based decision models with reaction time models. However, typically these models do not provide an integrated account of both value-based choices and reaction times, but simply link two types of models. Here, we propose a novel integrative joint model of both choices and reaction times by combining a computational account of Bayesian sequential decision making with a sampling procedure. This allows us to describe how internal uncertainty in the planning process shapes reaction time distributions. Specifically, we use a recent context-specific Bayesian forward planning model which we extend by a Markov chain Monte Carlo (MCMC) sampler to obtain both choices and reaction times. As we will show this makes the sampler an integral part of the decision making process and enables us to reproduce, using simulations, well-known experimental findings in value based-decision making as well as classical inhibition and switching tasks. Specifically, we use the proposed model to explain both choice behavior and reaction times in instrumental learning and automatized behavior, in the Eriksen flanker task and in task switching. These findings show that the proposed joint behavioral model may describe common underlying processes in these different decision making paradigms.
Collapse
Affiliation(s)
- Sarah Schwöbel
- Department of Psychology, Technische Universität Dresden, Dresden, Germany
| | - Dimitrije Marković
- Department of Psychology, Technische Universität Dresden, Dresden, Germany
| | - Michael N. Smolka
- Department of Psychiatry and Psychotherapy, Technische Universität Dresden, Dresden, Germany
| | - Stefan Kiebel
- Department of Psychology, Technische Universität Dresden, Dresden, Germany
- Centre for Tactile Internet with Human-in-the-Loop (CeTI), Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
10
|
Paul A, Isomura T, Razi A. On Predictive Planning and Counterfactual Learning in Active Inference. ENTROPY (BASEL, SWITZERLAND) 2024; 26:484. [PMID: 38920492 PMCID: PMC11202763 DOI: 10.3390/e26060484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 05/27/2024] [Accepted: 05/28/2024] [Indexed: 06/27/2024]
Abstract
Given the rapid advancement of artificial intelligence, understanding the foundations of intelligent behaviour is increasingly important. Active inference, regarded as a general theory of behaviour, offers a principled approach to probing the basis of sophistication in planning and decision-making. This paper examines two decision-making schemes in active inference based on "planning" and "learning from experience". Furthermore, we also introduce a mixed model that navigates the data complexity trade-off between these strategies, leveraging the strengths of both to facilitate balanced decision-making. We evaluate our proposed model in a challenging grid-world scenario that requires adaptability from the agent. Additionally, our model provides the opportunity to analyse the evolution of various parameters, offering valuable insights and contributing to an explainable framework for intelligent decision-making.
Collapse
Affiliation(s)
- Aswin Paul
- Turner Institute for Brain and Mental Health, School of Psychological Sciences, Monash University, Clayton 3800, Australia;
- IITB-Monash Research Academy, Mumbai 400076, India
- Department of Electrical Engineering, IIT Bombay, Mumbai 400076 , India
| | - Takuya Isomura
- Brain Intelligence Theory Unit, RIKEN Center for Brain Science, Wako, Saitama 351-0106, Japan;
| | - Adeel Razi
- Turner Institute for Brain and Mental Health, School of Psychological Sciences, Monash University, Clayton 3800, Australia;
- Wellcome Trust Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK
- CIFAR Azrieli Global Scholars Program, CIFAR, Toronto, ON M5G 1M1, Canada
| |
Collapse
|
11
|
Poli F, O'Reilly JX, Mars RB, Hunnius S. Curiosity and the dynamics of optimal exploration. Trends Cogn Sci 2024; 28:441-453. [PMID: 38413257 DOI: 10.1016/j.tics.2024.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 02/01/2024] [Accepted: 02/01/2024] [Indexed: 02/29/2024]
Abstract
What drives our curiosity remains an elusive and hotly debated issue, with multiple hypotheses proposed but a cohesive account yet to be established. This review discusses traditional and emergent theories that frame curiosity as a desire to know and a drive to learn, respectively. We adopt a model-based approach that maps the temporal dynamics of various factors underlying curiosity-based exploration, such as uncertainty, information gain, and learning progress. In so doing, we identify the limitations of past theories and posit an integrated account that harnesses their strengths in describing curiosity as a tool for optimal environmental exploration. In our unified account, curiosity serves as a 'common currency' for exploration, which must be balanced with other drives such as safety and hunger to achieve efficient action.
Collapse
Affiliation(s)
- Francesco Poli
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, The Netherlands.
| | - Jill X O'Reilly
- Wellcome Centre for Integrative Neuroimaging, Centre for Functional MRI of the Brain (FMRIB), Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, UK
| | - Rogier B Mars
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, The Netherlands; Wellcome Centre for Integrative Neuroimaging, Centre for Functional MRI of the Brain (FMRIB), Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, UK
| | - Sabine Hunnius
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, The Netherlands
| |
Collapse
|
12
|
Boban L, Boulic R, Herbelin B. In Case of Doubt, One Follows One's Self: The Implicit Guidance of the Embodied Self-Avatar. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2024; 30:2109-2118. [PMID: 38437112 DOI: 10.1109/tvcg.2024.3372042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2024]
Abstract
The sense of embodiment in virtual reality (VR) is commonly understood as the subjective experience that one's physical body is substituted by a virtual counterpart, and is typically achieved when the avatar's body, seen from a first-person view, moves like one's physical body. Embodiment can also be experienced in other circumstances (e.g., in third-person view) or with imprecise or distorted visuo-motor coupling. It was moreover observed, in various cases of small or progressive temporal and spatial manipulations of avatars' movements, that participants may spontaneously follow the movement shown by the avatar. The present work investigates whether, in some specific contexts, participants would follow what their avatar does even when large movement discrepancies occur, thereby extending the scope of understanding of the self-avatar follower effect beyond subtle changes of motion or speed manipulations. We conducted an experimental study in which we introduced uncertainty about which movement to perform at specific times and analyzed participants' movements and subjective feedback after their avatar showed them an incorrect movement. Results show that, when in doubt, participants were influenced by their avatar's movements, leading them to perform that particular error twice more often than normal. Importantly, results of the embodiment score indicate that participants experienced a dissociation with their avatar at those times. Overall, these observations not only demonstrate the possibility of provoking situations in which participants follow the guidance of their avatar for large motor distortions, despite their awareness about the avatar movement disruption and on the possible influence it had on their choice, and, importantly, exemplify how the cognitive mechanism of embodiment is deeply rooted in the necessity of having a body.
Collapse
|
13
|
Zhang Z, Xu F. An Overview of the Free Energy Principle and Related Research. Neural Comput 2024; 36:963-1021. [PMID: 38457757 DOI: 10.1162/neco_a_01642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 11/20/2023] [Indexed: 03/10/2024]
Abstract
The free energy principle and its corollary, the active inference framework, serve as theoretical foundations in the domain of neuroscience, explaining the genesis of intelligent behavior. This principle states that the processes of perception, learning, and decision making-within an agent-are all driven by the objective of "minimizing free energy," evincing the following behaviors: learning and employing a generative model of the environment to interpret observations, thereby achieving perception, and selecting actions to maintain a stable preferred state and minimize the uncertainty about the environment, thereby achieving decision making. This fundamental principle can be used to explain how the brain processes perceptual information, learns about the environment, and selects actions. Two pivotal tenets are that the agent employs a generative model for perception and planning and that interaction with the world (and other agents) enhances the performance of the generative model and augments perception. With the evolution of control theory and deep learning tools, agents based on the FEP have been instantiated in various ways across different domains, guiding the design of a multitude of generative models and decision-making algorithms. This letter first introduces the basic concepts of the FEP, followed by its historical development and connections with other theories of intelligence, and then delves into the specific application of the FEP to perception and decision making, encompassing both low-dimensional simple situations and high-dimensional complex situations. It compares the FEP with model-based reinforcement learning to show that the FEP provides a better objective function. We illustrate this using numerical studies of Dreamer3 by adding expected information gain into the standard objective function. In a complementary fashion, existing reinforcement learning, and deep learning algorithms can also help implement the FEP-based agents. Finally, we discuss the various capabilities that agents need to possess in complex environments and state that the FEP can aid agents in acquiring these capabilities.
Collapse
Affiliation(s)
- Zhengquan Zhang
- Key Laboratory of Information Science of Electromagnetic Waves, Fudan University, Shanghai, P.R.C.
| | - Feng Xu
- Key Laboratory of Information Science of Electromagnetic Waves, Fudan University, Shanghai, P.R.C.
| |
Collapse
|
14
|
Schiller D, Yu ANC, Alia-Klein N, Becker S, Cromwell HC, Dolcos F, Eslinger PJ, Frewen P, Kemp AH, Pace-Schott EF, Raber J, Silton RL, Stefanova E, Williams JHG, Abe N, Aghajani M, Albrecht F, Alexander R, Anders S, Aragón OR, Arias JA, Arzy S, Aue T, Baez S, Balconi M, Ballarini T, Bannister S, Banta MC, Barrett KC, Belzung C, Bensafi M, Booij L, Bookwala J, Boulanger-Bertolus J, Boutros SW, Bräscher AK, Bruno A, Busatto G, Bylsma LM, Caldwell-Harris C, Chan RCK, Cherbuin N, Chiarella J, Cipresso P, Critchley H, Croote DE, Demaree HA, Denson TF, Depue B, Derntl B, Dickson JM, Dolcos S, Drach-Zahavy A, Dubljević O, Eerola T, Ellingsen DM, Fairfield B, Ferdenzi C, Friedman BH, Fu CHY, Gatt JM, de Gelder B, Gendolla GHE, Gilam G, Goldblatt H, Gooding AEK, Gosseries O, Hamm AO, Hanson JL, Hendler T, Herbert C, Hofmann SG, Ibanez A, Joffily M, Jovanovic T, Kahrilas IJ, Kangas M, Katsumi Y, Kensinger E, Kirby LAJ, Koncz R, Koster EHW, Kozlowska K, Krach S, Kret ME, Krippl M, Kusi-Mensah K, Ladouceur CD, Laureys S, Lawrence A, Li CSR, Liddell BJ, Lidhar NK, Lowry CA, Magee K, Marin MF, Mariotti V, Martin LJ, Marusak HA, Mayer AV, Merner AR, Minnier J, Moll J, Morrison RG, Moore M, Mouly AM, Mueller SC, Mühlberger A, Murphy NA, Muscatello MRA, Musser ED, Newton TL, Noll-Hussong M, Norrholm SD, Northoff G, Nusslock R, Okon-Singer H, Olino TM, Ortner C, Owolabi M, Padulo C, Palermo R, Palumbo R, Palumbo S, Papadelis C, Pegna AJ, Pellegrini S, Peltonen K, Penninx BWJH, Pietrini P, Pinna G, Lobo RP, Polnaszek KL, Polyakova M, Rabinak C, Helene Richter S, Richter T, Riva G, Rizzo A, Robinson JL, Rosa P, Sachdev PS, Sato W, Schroeter ML, Schweizer S, Shiban Y, Siddharthan A, Siedlecka E, Smith RC, Soreq H, Spangler DP, Stern ER, Styliadis C, Sullivan GB, Swain JE, Urben S, Van den Stock J, Vander Kooij MA, van Overveld M, Van Rheenen TE, VanElzakker MB, Ventura-Bort C, Verona E, Volk T, Wang Y, Weingast LT, Weymar M, Williams C, Willis ML, Yamashita P, Zahn R, Zupan B, Lowe L. The Human Affectome. Neurosci Biobehav Rev 2024; 158:105450. [PMID: 37925091 PMCID: PMC11003721 DOI: 10.1016/j.neubiorev.2023.105450] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 10/26/2023] [Accepted: 10/27/2023] [Indexed: 11/06/2023]
Abstract
Over the last decades, theoretical perspectives in the interdisciplinary field of the affective sciences have proliferated rather than converged due to differing assumptions about what human affective phenomena are and how they work. These metaphysical and mechanistic assumptions, shaped by academic context and values, have dictated affective constructs and operationalizations. However, an assumption about the purpose of affective phenomena can guide us to a common set of metaphysical and mechanistic assumptions. In this capstone paper, we home in on a nested teleological principle for human affective phenomena in order to synthesize metaphysical and mechanistic assumptions. Under this framework, human affective phenomena can collectively be considered algorithms that either adjust based on the human comfort zone (affective concerns) or monitor those adaptive processes (affective features). This teleologically-grounded framework offers a principled agenda and launchpad for both organizing existing perspectives and generating new ones. Ultimately, we hope the Human Affectome brings us a step closer to not only an integrated understanding of human affective phenomena, but an integrated field for affective research.
Collapse
Affiliation(s)
- Daniela Schiller
- Department of Psychiatry, the Nash Family Department of Neuroscience, and the Friedman Brain Institute, at the Icahn School of Medicine at Mount Sinai, New York, NY, United States.
| | - Alessandra N C Yu
- Nash Family Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States.
| | - Nelly Alia-Klein
- Department of Psychiatry and Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Susanne Becker
- Department of Cognitive and Clinical Neuroscience, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, J5, 68159 Mannheim, Germany; Integrative Spinal Research Group, Department of Chiropractic Medicine, University Hospital Balgrist, University of Zurich, Balgrist Campus, Lengghalde 5, 8008 Zurich, Switzerland
| | - Howard C Cromwell
- J.P. Scott Center for Neuroscience, Mind and Behavior, Department of Psychology, Bowling Green State University, Bowling Green, OH 43403, United States
| | - Florin Dolcos
- Beckman Institute for Advanced Science & Technology, University of Illinois at Urbana-Champaign, Urbana, IL, United States; Department of Psychology, University of Illinois at Urbana-Champaign, Champaign, IL, United States
| | - Paul J Eslinger
- Departments of Neurology, Neural & Behavioral Science, Radiology, and Public Health Sciences, Penn State Hershey Medical Center and College of Medicine, Hershey, PA, United States
| | - Paul Frewen
- Departments of Psychiatry, Psychology and Neuroscience at the University of Western Ontario, London, Ontario, Canada
| | - Andrew H Kemp
- School of Psychology, Faculty of Medicine, Health & Life Science, Swansea University, Swansea, United Kingdom
| | - Edward F Pace-Schott
- Harvard Medical School and Massachusetts General Hospital, Department of Psychiatry, Boston, MA, United States; Department of Psychiatry, Massachusetts General Hospital, Boston, MA, United States
| | - Jacob Raber
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR 97239, United States; Departments of Neurology, Radiation Medicine, Psychiatry, and Division of Neuroscience, ONPRC, Oregon Health & Science University, Portland, OR, United States
| | - Rebecca L Silton
- Department of Psychology, Loyola University Chicago, Chicago, IL, United States
| | - Elka Stefanova
- Faculty of Medicine, University of Belgrade, Serbia; Neurology Clinic, Clinical Center of Serbia, Serbia
| | - Justin H G Williams
- Griffith University, Gold Coast Campus, 1 Parklands Dr, Southport, QLD 4215, Australia
| | - Nobuhito Abe
- Institute for the Future of Human Society, Kyoto University, 46 Shimoadachi-cho, Yoshida Sakyo-ku, Kyoto, Japan
| | - Moji Aghajani
- Institute of Education & Child Studies, Section Forensic Family & Youth Care, Leiden University, the Netherlands; Department of Psychiatry, Amsterdam UMC, Location VUMC, GGZ InGeest Research & Innovation, Amsterdam Neuroscience, the Netherlands
| | - Franziska Albrecht
- Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany; Clinic for Cognitive Neurology, University Hospital Leipzig, Leipzig, Germany; Division of Physiotherapy, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden; Karolinska University Hospital, Women's Health and Allied Health Professionals Theme, Medical unit Occupational Therapy & Physiotherapy, Stockholm, Sweden
| | - Rebecca Alexander
- Neuroscience Research Australia, Randwick, Sydney, NSW, Australia; Australian National University, Canberra, ACT, Australia
| | - Silke Anders
- Department of Neurology, University of Lübeck, Lübeck, Germany; Center of Brain, Behavior and Metabolism, University of Lübeck, Lübeck, Germany
| | - Oriana R Aragón
- Yale University, 2 Hillhouse Ave, New Haven, CT, United States; Cincinnati University, Marketing Department, 2906 Woodside Drive, Cincinnati, OH 45221-0145, United States
| | - Juan A Arias
- School of Psychology, Faculty of Medicine, Health & Life Science, Swansea University, Swansea, United Kingdom; Department of Statistics, Mathematical Analysis, and Operational Research, Universidade de Santiago de Compostela, Spain; The Galician Center for Mathematical Research and Technology (CITMAga), 15782 Santiago de Compostela, Spain
| | - Shahar Arzy
- Department of Medical Neurobiology, Hebrew University, Jerusalem, Israel
| | - Tatjana Aue
- Institute of Psychology, University of Bern, Fabrikstr. 8, 3012 Bern, Switzerland
| | | | - Michela Balconi
- International Research Center for Cognitive Applied Neuroscience, Catholic University of Milan, Milan, Italy
| | - Tommaso Ballarini
- Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Scott Bannister
- Durham University, Palace Green, DH1 RL3 Durham, United Kingdom
| | | | - Karen Caplovitz Barrett
- Department of Human Development & Family Studies, Colorado State University, Fort Collins, CO, United States; Department of Community & Behavioral Health, Colorado School of Public Health, Denver, CO, United States
| | | | - Moustafa Bensafi
- Research Center in Neurosciences of Lyon, CNRS UMR5292, INSERM U1028, Claude Bernard University Lyon 1, Lyon, Centre Hospitalier Le Vinatier, 95 bd Pinel, 69675 Bron Cedex, France
| | - Linda Booij
- Department of Psychology, Concordia University, Montreal, Canada; CHU Sainte-Justine, University of Montreal, Montreal, Canada
| | - Jamila Bookwala
- Department of Psychology, Lafayette College, Easton, PA, United States
| | - Julie Boulanger-Bertolus
- Department of Anesthesiology and Center for Consciousness Science, University of Michigan, Ann Arbor, MI, United States
| | - Sydney Weber Boutros
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR 97239, United States
| | - Anne-Kathrin Bräscher
- Department of Clinical Psychology, Psychotherapy and Experimental Psychopathology, University of Mainz, Wallstr. 3, 55122 Mainz, Germany; Nathan Kline Institute for Psychiatric Research, Orangeburg, NY, United States
| | - Antonio Bruno
- Department of Biomedical, Dental Sciences and Morpho-Functional Imaging - University of Messina, Italy
| | - Geraldo Busatto
- Laboratory of Psychiatric Neuroimaging (LIM-21), Departamento e Instituto de Psiquiatria, Hospital das Clinicas HCFMUSP, Faculdade de Medicina, Universidade de Sao Paulo, Sao Paulo, Brazil
| | - Lauren M Bylsma
- Departments of Psychiatry and Psychology; and the Center for Neural Basis of Cognition, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
| | | | - Raymond C K Chan
- Neuropsychology and Applied Cognitive Neuroscience Laboratory, CAS Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing, China
| | - Nicolas Cherbuin
- Centre for Research on Ageing, Health, and Wellbeing, Australian National University, Canberra, ACT, Australia
| | - Julian Chiarella
- Department of Psychology, Concordia University, Montreal, Canada; CHU Sainte-Justine, University of Montreal, Montreal, Canada
| | - Pietro Cipresso
- Applied Technology for Neuro-Psychology Lab., Istituto Auxologico Italiano (IRCCS), Milan, Italy; Department of Psychology, University of Turin, Turin, Italy
| | - Hugo Critchley
- Psychiatry, Department of Neuroscience, Brighton and Sussex Medical School (BSMS), University of Sussex, Sussex, United Kingdom
| | - Denise E Croote
- Departments of Psychiatry and Neuroscience, Icahn School of Medicine at Mount Sinai and Friedman Brain Institute, New York, NY 10029, United States; Hospital Universitário Gaffrée e Guinle, Universidade do Rio de Janeiro, Brazil
| | - Heath A Demaree
- Department of Psychological Sciences, Case Western Reserve University, Cleveland, OH, United States
| | - Thomas F Denson
- School of Psychology, University of New South Wales, Sydney, NSW, Australia
| | - Brendan Depue
- Departments of Psychological and Brain Sciences and Anatomical Sciences and Neurobiology, University of Louisville, Louisville, KY, United States
| | - Birgit Derntl
- Department of Psychiatry and Psychotherapy, Tübingen Center for Mental Health, University of Tübingen, Tübingen, Germany
| | - Joanne M Dickson
- Edith Cowan University, Psychology Discipline, School of Arts and Humanities, 270 Joondalup Dr, Joondalup, WA 6027, Australia
| | - Sanda Dolcos
- Beckman Institute for Advanced Science & Technology, University of Illinois at Urbana-Champaign, Urbana, IL, United States; Department of Psychology, University of Illinois at Urbana-Champaign, Champaign, IL, United States
| | - Anat Drach-Zahavy
- The Faculty of Health and Welfare Sciences, University of Haifa, Haifa, Israel
| | - Olga Dubljević
- Neurology Clinic, Clinical Center of Serbia, Serbia; Institute for Biological Research "Siniša Stanković", National Institute of Republic of Serbia, Belgrade, Serbia
| | - Tuomas Eerola
- Durham University, Palace Green, DH1 RL3 Durham, United Kingdom
| | - Dan-Mikael Ellingsen
- Department of Diagnostic Physics, Division of Radiology and Nuclear Medicine, Oslo University Hospital, Oslo, Norway
| | - Beth Fairfield
- Department of Humanistic Studies, University of Naples Federico II, Naples, Italy; UniCamillus, International Medical University, Rome, Italy
| | - Camille Ferdenzi
- Research Center in Neurosciences of Lyon, CNRS UMR5292, INSERM U1028, Claude Bernard University Lyon 1, Lyon, Centre Hospitalier Le Vinatier, 95 bd Pinel, 69675 Bron Cedex, France
| | - Bruce H Friedman
- Department of Psychology, Virginia Tech, Blacksburg, VA, United States
| | - Cynthia H Y Fu
- School of Psychology, University of East London, United Kingdom; Centre for Affective Disorders, Institute of Psychiatry, Psychology and Neuroscience, King's College London, United Kingdom
| | - Justine M Gatt
- Neuroscience Research Australia, Randwick, Sydney, NSW, Australia; School of Psychology, University of New South Wales, Randwick, Sydney, NSW, Australia
| | - Beatrice de Gelder
- Department of Psychology and Neuroscience, Maastricht University, Maastricht, the Netherlands
| | - Guido H E Gendolla
- Geneva Motivation Lab, University of Geneva, FPSE, Section of Psychology, CH-1211 Geneva 4, Switzerland
| | - Gadi Gilam
- The Institute of Biomedical and Oral Research, Faculty of Dental Medicine, Hebrew University of Jerusalem, Jerusalem, Israel; Systems Neuroscience and Pain Laboratory, Stanford University School of Medicine, CA, United States
| | - Hadass Goldblatt
- Department of Nursing, Faculty of Social Welfare & Health Sciences, University of Haifa, Haifa, Israel
| | | | - Olivia Gosseries
- Coma Science Group, GIGA Consciousness & Centre du Cerveau2, University and University Hospital of Liege, Liege, Belgium
| | - Alfons O Hamm
- Department of Biological and Clinical Psychology/Psychotherapy, University of Greifswald, Greifswald, Germany
| | - Jamie L Hanson
- Department of Psychology, University of Pittsburgh, Pittsburgh, PA 15206, United States
| | - Talma Hendler
- Tel Aviv Center for Brain Function, Wohl Institute for Advanced Imaging, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel; School of Psychological Sciences, Tel-Aviv University, Tel Aviv, Israel
| | - Cornelia Herbert
- Department of Applied Emotion and Motivation Psychology, Institute of Psychology and Education, Ulm University, Ulm, Germany
| | - Stefan G Hofmann
- Department of Clinical Psychology, Philipps University Marburg, Germany
| | - Agustin Ibanez
- Universidad de San Andres, Buenos Aires, Argentina; National Scientific and Technical Research Council (CONICET), Buenos Aires, Argentina; Latin American Brain Health Institute (BrainLat), Universidad Adolfo Ibáñez, Santiago, Chile; Global Brain Health Institute (GBHI), University of California San Francisco (UCSF), United States and Trinity Collegue Dublin (TCD), Ireland
| | - Mateus Joffily
- Groupe d'Analyse et de Théorie Economique (GATE), 93 Chemin des Mouilles, 69130 Écully, France
| | - Tanja Jovanovic
- Department of Psychiatry and Behavaioral Neurosciences, Wayne State University, Detroit, MI, United States
| | - Ian J Kahrilas
- Department of Psychology, Loyola University Chicago, Chicago, IL, United States
| | - Maria Kangas
- Department of Psychology, Macquarie University, Sydney, Australia
| | - Yuta Katsumi
- Department of Psychology, University of Illinois at Urbana-Champaign, Champaign, IL, United States; Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Charlestown, MA, USA
| | - Elizabeth Kensinger
- Department of Psychology and Neuroscience, Boston College, Boston, MA, United States
| | - Lauren A J Kirby
- Department of Psychology and Counseling, University of Texas at Tyler, Tyler, TX, United States
| | - Rebecca Koncz
- Centre for Healthy Brain Ageing, Discipline of Psychiatry and Mental Health, University of New South Wales, Sydney, Australia; Specialty of Psychiatry, The University of Sydney, Concord, New South Wales, Australia
| | - Ernst H W Koster
- Department of Experimental Clinical and Health Psychology, Ghent University, Ghent, Belgium
| | | | - Sören Krach
- Social Neuroscience Lab, Translational Psychiatry Unit, University of Lübeck, Lübeck, Germany
| | - Mariska E Kret
- Leiden University, Cognitive Psychology, Pieter de la Court, Waassenaarseweg 52, Leiden 2333 AK, the Netherlands
| | - Martin Krippl
- Faculty of Natural Sciences, Department of Psychology, Otto von Guericke University Magdeburg, Universitätsplatz 2, Magdeburg, Germany
| | - Kwabena Kusi-Mensah
- Department of Psychiatry, Komfo Anokye Teaching Hospital, P. O. Box 1934, Kumasi, Ghana; Department of Psychiatry, University of Cambridge, Darwin College, Silver Street, CB3 9EU Cambridge, United Kingdom; Behavioural Sciences Department, School of Medicine and Dentistry, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Cecile D Ladouceur
- Departments of Psychiatry and Psychology and the Center for Neural Basis of Cognition (CNBC), University of Pittsburgh, Pittsburgh, PA, United States
| | - Steven Laureys
- Coma Science Group, GIGA Consciousness & Centre du Cerveau2, University and University Hospital of Liege, Liege, Belgium
| | - Alistair Lawrence
- Scotland's Rural College, King's Buildings, Edinburgh, Scotland; The Roslin Institute, University of Edinburgh, Easter Bush, Scotland
| | - Chiang-Shan R Li
- Connecticut Mental Health Centre, Yale University, New Haven, CT, United States
| | - Belinda J Liddell
- School of Psychology, University of New South Wales, Randwick, Sydney, NSW, Australia
| | - Navdeep K Lidhar
- Department of Psychology, University of Toronto Mississauga, Mississauga, ON, Canada
| | - Christopher A Lowry
- Department of Integrative Physiology and Center for Neuroscience, University of Colorado Boulder, Boulder, CO, United States
| | - Kelsey Magee
- Department of Psychological Sciences, Case Western Reserve University, Cleveland, OH, United States
| | - Marie-France Marin
- Department of Psychology, Université du Québec à Montréal, Montreal, Canada; Research Center, Institut universitaire en santé mentale de Montréal, Montreal, Canada
| | - Veronica Mariotti
- Department of Clinical and Experimental Medicine, University of Pisa, Pisa, Italy
| | - Loren J Martin
- Department of Psychology, University of Toronto Mississauga, Mississauga, ON, Canada
| | - Hilary A Marusak
- Department of Psychiatry and Behavaioral Neurosciences, Wayne State University, Detroit, MI, United States; Merrill Palmer Skillman Institute for Child and Family Development, Wayne State University, Detroit, MI, United States
| | - Annalina V Mayer
- Social Neuroscience Lab, Translational Psychiatry Unit, University of Lübeck, Lübeck, Germany
| | - Amanda R Merner
- Department of Psychological Sciences, Case Western Reserve University, Cleveland, OH, United States
| | - Jessica Minnier
- School of Public Health, Oregon Health & Science University, Portland, OR, United States
| | - Jorge Moll
- Cognitive Neuroscience and Neuroinformatics Unit, D'Or Institute for Research and Education, Rio de Janeiro, Brazil
| | - Robert G Morrison
- Department of Psychology, Loyola University Chicago, Chicago, IL, United States
| | - Matthew Moore
- Beckman Institute for Advanced Science & Technology, University of Illinois at Urbana-Champaign, Urbana, IL, United States; Department of Psychology, University of Illinois at Urbana-Champaign, Champaign, IL, United States; War Related Illness and Injury Study Center (WRIISC), Veterans Affairs Palo Alto Health Care System, Palo Alto, CA, United States
| | - Anne-Marie Mouly
- Lyon Neuroscience Research Center, CNRS-UMR 5292, INSERM U1028, Universite Lyon, Lyon, France
| | - Sven C Mueller
- Department of Experimental Clinical and Health Psychology, Ghent University, Ghent, Belgium
| | - Andreas Mühlberger
- Department of Psychology (Clinical Psychology and Psychotherapy), University of Regensburg, Regensburg, Germany
| | - Nora A Murphy
- Department of Psychology, Loyola Marymount University, Los Angeles, CA, United States
| | | | - Erica D Musser
- Center for Children and Families, Department of Psychology, Florida International University, Miami, FL, United States
| | - Tamara L Newton
- Department of Psychological and Brain Sciences, University of Louisville, Louisville, KY, United States
| | - Michael Noll-Hussong
- Psychosomatic Medicine and Psychotherapy, TU Muenchen, Langerstrasse 3, D-81675 Muenchen, Germany
| | - Seth Davin Norrholm
- Department of Psychiatry and Behavaioral Neurosciences, Wayne State University, Detroit, MI, United States
| | - Georg Northoff
- Mind, Brain Imaging and Neuroethics Research Unit, University of Ottawa Institute of Mental Health Research, Royal Ottawa Mental Health Centre, Canada
| | - Robin Nusslock
- Department of Psychology and Institute for Policy Research, Northwestern University, 2029 Sheridan Road, Evanston, IL, United States
| | - Hadas Okon-Singer
- School of Psychological Sciences, University of Haifa, Haifa, Israel
| | - Thomas M Olino
- Department of Psychology, Temple University, 1701N. 13th St, Philadelphia, PA, United States
| | - Catherine Ortner
- Thompson Rivers University, Department of Psychology, 805 TRU Way, Kamloops, BC, Canada
| | - Mayowa Owolabi
- Department of Medicine and Center for Genomic and Precision Medicine, College of Medicine, University of Ibadan; University College Hospital, Ibadan, Oyo State, Nigeria; Blossom Specialist Medical Center Ibadan, Oyo State, Nigeria
| | - Caterina Padulo
- Department of Psychological, Health and Territorial Sciences, University of Chieti, Chieti, Italy
| | - Romina Palermo
- School of Psychological Science, University of Western Australia, Perth, WA, Australia
| | - Rocco Palumbo
- Department of Psychological, Health and Territorial Sciences, University of Chieti, Chieti, Italy
| | - Sara Palumbo
- Department of Surgical, Medical and Molecular Pathology and of Critical Care, University of Pisa, Pisa, Italy
| | - Christos Papadelis
- Jane and John Justin Neuroscience Center, Cook Children's Health Care System, Fort Worth, TX, United States; Department of Bioengineering, University of Texas at Arlington, Arlington, TX, United States
| | - Alan J Pegna
- School of Psychology, University of Queensland, Saint Lucia, Queensland, Australia
| | - Silvia Pellegrini
- Department of Clinical and Experimental Medicine, University of Pisa, Pisa, Italy
| | - Kirsi Peltonen
- Research Centre for Child Psychiatry, University of Turku, Turku, Finland; INVEST Research Flagship, University of Turku, Turku, Finland
| | - Brenda W J H Penninx
- Department of Psychiatry, Amsterdam UMC, Location VUMC, GGZ InGeest Research & Innovation, Amsterdam Neuroscience, the Netherlands
| | | | - Graziano Pinna
- The Psychiatric Institute, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, United States
| | - Rosario Pintos Lobo
- Center for Children and Families, Department of Psychology, Florida International University, Miami, FL, United States
| | - Kelly L Polnaszek
- Department of Psychology, Loyola University Chicago, Chicago, IL, United States
| | - Maryna Polyakova
- Neurology Department, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Christine Rabinak
- Department of Pharmacy Practice, Wayne State University, Detroit, MI, United States
| | - S Helene Richter
- Department of Behavioural Biology, University of Münster, Badestraße 13, Münster, Germany
| | - Thalia Richter
- School of Psychological Sciences, University of Haifa, Haifa, Israel
| | - Giuseppe Riva
- Applied Technology for Neuro-Psychology Lab., Istituto Auxologico Italiano (IRCCS), Milan, Italy; Humane Technology Lab., Università Cattolica del Sacro Cuore, Milan, Italy
| | - Amelia Rizzo
- Department of Biomedical, Dental Sciences and Morpho-Functional Imaging - University of Messina, Italy
| | | | - Pedro Rosa
- Laboratory of Psychiatric Neuroimaging (LIM-21), Departamento e Instituto de Psiquiatria, Hospital das Clinicas HCFMUSP, Faculdade de Medicina, Universidade de Sao Paulo, Sao Paulo, Brazil
| | - Perminder S Sachdev
- Centre for Healthy Brain Ageing, Discipline of Psychiatry and Mental Health, University of New South Wales, Sydney, Australia; Neuropsychiatric Institute, The Prince of Wales Hospital, Sydney, Australia
| | - Wataru Sato
- Psychological Process Research Team, Guardian Robot Project, RIKEN, 2-2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto, Japan
| | - Matthias L Schroeter
- Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany; Clinic for Cognitive Neurology, University Hospital Leipzig, Leipzig, Germany
| | - Susanne Schweizer
- Department of Psychology, University of Cambridge, Cambridge, United Kingdom; School of Psychology, University of New South Wales, Sydney, Australia
| | - Youssef Shiban
- Department of Psychology (Clinical Psychology and Psychotherapy), University of Regensburg, Regensburg, Germany; Department of Psychology (Clinical Psychology and Psychotherapy Research), PFH - Private University of Applied Sciences, Gottingen, Germany
| | - Advaith Siddharthan
- Knowledge Media Institute, The Open University, Milton Keynes MK7 6AA, United Kingdom
| | - Ewa Siedlecka
- School of Psychology, University of New South Wales, Sydney, NSW, Australia
| | - Robert C Smith
- Departments of Medicine and Psychiatry, Michigan State University, East Lansing, MI, United States
| | - Hermona Soreq
- Department of Biological Chemistry, Edmond and Lily Safra Center of Brain Science and The Institute of Life Sciences, Hebrew University, Jerusalem, Israel
| | - Derek P Spangler
- Department of Biobehavioral Health, The Pennsylvania State University, State College, PA, United States
| | - Emily R Stern
- Nathan Kline Institute for Psychiatric Research, Orangeburg, NY, United States; New York University School of Medicine, New York, NY, United States
| | - Charis Styliadis
- Neuroscience of Cognition and Affection group, Lab of Medical Physics and Digital Innovation, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | | | - James E Swain
- Departments of Psychiatry & Behavioral Health, Psychology, Obstetrics, Gynecology & Reproductive Medicine, and Program in Public Health, Renaissance School of Medicine at Stony Brook University, New York, United States
| | - Sébastien Urben
- Division of Child and Adolescent Psychiatry, Lausanne University Hospital (CHUV) and University of Lausanne, Lausanne, Switzerland
| | - Jan Van den Stock
- Neuropsychiatry, Department of Neurosciences, Leuven Brain Institute, KU Leuven, Leuven, Belgium
| | - Michael A Vander Kooij
- Translational Psychiatry, Department of Psychiatry and Psychotherapy, Universitatsmedizin der Johannes Guttenberg University Medical Center, Mainz, Germany
| | | | - Tamsyn E Van Rheenen
- University of Melbourne, Melbourne Neuropsychiatry Centre, Department of Psychiatry, 161 Barry Street, Carlton, VIC, Australia
| | - Michael B VanElzakker
- Division of Neurotherapeutics, Massachusetts General Hospital, Boston, MA, United States
| | - Carlos Ventura-Bort
- Department of Biological Psychology and Affective Science, Faculty of Human Sciences, University of Potsdam, Potsdam, Germany
| | - Edelyn Verona
- Department of Psychology, University of South Florida, Tampa, FL, United States
| | - Tyler Volk
- Professor Emeritus of Biology and Environmental Studies, New York University, New York, NY, United States
| | - Yi Wang
- Neuropsychology and Applied Cognitive Neuroscience Laboratory, CAS Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing, China
| | - Leah T Weingast
- Department of Social Work and Human Services and the Department of Psychological Sciences, Center for Young Adult Addiction and Recovery, Kennesaw State University, Kennesaw, GA, United States
| | - Mathias Weymar
- Department of Biological Psychology and Affective Science, Faculty of Human Sciences, University of Potsdam, Potsdam, Germany; Faculty of Health Sciences Brandenburg, University of Potsdam, Germany
| | - Claire Williams
- School of Psychology, Faculty of Medicine, Health & Life Science, Swansea University, Swansea, United Kingdom; Elysium Neurological Services, Elysium Healthcare, The Avalon Centre, United Kingdom
| | - Megan L Willis
- School of Behavioural and Health Sciences, Australian Catholic University, Sydney, NSW, Australia
| | - Paula Yamashita
- Department of Integrative Physiology and Center for Neuroscience, University of Colorado Boulder, Boulder, CO, United States
| | - Roland Zahn
- Centre for Affective Disorders, Institute of Psychiatry, Psychology and Neuroscience, King's College London, United Kingdom
| | - Barbra Zupan
- Central Queensland University, School of Health, Medical and Applied Sciences, Bruce Highway, Rockhampton, QLD, Australia
| | - Leroy Lowe
- Neuroqualia (NGO), Truro, Nova Scotia, Canada.
| |
Collapse
|
15
|
Hodson R, Mehta M, Smith R. The empirical status of predictive coding and active inference. Neurosci Biobehav Rev 2024; 157:105473. [PMID: 38030100 DOI: 10.1016/j.neubiorev.2023.105473] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 10/27/2023] [Accepted: 11/16/2023] [Indexed: 12/01/2023]
Abstract
Research on predictive processing models has focused largely on two specific algorithmic theories: Predictive Coding for perception and Active Inference for decision-making. While these interconnected theories possess broad explanatory potential, they have only recently begun to receive direct empirical evaluation. Here, we review recent studies of Predictive Coding and Active Inference with a focus on evaluating the degree to which they are empirically supported. For Predictive Coding, we find that existing empirical evidence offers modest support. However, some positive results can also be explained by alternative feedforward (e.g., feature detection-based) models. For Active Inference, most empirical studies have focused on fitting these models to behavior as a means of identifying and explaining individual or group differences. While Active Inference models tend to explain behavioral data reasonably well, there has not been a focus on testing empirical validity of active inference theory per se, which would require formal comparison to other models (e.g., non-Bayesian or model-free reinforcement learning models). This review suggests that, while promising, a number of specific research directions are still necessary to evaluate the empirical adequacy and explanatory power of these algorithms.
Collapse
Affiliation(s)
| | | | - Ryan Smith
- Laureate Institute for Brain Research, USA.
| |
Collapse
|
16
|
Novicky F, Parr T, Friston K, Mirza MB, Sajid N. Bistable perception, precision and neuromodulation. Cereb Cortex 2024; 34:bhad401. [PMID: 37950879 PMCID: PMC10793076 DOI: 10.1093/cercor/bhad401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 10/04/2023] [Accepted: 10/05/2023] [Indexed: 11/13/2023] Open
Abstract
Bistable perception follows from observing a static, ambiguous, (visual) stimulus with two possible interpretations. Here, we present an active (Bayesian) inference account of bistable perception and posit that perceptual transitions between different interpretations (i.e. inferences) of the same stimulus ensue from specific eye movements that shift the focus to a different visual feature. Formally, these inferences are a consequence of precision control that determines how confident beliefs are and change the frequency with which one can perceive-and alternate between-two distinct percepts. We hypothesized that there are multiple, but distinct, ways in which precision modulation can interact to give rise to a similar frequency of bistable perception. We validated this using numerical simulations of the Necker cube paradigm and demonstrate the multiple routes that underwrite the frequency of perceptual alternation. Our results provide an (enactive) computational account of the intricate precision balance underwriting bistable perception. Importantly, these precision parameters can be considered the computational homologs of particular neurotransmitters-i.e. acetylcholine, noradrenaline, dopamine-that have been previously implicated in controlling bistable perception, providing a computational link between the neurochemistry and perception.
Collapse
Affiliation(s)
- Filip Novicky
- Department of Neurophysics, Radboud University, Heyendaalseweg 135, 6525 AJ, Nijmegen, Netherlands
- Faculty of Psychology and Neuroscience, Maastricht University, Universiteitssingel 406229 ER, Maastricht, Netherlands
| | - Thomas Parr
- Wellcome Centre for Human Neuroimaging, UCL, 12 Queen Square London WC1N 3AR, United Kingdom
| | - Karl Friston
- Wellcome Centre for Human Neuroimaging, UCL, 12 Queen Square London WC1N 3AR, United Kingdom
| | - Muammer Berk Mirza
- Department of Psychology, University of Cambridge, Downing Pl, Cambridge CB2 3EB, United Kingdom
| | - Noor Sajid
- Wellcome Centre for Human Neuroimaging, UCL, 12 Queen Square London WC1N 3AR, United Kingdom
| |
Collapse
|
17
|
Bramley NR, Zhao B, Quillien T, Lucas CG. Local Search and the Evolution of World Models. Top Cogn Sci 2023. [PMID: 37850714 DOI: 10.1111/tops.12703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 10/02/2023] [Accepted: 10/03/2023] [Indexed: 10/19/2023]
Abstract
An open question regarding how people develop their models of the world is how new candidates are generated for consideration out of infinitely many possibilities. We discuss the role that evolutionary mechanisms play in this process. Specifically, we argue that when it comes to developing a global world model, innovation is necessarily incremental, involving the generation and selection among random local mutations and recombinations of (parts of) one's current model. We argue that, by narrowing and guiding exploration, this feature of cognitive search is what allows human learners to discover better theories, without ever grappling directly with the problem of finding a "global optimum," or best possible world model. We suggest this aspect of cognitive processing works analogously to how blind variation and selection mechanisms drive biological evolution. We propose algorithms developed for program synthesis provide candidate mechanisms for how human minds might achieve this. We discuss objections and implications of this perspective, finally suggesting that a better process-level understanding of how humans incrementally explore compositional theory spaces can shed light on how we think, and provide explanatory traction on fundamental cognitive biases, including anchoring, probability matching, and confirmation bias.
Collapse
Affiliation(s)
| | - Bonan Zhao
- Department of Psychology, University of Edinburgh
| | - Tadeg Quillien
- Institute of Language, Cognition & Computation, Informatics University of Edinburgh
| | - Christopher G Lucas
- Institute of Language, Cognition & Computation, Informatics University of Edinburgh
| |
Collapse
|
18
|
Llobera J, Charbonnier C. Physics-based character animation and human motor control. Phys Life Rev 2023; 46:190-219. [PMID: 37480729 DOI: 10.1016/j.plrev.2023.06.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 06/25/2023] [Indexed: 07/24/2023]
Abstract
Motor neuroscience and physics-based character animation (PBCA) approach human and humanoid control from different perspectives. The primary goal of PBCA is to control the movement of a ragdoll (humanoid or animal) applying forces and torques within a physical simulation. The primary goal of motor neuroscience is to understand the contribution of different parts of the nervous system to generate coordinated movements. We review the functional principles and the functional anatomy of human motor control and the main strategies used in PBCA. We then explore common research points by discussing the functional anatomy and ongoing debates in motor neuroscience from the perspective of PBCA. We also suggest there are several benefits to be found in studying sensorimotor integration and human-character coordination through closer collaboration between these two fields.
Collapse
Affiliation(s)
- Joan Llobera
- Artanim Foundation, 40, chemin du Grand-Puits, 1217 Meyrin - Geneva, Switzerland.
| | - Caecilia Charbonnier
- Artanim Foundation, 40, chemin du Grand-Puits, 1217 Meyrin - Geneva, Switzerland
| |
Collapse
|
19
|
Parr T, Holmes E, Friston KJ, Pezzulo G. Cognitive effort and active inference. Neuropsychologia 2023; 184:108562. [PMID: 37080424 PMCID: PMC10636588 DOI: 10.1016/j.neuropsychologia.2023.108562] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 04/03/2023] [Accepted: 04/11/2023] [Indexed: 04/22/2023]
Abstract
This paper aims to integrate some key constructs in the cognitive neuroscience of cognitive control and executive function by formalising the notion of cognitive (or mental) effort in terms of active inference. To do so, we call upon a task used in neuropsychology to assess impulse inhibition-a Stroop task. In this task, participants must suppress the impulse to read a colour word and instead report the colour of the text of the word. The Stroop task is characteristically effortful, and we unpack a theory of mental effort in which, to perform this task accurately, participants must overcome prior beliefs about how they would normally act. However, our interest here is not in overt action, but in covert (mental) action. Mental actions change our beliefs but have no (direct) effect on the outside world-much like deploying covert attention. This account of effort as mental action lets us generate multimodal (choice, reaction time, and electrophysiological) data of the sort we might expect from a human participant engaging in this task. We analyse how parameters determining cognitive effort influence simulated responses and demonstrate that-when provided only with performance data-these parameters can be recovered, provided they are within a certain range.
Collapse
Affiliation(s)
- Thomas Parr
- Wellcome Centre for Human Neuroimaging, Queen Square Institute of Neurology, UK.
| | - Emma Holmes
- Wellcome Centre for Human Neuroimaging, Queen Square Institute of Neurology, UK
| | - Karl J Friston
- Wellcome Centre for Human Neuroimaging, Queen Square Institute of Neurology, UK
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| |
Collapse
|
20
|
Kim CS. Free energy and inference in living systems. Interface Focus 2023; 13:20220041. [PMID: 37065269 PMCID: PMC10102732 DOI: 10.1098/rsfs.2022.0041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 01/18/2023] [Indexed: 04/18/2023] Open
Abstract
Organisms are non-equilibrium, stationary systems self-organized via spontaneous symmetry breaking and undergoing metabolic cycles with broken detailed balance in the environment. The thermodynamic free-energy (FE) principle describes an organism's homeostasis as the regulation of biochemical work constrained by the physical FE cost. By contrast, recent research in neuroscience and theoretical biology explains a higher organism's homeostasis and allostasis as Bayesian inference facilitated by the informational FE. As an integrated approach to living systems, this study presents an FE minimization theory overarching the essential features of both the thermodynamic and neuroscientific FE principles. Our results reveal that the perception and action of animals result from active inference entailed by FE minimization in the brain, and the brain operates as a Schrödinger's machine conducting the neural mechanics of minimizing sensory uncertainty. A parsimonious model suggests that the Bayesian brain develops the optimal trajectories in neural manifolds and induces a dynamic bifurcation between neural attractors in the process of active inference.
Collapse
Affiliation(s)
- Chang Sub Kim
- Department of Physics, Chonnam National University, Gwangju 61186, Republic of Korea
| |
Collapse
|
21
|
Alicea B, Gordon R, Parent J. Embodied cognitive morphogenesis as a route to intelligent systems. Interface Focus 2023; 13:20220067. [PMID: 37065267 PMCID: PMC10102728 DOI: 10.1098/rsfs.2022.0067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 01/20/2023] [Indexed: 04/18/2023] Open
Abstract
The embryological view of development is that coordinated gene expression, cellular physics and migration provides the basis for phenotypic complexity. This stands in contrast with the prevailing view of embodied cognition, which claims that informational feedback between organisms and their environment is key to the emergence of intelligent behaviours. We aim to unite these two perspectives as embodied cognitive morphogenesis, in which morphogenetic symmetry breaking produces specialized organismal subsystems which serve as a substrate for the emergence of autonomous behaviours. As embodied cognitive morphogenesis produces fluctuating phenotypic asymmetry and the emergence of information processing subsystems, we observe three distinct properties: acquisition, generativity and transformation. Using a generic organismal agent, such properties are captured through models such as tensegrity networks, differentiation trees and embodied hypernetworks, providing a means to identify the context of various symmetry-breaking events in developmental time. Related concepts that help us define this phenotype further include concepts such as modularity, homeostasis and 4E (embodied, enactive, embedded and extended) cognition. We conclude by considering these autonomous developmental systems as a process called connectogenesis, connecting various parts of the emerged phenotype into an approach useful for the analysis of organisms and the design of bioinspired computational agents.
Collapse
Affiliation(s)
- Bradly Alicea
- OpenWorm Foundation, Boston, MA, USA
- Orthogonal Research and Education Laboratory, Champaign-Urbana, IL, USA
| | - Richard Gordon
- Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI 48201, USA
| | - Jesse Parent
- Orthogonal Research and Education Laboratory, Champaign-Urbana, IL, USA
| |
Collapse
|
22
|
Yang Z, Diaz GJ, Fajen BR, Bailey R, Ororbia AG. A neural active inference model of perceptual-motor learning. Front Comput Neurosci 2023; 17:1099593. [PMID: 36890967 PMCID: PMC9986490 DOI: 10.3389/fncom.2023.1099593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 01/30/2023] [Indexed: 02/22/2023] Open
Abstract
The active inference framework (AIF) is a promising new computational framework grounded in contemporary neuroscience that can produce human-like behavior through reward-based learning. In this study, we test the ability for the AIF to capture the role of anticipation in the visual guidance of action in humans through the systematic investigation of a visual-motor task that has been well-explored-that of intercepting a target moving over a ground plane. Previous research demonstrated that humans performing this task resorted to anticipatory changes in speed intended to compensate for semi-predictable changes in target speed later in the approach. To capture this behavior, our proposed "neural" AIF agent uses artificial neural networks to select actions on the basis of a very short term prediction of the information about the task environment that these actions would reveal along with a long-term estimate of the resulting cumulative expected free energy. Systematic variation revealed that anticipatory behavior emerged only when required by limitations on the agent's movement capabilities, and only when the agent was able to estimate accumulated free energy over sufficiently long durations into the future. In addition, we present a novel formulation of the prior mapping function that maps a multi-dimensional world-state to a uni-dimensional distribution of free-energy/reward. Together, these results demonstrate the use of AIF as a plausible model of anticipatory visually guided behavior in humans.
Collapse
Affiliation(s)
- Zhizhuo Yang
- Golisano College of Computing and Information Sciences, Rochester Institute of Technology, Rochester, NY, United States
| | - Gabriel J Diaz
- Chester F. Carlson Center for Imaging Science, Rochester Institute of Technology, Rochester, NY, United States
| | - Brett R Fajen
- Department of Cognitive Science, Rensselaer Polytechnic Institute, Troy, NY, United States
| | - Reynold Bailey
- Golisano College of Computing and Information Sciences, Rochester Institute of Technology, Rochester, NY, United States
| | - Alexander G Ororbia
- Golisano College of Computing and Information Sciences, Rochester Institute of Technology, Rochester, NY, United States
| |
Collapse
|
23
|
Priorelli M, Stoianov IP. Flexible intentions: An Active Inference theory. Front Comput Neurosci 2023; 17:1128694. [PMID: 37021085 PMCID: PMC10067605 DOI: 10.3389/fncom.2023.1128694] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 03/03/2023] [Indexed: 04/07/2023] Open
Abstract
We present a normative computational theory of how the brain may support visually-guided goal-directed actions in dynamically changing environments. It extends the Active Inference theory of cortical processing according to which the brain maintains beliefs over the environmental state, and motor control signals try to fulfill the corresponding sensory predictions. We propose that the neural circuitry in the Posterior Parietal Cortex (PPC) compute flexible intentions-or motor plans from a belief over targets-to dynamically generate goal-directed actions, and we develop a computational formalization of this process. A proof-of-concept agent embodying visual and proprioceptive sensors and an actuated upper limb was tested on target-reaching tasks. The agent behaved correctly under various conditions, including static and dynamic targets, different sensory feedbacks, sensory precisions, intention gains, and movement policies; limit conditions were individuated, too. Active Inference driven by dynamic and flexible intentions can thus support goal-directed behavior in constantly changing environments, and the PPC might putatively host its core intention mechanism. More broadly, the study provides a normative computational basis for research on goal-directed behavior in end-to-end settings and further advances mechanistic theories of active biological systems.
Collapse
|
24
|
Gijsen S, Grundei M, Blankenburg F. Active inference and the two-step task. Sci Rep 2022; 12:17682. [PMID: 36271279 PMCID: PMC9586964 DOI: 10.1038/s41598-022-21766-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 09/30/2022] [Indexed: 01/18/2023] Open
Abstract
Sequential decision problems distill important challenges frequently faced by humans. Through repeated interactions with an uncertain world, unknown statistics need to be learned while balancing exploration and exploitation. Reinforcement learning is a prominent method for modeling such behaviour, with a prevalent application being the two-step task. However, recent studies indicate that the standard reinforcement learning model sometimes describes features of human task behaviour inaccurately and incompletely. We investigated whether active inference, a framework proposing a trade-off to the exploration-exploitation dilemma, could better describe human behaviour. Therefore, we re-analysed four publicly available datasets of the two-step task, performed Bayesian model selection, and compared behavioural model predictions. Two datasets, which revealed more model-based inference and behaviour indicative of directed exploration, were better described by active inference, while the models scored similarly for the remaining datasets. Learning using probability distributions appears to contribute to the improved model fits. Further, approximately half of all participants showed sensitivity to information gain as formulated under active inference, although behavioural exploration effects were not fully captured. These results contribute to the empirical validation of active inference as a model of human behaviour and the study of alternative models for the influential two-step task.
Collapse
Affiliation(s)
- Sam Gijsen
- Neurocomputation and Neuroimaging Unit, Freie Universität Berlin, 14195, Berlin, Germany.
- Berlin School of Mind and Brain, Humboldt-Universität zu Berlin, 10117, Berlin, Germany.
| | - Miro Grundei
- Neurocomputation and Neuroimaging Unit, Freie Universität Berlin, 14195, Berlin, Germany
- Berlin School of Mind and Brain, Humboldt-Universität zu Berlin, 10117, Berlin, Germany
| | - Felix Blankenburg
- Neurocomputation and Neuroimaging Unit, Freie Universität Berlin, 14195, Berlin, Germany
- Berlin School of Mind and Brain, Humboldt-Universität zu Berlin, 10117, Berlin, Germany
| |
Collapse
|
25
|
Shin JY, Kim C, Hwang HJ. Prior preference learning from experts: Designing a reward with active inference. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.12.042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
26
|
Harris DJ, Arthur T, Broadbent DP, Wilson MR, Vine SJ, Runswick OR. An Active Inference Account of Skilled Anticipation in Sport: Using Computational Models to Formalise Theory and Generate New Hypotheses. Sports Med 2022; 52:2023-2038. [PMID: 35503403 PMCID: PMC9388417 DOI: 10.1007/s40279-022-01689-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/06/2022] [Indexed: 11/30/2022]
Abstract
Optimal performance in time-constrained and dynamically changing environments depends on making reliable predictions about future outcomes. In sporting tasks, performers have been found to employ multiple information sources to maximise the accuracy of their predictions, but questions remain about how different information sources are weighted and integrated to guide anticipation. In this paper, we outline how predictive processing approaches, and active inference in particular, provide a unifying account of perception and action that explains many of the prominent findings in the sports anticipation literature. Active inference proposes that perception and action are underpinned by the organism’s need to remain within certain stable states. To this end, decision making approximates Bayesian inference and actions are used to minimise future prediction errors during brain–body–environment interactions. Using a series of Bayesian neurocomputational models based on a partially observable Markov process, we demonstrate that key findings from the literature can be recreated from the first principles of active inference. In doing so, we formulate a number of novel and empirically falsifiable hypotheses about human anticipation capabilities that could guide future investigations in the field.
Collapse
Affiliation(s)
- David J Harris
- School of Sport and Health Sciences, College of Life and Environmental Sciences, University of Exeter, St Luke's Campus, Exeter, EX1 2LU, UK.
| | - Tom Arthur
- School of Sport and Health Sciences, College of Life and Environmental Sciences, University of Exeter, St Luke's Campus, Exeter, EX1 2LU, UK
| | - David P Broadbent
- Division of Sport, Health and Exercise Sciences, Department of Life Sciences, Brunel University London, London, UK
| | - Mark R Wilson
- School of Sport and Health Sciences, College of Life and Environmental Sciences, University of Exeter, St Luke's Campus, Exeter, EX1 2LU, UK
| | - Samuel J Vine
- School of Sport and Health Sciences, College of Life and Environmental Sciences, University of Exeter, St Luke's Campus, Exeter, EX1 2LU, UK
| | - Oliver R Runswick
- Department of Psychology, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London, UK
| |
Collapse
|
27
|
Champion T, Da Costa L, Bowman H, Grześ M. Branching Time Active Inference: The theory and its generality. Neural Netw 2022; 151:295-316. [PMID: 35468491 DOI: 10.1016/j.neunet.2022.03.036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Revised: 03/25/2022] [Accepted: 03/28/2022] [Indexed: 12/01/2022]
Abstract
Over the last 10 to 15 years, active inference has helped to explain various brain mechanisms from habit formation to dopaminergic discharge and even modelling curiosity. However, the current implementations suffer from an exponential (space and time) complexity class when computing the prior over all the possible policies up to the time-horizon. Fountas et al. (2020) used Monte Carlo tree search to address this problem, leading to impressive results in two different tasks. In this paper, we present an alternative framework that aims to unify tree search and active inference by casting planning as a structure learning problem. Two tree search algorithms are then presented. The first propagates the expected free energy forward in time (i.e., towards the leaves), while the second propagates it backward (i.e., towards the root). Then, we demonstrate that forward and backward propagations are related to active inference and sophisticated inference, respectively, thereby clarifying the differences between those two planning strategies.
Collapse
Affiliation(s)
- Théophile Champion
- University of Kent, School of Computing, Canterbury CT2 7NZ, United Kingdom.
| | - Lancelot Da Costa
- Imperial College London, Department of Mathematics, London SW7 2AZ, United Kingdom; Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3AR, United Kingdom.
| | - Howard Bowman
- University of Birmingham, School of Psychology, Birmingham B15 2TT, United Kingdom; University of Kent, School of Computing, Canterbury CT2 7NZ, United Kingdom.
| | - Marek Grześ
- University of Kent, School of Computing, Canterbury CT2 7NZ, United Kingdom.
| |
Collapse
|
28
|
Wauthier ST, De Boom C, Çatal O, Verbelen T, Dhoedt B. Model Reduction Through Progressive Latent Space Pruning in Deep Active Inference. Front Neurorobot 2022; 16:795846. [PMID: 35360827 PMCID: PMC8961807 DOI: 10.3389/fnbot.2022.795846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 02/14/2022] [Indexed: 11/17/2022] Open
Abstract
Although still not fully understood, sleep is known to play an important role in learning and in pruning synaptic connections. From the active inference perspective, this can be cast as learning parameters of a generative model and Bayesian model reduction, respectively. In this article, we show how to reduce dimensionality of the latent space of such a generative model, and hence model complexity, in deep active inference during training through a similar process. While deep active inference uses deep neural networks for state space construction, an issue remains in that the dimensionality of the latent space must be specified beforehand. We investigate two methods that are able to prune the latent space of deep active inference models. The first approach functions similar to sleep and performs model reduction post hoc. The second approach is a novel method which is more similar to reflection, operates during training and displays “aha” moments when the model is able to reduce latent space dimensionality. We show for two well-known simulated environments that model performance is retained in the first approach and only diminishes slightly in the second approach. We also show that reconstructions from a real world example are indistinguishable before and after reduction. We conclude that the most important difference constitutes a trade-off between training time and model performance in terms of accuracy and the ability to generalize, via minimization of model complexity.
Collapse
|
29
|
Da Costa L, Lanillos P, Sajid N, Friston K, Khan S. How Active Inference Could Help Revolutionise Robotics. ENTROPY (BASEL, SWITZERLAND) 2022; 24:361. [PMID: 35327872 PMCID: PMC8946999 DOI: 10.3390/e24030361] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 02/24/2022] [Accepted: 02/28/2022] [Indexed: 02/05/2023]
Abstract
Recent advances in neuroscience have characterised brain function using mathematical formalisms and first principles that may be usefully applied elsewhere. In this paper, we explain how active inference-a well-known description of sentient behaviour from neuroscience-can be exploited in robotics. In short, active inference leverages the processes thought to underwrite human behaviour to build effective autonomous systems. These systems show state-of-the-art performance in several robotics settings; we highlight these and explain how this framework may be used to advance robotics.
Collapse
Affiliation(s)
- Lancelot Da Costa
- Department of Mathematics, Imperial College London, London SW7 2AZ, UK
- Wellcome Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK; (N.S.); (K.F.)
| | - Pablo Lanillos
- Department of Artificial Intelligence, Donders Institute for Brain, Cognition and Behavior, Radboud University, 6525 XZ Nijmegen, The Netherlands;
| | - Noor Sajid
- Wellcome Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK; (N.S.); (K.F.)
| | - Karl Friston
- Wellcome Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK; (N.S.); (K.F.)
| | - Shujhat Khan
- Milton Keynes Hospital, Oxford Deanery, Milton Keynes MK6 5LD, UK;
| |
Collapse
|
30
|
Mazzaglia P, Verbelen T, Çatal O, Dhoedt B. The Free Energy Principle for Perception and Action: A Deep Learning Perspective. ENTROPY (BASEL, SWITZERLAND) 2022; 24:301. [PMID: 35205595 PMCID: PMC8871280 DOI: 10.3390/e24020301] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 02/14/2022] [Accepted: 02/15/2022] [Indexed: 02/05/2023]
Abstract
The free energy principle, and its corollary active inference, constitute a bio-inspired theory that assumes biological agents act to remain in a restricted set of preferred states of the world, i.e., they minimize their free energy. Under this principle, biological agents learn a generative model of the world and plan actions in the future that will maintain the agent in an homeostatic state that satisfies its preferences. This framework lends itself to being realized in silico, as it comprehends important aspects that make it computationally affordable, such as variational inference and amortized planning. In this work, we investigate the tool of deep learning to design and realize artificial agents based on active inference, presenting a deep-learning oriented presentation of the free energy principle, surveying works that are relevant in both machine learning and active inference areas, and discussing the design choices that are involved in the implementation process. This manuscript probes newer perspectives for the active inference framework, grounding its theoretical aspects into more pragmatic affairs, offering a practical guide to active inference newcomers and a starting point for deep learning practitioners that would like to investigate implementations of the free energy principle.
Collapse
Affiliation(s)
- Pietro Mazzaglia
- IDLab, Ghent University, 9052 Gent, Belgium; (T.V.); (O.Ç.); (B.D.)
| | | | | | | |
Collapse
|
31
|
Isomura T, Shimazaki H, Friston KJ. Canonical neural networks perform active inference. Commun Biol 2022; 5:55. [PMID: 35031656 PMCID: PMC8760273 DOI: 10.1038/s42003-021-02994-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 12/21/2021] [Indexed: 12/03/2022] Open
Abstract
This work considers a class of canonical neural networks comprising rate coding models, wherein neural activity and plasticity minimise a common cost function-and plasticity is modulated with a certain delay. We show that such neural networks implicitly perform active inference and learning to minimise the risk associated with future outcomes. Mathematical analyses demonstrate that this biological optimisation can be cast as maximisation of model evidence, or equivalently minimisation of variational free energy, under the well-known form of a partially observed Markov decision process model. This equivalence indicates that the delayed modulation of Hebbian plasticity-accompanied with adaptation of firing thresholds-is a sufficient neuronal substrate to attain Bayes optimal inference and control. We corroborated this proposition using numerical analyses of maze tasks. This theory offers a universal characterisation of canonical neural networks in terms of Bayesian belief updating and provides insight into the neuronal mechanisms underlying planning and adaptive behavioural control.
Collapse
Affiliation(s)
- Takuya Isomura
- Brain Intelligence Theory Unit, RIKEN Center for Brain Science, Wako, Saitama, 351-0198, Japan.
| | - Hideaki Shimazaki
- Center for Human Nature, Artificial Intelligence, and Neuroscience (CHAIN), Hokkaido University, Sapporo, Hokkaido, 060-0812, Japan
| | - Karl J Friston
- Wellcome Centre for Human Neuroimaging, Institute of Neurology, University College London, 12 Queen Square, London, WC1N 3AR, UK
| |
Collapse
|
32
|
Chase HW, Wilson RC, Waltz JA. Editorial: Computational accounts of reinforcement learning and decision making in psychiatric disorders. Front Psychiatry 2022; 13:966369. [PMID: 35958661 PMCID: PMC9358282 DOI: 10.3389/fpsyt.2022.966369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 07/05/2022] [Indexed: 11/13/2022] Open
Affiliation(s)
- Henry W Chase
- Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
| | - Robert C Wilson
- Department of Psychology, University of Arizona, Tucson, AZ, United States
| | - James A Waltz
- Maryland Psychiatric Research Center, University of Maryland School of Medicine, Baltimore, MD, United States
| |
Collapse
|
33
|
Whyte CJ, Hohwy J, Smith R. An active inference model of conscious access: How cognitive action selection reconciles the results of report and no-report paradigms. CURRENT RESEARCH IN NEUROBIOLOGY 2022; 3:100036. [PMID: 36304590 PMCID: PMC9593308 DOI: 10.1016/j.crneur.2022.100036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 03/08/2022] [Accepted: 03/18/2022] [Indexed: 10/31/2022] Open
Abstract
Cognitive theories of consciousness, such as global workspace theory and higher-order theories, posit that frontoparietal circuits play a crucial role in conscious access. However, recent studies using no-report paradigms have posed a challenge to cognitive theories by demonstrating conscious accessibility in the apparent absence of prefrontal cortex (PFC) activation. To address this challenge, this paper presents a computational model of conscious access, based upon active inference, that treats working memory gating as a cognitive action. We simulate a visual masking task and show that late P3b-like event-related potentials (ERPs), and increased PFC activity, are induced by the working memory demands of self-report generation. When reporting demands are removed, these late ERPs vanish and PFC activity is reduced. These results therefore reproduce, and potentially explain, results from no-report paradigms. However, even without reporting demands, our model shows that simulated PFC activity on visible stimulus trials still crosses the threshold for reportability - maintaining the link between PFC and conscious access. Therefore, our simulations show that evidence provided by no-report paradigms does not necessarily contradict cognitive theories of consciousness.
Collapse
Affiliation(s)
- Christopher J. Whyte
- MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK
- Corresponding author. MRC Cognition and Brain Sciences Unit, 15 Chaucer Rd, Cambridge, CB2 7EF, UK.
| | - Jakob Hohwy
- Centre for Consciousness & Contemplative Studies, Monash University, Melbourne, Australia
| | - Ryan Smith
- Laureate Institute for Brain Research, Tulsa, OK, USA
| |
Collapse
|
34
|
Goekoop R, de Kleijn R. Permutation Entropy as a Universal Disorder Criterion: How Disorders at Different Scale Levels Are Manifestations of the Same Underlying Principle. ENTROPY (BASEL, SWITZERLAND) 2021; 23:1701. [PMID: 34946007 PMCID: PMC8700347 DOI: 10.3390/e23121701] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Revised: 12/10/2021] [Accepted: 12/13/2021] [Indexed: 12/12/2022]
Abstract
What do bacteria, cells, organs, people, and social communities have in common? At first sight, perhaps not much. They involve totally different agents and scale levels of observation. On second thought, however, perhaps they share everything. A growing body of literature suggests that living systems at different scale levels of observation follow the same architectural principles and process information in similar ways. Moreover, such systems appear to respond in similar ways to rising levels of stress, especially when stress levels approach near-lethal levels. To explain such communalities, we argue that all organisms (including humans) can be modeled as hierarchical Bayesian controls systems that are governed by the same biophysical principles. Such systems show generic changes when taxed beyond their ability to correct for environmental disturbances. Without exception, stressed organisms show rising levels of 'disorder' (randomness, unpredictability) in internal message passing and overt behavior. We argue that such changes can be explained by a collapse of allostatic (high-level integrative) control, which normally synchronizes activity of the various components of a living system to produce order. The selective overload and cascading failure of highly connected (hub) nodes flattens hierarchical control, producing maladaptive behavior. Thus, we present a theory according to which organic concepts such as stress, a loss of control, disorder, disease, and death can be operationalized in biophysical terms that apply to all scale levels of organization. Given the presumed universality of this mechanism, 'losing control' appears to involve the same process anywhere, whether involving bacteria succumbing to an antibiotic agent, people suffering from physical or mental disorders, or social systems slipping into warfare. On a practical note, measures of disorder may serve as early warning signs of system failure even when catastrophic failure is still some distance away.
Collapse
Affiliation(s)
- Rutger Goekoop
- Parnassia Group, PsyQ Parnassia Academy, Department of Anxiety Disorders, Early Detection and Intervention Team (EDIT), Lijnbaan 4, 2512 VA Den Haag, The Netherlands
| | - Roy de Kleijn
- Cognitive Psychology Unit, Institute of Psychology & Leiden Institute for Brain and Cognition, Leiden University, Wassenaarseweg 52, 2333 AK Leiden, The Netherlands;
| |
Collapse
|
35
|
Horii T, Nagai Y. Active Inference Through Energy Minimization in Multimodal Affective Human-Robot Interaction. Front Robot AI 2021; 8:684401. [PMID: 34901166 PMCID: PMC8662315 DOI: 10.3389/frobt.2021.684401] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 10/25/2021] [Indexed: 11/25/2022] Open
Abstract
During communication, humans express their emotional states using various modalities (e.g., facial expressions and gestures), and they estimate the emotional states of others by paying attention to multimodal signals. To ensure that a communication robot with limited resources can pay attention to such multimodal signals, the main challenge involves selecting the most effective modalities among those expressed. In this study, we propose an active perception method that involves selecting the most informative modalities using a criterion based on energy minimization. This energy-based model can learn the probability of the network state using energy values, whereby a lower energy value represents a higher probability of the state. A multimodal deep belief network, which is an energy-based model, was employed to represent the relationships between the emotional states and multimodal sensory signals. Compared to other active perception methods, the proposed approach demonstrated improved accuracy using limited information in several contexts associated with affective human–robot interaction. We present the differences and advantages of our method compared to other methods through mathematical formulations using, for example, information gain as a criterion. Further, we evaluate performance of our method, as pertains to active inference, which is based on the free energy principle. Consequently, we establish that our method demonstrated superior performance in tasks associated with mutually correlated multimodal information.
Collapse
Affiliation(s)
- Takato Horii
- Graduate School of Engineering Science, Osaka University, Osaka, Japan.,International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo, Tokyo, Japan
| | - Yukie Nagai
- International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo, Tokyo, Japan.,Institute for AI and Beyond, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
36
|
Koudahl MT, Kouw WM, de Vries B. On Epistemics in Expected Free Energy for Linear Gaussian State Space Models. ENTROPY 2021; 23:e23121565. [PMID: 34945871 PMCID: PMC8700494 DOI: 10.3390/e23121565] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 11/19/2021] [Accepted: 11/23/2021] [Indexed: 01/20/2023]
Abstract
Active Inference (AIF) is a framework that can be used both to describe information processing in naturally intelligent systems, such as the human brain, and to design synthetic intelligent systems (agents). In this paper we show that Expected Free Energy (EFE) minimisation, a core feature of the framework, does not lead to purposeful explorative behaviour in linear Gaussian dynamical systems. We provide a simple proof that, due to the specific construction used for the EFE, the terms responsible for the exploratory (epistemic) drive become constant in the case of linear Gaussian systems. This renders AIF equivalent to KL control. From a theoretical point of view this is an interesting result since it is generally assumed that EFE minimisation will always introduce an exploratory drive in AIF agents. While the full EFE objective does not lead to exploration in linear Gaussian dynamical systems, the principles of its construction can still be used to design objectives that include an epistemic drive. We provide an in-depth analysis of the mechanics behind the epistemic drive of AIF agents and show how to design objectives for linear Gaussian dynamical systems that do include an epistemic drive. Concretely, we show that focusing solely on epistemics and dispensing with goal-directed terms leads to a form of maximum entropy exploration that is heavily dependent on the type of control signals driving the system. Additive controls do not permit such exploration. From a practical point of view this is an important result since linear Gaussian dynamical systems with additive controls are an extensively used model class, encompassing for instance Linear Quadratic Gaussian controllers. On the other hand, linear Gaussian dynamical systems driven by multiplicative controls such as switching transition matrices do permit an exploratory drive.
Collapse
Affiliation(s)
- Magnus T. Koudahl
- Department of Electrical Engineering, Eindhoven University of Technology, 5612 AZ Eindhoven, The Netherlands; (W.M.K.); (B.d.V.)
- Correspondence:
| | - Wouter M. Kouw
- Department of Electrical Engineering, Eindhoven University of Technology, 5612 AZ Eindhoven, The Netherlands; (W.M.K.); (B.d.V.)
| | - Bert de Vries
- Department of Electrical Engineering, Eindhoven University of Technology, 5612 AZ Eindhoven, The Netherlands; (W.M.K.); (B.d.V.)
- GN Hearing, JF Kennedylaan 2, 5612 AB Eindhoven, The Netherlands
| |
Collapse
|
37
|
Parr T, Pezzulo G. Understanding, Explanation, and Active Inference. Front Syst Neurosci 2021; 15:772641. [PMID: 34803619 PMCID: PMC8602880 DOI: 10.3389/fnsys.2021.772641] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 10/15/2021] [Indexed: 11/13/2022] Open
Abstract
While machine learning techniques have been transformative in solving a range of problems, an important challenge is to understand why they arrive at the decisions they output. Some have argued that this necessitates augmenting machine intelligence with understanding such that, when queried, a machine is able to explain its behaviour (i.e., explainable AI). In this article, we address the issue of machine understanding from the perspective of active inference. This paradigm enables decision making based upon a model of how data are generated. The generative model contains those variables required to explain sensory data, and its inversion may be seen as an attempt to explain the causes of these data. Here we are interested in explanations of one's own actions. This implies a deep generative model that includes a model of the world, used to infer policies, and a higher-level model that attempts to predict which policies will be selected based upon a space of hypothetical (i.e., counterfactual) explanations-and which can subsequently be used to provide (retrospective) explanations about the policies pursued. We illustrate the construct validity of this notion of understanding in relation to human understanding by highlighting the similarities in computational architecture and the consequences of its dysfunction.
Collapse
Affiliation(s)
- Thomas Parr
- Wellcome Centre for Human Neuroimaging, Queen Square Institute of Neurology, University College London, London, United Kingdom
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy
| |
Collapse
|
38
|
Rorot W. Bayesian theories of consciousness: a review in search for a minimal unifying model. Neurosci Conscious 2021; 2021:niab038. [PMID: 34650816 PMCID: PMC8512254 DOI: 10.1093/nc/niab038] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 09/10/2021] [Accepted: 09/22/2021] [Indexed: 11/30/2022] Open
Abstract
The goal of the paper is to review existing work on consciousness within the frameworks of Predictive Processing, Active Inference, and Free Energy Principle. The emphasis is put on the role played by the precision and complexity of the internal generative model. In the light of those proposals, these two properties appear to be the minimal necessary components for the emergence of conscious experience-a Minimal Unifying Model of consciousness.
Collapse
Affiliation(s)
- Wiktor Rorot
- Faculty of Philosophy and Faculty of Psychology, University of Warsaw, ul. Krakowskie Przedmieście 3, 00-927, Stawki 5/7, Warsaw 00-183, Poland
| |
Collapse
|
39
|
van de Laar T, Wymeersch H, Şenöz İ, Özçelikkale A. Chance-Constrained Active Inference. Neural Comput 2021; 33:2710-2735. [PMID: 34280254 DOI: 10.1162/neco_a_01427] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 05/03/2021] [Indexed: 11/04/2022]
Abstract
Active inference (ActInf) is an emerging theory that explains perception and action in biological agents in terms of minimizing a free energy bound on Bayesian surprise. Goal-directed behavior is elicited by introducing prior beliefs on the underlying generative model. In contrast to prior beliefs, which constrain all realizations of a random variable, we propose an alternative approach through chance constraints, which allow for a (typically small) probability of constraint violation, and demonstrate how such constraints can be used as intrinsic drivers for goal-directed behavior in ActInf. We illustrate how chance-constrained ActInf weights all imposed (prior) constraints on the generative model, allowing, for example, for a trade-off between robust control and empirical chance constraint violation. Second, we interpret the proposed solution within a message passing framework. Interestingly, the message passing interpretation is not only relevant to the context of ActInf, but also provides a general-purpose approach that can account for chance constraints on graphical models. The chance constraint message updates can then be readily combined with other prederived message update rules without the need for custom derivations. The proposed chance-constrained message passing framework thus accelerates the search for workable models in general and can be used to complement message-passing formulations on generative neural models.
Collapse
Affiliation(s)
- Thijs van de Laar
- Eindhoven University of Technology, 5612 AP, Eindhoven, The Netherlands
| | - Henk Wymeersch
- Chalmers University of Technology, 41296, Gothenburg, Sweden
| | - İsmail Şenöz
- Eindhoven University of Technology, 5612 AP, Eindhoven, The Netherlands
| | | |
Collapse
|
40
|
An Active Inference Model of Collective Intelligence. ENTROPY 2021; 23:e23070830. [PMID: 34210008 PMCID: PMC8306784 DOI: 10.3390/e23070830] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 06/11/2021] [Accepted: 06/12/2021] [Indexed: 12/05/2022]
Abstract
Collective intelligence, an emergent phenomenon in which a composite system of multiple interacting agents performs at levels greater than the sum of its parts, has long compelled research efforts in social and behavioral sciences. To date, however, formal models of collective intelligence have lacked a plausible mathematical description of the relationship between local-scale interactions between autonomous sub-system components (individuals) and global-scale behavior of the composite system (the collective). In this paper we use the Active Inference Formulation (AIF), a framework for explaining the behavior of any non-equilibrium steady state system at any scale, to posit a minimal agent-based model that simulates the relationship between local individual-level interaction and collective intelligence. We explore the effects of providing baseline AIF agents (Model 1) with specific cognitive capabilities: Theory of Mind (Model 2), Goal Alignment (Model 3), and Theory of Mind with Goal Alignment (Model 4). These stepwise transitions in sophistication of cognitive ability are motivated by the types of advancements plausibly required for an AIF agent to persist and flourish in an environment populated by other highly autonomous AIF agents, and have also recently been shown to map naturally to canonical steps in human cognitive ability. Illustrative results show that stepwise cognitive transitions increase system performance by providing complementary mechanisms for alignment between agents’ local and global optima. Alignment emerges endogenously from the dynamics of interacting AIF agents themselves, rather than being imposed exogenously by incentives to agents’ behaviors (contra existing computational models of collective intelligence) or top-down priors for collective behavior (contra existing multiscale simulations of AIF). These results shed light on the types of generic information-theoretic patterns conducive to collective intelligence in human and other complex adaptive systems.
Collapse
|
41
|
Sajid N, Holmes E, Hope TM, Fountas Z, Price CJ, Friston KJ. Simulating lesion-dependent functional recovery mechanisms. Sci Rep 2021; 11:7475. [PMID: 33811259 PMCID: PMC8018968 DOI: 10.1038/s41598-021-87005-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 03/22/2021] [Indexed: 01/13/2023] Open
Abstract
Functional recovery after brain damage varies widely and depends on many factors, including lesion site and extent. When a neuronal system is damaged, recovery may occur by engaging residual (e.g., perilesional) components. When damage is extensive, recovery depends on the availability of other intact neural structures that can reproduce the same functional output (i.e., degeneracy). A system's response to damage may occur rapidly, require learning or both. Here, we simulate functional recovery from four different types of lesions, using a generative model of word repetition that comprised a default premorbid system and a less used alternative system. The synthetic lesions (i) completely disengaged the premorbid system, leaving the alternative system intact, (ii) partially damaged both premorbid and alternative systems, and (iii) limited the experience-dependent plasticity of both. The results, across 1000 trials, demonstrate that (i) a complete disconnection of the premorbid system naturally invoked the engagement of the other, (ii) incomplete damage to both systems had a much more devastating long-term effect on model performance and (iii) the effect of reducing learning capacity within each system. These findings contribute to formal frameworks for interpreting the effect of different types of lesions.
Collapse
Affiliation(s)
- Noor Sajid
- Wellcome Centre for Human Neuroimaging, University College London, UCL Queen Square Institute of Neurology, 12 Queen Square, London, WC1N 3AR, UK.
| | - Emma Holmes
- Wellcome Centre for Human Neuroimaging, University College London, UCL Queen Square Institute of Neurology, 12 Queen Square, London, WC1N 3AR, UK
| | - Thomas M Hope
- Wellcome Centre for Human Neuroimaging, University College London, UCL Queen Square Institute of Neurology, 12 Queen Square, London, WC1N 3AR, UK
| | - Zafeirios Fountas
- Wellcome Centre for Human Neuroimaging, University College London, UCL Queen Square Institute of Neurology, 12 Queen Square, London, WC1N 3AR, UK
- Huawei 2012 Laboratories, London, UK
| | - Cathy J Price
- Wellcome Centre for Human Neuroimaging, University College London, UCL Queen Square Institute of Neurology, 12 Queen Square, London, WC1N 3AR, UK
| | - Karl J Friston
- Wellcome Centre for Human Neuroimaging, University College London, UCL Queen Square Institute of Neurology, 12 Queen Square, London, WC1N 3AR, UK
| |
Collapse
|
42
|
The 2-D Cluster Variation Method: Topography Illustrations and Their Enthalpy Parameter Correlations. ENTROPY 2021; 23:e23030319. [PMID: 33800360 PMCID: PMC7999889 DOI: 10.3390/e23030319] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 02/22/2021] [Accepted: 03/01/2021] [Indexed: 01/02/2023]
Abstract
One of the biggest challenges in characterizing 2-D image topographies is finding a low-dimensional parameter set that can succinctly describe, not so much image patterns themselves, but the nature of these patterns. The 2-D cluster variation method (CVM), introduced by Kikuchi in 1951, can characterize very local image pattern distributions using configuration variables, identifying nearest-neighbor, next-nearest-neighbor, and triplet configurations. Using the 2-D CVM, we can characterize 2-D topographies using just two parameters; the activation enthalpy (ε0) and the interaction enthalpy (ε1). Two different initial topographies (“scale-free-like” and “extreme rich club-like”) were each computationally brought to a CVM free energy minimum, for the case where the activation enthalpy was zero and different values were used for the interaction enthalpy. The results are: (1) the computational configuration variable results differ significantly from the analytically-predicted values well before ε1 approaches the known divergence as ε1→0.881, (2) the range of potentially useful parameter values, favoring clustering of like-with-like units, is limited to the region where ε0<3 and ε1<0.25, and (3) the topographies in the systems that are brought to a free energy minimum show interesting visual features, such as extended “spider legs” connecting previously unconnected “islands,” and as well as evolution of “peninsulas” in what were previously solid masses.
Collapse
|
43
|
Sajid N, Parr T, Hope TM, Price CJ, Friston KJ. Degeneracy and Redundancy in Active Inference. Cereb Cortex 2020; 30:5750-5766. [PMID: 32488244 PMCID: PMC7899066 DOI: 10.1093/cercor/bhaa148] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Revised: 05/11/2020] [Accepted: 05/11/2020] [Indexed: 12/16/2022] Open
Abstract
The notions of degeneracy and redundancy are important constructs in many areas, ranging from genomics through to network science. Degeneracy finds a powerful role in neuroscience, explaining key aspects of distributed processing and structure-function relationships in the brain. For example, degeneracy accounts for the superadditive effect of lesions on functional deficits in terms of a "many-to-one" structure-function mapping. In this paper, we offer a principled account of degeneracy and redundancy, when function is operationalized in terms of active inference, namely, a formulation of perception and action as belief updating under generative models of the world. In brief, "degeneracy" is quantified by the "entropy" of posterior beliefs about the causes of sensations, while "redundancy" is the "complexity" cost incurred by forming those beliefs. From this perspective, degeneracy and redundancy are complementary: Active inference tries to minimize redundancy while maintaining degeneracy. This formulation is substantiated using statistical and mathematical notions of degenerate mappings and statistical efficiency. We then illustrate changes in degeneracy and redundancy during the learning of a word repetition task. Finally, we characterize the effects of lesions-to intrinsic and extrinsic connections-using in silico disconnections. These numerical analyses highlight the fundamental difference between degeneracy and redundancy-and how they score distinct imperatives for perceptual inference and structure learning that are relevant to synthetic and biological intelligence.
Collapse
Affiliation(s)
- Noor Sajid
- Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, London, WC1N 3AR, UK
| | - Thomas Parr
- Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, London, WC1N 3AR, UK
| | - Thomas M Hope
- Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, London, WC1N 3AR, UK
| | - Cathy J Price
- Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, London, WC1N 3AR, UK
| | - Karl J Friston
- Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, London, WC1N 3AR, UK
| |
Collapse
|