1
|
Rolls ET. The memory systems of the human brain and generative artificial intelligence. Heliyon 2024; 10:e31965. [PMID: 38841455 PMCID: PMC11152951 DOI: 10.1016/j.heliyon.2024.e31965] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 05/11/2024] [Accepted: 05/24/2024] [Indexed: 06/07/2024] Open
Abstract
Generative Artificial Intelligence foundation models (for example Generative Pre-trained Transformer - GPT - models) can generate the next token given a sequence of tokens. How can this 'generative AI' be compared with the 'real' intelligence of the human brain, when for example a human generates a whole memory in response to an incomplete retrieval cue, and then generates further prospective thoughts? Here these two types of generative intelligence, artificial in machines and real in the human brain are compared, and it is shown how when whole memories are generated by hippocampal recall in response to an incomplete retrieval cue, what the human brain computes, and how it computes it, are very different from generative AI. Key differences are the use of local associative learning rules in the hippocampal memory system, and of non-local backpropagation of error learning in AI. Indeed, it is argued that the whole operation of the human brain is performed computationally very differently to what is implemented in generative AI. Moreover, it is emphasized that the primate including human hippocampal system includes computations about spatial view and where objects and people are in scenes, whereas in rodents the emphasis is on place cells and path integration by movements between places. This comparison with generative memory and processing in the human brain has interesting implications for the further development of generative AI and for neuroscience research.
Collapse
Affiliation(s)
- Edmund T. Rolls
- Oxford Centre for Computational Neuroscience, Oxford, UK
- Department of Computer Science, University of Warwick, Coventry, CV4 7AL, UK
- Institute of Science and Technology for Brain Inspired Intelligence, Fudan University, Shanghai, 200403, China
| |
Collapse
|
2
|
Pezzulo G, Parr T, Cisek P, Clark A, Friston K. Generating meaning: active inference and the scope and limits of passive AI. Trends Cogn Sci 2024; 28:97-112. [PMID: 37973519 DOI: 10.1016/j.tics.2023.10.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 10/03/2023] [Accepted: 10/05/2023] [Indexed: 11/19/2023]
Abstract
Prominent accounts of sentient behavior depict brains as generative models of organismic interaction with the world, evincing intriguing similarities with current advances in generative artificial intelligence (AI). However, because they contend with the control of purposive, life-sustaining sensorimotor interactions, the generative models of living organisms are inextricably anchored to the body and world. Unlike the passive models learned by generative AI systems, they must capture and control the sensory consequences of action. This allows embodied agents to intervene upon their worlds in ways that constantly put their best models to the test, thus providing a solid bedrock that is - we argue - essential to the development of genuine understanding. We review the resulting implications and consider future directions for generative AI.
Collapse
Affiliation(s)
- Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy.
| | - Thomas Parr
- Nuffield Department of Clinical Neurosciences, University of Oxford
| | - Paul Cisek
- Department of Neuroscience, University of Montréal, Montréal, Québec, Canada
| | - Andy Clark
- Department of Philosophy, University of Sussex, Brighton, UK; Department of Informatics, University of Sussex, Brighton, UK; Department of Philosophy, Macquarie University, Sydney, New South Wales, Australia
| | - Karl Friston
- Wellcome Centre for Human Neuroimaging, Queen Square Institute of Neurology, University College London, London, UK; VERSES AI Research Lab, Los Angeles, CA, USA
| |
Collapse
|
3
|
Song S, Yu F, Jiang X, Zhu J, Cheng W, Fang X. Loop closure detection of visual SLAM based on variational autoencoder. Front Neurorobot 2024; 17:1301785. [PMID: 38313328 PMCID: PMC10837850 DOI: 10.3389/fnbot.2023.1301785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 12/26/2023] [Indexed: 02/06/2024] Open
Abstract
Loop closure detection is an important module for simultaneous localization and mapping (SLAM). Correct detection of loops can reduce the cumulative drift in positioning. Because traditional detection methods rely on handicraft features, false positive detections can occur when the environment changes, resulting in incorrect estimates and an inability to obtain accurate maps. In this research paper, a loop closure detection method based on a variational autoencoder (VAE) is proposed. It is intended to be used as a feature extractor to extract image features through neural networks to replace the handicraft features used in traditional methods. This method extracts a low-dimensional vector as the representation of the image. At the same time, the attention mechanism is added to the network and constraints are added to improve the loss function for better image representation. In the back-end feature matching process, geometric checking is used to filter out the wrong matching for the false positive problem. Finally, through numerical experiments, the proposed method is demonstrated to have a better precision-recall curve than the traditional method of the bag-of-words model and other deep learning methods and is highly robust to environmental changes. In addition, experiments on datasets from three different scenarios also demonstrate that the method can be applied in real-world scenarios and that it has a good performance.
Collapse
Affiliation(s)
- Shibin Song
- Department of College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao, China
| | - Fengjie Yu
- Department of College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao, China
| | - Xiaojie Jiang
- Yantai Tulan Electronic Technology Co., Ltd, Yantai, China
| | - Jie Zhu
- Department of College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao, China
| | - Weihao Cheng
- Department of College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao, China
| | - Xiao Fang
- Department of College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao, China
| |
Collapse
|
4
|
de Tinguy D, Van de Maele T, Verbelen T, Dhoedt B. Spatial and Temporal Hierarchy for Autonomous Navigation Using Active Inference in Minigrid Environment. ENTROPY (BASEL, SWITZERLAND) 2024; 26:83. [PMID: 38248208 PMCID: PMC11154534 DOI: 10.3390/e26010083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 01/05/2024] [Accepted: 01/12/2024] [Indexed: 01/23/2024]
Abstract
Robust evidence suggests that humans explore their environment using a combination of topological landmarks and coarse-grained path integration. This approach relies on identifiable environmental features (topological landmarks) in tandem with estimations of distance and direction (coarse-grained path integration) to construct cognitive maps of the surroundings. This cognitive map is believed to exhibit a hierarchical structure, allowing efficient planning when solving complex navigation tasks. Inspired by human behaviour, this paper presents a scalable hierarchical active inference model for autonomous navigation, exploration, and goal-oriented behaviour. The model uses visual observation and motion perception to combine curiosity-driven exploration with goal-oriented behaviour. Motion is planned using different levels of reasoning, i.e., from context to place to motion. This allows for efficient navigation in new spaces and rapid progress toward a target. By incorporating these human navigational strategies and their hierarchical representation of the environment, this model proposes a new solution for autonomous navigation and exploration. The approach is validated through simulations in a mini-grid environment.
Collapse
Affiliation(s)
| | | | - Tim Verbelen
- VERSES AI Research Lab, Los Angeles, CA 90016, USA;
| | | |
Collapse
|
5
|
Sanfey J. Simultaneity of consciousness with physical reality: the key that unlocks the mind-matter problem. Front Psychol 2023; 14:1173653. [PMID: 37842692 PMCID: PMC10568466 DOI: 10.3389/fpsyg.2023.1173653] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 09/12/2023] [Indexed: 10/17/2023] Open
Abstract
The problem of explaining the relationship between subjective experience and physical reality remains difficult and unresolved. In most explanations, consciousness is epiphenomenal, without causal power. The most notable exception is Integrated Information Theory (IIT), which provides a causal explanation for consciousness. However, IIT relies on an identity between subjectivity and a particular type of physical structure, namely with an information structure that has intrinsic causal power greater than the sum of its parts. Any theory that relies on a psycho-phyiscal identity must eventually appeal to panpsychism, which undermines that theory's claim to be fundamental. IIT has recently pivoted towards a strong version of causal emergence, but macroscopic structures cannot be stronger causally than their microphysical parts without some new physical law or governing principle. The approach taken here is designed to uncover such a principle. The decisive argument is entirely deductive from initial premises that are phenomenologically certain. If correct, the arguments prove that conscious experience is sufficient to create additional degrees of causal freedom independently of the content of experience, and in a manner that is unpredictable and unobservable by any temporally sequential means. This provides a fundamental principle about consciousness, and a conceptual bridge between it and the physics describing what is experienced. The principle makes testable predictions about brain function, with notable differences from IIT, some of which are also empirically testable.
Collapse
|
6
|
Dumont NSY, Furlong PM, Orchard J, Eliasmith C. Exploiting semantic information in a spiking neural SLAM system. Front Neurosci 2023; 17:1190515. [PMID: 37476829 PMCID: PMC10354246 DOI: 10.3389/fnins.2023.1190515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/16/2023] [Indexed: 07/22/2023] Open
Abstract
To navigate in new environments, an animal must be able to keep track of its position while simultaneously creating and updating an internal map of features in the environment, a problem formulated as simultaneous localization and mapping (SLAM) in the field of robotics. This requires integrating information from different domains, including self-motion cues, sensory, and semantic information. Several specialized neuron classes have been identified in the mammalian brain as being involved in solving SLAM. While biology has inspired a whole class of SLAM algorithms, the use of semantic information has not been explored in such work. We present a novel, biologically plausible SLAM model called SSP-SLAM-a spiking neural network designed using tools for large scale cognitive modeling. Our model uses a vector representation of continuous spatial maps, which can be encoded via spiking neural activity and bound with other features (continuous and discrete) to create compressed structures containing semantic information from multiple domains (e.g., spatial, temporal, visual, conceptual). We demonstrate that the dynamics of these representations can be implemented with a hybrid oscillatory-interference and continuous attractor network of head direction cells. The estimated self-position from this network is used to learn an associative memory between semantically encoded landmarks and their positions, i.e., an environment map, which is used for loop closure. Our experiments demonstrate that environment maps can be learned accurately and their use greatly improves self-position estimation. Furthermore, grid cells, place cells, and object vector cells are observed by this model. We also run our path integrator network on the NengoLoihi neuromorphic emulator to demonstrate feasibility for a full neuromorphic implementation for energy efficient SLAM.
Collapse
|
7
|
Safron A. Integrated world modeling theory expanded: Implications for the future of consciousness. Front Comput Neurosci 2022; 16:642397. [PMID: 36507308 PMCID: PMC9730424 DOI: 10.3389/fncom.2022.642397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 10/24/2022] [Indexed: 11/27/2022] Open
Abstract
Integrated world modeling theory (IWMT) is a synthetic theory of consciousness that uses the free energy principle and active inference (FEP-AI) framework to combine insights from integrated information theory (IIT) and global neuronal workspace theory (GNWT). Here, I first review philosophical principles and neural systems contributing to IWMT's integrative perspective. I then go on to describe predictive processing models of brains and their connections to machine learning architectures, with particular emphasis on autoencoders (perceptual and active inference), turbo-codes (establishment of shared latent spaces for multi-modal integration and inferential synergy), and graph neural networks (spatial and somatic modeling and control). Future directions for IIT and GNWT are considered by exploring ways in which modules and workspaces may be evaluated as both complexes of integrated information and arenas for iterated Bayesian model selection. Based on these considerations, I suggest novel ways in which integrated information might be estimated using concepts from probabilistic graphical models, flow networks, and game theory. Mechanistic and computational principles are also considered with respect to the ongoing debate between IIT and GNWT regarding the physical substrates of different kinds of conscious and unconscious phenomena. I further explore how these ideas might relate to the "Bayesian blur problem," or how it is that a seemingly discrete experience can be generated from probabilistic modeling, with some consideration of analogies from quantum mechanics as potentially revealing different varieties of inferential dynamics. I go on to describe potential means of addressing critiques of causal structure theories based on network unfolding, and the seeming absurdity of conscious expander graphs (without cybernetic symbol grounding). Finally, I discuss future directions for work centered on attentional selection and the evolutionary origins of consciousness as facilitated "unlimited associative learning." While not quite solving the Hard problem, this article expands on IWMT as a unifying model of consciousness and the potential future evolution of minds.
Collapse
Affiliation(s)
- Adam Safron
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Center for Psychedelic and Consciousness Research, Baltimore, MD, United States
- Cognitive Science Program, Indiana University, Bloomington, IN, United States
- Institute for Advanced Consciousness Studies (IACS), Santa Monica, CA, United States
| |
Collapse
|