1
|
Kawai Y, Park J, Tsuda I, Asada M. Learning long-term motor timing/patterns on an orthogonal basis in random neural networks. Neural Netw 2023; 163:298-311. [PMID: 37087852 DOI: 10.1016/j.neunet.2023.04.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 03/15/2023] [Accepted: 04/05/2023] [Indexed: 04/25/2023]
Abstract
The ability of the brain to generate complex spatiotemporal patterns with specific timings is essential for motor learning and temporal processing. An approach that can model this function, using the spontaneous activity of a random neural network (RNN), is associated with orbital instability. We propose a simple system that learns an arbitrary time series as the linear sum of stable trajectories produced by several small network modules. New finding in computer experiments is that the trajectories of the module outputs are orthogonal to each other. They created a dynamic orthogonal basis acquiring a high representational capacity, which enabled the system to learn the timing of extremely long intervals, such as tens of seconds for a millisecond computation unit, and also the complex time series of Lorenz attractors. This self-sustained system satisfies the stability and orthogonality requirements and thus provides a new neurocomputing framework and perspective for the neural mechanisms of motor learning.
Collapse
Affiliation(s)
- Yuji Kawai
- Symbiotic Intelligent Systems Research Center, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, 1-1 Yamadaoka, Suita, Osaka 565-0871, Japan.
| | - Jihoon Park
- Symbiotic Intelligent Systems Research Center, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, 1-1 Yamadaoka, Suita, Osaka 565-0871, Japan; Center for Information and Neural Networks, National Institute of Information and Communications Technology, 1-4 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Ichiro Tsuda
- Chubu University Academy of Emerging Sciences/Center for Mathematical Science and Artificial Intelligence, Chubu University, 1200 Matsumoto-cho, Kasugai, Aichi 487-8501, Japan
| | - Minoru Asada
- Symbiotic Intelligent Systems Research Center, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, 1-1 Yamadaoka, Suita, Osaka 565-0871, Japan; Center for Information and Neural Networks, National Institute of Information and Communications Technology, 1-4 Yamadaoka, Suita, Osaka 565-0871, Japan; Chubu University Academy of Emerging Sciences/Center for Mathematical Science and Artificial Intelligence, Chubu University, 1200 Matsumoto-cho, Kasugai, Aichi 487-8501, Japan; International Professional University of Technology in Osaka, 3-3-1 Umeda, Kita-ku, Osaka 530-0001, Japan
| |
Collapse
|
2
|
Kage H. Implementing associative memories by Echo State Network for the applications of natural language processing. MACHINE LEARNING WITH APPLICATIONS 2023. [DOI: 10.1016/j.mlwa.2023.100449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
|
3
|
Scleidorovich P, Weitzenfeld A, Fellous JM, Dominey PF. Integration of velocity-dependent spatio-temporal structure of place cell activation during navigation in a reservoir model of prefrontal cortex. BIOLOGICAL CYBERNETICS 2022; 116:585-610. [PMID: 36222887 DOI: 10.1007/s00422-022-00945-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 09/10/2022] [Indexed: 06/16/2023]
Abstract
Sequential behavior unfolds both in space and in time. The same spatial trajectory can be realized in different manners in the same overall time by changing instantaneous speeds. The current research investigates how speed profiles might be given behavioral significance and how cortical networks might encode this information. We first demonstrate that rats can associate different speed patterns on the same trajectory with distinct behavioral choices. In this novel experimental paradigm, rats follow a small baited robot in a large megaspace environment where the rat's speed is precisely controlled by the robot's speed. Based on this proof of concept and research showing that recurrent reservoir networks are ideal for representing spatio-temporal structures, we then test reservoir networks in simulated navigation contexts and demonstrate they can discriminate between traversals of the same path with identical durations but different speed profiles. We then test the networks in an embodied robotic setup, where we use place cell representations from physically navigating robots as input and again successfully discriminate between traversals. To demonstrate that this capability is inherent to recurrent networks, we compared the model against simple linear integrators. Interestingly, although the linear integrators could also perform the speed profile discrimination, a clear difference emerged when examining information coding in both models. Reservoir neurons displayed a form of statistical mixed selectivity as a complex interaction between spatial location and speed that was not as abundant in the linear integrators. This mixed selectivity is characteristic of cortex and reservoirs and allows us to generate specific predictions about the neural activity that will be recorded in rat cortex in future experiments.
Collapse
Affiliation(s)
- Pablo Scleidorovich
- Department of Computer Science and Engineering, University of South Florida, Tampa, USA
| | - Alfredo Weitzenfeld
- Department of Computer Science and Engineering, University of South Florida, Tampa, USA
| | - Jean-Marc Fellous
- Departments of Psychology and Biomedical Engineering, University of Arizona, Tucson, USA
| | - Peter Ford Dominey
- INSERM UMR1093-CAPS, Université Bourgogne Franche-Comté, UFR Des Sciences du Sport, 21000, Dijon, France.
- Robot Cognition Laboratory, Institute Marey, Dijon, France.
| |
Collapse
|
4
|
Roy M, Senapati A, Poria S, Mishra A, Hens C. Role of assortativity in predicting burst synchronization using echo state network. Phys Rev E 2022; 105:064205. [PMID: 35854538 DOI: 10.1103/physreve.105.064205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 05/11/2022] [Indexed: 06/15/2023]
Abstract
In this study, we use a reservoir computing based echo state network (ESN) to predict the collective burst synchronization of neurons. Specifically, we investigate the ability of ESN in predicting the burst synchronization of an ensemble of Rulkov neurons placed on a scale-free network. We have shown that a limited number of nodal dynamics used as input in the machine can capture the real trend of burst synchronization in this network. Further, we investigate the proper selection of nodal inputs of degree-degree (positive and negative) correlated networks. We show that for a disassortative network, selection of different input nodes based on degree has no significant role in the machine's prediction. However, in the case of assortative network, training the machine with the information (i.e., time series) of low degree nodes gives better results in predicting the burst synchronization. The results are found to be consistent with the investigation carried out with a continuous time Hindmarsh-Rose neuron model. Furthermore, the role of hyperparameters like spectral radius and leaking parameter of ESN on the prediction process has been examined. Finally, we explain the underlying mechanism responsible for observing these differences in the prediction in a degree correlated network.
Collapse
Affiliation(s)
- Mousumi Roy
- Department of Applied Mathematics, University of Calcutta, 92, A.P.C. Road, Kolkata 700009, India
| | - Abhishek Senapati
- Center for Advanced Systems Understanding (CASUS), 02826 Görlitz, Germany
| | - Swarup Poria
- Department of Applied Mathematics, University of Calcutta, 92, A.P.C. Road, Kolkata 700009, India
| | - Arindam Mishra
- Division of Dynamics, Lodz University of Technology, Stefanowskiego 1/15, 90924 Lodz, Poland
| | - Chittaranjan Hens
- Physics and Applied Mathematics Unit, Indian Statistical Institute, Kolkata 700108, India
| |
Collapse
|
5
|
Pedrelli L, Hinaut X. Hierarchical-Task Reservoir for Online Semantic Analysis From Continuous Speech. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:2654-2663. [PMID: 34570710 DOI: 10.1109/tnnls.2021.3095140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
In this article, we propose a novel architecture called hierarchical-task reservoir (HTR) suitable for real-time applications for which different levels of abstraction are available. We apply it to semantic role labeling (SRL) based on continuous speech recognition. Taking inspiration from the brain, this demonstrates the hierarchies of representations from perceptive to integrative areas, and we consider a hierarchy of four subtasks with increasing levels of abstraction (phone, word, part-of-speech (POS), and semantic role tags). These tasks are progressively learned by the layers of the HTR architecture. Interestingly, quantitative and qualitative results show that the hierarchical-task approach provides an advantage to improve the prediction. In particular, the qualitative results show that a shallow or a hierarchical reservoir, considered as baselines, does not produce estimations as good as the HTR model would. Moreover, we show that it is possible to further improve the accuracy of the model by designing skip connections and by considering word embedding (WE) in the internal representations. Overall, the HTR outperformed the other state-of-the-art reservoir-based approaches and it resulted in extremely efficient with respect to typical recurrent neural networks (RNNs) in deep learning (DL) [e.g., long short term memory (LSTMs)]. The HTR architecture is proposed as a step toward the modeling of online and hierarchical processes at work in the brain during language comprehension.
Collapse
|
6
|
D’Mello SK, Tay L, Southwell R. Psychological Measurement in the Information Age: Machine-Learned Computational Models. CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE 2022. [DOI: 10.1177/09637214211056906] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Psychological science can benefit from and contribute to emerging approaches from the computing and information sciences driven by the availability of real-world data and advances in sensing and computing. We focus on one such approach, machine-learned computational models (MLCMs)—computer programs learned from data, typically with human supervision. We introduce MLCMs and discuss how they contrast with traditional computational models and assessment in the psychological sciences. Examples of MLCMs from cognitive and affective science, neuroscience, education, organizational psychology, and personality and social psychology are provided. We consider the accuracy and generalizability of MLCM-based measures, cautioning researchers to consider the underlying context and intended use when interpreting their performance. We conclude that in addition to known data privacy and security concerns, the use of MLCMs entails a reconceptualization of fairness, bias, interpretability, and responsible use.
Collapse
Affiliation(s)
- Sidney K. D’Mello
- Institute of Cognitive Science, University of Colorado Boulder
- Department of Computer Science, University of Colorado Boulder
| | - Louis Tay
- Department of Psychological Sciences, Purdue University
| | - Rosy Southwell
- Institute of Cognitive Science, University of Colorado Boulder
| |
Collapse
|
7
|
Grammatical structure detection by Instinct Plasticity based Echo State Networks with Genetic Algorithm. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.09.073] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
8
|
Pitti A, Quoy M, Lavandier C, Boucenna S, Swaileh W, Weidmann C. In Search of a Neural Model for Serial Order: a Brain Theory for Memory Development and Higher-Level Cognition. IEEE Trans Cogn Dev Syst 2022. [DOI: 10.1109/tcds.2022.3168046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
9
|
Giorgi I, Golosio B, Esposito M, Cangelosi A, Masala GL. Modeling Multiple Language Learning in a Developmental Cognitive Architecture. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2020.3033963] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
10
|
Dominey PF. Narrative event segmentation in the cortical reservoir. PLoS Comput Biol 2021; 17:e1008993. [PMID: 34618804 PMCID: PMC8525778 DOI: 10.1371/journal.pcbi.1008993] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 10/19/2021] [Accepted: 09/08/2021] [Indexed: 01/04/2023] Open
Abstract
Recent research has revealed that during continuous perception of movies or stories, humans display cortical activity patterns that reveal hierarchical segmentation of event structure. Thus, sensory areas like auditory cortex display high frequency segmentation related to the stimulus, while semantic areas like posterior middle cortex display a lower frequency segmentation related to transitions between events. These hierarchical levels of segmentation are associated with different time constants for processing. Likewise, when two groups of participants heard the same sentence in a narrative, preceded by different contexts, neural responses for the groups were initially different and then gradually aligned. The time constant for alignment followed the segmentation hierarchy: sensory cortices aligned most quickly, followed by mid-level regions, while some higher-order cortical regions took more than 10 seconds to align. These hierarchical segmentation phenomena can be considered in the context of processing related to comprehension. In a recently described model of discourse comprehension word meanings are modeled by a language model pre-trained on a billion word corpus. During discourse comprehension, word meanings are continuously integrated in a recurrent cortical network. The model demonstrates novel discourse and inference processing, in part because of two fundamental characteristics: real-world event semantics are represented in the word embeddings, and these are integrated in a reservoir network which has an inherent gradient of functional time constants due to the recurrent connections. Here we demonstrate how this model displays hierarchical narrative event segmentation properties beyond the embeddings alone, or their linear integration. The reservoir produces activation patterns that are segmented by a hidden Markov model (HMM) in a manner that is comparable to that of humans. Context construction displays a continuum of time constants across reservoir neuron subsets, while context forgetting has a fixed time constant across these subsets. Importantly, virtual areas formed by subgroups of reservoir neurons with faster time constants segmented with shorter events, while those with longer time constants preferred longer events. This neurocomputational recurrent neural network simulates narrative event processing as revealed by the fMRI event segmentation algorithm provides a novel explanation of the asymmetry in narrative forgetting and construction. The model extends the characterization of online integration processes in discourse to more extended narrative, and demonstrates how reservoir computing provides a useful model of cortical processing of narrative structure.
Collapse
Affiliation(s)
- Peter Ford Dominey
- INSERM UMR1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, Dijon
- Robot Cognition Laboratory, Institute Marey, Dijon
| |
Collapse
|
11
|
Ghosh S, Senapati A, Mishra A, Chattopadhyay J, Dana SK, Hens C, Ghosh D. Reservoir computing on epidemic spreading: A case study on COVID-19 cases. Phys Rev E 2021; 104:014308. [PMID: 34412296 DOI: 10.1103/physreve.104.014308] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 06/23/2021] [Indexed: 12/19/2022]
Abstract
A reservoir computing based echo state network (ESN) is used here for the purpose of predicting the spread of a disease. The current infection trends of a disease in some targeted locations are efficiently captured by the ESN when it is fed with the infection data for other locations. The performance of the ESN is first tested with synthetic data generated by numerical simulations of independent uncoupled patches, each governed by the classical susceptible-infected-recovery model for a choice of distributed infection parameters. From a large pool of synthetic data, the ESN predicts the current trend of infection in 5% patches by exploiting the uncorrelated infection trend of 95% patches. The prediction remains consistent for most of the patches for approximately 4 to 5 weeks. The machine's performance is further tested with real data on the current COVID-19 pandemic collected for different countries. We show that our proposed scheme is able to predict the trend of the disease for up to 3 weeks for some targeted locations. An important point is that no detailed information on the epidemiological rate parameters is needed; the success of the machine rather depends on the history of the disease progress represented by the time-evolving data sets of a large number of locations. Finally, we apply a modified version of our proposed scheme for the purpose of future forecasting.
Collapse
Affiliation(s)
- Subrata Ghosh
- Physics and Applied Mathematics Unit, Indian Statistical Institute, 203 B. T. Road, Kolkata 700108, India
| | - Abhishek Senapati
- Agricultural and Ecological Research Unit, Indian Statistical Institute, 203 B. T. Road, Kolkata 700108, India.,Center for Advanced Systems Understanding (CASUS), Goerlitz, Germany
| | - Arindam Mishra
- Department of Mathematics, Jadavpur University, Kolkata 700032, India
| | - Joydev Chattopadhyay
- Agricultural and Ecological Research Unit, Indian Statistical Institute, 203 B. T. Road, Kolkata 700108, India
| | - Syamal K Dana
- Department of Mathematics, Jadavpur University, Kolkata 700032, India
| | - Chittaranjan Hens
- Physics and Applied Mathematics Unit, Indian Statistical Institute, 203 B. T. Road, Kolkata 700108, India
| | - Dibakar Ghosh
- Physics and Applied Mathematics Unit, Indian Statistical Institute, 203 B. T. Road, Kolkata 700108, India
| |
Collapse
|
12
|
Asano R, Boeckx C, Seifert U. Hierarchical control as a shared neurocognitive mechanism for language and music. Cognition 2021; 216:104847. [PMID: 34311153 DOI: 10.1016/j.cognition.2021.104847] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 05/14/2021] [Accepted: 07/11/2021] [Indexed: 12/16/2022]
Abstract
Although comparative research has made substantial progress in clarifying the relationship between language and music as neurocognitive systems from both a theoretical and empirical perspective, there is still no consensus about which mechanisms, if any, are shared and how they bring about different neurocognitive systems. In this paper, we tackle these two questions by focusing on hierarchical control as a neurocognitive mechanism underlying syntax in language and music. We put forward the Coordinated Hierarchical Control (CHC) hypothesis: linguistic and musical syntax rely on hierarchical control, but engage this shared mechanism differently depending on the current control demand. While linguistic syntax preferably engages the abstract rule-based control circuit, musical syntax rather employs the coordination of the abstract rule-based and the more concrete motor-based control circuits. We provide evidence for our hypothesis by reviewing neuroimaging as well as neuropsychological studies on linguistic and musical syntax. The CHC hypothesis makes a set of novel testable predictions to guide future work on the relationship between language and music.
Collapse
Affiliation(s)
- Rie Asano
- Systematic Musicology, Institute of Musicology, University of Cologne, Germany.
| | - Cedric Boeckx
- Section of General Linguistics, University of Barcelona, Spain; University of Barcelona Institute for Complex Systems (UBICS), Spain; Catalan Institute for Advanced Studies and Research (ICREA), Spain
| | - Uwe Seifert
- Systematic Musicology, Institute of Musicology, University of Cologne, Germany
| |
Collapse
|
13
|
Chen C, Lu Q, Beukers A, Baldassano C, Norman KA. Learning to perform role-filler binding with schematic knowledge. PeerJ 2021; 9:e11046. [PMID: 33850650 PMCID: PMC8019313 DOI: 10.7717/peerj.11046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Accepted: 02/10/2021] [Indexed: 11/20/2022] Open
Abstract
Through specific experiences, humans learn the relationships that underlie the structure of events in the world. Schema theory suggests that we organize this information in mental frameworks called "schemata," which represent our knowledge of the structure of the world. Generalizing knowledge of structural relationships to new situations requires role-filler binding, the ability to associate specific "fillers" with abstract "roles." For instance, when we hear the sentence Alice ordered a tea from Bob, the role-filler bindings customer:Alice, drink:tea and barista:Bob allow us to understand and make inferences about the sentence. We can perform these bindings for arbitrary fillers-we understand this sentence even if we have never heard the names Alice, tea, or Bob before. In this work, we define a model as capable of performing role-filler binding if it can recall arbitrary fillers corresponding to a specified role, even when these pairings violate correlations seen during training. Previous work found that models can learn this ability when explicitly told what the roles and fillers are, or when given fillers seen during training. We show that networks with external memory learn to bind roles to arbitrary fillers, without explicitly labeled role-filler pairs. We further show that they can perform these bindings on role-filler pairs that violate correlations seen during training, while retaining knowledge of training correlations. We apply analyses inspired by neural decoding to interpret what the networks have learned.
Collapse
Affiliation(s)
- Catherine Chen
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | - Qihong Lu
- Department of Psychology, Princeton University, Princeton, NJ, USA
| | - Andre Beukers
- Department of Psychology, Princeton University, Princeton, NJ, USA
| | | | - Kenneth A Norman
- Department of Psychology, Princeton University, Princeton, NJ, USA
| |
Collapse
|
14
|
Pointeau G, Mirliaz S, Mealier AL, Dominey PF. Learning to Use Narrative Function Words for the Organization and Communication of Experience. Front Psychol 2021; 12:591703. [PMID: 33762991 PMCID: PMC7982915 DOI: 10.3389/fpsyg.2021.591703] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 01/22/2021] [Indexed: 11/16/2022] Open
Abstract
How do people learn to talk about the causal and temporal relations between events, and the motivation behind why people do what they do? The narrative practice hypothesis of Hutto and Gallagher holds that children are exposed to narratives that provide training for understanding and expressing reasons for why people behave as they do. In this context, we have recently developed a model of narrative processing where a structured model of the developing situation (the situation model) is built up from experienced events, and enriched by sentences in a narrative that describe event meanings. The main interest is to develop a proof of concept for how narrative can be used to structure, organize and describe experience. Narrative sentences describe events, and they also define temporal and causal relations between events. These relations are specified by a class of narrative function words, including “because, before, after, first, finally.” The current research develops a proof of concept that by observing how people describe social events, a developmental robotic system can begin to acquire early knowledge of how to explain the reasons for events. We collect data from naïve subjects who use narrative function words to describe simple scenes of human-robot interaction, and then employ algorithms for extracting the statistical structure of how narrative function words link events in the situation model. By using these statistical regularities, the robot can thus learn from human experience about how to properly employ in question-answering dialogues with the human, and in generating canonical narratives for new experiences. The behavior of the system is demonstrated over several behavioral interactions, and associated narrative interaction sessions, while a more formal extended evaluation and user study will be the subject of future research. Clearly this is far removed from the power of the full blown narrative practice capability, but it provides a first step in the development of an experimental infrastructure for the study of socially situated narrative practice in human-robot interaction.
Collapse
Affiliation(s)
- Gregoire Pointeau
- INSERM UMR 1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, Dijon, France.,Robot Cognition Laboratory, Marey Institute, Dijon, France
| | - Solène Mirliaz
- INSERM UMR 1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, Dijon, France.,Robot Cognition Laboratory, Marey Institute, Dijon, France.,École Normale Supérieure de Rennes, Bruz, France
| | - Anne-Laure Mealier
- INSERM UMR 1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, Dijon, France.,Robot Cognition Laboratory, Marey Institute, Dijon, France
| | - Peter Ford Dominey
- INSERM UMR 1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, Dijon, France.,Robot Cognition Laboratory, Marey Institute, Dijon, France
| |
Collapse
|
15
|
Esposito M, Masala GL, Golosio B, Cangelosi A. Editorial: Language Representation and Learning in Cognitive and Artificial Intelligence Systems. Front Robot AI 2021; 7:69. [PMID: 33501236 PMCID: PMC7807395 DOI: 10.3389/frobt.2020.00069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Accepted: 04/27/2020] [Indexed: 11/16/2022] Open
Affiliation(s)
- Massimo Esposito
- Institute for High Performance Computing and Networking of the National Research Council of Italy, Naples, Italy
| | | | | | | |
Collapse
|
16
|
Uchida T, Lair N, Ishiguro H, Dominey PF. A Model of Online Temporal-Spatial Integration for Immediacy and Overrule in Discourse Comprehension. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2021; 2:83-105. [PMID: 37213417 PMCID: PMC10174358 DOI: 10.1162/nol_a_00026] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 10/12/2020] [Indexed: 05/23/2023]
Abstract
During discourse comprehension, information from prior processing is integrated and appears to be immediately accessible. This was remarkably demonstrated by an N400 for "salted" and not "in love" in response to "The peanut was salted/in love." Discourse overrule was induced by prior discourse featuring the peanut as an animate agent. Immediate discourse overrule requires a model that integrates information at two timescales. One is over the lifetime and includes event knowledge and word semantics. The second is over the discourse in an event context. We propose a model where both are accounted for by temporal-to-spatial integration of experience into distributed spatial representations, providing immediate access to experience accumulated over different timescales. For lexical semantics, this is modeled by a word embedding system trained by sequential exposure to the entire Wikipedia corpus. For discourse, this is modeled by a recurrent reservoir network trained to generate a discourse vector for input sequences of words. The N400 is modeled as the difference between the instantaneous discourse vector and the target word. We predict this model can account for semantic immediacy and discourse overrule. The model simulates lexical priming and discourse overrule in the "Peanut in love" discourse, and it demonstrates that an unexpected word elicits reduced N400 if it is generally related to the event described in prior discourse, and that this effect disappears when the discourse context is removed. This neurocomputational model is the first to simulate immediacy and overrule in discourse-modulated N400, and contributes to characterization of online integration processes in discourse.
Collapse
Affiliation(s)
- Takahisa Uchida
- Ishiguro Lab, Graduate School of Engineering Science, Osaka University, Osaka, Japan
| | - Nicolas Lair
- INSERM UMR1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, Dijon, France
- Robot Cognition Laboratory, Marey Institute, Dijon, France
| | - Hiroshi Ishiguro
- Ishiguro Lab, Graduate School of Engineering Science, Osaka University, Osaka, Japan
| | | |
Collapse
|
17
|
Gärdenfors P. Primary Cognitive Categories Are Determined by Their Invariances. Front Psychol 2020; 11:584017. [PMID: 33363496 PMCID: PMC7753358 DOI: 10.3389/fpsyg.2020.584017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 11/13/2020] [Indexed: 11/18/2022] Open
Abstract
The world as we perceive it is structured into objects, actions and places that form parts of events. In this article, my aim is to explain why these categories are cognitively primary. From an empiricist and evolutionary standpoint, it is argued that the reduction of the complexity of sensory signals is based on the brain's capacity to identify various types of invariances that are evolutionarily relevant for the activities of the organism. The first aim of the article is to explain why places, object and actions are primary cognitive categories in our constructions of the external world. It is shown that the invariances that determine these categories have their separate characteristics and that they are, by and large, independent of each other. This separation is supported by what is known about the neural mechanisms. The second aim is to show that the category of events can be analyzed as being constituted of the primary categories. The category of numbers is briefly discussed. Some implications for computational models of the categories are also presented.
Collapse
Affiliation(s)
- Peter Gärdenfors
- Cognitive Science, Department of Philosophy, Lund University, Lund, Sweden.,Faculty of Humanities, Palaeo-Research Institute, University of Johannesburg, Johannesburg, South Africa
| |
Collapse
|
18
|
Jirak D, Tietz S, Ali H, Wermter S. Echo State Networks and Long Short-Term Memory for Continuous Gesture Recognition: a Comparative Study. Cognit Comput 2020. [DOI: 10.1007/s12559-020-09754-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
AbstractRecent developments of sensors that allow tracking of human movements and gestures enable rapid progress of applications in domains like medical rehabilitation or robotic control. Especially the inertial measurement unit (IMU) is an excellent device for real-time scenarios as it rapidly delivers data input. Therefore, a computational model must be able to learn gesture sequences in a fast yet robust way. We recently introduced an echo state network (ESN) framework for continuous gesture recognition (Tietz et al., 2019) including novel approaches for gesture spotting, i.e., the automatic detection of the start and end phase of a gesture. Although our results showed good classification performance, we identified significant factors which also negatively impact the performance like subgestures and gesture variability. To address these issues, we include experiments with Long Short-Term Memory (LSTM) networks, which is a state-of-the-art model for sequence processing, to compare the obtained results with our framework and to evaluate their robustness regarding pitfalls in the recognition process. In this study, we analyze the two conceptually different approaches processing continuous, variable-length gesture sequences, which shows interesting results comparing the distinct gesture accomplishments. In addition, our results demonstrate that our ESN framework achieves comparably good performance as the LSTM network but has significantly lower training times. We conclude from the present work that ESNs are viable models for continuous gesture recognition delivering reasonable performance for applications requiring real-time performance as in robotic or rehabilitation tasks. From our discussion of this comparative study, we suggest prospective improvements on both the experimental and network architecture level.
Collapse
|
19
|
Simov K, Koprinkova-Hristova P, Popov A, Osenova P. A Reservoir Computing Approach to Word Sense Disambiguation. Cognit Comput 2020. [DOI: 10.1007/s12559-020-09758-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
20
|
Hinaut X, Twiefel J. Teach Your Robot Your Language! Trainable Neural Parser for Modeling Human Sentence Processing: Examples for 15 Languages. IEEE Trans Cogn Dev Syst 2020. [DOI: 10.1109/tcds.2019.2957006] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
21
|
Cazin N, Scleidorovich P, Weitzenfeld A, Dominey PF. Real-time sensory-motor integration of hippocampal place cell replay and prefrontal sequence learning in simulated and physical rat robots for novel path optimization. BIOLOGICAL CYBERNETICS 2020; 114:249-268. [PMID: 32095878 DOI: 10.1007/s00422-020-00820-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Accepted: 02/04/2020] [Indexed: 06/10/2023]
Abstract
An open problem in the cognitive dimensions of navigation concerns how previous exploratory experience is reorganized in order to allow the creation of novel efficient navigation trajectories. This behavior is revealed in the "traveling salesrat problem" (TSP) when rats discover the shortest path linking baited food wells after a few exploratory traversals. We have recently published a model of navigation sequence learning, where sharp wave ripple replay of hippocampal place cells transmit "snippets" of the recent trajectories that the animal has explored to the prefrontal cortex (PFC) (Cazin et al. in PLoS Comput Biol 15:e1006624, 2019). PFC is modeled as a recurrent reservoir network that is able to assemble these snippets into the efficient sequence (trajectory of spatial locations coded by place cell activation). The model of hippocampal replay generates a distribution of snippets as a function of their proximity to a reward, thus implementing a form of spatial credit assignment that solves the TSP task. The integrative PFC reservoir reconstructs the efficient TSP sequence based on exposure to this distribution of snippets that favors paths that are most proximal to rewards. While this demonstrates the theoretical feasibility of the PFC-HIPP interaction, the integration of such a dynamic system into a real-time sensory-motor system remains a challenge. In the current research, we test the hypothesis that the PFC reservoir model can operate in a real-time sensory-motor loop. Thus, the main goal of the paper is to validate the model in simulated and real robot scenarios. Place cell activation encoding the current position of the simulated and physical rat robot feeds the PFC reservoir which generates the successor place cell activation that represents the next step in the reproduced sequence in the readout. This is input to the robot, which advances to the coded location and then generates de novo the current place cell activation. This allows demonstration of the crucial role of embodiment. If the spatial code readout from PFC is played back directly into PFC, error can accumulate, and the system can diverge from desired trajectories. This required a spatial filter to decode the PFC code to a location and then recode a new place cell code for that location. In the robot, the place cell vector output of PFC is used to physically displace the robot and then generate a new place cell coded input to the PFC, replacing part of the software recoding procedure that was required otherwise. We demonstrate how this integrated sensory-motor system can learn simple navigation sequences and then, importantly, how it can synthesize novel efficient sequences based on prior experience, as previously demonstrated (Cazin et al. 2019). This contributes to the understanding of hippocampal replay in novel navigation sequence formation and the important role of embodiment.
Collapse
Affiliation(s)
- Nicolas Cazin
- INSERM UMR1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, 21000, Dijon, France
- Robot Cognition Laboratory, Institut Marey, INSERM U1093 CAPS, UBFC, Dijon, France
| | | | | | - Peter Ford Dominey
- INSERM UMR1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, 21000, Dijon, France.
- Robot Cognition Laboratory, Institut Marey, INSERM U1093 CAPS, UBFC, Dijon, France.
| |
Collapse
|
22
|
|
23
|
Abstract
Gated working memory is defined as the capacity of holding arbitrary information at any time in order to be used at a later time. Based on electrophysiological recordings, several computational models have tackled the problem using dedicated and explicit mechanisms. We propose instead to consider an implicit mechanism based on a random recurrent neural network. We introduce a robust yet simple reservoir model of gated working memory with instantaneous updates. The model is able to store an arbitrary real value at random time over an extended period of time. The dynamics of the model is a line attractor that learns to exploit reentry and a nonlinearity during the training phase using only a few representative values. A deeper study of the model shows that there is actually a large range of hyperparameters for which the results hold (e.g., number of neurons, sparsity, global weight scaling) such that any large enough population, mixing excitatory and inhibitory neurons, can quickly learn to realize such gated working memory. In a nutshell, with a minimal set of hypotheses, we show that we can have a robust model of working memory. This suggests this property could be an implicit property of any random population, that can be acquired through learning. Furthermore, considering working memory to be a physically open but functionally closed system, we give account on some counterintuitive electrophysiological recordings.
Collapse
Affiliation(s)
- Anthony Strock
- Inria Bordeaux Sud-Ouest, 33405 Talence Cedex, France; LaBRI, Université de Bordeaux, Institut Polytechnique de Bordeaux, Centre National de la Recherche Scientifique, 33405 Talence Cedex, France; and Institut des Maladies Neurodégénératives, Université de Bordeaux, Centre National de la Recherche Scientifique, 33076 Cedex, Bordeaux, France
| | - Xavier Hinaut
- Inria Bordeaux Sud-Ouest, 33405 Talence Cedex, France; LaBRI, Université de Bordeaux, Institut Polytechnique de Bordeaux, Centre National de la Recherche Scientifique, 33405 Talence Cedex, France; and Institut des Maladies Neurodégénératives, Université de Bordeaux, Centre National de la Recherche Scientifique, 33076 Cedex, Bordeaux, France
| | - Nicolas P Rougier
- Inria Bordeaux Sud-Ouest, 33405 Talence Cedex, France; LaBRI, Université de Bordeaux, Institut Polytechnique de Bordeaux, Centre National de la Recherche Scientifique, 33405 Talence Cedex, France; and Institut des Maladies Neurodégénératives, Université de Bordeaux, Centre National de la Recherche Scientifique, 33076 Cedex, Bordeaux, France
| |
Collapse
|
24
|
Pitti A, Quoy M, Lavandier C, Boucenna S. Gated spiking neural network using Iterative Free-Energy Optimization and rank-order coding for structure learning in memory sequences (INFERNO GATE). Neural Netw 2019; 121:242-258. [PMID: 31581065 DOI: 10.1016/j.neunet.2019.09.023] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Revised: 09/16/2019] [Accepted: 09/17/2019] [Indexed: 11/16/2022]
Abstract
We present a framework based on iterative free-energy optimization with spiking neural networks for modeling the fronto-striatal system (PFC-BG) for the generation and recall of audio memory sequences. In line with neuroimaging studies carried out in the PFC, we propose a genuine coding strategy using the gain-modulation mechanism to represent abstract sequences based solely on the rank and location of items within them. Based on this mechanism, we show that we can construct a repertoire of neurons sensitive to the temporal structure in sequences from which we can represent any novel sequences. Free-energy optimization is then used to explore and to retrieve the missing indices of the items in the correct order for executive control and compositionality. We show that the gain-modulation mechanism permits the network to be robust to variabilities and to have long-term dependencies as it implements a gated recurrent neural network. This model, called Inferno Gate, is an extension of the neural architecture Inferno standing for Iterative Free-Energy Optimization of Recurrent Neural Networks with Gating or Gain-modulation. In experiments performed with an audio database of ten thousand MFCC vectors, Inferno Gate is capable of encoding efficiently and retrieving chunks of fifty items length. We then discuss the potential of our network to model the features of working memory in the PFC-BG loop for structural learning, goal-direction and hierarchical reinforcement learning.
Collapse
Affiliation(s)
- Alexandre Pitti
- Laboratoire ETIS UMR 8051, Université Paris-Seine, Université de Cergy-Pontoise, ENSEA, CNRS, France.
| | - Mathias Quoy
- Laboratoire ETIS UMR 8051, Université Paris-Seine, Université de Cergy-Pontoise, ENSEA, CNRS, France.
| | - Catherine Lavandier
- Laboratoire ETIS UMR 8051, Université Paris-Seine, Université de Cergy-Pontoise, ENSEA, CNRS, France.
| | - Sofiane Boucenna
- Laboratoire ETIS UMR 8051, Université Paris-Seine, Université de Cergy-Pontoise, ENSEA, CNRS, France.
| |
Collapse
|
25
|
Language is not a gadget. Behav Brain Sci 2019; 42:e175. [PMID: 31511101 DOI: 10.1017/s0140525x19001092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Heyes does well to argue that some of the apparently innate human capabilities for cultural learning can be considered in terms of more general-purpose mechanisms. In the application of this to language, she overlooks some of its most interesting properties. I review three, and then illustrate how mindreading can come from general-purpose mechanism via language.
Collapse
|
26
|
Seoane LF. Evolutionary aspects of reservoir computing. Philos Trans R Soc Lond B Biol Sci 2019; 374:20180377. [PMID: 31006369 PMCID: PMC6553587 DOI: 10.1098/rstb.2018.0377] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2018] [Indexed: 01/31/2023] Open
Abstract
Reservoir computing (RC) is a powerful computational paradigm that allows high versatility with cheap learning. While other artificial intelligence approaches need exhaustive resources to specify their inner workings, RC is based on a reservoir with highly nonlinear dynamics that does not require a fine tuning of its parts. These dynamics project input signals into high-dimensional spaces, where training linear readouts to extract input features is vastly simplified. Thus, inexpensive learning provides very powerful tools for decision-making, controlling dynamical systems, classification, etc. RC also facilitates solving multiple tasks in parallel, resulting in a high throughput. Existing literature focuses on applications in artificial intelligence and neuroscience. We review this literature from an evolutionary perspective. RC's versatility makes it a great candidate to solve outstanding problems in biology, which raises relevant questions. Is RC as abundant in nature as its advantages should imply? Has it evolved? Once evolved, can it be easily sustained? Under what circumstances? (In other words, is RC an evolutionarily stable computing paradigm?) To tackle these issues, we introduce a conceptual morphospace that would map computational selective pressures that could select for or against RC and other computing paradigms. This guides a speculative discussion about the questions above and allows us to propose a solid research line that brings together computation and evolution with RC as test model of the proposed hypotheses. This article is part of the theme issue 'Liquid brains, solid brains: How distributed cognitive architectures process information'.
Collapse
Affiliation(s)
- Luís F. Seoane
- ICREA-Complex Systems Lab, Universitat Pompeu Fabra, Barcelona 08003, Spain
- Institut de Biologia Evolutiva (CSIC-UPF), Barcelona 08003, Spain
| |
Collapse
|
27
|
Kawai Y, Park J, Asada M. A small-world topology enhances the echo state property and signal propagation in reservoir computing. Neural Netw 2019; 112:15-23. [DOI: 10.1016/j.neunet.2019.01.002] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Revised: 10/19/2018] [Accepted: 01/07/2019] [Indexed: 01/22/2023]
|
28
|
Tanaka G, Yamane T, Héroux JB, Nakane R, Kanazawa N, Takeda S, Numata H, Nakano D, Hirose A. Recent advances in physical reservoir computing: A review. Neural Netw 2019; 115:100-123. [PMID: 30981085 DOI: 10.1016/j.neunet.2019.03.005] [Citation(s) in RCA: 301] [Impact Index Per Article: 60.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 02/24/2019] [Accepted: 03/07/2019] [Indexed: 02/06/2023]
Abstract
Reservoir computing is a computational framework suited for temporal/sequential data processing. It is derived from several recurrent neural network models, including echo state networks and liquid state machines. A reservoir computing system consists of a reservoir for mapping inputs into a high-dimensional space and a readout for pattern analysis from the high-dimensional states in the reservoir. The reservoir is fixed and only the readout is trained with a simple method such as linear regression and classification. Thus, the major advantage of reservoir computing compared to other recurrent neural networks is fast learning, resulting in low training cost. Another advantage is that the reservoir without adaptive updating is amenable to hardware implementation using a variety of physical systems, substrates, and devices. In fact, such physical reservoir computing has attracted increasing attention in diverse fields of research. The purpose of this review is to provide an overview of recent advances in physical reservoir computing by classifying them according to the type of the reservoir. We discuss the current issues and perspectives related to physical reservoir computing, in order to further expand its practical applications and develop next-generation machine learning systems.
Collapse
Affiliation(s)
- Gouhei Tanaka
- Institute for Innovation in International Engineering Education, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan; Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan.
| | | | | | - Ryosho Nakane
- Institute for Innovation in International Engineering Education, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan; Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan
| | | | | | | | | | - Akira Hirose
- Institute for Innovation in International Engineering Education, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan; Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan
| |
Collapse
|
29
|
Giavazzi M, Daland R, Palminteri S, Peperkamp S, Brugières P, Jacquemot C, Schramm C, Cleret de Langavant L, Bachoud-Lévi AC. The role of the striatum in linguistic selection: Evidence from Huntington's disease and computational modeling. Cortex 2018; 109:189-204. [DOI: 10.1016/j.cortex.2018.08.031] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Revised: 07/04/2018] [Accepted: 08/05/2018] [Indexed: 11/29/2022]
|
30
|
Moulin-Frier C, Fischer T, Petit M, Pointeau G, Puigbo JY, Pattacini U, Low SC, Camilleri D, Nguyen P, Hoffmann M, Chang HJ, Zambelli M, Mealier AL, Damianou A, Metta G, Prescott TJ, Demiris Y, Dominey PF, Verschure PFMJ. DAC-h3: A Proactive Robot Cognitive Architecture to Acquire and Express Knowledge About the World and the Self. IEEE Trans Cogn Dev Syst 2018. [DOI: 10.1109/tcds.2017.2754143] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
31
|
Sensorimotor input as a language generalisation tool: a neurorobotics model for generation and generalisation of noun-verb combinations with sensorimotor inputs. Auton Robots 2018. [DOI: 10.1007/s10514-018-9793-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
32
|
Söderström P, Horne M, Mannfolk P, van Westen D, Roll M. Tone-grammar association within words: Concurrent ERP and fMRI show rapid neural pre-activation and involvement of left inferior frontal gyrus in pseudoword processing. BRAIN AND LANGUAGE 2017; 174:119-126. [PMID: 28850882 DOI: 10.1016/j.bandl.2017.08.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Revised: 07/26/2017] [Accepted: 08/16/2017] [Indexed: 06/07/2023]
Abstract
Using a concurrent ERP/fMRI paradigm, we investigated how listeners take advantage of morphologically relevant tonal information at the beginning of words to predict and pre-activate likely word endings. More predictive, low tone word stems gave rise to a 'pre-activation negativity' (PrAN) in the ERPs, a brain potential which has previously been found to increase along with the degree of predictive certainty as regards how a word is going to end. It is suggested that more predictive, low tone stems lead to rapid access to word endings with processing subserved by the left primary auditory cortex as well as the supramarginal gyrus, while high tone stems - which are less predictive - decrease predictive certainty, leading to increased competition between activated word endings, which needs to be resolved by the left inferior frontal gyrus.
Collapse
Affiliation(s)
- Pelle Söderström
- Department of Linguistics, Centre for Languages and Literature, Lund University, Box 201, 221 00 Lund, Sweden.
| | - Merle Horne
- Department of Linguistics, Centre for Languages and Literature, Lund University, Box 201, 221 00 Lund, Sweden.
| | - Peter Mannfolk
- Skane University Hospital, Department of Medical Imaging and Physiology, Lund, Sweden.
| | - Danielle van Westen
- Lund University, Skane University Hospital, Department of Clinical Sciences Lund, Diagnostic Radiology, Lund, Sweden.
| | - Mikael Roll
- Department of Linguistics, Centre for Languages and Literature, Lund University, Box 201, 221 00 Lund, Sweden.
| |
Collapse
|
33
|
Mealier AL, Pointeau G, Mirliaz S, Ogawa K, Finlayson M, Dominey PF. Narrative Constructions for the Organization of Self Experience: Proof of Concept via Embodied Robotics. Front Psychol 2017; 8:1331. [PMID: 28861011 PMCID: PMC5559541 DOI: 10.3389/fpsyg.2017.01331] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 07/20/2017] [Indexed: 11/13/2022] Open
Abstract
It has been proposed that starting from meaning that the child derives directly from shared experience with others, adult narrative enriches this meaning and its structure, providing causal links between unseen intentional states and actions. This would require a means for representing meaning from experience-a situation model-and a mechanism that allows information to be extracted from sentences and mapped onto the situation model that has been derived from experience, thus enriching that representation. We present a hypothesis and theory concerning how the language processing infrastructure for grammatical constructions can naturally be extended to narrative constructions to provide a mechanism for using language to enrich meaning derived from physical experience. Toward this aim, the grammatical construction models are augmented with additional structures for representing relations between events across sentences. Simulation results demonstrate proof of concept for how the narrative construction model supports multiple successive levels of meaning creation which allows the system to learn about the intentionality of mental states, and argument substitution which allows extensions to metaphorical language and analogical problem solving. Cross-linguistic validity of the system is demonstrated in Japanese. The narrative construction model is then integrated into the cognitive system of a humanoid robot that provides the memory systems and world-interaction required for representing meaning in a situation model. In this context proof of concept is demonstrated for how the system enriches meaning in the situation model that has been directly derived from experience. In terms of links to empirical data, the model predicts strong usage based effects: that is, that the narrative constructions used by children will be highly correlated with those that they experience. It also relies on the notion of narrative or discourse function words. Both of these are validated in the experimental literature.
Collapse
Affiliation(s)
- Anne-Laure Mealier
- Human and Robot Cognitive Systems, Stem Cell and Brain Research Institute U1208, Institut National de la Santé et de la Recherche MédicaleLyon, France
| | - Gregoire Pointeau
- Human and Robot Cognitive Systems, Stem Cell and Brain Research Institute U1208, Institut National de la Santé et de la Recherche MédicaleLyon, France
| | - Solène Mirliaz
- Human and Robot Cognitive Systems, Stem Cell and Brain Research Institute U1208, Institut National de la Santé et de la Recherche MédicaleLyon, France
- Computer Science Department, Ecole Normale Supérieure de RennesRennes, France
| | - Kenji Ogawa
- Graduate School of Letters, Hokkaido UniversitySapporo, Japan
| | - Mark Finlayson
- School of Computing and Information Sciences, Florida International UniversityMiami, FL, United States
| | - Peter F. Dominey
- Human and Robot Cognitive Systems, Stem Cell and Brain Research Institute U1208, Institut National de la Santé et de la Recherche MédicaleLyon, France
| |
Collapse
|
34
|
Pointeau G, Dominey PF. The Role of Autobiographical Memory in the Development of a Robot Self. Front Neurorobot 2017; 11:27. [PMID: 28676751 PMCID: PMC5476692 DOI: 10.3389/fnbot.2017.00027] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2016] [Accepted: 05/22/2017] [Indexed: 11/13/2022] Open
Abstract
This article briefly reviews research in cognitive development concerning the nature of the human self. It then reviews research in developmental robotics that has attempted to retrace parts of the developmental trajectory of the self. This should be of interest to developmental psychologists, and researchers in developmental robotics. As a point of departure, one of the most characteristic aspects of human social interaction is cooperation-the process of entering into a joint enterprise to achieve a common goal. Fundamental to this ability to cooperate is the underlying ability to enter into, and engage in, a self-other relation. This suggests that if we intend for robots to cooperate with humans, then to some extent robots must engage in these self-other relations, and hence they must have some aspect of a self. Decades of research in human cognitive development indicate that the self is not fully present from the outset, but rather that it is developed in a usage-based fashion, that is, through engaging with the world, including the physical world and the social world of animate intentional agents. In an effort to characterize the self, Ulric Neisser noted that self is not unitary, and he thus proposed five types of self-knowledge that correspond to five distinct components of self: ecological, interpersonal, conceptual, temporally extended, and private. He emphasized the ecological nature of each of these levels, how they are developed through the engagement of the developing child with the physical and interpersonal worlds. Crucially, development of the self has been shown to rely on the child's autobiographical memory. From the developmental robotics perspective, this suggests that in principal it would be possible to develop certain aspects of self in a robot cognitive system where the robot is engaged in the physical and social world, equipped with an autobiographical memory system. We review a series of developmental robotics studies that make progress in this enterprise. We conclude with a summary of the properties that are required for the development of these different levels of self, and we identify topics for future research.
Collapse
Affiliation(s)
- Gregoire Pointeau
- Institut National de la Santé et de la Recherche Médicale, Stem Cell and Brain Research Institute U1208, Univ Lyon, Université Claude Bernard Lyon 1Lyon, France.,Robot Cognition Laboratory, Centre National de la Recherche ScientifiqueLyon, France
| | - Peter Ford Dominey
- Institut National de la Santé et de la Recherche Médicale, Stem Cell and Brain Research Institute U1208, Univ Lyon, Université Claude Bernard Lyon 1Lyon, France.,Robot Cognition Laboratory, Centre National de la Recherche ScientifiqueLyon, France
| |
Collapse
|
35
|
Pitti A, Gaussier P, Quoy M. Iterative free-energy optimization for recurrent neural networks (INFERNO). PLoS One 2017; 12:e0173684. [PMID: 28282439 PMCID: PMC5345841 DOI: 10.1371/journal.pone.0173684] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2016] [Accepted: 02/24/2017] [Indexed: 11/19/2022] Open
Abstract
The intra-parietal lobe coupled with the Basal Ganglia forms a working memory that demonstrates strong planning capabilities for generating robust yet flexible neuronal sequences. Neurocomputational models however, often fails to control long range neural synchrony in recurrent spiking networks due to spontaneous activity. As a novel framework based on the free-energy principle, we propose to see the problem of spikes' synchrony as an optimization problem of the neurons sub-threshold activity for the generation of long neuronal chains. Using a stochastic gradient descent, a reinforcement signal (presumably dopaminergic) evaluates the quality of one input vector to move the recurrent neural network to a desired activity; depending on the error made, this input vector is strengthened to hill-climb the gradient or elicited to search for another solution. This vector can be learned then by one associative memory as a model of the basal-ganglia to control the recurrent neural network. Experiments on habit learning and on sequence retrieving demonstrate the capabilities of the dual system to generate very long and precise spatio-temporal sequences, above two hundred iterations. Its features are applied then to the sequential planning of arm movements. In line with neurobiological theories, we discuss its relevance for modeling the cortico-basal working memory to initiate flexible goal-directed neuronal chains of causation and its relation to novel architectures such as Deep Networks, Neural Turing Machines and the Free-Energy Principle.
Collapse
Affiliation(s)
- Alexandre Pitti
- ETIS Laboratory, CNRS UMR 8051, University of Cergy-Pontoise, ENSEA, Paris-Seine, Cergy-Pontoise, France
| | - Philippe Gaussier
- ETIS Laboratory, CNRS UMR 8051, University of Cergy-Pontoise, ENSEA, Paris-Seine, Cergy-Pontoise, France
| | - Mathias Quoy
- ETIS Laboratory, CNRS UMR 8051, University of Cergy-Pontoise, ENSEA, Paris-Seine, Cergy-Pontoise, France
| |
Collapse
|
36
|
Szalisznyó K, Silverstein D, Teichmann M, Duffau H, Smits A. Cortico-striatal language pathways dynamically adjust for syntactic complexity: A computational study. BRAIN AND LANGUAGE 2017; 164:53-62. [PMID: 27792887 DOI: 10.1016/j.bandl.2016.08.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2016] [Revised: 08/05/2016] [Accepted: 08/14/2016] [Indexed: 06/06/2023]
Abstract
A growing body of literature supports a key role of fronto-striatal circuits in language perception. It is now known that the striatum plays a role in engaging attentional resources and linguistic rule computation while also serving phonological short-term memory capabilities. The ventral semantic and the dorsal phonological stream dichotomy assumed for spoken language processing also seems to play a role in cortico-striatal perception. Based on recent studies that correlate deep Broca-striatal pathways with complex syntax performance, we used a previously developed computational model of frontal-striatal syntax circuits and hypothesized that different parallel language pathways may contribute to canonical and non-canonical sentence comprehension separately. We modified and further analyzed a thematic role assignment task and corresponding reservoir computing model of language circuits, as previously developed by Dominey and coworkers. We examined the models performance under various parameter regimes, by influencing how fast the presented language input decays and altering the temporal dynamics of activated word representations. This enabled us to quantify canonical and non-canonical sentence comprehension abilities. The modeling results suggest that separate cortico-cortical and cortico-striatal circuits may be recruited differently for processing syntactically more difficult and less complicated sentences. Alternatively, a single circuit would need to dynamically and adaptively adjust to syntactic complexity.
Collapse
Affiliation(s)
- Krisztina Szalisznyó
- Department of Neuroscience, Psychiatry, University Hospital, Uppsala University, 751 85 Uppsala, Sweden; Computational Neuroscience Group, Wigner Research Institute, Hungarian Academy of Sciences, P.O. Box 49, Budapest, Hungary.
| | - David Silverstein
- Department of Computational Science and Technology, KTH Royal Institute of Technology; Stockholm Brain Institute, Karolinska Institutet, Stockholm, Sweden
| | - Marc Teichmann
- Department of Neurology, Institut de la mémoire et de la maladie d'Alzheimer, Centre de Référence Démences Rares, Hopital de la Pitié-Salpetriére, AP-HP, Paris, France; Institut du Cerveau et de la Moelle Epiniére (ICM), ICM-INSERM 1127, FrontLab, Paris, France
| | - Hugues Duffau
- Department of Neurosurgery, Gui de Chauliac Hospital, University of Montpellier, Institute for Neurosciences of Montpellier, Montpellier, France
| | - Anja Smits
- Department of Neuroscience, Neurology, Uppsala University, Uppsala, Sweden; Department of Clinical Neurosciences and Rehabilitation, Sahlgrenska Academy, Gothenburg, Institute of Neurosciences and Physiology, University of Gothenburg, Gothenburg, Sweden
| |
Collapse
|
37
|
Carmantini GS, Beim Graben P, Desroches M, Rodrigues S. A modular architecture for transparent computation in recurrent neural networks. Neural Netw 2016; 85:85-105. [PMID: 27814468 DOI: 10.1016/j.neunet.2016.09.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Revised: 08/30/2016] [Accepted: 09/05/2016] [Indexed: 10/20/2022]
Abstract
Computation is classically studied in terms of automata, formal languages and algorithms; yet, the relation between neural dynamics and symbolic representations and operations is still unclear in traditional eliminative connectionism. Therefore, we suggest a unique perspective on this central issue, to which we would like to refer as transparent connectionism, by proposing accounts of how symbolic computation can be implemented in neural substrates. In this study we first introduce a new model of dynamics on a symbolic space, the versatile shift, showing that it supports the real-time simulation of a range of automata. We then show that the Gödelization of versatile shifts defines nonlinear dynamical automata, dynamical systems evolving on a vectorial space. Finally, we present a mapping between nonlinear dynamical automata and recurrent artificial neural networks. The mapping defines an architecture characterized by its granular modularity, where data, symbolic operations and their control are not only distinguishable in activation space, but also spatially localizable in the network itself, while maintaining a distributed encoding of symbolic representations. The resulting networks simulate automata in real-time and are programmed directly, in the absence of network training. To discuss the unique characteristics of the architecture and their consequences, we present two examples: (i) the design of a Central Pattern Generator from a finite-state locomotive controller, and (ii) the creation of a network simulating a system of interactive automata that supports the parsing of garden-path sentences as investigated in psycholinguistics experiments.
Collapse
Affiliation(s)
| | - Peter Beim Graben
- Bernstein Center for Computational Neuroscience Berlin, Humboldt-Universität zu Berlin, Berlin, Germany.
| | | | - Serafim Rodrigues
- School of Computing and Mathematics, Plymouth University, Plymouth, United Kingdom.
| |
Collapse
|
38
|
Stock AK, Steenbergen L, Colzato L, Beste C. The system neurophysiological basis of non-adaptive cognitive control: Inhibition of implicit learning mediated by right prefrontal regions. Hum Brain Mapp 2016; 37:4511-4522. [PMID: 27477001 DOI: 10.1002/hbm.23325] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Revised: 07/13/2016] [Accepted: 07/18/2016] [Indexed: 12/24/2022] Open
Abstract
Cognitive control is adaptive in the sense that it inhibits automatic processes to optimize goal-directed behavior, but high levels of control may also have detrimental effects in case they suppress beneficial automatisms. Until now, the system neurophysiological mechanisms and functional neuroanatomy underlying these adverse effects of cognitive control have remained elusive. This question was examined by analyzing the automatic exploitation of a beneficial implicit predictive feature under conditions of high versus low cognitive control demands, combining event-related potentials (ERPs) and source localization. It was found that cognitive control prohibits the beneficial automatic exploitation of additional implicit information when task demands are high. Bottom-up perceptual and attentional selection processes (P1 and N1 ERPs) are not modulated by this, but the automatic exploitation of beneficial predictive information in case of low cognitive control demands was associated with larger response-locked P3 amplitudes and stronger activation of the right inferior frontal gyrus (rIFG, BA47). This suggests that the rIFG plays a key role in the detection of relevant task cues, the exploitation of alternative task sets, and the automatic (bottom-up) implementation and reprogramming of action plans. Moreover, N450 amplitudes were larger under high cognitive control demands, which was associated with activity differences in the right medial frontal gyrus (BA9). This most likely reflects a stronger exploitation of explicit task sets which hinders the exploration of the implicit beneficial information in case of high cognitive control demands. Hum Brain Mapp 37:4511-4522, 2016. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Ann-Kathrin Stock
- Cognitive Neurophysiology, Department of Child and Adolescent Psychiatry, Faculty of Medicine of the TU Dresden, Dresden, Germany
| | - Laura Steenbergen
- Cognitive Psychology Unit & Leiden Institute for Brain and Cognition, Leiden University, Leiden, The Netherlands
| | - Lorenza Colzato
- Cognitive Psychology Unit & Leiden Institute for Brain and Cognition, Leiden University, Leiden, The Netherlands
| | - Christian Beste
- Cognitive Neurophysiology, Department of Child and Adolescent Psychiatry, Faculty of Medicine of the TU Dresden, Dresden, Germany.,Experimental Neurobiology, National Institute of Mental Health, Klecany, Czech Republic
| |
Collapse
|
39
|
Yamada T, Murata S, Arie H, Ogata T. Dynamical Integration of Language and Behavior in a Recurrent Neural Network for Human-Robot Interaction. Front Neurorobot 2016; 10:5. [PMID: 27471463 PMCID: PMC4946379 DOI: 10.3389/fnbot.2016.00005] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Accepted: 06/23/2016] [Indexed: 12/03/2022] Open
Abstract
To work cooperatively with humans by using language, robots must not only acquire a mapping between language and their behavior but also autonomously utilize the mapping in appropriate contexts of interactive tasks online. To this end, we propose a novel learning method linking language to robot behavior by means of a recurrent neural network. In this method, the network learns from correct examples of the imposed task that are given not as explicitly separated sets of language and behavior but as sequential data constructed from the actual temporal flow of the task. By doing this, the internal dynamics of the network models both language-behavior relationships and the temporal patterns of interaction. Here, "internal dynamics" refers to the time development of the system defined on the fixed-dimensional space of the internal states of the context layer. Thus, in the execution phase, by constantly representing where in the interaction context it is as its current state, the network autonomously switches between recognition and generation phases without any explicit signs and utilizes the acquired mapping in appropriate contexts. To evaluate our method, we conducted an experiment in which a robot generates appropriate behavior responding to a human's linguistic instruction. After learning, the network actually formed the attractor structure representing both language-behavior relationships and the task's temporal pattern in its internal dynamics. In the dynamics, language-behavior mapping was achieved by the branching structure. Repetition of human's instruction and robot's behavioral response was represented as the cyclic structure, and besides, waiting to a subsequent instruction was represented as the fixed-point attractor. Thanks to this structure, the robot was able to interact online with a human concerning the given task by autonomously switching phases.
Collapse
Affiliation(s)
- Tatsuro Yamada
- Department of Intermedia Art and Science, Waseda University, Tokyo, Japan
| | - Shingo Murata
- Department of Modern Mechanical Engineering, Waseda University, Tokyo, Japan
| | - Hiroaki Arie
- Department of Modern Mechanical Engineering, Waseda University, Tokyo, Japan
| | - Tetsuya Ogata
- Department of Intermedia Art and Science, Waseda University, Tokyo, Japan
| |
Collapse
|
40
|
Enel P, Procyk E, Quilodran R, Dominey PF. Reservoir Computing Properties of Neural Dynamics in Prefrontal Cortex. PLoS Comput Biol 2016; 12:e1004967. [PMID: 27286251 PMCID: PMC4902312 DOI: 10.1371/journal.pcbi.1004967] [Citation(s) in RCA: 70] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Accepted: 05/08/2016] [Indexed: 11/25/2022] Open
Abstract
Primates display a remarkable ability to adapt to novel situations. Determining what is most pertinent in these situations is not always possible based only on the current sensory inputs, and often also depends on recent inputs and behavioral outputs that contribute to internal states. Thus, one can ask how cortical dynamics generate representations of these complex situations. It has been observed that mixed selectivity in cortical neurons contributes to represent diverse situations defined by a combination of the current stimuli, and that mixed selectivity is readily obtained in randomly connected recurrent networks. In this context, these reservoir networks reproduce the highly recurrent nature of local cortical connectivity. Recombining present and past inputs, random recurrent networks from the reservoir computing framework generate mixed selectivity which provides pre-coded representations of an essentially universal set of contexts. These representations can then be selectively amplified through learning to solve the task at hand. We thus explored their representational power and dynamical properties after training a reservoir to perform a complex cognitive task initially developed for monkeys. The reservoir model inherently displayed a dynamic form of mixed selectivity, key to the representation of the behavioral context over time. The pre-coded representation of context was amplified by training a feedback neuron to explicitly represent this context, thereby reproducing the effect of learning and allowing the model to perform more robustly. This second version of the model demonstrates how a hybrid dynamical regime combining spatio-temporal processing of reservoirs, and input driven attracting dynamics generated by the feedback neuron, can be used to solve a complex cognitive task. We compared reservoir activity to neural activity of dorsal anterior cingulate cortex of monkeys which revealed similar network dynamics. We argue that reservoir computing is a pertinent framework to model local cortical dynamics and their contribution to higher cognitive function. One of the most noteworthy properties of primate behavior is its diversity and adaptability. Human and non-human primates can learn an astonishing variety of novel behaviors that could not have been directly anticipated by evolution. How then can the nervous system be prewired to anticipate the ability to represent such an open class of behaviors? Recent developments in a branch of recurrent neural networks, referred to as reservoir computing, begins to shed light on this question. The novelty of reservoir computing is that the recurrent connections in the network are fixed, and only the connections from these neurons to the output neurons change with learning. The fixed recurrent connections provide the network with an inherent high dimensional dynamics that creates essentially all possible spatial and temporal combinations of the inputs which can then be selected, by learning, to perform the desired task. This high dimensional mixture of activity inherent to reservoirs has begun to be found in the primate cortex. Here we make direct comparisons between dynamic coding in the cortex and in reservoirs performing the same task, and contribute to the emerging evidence that cortex has significant reservoir properties.
Collapse
Affiliation(s)
- Pierre Enel
- Univ Lyon, Université Lyon 1, Inserm, Stem Cell and Brain Research Institute U1208, Bron, France
- Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Emmanuel Procyk
- Univ Lyon, Université Lyon 1, Inserm, Stem Cell and Brain Research Institute U1208, Bron, France
| | - René Quilodran
- Univ Lyon, Université Lyon 1, Inserm, Stem Cell and Brain Research Institute U1208, Bron, France
- Escuela de Medicina, Departamento de Pre-clínicas, Universidad de Valparaíso, Hontaneda, Valparaíso, Chile
| | - Peter Ford Dominey
- Univ Lyon, Université Lyon 1, Inserm, Stem Cell and Brain Research Institute U1208, Bron, France
- * E-mail:
| |
Collapse
|
41
|
Tepper JA, Shertil MS, Powell HM. On the importance of sluggish state memory for learning long term dependency. Knowl Based Syst 2016. [DOI: 10.1016/j.knosys.2015.12.024] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
42
|
Reservoir computing and the Sooner-is-Better bottleneck. Behav Brain Sci 2016; 39:e73. [DOI: 10.1017/s0140525x15000783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
AbstractPrior language input is not lost but integrated with the current input. This principle is demonstrated by “reservoir computing”: Untrained recurrent neural networks project input sequences onto a random point in high-dimensional state space. Earlier inputs can be retrieved from this projection, albeit less reliably so as more input is received. The bottleneck is therefore not “Now-or-Never” but “Sooner-is-Better.”
Collapse
|
43
|
van Dijk D, van der Velde F. A central pattern generator for controlling sequential activation in a neural architecture for sentence processing. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.12.113] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
44
|
Golosio B, Cangelosi A, Gamotina O, Masala GL. A Cognitive Neural Architecture Able to Learn and Communicate through Natural Language. PLoS One 2015; 10:e0140866. [PMID: 26560154 PMCID: PMC4641699 DOI: 10.1371/journal.pone.0140866] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Accepted: 10/01/2015] [Indexed: 11/18/2022] Open
Abstract
Communicative interactions involve a kind of procedural knowledge that is used by the human brain for processing verbal and nonverbal inputs and for language production. Although considerable work has been done on modeling human language abilities, it has been difficult to bring them together to a comprehensive tabula rasa system compatible with current knowledge of how verbal information is processed in the brain. This work presents a cognitive system, entirely based on a large-scale neural architecture, which was developed to shed light on the procedural knowledge involved in language elaboration. The main component of this system is the central executive, which is a supervising system that coordinates the other components of the working memory. In our model, the central executive is a neural network that takes as input the neural activation states of the short-term memory and yields as output mental actions, which control the flow of information among the working memory components through neural gating mechanisms. The proposed system is capable of learning to communicate through natural language starting from tabula rasa, without any a priori knowledge of the structure of phrases, meaning of words, role of the different classes of words, only by interacting with a human through a text-based interface, using an open-ended incremental learning process. It is able to learn nouns, verbs, adjectives, pronouns and other word classes, and to use them in expressive language. The model was validated on a corpus of 1587 input sentences, based on literature on early language assessment, at the level of about 4-years old child, and produced 521 output sentences, expressing a broad range of language processing functionalities.
Collapse
Affiliation(s)
- Bruno Golosio
- POLCOMING Department, Section of Engineering and Information Technologies, University of Sassari, Sassari, Italy
- * E-mail:
| | - Angelo Cangelosi
- Centre for Robotics and Neural Systems, School of Computing and Mathematics, University of Plymouth, Plymouth, United Kingdom
| | - Olesya Gamotina
- POLCOMING Department, Section of Engineering and Information Technologies, University of Sassari, Sassari, Italy
| | - Giovanni Luca Masala
- POLCOMING Department, Section of Engineering and Information Technologies, University of Sassari, Sassari, Italy
| |
Collapse
|
45
|
Hinaut X, Lance F, Droin C, Petit M, Pointeau G, Dominey PF. Corticostriatal response selection in sentence production: Insights from neural network simulation with reservoir computing. BRAIN AND LANGUAGE 2015; 150:54-68. [PMID: 26335997 DOI: 10.1016/j.bandl.2015.08.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Revised: 07/18/2015] [Accepted: 08/04/2015] [Indexed: 06/05/2023]
Abstract
Language production requires selection of the appropriate sentence structure to accommodate the communication goal of the speaker - the transmission of a particular meaning. Here we consider event meanings, in terms of predicates and thematic roles, and we address the problem that a given event can be described from multiple perspectives, which poses a problem of response selection. We present a model of response selection in sentence production that is inspired by the primate corticostriatal system. The model is implemented in the context of reservoir computing where the reservoir - a recurrent neural network with fixed connections - corresponds to cortex, and the readout corresponds to the striatum. We demonstrate robust learning, and generalization properties of the model, and demonstrate its cross linguistic capabilities in English and Japanese. The results contribute to the argument that the corticostriatal system plays a role in response selection in language production, and to the stance that reservoir computing is a valid potential model of corticostriatal processing.
Collapse
Affiliation(s)
- Xavier Hinaut
- CNPS, UMR CNRS 8195, University Paris-Sud, Orsay, France
| | - Florian Lance
- INSERM Stem Cell and Brain Research Institute, Human and Robot Cognitive Systems, 18 Ave Lepine, 69675 Bron Cedex, France
| | - Colas Droin
- INSERM Stem Cell and Brain Research Institute, Human and Robot Cognitive Systems, 18 Ave Lepine, 69675 Bron Cedex, France
| | - Maxime Petit
- INSERM Stem Cell and Brain Research Institute, Human and Robot Cognitive Systems, 18 Ave Lepine, 69675 Bron Cedex, France
| | - Gregoire Pointeau
- INSERM Stem Cell and Brain Research Institute, Human and Robot Cognitive Systems, 18 Ave Lepine, 69675 Bron Cedex, France
| | - Peter Ford Dominey
- INSERM Stem Cell and Brain Research Institute, Human and Robot Cognitive Systems, 18 Ave Lepine, 69675 Bron Cedex, France.
| |
Collapse
|
46
|
Batens K, De Letter M, Raedt R, Duyck W, Vanhoutte S, Van Roost D, Santens P. Subthalamic nucleus stimulation and spontaneous language production in Parkinson's disease: A double laterality problem. BRAIN AND LANGUAGE 2015; 147:76-84. [PMID: 26099950 DOI: 10.1016/j.bandl.2015.06.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2014] [Revised: 05/19/2015] [Accepted: 06/01/2015] [Indexed: 06/04/2023]
Abstract
BACKGROUND Asymmetric degeneration of dopaminergic neurons, are characteristic for Parkinson's disease (PD). Despite the lateralized representation of language, the correlation of asymmetric degeneration of nigrostriatal networks in PD with language performance has scarcely been examined. OBJECTIVE/HYPOTHESIS The laterality of dopamine depletion influences language deficits in PD and thus modulates the effects of subthalamic nucleus (STN) stimulation on language production. METHODS The spontaneous language production of patients with predominant dopamine depletion of the left (PD-left) and right (PD-right) hemisphere was compared in four stimulation conditions. RESULTS PD-right made comparatively more verb inflection errors than PD-left. Bilateral STN stimulation improves spontaneous language production only for PD-left. CONCLUSIONS The laterality of dopamine depletion influences spontaneous language production and the effect of STN stimulation on linguistic functions. However, it is probably only one of the many variables influencing the effect of STN stimulation on language production.
Collapse
Affiliation(s)
- Katja Batens
- Department of Neurology, Ghent University Hospital, De Pintelaan 185, B-9000 Ghent, Belgium.
| | - Miet De Letter
- Department of Neurology, Ghent University Hospital, De Pintelaan 185, B-9000 Ghent, Belgium; Department of Speech, Language and Hearing Sciences, Ghent University, De Pintelaan 185, B-9000 Ghent, Belgium
| | - Robrecht Raedt
- Department of Neurology, Ghent University Hospital, De Pintelaan 185, B-9000 Ghent, Belgium; Department of Internal Medicine, Neurology, Ghent University, De Pintelaan 185, B-9000 Ghent, Belgium
| | - Wouter Duyck
- Faculty of Psychology and Educational Sciences, Department of Experimental Psychology, Ghent University, Henri Dunantlaan 2, B-9000 Ghent, Belgium
| | - Sarah Vanhoutte
- Department of Internal Medicine, Neurology, Ghent University, De Pintelaan 185, B-9000 Ghent, Belgium
| | - Dirk Van Roost
- Department of Neurosurgery, Ghent University Hospital, De Pintelaan 185, B-9000 Ghent, Belgium
| | - Patrick Santens
- Department of Neurology, Ghent University Hospital, De Pintelaan 185, B-9000 Ghent, Belgium; Faculty of Psychology and Educational Sciences, Department of Experimental Psychology, Ghent University, Henri Dunantlaan 2, B-9000 Ghent, Belgium
| |
Collapse
|
47
|
Dasgupta S, Wörgötter F, Manoonpong P. Neuromodulatory adaptive combination of correlation-based learning in cerebellum and reward-based learning in basal ganglia for goal-directed behavior control. Front Neural Circuits 2014; 8:126. [PMID: 25389391 PMCID: PMC4211401 DOI: 10.3389/fncir.2014.00126] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 09/30/2014] [Indexed: 12/30/2022] Open
Abstract
Goal-directed decision making in biological systems is broadly based on associations between conditional and unconditional stimuli. This can be further classified as classical conditioning (correlation-based learning) and operant conditioning (reward-based learning). A number of computational and experimental studies have well established the role of the basal ganglia in reward-based learning, where as the cerebellum plays an important role in developing specific conditioned responses. Although viewed as distinct learning systems, recent animal experiments point toward their complementary role in behavioral learning, and also show the existence of substantial two-way communication between these two brain structures. Based on this notion of co-operative learning, in this paper we hypothesize that the basal ganglia and cerebellar learning systems work in parallel and interact with each other. We envision that such an interaction is influenced by reward modulated heterosynaptic plasticity (RMHP) rule at the thalamus, guiding the overall goal directed behavior. Using a recurrent neural network actor-critic model of the basal ganglia and a feed-forward correlation-based learning model of the cerebellum, we demonstrate that the RMHP rule can effectively balance the outcomes of the two learning systems. This is tested using simulated environments of increasing complexity with a four-wheeled robot in a foraging task in both static and dynamic configurations. Although modeled with a simplified level of biological abstraction, we clearly demonstrate that such a RMHP induced combinatorial learning mechanism, leads to stabler and faster learning of goal-directed behaviors, in comparison to the individual systems. Thus, in this paper we provide a computational model for adaptive combination of the basal ganglia and cerebellum learning systems by way of neuromodulated plasticity for goal-directed decision making in biological and bio-mimetic organisms.
Collapse
Affiliation(s)
- Sakyasingha Dasgupta
- Institute for Physics - Biophysics, George-August-UniversityGöttingen, Germany
- Bernstein Center for Computational Neuroscience, George-August-UniversityGöttingen, Germany
| | - Florentin Wörgötter
- Institute for Physics - Biophysics, George-August-UniversityGöttingen, Germany
- Bernstein Center for Computational Neuroscience, George-August-UniversityGöttingen, Germany
| | - Poramate Manoonpong
- Bernstein Center for Computational Neuroscience, George-August-UniversityGöttingen, Germany
- Center for Biorobotics, Maersk Mc-Kinney Møller Institute, University of Southern DenmarkOdense, Denmark
| |
Collapse
|
48
|
Kempen G. Prolegomena to a neurocomputational architecture for human grammatical encoding and decoding. Neuroinformatics 2014; 12:111-42. [PMID: 23872869 DOI: 10.1007/s12021-013-9191-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
This study develops a neurocomputational architecture for grammatical processing in language production and language comprehension (grammatical encoding and decoding, respectively). It seeks to answer two questions. First, how is online syntactic structure formation of the complexity required by natural-language grammars possible in a fixed, preexisting neural network without the need for online creation of new connections or associations? Second, is it realistic to assume that the seemingly disparate instantiations of syntactic structure formation in grammatical encoding and grammatical decoding can run on the same neural infrastructure? This issue is prompted by accumulating experimental evidence for the hypothesis that the mechanisms for grammatical decoding overlap with those for grammatical encoding to a considerable extent, thus inviting the hypothesis of a single "grammatical coder." The paper answers both questions by providing the blueprint for a syntactic structure formation mechanism that is entirely based on prewired circuitry (except for referential processing, which relies on the rapid learning capacity of the hippocampal complex), and can subserve decoding as well as encoding tasks. The model builds on the "Unification Space" model of syntactic parsing developed by Vosse and Kempen (Cognition 75:105-143, 2000; Cognitive Neurodynamics 3:331-346, 2009a). The design includes a neurocomputational mechanism for the treatment of an important class of grammatical movement phenomena.
Collapse
Affiliation(s)
- Gerard Kempen
- Max Planck Institute for Psycholinguistics, PO Box 310, 6500 AH, Nijmegen, The Netherlands,
| |
Collapse
|
49
|
Hinaut X, Petit M, Pointeau G, Dominey PF. Exploring the acquisition and production of grammatical constructions through human-robot interaction with echo state networks. Front Neurorobot 2014; 8:16. [PMID: 24834050 PMCID: PMC4018555 DOI: 10.3389/fnbot.2014.00016] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Accepted: 03/27/2014] [Indexed: 11/13/2022] Open
Abstract
One of the principal functions of human language is to allow people to coordinate joint action. This includes the description of events, requests for action, and their organization in time. A crucial component of language acquisition is learning the grammatical structures that allow the expression of such complex meaning related to physical events. The current research investigates the learning of grammatical constructions and their temporal organization in the context of human-robot physical interaction with the embodied sensorimotor humanoid platform, the iCub. We demonstrate three noteworthy phenomena. First, a recurrent network model is used in conjunction with this robotic platform to learn the mappings between grammatical forms and predicate-argument representations of meanings related to events, and the robot's execution of these events in time. Second, this learning mechanism functions in the inverse sense, i.e., in a language production mode, where rather than executing commanded actions, the robot will describe the results of human generated actions. Finally, we collect data from naïve subjects who interact with the robot via spoken language, and demonstrate significant learning and generalization results. This allows us to conclude that such a neural language learning system not only helps to characterize and understand some aspects of human language acquisition, but also that it can be useful in adaptive human-robot interaction.
Collapse
Affiliation(s)
- Xavier Hinaut
- Stem Cell and Brain Research Institute, INSERM U846 Bron, France ; Université de Lyon, Université Lyon I Lyon, France
| | - Maxime Petit
- Stem Cell and Brain Research Institute, INSERM U846 Bron, France ; Université de Lyon, Université Lyon I Lyon, France
| | - Gregoire Pointeau
- Stem Cell and Brain Research Institute, INSERM U846 Bron, France ; Université de Lyon, Université Lyon I Lyon, France
| | - Peter Ford Dominey
- Stem Cell and Brain Research Institute, INSERM U846 Bron, France ; Université de Lyon, Université Lyon I Lyon, France ; Centre National de la Recherche Scientifique Bron, France
| |
Collapse
|
50
|
Design and Construction of a Brain-Like Computer: A New Class of Frequency-Fractal Computing Using Wireless Communication in a Supramolecular Organic, Inorganic System. INFORMATION 2014. [DOI: 10.3390/info5010028] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|