1
|
Aubret A, Matignon L, Hassas S. An Information-Theoretic Perspective on Intrinsic Motivation in Reinforcement Learning: A Survey. ENTROPY (BASEL, SWITZERLAND) 2023; 25:327. [PMID: 36832693 PMCID: PMC9954873 DOI: 10.3390/e25020327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 01/23/2023] [Accepted: 02/07/2023] [Indexed: 06/18/2023]
Abstract
The reinforcement learning (RL) research area is very active, with an important number of new contributions, especially considering the emergent field of deep RL (DRL). However, a number of scientific and technical challenges still need to be resolved, among which we acknowledge the ability to abstract actions or the difficulty to explore the environment in sparse-reward settings which can be addressed by intrinsic motivation (IM). We propose to survey these research works through a new taxonomy based on information theory: we computationally revisit the notions of surprise, novelty, and skill-learning. This allows us to identify advantages and disadvantages of methods and exhibit current outlooks of research. Our analysis suggests that novelty and surprise can assist the building of a hierarchy of transferable skills which abstracts dynamics and makes the exploration process more robust.
Collapse
|
2
|
Ding Z, Xie H, Li P, Xu X. A structural developmental neural network with information saturation for continual unsupervised learning. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY 2023. [DOI: 10.1049/cit2.12169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Affiliation(s)
- Zhiyong Ding
- College of Intelligence Science and Technology National University of Defense Technology Changsha China
| | - Haibin Xie
- College of Intelligence Science and Technology National University of Defense Technology Changsha China
| | - Peng Li
- College of Intelligence Science and Technology National University of Defense Technology Changsha China
| | - Xin Xu
- College of Intelligence Science and Technology National University of Defense Technology Changsha China
| |
Collapse
|
3
|
Li A, Ma X. Scalable Cognitive Developmental Network:a strategy for integrating new perception online using relation evolution SOINN. COGN SYST RES 2023. [DOI: 10.1016/j.cogsys.2023.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
|
4
|
Duczek N, Kerzel M, Allgeuer P, Wermter S. Self-organized Learning from Synthetic and Real-World Data for a Humanoid Exercise Robot. Front Robot AI 2022; 9:669719. [PMID: 36274912 PMCID: PMC9585214 DOI: 10.3389/frobt.2022.669719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 07/07/2022] [Indexed: 11/13/2022] Open
Abstract
We propose a neural learning approach for a humanoid exercise robot that can automatically analyze and correct physical exercises. Such an exercise robot should be able to train many different human partners over time and thus requires the ability for lifelong learning. To this end, we develop a modified Grow-When-Required (GWR) network with recurrent connections, episodic memory and a novel subnode mechanism for learning spatiotemporal relationships of body movements and poses. Once an exercise is successfully demonstrated, the information of pose and movement per frame is stored in the Subnode-GWR network. For every frame, the current pose and motion pair is compared against a predicted output of the GWR, allowing for feedback not only on the pose but also on the velocity of the motion. Since both the pose and motion depend on a user's body morphology, the exercise demonstration by one individual cannot easily be used as a reference for further users. We allow the GWR to grow online with each further demonstration. The subnode mechanism ensures that exercise information for individual humans is stored and retrieved correctly and is not forgotten over time. In the application scenario, a physical exercise is performed in the presence of an expert like a physiotherapist and then used as a reference for a humanoid robot like Pepper to give feedback on further executions of the same exercise. For evaluation, we developed a new synthetic exercise dataset with virtual avatars. We also test our method on real-world data recorded in an office scenario. Overall, we claim that our novel GWR-based architecture can use a learned exercise reference for different body variations through incremental online learning while preventing catastrophic forgetting, enabling an engaging long-term human-robot experience with a humanoid robot.
Collapse
Affiliation(s)
| | - Matthias Kerzel
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| | | | | |
Collapse
|
5
|
Gryshchuk V, Weber C, Loo CK, Wermter S. Go ahead and do not forget: Modular lifelong learning from event-based data. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.05.101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
6
|
Zhong C, Liu S, Lu Q, Zhang B, Wang J, Wu Q. Topological structural analysis based on self-adaptive growing neural network for shape feature extraction. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.06.086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
7
|
Non-Uniform Input-Based Adaptive Growing Neural Gas for Unstructured Environment Map Construction. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12126110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The research and development of special robots such as excavation robots is an important way to achieve safe and efficient production in coal mines. Affected by the unstructured environment such as complex working conditions and unsteady factor disturbances, the real-time construction of section environment maps that can accurately describe the environment and facilitate trajectory planning and decision making has become a key scientific problem to be solved as soon as possible. Therefore, non-uniform input based adaptive growing neural gas for unstructured environment map construction has been proposed. Considering complex load identification, real-time location identification, and the types of unsteady disturbance factors and working conditions, a set of environment identification models has been established based on a large amount of underground measured data training. These models can express whether the section environment has changed, as well as the type and magnitude of the change, to realize the overall knowledge extraction and parametric representation of the unstructured environment. Then, in order to solve the problems of inaccurate topology, excessive aging of connecting edges, and excessive deletion of nodes in non-uniform input environment, an adaptive growing neural gas algorithm based on non-uniform input environment (AGNG-NU) is proposed. Featured by a dynamic response deletion mechanism and adaptive adjustment mechanism of neuron parameters, the generated nodes and their topology can be dynamically adjusted according to the density of regional sample points. Several sets of non-uniform input environments are set to test the algorithm. The experimental results show that the topological maps established by AGNG-NU express clearer environmental information and, at the same time, the accuracy and distribution are improved by 8% and 15%, respectively, compared with the basic GNG algorithm. The accuracy and the distribution have also been significantly improved compared with other common SOM and GCS algorithms.
Collapse
|
8
|
Lifelong 3D object recognition and grasp synthesis using dual memory recurrent self-organization networks. Neural Netw 2022; 150:167-180. [DOI: 10.1016/j.neunet.2022.02.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 01/18/2022] [Accepted: 02/28/2022] [Indexed: 11/22/2022]
|
9
|
Churamani N, Barros P, Gunes H, Wermter S. Affect-Driven Learning of Robot Behaviour for Collaborative Human-Robot Interactions. Front Robot AI 2022; 9:717193. [PMID: 35265672 PMCID: PMC8898942 DOI: 10.3389/frobt.2022.717193] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Accepted: 01/17/2022] [Indexed: 11/29/2022] Open
Abstract
Collaborative interactions require social robots to share the users’ perspective on the interactions and adapt to the dynamics of their affective behaviour. Yet, current approaches for affective behaviour generation in robots focus on instantaneous perception to generate a one-to-one mapping between observed human expressions and static robot actions. In this paper, we propose a novel framework for affect-driven behaviour generation in social robots. The framework consists of (i) a hybrid neural model for evaluating facial expressions and speech of the users, forming intrinsic affective representations in the robot, (ii) an Affective Core, that employs self-organising neural models to embed behavioural traits like patience and emotional actuation that modulate the robot’s affective appraisal, and (iii) a Reinforcement Learning model that uses the robot’s appraisal to learn interaction behaviour. We investigate the effect of modelling different affective core dispositions on the affective appraisal and use this affective appraisal as the motivation to generate robot behaviours. For evaluation, we conduct a user study (n = 31) where the NICO robot acts as a proposer in the Ultimatum Game. The effect of the robot’s affective core on its negotiation strategy is witnessed by participants, who rank a patient robot with high emotional actuation higher on persistence, while an impatient robot with low emotional actuation is rated higher on its generosity and altruistic behaviour.
Collapse
Affiliation(s)
- Nikhil Churamani
- Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom
- *Correspondence: Nikhil Churamani,
| | - Pablo Barros
- Cognitive Architecture for Collaborative Technologies (CONTACT) Unit, Istituto Italiano di Tecnologia, Genova, Italy
| | - Hatice Gunes
- Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom
| | - Stefan Wermter
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| |
Collapse
|
10
|
Zhang HT, Park TJ, Islam ANMN, Tran DSJ, Manna S, Wang Q, Mondal S, Yu H, Banik S, Cheng S, Zhou H, Gamage S, Mahapatra S, Zhu Y, Abate Y, Jiang N, Sankaranarayanan SKRS, Sengupta A, Teuscher C, Ramanathan S. Reconfigurable perovskite nickelate electronics for artificial intelligence. Science 2022; 375:533-539. [PMID: 35113713 DOI: 10.1126/science.abj7943] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Reconfigurable devices offer the ability to program electronic circuits on demand. In this work, we demonstrated on-demand creation of artificial neurons, synapses, and memory capacitors in post-fabricated perovskite NdNiO3 devices that can be simply reconfigured for a specific purpose by single-shot electric pulses. The sensitivity of electronic properties of perovskite nickelates to the local distribution of hydrogen ions enabled these results. With experimental data from our memory capacitors, simulation results of a reservoir computing framework showed excellent performance for tasks such as digit recognition and classification of electrocardiogram heartbeat activity. Using our reconfigurable artificial neurons and synapses, simulated dynamic networks outperformed static networks for incremental learning scenarios. The ability to fashion the building blocks of brain-inspired computers on demand opens up new directions in adaptive networks.
Collapse
Affiliation(s)
- Hai-Tian Zhang
- School of Materials Engineering, Purdue University, West Lafayette, IN 47907, USA
| | - Tae Joon Park
- School of Materials Engineering, Purdue University, West Lafayette, IN 47907, USA
| | - A N M Nafiul Islam
- Department of Electrical Engineering, Pennsylvania State University, University Park, PA, 16802, USA
| | - Dat S J Tran
- Department of Electrical and Computer Engineering, Santa Clara University, Santa Clara, CA 95053, USA
| | - Sukriti Manna
- Center for Nanoscale Materials, Argonne National Laboratory, Argonne, IL 60439, USA.,Department of Mechanical and Industrial Engineering, University of Illinois Chicago, Chicago, IL 60607, USA
| | - Qi Wang
- School of Materials Engineering, Purdue University, West Lafayette, IN 47907, USA
| | - Sandip Mondal
- School of Materials Engineering, Purdue University, West Lafayette, IN 47907, USA
| | - Haoming Yu
- School of Materials Engineering, Purdue University, West Lafayette, IN 47907, USA
| | - Suvo Banik
- Center for Nanoscale Materials, Argonne National Laboratory, Argonne, IL 60439, USA.,Department of Mechanical and Industrial Engineering, University of Illinois Chicago, Chicago, IL 60607, USA
| | - Shaobo Cheng
- Department of Condensed Matter Physics and Materials Science, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Hua Zhou
- X-ray Science Division, Advanced Photon Source, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Sampath Gamage
- Department of Physics and Astronomy, University of Georgia, Athens, GA 30602, USA
| | - Sayantan Mahapatra
- Department of Chemistry, University of Illinois Chicago, Chicago, IL 60607, USA
| | - Yimei Zhu
- Department of Condensed Matter Physics and Materials Science, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Yohannes Abate
- Department of Physics and Astronomy, University of Georgia, Athens, GA 30602, USA
| | - Nan Jiang
- Department of Chemistry, University of Illinois Chicago, Chicago, IL 60607, USA
| | - Subramanian K R S Sankaranarayanan
- Center for Nanoscale Materials, Argonne National Laboratory, Argonne, IL 60439, USA.,Department of Mechanical and Industrial Engineering, University of Illinois Chicago, Chicago, IL 60607, USA
| | - Abhronil Sengupta
- Department of Electrical Engineering, Pennsylvania State University, University Park, PA, 16802, USA
| | - Christof Teuscher
- Department of Electrical and Computer Engineering, Portland State University, Portland, OR 97201, USA
| | - Shriram Ramanathan
- School of Materials Engineering, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|
11
|
Carta A, Cossu A, Errica F, Bacciu D. Catastrophic Forgetting in Deep Graph Networks: A Graph Classification Benchmark. Front Artif Intell 2022; 5:824655. [PMID: 35187476 PMCID: PMC8855050 DOI: 10.3389/frai.2022.824655] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 01/11/2022] [Indexed: 11/29/2022] Open
Abstract
In this work, we study the phenomenon of catastrophic forgetting in the graph representation learning scenario. The primary objective of the analysis is to understand whether classical continual learning techniques for flat and sequential data have a tangible impact on performances when applied to graph data. To do so, we experiment with a structure-agnostic model and a deep graph network in a robust and controlled environment on three different datasets. The benchmark is complemented by an investigation on the effect of structure-preserving regularization techniques on catastrophic forgetting. We find that replay is the most effective strategy in so far, which also benefits the most from the use of regularization. Our findings suggest interesting future research at the intersection of the continual and graph representation learning fields. Finally, we provide researchers with a flexible software framework to reproduce our results and carry out further experiments.
Collapse
Affiliation(s)
- Antonio Carta
- Computer Science Department, University of Pisa, Pisa, Italy
- *Correspondence: Antonio Carta
| | - Andrea Cossu
- Computer Science Department, University of Pisa, Pisa, Italy
- Scuola Normale Superiore, Pisa, Italy
| | - Federico Errica
- Computer Science Department, University of Pisa, Pisa, Italy
| | - Davide Bacciu
- Computer Science Department, University of Pisa, Pisa, Italy
| |
Collapse
|
12
|
Self-Organizing Map Network for the Decision Making in Combined Mode Conduction-Radiation Heat Transfer in Porous Medium. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2022. [DOI: 10.1007/s13369-021-06489-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
13
|
Szadkowski R, Drchal J, Faigl J. Continually trained life-long classification. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06154-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
14
|
Gandolfi M, Ceotto M. Unsupervised Machine Learning Neural Gas Algorithm for Accurate Evaluations of the Hessian Matrix in Molecular Dynamics. J Chem Theory Comput 2021; 17:6733-6746. [PMID: 34705463 PMCID: PMC8582248 DOI: 10.1021/acs.jctc.1c00707] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Indexed: 11/29/2022]
Abstract
The Hessian matrix of the potential energy of molecular systems is employed not only in geometry optimizations or high-order molecular dynamics integrators but also in many other molecular procedures, such as instantaneous normal mode analysis, force field construction, instanton calculations, and semiclassical initial value representation molecular dynamics, to name a few. Here, we present an algorithm for the calculation of the approximated Hessian in molecular dynamics. The algorithm belongs to the family of unsupervised machine learning methods, and it is based on the neural gas idea, where neurons are molecular configurations whose Hessians are adopted for groups of molecular dynamics configurations with similar geometries. The method is tested on several molecular systems of different dimensionalities both in terms of accuracy and computational time versus calculating the Hessian matrix at each time-step, that is, without any approximation, and other Hessian approximation schemes. Finally, the method is applied to the on-the-fly, full-dimensional simulation of a small synthetic peptide (the 46 atom N-acetyl-l-phenylalaninyl-l-methionine amide) at the level of DFT-B3LYP-D/6-31G* theory, from which the semiclassical vibrational power spectrum is calculated.
Collapse
Affiliation(s)
- Michele Gandolfi
- Dipartimento di Chimica, Università
degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Michele Ceotto
- Dipartimento di Chimica, Università
degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| |
Collapse
|
15
|
Logacjov A, Kerzel M, Wermter S. Learning Then, Learning Now, and Every Second in Between: Lifelong Learning With a Simulated Humanoid Robot. Front Neurorobot 2021; 15:669534. [PMID: 34276332 PMCID: PMC8281815 DOI: 10.3389/fnbot.2021.669534] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 05/18/2021] [Indexed: 11/13/2022] Open
Abstract
Long-term human-robot interaction requires the continuous acquisition of knowledge. This ability is referred to as lifelong learning (LL). LL is a long-standing challenge in machine learning due to catastrophic forgetting, which states that continuously learning from novel experiences leads to a decrease in the performance of previously acquired knowledge. Two recently published LL approaches are the Growing Dual-Memory (GDM) and the Self-organizing Incremental Neural Network+ (SOINN+). Both are growing neural networks that create new neurons in response to novel sensory experiences. The latter approach shows state-of-the-art clustering performance on sequentially available data with low memory requirements regarding the number of nodes. However, classification capabilities are not investigated. Two novel contributions are made in our research paper: (I) An extended SOINN+ approach, called associative SOINN+ (A-SOINN+), is proposed. It adopts two main properties of the GDM model to facilitate classification. (II) A new LL object recognition dataset (v-NICO-World-LL) is presented. It is recorded in a nearly photorealistic virtual environment, where a virtual humanoid robot manipulates 100 different objects belonging to 10 classes. Real-world and artificially created background images, grouped into four different complexity levels, are utilized. The A-SOINN+ reaches similar state-of-the-art classification accuracy results as the best GDM architecture of this work and consists of 30 to 350 times fewer neurons, evaluated on two LL object recognition datasets, the novel v-NICO-World-LL and the well-known CORe50. Furthermore, we observe an approximately 268 times lower training time. These reduced numbers result in lower memory and computational requirements, indicating higher suitability for autonomous social robots with low computational resources to facilitate a more efficient LL during long-term human-robot interactions.
Collapse
Affiliation(s)
- Aleksej Logacjov
- Department of Informatics, Research Group Knowledge Technology, Universität Hamburg, Hamburg, Germany
| | | | | |
Collapse
|
16
|
Jirak D, Biertimpel D, Kerzel M, Wermter S. Solving visual object ambiguities when pointing: an unsupervised learning approach. Neural Comput Appl 2021. [DOI: 10.1007/s00521-020-05109-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
AbstractWhenever we are addressing a specific object or refer to a certain spatial location, we are using referential or deictic gestures usually accompanied by some verbal description. Particularly, pointing gestures are necessary to dissolve ambiguities in a scene and they are of crucial importance when verbal communication may fail due to environmental conditions or when two persons simply do not speak the same language. With the currently increasing advances of humanoid robots and their future integration in domestic domains, the development of gesture interfaces complementing human–robot interaction scenarios is of substantial interest. The implementation of an intuitive gesture scenario is still challenging because both the pointing intention and the corresponding object have to be correctly recognized in real time. The demand increases when considering pointing gestures in a cluttered environment, as is the case in households. Also, humans perform pointing in many different ways and those variations have to be captured. Research in this field often proposes a set of geometrical computations which do not scale well with the number of gestures and objects and use specific markers or a predefined set of pointing directions. In this paper, we propose an unsupervised learning approach to model the distribution of pointing gestures using a growing-when-required (GWR) network. We introduce an interaction scenario with a humanoid robot and define the so-called ambiguity classes. Our implementation for the hand and object detection is independent of any markers or skeleton models; thus, it can be easily reproduced. Our evaluation comparing a baseline computer vision approach with our GWR model shows that the pointing-object association is well learned even in cases of ambiguities resulting from close object proximity.
Collapse
|
17
|
Ezenkwu CP, Starkey A. Unsupervised Temporospatial Neural Architecture for Sensorimotor Map Learning. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2019.2934643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
18
|
Tuyen NTV, Elibol A, Chong NY. Learning Bodily Expression of Emotion for Social Robots Through Human Interaction. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2020.3005907] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
19
|
Whitelam S. Improving the Accuracy of Nearest-Neighbor Classification Using Principled Construction and Stochastic Sampling of Training-Set Centroids. ENTROPY (BASEL, SWITZERLAND) 2021; 23:149. [PMID: 33530507 PMCID: PMC7911166 DOI: 10.3390/e23020149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/03/2021] [Accepted: 01/21/2021] [Indexed: 11/16/2022]
Abstract
A conceptually simple way to classify images is to directly compare test-set data and training-set data. The accuracy of this approach is limited by the method of comparison used, and by the extent to which the training-set data cover configuration space. Here we show that this coverage can be substantially increased using coarse-graining (replacing groups of images by their centroids) and stochastic sampling (using distinct sets of centroids in combination). We use the MNIST and Fashion-MNIST data sets to show that a principled coarse-graining algorithm can convert training images into fewer image centroids without loss of accuracy of classification of test-set images by nearest-neighbor classification. Distinct batches of centroids can be used in combination as a means of stochastically sampling configuration space, and can classify test-set data more accurately than can the unaltered training set. On the MNIST and Fashion-MNIST data sets this approach converts nearest-neighbor classification from a mid-ranking- to an upper-ranking member of the set of classical machine-learning techniques.
Collapse
Affiliation(s)
- Stephen Whitelam
- Molecular Foundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| |
Collapse
|
20
|
Online state space generation by a growing self-organizing map and differential learning for reinforcement learning. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106723] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
21
|
The GNG neural network in analyzing consumer behaviour patterns: empirical research on a purchasing behaviour processes realized by the elderly consumers. ADV DATA ANAL CLASSI 2020. [DOI: 10.1007/s11634-020-00415-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
AbstractThe paper sheds light on the use of a self-learning GNG neural network for identification and exploration of the purchasing behaviour patterns. The test has been conducted on the data collected from consumers aged 60 years and over, with regard to three product purchases. The primary data used to explore the purchasing behaviour patterns was collected during a survey carried out among the elderly students at the Universities of Third Age in Slovenia, the Czech Republic and Poland, in the years 2017–2018. Finally, a total of six different types of purchasing patterns have been identified, namely the ‘thoughtful decision’, the ‘sensitive to recommendation’, the ‘beneficiary, the ‘short thoughtful decision’, the ‘habitual decision’ and ‘multiple’ patterns.
The most significant differences in the purchasing patterns of the three national samples have been identified with regard to the process of purchasing a smartphone, while the most repetitive patterns have been identified with regard to the purchasing of a new product. The results significantly support the GNG network’s validity for identification of consumer behaviour patterns. The application of this method allowed quick and effective to identify and segment consumers groups as well as facilitated the mapping of the differences among these groups and to compare the consumption behaviour expressed by consumers on different markets. The identified consumer purchase patterns may play a basic role for marketers to understand consumer behaviour and then propose tailored strategies in international marketing.
Collapse
|
22
|
GeoBoost: An Incremental Deep Learning Approach toward Global Mapping of Buildings from VHR Remote Sensing Images. REMOTE SENSING 2020. [DOI: 10.3390/rs12111794] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Modern convolutional neural networks (CNNs) are often trained on pre-set data sets with a fixed size. As for the large-scale applications of satellite images, for example, global or regional mappings, these images are collected incrementally by multiple stages in general. In other words, the sizes of training datasets might be increased for the tasks of mapping rather than be fixed beforehand. In this paper, we present a novel algorithm, called GeoBoost, for the incremental-learning tasks of semantic segmentation via convolutional neural networks. Specifically, the GeoBoost algorithm is trained in an end-to-end manner on the newly available data, and it does not decrease the performance of previously trained models. The effectiveness of the GeoBoost algorithm is verified on the large-scale data set of DREAM-B. This method avoids the need for training on the enlarged data set from scratch and would become more effective along with more available data.
Collapse
|
23
|
Horváth G, Kovács E, Molontay R, Nováczki S. Copula-Based Anomaly Scoring and Localization for Large-Scale, High-Dimensional Continuous Data. ACM T INTEL SYST TEC 2020. [DOI: 10.1145/3372274] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
The anomaly detection method presented by this article has a special feature: it not only indicates whether or not an observation is anomalous but also tells what exactly makes an anomalous observation unusual. Hence, it provides support to localize the reason of the anomaly.
The proposed approach is model based; it relies on the multivariate probability distribution associated with the observations. Since the rare events are present in the tails of the probability distributions, we use copula functions, which are able to model the fat-tailed distributions well. The presented procedure scales well; it can cope with a large number of high-dimensional samples. Furthermore, our procedure can cope with missing values as well, which occur frequently in high-dimensional datasets.
In the second part of the article, we demonstrate the usability of the method through a case study, where we analyze a large dataset consisting of the performance counters of a real mobile telecommunication network. Since such networks are complex systems, the signs of sub-optimal operation can remain hidden for a potentially long time. With the proposed procedure, many such hidden issues can be isolated and indicated to the network operator.
Collapse
Affiliation(s)
- Gábor Horváth
- Budapest University of Technology and Economics, Budapest, Hungary
| | - Edith Kovács
- University of Debrecen and Budapest University of Technology and Economics, Budapest, Hungary
| | - Roland Molontay
- University of Debrecen and Budapest University of Technology and Economics, Budapest, Hungary
| | | |
Collapse
|
24
|
A scalable multi-signal approach for the parallelization of self-organizing neural networks. Neural Netw 2020; 123:108-117. [DOI: 10.1016/j.neunet.2019.11.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Revised: 10/08/2019] [Accepted: 11/19/2019] [Indexed: 11/23/2022]
|
25
|
|
26
|
Barros P, Eppe M, Parisi GI, Liu X, Wermter S. Expectation Learning for Stimulus Prediction Across Modalities Improves Unisensory Classification. Front Robot AI 2019; 6:137. [PMID: 33501152 PMCID: PMC7806099 DOI: 10.3389/frobt.2019.00137] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Accepted: 11/25/2019] [Indexed: 11/25/2022] Open
Abstract
Expectation learning is a unsupervised learning process which uses multisensory bindings to enhance unisensory perception. For instance, as humans, we learn to associate a barking sound with the visual appearance of a dog, and we continuously fine-tune this association over time, as we learn, e.g., to associate high-pitched barking with small dogs. In this work, we address the problem of developing a computational model that addresses important properties of expectation learning, in particular focusing on the lack of explicit external supervision other than temporal co-occurrence. To this end, we present a novel hybrid neural model based on audio-visual autoencoders and a recurrent self-organizing network for multisensory bindings that facilitate stimulus reconstructions across different sensory modalities. We refer to this mechanism as stimulus prediction across modalities and demonstrate that the proposed model is capable of learning concept bindings by evaluating it on unisensory classification tasks for audio-visual stimuli using the 43,500 Youtube videos from the animal subset of the AudioSet corpus.
Collapse
Affiliation(s)
- Pablo Barros
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| | - Manfred Eppe
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| | - German I Parisi
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| | - Xun Liu
- Department of Psychology, University of CAS, Beijing, China
| | - Stefan Wermter
- Knowledge Technology, Department of Informatics, University of Hamburg, Hamburg, Germany
| |
Collapse
|
27
|
Pitonakova L, Bullock S. The robustness-fidelity trade-off in Grow When Required neural networks performing continuous novelty detection. Neural Netw 2019; 122:183-195. [PMID: 31683146 DOI: 10.1016/j.neunet.2019.10.015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 10/24/2019] [Accepted: 10/24/2019] [Indexed: 10/25/2022]
Abstract
Novelty detection allows robots to recognise unexpected data in their sensory field and can thus be utilised in applications such as reconnaissance, surveillance, self-monitoring, etc. We assess the suitability of Grow When Required Neural Networks (GWRNNs) for detecting novel features in a robot's visual input in the context of randomised physics-based simulation environments. We compare, for the first time, several GWRNN architectures, including new Plastic architectures in which the number of activated input connections for individual neurons is adjusted dynamically as the robot senses a varying number of salient environmental features. The networks are studied in both one-shot and continuous novelty reporting tasks and we demonstrate that there is a trade-off, not unique to this type of novelty detector, between robustness and fidelity. Robustness is achieved through generalisation over the input space which minimises the impact of network parameters on performance, whereas high fidelity results from learning detailed models of the input space and is especially important when a robot encounters multiple novelties consecutively or must detect that previously encountered objects have disappeared from the environment. We propose a number of improvements that could mitigate the robustness-fidelity trade-off and demonstrate one of them, where localisation information is added to the input data stream being monitored.
Collapse
Affiliation(s)
- Lenka Pitonakova
- Department of Computer Science, University of Bristol, Merchant Venturers' Building, Woodland Road, Bristol, BS8 1UB, United Kingdom
| | - Seth Bullock
- Department of Computer Science, University of Bristol, Merchant Venturers' Building, Woodland Road, Bristol, BS8 1UB, United Kingdom.
| |
Collapse
|
28
|
Mici L, Parisi GI, Wermter S. Compositional Learning of Human Activities With a Self-Organizing Neural Architecture. Front Robot AI 2019; 6:72. [PMID: 33501087 PMCID: PMC7805845 DOI: 10.3389/frobt.2019.00072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Accepted: 07/30/2019] [Indexed: 12/02/2022] Open
Abstract
An important step for assistive systems and robot companions operating in human environments is to learn the compositionality of human activities, i.e., recognize both activities and their comprising actions. Most existing approaches address action and activity recognition as separate tasks, i.e., actions need to be inferred before the activity labels, and are thus highly sensitive to the correct temporal segmentation of the activity sequences. In this paper, we present a novel learning approach that jointly learns human activities on two levels of semantic and temporal complexity: (1) transitive actions such as reaching and opening, e.g., a cereal box, and (2) high-level activities such as having breakfast. Our model consists of a hierarchy of GWR networks which process and learn inherent spatiotemporal dependencies of multiple visual cues extracted from the human body skeletal representation and the interaction with objects. The neural architecture learns and semantically segments input RGB-D sequences of high-level activities into their composing actions, without supervision. We investigate the performance of our architecture with a set of experiments on a publicly available benchmark dataset. The experimental results show that our approach outperforms the state of the art with respect to the classification of the high-level activities. Additionally, we introduce a novel top-down modulation mechanism to the architecture which uses the actions and activity labels as constraints during the learning phase. In our experiments, we show how this mechanism can be used to control the network's neural growth without decreasing the overall performance.
Collapse
|
29
|
Itano F, Pires R, de Abreu de Sousa MA, Del-Moral-Hernandez E. Human actions recognition in video scenes from multiple camera viewpoints. COGN SYST RES 2019. [DOI: 10.1016/j.cogsys.2019.03.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
30
|
Contreras-Cruz MA, Ramirez-Paredes JP, Hernandez-Belmonte UH, Ayala-Ramirez V. Vision-Based Novelty Detection Using Deep Features and Evolved Novelty Filters for Specific Robotic Exploration and Inspection Tasks. SENSORS (BASEL, SWITZERLAND) 2019; 19:E2965. [PMID: 31284410 PMCID: PMC6651515 DOI: 10.3390/s19132965] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Revised: 06/18/2019] [Accepted: 06/28/2019] [Indexed: 11/26/2022]
Abstract
One of the essential abilities in animals is to detect novelties within their environment. From the computational point of view, novelty detection consists of finding data that are different in some aspect to the known data. In robotics, researchers have incorporated novelty modules in robots to develop automatic exploration and inspection tasks. The visual sensor is one of the preferred sensors to perform this task. However, there exist problems as illumination changes, occlusion, and scale, among others. Besides, novelty detectors vary their performance depending on the specific application scenario. In this work, we propose a visual novelty detection framework for specific exploration and inspection tasks based on evolved novelty detectors. The system uses deep features to represent the visual information captured by the robots and applies a global optimization technique to design novelty detectors for specific robotics applications. We verified the performance of the proposed system against well-established state-of-the-art methods in a challenging scenario. This scenario was an outdoor environment covering typical problems in computer vision such as illumination changes, occlusion, and geometric transformations. The proposed framework presented high-novelty detection accuracy with competitive or even better results than the baseline methods.
Collapse
Affiliation(s)
- Marco Antonio Contreras-Cruz
- Department of Electronics Engineering, University of Guanajuato, Campus Irapuato-Salamanca, Carr. Salamanca-Valle de Santiago Km 3.5 + 1.8, Comunidad de Palo Blanco, Salamanca 36885, Mexico
| | - Juan Pablo Ramirez-Paredes
- Department of Electronics Engineering, University of Guanajuato, Campus Irapuato-Salamanca, Carr. Salamanca-Valle de Santiago Km 3.5 + 1.8, Comunidad de Palo Blanco, Salamanca 36885, Mexico
| | - Uriel Haile Hernandez-Belmonte
- Department of Art and Enterprise, University of Guanajuato, Campus Irapuato-Salamanca, Carr. Salamanca-Valle de Santiago Km 3.5 + 1.8, Comunidad de Palo Blanco, Salamanca 36885, Mexico
| | - Victor Ayala-Ramirez
- Department of Electronics Engineering, University of Guanajuato, Campus Irapuato-Salamanca, Carr. Salamanca-Valle de Santiago Km 3.5 + 1.8, Comunidad de Palo Blanco, Salamanca 36885, Mexico.
| |
Collapse
|
31
|
Masuyama N, Loo CK, Wermter S. A Kernel Bayesian Adaptive Resonance Theory with A Topological Structure. Int J Neural Syst 2019; 29:1850052. [DOI: 10.1142/s0129065718500521] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This paper attempts to solve the typical problems of self-organizing growing network models, i.e. (a) an influence of the order of input data on the self-organizing ability, (b) an instability to high-dimensional data and an excessive sensitivity to noise, and (c) an expensive computational cost by integrating Kernel Bayes Rule (KBR) and Correntropy-Induced Metric (CIM) into Adaptive Resonance Theory (ART) framework. KBR performs a covariance-free Bayesian computation which is able to maintain a fast and stable computation. CIM is a generalized similarity measurement which can maintain a high-noise reduction ability even in a high-dimensional space. In addition, a Growing Neural Gas (GNG)-based topology construction process is integrated into the ART framework to enhance its self-organizing ability. The simulation experiments with synthetic and real-world datasets show that the proposed model has an outstanding stable self-organizing ability for various test environments.
Collapse
Affiliation(s)
- Naoki Masuyama
- Department of Computer Science and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University, 1-1 Gakuen-cho Naka-ku, Sakai-Shi, Osaka 599-8531, Japan
| | - Chu Kiong Loo
- Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, University of Malaya, 50603 Kuala Lumpur, Malaysia
| | - Stefan Wermter
- Department of Informatics, Faculty of Mathematics, Computer Science and Natural Sciences, University of Hamburg, Vogt-Koelln-Str. 30, 22527 Hamburg, Germany
| |
Collapse
|
32
|
Continual lifelong learning with neural networks: A review. Neural Netw 2019; 113:54-71. [DOI: 10.1016/j.neunet.2019.01.012] [Citation(s) in RCA: 322] [Impact Index Per Article: 64.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Revised: 01/18/2019] [Accepted: 01/22/2019] [Indexed: 10/27/2022]
|
33
|
Zhou X, Bai T, Gao Y, Han Y. Vision-Based Robot Navigation through Combining Unsupervised Learning and Hierarchical Reinforcement Learning. SENSORS (BASEL, SWITZERLAND) 2019; 19:E1576. [PMID: 30939807 PMCID: PMC6479296 DOI: 10.3390/s19071576] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Revised: 03/21/2019] [Accepted: 03/25/2019] [Indexed: 11/16/2022]
Abstract
Extensive studies have shown that many animals' capability of forming spatial representations for self-localization, path planning, and navigation relies on the functionalities of place and head-direction (HD) cells in the hippocampus. Although there are numerous hippocampal modeling approaches, only a few span the wide functionalities ranging from processing raw sensory signals to planning and action generation. This paper presents a vision-based navigation system that involves generating place and HD cells through learning from visual images, building topological maps based on learned cell representations and performing navigation using hierarchical reinforcement learning. First, place and HD cells are trained from sequences of visual stimuli in an unsupervised learning fashion. A modified Slow Feature Analysis (SFA) algorithm is proposed to learn different cell types in an intentional way by restricting their learning to separate phases of the spatial exploration. Then, to extract the encoded metric information from these unsupervised learning representations, a self-organized learning algorithm is adopted to learn over the emerged cell activities and to generate topological maps that reveal the topology of the environment and information about a robot's head direction, respectively. This enables the robot to perform self-localization and orientation detection based on the generated maps. Finally, goal-directed navigation is performed using reinforcement learning in continuous state spaces which are represented by the population activities of place cells. In particular, considering that the topological map provides a natural hierarchical representation of the environment, hierarchical reinforcement learning (HRL) is used to exploit this hierarchy to accelerate learning. The HRL works on different spatial scales, where a high-level policy learns to select subgoals and a low-level policy learns over primitive actions to specialize on the selected subgoals. Experimental results demonstrate that our system is able to navigate a robot to the desired position effectively, and the HRL shows a much better learning performance than the standard RL in solving our navigation tasks.
Collapse
Affiliation(s)
- Xiaomao Zhou
- College of Automation, Harbin Engineering University, Harbin 150001, China.
| | - Tao Bai
- College of Automation, Harbin Engineering University, Harbin 150001, China.
| | - Yanbin Gao
- College of Automation, Harbin Engineering University, Harbin 150001, China.
| | - Yuntao Han
- College of Automation, Harbin Engineering University, Harbin 150001, China.
| |
Collapse
|
34
|
Mici L, Parisi GI, Wermter S. An Incremental Self-Organizing Architecture for Sensorimotor Learning and Prediction. IEEE Trans Cogn Dev Syst 2018. [DOI: 10.1109/tcds.2018.2832844] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
35
|
Parisi GI, Tani J, Weber C, Wermter S. Lifelong Learning of Spatiotemporal Representations With Dual-Memory Recurrent Self-Organization. Front Neurorobot 2018; 12:78. [PMID: 30546302 PMCID: PMC6279894 DOI: 10.3389/fnbot.2018.00078] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Accepted: 11/06/2018] [Indexed: 11/28/2022] Open
Abstract
Artificial autonomous agents and robots interacting in complex environments are required to continually acquire and fine-tune knowledge over sustained periods of time. The ability to learn from continuous streams of information is referred to as lifelong learning and represents a long-standing challenge for neural network models due to catastrophic forgetting in which novel sensory experience interferes with existing representations and leads to abrupt decreases in the performance on previously acquired knowledge. Computational models of lifelong learning typically alleviate catastrophic forgetting in experimental scenarios with given datasets of static images and limited complexity, thereby differing significantly from the conditions artificial agents are exposed to. In more natural settings, sequential information may become progressively available over time and access to previous experience may be restricted. Therefore, specialized neural network mechanisms are required that adapt to novel sequential experience while preventing disruptive interference with existing representations. In this paper, we propose a dual-memory self-organizing architecture for lifelong learning scenarios. The architecture comprises two growing recurrent networks with the complementary tasks of learning object instances (episodic memory) and categories (semantic memory). Both growing networks can expand in response to novel sensory experience: the episodic memory learns fine-grained spatiotemporal representations of object instances in an unsupervised fashion while the semantic memory uses task-relevant signals to regulate structural plasticity levels and develop more compact representations from episodic experience. For the consolidation of knowledge in the absence of external sensory input, the episodic memory periodically replays trajectories of neural reactivations. We evaluate the proposed model on the CORe50 benchmark dataset for continuous object recognition, showing that we significantly outperform current methods of lifelong learning in three different incremental learning scenarios.
Collapse
Affiliation(s)
- German I. Parisi
- Knowledge Technology, Department of Informatics, Universität Hamburg, Hamburg, Germany
| | - Jun Tani
- Cognitive Neurorobotics Research Unit, Okinawa Institute of Science and Technology, Okinawa, Japan
| | - Cornelius Weber
- Knowledge Technology, Department of Informatics, Universität Hamburg, Hamburg, Germany
| | - Stefan Wermter
- Knowledge Technology, Department of Informatics, Universität Hamburg, Hamburg, Germany
| |
Collapse
|
36
|
|
37
|
Stepanova K, Klein FB, Cangelosi A, Vavrecka M. Mapping Language to Vision in a Real-World Robotic Scenario. IEEE Trans Cogn Dev Syst 2018. [DOI: 10.1109/tcds.2018.2819359] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
38
|
Bouguelia MR, Nowaczyk S, Payberah AH. An adaptive algorithm for anomaly and novelty detection in evolving data streams. Data Min Knowl Discov 2018. [DOI: 10.1007/s10618-018-0571-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
39
|
Park YS, Chon TS, Bae MJ, Kim DH, Lek S. Multivariate Data Analysis by Means of Self-Organizing Maps. ECOL INFORM 2018. [DOI: 10.1007/978-3-319-59928-1_12] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
40
|
Toprak S, Navarro-Guerrero N, Wermter S. Evaluating Integration Strategies for Visuo-Haptic Object Recognition. Cognit Comput 2017; 10:408-425. [PMID: 29881470 PMCID: PMC5971043 DOI: 10.1007/s12559-017-9536-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Accepted: 12/05/2017] [Indexed: 11/24/2022]
Abstract
In computational systems for visuo-haptic object recognition, vision and haptics are often modeled as separate processes. But this is far from what really happens in the human brain, where cross- as well as multimodal interactions take place between the two sensory modalities. Generally, three main principles can be identified as underlying the processing of the visual and haptic object-related stimuli in the brain: (1) hierarchical processing, (2) the divergence of the processing onto substreams for object shape and material perception, and (3) the experience-driven self-organization of the integratory neural circuits. The question arises whether an object recognition system can benefit in terms of performance from adopting these brain-inspired processing principles for the integration of the visual and haptic inputs. To address this, we compare the integration strategy that incorporates all three principles to the two commonly used integration strategies in the literature. We collected data with a NAO robot enhanced with inexpensive contact microphones as tactile sensors. The results of our experiments involving every-day objects indicate that (1) the contact microphones are a good alternative to capturing tactile information and that (2) organizing the processing of the visual and haptic inputs hierarchically and in two pre-processing streams is helpful performance-wise. Nevertheless, further research is needed to effectively quantify the role of each identified principle by itself as well as in combination with others.
Collapse
Affiliation(s)
- Sibel Toprak
- Knowledge Technology, Department of Informatics, Universität Hamburg, Vogt-Kölln-Str. 30, 22527 Hamburg, Germany
| | - Nicolás Navarro-Guerrero
- Knowledge Technology, Department of Informatics, Universität Hamburg, Vogt-Kölln-Str. 30, 22527 Hamburg, Germany
| | - Stefan Wermter
- Knowledge Technology, Department of Informatics, Universität Hamburg, Vogt-Kölln-Str. 30, 22527 Hamburg, Germany
| |
Collapse
|
41
|
Parisi GI, Tani J, Weber C, Wermter S. Lifelong learning of human actions with deep neural network self-organization. Neural Netw 2017; 96:137-149. [DOI: 10.1016/j.neunet.2017.09.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 08/23/2017] [Accepted: 09/01/2017] [Indexed: 10/18/2022]
|
42
|
|
43
|
Ghesmoune M, Azzag H, Benbernou S, Lebbah M, Duong T, Ouziri M. Big Data: from collection to visualization. Mach Learn 2017. [DOI: 10.1007/s10994-016-5622-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
44
|
Menegas W, Babayan BM, Uchida N, Watabe-Uchida M. Opposite initialization to novel cues in dopamine signaling in ventral and posterior striatum in mice. eLife 2017; 6. [PMID: 28054919 PMCID: PMC5271609 DOI: 10.7554/elife.21886] [Citation(s) in RCA: 147] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Accepted: 01/04/2017] [Indexed: 01/02/2023] Open
Abstract
Dopamine neurons are thought to encode novelty in addition to reward prediction error (the discrepancy between actual and predicted values). In this study, we compared dopamine activity across the striatum using fiber fluorometry in mice. During classical conditioning, we observed opposite dynamics in dopamine axon signals in the ventral striatum (‘VS dopamine’) and the posterior tail of the striatum (‘TS dopamine’). TS dopamine showed strong excitation to novel cues, whereas VS dopamine showed no responses to novel cues until they had been paired with a reward. TS dopamine cue responses decreased over time, depending on what the cue predicted. Additionally, TS dopamine showed excitation to several types of stimuli including rewarding, aversive, and neutral stimuli whereas VS dopamine showed excitation only to reward or reward-predicting cues. Together, these results demonstrate that dopamine novelty signals are localized in TS along with general salience signals, while VS dopamine reliably encodes reward prediction error. DOI:http://dx.doi.org/10.7554/eLife.21886.001 New experiences trigger a variety of responses in animals. We are surprised by, move towards, and often explore new objects. But how does the brain control these responses? Dopamine is a molecule that controls many processes in the brain and plays critical roles in various mental disorders, diseases that affect movement, and addiction. Rewarding experiences (like a glass of cold water on a hot day) can trigger dopamine neurons and studies have also shown that dopamine neurons respond to new experiences. This suggested that novelty may be rewarding in itself, or that novelty may signal the potential for future reward. On the other hand, it may be that different groups of dopamine neurons play different roles in responding to new or rewarding experiences. In 2015, it was reported that dopamine neurons connected to the rear part of an area in the brain called the striatum receive signals from different parts of the brain than most other dopamine neurons. The dopamine neurons connected to this “tail” of the striatum preferentially received inputs from regions involved in arousal rather than reward, suggesting that they may have a unique role and transmit a different type of information. Now, Menegas et al. have shown that dopamine signals in different areas of the striatum separate reward from novelty and other signals in mice. The results demonstrate that new odors activate dopamine neurons projecting to the tail of the striatum, but that this activity fades as the novelty wears off (as the mice learn to associate the odor with a particular outcome). By contrast, dopamine neurons projecting to the front of the striatum do not respond to novelty, but rather become more active as mice learn which odors accompany rewards (only responding to odors that predict reward). The experiments also show that dopamine neurons in the tail of the striatum encode information about the importance of a stimulus. Together, these findings indicate that some of the roles dopamine plays in the brain may not be related to reward, but are instead linked to the novelty and importance of the stimulus. The next challenge will be to find out how the separate reward and novelty signals in dopamine neurons relate to the animals’ behavior. This may help us to better understand dopamine-related psychiatric conditions, such as depression and addiction. DOI:http://dx.doi.org/10.7554/eLife.21886.002
Collapse
Affiliation(s)
- William Menegas
- Department of Molecular and Cellular Biology, Center for Brain Science, Harvard University, Cambridge, United States
| | - Benedicte M Babayan
- Department of Molecular and Cellular Biology, Center for Brain Science, Harvard University, Cambridge, United States
| | - Naoshige Uchida
- Department of Molecular and Cellular Biology, Center for Brain Science, Harvard University, Cambridge, United States
| | - Mitsuko Watabe-Uchida
- Department of Molecular and Cellular Biology, Center for Brain Science, Harvard University, Cambridge, United States
| |
Collapse
|
45
|
|
46
|
Xing Y, Shi X, Shen F, Zhou K, Zhao J. A Self-Organizing Incremental Neural Network based on local distribution learning. Neural Netw 2016; 84:143-160. [PMID: 27718392 DOI: 10.1016/j.neunet.2016.08.011] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Revised: 08/25/2016] [Accepted: 08/26/2016] [Indexed: 11/18/2022]
Abstract
In this paper, we propose an unsupervised incremental learning neural network based on local distribution learning, which is called Local Distribution Self-Organizing Incremental Neural Network (LD-SOINN). The LD-SOINN combines the advantages of incremental learning and matrix learning. It can automatically discover suitable nodes to fit the learning data in an incremental way without a priori knowledge such as the structure of the network. The nodes of the network store rich local information regarding the learning data. The adaptive vigilance parameter guarantees that LD-SOINN is able to add new nodes for new knowledge automatically and the number of nodes will not grow unlimitedly. While the learning process continues, nodes that are close to each other and have similar principal components are merged to obtain a concise local representation, which we call a relaxation data representation. A denoising process based on density is designed to reduce the influence of noise. Experiments show that the LD-SOINN performs well on both artificial and real-word data.
Collapse
Affiliation(s)
- Youlu Xing
- The National Key Laboratory for Novel Software Technology, Nanjing University, China; School of Computer Science and Technology, Anhui University, Hefei, 230601, China.
| | - Xiaofeng Shi
- The National Key Laboratory for Novel Software Technology, Nanjing University, China.
| | - Furao Shen
- The National Key Laboratory for Novel Software Technology, Nanjing University, China.
| | - Ke Zhou
- School of Statistics at University of International Business and Economics, Beijing, China.
| | - Jinxi Zhao
- The National Key Laboratory for Novel Software Technology, Nanjing University, China.
| |
Collapse
|
47
|
Ghesmoune M, Lebbah M, Azzag H. A new Growing Neural Gas for clustering data streams. Neural Netw 2016; 78:36-50. [PMID: 26997530 DOI: 10.1016/j.neunet.2016.02.003] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Revised: 12/22/2015] [Accepted: 02/09/2016] [Indexed: 11/17/2022]
Abstract
Clustering data streams is becoming the most efficient way to cluster a massive dataset. This task requires a process capable of partitioning observations continuously with restrictions of memory and time. In this paper we present a new algorithm, called G-Stream, for clustering data streams by making one pass over the data. G-Stream is based on growing neural gas, that allows us to discover clusters of arbitrary shapes without any assumptions on the number of clusters. By using a reservoir, and applying a fading function, the quality of clustering is improved. The performance of the proposed algorithm is evaluated on public datasets.
Collapse
Affiliation(s)
- Mohammed Ghesmoune
- University of Paris 13, Sorbonne Paris City LIPN-UMR 7030 - CNRS, 99, av. J-B Clément-F-93430 Villetaneuse, France.
| | - Mustapha Lebbah
- University of Paris 13, Sorbonne Paris City LIPN-UMR 7030 - CNRS, 99, av. J-B Clément-F-93430 Villetaneuse, France.
| | - Hanene Azzag
- University of Paris 13, Sorbonne Paris City LIPN-UMR 7030 - CNRS, 99, av. J-B Clément-F-93430 Villetaneuse, France.
| |
Collapse
|
48
|
Vallim RMM, de Mello RF. Unsupervised change detection in data streams: an application in music analysis. PROGRESS IN ARTIFICIAL INTELLIGENCE 2015. [DOI: 10.1007/s13748-015-0063-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
49
|
|
50
|
|