1
|
Adami C. The Elements of Intelligence. ARTIFICIAL LIFE 2023; 29:293-307. [PMID: 37490705 DOI: 10.1162/artl_a_00410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Affiliation(s)
- Christoph Adami
- Michigan State University, Department of Microbiology and Molecular Genetics, Department of Physics and Astronomy.
| |
Collapse
|
2
|
Abstract
This selective review explores biologically inspired learning as a model for intelligent robot control and sensing technology on the basis of specific examples. Hebbian synaptic learning is discussed as a functionally relevant model for machine learning and intelligence, as explained on the basis of examples from the highly plastic biological neural networks of invertebrates and vertebrates. Its potential for adaptive learning and control without supervision, the generation of functional complexity, and control architectures based on self-organization is brought forward. Learning without prior knowledge based on excitatory and inhibitory neural mechanisms accounts for the process through which survival-relevant or task-relevant representations are either reinforced or suppressed. The basic mechanisms of unsupervised biological learning drive synaptic plasticity and adaptation for behavioral success in living brains with different levels of complexity. The insights collected here point toward the Hebbian model as a choice solution for “intelligent” robotics and sensor systems.
Collapse
|
3
|
Gumbsch C, Butz MV, Martius G. Autonomous Identification and Goal-Directed Invocation of Event-Predictive Behavioral Primitives. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2019.2925890] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
4
|
Zai AT, Cavé-Lopez S, Rolland M, Giret N, Hahnloser RHR. Sensory substitution reveals a manipulation bias. Nat Commun 2020; 11:5940. [PMID: 33230182 PMCID: PMC7684286 DOI: 10.1038/s41467-020-19686-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 10/14/2020] [Indexed: 01/01/2023] Open
Abstract
Sensory substitution is a promising therapeutic approach for replacing a missing or diseased sensory organ by translating inaccessible information into another sensory modality. However, many substitution systems are not well accepted by subjects. To explore the effect of sensory substitution on voluntary action repertoires and their associated affective valence, we study deaf songbirds to which we provide visual feedback as a substitute of auditory feedback. Surprisingly, deaf birds respond appetitively to song-contingent binary visual stimuli. They skillfully adapt their songs to increase the rate of visual stimuli, showing that auditory feedback is not required for making targeted changes to vocal repertoires. We find that visually instructed song learning is basal-ganglia dependent. Because hearing birds respond aversively to the same visual stimuli, sensory substitution reveals a preference for actions that elicit sensory feedback over actions that do not, suggesting that substitution systems should be designed to exploit the drive to manipulate.
Collapse
Affiliation(s)
- Anja T Zai
- Institute of Neuroinformatics, University of Zurich and ETH Zurich, 8057, Zurich, Switzerland
- Neuroscience Center Zurich (ZNZ), University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Sophie Cavé-Lopez
- Institute of Neuroinformatics, University of Zurich and ETH Zurich, 8057, Zurich, Switzerland
| | - Manon Rolland
- Institut des Neurosciences Paris Saclay, CNRS, Université Paris Saclay, Orsay, France
| | - Nicolas Giret
- Institut des Neurosciences Paris Saclay, CNRS, Université Paris Saclay, Orsay, France
| | - Richard H R Hahnloser
- Institute of Neuroinformatics, University of Zurich and ETH Zurich, 8057, Zurich, Switzerland.
- Neuroscience Center Zurich (ZNZ), University of Zurich and ETH Zurich, Zurich, Switzerland.
| |
Collapse
|
5
|
Abstract
AbstractJoining multiple decision-makers together is a powerful way to obtain more sophisticated decision-making systems, but requires to address the questions of division of labor and specialization. We investigate in how far information constraints in hierarchies of experts not only provide a principled method for regularization but also to enforce specialization. In particular, we devise an information-theoretically motivated on-line learning rule that allows partitioning of the problem space into multiple sub-problems that can be solved by the individual experts. We demonstrate two different ways to apply our method: (i) partitioning problems based on individual data samples and (ii) based on sets of data samples representing tasks. Approach (i) equips the system with the ability to solve complex decision-making problems by finding an optimal combination of local expert decision-makers. Approach (ii) leads to decision-makers specialized in solving families of tasks, which equips the system with the ability to solve meta-learning problems. We show the broad applicability of our approach on a range of problems including classification, regression, density estimation, and reinforcement learning problems, both in the standard machine learning setup and in a meta-learning setting.
Collapse
|
6
|
Hangl S, Dunjko V, Briegel HJ, Piater J. Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition. Front Robot AI 2020; 7:42. [PMID: 33501210 PMCID: PMC7806109 DOI: 10.3389/frobt.2020.00042] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2019] [Accepted: 03/09/2020] [Indexed: 11/13/2022] Open
Abstract
We consider the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous play with the object. By applying previously-trained skills and behaviors, the robot learns how to prepare situations for which a successful strategy is already known. The information gathered during autonomous play is additionally used to train an environment model. This model is exploited for active learning and the generation of novel preparatory behaviors compositions. We apply our approach to a wide range of different manipulation tasks, e.g., book grasping, grasping of objects of different sizes by selecting different grasping strategies, placement on shelves, and tower disassembly. We show that the composite behavior generation mechanism enables the robot to solve previously-unsolvable tasks, e.g., tower disassembly. We use success statistics gained during real-world experiments to simulate the convergence behavior of our system. Simulation experiments show that the learning speed can be improved by around 30% by using active learning.
Collapse
Affiliation(s)
- Simon Hangl
- Intelligent and Interactive Systems, Department of Informatics, University of Innsbruck, Innsbruck, Austria
| | | | - Hans J. Briegel
- Institute for Theoretical Physics, University of Innsbruck, Innsbruck, Austria
| | - Justus Piater
- Intelligent and Interactive Systems, Department of Informatics, University of Innsbruck, Innsbruck, Austria
| |
Collapse
|
7
|
Santucci VG, Oudeyer PY, Barto A, Baldassarre G. Editorial: Intrinsically Motivated Open-Ended Learning in Autonomous Robots. Front Neurorobot 2020; 13:115. [PMID: 32009927 PMCID: PMC6978885 DOI: 10.3389/fnbot.2019.00115] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 12/19/2019] [Indexed: 11/26/2022] Open
Affiliation(s)
- Vieri Giuliano Santucci
- Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche, Rome, Italy
| | - Pierre-Yves Oudeyer
- Institut National de Recherche en Informatique et en Automatique (INRIA), Bordeaux, France
| | - Andrew Barto
- College of Information and Computer Sciences, University of Massachusetts, Amherst, MA, United States
| | - Gianluca Baldassarre
- Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche, Rome, Italy
| |
Collapse
|
8
|
Roli A, Ligot A, Birattari M. Complexity Measures: Open Questions and Novel Opportunities in the Automatic Design and Analysis of Robot Swarms. Front Robot AI 2019; 6:130. [PMID: 33501145 PMCID: PMC7805888 DOI: 10.3389/frobt.2019.00130] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 11/11/2019] [Indexed: 11/13/2022] Open
Abstract
Complexity measures and information theory metrics in general have recently been attracting the interest of multi-agent and robotics communities, owing to their capability of capturing relevant features of robot behaviors, while abstracting from implementation details. We believe that theories and tools from complex systems science and information theory may be fruitfully applied in the near future to support the automatic design of robot swarms and the analysis of their dynamics. In this paper we discuss opportunities and open questions in this scenario.
Collapse
Affiliation(s)
- Andrea Roli
- Department of Computer Science and Engineering, Campus of Cesena, Alma Mater Studiorum Università di Bologna, Bologna, Italy
| | - Antoine Ligot
- IRIDIA, Université libre de Bruxelles, Brussels, Belgium
| | | |
Collapse
|
9
|
Svenkeson A, West BJ. Persistent random motion with maximally correlated fluctuations. Phys Rev E 2019; 100:022119. [PMID: 31574651 DOI: 10.1103/physreve.100.022119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Indexed: 11/07/2022]
Abstract
How often should a random walker change its direction of motion in order to maximize correlation in velocity fluctuations over a finite time interval? We address this optimal diffusion problem in the context of the one-dimensional persistent random walk, where we evaluate the correlation and mutual information in velocity trajectories as a function of the persistence level and the observation time. We find the optimal persistence level corresponds to the average number of direction reversals asymptotically scaling as the square root of the observation time. This square-root scaling law makes the relative growth between the average number of direction reversals and the persistence length invariant with respect to changes in the overall time duration of the random walk.
Collapse
Affiliation(s)
- Adam Svenkeson
- Vehicle Technology Directorate, Army Research Laboratory, Aberdeen Proving Ground, Aberdeen, Maryland 21005, USA
| | - Bruce J West
- Information Science Directorate, Army Research Office, Research Triangle Park, North Carolina 27703, USA
| |
Collapse
|
10
|
Koglin T, Sándor B, Gros C. When the goal is to generate a series of activities: A self-organized simulated robot arm. PLoS One 2019; 14:e0217004. [PMID: 31216272 PMCID: PMC6584010 DOI: 10.1371/journal.pone.0217004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 04/09/2019] [Indexed: 11/19/2022] Open
Abstract
Behavior is characterized by sequences of goal oriented conducts, such as food uptake, socializing and resting. Classically, one would define for each task a corresponding satisfaction level, with the agent engaging, at a given time, in the activity having the lowest satisfaction level. Alternatively, one may consider that the agent follows the overarching objective to generate sequences of distinct activities. To achieve a balanced distribution of activities would then be the primary goal, and not to master a specific task. In this setting the agent would show two types of behaviors, task-oriented and task-searching phases, with the latter interseeding the former. We study the emergence of autonomous task switching for the case of a simulated robot arm. Grasping one of several moving objects corresponds in this setting to a specific activity. Overall, the arm should follow a given object temporarily and then move away, in order to search for a new target and reengage. We show that this behavior can be generated robustly when modeling the arm as an adaptive dynamical system. The dissipation function is in this approach time dependent. The arm is in a dissipative state when searching for a nearby object, dissipating energy on approach. Once close, the dissipation function starts to increase, with the eventual sign change implying that the arm will take up energy and wander off. The resulting explorative state ends when the dissipation function becomes again negative and the arm selects a new target. We believe that our approach may be generalized to generate self-organized sequences of activities in general.
Collapse
Affiliation(s)
- Tim Koglin
- Institute for Theoretical Physics, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Bulcsú Sándor
- Department of Physics, Babeș-Bolyai University, Cluj-Napoca, Romania
- * E-mail:
| | - Claudius Gros
- Institute for Theoretical Physics, Goethe University Frankfurt, Frankfurt am Main, Germany
| |
Collapse
|
11
|
Aguilera M, Bedia MG. Exploring Criticality as a Generic Adaptive Mechanism. Front Neurorobot 2018; 12:55. [PMID: 30333741 PMCID: PMC6176217 DOI: 10.3389/fnbot.2018.00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Accepted: 08/20/2018] [Indexed: 11/19/2022] Open
Abstract
The activity of many biological and cognitive systems is not poised deep within a specific regime of activity. Instead, they operate near points of critical behavior located at the boundary between different phases. Certain authors link some of the properties of criticality with the ability of living systems to generate autonomous or intrinsically generated behavior. However, these claims remain highly speculative. In this paper, we intend to explore the connection between criticality and autonomous behavior through conceptual models that show how embodied agents may adapt themselves toward critical points. We propose to exploit maximum entropy models and their formal descriptions of indicators of criticality to present a learning model that drives generic agents toward critical points. Specifically, we derive such a learning model in an embodied Boltzmann machine by implementing a gradient ascent rule that maximizes the heat capacity of the controller in order to make the network maximally sensitive to external perturbations. We test and corroborate the model by implementing an embodied agent in the Mountain Car benchmark test, which is controlled by a Boltzmann machine that adjusts its weights according to the model. We find that the neural controller reaches an apparent point of criticality, which coincides with a transition point of the behavior of the agent between two regimes of behavior, maximizing the synergistic information between its sensors and the combination of hidden and motor neurons. Finally, we discuss the potential of our learning model to answer questions about the connection between criticality and the capabilities of living systems to autonomously generate intrinsic constraints on their behavior. We suggest that these “critical agents” are able to acquire flexible behavioral patterns that are useful for the development of successful strategies in different contexts.
Collapse
Affiliation(s)
- Miguel Aguilera
- Information and Autonomous Systems-Research Center for Life, Mind, and Society, University of the Basque Country, Donostia, Spain.,Department of Computer Science and Systems Engineering, University of Zaragoza, Zaragoza, Spain
| | - Manuel G Bedia
- Department of Computer Science and Systems Engineering, University of Zaragoza, Zaragoza, Spain.,Interactive Systems, Adaptivity, Autonomy and Cognition Lab, Aragón Institute of Engineering Research, University of Zaragoza, Zaragoza, Spain
| |
Collapse
|
12
|
Biehl M, Guckelsberger C, Salge C, Smith SC, Polani D. Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop. Front Neurorobot 2018; 12:45. [PMID: 30214404 PMCID: PMC6125413 DOI: 10.3389/fnbot.2018.00045] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Accepted: 07/02/2018] [Indexed: 11/13/2022] Open
Abstract
Active inference is an ambitious theory that treats perception, inference, and action selection of autonomous agents under the heading of a single principle. It suggests biologically plausible explanations for many cognitive phenomena, including consciousness. In active inference, action selection is driven by an objective function that evaluates possible future actions with respect to current, inferred beliefs about the world. Active inference at its core is independent from extrinsic rewards, resulting in a high level of robustness across e.g., different environments or agent morphologies. In the literature, paradigms that share this independence have been summarized under the notion of intrinsic motivations. In general and in contrast to active inference, these models of motivation come without a commitment to particular inference and action selection mechanisms. In this article, we study if the inference and action selection machinery of active inference can also be used by alternatives to the originally included intrinsic motivation. The perception-action loop explicitly relates inference and action selection to the environment and agent memory, and is consequently used as foundation for our analysis. We reconstruct the active inference approach, locate the original formulation within, and show how alternative intrinsic motivations can be used while keeping many of the original features intact. Furthermore, we illustrate the connection to universal reinforcement learning by means of our formalism. Active inference research may profit from comparisons of the dynamics induced by alternative intrinsic motivations. Research on intrinsic motivations may profit from an additional way to implement intrinsically motivated agents that also share the biological plausibility of active inference.
Collapse
Affiliation(s)
| | - Christian Guckelsberger
- Computational Creativity Group, Department of Computing, Goldsmiths, University of London, London, United Kingdom
| | - Christoph Salge
- Game Innovation Lab, Department of Computer Science and Engineering, New York University, New York, NY, United States
- Sepia Lab, Adaptive Systems Research Group, Department of Computer Science, University of Hertfordshire, Hatfield, United Kingdom
| | - Simón C. Smith
- Sepia Lab, Adaptive Systems Research Group, Department of Computer Science, University of Hertfordshire, Hatfield, United Kingdom
- Institute of Perception, Action and Behaviour, School of Informatics, The University of Edinburgh, Edinburgh, United Kingdom
| | - Daniel Polani
- Sepia Lab, Adaptive Systems Research Group, Department of Computer Science, University of Hertfordshire, Hatfield, United Kingdom
| |
Collapse
|
13
|
Sándor B, Nowak M, Koglin T, Martin L, Gros C. Kick Control: Using the Attracting States Arising Within the Sensorimotor Loop of Self-Organized Robots as Motor Primitives. Front Neurorobot 2018; 12:40. [PMID: 30050427 PMCID: PMC6051224 DOI: 10.3389/fnbot.2018.00040] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Accepted: 06/20/2018] [Indexed: 12/03/2022] Open
Abstract
Self-organized robots may develop attracting states within the sensorimotor loop, that is within the phase space of neural activity, body and environmental variables. Fixpoints, limit cycles and chaotic attractors correspond in this setting to a non-moving robot, to directed, and to irregular locomotion respectively. Short higher-order control commands may hence be used to kick the system from one self-organized attractor robustly into the basin of attraction of a different attractor, a concept termed here as kick control. The individual sensorimotor states serve in this context as highly compliant motor primitives. We study different implementations of kick control for the case of simulated and real-world wheeled robots, for which the dynamics of the distinct wheels is generated independently by local feedback loops. The feedback loops are mediated by rate-encoding neurons disposing exclusively of propriosensoric inputs in terms of projections of the actual rotational angle of the wheel. The changes of the neural activity are then transmitted into a rotational motion by a simulated transmission rod akin to the transmission rods used for steam locomotives. We find that the self-organized attractor landscape may be morphed both by higher-level control signals, in the spirit of kick control, and by interacting with the environment. Bumping against a wall destroys the limit cycle corresponding to forward motion, with the consequence that the dynamical variables are then attracted in phase space by the limit cycle corresponding to backward moving. The robot, which does not dispose of any distance or contact sensors, hence reverses direction autonomously.
Collapse
Affiliation(s)
- Bulcsú Sándor
- Department of Physics, Babes-Bolyai University, Cluj-Napoca, Romania.,Institute for Theoretical Physics, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Michael Nowak
- Institute for Theoretical Physics, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Tim Koglin
- Institute for Theoretical Physics, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Laura Martin
- Institute for Theoretical Physics, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Claudius Gros
- Institute for Theoretical Physics, Goethe University Frankfurt, Frankfurt am Main, Germany
| |
Collapse
|
14
|
Der R, Martius G. Self-Organized Behavior Generation for Musculoskeletal Robots. Front Neurorobot 2017; 11:8. [PMID: 28360852 PMCID: PMC5352682 DOI: 10.3389/fnbot.2017.00008] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Accepted: 02/07/2017] [Indexed: 11/13/2022] Open
Abstract
With the accelerated development of robot technologies, control becomes one of the central themes of research. In traditional approaches, the controller, by its internal functionality, finds appropriate actions on the basis of specific objectives for the task at hand. While very successful in many applications, self-organized control schemes seem to be favored in large complex systems with unknown dynamics or which are difficult to model. Reasons are the expected scalability, robustness, and resilience of self-organizing systems. The paper presents a self-learning neurocontroller based on extrinsic differential plasticity introduced recently, applying it to an anthropomorphic musculoskeletal robot arm with attached objects of unknown physical dynamics. The central finding of the paper is the following effect: by the mere feedback through the internal dynamics of the object, the robot is learning to relate each of the objects with a very specific sensorimotor pattern. Specifically, an attached pendulum pilots the arm into a circular motion, a half-filled bottle produces axis oriented shaking behavior, a wheel is getting rotated, and wiping patterns emerge automatically in a table-plus-brush setting. By these object-specific dynamical patterns, the robot may be said to recognize the object's identity, or in other words, it discovers dynamical affordances of objects. Furthermore, when including hand coordinates obtained from a camera, a dedicated hand-eye coordination self-organizes spontaneously. These phenomena are discussed from a specific dynamical system perspective. Central is the dedicated working regime at the border to instability with its potentially infinite reservoir of (limit cycle) attractors “waiting” to be excited. Besides converging toward one of these attractors, variate behavior is also arising from a self-induced attractor morphing driven by the learning rule. We claim that experimental investigations with this anthropomorphic, self-learning robot not only generate interesting and potentially useful behaviors, but may also help to better understand what subjective human muscle feelings are, how they can be rooted in sensorimotor patterns, and how these concepts may feed back on robotics.
Collapse
Affiliation(s)
- Ralf Der
- Institute for Computer Science, University of Leipzig Leipzig, Germany
| | - Georg Martius
- IST AustriaKlosterneuburg, Austria; Autonomous Learning Group, Max Planck Institute for Intelligent SystemsTübingen, Germany
| |
Collapse
|
15
|
|
16
|
Martin L, Sándor B, Gros C. Closed-loop Robots Driven by Short-Term Synaptic Plasticity: Emergent Explorative vs. Limit-Cycle Locomotion. Front Neurorobot 2016; 10:12. [PMID: 27803661 PMCID: PMC5067527 DOI: 10.3389/fnbot.2016.00012] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Accepted: 10/03/2016] [Indexed: 11/13/2022] Open
Abstract
We examine the hypothesis, that short-term synaptic plasticity (STSP) may generate self-organized motor patterns. We simulated sphere-shaped autonomous robots, within the LPZRobots simulation package, containing three weights moving along orthogonal internal rods. The position of a weight is controlled by a single neuron receiving excitatory input from the sensor, measuring its actual position, and inhibitory inputs from the other two neurons. The inhibitory connections are transiently plastic, following physiologically inspired STSP-rules. We find that a wide palette of motion patterns are generated through the interaction of STSP, robot, and environment (closed-loop configuration), including various forward meandering and circular motions, together with chaotic trajectories. The observed locomotion is robust with respect to additional interactions with obstacles. In the chaotic phase the robot is seemingly engaged in actively exploring its environment. We believe that our results constitute a concept of proof that transient synaptic plasticity, as described by STSP, may potentially be important for the generation of motor commands and for the emergence of complex locomotion patterns, adapting seamlessly also to unexpected environmental feedback. We observe spontaneous and collision induced mode switchings, finding in addition, that locomotion may follow transiently limit cycles which are otherwise unstable. Regular locomotion corresponds to stable limit cycles in the sensorimotor loop, which may be characterized in turn by arbitrary angles of propagation. This degeneracy is, in our analysis, one of the drivings for the chaotic wandering observed for selected parameter settings, which is induced by the smooth diffusion of the angle of propagation.
Collapse
Affiliation(s)
| | - Bulcsú Sándor
- Institute for Theoretical Physics, Goethe University FrankfurtFrankfurt am Main, Germany
| | | |
Collapse
|
17
|
Der R. In Search for the Neural Mechanisms of Individual Development: Behavior-Driven Differential Hebbian Learning. Front Robot AI 2016. [DOI: 10.3389/frobt.2015.00037] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
18
|
Sándor B, Jahn T, Martin L, Gros C. The Sensorimotor Loop as a Dynamical System: How Regular Motion Primitives May Emerge from Self-Organized Limit Cycles. Front Robot AI 2015. [DOI: 10.3389/frobt.2015.00031] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
19
|
Soltoggio A, van der Velde F. Editorial: Neural plasticity for rich and uncertain robotic information streams. Front Neurorobot 2015; 9:12. [PMID: 26578947 PMCID: PMC4621436 DOI: 10.3389/fnbot.2015.00012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Accepted: 10/08/2015] [Indexed: 01/22/2023] Open
|
20
|
Novel plasticity rule can explain the development of sensorimotor intelligence. Proc Natl Acad Sci U S A 2015; 112:E6224-32. [PMID: 26504200 DOI: 10.1073/pnas.1508400112] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Grounding autonomous behavior in the nervous system is a fundamental challenge for neuroscience. In particular, self-organized behavioral development provides more questions than answers. Are there special functional units for curiosity, motivation, and creativity? This paper argues that these features can be grounded in synaptic plasticity itself, without requiring any higher-level constructs. We propose differential extrinsic plasticity (DEP) as a new synaptic rule for self-learning systems and apply it to a number of complex robotic systems as a test case. Without specifying any purpose or goal, seemingly purposeful and adaptive rhythmic behavior is developed, displaying a certain level of sensorimotor intelligence. These surprising results require no system-specific modifications of the DEP rule. They rather arise from the underlying mechanism of spontaneous symmetry breaking, which is due to the tight brain body environment coupling. The new synaptic rule is biologically plausible and would be an interesting target for neurobiological investigation. We also argue that this neuronal mechanism may have been a catalyst in natural evolution.
Collapse
|
21
|
Marzen SE, DeWeese MR, Crutchfield JP. Time resolution dependence of information measures for spiking neurons: scaling and universality. Front Comput Neurosci 2015; 9:105. [PMID: 26379538 PMCID: PMC4551861 DOI: 10.3389/fncom.2015.00105] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2015] [Accepted: 07/30/2015] [Indexed: 11/15/2022] Open
Abstract
The mutual information between stimulus and spike-train response is commonly used to monitor neural coding efficiency, but neuronal computation broadly conceived requires more refined and targeted information measures of input-output joint processes. A first step toward that larger goal is to develop information measures for individual output processes, including information generation (entropy rate), stored information (statistical complexity), predictable information (excess entropy), and active information accumulation (bound information rate). We calculate these for spike trains generated by a variety of noise-driven integrate-and-fire neurons as a function of time resolution and for alternating renewal processes. We show that their time-resolution dependence reveals coarse-grained structural properties of interspike interval statistics; e.g., τ-entropy rates that diverge less quickly than the firing rate indicated by interspike interval correlations. We also find evidence that the excess entropy and regularized statistical complexity of different types of integrate-and-fire neurons are universal in the continuous-time limit in the sense that they do not depend on mechanism details. This suggests a surprising simplicity in the spike trains generated by these model neurons. Interestingly, neurons with gamma-distributed ISIs and neurons whose spike trains are alternating renewal processes do not fall into the same universality class. These results lead to two conclusions. First, the dependence of information measures on time resolution reveals mechanistic details about spike train generation. Second, information measures can be used as model selection tools for analyzing spike train processes.
Collapse
Affiliation(s)
- Sarah E. Marzen
- Department of Physics, University of California, BerkeleyBerkeley, CA, USA
| | - Michael R. DeWeese
- Department of Physics, University of California, BerkeleyBerkeley, CA, USA
- Helen Wills Neuroscience Institute and Redwood Center for Theoretical Neuroscience, University of California, BerkeleyBerkeley, CA, USA
| | - James P. Crutchfield
- Complexity Sciences Center and Department of Physics, University of California, DavisDavis, CA, USA
| |
Collapse
|
22
|
Informational and Causal Architecture of Discrete-Time Renewal Processes. ENTROPY 2015. [DOI: 10.3390/e17074891] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
23
|
|
24
|
The Fisher Information as a Neural Guiding Principle for Independent Component Analysis. ENTROPY 2015. [DOI: 10.3390/e17063838] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
25
|
Nakajima K, Schmidt N, Pfeifer R. Measuring information transfer in a soft robotic arm. BIOINSPIRATION & BIOMIMETICS 2015; 10:035007. [PMID: 25970447 DOI: 10.1088/1748-3190/10/3/035007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Soft robots can exhibit diverse behaviors with simple types of actuation by partially outsourcing control to the morphological and material properties of their soft bodies, which is made possible by the tight coupling between control, body, and environment. In this paper, we present a method that will quantitatively characterize these diverse spatiotemporal dynamics of a soft body based on the information-theoretic approach. In particular, soft bodies have the ability to propagate the effect of actuation through the entire body, with a certain time delay, due to their elasticity. Our goal is to capture this delayed interaction in a quantitative manner based on a measure called momentary information transfer. We extend this measure to soft robotic applications and demonstrate its power using a physical soft robotic platform inspired by the octopus. Our approach is illustrated in two ways. First, we statistically characterize the delayed actuation propagation through the body as a strength of information transfer. Second, we capture this information propagation directly as local information dynamics. As a result, we show that our approach can successfully characterize the spatiotemporal dynamics of the soft robotic platform, explicitly visualizing how information transfers through the entire body with delays. Further extension scenarios of our approach are discussed for soft robotic applications in general.
Collapse
Affiliation(s)
- K Nakajima
- The Hakubi Center for Advanced Research, Kyoto University, 606-8501 Kyoto, Japan. Department of Applied Analysis and Complex Dynamical Systems, Graduate School of Informatics, Kyoto University, 606-8501 Kyoto, Japan
| | | | | |
Collapse
|
26
|
|
27
|
Fernandez-Leon JA, Acosta GG, Rozenfeld A. How simple autonomous decisions evolve into robust behaviours? A review from neurorobotics, cognitive, self-organized and artificial immune systems fields. Biosystems 2014; 124:7-20. [PMID: 25149273 DOI: 10.1016/j.biosystems.2014.08.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2014] [Revised: 08/13/2014] [Accepted: 08/15/2014] [Indexed: 10/24/2022]
Abstract
Researchers in diverse fields, such as in neuroscience, systems biology and autonomous robotics, have been intrigued by the origin and mechanisms for biological robustness. Darwinian evolution, in general, has suggested that adaptive mechanisms as a way of reaching robustness, could evolve by natural selection acting successively on numerous heritable variations. However, is this understanding enough for realizing how biological systems remain robust during their interactions with the surroundings? Here, we describe selected studies of bio-inspired systems that show behavioral robustness. From neurorobotics, cognitive, self-organizing and artificial immune system perspectives, our discussions focus mainly on how robust behaviors evolve or emerge in these systems, having the capacity of interacting with their surroundings. These descriptions are twofold. Initially, we introduce examples from autonomous robotics to illustrate how the process of designing robust control can be idealized in complex environments for autonomous navigation in terrain and underwater vehicles. We also include descriptions of bio-inspired self-organizing systems. Then, we introduce other studies that contextualize experimental evolution with simulated organisms and physical robots to exemplify how the process of natural selection can lead to the evolution of robustness by means of adaptive behaviors.
Collapse
Affiliation(s)
- Jose A Fernandez-Leon
- Centre for Computational Neuroscience and Robotics (CCNR), Informatics, University of Sussex, United Kingdom
| | - Gerardo G Acosta
- INTELYMEC-CIFICEN-CONICET, Engineering Faculty, Universidad Nacional del Centro de la Prov. de Buenos Aires and CONICET, Olavarría, Argentina; GEE - Department of Physics, Universitat de les Illes Balears, Palma de Mallorca, Spain.
| | - Alejandro Rozenfeld
- INTELYMEC-CIFICEN-CONICET, Engineering Faculty, Universidad Nacional del Centro de la Prov. de Buenos Aires and CONICET, Olavarría, Argentina; Rui Nabeiro Biodiversity Chair, CIBIO, University of Évora, Évora, Portugal.
| |
Collapse
|
28
|
Peng Z, Genewein T, Braun DA. Assessing randomness and complexity in human motion trajectories through analysis of symbolic sequences. Front Hum Neurosci 2014; 8:168. [PMID: 24744716 PMCID: PMC3978291 DOI: 10.3389/fnhum.2014.00168] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Accepted: 03/07/2014] [Indexed: 11/23/2022] Open
Abstract
Complexity is a hallmark of intelligent behavior consisting both of regular patterns and random variation. To quantitatively assess the complexity and randomness of human motion, we designed a motor task in which we translated subjects' motion trajectories into strings of symbol sequences. In the first part of the experiment participants were asked to perform self-paced movements to create repetitive patterns, copy pre-specified letter sequences, and generate random movements. To investigate whether the degree of randomness can be manipulated, in the second part of the experiment participants were asked to perform unpredictable movements in the context of a pursuit game, where they received feedback from an online Bayesian predictor guessing their next move. We analyzed symbol sequences representing subjects' motion trajectories with five common complexity measures: predictability, compressibility, approximate entropy, Lempel-Ziv complexity, as well as effective measure complexity. We found that subjects' self-created patterns were the most complex, followed by drawing movements of letters and self-paced random motion. We also found that participants could change the randomness of their behavior depending on context and feedback. Our results suggest that humans can adjust both complexity and regularity in different movement types and contexts and that this can be assessed with information-theoretic measures of the symbolic sequences generated from movement trajectories.
Collapse
Affiliation(s)
- Zhen Peng
- Max Planck Institute for Biological Cybernetics Tübingen, Germany ; Max Planck Institute for Intelligent Systems Tübingen, Germany ; Graduate Training Centre of Neuroscience Tübingen, Germany
| | - Tim Genewein
- Max Planck Institute for Biological Cybernetics Tübingen, Germany ; Max Planck Institute for Intelligent Systems Tübingen, Germany ; Graduate Training Centre of Neuroscience Tübingen, Germany
| | - Daniel A Braun
- Max Planck Institute for Biological Cybernetics Tübingen, Germany ; Max Planck Institute for Intelligent Systems Tübingen, Germany
| |
Collapse
|
29
|
Zahedi K, Martius G, Ay N. Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis. Front Psychol 2013; 4:801. [PMID: 24204351 PMCID: PMC3816314 DOI: 10.3389/fpsyg.2013.00801] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Accepted: 10/10/2013] [Indexed: 11/23/2022] Open
Abstract
One of the main challenges in the field of embodied artificial intelligence is the open-ended autonomous learning of complex behaviors. Our approach is to use task-independent, information-driven intrinsic motivation(s) to support task-dependent learning. The work presented here is a preliminary step in which we investigate the predictive information (the mutual information of the past and future of the sensor stream) as an intrinsic drive, ideally supporting any kind of task acquisition. Previous experiments have shown that the predictive information (PI) is a good candidate to support autonomous, open-ended learning of complex behaviors, because a maximization of the PI corresponds to an exploration of morphology- and environment-dependent behavioral regularities. The idea is that these regularities can then be exploited in order to solve any given task. Three different experiments are presented and their results lead to the conclusion that the linear combination of the one-step PI with an external reward function is not generally recommended in an episodic policy gradient setting. Only for hard tasks a great speed-up can be achieved at the cost of an asymptotic performance lost.
Collapse
Affiliation(s)
- Keyan Zahedi
- Max Planck Institute for Mathematics in the Sciences Leipzig, Germany
| | | | | |
Collapse
|