Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Manoonpong P, Geng T, Kulvicius T, Porr B, Wörgötter F. Adaptive, fast walking in a biped robot under neuronal control and learning. PLoS Comput Biol 2008;3:e134. [PMID: 17630828 PMCID: PMC1914373 DOI: 10.1371/journal.pcbi.0030134] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2007] [Accepted: 05/30/2007] [Indexed: 11/22/2022] Open

For:	Manoonpong P, Geng T, Kulvicius T, Porr B, Wörgötter F. Adaptive, fast walking in a biped robot under neuronal control and learning. PLoS Comput Biol 2008;3:e134. [PMID: 17630828 PMCID: PMC1914373 DOI: 10.1371/journal.pcbi.0030134] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2007] [Accepted: 05/30/2007] [Indexed: 11/22/2022] Open

Number

Cited by Other Article(s)

Ramdya P, Ijspeert AJ. The neuromechanics of animal locomotion: From biology to robotics and back. Sci Robot 2023;8:eadg0279. [PMID: 37256966 DOI: 10.1126/scirobotics.adg0279] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 05/05/2023] [Indexed: 06/02/2023]

Thor M, Manoonpong P. Versatile modular neural locomotion control with fast learning. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00444-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Song S, Kidziński Ł, Peng XB, Ong C, Hicks J, Levine S, Atkeson CG, Delp SL. Deep reinforcement learning for modeling human locomotion control in neuromechanical simulation. J Neuroeng Rehabil 2021;18:126. [PMID: 34399772 PMCID: PMC8365920 DOI: 10.1186/s12984-021-00919-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 07/29/2021] [Indexed: 11/10/2022] Open

Abstract

Modeling human motor control and predicting how humans will move in novel environments is a grand scientific challenge. Researchers in the fields of biomechanics and motor control have proposed and evaluated motor control models via neuromechanical simulations, which produce physically correct motions of a musculoskeletal model. Typically, researchers have developed control models that encode physiologically plausible motor control hypotheses and compared the resulting simulation behaviors to measurable human motion data. While such plausible control models were able to simulate and explain many basic locomotion behaviors (e.g. walking, running, and climbing stairs), modeling higher layer controls (e.g. processing environment cues, planning long-term motion strategies, and coordinating basic motor skills to navigate in dynamic and complex environments) remains a challenge. Recent advances in deep reinforcement learning lay a foundation for modeling these complex control processes and controlling a diverse repertoire of human movement; however, reinforcement learning has been rarely applied in neuromechanical simulation to model human control. In this paper, we review the current state of neuromechanical simulations, along with the fundamentals of reinforcement learning, as it applies to human locomotion. We also present a scientific competition and accompanying software platform, which we have organized to accelerate the use of reinforcement learning in neuromechanical simulations. This “Learn to Move” competition was an official competition at the NeurIPS conference from 2017 to 2019 and attracted over 1300 teams from around the world. Top teams adapted state-of-the-art deep reinforcement learning techniques and produced motions, such as quick turning and walk-to-stand transitions, that have not been demonstrated before in neuromechanical simulations without utilizing reference motion data. We close with a discussion of future opportunities at the intersection of human movement simulation and reinforcement learning and our plans to extend the Learn to Move competition to further facilitate interdisciplinary collaboration in modeling human motor control for biomechanics and rehabilitation research

Collapse

Zamboni R, Owaki D, Hayashibe M. Adaptive and Energy-Efficient Optimal Control in CPGs Through Tegotae-Based Feedback. Front Robot AI 2021;8:632804. [PMID: 34124172 PMCID: PMC8187776 DOI: 10.3389/frobt.2021.632804] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 05/03/2021] [Indexed: 11/29/2022] Open

Owaki D, Horikiri SY, Nishii J, Ishiguro A. Tegotae-Based Control Produces Adaptive Inter- and Intra-limb Coordination in Bipedal Walking. Front Neurorobot 2021;15:629595. [PMID: 34054453 PMCID: PMC8149599 DOI: 10.3389/fnbot.2021.629595] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 04/07/2021] [Indexed: 11/16/2022] Open

Urbain G, Barasuol V, Semini C, Dambre J, wyffels F. Effect of compliance on morphological control of dynamic locomotion with HyQ. Auton Robots 2021. [DOI: 10.1007/s10514-021-09974-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Haeufle DFB, Stollenmaier K, Heinrich I, Schmitt S, Ghazi-Zahedi K. Morphological Computation Increases From Lower- to Higher-Level of Biological Motor Control Hierarchy. Front Robot AI 2020;7:511265. [PMID: 33501299 PMCID: PMC7805613 DOI: 10.3389/frobt.2020.511265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2019] [Accepted: 08/24/2020] [Indexed: 11/29/2022] Open

Abstract

Voluntary movements, like point-to-point or oscillatory human arm movements, are generated by the interaction of several structures. High-level neuronal circuits in the brain are responsible for planning and initiating a movement. Spinal circuits incorporate proprioceptive feedback to compensate for deviations from the desired movement. Muscle biochemistry and contraction dynamics generate movement driving forces and provide an immediate physical response to external forces, like a low-level decentralized controller. A simple central neuronal command like "initiate a movement" then recruits all these biological structures and processes leading to complex behavior, e.g., generate a stable oscillatory movement in resonance with an external spring-mass system. It has been discussed that the spinal feedback circuits, the biochemical processes, and the biomechanical muscle dynamics contribute to the movement generation, and, thus, take over some parts of the movement generation and stabilization which would otherwise have to be performed by the high-level controller. This contribution is termed morphological computation and can be quantified with information entropy-based approaches. However, it is unknown whether morphological computation actually differs between these different hierarchical levels of the control system. To investigate this, we simulated point-to-point and oscillatory human arm movements with a neuro-musculoskeletal model. We then quantify morphological computation on the different hierarchy levels. The results show that morphological computation is highest for the most central (highest) level of the modeled control hierarchy, where the movement initiation and timing are encoded. Furthermore, they show that the lowest neuronal control layer, the muscle stimulation input, exploits the morphological computation of the biochemical and biophysical muscle characteristics to generate smooth dynamic movements. This study provides evidence that the system's design in the mechanical as well as in the neurological structure can take over important contributions to control, which would otherwise need to be performed by the higher control levels.

Collapse

A model for the transfer of control from the brain to the spinal cord through synaptic learning. J Comput Neurosci 2020;48:365-375. [PMID: 33009635 DOI: 10.1007/s10827-020-00767-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 08/06/2020] [Accepted: 09/11/2020] [Indexed: 12/20/2022]

Nassour J, Duy Hoa T, Atoofi P, Hamker F. Concrete Action Representation Model: From Neuroscience to Robotics. IEEE Trans Cogn Dev Syst 2020. [DOI: 10.1109/tcds.2019.2896300] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Ishihara K, Itoh TD, Morimoto J. Full-Body Optimal Control Toward Versatile and Agile Behaviors in a Humanoid Robot. IEEE Robot Autom Lett 2020. [DOI: 10.1109/lra.2019.2947001] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Dingwell JB, Cusumano JP. Humans use multi-objective control to regulate lateral foot placement when walking. PLoS Comput Biol 2019;15:e1006850. [PMID: 30840620 PMCID: PMC6422313 DOI: 10.1371/journal.pcbi.1006850] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 03/18/2019] [Accepted: 02/06/2019] [Indexed: 01/01/2023] Open

Abstract

A fundamental question in human motor neuroscience is to determine how the nervous system generates goal-directed movements despite inherent physiological noise and redundancy. Walking exhibits considerable variability and equifinality of task solutions. Existing models of bipedal walking do not yet achieve both continuous dynamic balance control and the equifinality of foot placement humans exhibit. Appropriate computational models are critical to disambiguate the numerous possibilities of how to regulate stepping movements to achieve different walking goals. Here, we extend a theoretical and computational Goal Equivalent Manifold (GEM) framework to generate predictive models, each posing a different experimentally testable hypothesis. These models regulate stepping movements to achieve any of three hypothesized goals, either alone or in combination: maintain lateral position, maintain lateral speed or “heading”, and/or maintain step width. We compared model predictions against human experimental data. Uni-objective control models demonstrated clear redundancy between stepping variables, but could not replicate human stepping dynamics. Most multi-objective control models that balanced maintaining two of the three hypothesized goals also failed to replicate human stepping dynamics. However, multi-objective models that strongly prioritized regulating step width over lateral position did successfully replicate all of the relevant step-to-step dynamics observed in humans. Independent analyses confirmed this control was consistent with linear error correction and replicated step-to-step dynamics of individual foot placements. Thus, the regulation of lateral stepping movements is inherently multi-objective and balances task-specific trade-offs between competing task goals. To determine how people walk in their environment requires understanding both walking biomechanics and how the nervous system regulates movements from step-to-step. Analogous to mechanical “templates” of locomotor biomechanics, our models serve as “control templates” for how humans regulate stepping movements from each step to the next. These control templates are symbiotic with well-established mechanical templates, providing complimentary insights into walking regulation.

When we walk, we walk in real-world contexts and with specific goal to achieve. Side-to-side movements are paramount because walking bipeds (humans, animals, robots, etc.) are inherently more unstable laterally. This is particularly important in older adults as sideways falls greatly increase hip fracture risk. Additionally, we normally walk on paths that limit (more or less) our lateral movements. Appropriately regulating lateral stepping movements is thus critical to achieving successful locomotion in any such context. Here, we use appropriate models to test competing hypotheses about how humans regulate lateral stepping movements from each step to the next to identify what task goals they try to achieve. Our work both bridges and unifies perspectives from dynamic walking and computational motor control to provide a coherent theoretical and computational framework from which to study motor regulation in the context of goal-directedness across a wide range of walking tasks and/or conditions.

Collapse

Marjaninejad A, Urbina-Meléndez D, Cohn BA, Valero-Cuevas FJ. Autonomous Functional Movements in a Tendon-Driven Limb via Limited Experience. NAT MACH INTELL 2019;1:144-154. [PMID: 31161156 PMCID: PMC6544439 DOI: 10.1038/s42256-019-0029-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Accepted: 02/05/2019] [Indexed: 11/09/2022]

Meng L, Macleod CA, Porr B, Gollee H. Bipedal robotic walking control derived from analysis of human locomotion. BIOLOGICAL CYBERNETICS 2018;112:277-290. [PMID: 29399713 PMCID: PMC6002472 DOI: 10.1007/s00422-018-0750-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Accepted: 01/09/2018] [Indexed: 06/07/2023]

Aoi S, Manoonpong P, Ambe Y, Matsuno F, Wörgötter F. Adaptive Control Strategies for Interlimb Coordination in Legged Robots: A Review. Front Neurorobot 2017;11:39. [PMID: 28878645 PMCID: PMC5572352 DOI: 10.3389/fnbot.2017.00039] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2016] [Accepted: 07/31/2017] [Indexed: 12/02/2022] Open

Der R, Martius G. Self-Organized Behavior Generation for Musculoskeletal Robots. Front Neurorobot 2017;11:8. [PMID: 28360852 PMCID: PMC5352682 DOI: 10.3389/fnbot.2017.00008] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Accepted: 02/07/2017] [Indexed: 11/13/2022] Open

Abstract

With the accelerated development of robot technologies, control becomes one of the central themes of research. In traditional approaches, the controller, by its internal functionality, finds appropriate actions on the basis of specific objectives for the task at hand. While very successful in many applications, self-organized control schemes seem to be favored in large complex systems with unknown dynamics or which are difficult to model. Reasons are the expected scalability, robustness, and resilience of self-organizing systems. The paper presents a self-learning neurocontroller based on extrinsic differential plasticity introduced recently, applying it to an anthropomorphic musculoskeletal robot arm with attached objects of unknown physical dynamics. The central finding of the paper is the following effect: by the mere feedback through the internal dynamics of the object, the robot is learning to relate each of the objects with a very specific sensorimotor pattern. Specifically, an attached pendulum pilots the arm into a circular motion, a half-filled bottle produces axis oriented shaking behavior, a wheel is getting rotated, and wiping patterns emerge automatically in a table-plus-brush setting. By these object-specific dynamical patterns, the robot may be said to recognize the object's identity, or in other words, it discovers dynamical affordances of objects. Furthermore, when including hand coordinates obtained from a camera, a dedicated hand-eye coordination self-organizes spontaneously. These phenomena are discussed from a specific dynamical system perspective. Central is the dedicated working regime at the border to instability with its potentially infinite reservoir of (limit cycle) attractors “waiting” to be excited. Besides converging toward one of these attractors, variate behavior is also arising from a self-induced attractor morphing driven by the learning rule. We claim that experimental investigations with this anthropomorphic, self-learning robot not only generate interesting and potentially useful behaviors, but may also help to better understand what subjective human muscle feelings are, how they can be rooted in sensorimotor patterns, and how these concepts may feed back on robotics.

Collapse

Meng L, Porr B, Macleod CA, Gollee H. A functional electrical stimulation system for human walking inspired by reflexive control principles. Proc Inst Mech Eng H 2017;231:315-325. [PMID: 28332444 PMCID: PMC5405833 DOI: 10.1177/0954411917693879] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2016] [Accepted: 01/16/2017] [Indexed: 11/24/2022]

Shaikh D, Manoonpong P. An Adaptive Neural Mechanism for Acoustic Motion Perception with Varying Sparsity. Front Neurorobot 2017;11:11. [PMID: 28337137 PMCID: PMC5343069 DOI: 10.3389/fnbot.2017.00011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2016] [Accepted: 02/20/2017] [Indexed: 11/14/2022] Open

An Adaptive Neural Mechanism with a Lizard Ear Model for Binaural Acoustic Tracking. ACTA ACUST UNITED AC 2016. [DOI: 10.1007/978-3-319-43488-9_8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Fujiki S, Aoi S, Funato T, Tomita N, Senda K, Tsuchiya K. Adaptation mechanism of interlimb coordination in human split-belt treadmill walking through learning of foot contact timing: a robotics study. J R Soc Interface 2016;12:0542. [PMID: 26289658 PMCID: PMC4614464 DOI: 10.1098/rsif.2015.0542] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Abstract

Human walking behaviour adaptation strategies have previously been examined using split-belt treadmills, which have two parallel independently controlled belts. In such human split-belt treadmill walking, two types of adaptations have been identified: early and late. Early-type adaptations appear as rapid changes in interlimb and intralimb coordination activities when the belt speeds of the treadmill change between tied (same speed for both belts) and split-belt (different speeds for each belt) configurations. By contrast, late-type adaptations occur after the early-type adaptations as a gradual change and only involve interlimb coordination. Furthermore, interlimb coordination shows after-effects that are related to these adaptations. It has been suggested that these adaptations are governed primarily by the spinal cord and cerebellum, but the underlying mechanism remains unclear. Because various physiological findings suggest that foot contact timing is crucial to adaptive locomotion, this paper reports on the development of a two-layered control model for walking composed of spinal and cerebellar models, and on its use as the focus of our control model. The spinal model generates rhythmic motor commands using an oscillator network based on a central pattern generator and modulates the commands formulated in immediate response to foot contact, while the cerebellar model modifies motor commands through learning based on error information related to differences between the predicted and actual foot contact timings of each leg. We investigated adaptive behaviour and its mechanism by split-belt treadmill walking experiments using both computer simulations and an experimental bipedal robot. Our results showed that the robot exhibited rapid changes in interlimb and intralimb coordination that were similar to the early-type adaptations observed in humans. In addition, despite the lack of direct interlimb coordination control, gradual changes and after-effects in the interlimb coordination appeared in a manner that was similar to the late-type adaptations and after-effects observed in humans. The adaptation results of the robot were then evaluated in comparison with human split-belt treadmill walking, and the adaptation mechanism was clarified from a dynamic viewpoint.

Collapse

Song S, Desai R, Geyer H. Integration of an adaptive swing control into a neuromuscular human walking model. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2015;2013:4915-8. [PMID: 24110837 DOI: 10.1109/embc.2013.6610650] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Dingwell JB, Cusumano JP. Identifying stride-to-stride control strategies in human treadmill walking. PLoS One 2015;10:e0124879. [PMID: 25910253 PMCID: PMC4409060 DOI: 10.1371/journal.pone.0124879] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2014] [Accepted: 03/18/2015] [Indexed: 01/05/2023] Open

Abstract

Variability is ubiquitous in human movement, arising from internal and external noise, inherent biological redundancy, and from the neurophysiological control actions that help regulate movement fluctuations. Increased walking variability can lead to increased energetic cost and/or increased fall risk. Conversely, biological noise may be beneficial, even necessary, to enhance motor performance. Indeed, encouraging more variability actually facilitates greater improvements in some forms of locomotor rehabilitation. Thus, it is critical to identify the fundamental principles humans use to regulate stride-to-stride fluctuations in walking. This study sought to determine how humans regulate stride-to-stride fluctuations in stepping movements during treadmill walking. We developed computational models based on pre-defined goal functions to compare if subjects, from each stride to the next, tried to maintain the same speed as the treadmill, or instead stay in the same position on the treadmill. Both strategies predicted average behaviors empirically indistinguishable from each other and from that of humans. These strategies, however, predicted very different stride-to-stride fluctuation dynamics. Comparisons to experimental data showed that human stepping movements were generally well-predicted by the speed-control model, but not by the position-control model. Human subjects also exhibited no indications they corrected deviations in absolute position only intermittently: i.e., closer to the boundaries of the treadmill. Thus, humans clearly do not adopt a control strategy whose primary goal is to maintain some constant absolute position on the treadmill. Instead, humans appear to regulate their stepping movements in a way most consistent with a strategy whose primary goal is to try to maintain the same speed as the treadmill at each consecutive stride. These findings have important implications both for understanding how biological systems regulate walking in general and for being able to harness these mechanisms to develop more effective rehabilitation interventions to improve locomotor performance.

Collapse

Dasgupta S, Wörgötter F, Manoonpong P. Neuromodulatory adaptive combination of correlation-based learning in cerebellum and reward-based learning in basal ganglia for goal-directed behavior control. Front Neural Circuits 2014;8:126. [PMID: 25389391 PMCID: PMC4211401 DOI: 10.3389/fncir.2014.00126] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 09/30/2014] [Indexed: 12/30/2022] Open

Abstract

Goal-directed decision making in biological systems is broadly based on associations between conditional and unconditional stimuli. This can be further classified as classical conditioning (correlation-based learning) and operant conditioning (reward-based learning). A number of computational and experimental studies have well established the role of the basal ganglia in reward-based learning, where as the cerebellum plays an important role in developing specific conditioned responses. Although viewed as distinct learning systems, recent animal experiments point toward their complementary role in behavioral learning, and also show the existence of substantial two-way communication between these two brain structures. Based on this notion of co-operative learning, in this paper we hypothesize that the basal ganglia and cerebellar learning systems work in parallel and interact with each other. We envision that such an interaction is influenced by reward modulated heterosynaptic plasticity (RMHP) rule at the thalamus, guiding the overall goal directed behavior. Using a recurrent neural network actor-critic model of the basal ganglia and a feed-forward correlation-based learning model of the cerebellum, we demonstrate that the RMHP rule can effectively balance the outcomes of the two learning systems. This is tested using simulated environments of increasing complexity with a four-wheeled robot in a foraging task in both static and dynamic configurations. Although modeled with a simplified level of biological abstraction, we clearly demonstrate that such a RMHP induced combinatorial learning mechanism, leads to stabler and faster learning of goal-directed behaviors, in comparison to the individual systems. Thus, in this paper we provide a computational model for adaptive combination of the basal ganglia and cerebellum learning systems by way of neuromodulated plasticity for goal-directed decision making in biological and bio-mimetic organisms.

Collapse

Macleod CA, Meng L, Conway BA, Porr B. Reflex control of robotic gait using human walking data. PLoS One 2014;9:e109959. [PMID: 25347544 PMCID: PMC4210155 DOI: 10.1371/journal.pone.0109959] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Accepted: 09/08/2014] [Indexed: 11/23/2022] Open

Ijspeert AJ. Biorobotics: Using robots to emulate and investigate agile locomotion. Science 2014;346:196-203. [DOI: 10.1126/science.1254486] [Citation(s) in RCA: 288] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

He B, Wang Z, Shen R, Hu S. Real-time Walking Pattern Generation for a Biped Robot with Hybrid CPG-ZMP Algorithm. INT J ADV ROBOT SYST 2014. [DOI: 10.5772/58845] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

Marques HG, Bharadwaj A, Iida F. From spontaneous motor activity to coordinated behaviour: a developmental model. PLoS Comput Biol 2014;10:e1003653. [PMID: 25057775 PMCID: PMC4109855 DOI: 10.1371/journal.pcbi.1003653] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2013] [Accepted: 04/18/2014] [Indexed: 01/09/2023] Open

Abstract

In mammals, the developmental path that links the primary behaviours observed during foetal stages to the full fledged behaviours observed in adults is still beyond our understanding. Often theories of motor control try to deal with the process of incremental learning in an abstract and modular way without establishing any correspondence with the mammalian developmental stages. In this paper, we propose a computational model that links three distinct behaviours which appear at three different stages of development. In order of appearance, these behaviours are: spontaneous motor activity (SMA), reflexes, and coordinated behaviours, such as locomotion. The goal of our model is to address in silico four hypotheses that are currently hard to verify in vivo: First, the hypothesis that spinal reflex circuits can be self-organized from the sensor and motor activity induced by SMA. Second, the hypothesis that supraspinal systems can modulate reflex circuits to achieve coordinated behaviour. Third, the hypothesis that, since SMA is observed in an organism throughout its entire lifetime, it provides a mechanism suitable to maintain the reflex circuits aligned with the musculoskeletal system, and thus adapt to changes in body morphology. And fourth, the hypothesis that by changing the modulation of the reflex circuits over time, one can switch between different coordinated behaviours. Our model is tested in a simulated musculoskeletal leg actuated by six muscles arranged in a number of different ways. Hopping is used as a case study of coordinated behaviour. Our results show that reflex circuits can be self-organized from SMA, and that, once these circuits are in place, they can be modulated to achieve coordinated behaviour. In addition, our results show that our model can naturally adapt to different morphological changes and perform behavioural transitions.

Collapse

Nassour J, Hénaff P, Benouezdou F, Cheng G. Multi-layered multi-pattern CPG for adaptive locomotion of humanoid robots. BIOLOGICAL CYBERNETICS 2014;108:291-303. [PMID: 24570353 DOI: 10.1007/s00422-014-0592-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2013] [Accepted: 02/03/2014] [Indexed: 06/03/2023]

Toutounji H, Pasemann F. Behavior control in the sensorimotor loop with short-term synaptic dynamics induced by self-regulating neurons. Front Neurorobot 2014;8:19. [PMID: 24904403 PMCID: PMC4033235 DOI: 10.3389/fnbot.2014.00019] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Accepted: 05/07/2014] [Indexed: 12/02/2022] Open

Abstract

The behavior and skills of living systems depend on the distributed control provided by specialized and highly recurrent neural networks. Learning and memory in these systems is mediated by a set of adaptation mechanisms, known collectively as neuronal plasticity. Translating principles of recurrent neural control and plasticity to artificial agents has seen major strides, but is usually hampered by the complex interactions between the agent's body and its environment. One of the important standing issues is for the agent to support multiple stable states of behavior, so that its behavioral repertoire matches the requirements imposed by these interactions. The agent also must have the capacity to switch between these states in time scales that are comparable to those by which sensory stimulation varies. Achieving this requires a mechanism of short-term memory that allows the neurocontroller to keep track of the recent history of its input, which finds its biological counterpart in short-term synaptic plasticity. This issue is approached here by deriving synaptic dynamics in recurrent neural networks. Neurons are introduced as self-regulating units with a rich repertoire of dynamics. They exhibit homeostatic properties for certain parameter domains, which result in a set of stable states and the required short-term memory. They can also operate as oscillators, which allow them to surpass the level of activity imposed by their homeostatic operation conditions. Neural systems endowed with the derived synaptic dynamics can be utilized for the neural behavior control of autonomous mobile agents. The resulting behavior depends also on the underlying network structure, which is either engineered or developed by evolutionary techniques. The effectiveness of these self-regulating units is demonstrated by controlling locomotion of a hexapod with 18 degrees of freedom, and obstacle-avoidance of a wheel-driven robot.

Collapse

Silva P, Matos V, Santos CP. Visually guided gait modifications for stepping over an obstacle: a bio-inspired approach. BIOLOGICAL CYBERNETICS 2014;108:103-119. [PMID: 24469319 DOI: 10.1007/s00422-014-0586-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2013] [Accepted: 01/13/2014] [Indexed: 06/03/2023]

Li C, Lowe R, Ziemke T. Humanoids Learning to Walk: A Natural CPG-Actor-Critic Architecture. Front Neurorobot 2013;7:5. [PMID: 23675345 PMCID: PMC3619089 DOI: 10.3389/fnbot.2013.00005] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Accepted: 03/06/2013] [Indexed: 11/13/2022] Open

Schröder-Schetelig J, Manoonpong P, Wörgötter F. Using efference copy and a forward internal model for adaptive biped walking. Auton Robots 2010. [DOI: 10.1007/s10514-010-9199-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Dingwell JB, John J, Cusumano JP. Do humans optimally exploit redundancy to control step variability in walking? PLoS Comput Biol 2010;6:e1000856. [PMID: 20657664 PMCID: PMC2904769 DOI: 10.1371/journal.pcbi.1000856] [Citation(s) in RCA: 138] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2009] [Accepted: 06/10/2010] [Indexed: 11/30/2022] Open

Abstract

It is widely accepted that humans and animals minimize energetic cost while walking. While such principles predict average behavior, they do not explain the variability observed in walking. For robust performance, walking movements must adapt at each step, not just on average. Here, we propose an analytical framework that reconciles issues of optimality, redundancy, and stochasticity. For human treadmill walking, we defined a goal function to formulate a precise mathematical definition of one possible control strategy: maintain constant speed at each stride. We recorded stride times and stride lengths from healthy subjects walking at five speeds. The specified goal function yielded a decomposition of stride-to-stride variations into new gait variables explicitly related to achieving the hypothesized strategy. Subjects exhibited greatly decreased variability for goal-relevant gait fluctuations directly related to achieving this strategy, but far greater variability for goal-irrelevant fluctuations. More importantly, humans immediately corrected goal-relevant deviations at each successive stride, while allowing goal-irrelevant deviations to persist across multiple strides. To demonstrate that this was not the only strategy people could have used to successfully accomplish the task, we created three surrogate data sets. Each tested a specific alternative hypothesis that subjects used a different strategy that made no reference to the hypothesized goal function. Humans did not adopt any of these viable alternative strategies. Finally, we developed a sequence of stochastic control models of stride-to-stride variability for walking, based on the Minimum Intervention Principle. We demonstrate that healthy humans are not precisely "optimal," but instead consistently slightly over-correct small deviations in walking speed at each stride. Our results reveal a new governing principle for regulating stride-to-stride fluctuations in human walking that acts independently of, but in parallel with, minimizing energetic cost. Thus, humans exploit task redundancies to achieve robust control while minimizing effort and allowing potentially beneficial motor variability.

Collapse

Duff A, Verschure PF. Unifying perceptual and behavioral learning with a correlative subspace learning rule. Neurocomputing 2010. [DOI: 10.1016/j.neucom.2009.11.048] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Friston KJ, Daunizeau J, Kilner J, Kiebel SJ. Action and behavior: a free-energy formulation. BIOLOGICAL CYBERNETICS 2010;102:227-60. [PMID: 20148260 DOI: 10.1007/s00422-010-0364-z] [Citation(s) in RCA: 347] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2009] [Accepted: 01/19/2010] [Indexed: 05/19/2023]

Iida F, Tedrake R. Minimalistic control of biped walking in rough terrain. Auton Robots 2010. [DOI: 10.1007/s10514-009-9174-3] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Yavuz M, Ocak H, Hetherington VJ, Davis BL. Prediction of plantar shear stress distribution by artificial intelligence methods. J Biomech Eng 2009;131:091007. [PMID: 19725696 DOI: 10.1115/1.3130453] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Braun D, Goldfarb M. A Control Approach for Actuated Dynamic Walking in Biped Robots. IEEE T ROBOT 2009. [DOI: 10.1109/tro.2009.2028762] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Pitti A, Lungarella M, Kuniyoshi Y. Generating spatiotemporal joint torque patterns from dynamical synchronization of distributed pattern generators. Front Neurorobot 2009;3:2. [PMID: 20011216 PMCID: PMC2790947 DOI: 10.3389/neuro.12.002.2009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2009] [Accepted: 09/13/2009] [Indexed: 11/13/2022] Open

Friston KJ, Daunizeau J, Kiebel SJ. Reinforcement learning or active inference? PLoS One 2009;4:e6421. [PMID: 19641614 PMCID: PMC2713351 DOI: 10.1371/journal.pone.0006421] [Citation(s) in RCA: 176] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2009] [Accepted: 03/19/2009] [Indexed: 11/18/2022] Open

Genetic algorithm-based optimal bipedal walking gait synthesis considering tradeoff between stability margin and speed. ROBOTICA 2009. [DOI: 10.1017/s026357470800475x] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Kolodziejski C, Porr B, Wörgötter F. Mathematical properties of neuronal TD-rules and differential Hebbian learning: a comparison. BIOLOGICAL CYBERNETICS 2008;98:259-272. [PMID: 18196266 PMCID: PMC2798052 DOI: 10.1007/s00422-007-0209-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/19/2007] [Accepted: 12/19/2007] [Indexed: 05/25/2023]

Abstract

A confusingly wide variety of temporally asymmetric learning rules exists related to reinforcement learning and/or to spike-timing dependent plasticity, many of which look exceedingly similar, while displaying strongly different behavior. These rules often find their use in control tasks, for example in robotics and for this rigorous convergence and numerical stability is required. The goal of this article is to review these rules and compare them to provide a better overview over their different properties. Two main classes will be discussed: temporal difference (TD) rules and correlation based (differential hebbian) rules and some transition cases. In general we will focus on neuronal implementations with changeable synaptic weights and a time-continuous representation of activity. In a machine learning (non-neuronal) context, for TD-learning a solid mathematical theory has existed since several years. This can partly be transferred to a neuronal framework, too. On the other hand, only now a more complete theory has also emerged for differential Hebb rules. In general rules differ by their convergence conditions and their numerical stability, which can lead to very undesirable behavior, when wanting to apply them. For TD, convergence can be enforced with a certain output condition assuring that the delta-error drops on average to zero (output control). Correlation based rules, on the other hand, converge when one input drops to zero (input control). Temporally asymmetric learning rules treat situations where incoming stimuli follow each other in time. Thus, it is necessary to remember the first stimulus to be able to relate it to the later occurring second one. To this end different types of so-called eligibility traces are being used by these two different types of rules. This aspect leads again to different properties of TD and differential Hebbian learning as discussed here. Thus, this paper, while also presenting several novel mathematical results, is mainly meant to provide a road map through the different neuronally emulated temporal asymmetrical learning rules and their behavior to provide some guidance for possible applications.

Collapse