1
|
Penna MF, Trigili E, Amato L, Eken H, Dell'Agnello F, Lanotte F, Gruppioni E, Vitiello N, Crea S. Decoding Upper-Limb Movement Intention Through Adaptive Dynamic Movement Primitives: A Proof-of-Concept Study with a Shoulder-Elbow Exoskeleton. IEEE Int Conf Rehabil Robot 2023; 2023:1-6. [PMID: 37941281 DOI: 10.1109/icorr58425.2023.10304723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2023]
Abstract
This work presents an intention decoding algorithm that can be used to control a 4 degrees-of-freedom shoulder-elbow exoskeleton in reaching tasks. The algorithm was designed to assist the movement of users with upper-limb impairments who can initiate the movement by themselves. It relies on the observation of the initial part of the user's movement through joint angle measures and aims to estimate in real-time the phase of the movement and predict the goal position of the hand in the reaching task. The algorithm is based on adaptive Dynamic Movement Primitives and Gaussian Mixture Models. The performance of the algorithm was verified in robot-assisted planar reaching movements performed by one healthy subject wearing the exoskeleton. Tests included movements of different amplitudes and orientations. Results showed that the algorithm could predict the hand's final position with an error lower than 5 cm after 0.25 s from the movement onset, and that the final position reached during the tests was on average less than 4 cm far from the target position. Finally, the effects of the assistance were observed in a reduction of the activation of the Biceps Brachii and of the time to execute the reaching tasks.
Collapse
|
2
|
Floegel M, Kasper J, Perrier P, Kell CA. How the conception of control influences our understanding of actions. Nat Rev Neurosci 2023; 24:313-329. [PMID: 36997716 DOI: 10.1038/s41583-023-00691-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/28/2023] [Indexed: 04/01/2023]
Abstract
Wilful movement requires neural control. Commonly, neural computations are thought to generate motor commands that bring the musculoskeletal system - that is, the plant - from its current physical state into a desired physical state. The current state can be estimated from past motor commands and from sensory information. Modelling movement on the basis of this concept of plant control strives to explain behaviour by identifying the computational principles for control signals that can reproduce the observed features of movements. From an alternative perspective, movements emerge in a dynamically coupled agent-environment system from the pursuit of subjective perceptual goals. Modelling movement on the basis of this concept of perceptual control aims to identify the controlled percepts and their coupling rules that can give rise to the observed characteristics of behaviour. In this Perspective, we discuss a broad spectrum of approaches to modelling human motor control and their notions of control signals, internal models, handling of sensory feedback delays and learning. We focus on the influence that the plant control and the perceptual control perspective may have on decisions when modelling empirical data, which may in turn shape our understanding of actions.
Collapse
Affiliation(s)
- Mareike Floegel
- Department of Neurology and Brain Imaging Center, Goethe University Frankfurt, Frankfurt, Germany
| | - Johannes Kasper
- Department of Neurology and Brain Imaging Center, Goethe University Frankfurt, Frankfurt, Germany
| | - Pascal Perrier
- Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, Grenoble, France
| | - Christian A Kell
- Department of Neurology and Brain Imaging Center, Goethe University Frankfurt, Frankfurt, Germany.
| |
Collapse
|
3
|
Deep Intelligence: What AI Should Learn from Nature’s Imagination. Cognit Comput 2023. [DOI: 10.1007/s12559-023-10124-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]
|
4
|
Wang K, Fan Y, Sakuma I. Robot Programming from a Single Demonstration for High Precision Industrial Insertion. SENSORS (BASEL, SWITZERLAND) 2023; 23:2514. [PMID: 36904718 PMCID: PMC10006960 DOI: 10.3390/s23052514] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/01/2023] [Revised: 02/06/2023] [Accepted: 02/20/2023] [Indexed: 06/18/2023]
Abstract
We propose a novel approach for robotic industrial insertion tasks using the Programming by Demonstration technique. Our method allows robots to learn a high-precision task by observing human demonstration once, without requiring any prior knowledge of the object. We introduce an Imitated-to-Finetuned approach that generates imitated approach trajectories by cloning the human hand's movements and then fine-tunes the goal position with a visual servoing approach. To identify features on the object used in visual servoing, we model object tracking as the moving object detection problem, separating each demonstration video frame into the moving foreground that includes the object and demonstrator's hand and the static background. Then a hand keypoints estimation function is used to remove the redundant features on the hand. The experiment shows that the proposed method can make robots learn precision industrial insertion tasks from a single human demonstration.
Collapse
Affiliation(s)
- Kaimeng Wang
- FANUC Advanced Research Laboratory, FANUC America Corporation, Union City, CA 94587, USA
| | - Yongxiang Fan
- FANUC Advanced Research Laboratory, FANUC America Corporation, Union City, CA 94587, USA
| | - Ichiro Sakuma
- Department of Precision Engineering, The University of Tokyo, Tokyo 113-8654, Japan
| |
Collapse
|
5
|
Köksal Ersöz E, Chossat P, Krupa M, Lavigne F. Dynamic branching in a neural network model for probabilistic prediction of sequences. J Comput Neurosci 2022; 50:537-557. [PMID: 35948839 DOI: 10.1007/s10827-022-00830-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Revised: 07/11/2022] [Accepted: 07/13/2022] [Indexed: 10/15/2022]
Abstract
An important function of the brain is to predict which stimulus is likely to occur based on the perceived cues. The present research studied the branching behavior of a computational network model of populations of excitatory and inhibitory neurons, both analytically and through simulations. Results show how synaptic efficacy, retroactive inhibition and short-term synaptic depression determine the dynamics of selection between different branches predicting sequences of stimuli of different probabilities. Further results show that changes in the probability of the different predictions depend on variations of neuronal gain. Such variations allow the network to optimize the probability of its predictions to changing probabilities of the sequences without changing synaptic efficacy.
Collapse
Affiliation(s)
- Elif Köksal Ersöz
- Univ Rennes, INSERM, LTSI - UMR 1099, Campus Beaulieu, Rennes, F-35000, France. .,Project Team MathNeuro, INRIA-CNRS-UNS, 2004 route des Lucioles-BP 93, Sophia Antipolis, 06902, France.
| | - Pascal Chossat
- Project Team MathNeuro, INRIA-CNRS-UNS, 2004 route des Lucioles-BP 93, Sophia Antipolis, 06902, France.,Université Côte d'Azur, Laboratoire Jean-Alexandre Dieudonné, Campus Valrose, Nice, 06300, France
| | - Martin Krupa
- Project Team MathNeuro, INRIA-CNRS-UNS, 2004 route des Lucioles-BP 93, Sophia Antipolis, 06902, France.,Université Côte d'Azur, Laboratoire Jean-Alexandre Dieudonné, Campus Valrose, Nice, 06300, France
| | - Frédéric Lavigne
- Université Côte d'Azur, CNRS-BCL, Campus Saint Jean d'Angely, Nice, 06300, France
| |
Collapse
|
6
|
Yu X, Liu P, He W, Liu Y, Chen Q, Ding L. Human-Robot Variable Impedance Skills Transfer Learning Based on Dynamic Movement Primitives. IEEE Robot Autom Lett 2022. [DOI: 10.1109/lra.2022.3154469] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Xinbo Yu
- Institute of Artificial Intelligence, University of Science and Technology Beijing, Beijing, China
| | - Peisen Liu
- Institute of Artificial Intelligence, School of Automation and Electrical Engineering, and Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing, China
| | - Wei He
- Institute of Artificial Intelligence, School of Automation and Electrical Engineering, and Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing, China
| | - Yu Liu
- School of Automation Science and Engineering, South China University of Technology, Guangzhou, China
| | - Qi Chen
- Institute of Artificial Intelligence, School of Automation and Electrical Engineering, and Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing, China
| | - Liang Ding
- State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin, China
| |
Collapse
|
7
|
Sharifi M, Zakerimanesh A, Mehr JK, Torabi A, Mushahwar VK, Tavakoli M. Impedance Variation and Learning Strategies in Human-Robot Interaction. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:6462-6475. [PMID: 33449901 DOI: 10.1109/tcyb.2020.3043798] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
In this survey, various concepts and methodologies developed over the past two decades for varying and learning the impedance or admittance of robotic systems that physically interact with humans are explored. For this purpose, the assumptions and mathematical formulations for the online adjustment of impedance models and controllers for physical human-robot interaction (HRI) are categorized and compared. In this systematic review, studies on: 1) variation and 2) learning of appropriate impedance elements are taken into account. These strategies are classified and described in terms of their objectives, points of view (approaches), and signal requirements (including position, HRI force, and electromyography activity). Different methods involving linear/nonlinear analyses (e.g., optimal control design and nonlinear Lyapunov-based stability guarantee) and the Gaussian approximation algorithms (e.g., Gaussian mixture model-based and dynamic movement primitives-based strategies) are reviewed. Current challenges and research trends in physical HRI are finally discussed.
Collapse
|
8
|
Zhang X, Seigler TM, Hoagg JB. The Impact of Nonminimum-Phase Zeros on Human-in-the-Loop Control Systems. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:5098-5112. [PMID: 33151888 DOI: 10.1109/tcyb.2020.3027502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
We present results from an experiment in which 33 human subjects interact with a dynamic system 40 times over a one-week period. The subjects are divided into three groups. For each interaction, a subject performs a command-following task, where the reference command is the same for all trials and all subjects. However, each group interacts with a different dynamic system, which is represented by a transfer function. The transfer functions have the same poles but different zeros. One has a minimum-phase zero , another has a nonminimum-phase zero , and the last has a slower (i.e., closer to the imaginary axis) nonminimum-phase zero zsn ∈ (0,zn) . The experimental results show that nonminimum-phase zeros tend to make dynamic systems more difficult for humans to learn to control. We use a subsystem identification algorithm to identify the control strategy that each subject uses on each trial. The identification results show that the identified feedforward controllers approximate the inverse dynamics of the system with which the subjects interact better on the last trial than on the first trial. However, the subjects interacting with the minimum-phase system are able to approximate the inverse dynamics in feedforward more accurately than the subjects interacting with the nonminimum-phase system. This observation suggests that nonminimum-phase zeros are an impediment to approximating inverse dynamics in feedforward. Finally, we provide evidence that humans rely on feedforward-step-like-control strategies with systems (e.g., nonminimum-phase systems) for which it is difficult to approximate the inverse dynamics in feedforward.
Collapse
|
9
|
Seepanomwan K, Caligiore D, O'Regan KJ, Baldassarre G. Intrinsic Motivations and Planning to Explain Tool-Use Development: A Study With a Simulated Robot Model. IEEE Trans Cogn Dev Syst 2022. [DOI: 10.1109/tcds.2020.2986411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
10
|
Abstract
To control unmanned aerial systems, we rarely have a perfect system model. Safe and aggressive planning is also challenging for nonlinear and under-actuated systems. Expert pilots, however, demonstrate maneuvers that are deemed at the edge of plane envelope. Inspired by biological systems, in this paper, we introduce a framework that leverages methods in the field of control theory and reinforcement learning to generate feasible, possibly aggressive, trajectories. For the control policies, Dynamic Movement Primitives (DMPs) imitate pilot-induced primitives, and DMPs are combined in parallel to generate trajectories to reach original or different goal points. The stability properties of DMPs and their overall systems are analyzed using contraction theory. For reinforcement learning, Policy Improvement with Path Integrals (PI2) was used for the maneuvers. The results in this paper show that PI2 updated policies are a feasible and parallel combination of different updated primitives transfer the learning in the contraction regions. Our proposed methodology can be used to imitate, reshape, and improve feasible, possibly aggressive, maneuvers. In addition, we can exploit trajectories generated by optimization methods, such as Model Predictive Control (MPC), and a library of maneuvers can be instantly generated. For application, 3-DOF (degrees of freedom) Helicopter and 2D-UAV (unmanned aerial vehicle) models are utilized to demonstrate the main results.
Collapse
|
11
|
Lanotte F, McKinney Z, Grazi L, Chen B, Crea S, Vitiello N. Adaptive Control Method for Dynamic Synchronization of Wearable Robotic Assistance to Discrete Movements: Validation for Use Case of Lifting Tasks. IEEE T ROBOT 2021. [DOI: 10.1109/tro.2021.3073836] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
12
|
Frölich S, Marković D, Kiebel SJ. Neuronal Sequence Models for Bayesian Online Inference. Front Artif Intell 2021; 4:530937. [PMID: 34095815 PMCID: PMC8176225 DOI: 10.3389/frai.2021.530937] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Accepted: 04/13/2021] [Indexed: 11/13/2022] Open
Abstract
Various imaging and electrophysiological studies in a number of different species and brain regions have revealed that neuronal dynamics associated with diverse behavioral patterns and cognitive tasks take on a sequence-like structure, even when encoding stationary concepts. These neuronal sequences are characterized by robust and reproducible spatiotemporal activation patterns. This suggests that the role of neuronal sequences may be much more fundamental for brain function than is commonly believed. Furthermore, the idea that the brain is not simply a passive observer but an active predictor of its sensory input, is supported by an enormous amount of evidence in fields as diverse as human ethology and physiology, besides neuroscience. Hence, a central aspect of this review is to illustrate how neuronal sequences can be understood as critical for probabilistic predictive information processing, and what dynamical principles can be used as generators of neuronal sequences. Moreover, since different lines of evidence from neuroscience and computational modeling suggest that the brain is organized in a functional hierarchy of time scales, we will also review how models based on sequence-generating principles can be embedded in such a hierarchy, to form a generative model for recognition and prediction of sensory input. We shortly introduce the Bayesian brain hypothesis as a prominent mathematical description of how online, i.e., fast, recognition, and predictions may be computed by the brain. Finally, we briefly discuss some recent advances in machine learning, where spatiotemporally structured methods (akin to neuronal sequences) and hierarchical networks have independently been developed for a wide range of tasks. We conclude that the investigation of specific dynamical and structural principles of sequential brain activity not only helps us understand how the brain processes information and generates predictions, but also informs us about neuroscientific principles potentially useful for designing more efficient artificial neuronal networks for machine learning tasks.
Collapse
Affiliation(s)
- Sascha Frölich
- Department of Psychology, Technische Universität Dresden, Dresden, Germany
| | | | | |
Collapse
|
13
|
Abstract
SUMMARYDynamic movement primitives (DMP) are motion building blocks suitable for real-world tasks. We suggest a methodology for learning the manifold of task and DMP parameters, which facilitates runtime adaptation to changes in task requirements while ensuring predictable and robust performance. For efficient learning, the parameter space is analyzed using principal component analysis and locally linear embedding. Two manifold learning methods: kernel estimation and deep neural networks, are investigated for a ball throwing task in simulation and in a physical environment. Low runtime estimation errors are obtained for both learning methods, with an advantage to kernel estimation when data sets are small.
Collapse
|
14
|
Gafos A, van Lieshout P. Editorial: Models and Theories of Speech Production. Front Psychol 2020; 11:1238. [PMID: 32636783 PMCID: PMC7316957 DOI: 10.3389/fpsyg.2020.01238] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 05/12/2020] [Indexed: 11/13/2022] Open
Affiliation(s)
- Adamantios Gafos
- Department of Linguistics and Excellence Area of Cognitive Sciences, University of Potsdam, Potsdam, Germany
| | - Pascal van Lieshout
- Department of Speech-Language Pathology, Oral Dynamics Laboratory, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
15
|
Bazzi S, Sternad D. Human control of complex objects: Towards more dexterous robots. Adv Robot 2020; 34:1137-1155. [PMID: 33100448 PMCID: PMC7577404 DOI: 10.1080/01691864.2020.1777198] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 05/08/2020] [Accepted: 05/27/2020] [Indexed: 10/24/2022]
Abstract
Manipulation of objects with underactuated dynamics remains a challenge for robots. In contrast, humans excel at 'tool use' and more insight into human control strategies may inform robotic control architectures. We examined human control of objects that exhibit complex - underactuated, nonlinear, and potentially chaotic dynamics, such as transporting a cup of coffee. Simple control strategies appropriate for unconstrained movements, such as maximizing smoothness, fail as interaction forces have to be compensated or preempted. However, predictive control based on internal models appears daunting when the objects have nonlinear and unpredictable dynamics. We hypothesized that humans learn strategies that make these interactions predictable. Using a virtual environment subjects interacted with a virtual cup and rolling ball using a robotic visual and haptic interface. Two different metrics quantified predictability: stability or contraction, and mutual information between controller and object. In point-to-point displacements subjects exploited the contracting regions of the object dynamics to safely navigate perturbations. Control contraction metrics showed that subjects used a controller that exponentially stabilized trajectories. During continuous cup-and-ball displacements subjects developed predictable solutions sacrificing smoothness and energy efficiency. These results may stimulate control strategies for dexterous robotic manipulators and human-robot interaction.
Collapse
Affiliation(s)
- Salah Bazzi
- Department of Biology, Northeastern University, Boston, Massachusetts 02115, USA
- Department of Electrical and Computer Engineering, Northeastern University, Boston, Massachusetts 02115, USA
| | - Dagmar Sternad
- Department of Biology, Northeastern University, Boston, Massachusetts 02115, USA
- Department of Electrical and Computer Engineering, Northeastern University, Boston, Massachusetts 02115, USA
- Department of Physics, Northeastern University, Boston, Massachusetts 02115, USA
| |
Collapse
|
16
|
Bazzi S, Sternad D. Robustness in Human Manipulation of Dynamically Complex Objects through Control Contraction Metrics. IEEE Robot Autom Lett 2020; 5:2578-2585. [PMID: 32219173 PMCID: PMC7098464 DOI: 10.1109/lra.2020.2972863] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Control and manipulation of objects with underactuated dynamics remains a challenge for robots. Due to their typically nonlinear dynamics, it is computationally taxing to implement model-based planning and control techniques. Yet humans can skillfully manipulate such objects, seemingly with ease. More insight into human control strategies may inform how to enhance control strategies in robots. This study examined human control of objects that exhibit complex - underactuated and nonlinear - dynamics. We hypothesized that humans seek to make their trajectories exponentially stable to achieve robustness in the face of external perturbations. A stable trajectory is also robust to the high levels of noise in the human neuromotor system. Motivated by the task of carrying a cup of coffee, a virtual implementation of transporting a cart-pendulum system was developed. Subjects interacted with the virtual system via a robotic manipulandum that provided a haptic and visual interface. Human subjects were instructed to transport this simplified system to a target position as fast as possible without 'spilling coffee', while accommodating different visible perturbations that could be anticipated. To test the hypothesis of exponential convergence, tools from the framework of control contraction metrics were leveraged to analyze human trajectories. Results showed that with practice the trajectories indeed became exponentially stable, selectively around the perturbation. While these findings are agnostic about the involvement of feedback and feedforward control, they do support the hypothesis that humans learn to make trajectories stable, consistent with achieving predictability.
Collapse
Affiliation(s)
- Salah Bazzi
- Salah Bazzi and Dagmar Sternad are in the Department of Electrical and Computer Engineering and the Department of Biology, Northeastern University, Boston, Massachusetts 02115
| | - Dagmar Sternad
- Salah Bazzi and Dagmar Sternad are in the Department of Electrical and Computer Engineering and the Department of Biology, Northeastern University, Boston, Massachusetts 02115
| |
Collapse
|
17
|
Parrell B, Lammert AC. Bridging Dynamical Systems and Optimal Trajectory Approaches to Speech Motor Control With Dynamic Movement Primitives. Front Psychol 2019; 10:2251. [PMID: 31681077 PMCID: PMC6802577 DOI: 10.3389/fpsyg.2019.02251] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Accepted: 09/19/2019] [Indexed: 11/13/2022] Open
Abstract
Current models of speech motor control rely on either trajectory-based control (DIVA, GEPPETO, ACT) or a dynamical systems approach based on feedback control (Task Dynamics, FACTS). While both approaches have provided insights into the speech motor system, it is difficult to connect these findings across models given the distinct theoretical and computational bases of the two approaches. We propose a new extension of the most widely used dynamical systems approach, Task Dynamics, that incorporates many of the strengths of trajectory-based approaches, providing a way to bridge the theoretical divide between what have been two separate approaches to understanding speech motor control. The Task Dynamics (TD) model posits that speech gestures are governed by point attractor dynamics consistent with a critically damped harmonic oscillator. Kinematic trajectories associated with such gestures should therefore be consistent with a second-order dynamical system, possibly modified by blending with temporally overlapping gestures or altering oscillator parameters. This account of observed kinematics is powerful and theoretically appealing, but may be insufficient to account for deviations from predicted kinematics-i.e., changes produced in response to some external perturbations to the jaw, changes in control during acquisition and development, or effects of word/syllable frequency. Optimization, such as would be needed to minimize articulatory effort, is also incompatible with the current TD model, though the idea that the speech production systems economizes effort has a long history and, importantly, also plays a critical role in current theories of domain-general human motor control. To address these issues, we use Dynamic Movement Primitives (DMPs) to expand a dynamical systems framework for speech motor control to allow modification of kinematic trajectories by incorporating a simple, learnable forcing term into existing point attractor dynamics. We show that integration of DMPs with task-based point-attractor dynamics enhances the potential explanatory power of TD in a number of critical ways, including the ability to account for external forces in planning and optimizing both kinematic and dynamic movement costs. At the same time, this approach preserves the successes of Task Dynamics in handling multi-gesture planning and coordination.
Collapse
Affiliation(s)
- Benjamin Parrell
- Department of Communication Sciences & Disorders, University of Wisconsin-Madison, Madison, WI, United States
| | - Adam C. Lammert
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA, United States
- Bioengineering Systems & Technologies, MIT Lincoln Laboratory, Lexington, MA, United States
| |
Collapse
|
18
|
Motion encoding with asynchronous trajectories of repetitive teleoperation tasks and its extension to human-agent shared teleoperation. Auton Robots 2019. [DOI: 10.1007/s10514-019-09853-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
19
|
Brandherm F, Peters J, Neumann G, Akrour R. Learning Replanning Policies With Direct Policy Search. IEEE Robot Autom Lett 2019. [DOI: 10.1109/lra.2019.2901656] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
20
|
Deng Z, Guan H, Huang R, Liang H, Zhang L, Zhang J. Combining Model-Based $Q$ -Learning With Structural Knowledge Transfer for Robot Skill Learning. IEEE Trans Cogn Dev Syst 2019. [DOI: 10.1109/tcds.2017.2718938] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
21
|
Abstract
Small-variance asymptotics is emerging as a useful technique for inference in large-scale Bayesian non-parametric mixture models. This paper analyzes the online learning of robot manipulation tasks with Bayesian non-parametric mixture models under small-variance asymptotics. The analysis yields a scalable online sequence clustering (SOSC) algorithm that is non-parametric in the number of clusters and the subspace dimension of each cluster. SOSC groups the new datapoint in low-dimensional subspaces by online inference in a non-parametric mixture of probabilistic principal component analyzers (MPPCA) based on a Dirichlet process, and captures the state transition and state duration information online in a hidden semi-Markov model (HSMM) based on a hierarchical Dirichlet process. A task-parameterized formulation of our approach autonomously adapts the model to changing environmental situations during manipulation. We apply the algorithm in a teleoperation setting to recognize the intention of the operator and remotely adjust the movement of the robot using the learned model. The generative model is used to synthesize both time-independent and time-dependent behaviors by relying on the principles of shared and autonomous control. Experiments with the Baxter robot yield parsimonious clusters that adapt online with new demonstrations and assist the operator in performing remote manipulation tasks.
Collapse
Affiliation(s)
- Ajay Kumar Tanwani
- Idiap Research Institute and EPFL, Martigny, Switzerland
- University of California, Berkeley, CA, USA
| | | |
Collapse
|
22
|
Petric T, Gams A, Colasanto L, Ijspeert AJ, Ude A. Accelerated Sensorimotor Learning of Compliant Movement Primitives. IEEE T ROBOT 2018. [DOI: 10.1109/tro.2018.2861921] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
23
|
Raffalt PC, Vallabhajosula S, Renz JJ, Mukherjee M, Stergiou N. Lower limb joint angle variability and dimensionality are different in stairmill climbing and treadmill walking. ROYAL SOCIETY OPEN SCIENCE 2018; 5:180996. [PMID: 30662723 PMCID: PMC6304153 DOI: 10.1098/rsos.180996] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 10/18/2018] [Indexed: 06/09/2023]
Abstract
The present study tested if the quadratic relationship which exists between stepping frequency and gait dynamics in walking can be generalized to stairmill climbing. To accomplish this, we investigated the joint angle dynamics and variability during continuous stairmill climbing at stepping frequencies both above and below the preferred stepping frequency (PSF). Nine subjects performed stairmill climbing at 80, 90, 100, 110 and 120% PSF and treadmill walking at preferred walking speed during which sagittal hip, knee and ankle angles were extracted. Joint angle dynamics were quantified by the largest Lyapunov exponent (LyE) and correlation dimension (CoD). Joint angle variability was estimated by the mean ensemble standard deviation (meanSD). MeanSD and CoD for all joints were significantly higher during stairmill climbing but there were no task differences in LyE. Changes in stepping frequency had only limited effect on joint angle variability and did not affect joint angle dynamics. Thus, we concluded that the quadratic relationship between stepping frequency and gait dynamics observed in walking is not present in stairmill climbing based on the investigated parameters.
Collapse
Affiliation(s)
- P. C. Raffalt
- Julius Wolff Institute for Biomechanics and Musculoskeletal Regeneration, Charité – Universitätsmedizin Berlin, Berlin, Germany
- Department of Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - S. Vallabhajosula
- Department of Physical Therapy Education, School of Health Sciences, Elon University, Elon, NC, USA
| | - J. J. Renz
- Department of Biomechanics, College of Education, University of Nebraska Medical Center, Omaha, NE, USA
| | - M. Mukherjee
- Department of Biomechanics, College of Education, University of Nebraska Medical Center, Omaha, NE, USA
| | - N. Stergiou
- Department of Biomechanics, College of Education, University of Nebraska Medical Center, Omaha, NE, USA
- Department of Environmental Agricultural and Occupational Health, College of Public Health, University of Nebraska Medical Center, Omaha, NE, USA
| |
Collapse
|
24
|
Calvo Tapia C, Tyukin IY, Makarov VA. Fast social-like learning of complex behaviors based on motor motifs. Phys Rev E 2018; 97:052308. [PMID: 29906958 DOI: 10.1103/physreve.97.052308] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Indexed: 01/01/2023]
Abstract
Social learning is widely observed in many species. Less experienced agents copy successful behaviors exhibited by more experienced individuals. Nevertheless, the dynamical mechanisms behind this process remain largely unknown. Here we assume that a complex behavior can be decomposed into a sequence of n motor motifs. Then a neural network capable of activating motor motifs in a given sequence can drive an agent. To account for (n-1)! possible sequences of motifs in a neural network, we employ the winnerless competition approach. We then consider a teacher-learner situation: one agent exhibits a complex movement, while another one aims at mimicking the teacher's behavior. Despite the huge variety of possible motif sequences we show that the learner, equipped with the provided learning model, can rewire "on the fly" its synaptic couplings in no more than (n-1) learning cycles and converge exponentially to the durations of the teacher's motifs. We validate the learning model on mobile robots. Experimental results show that the learner is indeed capable of copying the teacher's behavior composed of six motor motifs in a few learning cycles. The reported mechanism of learning is general and can be used for replicating different functions, including, for example, sound patterns or speech.
Collapse
Affiliation(s)
- Carlos Calvo Tapia
- Instituto de Matemática Interdisciplinar, Faculty of Mathematics, Universidad Complutense de Madrid, Plaza Ciencias 3, 28040 Madrid, Spain
| | - Ivan Y Tyukin
- University of Leicester, Department of Mathematics, University Road, LE1 7RH, United Kingdom
| | - Valeri A Makarov
- Instituto de Matemática Interdisciplinar, Faculty of Mathematics, Universidad Complutense de Madrid, Plaza Ciencias 3, 28040 Madrid, Spain.,Lobachevsky State University of Nizhny Novgorod, Gagarin Ave. 23, 603950 Nizhny Novgorod, Russia
| |
Collapse
|
25
|
Ravichandar HC, Dani A. Learning position and orientation dynamics from demonstrations via contraction analysis. Auton Robots 2018. [DOI: 10.1007/s10514-018-9758-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
26
|
Abstract
Highly articulated systems are capable of executing a variety of behaviors by coordinating their many internal degrees of freedom to help them move more effectively in complex terrains. However, this inherent variety poses significant challenges that have been the subject of a great deal of previous work: What are the most effective or most efficient methods for achieving the intrinsic coordination necessary to produce desired global objectives? This work takes these questions one step further, asking how different levels of coordination, which we quantify in terms of kinematic coupling, affect articulated locomotion in environments with different degrees of underlying structure. We introduce shape functions as the analytical basis for specifying kinematic coupling relationships that constrain the relative motion among the internal degrees of freedom for a given system during its nominal locomotion. Furthermore, we show how shape functions are used to derive shape-based controllers (SBCs) that manage the compliant interaction between articulated bodies and the environment while explicitly preserving the inter-joint coupling defined by shape functions. Initial experimental evidence provides a comparison of the benefits of different levels of coordination for two separate platforms in environments with different degrees of inherent structure. The experimental results show that decentralized implementations, where there is relatively little inter-joint coupling, perform well across a spectrum of different terrains but that there are potential benefits to higher degrees of coupling in structured terrains. We discuss how this observation has implications related to future planning and control approaches that actively “tune” their underlying structure by dynamically varying the assumed level of coupling as a function of task specification and local environmental conditions.
Collapse
Affiliation(s)
- Matthew Travers
- The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Julian Whitman
- The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Howie Choset
- The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, USA
| |
Collapse
|
27
|
Zhang X, Wang S, Hoagg JB, Seigler TM. The Roles of Feedback and Feedforward as Humans Learn to Control Unknown Dynamic Systems. IEEE TRANSACTIONS ON CYBERNETICS 2018; 48:543-555. [PMID: 28141541 DOI: 10.1109/tcyb.2016.2646483] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
We present results from an experiment in which human subjects interact with an unknown dynamic system 40 times during a two-week period. During each interaction, subjects are asked to perform a command-following (i.e., pursuit tracking) task. Each subject's performance at that task improves from the first trial to the last trial. For each trial, we use subsystem identification to estimate each subject's feedforward (or anticipatory) control, feedback (or reactive) control, and feedback time delay. Over the 40 trials, the magnitudes of the identified feedback controllers and the identified feedback time delays do not change significantly. In contrast, the identified feedforward controllers do change significantly. By the last trial, the average identified feedforward controller approximates the inverse of the dynamic system. This observation provides evidence that a fundamental component of human learning is updating the anticipatory control until it models the inverse dynamics.
Collapse
|
28
|
Casellato C, Pedrocchi A, Ferrigno G. Whole-Body Movements in Long-Term Weightlessness: Hierarchies of the Controlled Variables Are Gravity-Dependent. J Mot Behav 2016; 49:568-579. [PMID: 28027021 DOI: 10.1080/00222895.2016.1247032] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
Switching between contexts affects the mechanisms underlying motion planning, in particular it may entail reranking the variables to be controlled in defining the motor solutions. Three astronauts performed multiple sessions of whole-body pointing, in normogravity before launch, in prolonged weightlessness onboard the International Space Station, and after return. The effect of gravity context on kinematic and dynamic components was evaluated. Hand trajectory was gravity independent; center-of-mass excursion was highly variable within and between subjects. The body-environment effort exchange, expressed as inertial ankle momentum, was systematically lower in weightlessness than in normogravity. After return on Earth, the system underwent a rapid 1-week readaptation. The study indicates that minimizing the control effort is given greater weight when optimizing the motor plan in weightlessness compared to normogravity: the hierarchies of the controlled variables are gravity dependent.
Collapse
Affiliation(s)
| | - Alessandra Pedrocchi
- a NeuroEngineering and Medical Robotics Laboratory, Department of Electronics, Information and Bioengineering , Politecnico di Milano , Milano , Italy
| | - Giancarlo Ferrigno
- a NeuroEngineering and Medical Robotics Laboratory, Department of Electronics, Information and Bioengineering , Politecnico di Milano , Milano , Italy
| |
Collapse
|
29
|
Abstract
SUMMARYThe paper addresses the problem of speed adaptation of movements subject to environmental constraints. Our approach relies on a novel formulation of velocity profiles as an extension of dynamic movement primitives (DMP). The framework allows for compact representation of non-uniformly accelerated motion as well as simple modulation of the movement parameters. In the paper, we evaluate two model free methods by which optimal parameters can be obtained: iterative learning control (ILC) and policy search based reinforcement learning (RL). The applicability of each method is discussed and evaluated on two distinct cases, which are hard to model using standard techniques. The first deals with hard contacts with the environment while the second process involves liquid dynamics. We find ILC to be very efficient in cases where task parameters can be easily described with an error function. On the other hand, RL has stronger convergence properties and can therefore provide a solution in the general case.
Collapse
|
30
|
Zhou C, Wang B, Zhu Q, Wu J. An online gait generator for quadruped walking using motor primitives. INT J ADV ROBOT SYST 2016. [DOI: 10.1177/1729881416657960] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
This article presents implementation of an online gait generator on a quadruped robot. Firstly, the design of a quadruped robot is presented. The robot contains four leg modules each of which is constructed by a 2 degrees of freedom (2-DOF) five-bar parallel linkage mechanism. Together with other two rotational DOF, the leg module is able to perform 4-DOF movement. The parallel mechanism of the robot allows all the servos attached on the body frame, so that the leg mass is decreased and motor load can be balanced. Secondly, an online gait generator based on dynamic movement primitives for the walking control is presented. Dynamic movement primitives provide an approach to generate periodic trajectories and they can be modulated in real time, which makes the online adjustment of walking gaits possible. This gait controller is tested by the quadruped robot in regulating walking speed, switching between forward\backward movements and steering. The controller is easy to apply, expand and is quite effective on phase coordination and online trajectory modulation. Results of simulated experiments are presented.
Collapse
Affiliation(s)
| | | | | | - Jun Wu
- Zhejiang University, Hangzhou, China
| |
Collapse
|
31
|
Meola VC, Caligiore D, Sperati V, Zollo L, Ciancio AL, Taffoni F, Guglielmelli E, Baldassarre G. Interplay of Rhythmic and Discrete Manipulation Movements During Development: A Policy-Search Reinforcement-Learning Robot Model. IEEE Trans Cogn Dev Syst 2016. [DOI: 10.1109/tamd.2015.2494460] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
32
|
Kamali K, Akbari AA, Akbarzadeh A. Trajectory generation and control of a knee exoskeleton based on dynamic movement primitives for sit-to-stand assistance. Adv Robot 2016. [DOI: 10.1080/01691864.2016.1154800] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
33
|
|
34
|
Karigiannis JN, Tzafestas CS. Model-free learning on robot kinematic chains using a nested multi-agent topology. J EXP THEOR ARTIF IN 2015. [DOI: 10.1080/0952813x.2015.1042923] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
35
|
Abstract
In the attempt to build adaptive and intelligent machines, roboticists have looked at neuroscience for more than half a century as a source of inspiration for perception and control. More recently, neuroscientists have resorted to robots for testing hypotheses and validating models of biological nervous systems. Here, we give an overview of the work at the intersection of robotics and neuroscience and highlight the most promising approaches and areas where interactions between the two fields have generated significant new insights. We articulate the work in three sections, invertebrate, vertebrate and primate neuroscience. We argue that robots generate valuable insight into the function of nervous systems, which is intimately linked to behaviour and embodiment, and that brain-inspired algorithms and devices give robots life-like capabilities.
Collapse
Affiliation(s)
- Dario Floreano
- Laboratory of Intelligent Systems, Ecole Polytechnique Fédérale de Lausanne, Station 11, Lausanne, CH 1015, Switzerland.
| | - Auke Jan Ijspeert
- Biorobotics Laboratory, Ecole Polytechnique Fédérale de Lausanne, Station 14, Lausanne, CH 1015, Switzerland
| | - Stefan Schaal
- Max-Planck-Institute for Intelligent Systems, Spemannstrasse 41, 72076 Tübingen, Germany, & University of Southern California, Ronald Tutor Hall RTH 401, 3710 S. McClintock Avenue, Los Angeles, CA 90089-2905, USA
| |
Collapse
|
36
|
González-Fierro M, Hernández-García D, Nanayakkara T, Balaguer C. Behavior sequencing based on demonstrations: a case of a humanoid opening a door while walking. Adv Robot 2015. [DOI: 10.1080/01691864.2014.992955] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
37
|
The role of prediction and outcomes in adaptive cognitive control. ACTA ACUST UNITED AC 2015; 109:38-52. [PMID: 25698177 DOI: 10.1016/j.jphysparis.2015.02.001] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Revised: 01/18/2015] [Accepted: 02/07/2015] [Indexed: 02/04/2023]
Abstract
Humans adaptively perform actions to achieve their goals. This flexible behaviour requires two core abilities: the ability to anticipate the outcomes of candidate actions and the ability to select and implement actions in a goal-directed manner. The ability to predict outcomes has been extensively researched in reinforcement learning paradigms, but this work has often focused on simple actions that are not embedded in hierarchical and sequential structures that are characteristic of goal-directed human behaviour. On the other hand, the ability to select actions in accordance with high-level task goals, particularly in the presence of alternative responses and salient distractors, has been widely researched in cognitive control paradigms. Cognitive control research, however, has often paid less attention to the role of action outcomes. The present review attempts to bridge these accounts by proposing an outcome-guided mechanism for selection of extended actions. Our proposal builds on constructs from the hierarchical reinforcement learning literature, which emphasises the concept of reaching and evaluating informative states, i.e., states that constitute subgoals in complex actions. We develop an account of the neural mechanisms that allow outcome-guided action selection to be achieved in a network that relies on projections from cortical areas to the basal ganglia and back-projections from the basal ganglia to the cortex. These cortico-basal ganglia-thalamo-cortical 'loops' allow convergence - and thus integration - of information from non-adjacent cortical areas (for example between sensory and motor representations). This integration is essential in action sequences, for which achieving an anticipated sensory state signals the successful completion of an action. We further describe how projection pathways within the basal ganglia allow selection between representations, which may pertain to movements, actions, or extended action plans. The model lastly envisages a role for hierarchical projections from the striatum to dopaminergic midbrain areas that enable more rostral frontal areas to bias the selection of inputs from more posterior frontal areas via their respective representations in the basal ganglia.
Collapse
|
38
|
Mickevičienė D, Skurvydas A, Karanauskienė D. Is intraindividual variability different between unimanual and bimanual speed-accuracy movements? Percept Mot Skills 2015; 120:125-38. [PMID: 25668076 DOI: 10.2466/25.pms.120v14x3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Dominant (right) and nondominant (left) hand differences in motor performance variables were investigated during targeted rapid aiming unimanual and bimanual (to two separate targets) speed-accuracy tasks. The performance of the dominant and nondominant hands were compared during rapid targeted unimanual and bimanual reaching tasks (50 repetitions) in adult, neurologically intact, right-handed men (n = 20). As task difficulty increased from unimanual to bimanual tasks, reaction time increased and velocity (average and maximal) of performance decreased significantly. The effect of hand was significant only on average movement velocity and on intraindividual variability of accuracy (in both cases performance in the left hand was worse). Variability of movement path (accuracy) was less in the right hand during both unimanual and bimanual tasks.
Collapse
Affiliation(s)
- Dalia Mickevičienė
- 1 Department of Applied Biology and Rehabilitation, Lithuanian Sports University, Kaunas, Lithuania
| | | | | |
Collapse
|
39
|
Gams A, Nemec B, Ijspeert AJ, Ude A. Coupling Movement Primitives: Interaction With the Environment and Bimanual Tasks. IEEE T ROBOT 2014. [DOI: 10.1109/tro.2014.2304775] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
40
|
Ankarali MM, Tutkun Sen H, De A, Okamura AM, Cowan NJ. Haptic feedback enhances rhythmic motor control by reducing variability, not improving convergence rate. J Neurophysiol 2013; 111:1286-99. [PMID: 24371296 DOI: 10.1152/jn.00140.2013] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Stability and performance during rhythmic motor behaviors such as locomotion are critical for survival across taxa: falling down would bode well for neither cheetah nor gazelle. Little is known about how haptic feedback, particularly during discrete events such as the heel-strike event during walking, enhances rhythmic behavior. To determine the effect of haptic cues on rhythmic motor performance, we investigated a virtual paddle juggling behavior, analogous to bouncing a table tennis ball on a paddle. Here, we show that a force impulse to the hand at the moment of ball-paddle collision categorically improves performance over visual feedback alone, not by regulating the rate of convergence to steady state (e.g., via higher gain feedback or modifying the steady-state hand motion), but rather by reducing cycle-to-cycle variability. This suggests that the timing and state cues afforded by haptic feedback decrease the nervous system's uncertainty of the state of the ball to enable more accurate control but that the feedback gain itself is unaltered. This decrease in variability leads to a substantial increase in the mean first passage time, a measure of the long-term metastability of a stochastic dynamical system. Rhythmic tasks such as locomotion and juggling involve intermittent contact with the environment (i.e., hybrid transitions), and the timing of such transitions is generally easy to sense via haptic feedback. This timing information may improve metastability, equating to less frequent falls or other failures depending on the task.
Collapse
Affiliation(s)
- M Mert Ankarali
- Department of Mechanical Engineering, Johns Hopkins University, Baltimore, Maryland
| | | | | | | | | |
Collapse
|
41
|
Abstract
Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this article, we attempt to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as notable successes. We discuss how contributions tamed the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. As a result, a particular focus of our paper lies on the choice between model-based and model-free as well as between value-function-based and policy-search methods. By analyzing a simple problem in some detail we demonstrate how reinforcement learning approaches may be profitably applied, and we note throughout open questions and the tremendous potential for future research.
Collapse
Affiliation(s)
- Jens Kober
- Bielefeld University, CoR-Lab Research Institute for Cognition and Robotics, Bielefeld, Germany
- Honda Research Institute Europe, Offenbach/Main, Germany
| | | | - Jan Peters
- Max Planck Institute for Intelligent Systems, Department of Empirical Inference, Tübingen, Germany
- Technische Universität Darmstadt, FB Informatik, FG Intelligent Autonomous Systems, Darmstadt, Germany
| |
Collapse
|
42
|
|
43
|
Petrič T, Gams A, Babič J, Žlajpah L. Reflexive stability control framework for humanoid robots. Auton Robots 2013. [DOI: 10.1007/s10514-013-9329-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
44
|
Ijspeert AJ, Nakanishi J, Hoffmann H, Pastor P, Schaal S. Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors. Neural Comput 2013; 25:328-73. [DOI: 10.1162/neco_a_00393] [Citation(s) in RCA: 847] [Impact Index Per Article: 77.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Nonlinear dynamical systems have been used in many disciplines to model complex behaviors, including biological motor control, robotics, perception, economics, traffic prediction, and neuroscience. While often the unexpected emergent behavior of nonlinear systems is the focus of investigations, it is of equal importance to create goal-directed behavior (e.g., stable locomotion from a system of coupled oscillators under perceptual guidance). Modeling goal-directed behavior with nonlinear systems is, however, rather difficult due to the parameter sensitivity of these systems, their complex phase transitions in response to subtle parameter changes, and the difficulty of analyzing and predicting their long-term behavior; intuition and time-consuming parameter tuning play a major role. This letter presents and reviews dynamical movement primitives, a line of research for modeling attractor behaviors of autonomous nonlinear dynamical systems with the help of statistical learning techniques. The essence of our approach is to start with a simple dynamical system, such as a set of linear differential equations, and transform those into a weakly nonlinear system with prescribed attractor dynamics by means of a learnable autonomous forcing term. Both point attractors and limit cycle attractors of almost arbitrary complexity can be generated. We explain the design principle of our approach and evaluate its properties in several example applications in motor control and robotics.
Collapse
Affiliation(s)
- Auke Jan Ijspeert
- Ecole Polytechnique Fédérale de Lausanne, Lausanne CH-1015, Switzerland
| | - Jun Nakanishi
- School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, U.K
| | - Heiko Hoffmann
- Computer Science, Neuroscience, and Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, U.S.A
| | - Peter Pastor
- Computer Science, Neuroscience, and Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, U.S.A
| | - Stefan Schaal
- Computer Science, Neuroscience, and Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, U.S.A.; Max-Planck-Institute for Intelligent Systems, Tübingen 72076, Germany; and ATR Computational Neuroscience Laboratories, Kyoto 619-0288, Japan
| |
Collapse
|
45
|
Mülling K, Kober J, Kroemer O, Peters J. Learning to select and generalize striking movements in robot table tennis. Int J Rob Res 2013. [DOI: 10.1177/0278364912472380] [Citation(s) in RCA: 199] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Learning new motor tasks from physical interactions is an important goal for both robotics and machine learning. However, when moving beyond basic skills, most monolithic machine learning approaches fail to scale. For more complex skills, methods that are tailored for the domain of skill learning are needed. In this paper, we take the task of learning table tennis as an example and present a new framework that allows a robot to learn cooperative table tennis from physical interaction with a human. The robot first learns a set of elementary table tennis hitting movements from a human table tennis teacher by kinesthetic teach-in, which is compiled into a set of motor primitives represented by dynamical systems. The robot subsequently generalizes these movements to a wider range of situations using our mixture of motor primitives approach. The resulting policy enables the robot to select appropriate motor primitives as well as to generalize between them. Finally, the robot plays with a human table tennis partner and learns online to improve its behavior. We show that the resulting setup is capable of playing table tennis using an anthropomorphic robot arm.
Collapse
Affiliation(s)
- Katharina Mülling
- Max Planck Institute for Intelligent Systems, Department of Empirical Inference, Tübingen, Germany
- Technische Universität Darmstadt, FG Intelligente Autonome Systeme, Darmstadt, Germany
| | - Jens Kober
- Max Planck Institute for Intelligent Systems, Department of Empirical Inference, Tübingen, Germany
- Technische Universität Darmstadt, FG Intelligente Autonome Systeme, Darmstadt, Germany
| | - Oliver Kroemer
- Technische Universität Darmstadt, FG Intelligente Autonome Systeme, Darmstadt, Germany
| | - Jan Peters
- Max Planck Institute for Intelligent Systems, Department of Empirical Inference, Tübingen, Germany
- Technische Universität Darmstadt, FG Intelligente Autonome Systeme, Darmstadt, Germany
| |
Collapse
|
46
|
Ciancio AL, Zollo L, Baldassarre G, Caligiore D, Guglielmelli E. The Role of Learning and Kinematic Features in Dexterous Manipulation: A Comparative Study with Two Robotic Hands. INT J ADV ROBOT SYST 2013. [DOI: 10.5772/56479] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Dexterous movements performed by the human hand are by far more sophisticated than those achieved by current humanoid robotic hands and systems used to control them. This work aims at providing a contribution in order to overcome this gap by proposing a bio-inspired control architecture that captures two key elements underlying human dexterity. The first is the progressive development of skilful control, often starting from – or involving – cyclic movements, based on trial-and-error learning processes and central pattern generators. The second element is the exploitation of a particular kinematic features of the human hand, i.e. the thumb opposition. The architecture is tested with two simulated robotic hands having different kinematic features and engaged in rotating spheres, cylinders, and cubes of different sizes. The results support the feasibility of the proposed approach and show the potential of the model to allow a better understanding of the control mechanisms and kinematic principles underlying human dexterity and make them transferable to anthropomorphic robotic hands.
Collapse
Affiliation(s)
- Anna Lisa Ciancio
- Laboratory of Biomedical Robotics and Biomicrosystem, Università Campus Bio-Medico di Roma, Roma, Italy
| | - Loredana Zollo
- Laboratory of Biomedical Robotics and Biomicrosystem, Università Campus Bio-Medico di Roma, Roma, Italy
| | - Gianluca Baldassarre
- Laboratory of Computational Embodied Neuroscience, Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche (LOCEN-ISTC-CNR), Roma, Italy
| | - Daniele Caligiore
- Laboratory of Computational Embodied Neuroscience, Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche (LOCEN-ISTC-CNR), Roma, Italy
| | - Eugenio Guglielmelli
- Laboratory of Biomedical Robotics and Biomicrosystem, Università Campus Bio-Medico di Roma, Roma, Italy
| |
Collapse
|
47
|
Pearce TM, Moran DW. Strategy-dependent encoding of planned arm movements in the dorsal premotor cortex. Science 2012; 337:984-8. [PMID: 22821987 DOI: 10.1126/science.1220642] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The kinematic strategy encoded in motor cortical areas for classic straight-line reaching is remarkably simple and consistent across subjects, despite the complicated musculoskeletal dynamics that are involved. As tasks become more challenging, however, different conscious strategies may be used to improve perceived behavioral performance. We identified additional spatial information that appeared both in single neurons and in the population code of monkey dorsal premotor cortex when obstacles impeded direct reach paths. The neural correlate of movement planning varied between subjects in a manner consistent with the use of different strategies to optimize task completion. These distinct planning strategies were manifested in the timing and strength of the information contained in the neural population code.
Collapse
Affiliation(s)
- Thomas M Pearce
- Washington University in St. Louis, St. Louis, MO 63130, USA
| | | |
Collapse
|
48
|
|
49
|
|
50
|
Kormushev P, Calinon S, Caldwell DG. Imitation Learning of Positional and Force Skills Demonstrated via Kinesthetic Teaching and Haptic Input. Adv Robot 2012. [DOI: 10.1163/016918611x558261] [Citation(s) in RCA: 176] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Petar Kormushev
- a Department of Advanced Robotics, Italian Institute of Technology, 16163 Genova, Italy
| | - Sylvain Calinon
- b Department of Advanced Robotics, Italian Institute of Technology, 16163 Genova, Italy
| | - Darwin G. Caldwell
- c Department of Advanced Robotics, Italian Institute of Technology, 16163 Genova, Italy
| |
Collapse
|