1
|
Sandini G, Sciutti A, Morasso P. Artificial cognition vs. artificial intelligence for next-generation autonomous robotic agents. Front Comput Neurosci 2024; 18:1349408. [PMID: 38585280 PMCID: PMC10995397 DOI: 10.3389/fncom.2024.1349408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 02/20/2024] [Indexed: 04/09/2024] Open
Abstract
The trend in industrial/service robotics is to develop robots that can cooperate with people, interacting with them in an autonomous, safe and purposive way. These are the fundamental elements characterizing the fourth and the fifth industrial revolutions (4IR, 5IR): the crucial innovation is the adoption of intelligent technologies that can allow the development of cyber-physical systems, similar if not superior to humans. The common wisdom is that intelligence might be provided by AI (Artificial Intelligence), a claim that is supported more by media coverage and commercial interests than by solid scientific evidence. AI is currently conceived in a quite broad sense, encompassing LLMs and a lot of other things, without any unifying principle, but self-motivating for the success in various areas. The current view of AI robotics mostly follows a purely disembodied approach that is consistent with the old-fashioned, Cartesian mind-body dualism, reflected in the software-hardware distinction inherent to the von Neumann computing architecture. The working hypothesis of this position paper is that the road to the next generation of autonomous robotic agents with cognitive capabilities requires a fully brain-inspired, embodied cognitive approach that avoids the trap of mind-body dualism and aims at the full integration of Bodyware and Cogniware. We name this approach Artificial Cognition (ACo) and ground it in Cognitive Neuroscience. It is specifically focused on proactive knowledge acquisition based on bidirectional human-robot interaction: the practical advantage is to enhance generalization and explainability. Moreover, we believe that a brain-inspired network of interactions is necessary for allowing humans to cooperate with artificial cognitive agents, building a growing level of personal trust and reciprocal accountability: this is clearly missing, although actively sought, in current AI. The ACo approach is a work in progress that can take advantage of a number of research threads, some of them antecedent the early attempts to define AI concepts and methods. In the rest of the paper we will consider some of the building blocks that need to be re-visited in a unitary framework: the principles of developmental robotics, the methods of action representation with prospection capabilities, and the crucial role of social interaction.
Collapse
Affiliation(s)
| | | | - Pietro Morasso
- Italian Institute of Technology, Cognitive Architecture for Collaborative Technologies (CONTACT) and Robotics, Brain and Cognitive Sciences (RBCS) Research Units, Genoa, Italy
| |
Collapse
|
2
|
Shen M, Huang X, Zhao Y, Wang Y, Li H, Jiang Z. Human-like acceleration and deceleration control of a robot astronaut floating in a space station. ISA TRANSACTIONS 2024:S0019-0578(24)00097-1. [PMID: 38458904 DOI: 10.1016/j.isatra.2024.02.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 02/28/2024] [Accepted: 02/28/2024] [Indexed: 03/10/2024]
Abstract
The acceleration and deceleration (AD) motions are the basic motion modes of robot astronauts moving in a space station. Controlling the locomotion of the robot astronaut is very challenging due to the strong nonlinearity of its complex multi-body dynamics in a gravity-free environment. However, after training, humans can move well in space stations by pushing the bulkhead, and the motion mechanism of humans is a good reference for robot astronauts. The contribution of this study is modeling the human AD motion in a microgravity environment and proposing a human-like control method for robot astronauts moving in space stations. Specifically, the movement and contact force data of the human body during AD motion were collected on an air-floating platform. Through human AD modeling analysis, the mechanism of human motion is discovered, and semi-sinusoidal primitives of contact forces are proposed for AD motion. Then, a dynamic guidance model of human AD motion is built to complete motion planning under contact constraints, which is used as the expected model for the AD control of robot astronauts. Benefiting from the force primitives, accurate and safe planning of human-like AD motion can be completed. The characteristics and mechanism of human AD motion have been analyzed from the perspective of optimization. Lastly, based on the proposed dynamic guidance model, the AD motion policy is mapped to the robot astronaut system via a system control method based on the equivalent mapping of dynamic responses (force, velocity and pose). Through comparative analysis with real human motion data and simulation results under different conditions, the proposed AD control method can achieve human-like motion efficiently and stably. Even when confronted with errors in the robot's contact velocities and inertia parameters, the method can significantly reduce the motion errors while ensuring stability.
Collapse
Affiliation(s)
- Minghui Shen
- School of Mechatronical Engineering, Beijing Institute of Technology, Beijing 100081, China; Key Laboratory of Biomimetic Robots and Systems of Chinese Ministry of Education, Beijing, 100081, China
| | - Xiao Huang
- School of Mechatronical Engineering, Beijing Institute of Technology, Beijing 100081, China; Key Laboratory of Biomimetic Robots and Systems of Chinese Ministry of Education, Beijing, 100081, China
| | - Yan Zhao
- School of Mechatronical Engineering, Beijing Institute of Technology, Beijing 100081, China; Key Laboratory of Biomimetic Robots and Systems of Chinese Ministry of Education, Beijing, 100081, China
| | - Yunlai Wang
- School of Mechatronical Engineering, Beijing Institute of Technology, Beijing 100081, China; Key Laboratory of Biomimetic Robots and Systems of Chinese Ministry of Education, Beijing, 100081, China
| | - Hui Li
- School of Mechatronical Engineering, Beijing Institute of Technology, Beijing 100081, China; Key Laboratory of Biomimetic Robots and Systems of Chinese Ministry of Education, Beijing, 100081, China.
| | - Zhihong Jiang
- School of Mechatronical Engineering, Beijing Institute of Technology, Beijing 100081, China; Key Laboratory of Biomimetic Robots and Systems of Chinese Ministry of Education, Beijing, 100081, China.
| |
Collapse
|
3
|
Chen W, Jin J, Gerontitis D, Qiu L, Zhu J. Improved Recurrent Neural Networks for Text Classification and Dynamic Sylvester Equation Solving. Neural Process Lett 2023. [DOI: 10.1007/s11063-023-11176-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
|
4
|
Li Z, Li G, Wu X, Kan Z, Su H, Liu Y. Asymmetric Cooperation Control of Dual-Arm Exoskeletons Using Human Collaborative Manipulation Models. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:12126-12139. [PMID: 34637389 DOI: 10.1109/tcyb.2021.3113709] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The exoskeleton is mainly used by subjects who suffer muscle injury to enhance motor ability in the daily life environment. Previous research seldom considers extending human collaboration skills to human-robot collaborations. In this article, two models, that is: 1) the following the better model and 2) the interpersonal goal integration model, are designed to facilitate the human-human collaborative manipulation in tracking a moving target. Integrated with dual-arm exoskeletons, these two models can enable the robot to successfully perform target tracking with two human partners. Specifically, the manipulation workspace of the human-exoskeleton system is divided into a human region and a robot region. In the human region, the human acts as the leader during cooperation, while, in the robot region, the robot takes the leading role. A novel region-based Barrier Lyapunov function (BLF) is then designed to handle the change of leader roles between the human and the robot and ensures the operation within the constrained human and robot regions when driving the dual-arm exoskeleton to track the moving target. The designed adaptive controller ensures the convergence of tracking errors in the presence of region switches. Experiments are performed on the dual-arm robotic exoskeleton for the subject with muscle damage or some degree of motor dysfunctions to evaluate the proposed controller in tracking a moving target, and the experimental results demonstrate the effectiveness of the developed control.
Collapse
|
5
|
Gerontitis D, Behera R, Shi Y, Stanimirović PS. A robust noise tolerant zeroing neural network for solving time-varying linear matrix equations. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.08.036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
|
6
|
Li Q, Wang Y. A Novel Teacher-Assistance-Based Method to Detect and Handle Bad Training Demonstrations in Learning From Demonstration. IEEE Trans Cogn Dev Syst 2022. [DOI: 10.1109/tcds.2021.3097251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Qin Li
- School of Automation, Central South University, Changsha, China
| | - Yong Wang
- School of Automation, Central South University, Changsha, China
| |
Collapse
|
7
|
Trajectory Tracking within a Hierarchical Primitive-Based Learning Approach. ENTROPY 2022; 24:e24070889. [PMID: 35885112 PMCID: PMC9321877 DOI: 10.3390/e24070889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 06/20/2022] [Accepted: 06/23/2022] [Indexed: 11/30/2022]
Abstract
A hierarchical learning control framework (HLF) has been validated on two affordable control laboratories: an active temperature control system (ATCS) and an electrical rheostatic braking system (EBS). The proposed HLF is data-driven and model-free, while being applicable on general control tracking tasks which are omnipresent. At the lowermost level, L1, virtual state-feedback control is learned from input–output data, using a recently proposed virtual state-feedback reference tuning (VSFRT) principle. L1 ensures a linear reference model tracking (or matching) and thus, indirect closed-loop control system (CLCS) linearization. On top of L1, an experiment-driven model-free iterative learning control (EDMFILC) is then applied for learning reference input–controlled outputs pairs, coined as primitives. The primitives’ signals at the L2 level encode the CLCS dynamics, which are not explicitly used in the learning phase. Data reusability is applied to derive monotonic and safely guaranteed learning convergence. The learning primitives in the L2 level are finally used in the uppermost and final L3 level, where a decomposition/recomposition operation enables prediction of the optimal reference input assuring optimal tracking of a previously unseen trajectory, without relearning by repetitions, as it was in level L2. Hence, the HLF enables control systems to generalize their tracking behavior to new scenarios by extrapolating their current knowledge base. The proposed HLF framework endows the CLCSs with learning, memorization and generalization features which are specific to intelligent organisms. This may be considered as an advancement towards intelligent, generalizable and adaptive control systems.
Collapse
|
8
|
Learning Motion Primitives Automata for Autonomous Driving Applications. MATHEMATICAL AND COMPUTATIONAL APPLICATIONS 2022. [DOI: 10.3390/mca27040054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Motion planning methods often rely on libraries of primitives. The selection of primitives is then crucial for assuring feasible solutions and good performance within the motion planner. In the literature, the library is usually designed by either learning from demonstration, relying entirely on data, or by model-based approaches, with the advantage of exploiting the dynamical system’s property, e.g., symmetries. In this work, we propose a method combining data with a dynamical model to optimally select primitives. The library is designed based on primitives with highest occurrences within the data set, while Lie group symmetries from a model are analysed in the available data to allow for structure-exploiting primitives. We illustrate our technique in an autonomous driving application. Primitives are identified based on data from human driving, with the freedom to build libraries of different sizes as a parameter of choice. We also compare the extracted library with a custom selection of primitives regarding the performance of obtained solutions for a street layout based on a real-world scenario.
Collapse
|
9
|
Model Reference Tracking Control Solutions for a Visual Servo System Based on a Virtual State from Unknown Dynamics. ENERGIES 2021. [DOI: 10.3390/en15010267] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
This paper focuses on validating a model-free Value Iteration Reinforcement Learning (MFVI-RL) control solution on a visual servo tracking system in a comprehensive manner starting from theoretical convergence analysis to detailed hardware and software implementation. Learning is based on a virtual state representation reconstructed from input-output (I/O) system samples under nonlinear observability and unknown dynamics assumptions, while the goal is to ensure linear output reference model (ORM) tracking. Secondary, a competitive model-free Virtual State-Feedback Reference Tuning (VSFRT) is learned from the same I/O data using the same virtual state representation, demonstrating the framework’s learning capability. A model-based two degrees-of-freedom (2DOF) output feedback controller serving as a comparisons baseline is designed and tuned using an identified system model. With similar complexity and linear controller structure, MFVI-RL is shown to be superior, confirming that the model-based design issue of poor identified system model and control performance degradation can be solved in a direct data-driven style. Apart from establishing a formal connection between output feedback control, state feedback control and also between classical control and artificial intelligence methods, the results also point out several practical trade-offs, such as I/O data exploration quality and control performance leverage with data volume, control goal and controller complexity.
Collapse
|
10
|
Abstract
A general control system tracking learning framework is proposed, by which an optimal learned tracking behavior called ‘primitive’ is extrapolated to new unseen trajectories without requiring relearning. This is considered intelligent behavior and strongly related to the neuro-motor cognitive control of biological (human-like) systems that deliver suboptimal executions for tasks outside of their current knowledge base, by using previously memorized experience. However, biological systems do not solve explicit mathematical equations for solving learning and prediction tasks. This stimulates the proposed hierarchical cognitive-like learning framework, based on state-of-the-art model-free control: (1) at the low-level L1, an approximated iterative Value Iteration for linearizing the closed-loop system (CLS) behavior by a linear reference model output tracking is first employed; (2) an experiment-driven Iterative Learning Control (EDILC) applied to the CLS from the reference input to the controlled output learns simple tracking tasks called ‘primitives’ in the secondary L2 level, and (3) the tertiary level L3 extrapolates the primitives’ optimal tracking behavior to new tracking tasks without trial-based relearning. The learning framework relies only on input-output system data to build a virtual state space representation of the underlying controlled system that is assumed to be observable. It has been shown to be effective by experimental validation on a representative, coupled, nonlinear, multivariable real-world system. Able to cope with new unseen scenarios in an optimal fashion, the hierarchical learning framework is an advance toward cognitive control systems.
Collapse
|
11
|
Virtual State Feedback Reference Tuning and Value Iteration Reinforcement Learning for Unknown Observable Systems Control. ENERGIES 2021. [DOI: 10.3390/en14041006] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In this paper, a novel Virtual State-feedback Reference Feedback Tuning (VSFRT) and Approximate Iterative Value Iteration Reinforcement Learning (AI-VIRL) are applied for learning linear reference model output (LRMO) tracking control of observable systems with unknown dynamics. For the observable system, a new state representation in terms of input/output (IO) data is derived. Consequently, the Virtual State Feedback Tuning (VRFT)-based solution is redefined to accommodate virtual state feedback control, leading to an original stability-certified Virtual State-Feedback Reference Tuning (VSFRT) concept. Both VSFRT and AI-VIRL use neural networks controllers. We find that AI-VIRL is significantly more computationally demanding and more sensitive to the exploration settings, while leading to inferior LRMO tracking performance when compared to VSFRT. It is not helped either by transfer learning the VSFRT control as initialization for AI-VIRL. State dimensionality reduction using machine learning techniques such as principal component analysis and autoencoders does not improve on the best learned tracking performance however it trades off the learning complexity. Surprisingly, unlike AI-VIRL, the VSFRT control is one-shot (non-iterative) and learns stabilizing controllers even in poorly, open-loop explored environments, proving to be superior in learning LRMO tracking control. Validation on two nonlinear coupled multivariable complex systems serves as a comprehensive case study.
Collapse
|