Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Modares H, Ranatunga I, Lewis FL, Popa DO. Optimized Assistive Human-Robot Interaction Using Reinforcement Learning. IEEE Trans Cybern 2016;46:655-67. [PMID: 25823055 DOI: 10.1109/tcyb.2015.2412554] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

For:	Modares H, Ranatunga I, Lewis FL, Popa DO. Optimized Assistive Human-Robot Interaction Using Reinforcement Learning. IEEE Trans Cybern 2016;46:655-67. [PMID: 25823055 DOI: 10.1109/tcyb.2015.2412554] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Number

Cited by Other Article(s)

Sun T, Yang J, Pan Y, Yu H. Repetitive Impedance Learning-Based Physically Human-Robot Interactive Control. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:10629-10638. [PMID: 37027552 DOI: 10.1109/tnnls.2023.3243091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Diaz MA, Vos M, Dillen A, Tassignon B, Flynn L, Geeroms J, Meeusen R, Verstraten T, Babic J, Beckerle P, De Pauw K. Human-in-the-Loop Optimization of Wearable Robotic Devices to Improve Human-Robot Interaction: A Systematic Review. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:7483-7496. [PMID: 37015459 DOI: 10.1109/tcyb.2022.3224895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Gu S, Kshirsagar A, Du Y, Chen G, Peters J, Knoll A. A human-centered safe robot reinforcement learning framework with interactive behaviors. Front Neurorobot 2023;17:1280341. [PMID: 38023448 PMCID: PMC10665848 DOI: 10.3389/fnbot.2023.1280341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Accepted: 10/18/2023] [Indexed: 12/01/2023] Open

Yang J, Sun T, Yang H. Spatial hybrid adaptive impedance learning control for robots in repetitive interactive tasks. ISA TRANSACTIONS 2023;138:151-159. [PMID: 36828703 DOI: 10.1016/j.isatra.2023.02.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 02/14/2023] [Accepted: 02/14/2023] [Indexed: 06/16/2023]

Qin L, Ji H, Chen M, Wang K. A Self-Coordinating Controller with Balance-Guiding Ability for Lower-Limb Rehabilitation Exoskeleton Robot. SENSORS (BASEL, SWITZERLAND) 2023;23:s23115311. [PMID: 37300038 DOI: 10.3390/s23115311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 05/26/2023] [Accepted: 05/30/2023] [Indexed: 06/12/2023]

Yang R, Zheng J, Song R. Continuous mode adaptation for cable-driven rehabilitation robot using reinforcement learning. Front Neurorobot 2022;16:1068706. [PMID: 36620486 PMCID: PMC9813438 DOI: 10.3389/fnbot.2022.1068706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 11/28/2022] [Indexed: 12/24/2022] Open

Li J, Ma Y, Gao R, Cao Z, Lim A, Song W, Zhang J. Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:13572-13585. [PMID: 34554923 DOI: 10.1109/tcyb.2021.3111082] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Incorporating rivalry in reinforcement learning for a competitive game. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07746-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Hu B, Guan ZH, Chen G, Chen CLP. Neuroscience and Network Dynamics Toward Brain-Inspired Intelligence. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:10214-10227. [PMID: 33909581 DOI: 10.1109/tcyb.2021.3071110] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Asymmetric constrained control scheme design with discrete output feedback in unknown robot–environment interaction system. ROBOTICA 2022. [DOI: 10.1017/s0263574722001138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Cognitive Learning and Robotics: Innovative Teaching for Inclusivity. MULTIMODAL TECHNOLOGIES AND INTERACTION 2022. [DOI: 10.3390/mti6080065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Wu HN, Zhang XM, Li RG. Synthesis With Guaranteed Cost and Less Human Intervention for Human-in-the-Loop Control Systems. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:7541-7551. [PMID: 33417574 DOI: 10.1109/tcyb.2020.3041033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Kobayashi T. Optimistic reinforcement learning by forward Kullback–Leibler divergence optimization. Neural Netw 2022;152:169-180. [DOI: 10.1016/j.neunet.2022.04.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 04/06/2022] [Accepted: 04/16/2022] [Indexed: 11/25/2022]

Sharifi M, Zakerimanesh A, Mehr JK, Torabi A, Mushahwar VK, Tavakoli M. Impedance Variation and Learning Strategies in Human-Robot Interaction. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:6462-6475. [PMID: 33449901 DOI: 10.1109/tcyb.2020.3043798] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Yang J, Sun T. Finite-Time Interactive Control of Robots with Multiple Interaction Modes. SENSORS (BASEL, SWITZERLAND) 2022;22:s22103668. [PMID: 35632080 PMCID: PMC9147656 DOI: 10.3390/s22103668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 05/08/2022] [Accepted: 05/10/2022] [Indexed: 05/14/2023]

Lv P, Wang X, Cheng Y, Duan Z, Chen CLP. Integrated Double Estimator Architecture for Reinforcement Learning. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:3111-3122. [PMID: 33027028 DOI: 10.1109/tcyb.2020.3023033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Jin Z, Liu A, Zhang WA, Yu L. An Optimal Variable Impedance Control With Consideration of the Stability. IEEE Robot Autom Lett 2022. [DOI: 10.1109/lra.2022.3141759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Zhang L, Zhang R, Wu T, Weng R, Han M, Zhao Y. Safe Reinforcement Learning With Stability Guarantee for Motion Planning of Autonomous Vehicles. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:5435-5444. [PMID: 34242172 DOI: 10.1109/tnnls.2021.3084685] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Bahrami V, Kalhor A, Masouleh MT. Dynamic model estimating and designing controller for the 2-DoF planar robot in interaction with cable-driven robot based on adaptive neural network. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-210180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Abstract This study intends to investigate the dynamic model estimation and the design of an adaptive neural network based controller for a passive planar robot, performing 2-DoF motion pattern which is in interaction with an actuated cable-driven robot. In fact, the main goal of applying this structure is to use a number of light cables to drive serial robot links and track the desired reference model by the robot’s end-effector. The under study system can be used as a rehabilitation setup which is helpful for those with arm disability. In this way, upon applying sliding mode error dynamics, it is necessary to determine a vector that contains the matrices related to the robot dynamics. However, finding these matrices requires the use of computational approaches such as Newton-Euler or Lagrange. In addition, since the purpose of this paper is to express comprehensive methods, so with increasing the number of links and degrees of freedom of the robot, finding the dynamics of the robot becomes more difficult. Therefore, the Adaptive Neural Network (ANN) with specific inputs has been used for estimation unknown matrices of the system and the controller design has been performed based on it. So, the main idea in using an adaptive controller is the fact there is no pre-knowledge for the dynamic modeling of the system since the human arm could have different dynamic properties. Hence, the controller is formed by an ANN and robust term. In this way, the adaptation laws of the parameters are extracted by Lyapunov approach, and as a result, as aforementioned, the asymptotic stability of the whole of the system is guaranteed. Simulation results certify the efficiency of the proposed method. Finally, using the Roots Mean Square Error (RMSE) criteria, it has been revealed that, in the presence of bounded disturbance with different amplitude, adding the robust term to the controller leads to improve the tracking error about 34% and 62%, respectively. Collapse

Singh B, Kumar R, Singh VP. Reinforcement learning in robotic applications: a comprehensive survey. Artif Intell Rev 2021. [DOI: 10.1007/s10462-021-09997-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Yu X, He W, Li Y, Xue C, Li J, Zou J, Yang C. Bayesian Estimation of Human Impedance and Motion Intention for Human-Robot Collaboration. IEEE TRANSACTIONS ON CYBERNETICS 2021;51:1822-1834. [PMID: 31647450 DOI: 10.1109/tcyb.2019.2940276] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Lippi M, Marino A. Human Multi-Robot Physical Interaction: a Distributed Framework. J INTELL ROBOT SYST 2021. [DOI: 10.1007/s10846-020-01277-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Kobayashi T, Ilboudo WEL. t-soft update of target network for deep reinforcement learning. Neural Netw 2021;136:63-71. [PMID: 33450653 DOI: 10.1016/j.neunet.2020.12.023] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 11/18/2020] [Accepted: 12/23/2020] [Indexed: 10/22/2022]

Xu J, Xu L, Li Y, Cheng G, Shi J, Liu J, Chen S. A Multi-Channel Reinforcement Learning Framework for Robotic Mirror Therapy. IEEE Robot Autom Lett 2020. [DOI: 10.1109/lra.2020.3007408] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Köpf F, Westermann J, Flad M, Hohmann S. Adaptive optimal control for reference tracking independent of exo-system dynamics. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.04.140] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Khoramshahi M, Billard A. A dynamical system approach for detection and reaction to human guidance in physical human–robot interaction. Auton Robots 2020. [DOI: 10.1007/s10514-020-09934-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Compliant Manipulation Method for a Nursing Robot Based on Physical Structure of Human Limb. J INTELL ROBOT SYST 2020. [DOI: 10.1007/s10846-020-01221-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Li Y, Wen Y, Tao D, Guan K. Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning. IEEE TRANSACTIONS ON CYBERNETICS 2020;50:2002-2013. [PMID: 31352360 DOI: 10.1109/tcyb.2019.2927410] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Abstract

Data center (DC) plays an important role to support services, such as e-commerce and cloud computing. The resulting energy consumption from this growing market has drawn significant attention, and noticeably almost half of the energy cost is used to cool the DC to a particular temperature. It is thus an critical operational challenge to curb the cooling energy cost without sacrificing the thermal safety of a DC. The existing solutions typically follow a two-step approach, in which the system is first modeled based on expert knowledge and, thus, the operational actions are determined with heuristics and/or best practices. These approaches are often hard to generalize and might result in suboptimal performances due to intrinsic model errors for large-scale systems. In this paper, we propose optimizing the DC cooling control via the emerging deep reinforcement learning (DRL) framework. Compared to the existing approaches, our solution lends itself an end-to-end cooling control algorithm (CCA) via an off-policy offline version of the deep deterministic policy gradient (DDPG) algorithm, in which an evaluation network is trained to predict the DC energy cost along with resulting cooling effects, and a policy network is trained to gauge optimized control settings. Moreover, we introduce a de-underestimation (DUE) validation mechanism for the critic network to reduce the potential underestimation of the risk caused by neural approximation. Our proposed algorithm is evaluated on an EnergyPlus simulation platform and on a real data trace collected from the National Super Computing Centre (NSCC) of Singapore. The resulting numerical results show that the proposed CCA can achieve up to 11% cooling cost reduction on the simulation platform compared with a manually configured baseline control algorithm. In the trace-based study of conservative nature, the proposed algorithm can achieve about 15% cooling energy savings on the NSCC data trace. Our pioneering approach can shed new light on the application of DRL to optimize and automate DC operations and management, potentially revolutionizing digital infrastructure management with intelligence.

Collapse

Wan Z, Jiang C, Fahad M, Ni Z, Guo Y, He H. Robot-Assisted Pedestrian Regulation Based on Deep Reinforcement Learning. IEEE TRANSACTIONS ON CYBERNETICS 2020;50:1669-1682. [PMID: 30475740 DOI: 10.1109/tcyb.2018.2878977] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Gui K, Tan UX, Liu H, Zhang D. A New Impedance Controller Based on Nonlinear Model Reference Adaptive Control for Exoskeleton Systems. INT J HUM ROBOT 2019. [DOI: 10.1142/s0219843619500208] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Online Multi-Objective Model-Independent Adaptive Tracking Mechanism for Dynamical Systems. ROBOTICS 2019. [DOI: 10.3390/robotics8040082] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Song R, Xie Y, Zhang Z. Data-driven finite-horizon optimal tracking control scheme for completely unknown discrete-time nonlinear systems. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2019.05.026] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

A Survey of Behavioral Models for Social Robots. ROBOTICS 2019. [DOI: 10.3390/robotics8030054] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Hentout A, Aouache M, Maoudj A, Akli I. Human–robot interaction in industrial collaborative robotics: a literature review of the decade 2008–2017. Adv Robot 2019. [DOI: 10.1080/01691864.2019.1636714] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Senoo T, Murakami K, Ishikawa M. Deformation Control of a Manipulator Based on the Zener Model. JOURNAL OF ROBOTICS AND MECHATRONICS 2019. [DOI: 10.20965/jrm.2019.p0263] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Yu J, Ji J, Miao Z, Zhou J. Neural network-based region reaching formation control for multi-robot systems in obstacle environment. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.12.051] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Improved learning algorithm for two-layer neural networks for identification of nonlinear systems. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.10.008] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Ghannadi B, Sharif Razavian R, McPhee J. Configuration-Dependent Optimal Impedance Control of an Upper Extremity Stroke Rehabilitation Manipulandum. Front Robot AI 2018;5:124. [PMID: 33501003 PMCID: PMC7805823 DOI: 10.3389/frobt.2018.00124] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 10/09/2018] [Indexed: 11/13/2022] Open

Abstract

Robots are becoming a popular means of rehabilitation since they can decrease the laborious work of a therapist, and associated costs, and provide well-controlled repeatable tasks. Many researchers have postulated that human motor control can be mathematically represented using optimal control theories, whereby some cost function is effectively maximized or minimized. However, such abilities are compromised in stroke patients. In this study, to promote rehabilitation of the stroke patient, a rehabilitation robot has been developed using optimal control theory. Despite numerous studies of control strategies for rehabilitation, there is a limited number of rehabilitation robots using optimal control theory. The main idea of this work is to show that impedance control gains cannot be kept constant for optimal performance of the robot using a feedback linearization approach. Hence, a general method for the real-time and optimal impedance control of an end-effector-based rehabilitation robot is proposed. The controller is developed for a 2 degree-of-freedom upper extremity stroke rehabilitation robot, and compared to a feedback linearization approach that uses the standard optimal impedance derived from covariance propagation equations. The new method will assign optimal impedance gains at each configuration of the robot while performing a rehabilitation task. The proposed controller is a linear quadratic regulator mapped from the operational space to the joint space. Parameters of the two controllers have been tuned using a unified biomechatronic model of the human and robot. The performances of the controllers were compared while operating the robot under four conditions of human movements (impaired, healthy, delayed, and time-advanced) along a reference trajectory, both in simulations and experiments. Despite the idealized and approximate nature of the human-robot model, the proposed controller worked well in experiments. Simulation and experimental results with the two controllers showed that, compared to the standard optimal controller, the rehabilitation system with the proposed optimal controller is assisting more in the active-assist therapy while resisting in active-constrained case. Furthermore, in passive therapy, the proposed optimal controller maintains the position error and interaction forces in safer regions. This is the result of updating the impedance in the operational space using a linear time-variant impedance model.

Collapse

Mu C, Wang D, He H. Data-Driven Finite-Horizon Approximate Optimal Control for Discrete-Time Nonlinear Systems Using Iterative HDP Approach. IEEE TRANSACTIONS ON CYBERNETICS 2018;48:2948-2961. [PMID: 29028219 DOI: 10.1109/tcyb.2017.2752845] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Decentralized robust optimal control for modular robot manipulators via critic-identifier structure-based adaptive dynamic programming. Neural Comput Appl 2018. [DOI: 10.1007/s00521-018-3714-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

A dynamical system approach to task-adaptation in physical human–robot interaction. Auton Robots 2018. [DOI: 10.1007/s10514-018-9764-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Neural impedance adaption for assistive human–robot interaction. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2018.02.025] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Learning assistive strategies for exoskeleton robots from user-robot physical interaction. Pattern Recognit Lett 2017. [DOI: 10.1016/j.patrec.2017.04.007] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

General value iteration based reinforcement learning for solving optimal tracking control problem of continuous–time affine nonlinear systems. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2017.03.038] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Variable Admittance Control Based on Fuzzy Reinforcement Learning for Minimally Invasive Surgery Manipulator. SENSORS 2017;17:s17040844. [PMID: 28417944 PMCID: PMC5424721 DOI: 10.3390/s17040844] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2017] [Revised: 04/04/2017] [Accepted: 04/07/2017] [Indexed: 11/17/2022]

Erden MS, Billard A. Robotic Assistance by Impedance Compensation for Hand Movements While Manual Welding. IEEE TRANSACTIONS ON CYBERNETICS 2016;46:2459-2472. [PMID: 26452294 DOI: 10.1109/tcyb.2015.2478656] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Modares H, Lewis FL, Jiang ZP. H ∞ tracking control of completely unknown continuous-time systems via off-policy reinforcement learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015;26:2550-2562. [PMID: 26111401 DOI: 10.1109/tnnls.2015.2441749] [Citation(s) in RCA: 111] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]