1
|
Li M, Wang D, Zhao M, Qiao J. Event-triggered constrained neural critic control of nonlinear continuous-time multiplayer nonzero-sum games. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.02.081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2023]
|
2
|
Han H, Zhang J, Yang H, Hou Y, Qiao J. Data-Driven Robust Optimal Control for Nonlinear System with Uncertain Disturbances. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.11.092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
3
|
Fan QY, Wang D, Xu B. H ∞ Codesign for Uncertain Nonlinear Control Systems Based on Policy Iteration Method. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:10101-10110. [PMID: 33877997 DOI: 10.1109/tcyb.2021.3065995] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
In this article, the problem of H∞ codesign for nonlinear control systems with unmatched uncertainties and adjustable parameters is investigated. The main purpose is to solve the adjustable parameters and H∞ controller simultaneously so that better robust control performance can be achieved. By introducing a bounded function and defining a special cost function, the problem of solving the Hamilton-Jacobi-Isaacs equation is transformed into an optimization problem with nonlinear inequality constraints. Based on the sum of squares technique, a novel policy iteration algorithm is proposed to solve the problem of the H∞ codesign. Moreover, one modified algorithm for optimizing the robust performance index is given. The convergence and the performance improvement of new iteration policy algorithms are proved. Simulation results are presented to demonstrate the effectiveness of the proposed algorithms.
Collapse
|
4
|
Wang X, Deng H, Ye X. Model-free nonlinear robust control design via online critic learning. ISA TRANSACTIONS 2022; 129:446-459. [PMID: 34983736 DOI: 10.1016/j.isatra.2021.12.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Revised: 12/10/2021] [Accepted: 12/11/2021] [Indexed: 06/14/2023]
Abstract
Online critic learning or solving robust control problems of complex systems usually requires knowledge about system dynamics. In order to achieve these goals in data-driven method, a new performance index related to the decreasing rate of the conventional cost is designed. The corresponding optimal control policy can be approximated online using a new actor-critic scheme with three neural networks, without depending on initial stable control and knowledge about system dynamics. The learning process and the learned control policy show excellent robustness. Numerical simulations and an inverted pendulum experiment show that compared with benchmark methods, the proposed method relaxes the dependence on initial admissible control and exhibits better disturbance attenuation performance.
Collapse
Affiliation(s)
- Xiaoyang Wang
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, 150001 Harbin, China.
| | - Hao Deng
- Pengcheng Laboratory, 518055 Shenzhen, China.
| | - Xiufen Ye
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, 150001 Harbin, China.
| |
Collapse
|
5
|
Goal representation adaptive critic design for discrete-time uncertain systems subjected to input constraints: The event-triggered case. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.12.057] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
6
|
Fu Y, Hong C, Fu J, Chai T. Approximate Optimal Tracking Control of Nondifferentiable Signals for a Class of Continuous-Time Nonlinear Systems. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:4441-4450. [PMID: 33141675 DOI: 10.1109/tcyb.2020.3027344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
In this article, for a class of continuous-time nonlinear nonaffine systems with unknown dynamics, a robust approximate optimal tracking controller (RAOTC) is proposed in the framework of adaptive dynamic programming (ADP). The distinguishing contribution of this article is that a new Lyapunov function is constructed, by using which the derivative information of tracking errors is not required in computing its time derivative along with the solution of the closed-loop system. Thus, the proposed method can make the system states follow nondifferentiable reference signals, which removes the common assumption that the reference signals have to be continuous for tracking control of continuous-time nonlinear systems in the literature. The theoretical analysis, simulation, and application results well illustrate the effectiveness and superiority of the proposed method.
Collapse
|
7
|
Zhang S, Zhao B, Zhang Y. Event-triggered control for input constrained non-affine nonlinear systems based on neuro-dynamic programming. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.01.116] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
8
|
Wu A, Liu H, Zeng Z. Observer Design and H ∞ Performance for Discrete-Time Uncertain Fuzzy-Logic Systems. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:2398-2408. [PMID: 31725404 DOI: 10.1109/tcyb.2019.2948562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The observer design and H∞ algorithm are proposed for the discrete-time fuzzy-logic systems in this article. The considered fuzzy-logic systems are subject to parameter uncertainty and unmeasurable state variables. To appropriately deal with parameter uncertainty and unmeasurable state variables, we first disintegrate the space of premise variables, then the total partitioned regions will be divided into two kinds: 1) crisp regions and 2) fuzzy regions. With the aid of partitioned regions, piecewise fuzzy H∞ observers are presented. Availability of the piecewise fuzzy H∞ observers gives us an accurate picture of the overall evolution of the augmented systems leading therefore to an improved situational awareness for the noncoordination of premise variables and external interference and, hence, the augmented systems achieve the performance conditions: asymptotic convergence and H∞ performance. The simulation results of the proposed approach show the accuracy of the resulting state estimates.
Collapse
|
9
|
Liang Y, Zhang H, Duan J, Sun S. Event-triggered reinforcement learning H∞control design for constrained-input nonlinear systems subject to actuator failures. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2020.07.055] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
10
|
Event-triggered constrained control with DHP implementation for nonaffine discrete-time systems. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.01.020] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
11
|
Wang D, Qiao J. Approximate neural optimal control with reinforcement learning for a torsional pendulum device. Neural Netw 2019; 117:1-7. [DOI: 10.1016/j.neunet.2019.04.026] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 04/10/2019] [Accepted: 04/30/2019] [Indexed: 11/16/2022]
|
12
|
Wen G, Ge SS, Chen CLP, Tu F, Wang S. Adaptive Tracking Control of Surface Vessel Using Optimized Backstepping Technique. IEEE TRANSACTIONS ON CYBERNETICS 2019; 49:3420-3431. [PMID: 29994688 DOI: 10.1109/tcyb.2018.2844177] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
In this paper, a tracking control approach for surface vessel is developed based on the new control technique named optimized backstepping (OB), which considers optimization as a backstepping design principle. Since surface vessel systems are modeled by second-order dynamic in strict feedback form, backstepping is an ideal technique for finishing the tracking task. In the backstepping control of surface vessel, the virtual and actual controls are designed to be the optimized solutions of corresponding subsystems, therefore the overall control is optimized. In general, optimization control is designed based on the solution of Hamilton-Jacobi-Bellman equation. However, solving the equation is very difficult or even impossible due to the inherent nonlinearity and complexity. In order to overcome the difficulty, the reinforcement learning (RL) strategy of actor-critic architecture is usually considered, of which the critic and actor are utilized for evaluating the control performance and executing the control behavior, respectively. By employing the actor-critic RL algorithm for both virtual and actual controls of the vessel, it is proven that the desired optimizing and tracking performances can be arrived. Simulation results further demonstrate effectiveness of the proposed surface vessel control.
Collapse
|
13
|
Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.06.022] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
14
|
Policy iteration based robust co-design for nonlinear control systems with state constraints. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.08.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
15
|
Wen G, Ge SS, Tu F. Optimized Backstepping for Tracking Control of Strict-Feedback Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:3850-3862. [PMID: 29993615 DOI: 10.1109/tnnls.2018.2803726] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
In this paper, a control technique named optimized backstepping is first proposed by implementing tracking control for a class of strict-feedback systems, which considers optimization as a design philosophy of the high-order system control. The basic idea is that designing the actual and virtual controls of backstepping is the optimized solutions of the corresponding subsystems so that overall control of the high-order system is optimized. In general, optimization control is designed based on the solution of Hamilton-Jacobi-Bellman equation, but solving the equation is very difficult due to the inherent nonlinearity and intractability. In order to overcome the difficulty, the neural network (NN)-based reinforcement learning strategy of actor-critic architecture is used. In every backstepping step, the actor and critic NNs are constructed for executing control behavior and evaluating control performance, respectively. According to the Lyapunov stability theorem, it is proven that the desired control performance can be obtained. Finally, a simulation example is carried out to further demonstrate the effectiveness of the proposed control approach.
Collapse
|
16
|
Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances. Neural Netw 2018; 99:19-30. [DOI: 10.1016/j.neunet.2017.11.022] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Revised: 10/16/2017] [Accepted: 11/28/2017] [Indexed: 11/19/2022]
|
17
|
Distributed algorithm for dissensus of a class of networked multiagent systems using output information. Soft comput 2018. [DOI: 10.1007/s00500-016-2332-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
18
|
Wang D, He H, Liu D. Improving the Critic Learning for Event-Based Nonlinear $H_{\infty }$ Control Design. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:3417-3428. [PMID: 28166513 DOI: 10.1109/tcyb.2017.2653800] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In this paper, we aim at improving the critic learning criterion to cope with the event-based nonlinear H∞ state feedback control design. First of all, the H∞ control problem is regarded as a two-player zero-sum game and the adaptive critic mechanism is used to achieve the minimax optimization under event-based environment. Then, based on an improved updating rule, the event-based optimal control law and the time-based worst-case disturbance law are obtained approximately by training a single critic neural network. The initial stabilizing control is no longer required during the implementation process of the new algorithm. Next, the closed-loop system is formulated as an impulsive model and its stability issue is handled by incorporating the improved learning criterion. The infamous Zeno behavior of the present event-based design is also avoided through theoretical analysis on the lower bound of the minimal intersample time. Finally, the applications to an aircraft dynamics and a robot arm plant are carried out to verify the efficient performance of the present novel design method.
Collapse
|
19
|
Wang D, He H, Liu D. Adaptive Critic Nonlinear Robust Control: A Survey. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:3429-3451. [PMID: 28682269 DOI: 10.1109/tcyb.2017.2712188] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Adaptive dynamic programming (ADP) and reinforcement learning are quite relevant to each other when performing intelligent optimization. They are both regarded as promising methods involving important components of evaluation and improvement, at the background of information technology, such as artificial intelligence, big data, and deep learning. Although great progresses have been achieved and surveyed when addressing nonlinear optimal control problems, the research on robustness of ADP-based control strategies under uncertain environment has not been fully summarized. Hence, this survey reviews the recent main results of adaptive-critic-based robust control design of continuous-time nonlinear systems. The ADP-based nonlinear optimal regulation is reviewed, followed by robust stabilization of nonlinear systems with matched uncertainties, guaranteed cost control design of unmatched plants, and decentralized stabilization of interconnected systems. Additionally, further comprehensive discussions are presented, including event-based robust control design, improvement of the critic learning rule, nonlinear H∞ control design, and several notes on future perspectives. By applying the ADP-based optimal and robust control methods to a practical power system and an overhead crane plant, two typical examples are provided to verify the effectiveness of theoretical results. Overall, this survey is beneficial to promote the development of adaptive critic control methods with robustness guarantee and the construction of higher level intelligent systems.
Collapse
|
20
|
Neural-network-based adaptive guaranteed cost control of nonlinear dynamical systems with matched uncertainties. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2017.03.047] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
21
|
General value iteration based reinforcement learning for solving optimal tracking control problem of continuous–time affine nonlinear systems. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2017.03.038] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
22
|
Wang Y, Song Y, Krstic M, Wen C. Adaptive finite time coordinated consensus for high-order multi-agent systems: Adjustable fraction power feedback approach. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.08.054] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
23
|
Wang D, Mu C, Zhang Q, Liu D. Event-based input-constrained nonlinear H∞ state feedback with adaptive critic and neural implementation. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.07.002] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
24
|
Yang X, Liu D, Luo B, Li C. Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.07.051] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|