Optimized tracking control using reinforcement learning strategy for a class of nonlinear systems

被引：12

作者：

Yang, Xue ^{[1
]}

Li, Bin ^{[1
]}

机构：

[1] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan 250353, Peoples R China

来源：

ASIAN JOURNAL OF CONTROL | 2023年 / 25卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Lyapunov function; neural networks (NNs); nonlinear systems; optimal control; reinforcement learning (RL); CONSTRAINED OPTIMAL-CONTROL; BACKSTEPPING CONTROL; ALGORITHM; DESIGN; STATE;

D O I：

10.1002/asjc.2866

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper is to develop a simplified optimized tracking control using reinforcement learning (RL) strategy for a class of nonlinear systems. Since the nonlinear control gain function is considered in the system modeling, it is challenging to extend the existing RL-based optimal methods to the tracking control. The main reasons are that these methods' algorithm are very complex; meanwhile, they also require to meet some strict conditions. Different with these exiting RL-based optimal methods that derive the actor and critic training laws from the square of Bellman residual error, which is a complex function consisting of multiple nonlinear terms, the proposed optimized scheme derives the two RL training laws from negative gradient of a simple positive function, so that the algorithm can be significantly simplified. Moreover, the actor and critic in RL are constructed by employing neural network (NN) to approximate the solution of Hamilton-Jacobi-Bellman (HJB) equation. Finally, the feasibility of the proposed method is demonstrated in accordance with both Lyapunov stability theory and simulation example.

引用

页码：2095 / 2104

页数：10

共 39 条

[1] Experience-based iterative learning controllers for robotic systems [J].

Arif, M ;

Ishihara, T ;

Inooka, H .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2002, 35 (04) :381-396

[2] DYNAMIC PROGRAMMING [J].

BELLMAN, R .

SCIENCE, 1966, 153 (3731) :34-&

[3] Real-time stabilization and tracking of a four-rotor mini rotorcraft [J].

Castillo, P ;

Dzul, A ;

Lozano, R .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2004, 12 (04) :510-516

[4] Fixed-final-time-constrained optimal control, of Nonlinear systems using neural network HJB approach [J].

Cheng, Tao ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (06) :1725-1737

[5] Adaptive neural network control of nonlinear systems by state and output feedback [J].

Ge, SS ;

Hang, CC ;

Zhang, T .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1999, 29 (06) :818-828

[6] Observer-based robust composite adaptive fuzzy control by uncertainty estimation for a class of nonlinear systems [J].

Ghavidel, Hesam Fallah ;

Kalat, Ali Akbarzadeh .

NEUROCOMPUTING, 2017, 230 :100-109

[7] Reinforcement Learning and Feedback Control USING NATURAL DECISION METHODS TO DESIGN OPTIMAL ADAPTIVE CONTROLLERS [J].

Lewis, Frank L. ;

Vrabie, Draguna ;

Vamvoudakis, Kyriakos G. .

IEEE CONTROL SYSTEMS MAGAZINE, 2012, 32 (06) :76-105

[8] Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints [J].

Liu, Derong ;

Yang, Xiong ;

Wang, Ding ;

Wei, Qinglai .

IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (07) :1372-1385

[9] Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems [J].

Liu, Derong ;

Wei, Qinglai .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (03) :621-634

[10] Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics [J].

Liu, Derong ;

Yang, Xiong ;

Li, Hongliang .

NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8) :1843-1850

← 1 2 3 4 →