Wavelet Reduced Order Observer Based Adaptive Tracking Control for a Class of Uncertain Multiple Time Delay Nonlinear Systems Subjected to Actuator Saturation Using Actor Critic Architecture

被引:2
作者
Sharma, Manish [1 ]
Verma, Ajay [2 ]
机构
[1] Medicaps Inst Technol & Management, Indore 453331, Madhya Pradesh, India
[2] Inst Engn & Technol, Indore 452001, Madhya Pradesh, India
来源
INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING | 2012年 / 38卷
关键词
Wavelet neural networks; reduced order observer; adaptive control; optimal control; reinforcement learning; Lyapunov Krasovskii functional; multiple point delay; actuator saturation; DESIGN;
D O I
10.1016/j.proeng.2012.06.160
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This Paper investigates the mean to design the reduced order observer and observer based controller for a class of uncertain delayed nonlinear system subjected to actuator saturation using Actor Critic architecture. A new design approach of wavelet based adaptive reduced order observer is proposed. The task of the proposed wavelet adaptive reduced order observer is to identify the unknown system dynamics and to reconstruct the states of the system. Wavelet neural network (WNN) is implemented to approximate the uncertainties present in the system as well as to identify and compensate the nonlinearities introduced in the system due to actuator saturation. Reinforcement learning is applied through Actor-Critic architecture where a separate structure is for both perception (critic) and action (actor). Reinforcement learning is used via two Wavelet Neural networks (WNN), critic WNN and action WNN, which are combined to form an adaptive WNN controller. The critic WNN approximates the "strategic" utility function which is then minimized by the action WNN. Using the feedback control, based on reconstructed states, the behavior of closed loop system is investigated. By Lyapunov-Krasovskii approach, the closed-loop tracking error is proved to be uniformly ultimate bounded. A numerical example is provided to verify the effectiveness of theoretical development. (C) 2012 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of Noorul Islam Centre for Higher Education
引用
收藏
页码:1297 / 1308
页数:12
相关论文
共 18 条
[1]   A stable neural network-based observer with application to flexible-joint manipulators [J].
Abdollahi, F ;
Talebi, HA ;
Patel, RV .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (01) :118-129
[2]  
[Anonymous], 2008, IFAC P, DOI DOI 10.3182/20080706-5-KR-1001.01678
[3]  
Bartolini G., 2008, P 47 IEEE C DEC CONT
[4]  
Crespo LG, 2003, P AMER CONTR CONF, P4219
[5]   Delay-dependent robust stabilization for uncertain singular systems with multiple input delays [J].
Du, Zhao-Ping ;
Zhang, Qing-Ling ;
Liu, Li-Li .
Zidonghua Xuebao/ Acta Automatica Sinica, 2009, 35 (02) :162-167
[6]   An improved stabilization method for linear time-delay systems [J].
Fridman, E ;
Shaked, U .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2002, 47 (11) :1931-1937
[7]   Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints [J].
He, Pingan ;
Jagannathan, S. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :425-436
[8]   Adaptive critic anti-slip control of wheeled autonomous robot [J].
Lin, W.-S. ;
Chang, L.-H. ;
Yang, P.-C. .
IET CONTROL THEORY AND APPLICATIONS, 2007, 1 (01) :51-57
[9]   Reduced-order observer-based control design for nonlinear stochastic systems [J].
Liu, YG ;
Zhang, JF .
SYSTEMS & CONTROL LETTERS, 2004, 52 (02) :123-135
[10]   Nonlinear antiwindup applied to Euler-Lagrange systems [J].
Morabito, F ;
Teel, AR ;
Zaccarian, L .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 2004, 20 (03) :526-537