Fault-tolerant optimised tracking control for unknown discrete-time linear systems using a combined reinforcement learning and residual compensation methodology

被引：11

作者：

Han, Ke-Zhen ^{[1
]}

Feng, Jian ^{[1
]}

Cui, Xiaohong ^{[1
]}

机构：

[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang, Liaoning, Peoples R China

来源：

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE | 2017年 / 48卷 / 13期

基金：

中国国家自然科学基金;

关键词：

Fault compensation; H theory; reinforcement Q-learning; residual compensation; tracking control; DATA-DRIVEN; DESIGN; DIAGNOSIS;

D O I：

10.1080/00207721.2017.1344890

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper considers the fault-tolerant optimised tracking control (FTOTC) problem for unknown discrete-time linear system. A research scheme is proposed on the basis of data-based parity space identification, reinforcement learning and residual compensation techniques. The main characteristic of this research scheme lies in the parity-space-identification-based simultaneous tracking control and residual compensation. The specific technical line consists of four main contents: apply subspace aided method to design observer-based residual generator; use reinforcement Q-learning approach to solve optimised tracking control policy; rely on robust H theory to achieve noise attenuation; adopt fault estimation triggered by residual generator to perform fault compensation. To clarify the design and implementation procedures, an integrated algorithm is further constructed to link up these four functional units. The detailed analysis and proof are subsequently given to explain the guaranteed FTOTC performance of the proposed conclusions. Finally, a case simulation is provided to verify its effectiveness.

引用

页码：2811 / 2825

页数：15

共 45 条

[1] Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control [J].

Al-Tamimi, Asma ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

AUTOMATICA, 2007, 43 (03) :473-481

[2]

[Anonymous], 2012, STAT MONITORING COMP, DOI [DOI 10.1002/9780470517253, 10.1002/9780470517253]

[3]

[Anonymous], 2008, DYNAMIC MODELING PRE

[4]

Blanke M., 2006, Diagnosis and fault-tolerant control

[5] Mode-independent H∞ filters for Markovian jump linear systems [J].

de Souza, Carlos E. ;

Trofino, Alexandre ;

Barbosa, Karina A. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (11) :1837-1841

[6]

Ding S., 2008, MODEL BASED FAULT DI

[7] Data-driven design of monitoring and diagnosis systems for dynamic processes: A review of subspace technique based schemes and some recent results [J].

Ding, S. X. .

JOURNAL OF PROCESS CONTROL, 2014, 24 (02) :431-449

[8] Feedback Control Structures, Embedded Residual Signals, and Feedback Control Schemes With an Integrated Residual Access [J].

Ding, S. X. ;

Yang, G. ;

Zhang, P. ;

Ding, E. L. ;

Jeinsch, T. ;

Weinhold, N. ;

Schultalbers, M. .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2010, 18 (02) :352-367

[9] Data-driven realizations of kernel and image representations and their application to fault detection and control system design [J].

Ding, Steven X. ;

Yang, Ying ;

Zhang, Yong ;

Li, Linlin .

AUTOMATICA, 2014, 50 (10) :2615-2623

[10]

Ding SX, 2014, ADV IND CONTROL, P1, DOI 10.1007/978-1-4471-6410-4

← 1 2 3 4 5 →