Optimal Active Fault Diagnosis by Temporal-Difference Learning

被引:0
作者
Skach, Jan [1 ]
Puncochar, Ivo [1 ]
Lewis, Frank L. [2 ]
机构
[1] Univ West Bohemia, Fac fo Appl Sci, European Ctr Excellence NTIS, Univ 8, Plzen 30614, Czech Republic
[2] Univ Texas Arlington, UTA Res Inst UTARI, Ft Worth, TX 76118 USA
来源
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC) | 2016年
关键词
DESIGN;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel solution to the active fault diagnosis problem for stochastic linear Markovian switching systems on the infinite-time horizon is proposed. The imperfect state information problem of designing an active fault detector that minimizes a general detection cost criterion is reformulated as the perfect state information problem using sufficient statistics. The reformulation decreases theoretical complexity and enables to find a suboptimal solution by dynamic programming. However, classical approaches are computationally complex or fail to identify the most representative states of the system. This paper combines the active fault detection, state estimation, and reinforcement learning. In the proposed algorithm, temporal difference learning is used to train the active fault detector based on input-output data from the system simulation. The designed detector can be then used online. A numerical example is presented to verify the proposed algorithm.
引用
收藏
页码:2146 / 2151
页数:6
相关论文
共 21 条
[1]  
[Anonymous], 2013, Optimal adaptive control and differential games by reinforcement learning principles
[2]  
Bertsekas D. P., 1996, NEURODYNAMIC PROGRAM
[3]  
Bertsekas D.P., 2001, DYNAMIC PROGRAMMING, V2
[4]  
Bertsekas D. P., 2000, DYNAMIC PROGRAMMING, VI
[5]  
Bertsekas DP, 1995, PROCEEDINGS OF THE 34TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P560, DOI 10.1109/CDC.1995.478953
[6]   Active Estimation for Jump Markov Linear Systems [J].
Blackmore, Lars ;
Rajamanoharan, Senthooran ;
Williams, Brian C. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (10) :2223-2236
[7]  
Campbell S.L., 2004, PRIN SER APPL MATH
[8]  
Gao ZW, 2015, IEEE T IND ELECTRON, V62, P3768, DOI [10.1109/TIE.2015.2417501, 10.1109/TIE.2015.2419013]
[9]  
GARCES F, 2003, STRATEGIES FEEDBACK
[10]  
Howard R., 1960, DYNAMIC PROGRAMMING