On Robust Model-Free Reduced-Dimensional Reinforcement Learning Control for Singularly Perturbed Systems

被引:0
作者
Mukherjee, Sayak [1 ]
Bai, He [2 ]
Chakrabortty, Aranya [1 ]
机构
[1] North Carolina State Univ, Dept Elect & Comp Engn, Raleigh, NC 27695 USA
[2] Oklahoma State Univ, Sch Mech & Aerosp Engn, Stillwater, OK 74078 USA
来源
2020 AMERICAN CONTROL CONFERENCE (ACC) | 2020年
关键词
Reinforcement learning (RL); singular perturbation; reduced-dimensional control; dynamic uncertainties; robustness; input-to-state stability (ISS);
D O I
10.23919/acc45564.2020.9147523
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a robust design for reinforcement learning (RL) based optimal control of continuous-time linear time-invariant singularly perturbed (SP) dynamic systems in the presence of dynamic uncertainties. We consider the dynamic model of both the plant and the uncertainty to be unknown. Assuming that the uncertainty satisfies an input-to-state stability (ISS) condition, we propose a variant of the adaptive dynamic programming (ADP) method that learns a sub-optimal controller using measurements of only the slow states of the plant. The resulting RL controller is, therefore, significantly reduced-dimensional, and enjoys reduced learning time. We illustrate our design with simulations of a SP system and of a clustered multi-agent consensus network.
引用
收藏
页码:3914 / 3919
页数:6
相关论文
共 14 条
[1]  
[Anonymous], 2007, Approximate Dynamic Programming: Solving the curses of
[2]  
Boker A., 2015, AM CONTR C
[3]   DECOMPOSITION OF NEAR-OPTIMUM REGULATORS FOR SYSTEMS WITH SLOW AND FAST MODES [J].
CHOW, JH ;
KOKOTOVIC, PV .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1976, 21 (05) :701-705
[4]  
Dashkovskiy S, 2005, IEEE DECIS CONTR P, P5633
[5]  
Jiang Y., 2017, Robust adaptive dynamic programming
[6]  
Jiang Y, 2013, IEEE SIG PROC MED
[7]   Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics [J].
Jiang, Yu ;
Jiang, Zhong-Ping .
AUTOMATICA, 2012, 48 (10) :2699-2704
[8]   SMALL-GAIN THEOREM FOR ISS SYSTEMS AND APPLICATIONS [J].
JIANG, ZP ;
TEEL, AR ;
PRALY, L .
MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 1994, 7 (02) :95-120
[9]   CONTROL STRATEGIES FOR DECISION MAKERS USING DIFFERENT MODELS OF SAME SYSTEM [J].
KHALIL, HK ;
KOKOTOVIC, PV .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1978, 23 (02) :289-298
[10]  
Khalil HK, 2002, Nonlinear systems, V3