Stochastic Optimal Control of Unknown Linear Networked Control System using Q-Learning Methodology

被引:0
作者
Xu, Hao [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
来源
2011 AMERICAN CONTROL CONFERENCE | 2011年
关键词
Networked Control System (NCS); Q-function; Adaptive Estimator (AE); Optimal Control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the Bellman equation is utilized forward-in-time for the stochastic optimal control of Networked Control System (NCS) with unknown system dynamics in the presence of random delays and packet losses which are unknown. The proposed stochastic optimal control approach, referred normally as adaptive dynamic programming, uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation control of NCS with unknown system dynamics. Update laws for tuning the unknown parameters of the adaptive estimator (AE) online to obtain the time-based Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the approximated control signals converge to optimal control inputs. Simulation results are included to show the effectiveness of the proposed scheme.
引用
收藏
页码:2819 / 2824
页数:6
相关论文
共 50 条
[31]   An Optimal Hybrid Learning Approach for Attack Detection in Linear Networked Control Systems [J].
Niu, Haifeng ;
Sahoo, Avimanyu ;
Bhowmick, Chandreyee ;
Jagannathan, S. .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (06) :1404-1416
[32]   Adaptive Optimal Control via Q-Learning for Ito Fuzzy Stochastic Nonlinear Continuous-Time Systems With Stackelberg Game [J].
Ming, Zhongyang ;
Zhang, Huaguang ;
Yan, Ying ;
Yang, Liu .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (04) :2029-2038
[33]   Iterative Q-Learning for Model-Free Optimal Control With Adjustable Convergence Rate [J].
Wang, Ding ;
Wang, Yuan ;
Zhao, Mingming ;
Qiao, Junfei .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) :2224-2228
[34]   Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems [J].
Pang, Bo ;
Jiang, Zhong-Ping .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (04) :2383-2390
[35]   Stochastic Linear Quadratic Optimal Control Problem: A Reinforcement Learning Method [J].
Li, Na ;
Li, Xun ;
Peng, Jing ;
Xu, Zuo Quan .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (09) :5009-5016
[36]   Linear quadratic optimal sampled data control of linear systems with unknown switched modes and stochastic disturbances [J].
Liu, Feng ;
Li, Peng ;
Lei, ZhiBang ;
Song, Yongduan .
OPTIMAL CONTROL APPLICATIONS & METHODS, 2016, 37 (05) :1085-1100
[37]   Stochastic optimal control and network co-design for networked control systems [J].
Ji, Kun ;
Kim, Won-Jong .
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2007, 5 (05) :515-525
[38]   Time Delay Dependent Optimal Control of Networked Control System [J].
Ma Hui ;
Yuan Zhongkai ;
Tang Gong-You ;
Zhang Bao-Lin .
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, :6634-6637
[39]   MODELLING AND OPTIMAL CONTROL OF NETWORKED SYSTEMS WITH STOCHASTIC COMMUNICATION PROTOCOLS [J].
Zhu, Chaoqun ;
Yang, Bin ;
Zhu, Xiang .
KYBERNETIKA, 2020, 56 (02) :239-260
[40]   Safe Q-Learning for Data-Driven Nonlinear Optimal Control with Asymmetric State Constraints [J].
Zhao, Mingming ;
Wang, Ding ;
Song, Shijie ;
Qiao, Junfei .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (12) :2408-2422