Stochastic Optimal Control of Unknown Linear Networked Control System using Q-Learning Methodology

被引：0

作者：

Xu, Hao ^{[1
]}

Jagannathan, S. ^{[1
]}

机构：

[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA

来源：

2011 AMERICAN CONTROL CONFERENCE | 2011年

关键词：

Networked Control System (NCS); Q-function; Adaptive Estimator (AE); Optimal Control;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, the Bellman equation is utilized forward-in-time for the stochastic optimal control of Networked Control System (NCS) with unknown system dynamics in the presence of random delays and packet losses which are unknown. The proposed stochastic optimal control approach, referred normally as adaptive dynamic programming, uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation control of NCS with unknown system dynamics. Update laws for tuning the unknown parameters of the adaptive estimator (AE) online to obtain the time-based Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the approximated control signals converge to optimal control inputs. Simulation results are included to show the effectiveness of the proposed scheme.

引用

页码：2819 / 2824

页数：6

共 50 条

[41] A Combined Policy Gradient and Q-learning Method for Data-driven Optimal Control Problems [J].

Lin, Mingduo ;

Liu, Derong ;

Zhao, Bo ;

Dai, Qionghai ;

Dong, Yi .

2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, :6-10

[42] Optimal control for controllable stochastic linear systems [J].

Bi, Xiuchun ;

Sun, Jingrui ;

Xiong, Jie .

ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2020, 26

[43] Optimal control for linear system using genetic programming [J].

Kumar, A. Vincent Antony ;

Balasubramaniam, P. .

OPTIMAL CONTROL APPLICATIONS & METHODS, 2009, 30 (01) :47-60

[44] A study of optimal control strategy of networked control systems with stochastic delay and packet losses [J].

Wang, Puxi ;

Feng, Guang .

2013 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND EMBEDDED SYSTEMS (CARE-2013), 2013,

[45] Echo state network-based Q-learning method for optimal battery control of offices combined with renewable energy [J].

Shi, Guang ;

Liu, Derong ;

Wei, Qinglai .

IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (07) :915-922

[46] Adaptive optimal tracking control of networked linear systems under two-channel stochastic dropouts [J].

Jiang, Yi ;

Liu, Lu ;

Feng, Gang .

AUTOMATICA, 2024, 165

[47] Potential Based Policy Gradient Approach for Optimal Control of the Stochastic System with Unknown Noise [J].

Cheng Kang ;

Zhang Kanjian ;

Fei Shumin ;

Wei Haikun .

2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, :2362-2366

[48] Optimal control for networked control system with Markovian packet loss and delay [J].

Wang, Hongxia ;

Liu, Tao ;

Li, Zixing ;

Liang, Xiao .

ASIAN JOURNAL OF CONTROL, 2024, 26 (06) :3162-3178

[49] Evolution-Guided Q-Learning With Dual Swarm Intelligence for Model-Free Optimal Control [J].

Wang, Ding ;

Yuan, Zeqiang ;

Tang, Guohan ;

Wang, Jiangyu ;

Qiao, Junfei .

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 :16964-16975

[50] Adaptive dynamic programming and distributionally robust optimal control of linear stochastic system using the Wasserstein metric [J].

Liang, Qingpeng ;

Hu, Jiangping ;

Shi, Kaibo ;

Wu, Yanzhi ;

Xiang, Linying .

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (08) :2810-2832

← 1 2 3 4 5 →