Learning Optimal Scheduling Policy for Remote State Estimation Under Uncertain Channel Condition

被引:30
作者
Wu, Shuang [1 ]
Ren, Xiaoqiang [2 ]
Jia, Qing-Shan [3 ]
Johansson, Karl Henrik [4 ]
Shi, Ling [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[2] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China
[3] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Dept Automat, Ctr Intelligent & Networked Syst, Beijing 100084, Peoples R China
[4] KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-11428 Stockholm, Sweden
来源
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS | 2020年 / 7卷 / 02期
基金
中国国家自然科学基金; 瑞典研究理事会;
关键词
Learning algorithm; scheduling; state estimation; threshold structure;
D O I
10.1109/TCNS.2019.2959162
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider optimal sensor scheduling with unknown communication channel statistics. We formulate two types of scheduling problems with the communication rate being a soft or hard constraint, respectively. We first present some structural results on the optimal scheduling policy using dynamic programming and assuming that the channel statistics is known. We prove that the Q-factor is monotonic and submodular, which leads to thresholdlike structures in both problems. Then, we develop a stochastic approximation and parameter learning frameworks to deal with the two scheduling problems with unknown channel statistics. We utilize their structures to design specialized learning algorithms. We, then prove the convergence of these algorithms. Performance improvement compared with the standard Q-learning algorithm is shown through numerical examples, which will also discuss an alternative method based on recursive estimation of the channel quality.
引用
收藏
页码:579 / 591
页数:13
相关论文
共 31 条