Optimal redundant transmission scheduling for remote state estimation via reinforcement learning approach

被引：2

作者：

Jia, Yijin ^{[1
]}

Yang, Lixin ^{[2
]}

Zhao, Yao ^{[1
]}

Li, Jun-Yi ^{[1
]}

Lv, Weijun ^{[1
]}

机构：

[1] Guangdong Univ Technol, Sch Automat, Prov Key Lab Intelligent Decis & Cooperat Control, Guangzhou 510006, Peoples R China

[2] Queensland Univ Technol, Sch Elect Engn & Robot, Brisbane, Qld 4000, Australia

来源：

NEUROCOMPUTING | 2024年 / 576卷

基金：

中国国家自然科学基金;

关键词：

Redundant transmission scheduling; Remote state estimation; Markov decision process; Reinforcement learning; NETWORKED CONTROL-SYSTEMS; NEURAL-NETWORKS; DELAYS;

D O I：

10.1016/j.neucom.2024.127337

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper studies the optimal redundant transmission scheduling for remote state estimation. Multiple smart sensors observe some systems, and transmit the local state estimates via independent channels to a remote estimator (RE), where packet losses may occur with some particular probabilities. To improve the estimation performance, some redundant channels are adopted for the data transmission. Since the number of redundant channels is fixed, the optimal redundant scheduling for multiple sensors is worth investigating to determine how to allocate the redundant channels. To address this problem, the redundant transmission scheduling is modeled as a Markov decision process (MDP) to minimize the estimation error for all systems. By constructing a sufficient condition, one ensures that the MDP has an optimal deterministic and stationary policy. Meanwhile, the threshold structure of the redundant transmission scheduling policy is verified to further decrease the complexity of the calculation. Reinforcement learning (RL) is used for this problem, and a near -optimal policy is obtained by dueling double -deep Q -networks (D3QN) algorithm. Finally, an illustrative simulation is presented to demonstrate its effectiveness.

引用

页数：9

共 42 条

[11] An Autonomous Sigfox Wireless Sensor Node for Environmental Monitoring [J].

Joris, Laura ;

Dupont, Francois ;

Laurent, Philippe ;

Bellier, Pierre ;

Stoukatch, Serguei ;

Redoute, Jean-Michel .

IEEE SENSORS LETTERS, 2019, 3 (07)

[12] A Strategy for Elimination of Data Redundancy in Internet of Things (IoT) Based Wireless Sensor Network (WSN) [J].

Kumar, Shishupal ;

Chaurasiya, Vijay Kumar .

IEEE SYSTEMS JOURNAL, 2019, 13 (02) :1650-1657

[13] Deep reinforcement learning for wireless sensor scheduling in cyber-physical systems [J].

Leong, Alex S. ;

Ramaswamy, Arunselvan ;

Quevedo, Daniel E. ;

Karl, Holger ;

Shi, Ling .

AUTOMATICA, 2020, 113

[14]

Leong AS, 2015, 2015 EUROPEAN CONTROL CONFERENCE (ECC), P927, DOI 10.1109/ECC.2015.7330661

[15] An overview of packet reordering in Transmission Control Protocol (TCP): Problems, solutions, and challenges [J].

Leung, Ka-Cheong ;

Li, Victor O. K. ;

Yang, Daiqin .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2007, 18 (04) :522-535

[16] Analysis and Improvement of Send-and-Wait Automatic Repeat-reQuest Protocols for Wireless Sensor Networks [J].

Liu, Yuxin ;

Liu, Anfeng ;

Chen, Zhigang .

WIRELESS PERSONAL COMMUNICATIONS, 2015, 81 (03) :923-959

[17] Noisy Sensor Scheduling in Wireless Networked Control Systems: Freshness or Precision [J].

Ma, He ;

Zhou, Shidong .

IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (05) :1107-1111

[18] Redundant data transmission in control/estimation over lossy networks [J].

Mesquita, Alexandre R. ;

Hespanha, Joao P. ;

Nair, Girish N. .

AUTOMATICA, 2012, 48 (08) :1612-1620

[19] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[20]

Ni YQ, 2017, ASIA CONTROL CONF AS, P934, DOI 10.1109/ASCC.2017.8287296

← 1 2 3 4 5 →