Optimal redundant transmission scheduling for remote state estimation via reinforcement learning approach

被引：2

作者：

Jia, Yijin ^{[1
]}

Yang, Lixin ^{[2
]}

Zhao, Yao ^{[1
]}

Li, Jun-Yi ^{[1
]}

Lv, Weijun ^{[1
]}

机构：

[1] Guangdong Univ Technol, Sch Automat, Prov Key Lab Intelligent Decis & Cooperat Control, Guangzhou 510006, Peoples R China

[2] Queensland Univ Technol, Sch Elect Engn & Robot, Brisbane, Qld 4000, Australia

来源：

NEUROCOMPUTING | 2024年 / 576卷

基金：

中国国家自然科学基金;

关键词：

Redundant transmission scheduling; Remote state estimation; Markov decision process; Reinforcement learning; NETWORKED CONTROL-SYSTEMS; NEURAL-NETWORKS; DELAYS;

D O I：

10.1016/j.neucom.2024.127337

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper studies the optimal redundant transmission scheduling for remote state estimation. Multiple smart sensors observe some systems, and transmit the local state estimates via independent channels to a remote estimator (RE), where packet losses may occur with some particular probabilities. To improve the estimation performance, some redundant channels are adopted for the data transmission. Since the number of redundant channels is fixed, the optimal redundant scheduling for multiple sensors is worth investigating to determine how to allocate the redundant channels. To address this problem, the redundant transmission scheduling is modeled as a Markov decision process (MDP) to minimize the estimation error for all systems. By constructing a sufficient condition, one ensures that the MDP has an optimal deterministic and stationary policy. Meanwhile, the threshold structure of the redundant transmission scheduling policy is verified to further decrease the complexity of the calculation. Reinforcement learning (RL) is used for this problem, and a near -optimal policy is obtained by dueling double -deep Q -networks (D3QN) algorithm. Finally, an illustrative simulation is presented to demonstrate its effectiveness.

引用

页数：9

共 42 条

[1] Hybrid Automatic Repeat Request (HARQ) in Wireless Communications Systems and Standards: A Contemporary Survey [J].

Ahmed, Ashfaq ;

Al-Dweik, Arafat ;

Iraqi, Youssef ;

Mukhtar, Husameldin ;

Naeem, Muhammad ;

Hossain, Ekram .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (04) :2711-2752

[2] Survey on artificial intelligence based techniques for emerging robotic communication [J].

Alsamhi, S. H. ;

Ma, Ou ;

Ansari, Mohd Samar .

TELECOMMUNICATION SYSTEMS, 2019, 72 (03) :483-503

[3]

Anderson BD., 2012, OPTIMAL FILTERING

[4]

Anghel L, 2000, 13TH SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, PROCEEDINGS, P237, DOI 10.1109/SBCCI.2000.876036

[5] An adaptive retransmit mechanism for delay differentiated services in industrial WSNs [J].

Chen, Ye ;

Liu, Wei ;

Wang, Tian ;

Deng, Qingyong ;

Liu, Anfeng ;

Song, Houbing .

EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2019, 2019 (01)

[6] State estimation of Markov jump neural networks with random delays by redundant channels [J].

Chen, Yun ;

Ren, Jing ;

Zhao, Xiaodong ;

Xue, Anke .

NEUROCOMPUTING, 2021, 453 :493-501

[7]

Cloud J, 2015, IEEE INFOCOM SER

[8] Average optimality for Markov decision processes in Borel spaces: A new condition and approach [J].

Guo, Xianping ;

Zhu, Quanxin .

JOURNAL OF APPLIED PROBABILITY, 2006, 43 (02) :318-334

[9]

Hernandez-Lerma O., 1996, DISCRETE TIME MARKOV

[10] ESRRA-IoT: Edge-based spatial redundancy reduction approach for Internet of Things [J].

Ismael, Waleed M. ;

Gao, Mingsheng ;

Yemeni, Zaid .

INTERNET OF THINGS, 2021, 14

← 1 2 3 4 5 →