Distributed localization for IoT with multi-agent reinforcement learning

被引：6

作者：

Jia, Jie ^{[1
,2
]}

Yu, Ruoying ^{[1
]}

Du, Zhenjun ^{[3
]}

Chen, Jian ^{[1
]}

Wang, Qinghu ^{[1
,2
]}

Wang, Xingwei ^{[1
,2
]}

机构：

[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China

[2] Minist Educ, Engn Res Ctr Secur Technol Complex Network Syst, Shenyang 110819, Peoples R China

[3] SIASUN Robot & Automat CO Ltd, Shenyang, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Distributed localization; Q-learning; Internet of things (IoT); Multi-agent reinforcement learning; PERIODIC-SOLUTION; WIRELESS; ALGORITHM;

D O I：

10.1007/s00521-021-06855-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Localization has become one of the important techniques for Internet of Things (IoT). However, most existing localization methods need a central controller and operate on an off-line manner, which cannot satisfy the requirements of real-time IoT applications. In order to address this issue, a novel distributed localization scheme based on multi-agent reinforcement learning (MARL) is proposed. The localization problem is first reformulated as a stochastic game for maximizing the sum of the negative localization error. Each non-anchor node is then modeled as an intelligent agent, where its action space corresponds to possible locations. After that, we invoke a MARL framework on the basis of conventional Q-learning framework to learn the optimal policy, and to maximize the long-term expected reward. The novel strategy is also proposed to reduce the localization error. Extensive simulations demonstrate that the proposed localization method is superior to game theoretic-based distributed localization algorithm and virtual force-based distributed localization algorithm in terms of both localization accuracy and convergence speed, and is suitable for on-line localization scenarios.

引用

页码：7227 / 7240

页数：14

共 42 条

[1] Abed-Alguni Bilal H., 2016, International Journal of Artificial Intelligence, V14, P71
[2] Abed-Alguni B. H., 2018, INT J ARTIF INTELL, V16, P41
[3] Abed-Alguni BH., 2017, ARAB J SCI ENG, V1, P1
[4] Abed-alguni BH., 2015, Vietnam J. Comp. Sci, V2, P213, DOI [10.1007/s40595-015-0045-x, DOI 10.1007/S40595-015-0045-X]
[5] Reduction the secular solution to periodic solution in the generalized restricted three-body problem
Abouelmagd, Elbaz I.
Awad, M. E.
Elzayat, E. M. A.
Abbas, Ibrahim A.
[J]. ASTROPHYSICS AND SPACE SCIENCE, 2014, 350 (02) : 495 - 505
[6] Internet of Things: A Survey on Enabling Technologies, Protocols, and Applications
Al-Fuqaha, Ala
Guizani, Mohsen
Mohammadi, Mehdi
Aledhari, Mohammed
Ayyash, Moussa
[J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (04): : 2347 - 2376
[7] An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks
Asheralieva, Alia
Miyanaga, Yoshikazu
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (09) : 3996 - 4012
[8] Finite-region asynchronous H∞ control for 2D Markov jump systems
Cheng, Peng
He, Shuping
Luan, Xiaoli
Liu, Fei
[J]. AUTOMATICA, 2021, 129
[9] A Security Localization Algorithm Based on DV-Hop Against Sybil Attack in Wireless Sensor Networks
Dong, Shi
Zhang, Xin-gang
Zhou, Wen-gang
[J]. JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2020, 15 (02) : 919 - 926
[10] Existence and asymptotic behavior results of periodic solution for discrete-time neutral-type neural networks
Du, Bo
Liu, Yurong
Abbas, Ibrahim Atiatallah
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2016, 353 (02): : 448 - 461

← 1 2 3 4 5 →