Distributed localization for IoT with multi-agent reinforcement learning

被引:6
作者
Jia, Jie [1 ,2 ]
Yu, Ruoying [1 ]
Du, Zhenjun [3 ]
Chen, Jian [1 ]
Wang, Qinghu [1 ,2 ]
Wang, Xingwei [1 ,2 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Minist Educ, Engn Res Ctr Secur Technol Complex Network Syst, Shenyang 110819, Peoples R China
[3] SIASUN Robot & Automat CO Ltd, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributed localization; Q-learning; Internet of things (IoT); Multi-agent reinforcement learning; PERIODIC-SOLUTION; WIRELESS; ALGORITHM;
D O I
10.1007/s00521-021-06855-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Localization has become one of the important techniques for Internet of Things (IoT). However, most existing localization methods need a central controller and operate on an off-line manner, which cannot satisfy the requirements of real-time IoT applications. In order to address this issue, a novel distributed localization scheme based on multi-agent reinforcement learning (MARL) is proposed. The localization problem is first reformulated as a stochastic game for maximizing the sum of the negative localization error. Each non-anchor node is then modeled as an intelligent agent, where its action space corresponds to possible locations. After that, we invoke a MARL framework on the basis of conventional Q-learning framework to learn the optimal policy, and to maximize the long-term expected reward. The novel strategy is also proposed to reduce the localization error. Extensive simulations demonstrate that the proposed localization method is superior to game theoretic-based distributed localization algorithm and virtual force-based distributed localization algorithm in terms of both localization accuracy and convergence speed, and is suitable for on-line localization scenarios.
引用
收藏
页码:7227 / 7240
页数:14
相关论文
共 42 条
  • [1] Abed-Alguni Bilal H., 2016, International Journal of Artificial Intelligence, V14, P71
  • [2] Abed-Alguni B. H., 2018, INT J ARTIF INTELL, V16, P41
  • [3] Abed-Alguni BH., 2017, ARAB J SCI ENG, V1, P1
  • [4] Abed-alguni BH., 2015, Vietnam J. Comp. Sci, V2, P213, DOI [10.1007/s40595-015-0045-x, DOI 10.1007/S40595-015-0045-X]
  • [5] Reduction the secular solution to periodic solution in the generalized restricted three-body problem
    Abouelmagd, Elbaz I.
    Awad, M. E.
    Elzayat, E. M. A.
    Abbas, Ibrahim A.
    [J]. ASTROPHYSICS AND SPACE SCIENCE, 2014, 350 (02) : 495 - 505
  • [6] Internet of Things: A Survey on Enabling Technologies, Protocols, and Applications
    Al-Fuqaha, Ala
    Guizani, Mohsen
    Mohammadi, Mehdi
    Aledhari, Mohammed
    Ayyash, Moussa
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (04): : 2347 - 2376
  • [7] An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks
    Asheralieva, Alia
    Miyanaga, Yoshikazu
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (09) : 3996 - 4012
  • [8] Finite-region asynchronous H∞ control for 2D Markov jump systems
    Cheng, Peng
    He, Shuping
    Luan, Xiaoli
    Liu, Fei
    [J]. AUTOMATICA, 2021, 129
  • [9] A Security Localization Algorithm Based on DV-Hop Against Sybil Attack in Wireless Sensor Networks
    Dong, Shi
    Zhang, Xin-gang
    Zhou, Wen-gang
    [J]. JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2020, 15 (02) : 919 - 926
  • [10] Existence and asymptotic behavior results of periodic solution for discrete-time neutral-type neural networks
    Du, Bo
    Liu, Yurong
    Abbas, Ibrahim Atiatallah
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2016, 353 (02): : 448 - 461