A Bisection Reinforcement Learning Approach to 3-D Indoor Localization

被引:27
作者
Dou, Fei [1 ]
Lu, Jin [2 ]
Xu, Tingyang [3 ]
Huang, Chun-Hsi [4 ]
Bi, Jinbo [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
[2] Univ Michigan, Dept Comp & Informat Sci, Dearborn, MI 48128 USA
[3] Tencent Inc, Tencent AI Lab, Shenzhen 518057, Peoples R China
[4] Southern Illinois Univ, Sch Comp, Carbondale, IL 62901 USA
基金
美国国家科学基金会;
关键词
Deep Q-network (DQN); deep reinforcement learning (DRL); dynamic environment; indoor localization; Internet of Things (IoT); multifloor; received signal strength indicator (RSSI); Wi-Fi fingerprint; NEURAL-NETWORKS;
D O I
10.1109/JIOT.2020.3041204
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The demand for indoor localization services in the Internet of Things (IoT) has been increasing dramatically during the last decade. Many indoor localization systems adopt Wi-Fi fingerprinting with received signal strength indicators (RSSIs) as a source of sensors to localize an object because it is cost effective and can give high accuracy. However, the fluctuation of wireless signals resulting from environmental uncertainties leads to considerable variations in RSSIs, which poses a challenge to accurate localization on a single floor, not to mention multifloor or even 3-D localization. Most existing multifloor methods employ a sequential approach where a different algorithm is tailored for each step in the sequence to determine the floor and then the location of an object. In this article, we formulate the indoor localization problem as a Markov decision process rather than a typical classification or regression problem. A deep reinforcement learning method is used to bisect the search space in a hierarchy from the entire building down to a prespecified distance scale to the object position. This approach significantly reduces the time complexity of the searching from O(N-3) to O(logN), where N indicates the localization resolution. The proposed method tackles environmental dynamics with Wi-Fi fingerprinting for 3-D continuous space. The experimental results demonstrate the high accuracy, efficiency, and robustness of the proposed approach.
引用
收藏
页码:6519 / 6535
页数:17
相关论文
共 44 条
[1]  
[Anonymous], 2016, P DEEP REINF LEARN W
[2]  
Anschel O, 2017, 34 INT C MACHINE LEA, V70
[3]   The Internet of Things: A survey [J].
Atzori, Luigi ;
Iera, Antonio ;
Morabito, Giacomo .
COMPUTER NETWORKS, 2010, 54 (15) :2787-2805
[4]   Active Object Localization with Deep Reinforcement Learning [J].
Caicedo, Juan C. ;
Lazebnik, Svetlana .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2488-2496
[5]  
Cheng Y., 2018, ROBUST NEURAL MACHIN
[6]   Efficient kNN classification algorithm for big data [J].
Deng, Zhenyun ;
Zhu, Xiaoshu ;
Cheng, Debo ;
Zong, Ming ;
Zhang, Shichao .
NEUROCOMPUTING, 2016, 195 :143-148
[7]   Top-Down Indoor Localization with Wi-Fi Fingerprints using Deep Q-Network [J].
Dou, Fei ;
Lu, Jin ;
Wang, Zigeng ;
Xiao, Xia ;
Bi, Jinbo ;
Huang, Chun-Hsi .
2018 IEEE 15TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2018, :166-174
[8]  
Finn Chelsea, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P2786, DOI 10.1109/ICRA.2017.7989324
[9]  
Fortunato Meire, 2018, PROC INT C LEARN REP
[10]   Wi-Fi Fingerprint-Based Indoor Positioning: Recent Advances and Comparisons [J].
He, Suining ;
Chan, S. -H. Gary .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (01) :466-490