On-Device Indoor Positioning: A Federated Reinforcement Learning Approach With Heterogeneous Devices

被引：10

作者：

Dou, Fei ^{[1
]}

Lu, Jin ^{[1
]}

Zhu, Tan ^{[2
]}

Bi, Jinbo ^{[2
]}

机构：

[1] Univ Georgia, Sch Comp, Athens, GA 30602 USA

[2] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 03期

基金：

美国国家科学基金会;

关键词：

Data models; Location awareness; Task analysis; Servers; Mobile handsets; Training; Federated learning; Deep reinforcement learning (RL); device heterogeneity; federated learning (FL); few-shot learning (FSL); indoor localization; privacy preserving;

D O I：

10.1109/JIOT.2023.3299262

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The widespread deployment of machine learning techniques in ubiquitous computing environments has sparked interests in exploiting the vast amount of data stored on mobile devices. To preserve data privacy, federated learning (FL) has been proposed to learn a shared model by performing distributed training locally on participating devices and aggregating the local models into a global one. Reinforcement learning (RL) can improve indoor localization by accounting for environmental dynamics, but has been trained on centralized data. An FL version of RL can help train a global localization model using data from different user clients whereas keeping data on device without centralization. We propose a personalized federated RL for indoor localization that addresses two major challenges. Due to the limited network connectivity of mobile devices, under the federated computing setting, it is impractical to aggregate updates from all clients in any learning iteration. Data gathered on different devices are heterogeneous, imposing difficulty in training high accuracy models. In our approach, each client performs RL to learn an action policy that can quickly search for a target based on its own data (e.g., personalized) and then a central server communicates with clients only for their model updates and learns a global model that is in the proximity of all client models (e.g., federated). Empirical evaluations demonstrate superior performance of the proposed approach in terms of localization accuracy and steadiness over existing methods. We further extend our approach to few-shot learning that can quickly position a new user with sparse annotated location data.

引用

页码：3909 / 3926

页数：18

共 59 条

[1]

[Anonymous], 2008, P 2008 2 INT C SENS, DOI [10.1109/SENSORCOMM.2008.32, DOI 10.1109/SENSORCOMM.2008.32]

[2]

[Anonymous], 2004, Adv. Neural Inf. Process. Syst

[3]

Arik S., 2018, Adv. Neural Inf. Process. Syst, V31, P1

[4]

Bellver M., 2016, PROC DEEPREINFORCEME, P1

[5]

Bonawitz K, 2019, Arxiv, DOI arXiv:1902.01046

[6] Practical Secure Aggregation for Privacy-Preserving Machine Learning [J].

Bonawitz, Keith ;

Ivanov, Vladimir ;

Kreuter, Ben ;

Marcedone, Antonio ;

McMahan, H. Brendan ;

Patel, Sarvar ;

Ramage, Daniel ;

Segal, Aaron ;

Seth, Karn .

CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1175-1191

[7] Active Object Localization with Deep Reinforcement Learning [J].

Caicedo, Juan C. ;

Lazebnik, Svetlana .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2488-2496

[8]

Cheng H, 2012, INT C INDOOR POSIT

[9]

Ciftler BS, 2020, INT WIREL COMMUN, P2112, DOI 10.1109/IWCMC48107.2020.9148111

[10]

Deng YY, 2020, Arxiv, DOI [arXiv:2003.13461, DOI 10.48550/ARXIV.2003.13461]

← 1 2 3 4 5 6 →