Edge-Based Federated Deep Reinforcement Learning for IoT Traffic Management

被引：14

作者：

Jarwan, Abdallah ^{[1
]}

Ibnkahla, Mohamed ^{[2
]}

机构：

[1] Lytica Inc, Data Analyt Dept, Kanata, ON K2K 1Y6, Canada

[2] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada

来源：

IEEE INTERNET OF THINGS JOURNAL | 2023年 / 10卷 / 05期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Internet of Things; Quality of service; Performance evaluation; Wireless sensor networks; Training; Reinforcement learning; Delays; Advantage-actor-critic (A2C) methods; backhaul (BH) selection; deep reinforcement learning (DRL); distributed edge learning; federated learning (FL); Internet of Things (IoT); IoT traffic management; INTERNET; THINGS; OPTIMIZATION; 5G;

D O I：

10.1109/JIOT.2022.3174469

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The wide adoption of large-scale Internet of Things (IoT) systems has led to an unprecedented increase in backhaul (BH) traffic congestion, making it critical to optimize traffic management at the network edge. In IoT systems, the BH network is supported by various backhauling technologies that have different characteristics. Also, the characteristics of the BH links can be sometimes time varying and have an unknown state, due to external factors such as having the resources shared with other systems. It is the responsibility of the edge devices to be able to forward IoT traffic through the unknown-state BH network by selecting the suitable BH link for each collected data flow. To the best of our knowledge, this type of BH selection problem is not addressed in the literature. Therefore, there is a crucial need to develop intelligent approaches enabling edge devices to learn how to deal with unknown-state (partially observable) components of the BH network, which is the primary goal of this article. We propose an edge-based BH selection technique for improving traffic delivery by exploiting multiobjective feedback on delivery performance. The proposed approach relies on the advantage-actor-critic deep reinforcement learning (DRL) methods. Moreover, to improve the DRL training performance in large-scale deployments of distributed IoT systems, federated learning (FL) is applied to enable multiple edge devices to collaborate in training a shared BH selection policy. The proposed federated DRL (F-DRL) approach is able to solve the BH selection problem as verified and demonstrated through extensive simulations.

引用

页码：3799 / 3813

页数：15

共 35 条

[1] A Survey of Machine and Deep Learning Methods for Internet of Things (IoT) Security [J].

Al-Garadi, Mohammed Ali ;

Mohamed, Amr ;

Al-Ali, Abdulla Khalid ;

Du, Xiaojiang ;

Ali, Ihsan ;

Guizani, Mohsen .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (03) :1646-1685

[2] Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey [J].

Chen, Wuhui ;

Qiu, Xiaoyu ;

Cai, Ting ;

Dai, Hong-Ning ;

Zheng, Zibin ;

Zhang, Yan .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03) :1659-1692

[3] Channel Assignment for Throughput Optimization in Multichannel Multiradio Wireless Mesh Networks Using Network Coding [J].

Chieochan, Surachai ;

Hossain, Ekram .

IEEE TRANSACTIONS ON MOBILE COMPUTING, 2013, 12 (01) :118-135

[4]

Ciuonzo D., 2011, P 14 INT C INF FUS C, P1

[5] Distributed Detection in Wireless Sensor Networks Under Multiplicative Fading via Generalized Score Tests [J].

Ciuonzo, Domenico ;

Rossi, Pierluigi Salvo ;

Varshney, Pramod K. .

IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11) :9059-9071

[6] Scalable Personalized IoT Networks [J].

El-Mougi, Amr ;

Al-Shiab, Ismael ;

Ibnkahla, Mohamed .

PROCEEDINGS OF THE IEEE, 2019, 107 (04) :695-710

[7] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J].

Grondman, Ivo ;

Busoniu, Lucian ;

Lopes, Gabriel A. D. ;

Babuska, Robert .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :1291-1307

[8] Deep Neural Network Initialization With Decision Trees [J].

Humbird, Kelli D. ;

Peterson, J. Luc ;

McClarren, Ryan G. .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) :1286-1295

[9] Soft-GORA: Soft Constrained Globally Optimal Resource Allocation for Critical Links in loT Backhaul Communication [J].

Iqbal, Saleem ;

Abdullah, Abdul Hanan ;

Qureshi, Kashif Naseer ;

Lloret, Jaime .

IEEE ACCESS, 2018, 6 :614-624

[10] Data Transmission Reduction Schemes in WSNs for Efficient IoT Systems [J].

Jarwan, Abdallah ;

Sabbah, Ayman ;

Ibnkahla, Mohamed .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (06) :1307-1324

← 1 2 3 4 →