A Dueling Deep Recurrent Q-Network Framework for Dynamic Multichannel Access in Heterogeneous Wireless Networks

被引：0

作者：

Chen, Haitao ^{[1
]}

Zhao, Haitao ^{[1
]}

Zhou, Li ^{[1
]}

Zhang, Jiao ^{[1
]}

Liu, Yan ^{[1
]}

Pan, Xiaoqian ^{[1
]}

Liu, Xingguang ^{[1
]}

Wei, Jibo ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Hunan, Peoples R China

来源：

WIRELESS COMMUNICATIONS & MOBILE COMPUTING | 2022年 / 2022卷

基金：

中国国家自然科学基金;

关键词：

SPECTRUM ACCESS; REINFORCEMENT; ALLOCATION; OPTIMALITY;

D O I：

10.1155/2022/9446418

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a deep reinforcement learning algorithm based on dueling deep recurrent Q-network (Dueling DRQN) for dynamic multichannel access in heterogeneous wireless networks. Specifically, we consider the scenario that multiple heterogeneous users with different MAC protocols share multiple independent channels. The goal of the intelligent node is to learn a channel access strategy that achieves high throughput by making full use of the underutilized channels. Two key challenges for the intelligent node are (i) there is no prior knowledge of spectrum environment or the other nodes' behaviors; (ii) the spectrum environment is partially observable, and the spectrum states have complex temporal dynamics. In order to overcome the aforementioned challenges, we first embed the long short-term memory layer (LSTM) into the deep Q-network (DQN) to aggregate historical observations and capture the underlying temporal feature in the heterogeneous networks. And second, we employ the dueling architecture to overcome the observability problem of dynamic environment in neural networks. Simulation results show that our approach can learn the optimal access policy in various heterogeneous networks and outperforms the state-of-the-art policies.

引用

页数：14

共 36 条

[1] Badran EF, 2019, CHINA COMMUN, V16, P34, DOI 10.23919/JCC.2019.12.002
[2] Multi-Armed Bandits for Spectrum Allocation in Multi-Agent Channel Bonding WLANs
Barrachina-Munoz, Sergio
Chiumento, Alessandro
Bellalta, Boris
[J]. IEEE ACCESS, 2021, 9 : 133472 - 133490
[3] Callejas-Molina RA, 2015, 2015 INTERNATIONAL CONFERENCE ON COMPUTING SYSTEMS AND TELEMATICS (ICCSAT)
[4] Deep Reinforcement Learning With Bidirectional Recurrent Neural Networks for Dynamic Spectrum Access
Chen, Peng
Quo, Shizeng
Gao, Yulong
[J]. 2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
[5] Dhahri C, 2012, IEEE GLOB COMM CONF, P4975, DOI 10.1109/GLOCOM.2012.6503908
[6] Learning to forget: Continual prediction with LSTM
Gers, FA
Schmidhuber, J
Cummins, F
[J]. NEURAL COMPUTATION, 2000, 12 (10) : 2451 - 2471
[7] Janiar S. B., 2021, 2021 IEEE 18 ANN CON, DOI DOI 10.1109/CCNC49032.2021.9369536
[8] Kingma DP, 2014, ADV NEUR IN, V27
[9] Trading-Based Dynamic Spectrum Access and Allocation in Cognitive Internet of Things
Li, Feng
Lam, Kwok-Yan
Meng, Limin
Luo, Hao
Wang, Li
[J]. IEEE ACCESS, 2019, 7 (125952-125959) : 125952 - 125959
[10] Listiyarini M., 2021, 2021 International Symposium on Electronics and Smart Devices (ISESD), P1, DOI 10.1109/WAMICON47156.2021.9444294

← 1 2 3 4 →