Deep Reinforcement Learning for Dynamic Spectrum Access in the Multi-Channel Wireless Local Area Networks

被引:3
作者
Bhandari, Sovit [1 ]
Ranjan, Navin [1 ]
Kim, Yeong-Chan [1 ]
Kim, Hoon [1 ]
机构
[1] Incheon Natl Univ, IoT & Big Data Res Ctr, Dept Elect Engn, Incheon, South Korea
来源
2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC) | 2022年
基金
新加坡国家研究基金会;
关键词
WLANs; multi-channel; DSA; reinforcement learning; DQN; OPTIMALITY;
D O I
10.1109/ICEIC54506.2022.9748733
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, the rapid proliferation of wireless local area networks (WLANs) has led to a scarcity of radio spectrum. Dynamic Spectrum Access (DSA) is considered a promising technology to address the increasing shortage of radio spectrum and improve its utilization. DSA technique effectively utilizes the radio spectrum by switching between different networks. However, most conventional DSA techniques do not consider the correlation between multiple channels and require network information in advance to make decisions. Due to recent advances in reinforcement learning, a deep Q-network (DQN) based method is proposed in this paper to solve the problem of correlated multi-channel DSA with unknown system dynamics. The performance of the DQN-based method is quantified based on the successful packet transmission, packet collisions, and channel utilization.
引用
收藏
页数:4
相关论文
共 13 条
[1]   Optimality of Myopic Sensing in Multichannel Opportunistic Access [J].
Ahmad, Sahand Haji Ali ;
Liu, Mingyan ;
Javidi, Tara ;
Zhao, Qing ;
Krishnamachari, Bhaskar .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (09) :4040-4050
[2]   Next generation IEEE 802.11 Wireless Local Area Networks: Current status, future directions and open challenges [J].
Bellalta, Boris ;
Bononi, Luciano ;
Bruno, Raffaele ;
Kassler, Andreas .
COMPUTER COMMUNICATIONS, 2016, 75 :1-25
[3]   Optimal Cache Resource Allocation Based on Deep Neural Networks for Fog Radio Access Networks [J].
Bhandari, Sovit ;
Kim, Hoon ;
Ranjan, Navin ;
Zhao, Hong Ping ;
Khan, Pervez .
JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (04) :967-975
[4]   Deep Reinforcement Learning for Dynamic Spectrum Sensing and Aggregation in Multi-Channel Wireless Networks [J].
Li, Yunzeng ;
Zhang, Wensheng ;
Wang, Cheng-Xiang ;
Sun, Jian ;
Liu, Yu .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2020, 6 (02) :464-475
[5]   Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access [J].
Liu, Keqin ;
Zhao, Qing .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (11) :5547-5567
[6]  
Mnih V, 2013, Arxiv, DOI arXiv:1312.5602
[7]   Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access [J].
Naparstek, Oshri ;
Cohen, Kobi .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (01) :310-323
[8]   A Review on Deep Learning-Based Approaches for Automatic Sonar Target Recognition [J].
Neupane, Dhiraj ;
Seok, Jongwon .
ELECTRONICS, 2020, 9 (11) :1-30
[9]   Large-Scale Road Network Congestion Pattern Analysis and Prediction Using Deep Convolutional Autoencoder [J].
Ranjan, Navin ;
Bhandari, Sovit ;
Khan, Pervez ;
Hong, Youn-Sik ;
Kim, Hoon .
SUSTAINABILITY, 2021, 13 (09)
[10]   On Optimality of Myopic Policy for Restless Multi-Armed Bandit Problem: An Axiomatic Approach [J].
Wang, Kehao ;
Chen, Lin .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (01) :300-309