DeepRAT: A DRL-Based Framework for Multi-RAT Assignment and Power Allocation in HetNets

被引:11
作者
Alwarafy, Abdulmalik [1 ]
Ciftler, Bekir Sait [1 ]
Abdallah, Mohamed [1 ]
Hamdi, Mounir [1 ]
机构
[1] Hamad Bin Khalifa Univ, Div Informat & Comp Technol, Coll Sci & Engn, Doha, Qatar
来源
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS) | 2021年
关键词
DRL; DQN; DDPG; Resource Allocation; Multi-RAT; Heterogenous Networks;
D O I
10.1109/ICCWorkshops50388.2021.9473488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wireless heterogeneous networks (HetNets), where several systems with multi-radio access technologies (multi-RATs) coexist for massive multi-connectivity networks, are in service nowadays and are expected to be a main enabling technology in the future wireless networks. The extensive heterogeneity of such networks in terms of RATs architectures and their supported applications poses extra challenges on efficiently managing networks' communication resources, such as power and spectrum. In this paper, we propose a non-cooperative multi-agent deep reinforcement learning (DRL)-based framework, called DeepRAT, to address the problem of joint multi-RAT assignment and dynamic power allocation in the downlink of multi-connectivity edge devices (EDs) in future HetNets. In particular, the problem is formulated as a partially observable Markov decision process (POMDP), and the DeepRAT algorithm solves this computationally-expensive problem hierarchically by decomposing it into two stages; a multi-RAT assignment stage and a power allocation stage. The first stage utilizes the Deep Q-Network (DQN) algorithm to learn the optimal policy for RAT assignment of EDs. The second stage employs the Deep Deterministic Policy Gradient (DDPG) algorithm to solve the power allocation problem for RATs' assigned EDs. Simulation results show that the DeepRAT algorithm's performance is about 98.1% and 95.6% compared to the state-of-the-art methods that assume perfect information of the HetNet dynamics.
引用
收藏
页数:6
相关论文
共 17 条
[1]  
Khan AA, 2020, Arxiv, DOI arXiv:2009.03033
[2]   An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks [J].
Asheralieva, Alia ;
Miyanaga, Yoshikazu .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (09) :3996-4012
[3]  
Bi Z., 2020, P IEEE 91 VEH TECHN, P1
[4]   Joint Resource Allocation for Parallel Multi-Radio Access in Heterogeneous Wireless Networks [J].
Choi, Yonghoon ;
Kim, Hoon ;
Han, Sang-wook ;
Han, Youngnam .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2010, 9 (11) :3324-3329
[5]  
Diamond S, 2016, J MACH LEARN RES, V17
[6]   A deep reinforcement learning for user association and power control in heterogeneous networks [J].
Ding, Hui ;
Zhao, Feng ;
Tian, Jie ;
Li, Dongyang ;
Zhang, Haixia .
AD HOC NETWORKS, 2020, 102
[7]   Machine Learning for Resource Management in Cellular and IoT Networks: Potentials, Current Solutions, and Open Challenges [J].
Hussain, Fatima ;
Hassan, Syed Ali ;
Hussain, Rasheed ;
Hossain, Ekram .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (02) :1251-1275
[8]   Q-Learning Based Two-Timescale Power Allocation for Multi-Homing Hybrid RF/VLC Networks [J].
Kong, Justin ;
Wu, Zi-Yang ;
Ismail, Muhammad ;
Serpedin, Erchin ;
Qaraqe, Khalid A. .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (04) :443-447
[9]  
Lillicrap T. P., 2015, arXiv
[10]  
Liu Q, 2020, Arxiv, DOI arXiv:2008.07614