Hierarchical Multi-Agent DRL-Based Framework for Joint Multi-RAT Assignment and Dynamic Resource Allocation in Next-Generation HetNets

被引:20
作者
Alwarafy, Abdulmalik [1 ]
Ciftler, Bekir Sait [1 ]
Abdallah, Mohamed [1 ]
Hamdi, Mounir [1 ]
Al-Dhahir, Naofal [2 ]
机构
[1] Hamad Bin Khalifa Univ, Coll Sci & Engn, Div Informat & Comp Technol, Doha 34110, Qatar
[2] Univ Texas Dallas, Erik Jonsson Sch Engn & Comp Sci, Elect & Comp Engn Dept, Richardson, TX 75080 USA
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2022年 / 9卷 / 04期
关键词
Deep reinforcement learning; deep Q network; deep deterministic policy gradient; heterogeneous networks; multi-RAT assignment; power allocation; resource allocation; USER ASSOCIATION; REINFORCEMENT; NETWORKS;
D O I
10.1109/TNSE.2022.3164648
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This article considers the problem of cost-aware downlink sum-rate maximization via joint optimal radio access technologies (RATs) assignment and power allocation in next-generation heterogeneous wireless networks (HetNets). We consider a future HetNet comprised of multi-RATs and serving multi-connectivity edge devices (EDs), and we formulate the problem as a mixed-integer non-linear programming (MINP) problem. Due to the high complexity and combinatorial nature of this problem and the difficulty to solve it using conventional methods, we propose a hierarchical multi-agent deep reinforcement learning (DRL)-based framework, called DeepRAT, to solve it efficiently and learn system dynamics. In particular, the DeepRAT framework decomposes the problem into two main stages; the RATs-EDs assignment stage, which implements a single-agent Deep Q Network (DQN) algorithm, and the power allocation stage, which utilizes a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm. Using simulations, we demonstrate how the various DRL agents efficiently interact to learn system dynamics and derive the global optimal policy. Furthermore, our simulation results show that the proposed DeepRAT algorithm outperforms existing state-of-the-art heuristic approaches in terms of network utility. Finally, we quantitatively show the ability of the DeepRAT model to quickly and dynamically adapt to abrupt changes in network dynamics, such as EDs' mobility.
引用
收藏
页码:2481 / 2494
页数:14
相关论文
共 33 条
[1]  
Abdelali A., 2021, P 6 ARABIC NATURAL L, P1
[2]  
Alwarafy A., 2021, ARXIV210600574
[3]   DeepRAT: A DRL-Based Framework for Multi-RAT Assignment and Power Allocation in HetNets [J].
Alwarafy, Abdulmalik ;
Ciftler, Bekir Sait ;
Abdallah, Mohamed ;
Hamdi, Mounir .
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
[4]  
[Anonymous], 2012, P IEEE 75 VEH TECHN
[5]  
[Anonymous], 2010, 802212008 IEEE, DOI DOI 10.1109/IEEESTD.2009.5375704
[6]  
Bi Z., 2020, 2020 IEEE 91 VEHICUL, P1
[7]  
Boyd S., 2004, CONVEX OPTIMIZATION
[8]   Joint User Association and Resource Allocation in the Downlink of Heterogeneous Networks [J].
Chen, Youjia ;
Li, Jun ;
Chen, Wen ;
Lin, Zihuai ;
Vucetic, Branka .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (07) :5701-5706
[9]   Deep Reinforcement Learning for Network Selection Over Heterogeneous Health Systems [J].
Chkirbene, Zina ;
Abdellatif, Alaa Awad ;
Mohamed, Amr ;
Erbad, Aiman ;
Guizani, Mohsen .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (01) :258-270
[10]   Joint Resource Allocation for Parallel Multi-Radio Access in Heterogeneous Wireless Networks [J].
Choi, Yonghoon ;
Kim, Hoon ;
Han, Sang-wook ;
Han, Youngnam .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2010, 9 (11) :3324-3329