Hierarchical Multi-Agent DRL-Based Framework for Joint Multi-RAT Assignment and Dynamic Resource Allocation in Next-Generation HetNets

被引：20

作者：

Alwarafy, Abdulmalik ^{[1
]}

Ciftler, Bekir Sait ^{[1
]}

Abdallah, Mohamed ^{[1
]}

Hamdi, Mounir ^{[1
]}

Al-Dhahir, Naofal ^{[2
]}

机构：

[1] Hamad Bin Khalifa Univ, Coll Sci & Engn, Div Informat & Comp Technol, Doha 34110, Qatar

[2] Univ Texas Dallas, Erik Jonsson Sch Engn & Comp Sci, Elect & Comp Engn Dept, Richardson, TX 75080 USA

来源：

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2022年 / 9卷 / 04期

关键词：

Deep reinforcement learning; deep Q network; deep deterministic policy gradient; heterogeneous networks; multi-RAT assignment; power allocation; resource allocation; USER ASSOCIATION; REINFORCEMENT; NETWORKS;

D O I：

10.1109/TNSE.2022.3164648

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

This article considers the problem of cost-aware downlink sum-rate maximization via joint optimal radio access technologies (RATs) assignment and power allocation in next-generation heterogeneous wireless networks (HetNets). We consider a future HetNet comprised of multi-RATs and serving multi-connectivity edge devices (EDs), and we formulate the problem as a mixed-integer non-linear programming (MINP) problem. Due to the high complexity and combinatorial nature of this problem and the difficulty to solve it using conventional methods, we propose a hierarchical multi-agent deep reinforcement learning (DRL)-based framework, called DeepRAT, to solve it efficiently and learn system dynamics. In particular, the DeepRAT framework decomposes the problem into two main stages; the RATs-EDs assignment stage, which implements a single-agent Deep Q Network (DQN) algorithm, and the power allocation stage, which utilizes a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm. Using simulations, we demonstrate how the various DRL agents efficiently interact to learn system dynamics and derive the global optimal policy. Furthermore, our simulation results show that the proposed DeepRAT algorithm outperforms existing state-of-the-art heuristic approaches in terms of network utility. Finally, we quantitatively show the ability of the DeepRAT model to quickly and dynamically adapt to abrupt changes in network dynamics, such as EDs' mobility.

引用

页码：2481 / 2494

页数：14

共 33 条

[1]

Abdelali A., 2021, P 6 ARABIC NATURAL L, P1

[2]

Alwarafy A., 2021, ARXIV210600574

[3] DeepRAT: A DRL-Based Framework for Multi-RAT Assignment and Power Allocation in HetNets [J].