Hierarchical Reinforcement Learning Based Resource Allocation for RAN Slicing

被引:4
作者
Anil Akyildiz, Hasan [1 ,2 ]
Faruk Gemici, Omer [2 ]
Hokelek, Ibrahim [1 ,3 ]
Ali Cirpan, Hakan
机构
[1] Istanbul Tech Univ, Elect & Commun Dept, TR-34469 Istanbul, Turkiye
[2] Ericsson, Business Area Networks Engn Unit Cloud RAN CX CE, Ottawa, ON K2K 2V6, Canada
[3] TUBITAK BILGEM, Res Ctr Adv Technol Informat & Informat Secur, TR-41470 Izmit, Turkiye
关键词
Resource management; Ultra reliable low latency communication; Throughput; Delays; Task analysis; Signal to noise ratio; Network slicing; Radio access networks; Reinforcement learning; eMBB; network slicing; radio access networks; reinforcement-learning; resource allocation; URLLC; 5G; MANAGEMENT;
D O I
10.1109/ACCESS.2024.3406949
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the complexity of wireless mobile networks increases significantly, artificial intelligence (AI) and machine learning (ML) have become key enablers for radio resource management and orchestration. In this paper, we propose a multi-agent reinforcement learning (RL) method for allocating radio resources to mobile users under random traffic arrivals, in which Ultra-Reliable Low-Latency Communications (URLLC) and enhanced Mobile Broad-Band (eMBB) services are jointly considered in the same radio access network (RAN). The proposed system includes hierarchically placed RL agents, where the main-agent residing on the upper hierarchy performs inter-slice resource allocation between the URLLC and eMBB slices. The URLLC and eMBB sub-agents are responsible for the resource allocation within their own slice, where the objective is to maximize the eMBB throughput while satisfying the latency requirements of the URLLC slice. In the RL algorithm, the state space includes the queue occupancy and the channel quality information of mobile users while the action space specifies the resource allocation to the users. For a computationally efficient RL training, the state space is significantly reduced by quantizing the queue occupancy and grouping the users according to their channel qualities. The numerical results for the URLLC show that the proposed RL-based approach provides the average delay results of lower than 1 ms for all experiments while the worst case eMBB throughput degradation is limited to 4%.
引用
收藏
页码:75818 / 75831
页数:14
相关论文
共 23 条
[1]   Flexible Resource Block Allocation to Multiple Slices for Radio Access Network Slicing Using Deep Reinforcement Learning [J].
Abiko, Yu ;
Saito, Takato ;
Ikeda, Daizo ;
Ohta, Ken ;
Mizuno, Tadanori ;
Mineno, Hiroshi .
IEEE ACCESS, 2020, 8 :68183-68198
[2]  
Abiko Y, 2020, 2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), P420, DOI [10.1109/ICOIN48656.2020.9016577, 10.1109/icoin48656.2020.9016577]
[3]  
Akyildiz HA, 2022, INT BLACK SEA CONF, P202, DOI [10.1109/BlackSeaCom54372.2022.9858135, 10.1109/BLACKSEACOM54372.2022.9858135]
[4]   Model-Based Reinforcement Learning With Kernels for Resource Allocation in RAN Slices [J].
Alcaraz, Juan J. ;
Losilla, Fernando ;
Zanella, Andrea ;
Zorzi, Michele .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (01) :486-501
[5]   Intelligent Resource Slicing for eMBB and URLLC Coexistence in 5G and Beyond: A Deep Reinforcement Learning Based Approach [J].
Alsenwi, Madyan ;
Tran, Nguyen H. ;
Bennis, Mehdi ;
Pandey, Shashi Raj ;
Bairagi, Anupam Kumar ;
Hong, Choong Seon .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (07) :4585-4600
[6]   QoS Guaranteed Network Slicing Orchestration for Internet of Vehicles [J].
Cui, Yaping ;
Huang, Xinyun ;
He, Peng ;
Wu, Dapeng ;
Wang, Ruyan .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (16) :15215-15227
[7]  
De Bast S, 2019, IEEE CONF COMPUT, P264, DOI [10.1109/INFCOMW.2019.8845211, 10.1109/infcomw.2019.8845211]
[8]   Reinforcement Learning-based Joint Power and Resource Allocation for URLLC in 5G [J].
Elsayed, Medhat ;
Erol-Kantarci, Melike .
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[9]  
Elsayed M, 2019, 2019 IEEE 2ND 5G WORLD FORUM (5GWF), P590, DOI [10.1109/5GWF.2019.8911618, 10.1109/5gwf.2019.8911618]
[10]   Modeling Queuing Delay of 5G NR With NOMA Under SINR Outage Constraint [J].
Gemici, Omer Faruk ;
Hokelek, Ibrahim ;
Cirpan, Hakan Ali .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (03) :2389-2403