Dynamic SDN-Based Radio Access Network Slicing With Deep Reinforcement Learning for URLLC and eMBB Services

被引:61
作者
Filali, Abderrahime [1 ]
Mlika, Zoubeir [1 ]
Cherkaoui, Soumaya [1 ,2 ]
Kobbane, Abdellatif [3 ]
机构
[1] Univ Sherbrooke, INTERLAB Res Lab, Dept Elect & Comp Sci Engn, Sherbrooke, PQ J1K 2R1, Canada
[2] Polytech Montreal, Dept Comp & Software Engn, Montreal, PQ H3T 1J4, Canada
[3] Univ Mohammed V Rabat, ENSIAS, Rabat 10500, Morocco
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2022年 / 9卷 / 04期
关键词
Resource management; Ultra reliable low latency communication; Quality of service; 5G mobile communication; Optimization; Radio access networks; Heuristic algorithms; Network slicing; software defined networking; URLLC; eMBB; deep reinforcement learning; 5G; INTERNET;
D O I
10.1109/TNSE.2022.3157274
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Radio access network (RAN) slicing is a key technology that enables 5G network to support heterogeneous requirements of generic services, namely ultra-reliable low-latency communication (URLLC) and enhanced mobile broadband (eMBB). In this paper, we propose a two time-scales RAN slicing mechanism to optimize the performance of URLLC and eMBB services. In a large time-scale, an SDN controller allocates radio resources to gNodeBs according to the requirements of the eMBB and URLLC services. In a short time-scale, each gNodeB allocates its available resources to its end-users and requests, if needed, additional resources from adjacent gNodeBs. We formulate this problem as a non-linear binary program and prove its NP-hardness. Next, for each time-scale, we model the problem as a Markov decision process (MDP), where the large-time scale is modeled as a single agent MDP whereas the shorter time-scale is modeled as a multi-agent MDP. We leverage the exponential-weight algorithm for exploration and exploitation (EXP3) to solve the single-agent MDP of the large time-scale MDP and the multi-agent deep Q-learning (DQL) algorithm to solve the multi-agent MDP of the short time-scale resource allocation. Extensive simulations show that our approach is efficient under different network parameters configuration and it outperforms recent benchmark solutions.
引用
收藏
页码:2174 / 2187
页数:14
相关论文
共 39 条
[1]  
[Anonymous], 2019, TS382115GNR 3GPP
[2]  
Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375
[3]   Multi-Tenant Cross-Slice Resource Orchestration: A Deep Reinforcement Learning Approach [J].
Chen, Xianfu ;
Zhao, Zhifeng ;
Wu, Celimuge ;
Bennis, Mehdi ;
Liu, Hang ;
Ji, Yusheng ;
Zhang, Honggang .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) :2377-2392
[4]  
Cherkaoui S., 2021, IEEE NETWORK, V35, P4
[5]   ARTIFICIAL INTELLIGENCE EMPOWERED EDGE COMPUTING AND CACHING FOR INTERNET OF VEHICLES [J].
Dai, Yueyue ;
Xu, Du ;
Maharjan, Sabita ;
Qiao, Guanhua ;
Zhang, Yan .
IEEE WIRELESS COMMUNICATIONS, 2019, 26 (03) :12-18
[6]   5G RAN Slicing for Verticals: Enablers and Challenges [J].
Elayoubi, Salah Eddine ;
Ben Jemaa, Sana ;
Altman, Zwi ;
Galindo-Serrano, Ana .
IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (01) :28-34
[7]  
Filali A., 2022, ARXIV220206439
[8]   Preemptive SDN Load Balancing With Machine Learning for Delay Sensitive Applications [J].
Filali, Abderrahime ;
Mlika, Zoubeir ;
Cherkaoui, Soumaya ;
Kobbane, Abdellatif .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) :15947-15963
[9]   Multi-Access Edge Computing: A Survey [J].
Filali, Abderrahime ;
Abouaomar, Amine ;
Cherkaoui, Soumaya ;
Kobbane, Abdellatif ;
Guizani, Mohsen .
IEEE ACCESS, 2020, 8 :197017-197046
[10]   Energy-Efficient Scheduling for Real-Time Systems Based on Deep Q-Learning Mode [J].
Zhang, Qingchen ;
Lin, Man ;
Yang, Laurence T. ;
Chen, Zhikui ;
Li, Peng .
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2019, 4 (01) :132-141