Deep Reinforcement Learning for Optimization of RAN Slicing Relying on Control- and User-Plane Separation

被引:1
作者
Tu, Haiyan [1 ]
Zhao, Liqiang [1 ,2 ]
Zhang, Yaoyuan [3 ]
Zheng, Gan [4 ]
Feng, Chen [5 ]
Song, Shenghui [6 ]
Liang, Kai [1 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[2] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510100, Peoples R China
[3] Hebei Univ, Coll Elect Informat Engn, Baoding 071002, Peoples R China
[4] Univ Warwick, Sch Engn, Coventry CV4 7AL, England
[5] Univ British Columbia, Sch Engn, Kelowna, BC V1V 1V7, Canada
[6] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimization; Resource management; Radio access networks; Base stations; Reinforcement learning; Network slicing; Deep learning; Asynchronous advantage actor-critic (A3C); control- and user-plane separation (CUPS); Lyapunov optimization; radio access network (RAN) slicing; NETWORK; EFFICIENT; ORCHESTRATION; PERFORMANCE; INTERNET; EMBB; MEC;
D O I
10.1109/JIOT.2023.3320434
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of radio access network (RAN) slicing and control- and user-plane separation (CUPS) has created a new paradigm for future networks, namely, CUPS-based RAN slicing. In this article, we formulate the utility optimization problems of the CUPS-based RAN slicing system and propose a Lyapunov-based deep reinforcement learning (L-DRL) framework to solve them. Specifically, we propose that the control plane (CP) and user plane (UP) slices should control their respective power and subcarrier resources. First, we provide coverage-driven slices in the CP for coverage control and data-driven slices in the UP for diverse user requests, where we consider the influence of coverage-driven slices on data-driven slices. Second, we define the system's utilities as income minus cost, and we formulate the utility maximization problem of the UP as a mixed-integer nonlinear programming (MINLP) problem, which is NP-hard because it considers both continuous actions (densities deployment and power allocation) and discrete action (subcarrier allocation). Furthermore, we design an alternating optimization method for the CP and UP based on the densities of deployment. Finally, we develop a novel L-DRL framework for mixed-action optimization problems and propose a specific Lyapunov-based asynchronous advantage actor-critic (L-A3C) algorithm. Simulation results demonstrate that our proposed Lyapunov-based A3C (L-A3C) algorithm outperforms the standard A3C algorithm in terms of the convergence while achieving higher performance than Lyapunov optimization. Moreover, our proposed CUPS-based RAN slicing scheme surpasses the benchmark RAN slicing schemes in terms of the achievable rate and delay.
引用
收藏
页码:8485 / 8498
页数:14
相关论文
共 50 条
[1]   Network Slicing and Softwarization: A Survey on Principles, Enabling Technologies, and Solutions [J].
Afolabi, Ibrahim ;
Taleb, Tarik ;
Samdanis, Konstantinos ;
Ksentini, Adlen ;
Flinck, Hannu .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2018, 20 (03) :2429-2453
[2]   DRL-Based Joint RAT Association, Power and Bandwidth Optimization for Future HetNets [J].
Alwarafy, Abdulmalik ;
Ciftler, Bekir Sait ;
Abdallah, Mohamed ;
Hamdi, Mounir ;
Al-Dhahir, Naofal .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (07) :1503-1507
[3]  
[Anonymous], 1992, Data Networks
[4]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[5]   Joint Optimization of Service Caching Placement and Computation Offloading in Mobile Edge Computing Systems [J].
Bi, Suzhi ;
Huang, Liang ;
Zhang, Ying-Jun Angela .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (07) :4947-4963
[6]  
Boyd S., 2004, Convex Optimization
[7]   Network Slicing Games: Enabling Customization in Multi-Tenant Mobile Networks [J].
Caballero, Pablo ;
Banchs, Albert ;
De Veciana, Gustavo ;
Costa-Perez, Xavier .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2019, 27 (02) :662-675
[8]   QoS Guaranteed Network Slicing Orchestration for Internet of Vehicles [J].
Cui, Yaping ;
Huang, Xinyun ;
He, Peng ;
Wu, Dapeng ;
Wang, Ruyan .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (16) :15215-15227
[9]   MEC-Assisted Immersive VR Video Streaming Over Terahertz Wireless Networks: A Deep Reinforcement Learning Approach [J].
Du, Jianbo ;
Yu, F. Richard ;
Lu, Guangyue ;
Wang, Junxuan ;
Jiang, Jing ;
Chu, Xiaoli .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) :9517-9529
[10]   Service Characteristics-Oriented Joint Optimization of Radio and Computing Resource Allocation in Mobile-Edge Computing [J].
Feng, Jie ;
Liu, Lei ;
Pei, Qingqi ;
Hou, Fen ;
Yang, Tingting ;
Wu, Jinsong .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11) :9407-9421