Deep Reinforcement Learning for Optimization of RAN Slicing Relying on Control- and User-Plane Separation
被引:1
作者:
Tu, Haiyan
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R ChinaXidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
Tu, Haiyan
[1
]
Zhao, Liqiang
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
Xidian Univ, Guangzhou Inst Technol, Guangzhou 510100, Peoples R ChinaXidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
Zhao, Liqiang
[1
,2
]
Zhang, Yaoyuan
论文数: 0引用数: 0
h-index: 0
机构:
Hebei Univ, Coll Elect Informat Engn, Baoding 071002, Peoples R ChinaXidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
Zhang, Yaoyuan
[3
]
Zheng, Gan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Warwick, Sch Engn, Coventry CV4 7AL, EnglandXidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
Zheng, Gan
[4
]
Feng, Chen
论文数: 0引用数: 0
h-index: 0
机构:
Univ British Columbia, Sch Engn, Kelowna, BC V1V 1V7, CanadaXidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
Feng, Chen
[5
]
Song, Shenghui
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R ChinaXidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
Song, Shenghui
[6
]
Liang, Kai
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R ChinaXidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
Liang, Kai
[1
]
机构:
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[2] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510100, Peoples R China
[3] Hebei Univ, Coll Elect Informat Engn, Baoding 071002, Peoples R China
[4] Univ Warwick, Sch Engn, Coventry CV4 7AL, England
[5] Univ British Columbia, Sch Engn, Kelowna, BC V1V 1V7, Canada
[6] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
Optimization;
Resource management;
Radio access networks;
Base stations;
Reinforcement learning;
Network slicing;
Deep learning;
Asynchronous advantage actor-critic (A3C);
control- and user-plane separation (CUPS);
Lyapunov optimization;
radio access network (RAN) slicing;
NETWORK;
EFFICIENT;
ORCHESTRATION;
PERFORMANCE;
INTERNET;
EMBB;
MEC;
D O I:
10.1109/JIOT.2023.3320434
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
The rapid development of radio access network (RAN) slicing and control- and user-plane separation (CUPS) has created a new paradigm for future networks, namely, CUPS-based RAN slicing. In this article, we formulate the utility optimization problems of the CUPS-based RAN slicing system and propose a Lyapunov-based deep reinforcement learning (L-DRL) framework to solve them. Specifically, we propose that the control plane (CP) and user plane (UP) slices should control their respective power and subcarrier resources. First, we provide coverage-driven slices in the CP for coverage control and data-driven slices in the UP for diverse user requests, where we consider the influence of coverage-driven slices on data-driven slices. Second, we define the system's utilities as income minus cost, and we formulate the utility maximization problem of the UP as a mixed-integer nonlinear programming (MINLP) problem, which is NP-hard because it considers both continuous actions (densities deployment and power allocation) and discrete action (subcarrier allocation). Furthermore, we design an alternating optimization method for the CP and UP based on the densities of deployment. Finally, we develop a novel L-DRL framework for mixed-action optimization problems and propose a specific Lyapunov-based asynchronous advantage actor-critic (L-A3C) algorithm. Simulation results demonstrate that our proposed Lyapunov-based A3C (L-A3C) algorithm outperforms the standard A3C algorithm in terms of the convergence while achieving higher performance than Lyapunov optimization. Moreover, our proposed CUPS-based RAN slicing scheme surpasses the benchmark RAN slicing schemes in terms of the achievable rate and delay.
机构:
Imperial Coll London, Dept Comp, London, England
PROWLER Io, Cambridge, EnglandImperial Coll London, Dept Bioengn, London, England
Deisenroth, Marc Peter
;
Brundage, Miles
论文数: 0引用数: 0
h-index: 0
机构:
Arizona State Univ, Sci & Technol Dept, Human & Social Dimens, Tempe, AZ 85287 USA
Univ Oxford, Future Humanity Inst, Oxford, EnglandImperial Coll London, Dept Bioengn, London, England
机构:
Xidian Univ, State Key Lab ISN, Xian 710071, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Feng, Jie
;
Liu, Lei
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, State Key Lab ISN, Xian 710071, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Liu, Lei
;
Pei, Qingqi
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, State Key Lab ISN, Xian 710071, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Pei, Qingqi
;
Hou, Fen
论文数: 0引用数: 0
h-index: 0
机构:
Univ Macau, State Key Lab IoT Smart City, Macau, Peoples R China
Univ Macau, Dept Elect & Comp Engn, Macau, Peoples R China
Guangdong Hong Kong Macau Joint Lab Smart Cities, Macau, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Hou, Fen
;
Yang, Tingting
论文数: 0引用数: 0
h-index: 0
机构:
Dongguan Univ Technol, Sch Elect Engn & Intelligentizat, Dongguan 523000, Peoples R China
Pengcheng Lab, Shenzhen 518000, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Yang, Tingting
;
Wu, Jinsong
论文数: 0引用数: 0
h-index: 0
机构:
Guilin Univ Elect Technol, Sch Artificial Intelligence, Guilin 541004, Peoples R China
Univ Chile, Dept Elect Engn, Santiago 8370451, ChileXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
机构:
Imperial Coll London, Dept Comp, London, England
PROWLER Io, Cambridge, EnglandImperial Coll London, Dept Bioengn, London, England
Deisenroth, Marc Peter
;
Brundage, Miles
论文数: 0引用数: 0
h-index: 0
机构:
Arizona State Univ, Sci & Technol Dept, Human & Social Dimens, Tempe, AZ 85287 USA
Univ Oxford, Future Humanity Inst, Oxford, EnglandImperial Coll London, Dept Bioengn, London, England
机构:
Xidian Univ, State Key Lab ISN, Xian 710071, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Feng, Jie
;
Liu, Lei
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, State Key Lab ISN, Xian 710071, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Liu, Lei
;
Pei, Qingqi
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, State Key Lab ISN, Xian 710071, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Pei, Qingqi
;
Hou, Fen
论文数: 0引用数: 0
h-index: 0
机构:
Univ Macau, State Key Lab IoT Smart City, Macau, Peoples R China
Univ Macau, Dept Elect & Comp Engn, Macau, Peoples R China
Guangdong Hong Kong Macau Joint Lab Smart Cities, Macau, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Hou, Fen
;
Yang, Tingting
论文数: 0引用数: 0
h-index: 0
机构:
Dongguan Univ Technol, Sch Elect Engn & Intelligentizat, Dongguan 523000, Peoples R China
Pengcheng Lab, Shenzhen 518000, Peoples R ChinaXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China
Yang, Tingting
;
Wu, Jinsong
论文数: 0引用数: 0
h-index: 0
机构:
Guilin Univ Elect Technol, Sch Artificial Intelligence, Guilin 541004, Peoples R China
Univ Chile, Dept Elect Engn, Santiago 8370451, ChileXidian Univ, State Key Lab ISN, Xian 710071, Peoples R China