Deep Reinforcement Learning-Based Policy for Baseband Function Placement and Routing of RAN in 5G and Beyond

被引：16

作者：

Gao, Zhengguang ^{[1
,2
]}

Yan, Shuangyi ^{[3
]}

Zhang, Jiawei ^{[4
]}

Han, Bingtao ^{[1
]}

Wang, Yongcheng ^{[1
]}

Xiao, Yuming

Simeonidou, Dimitra ^{[3
]}

Ji, Yuefeng

机构：

[1] State Key Lab Mobile Network & Mobile Multimedia, Shenzhen 518055, Peoples R China

[2] Beijing Univ Posts & Telecommun, State Key Lab Informat Photon & Opt Commun, Beijing 100876, Peoples R China

[3] Univ Bristol, High Performance Networks Grp, Smart Internet Lab, Bristol BS8 1TH, Avon, England

[4] Beijing Univ Posts & Telecommun, State Key Lab Informat Photon & Opt Commun, Beijing 100876, Peoples R China

来源：

JOURNAL OF LIGHTWAVE TECHNOLOGY | 2022年 / 40卷 / 02期

基金：

中国国家自然科学基金; 北京市自然科学基金; 欧盟地平线“2020”;

关键词：

Routing; 5G mobile communication; Heuristic algorithms; Bandwidth; Baseband; Computer architecture; Benchmark testing; 5G and beyond; baseband function placement and routing; deep reinforcement learning; C-RAN; NETWORKS; SERVICE; ARCHITECTURE; RADIO;

D O I：

10.1109/JLT.2021.3110788

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a deep reinforcement learning (DRL)-based algorithm to generate policies of Baseband Function (BBF) placement and routing. In order to explore the performance of the proposed algorithm in practical systems, the online scenario with the completely random requests is used in the simulation considering C-RAN and NG-RAN architectures. Besides, an Integer Linear Programming (ILP) model is formulated to generate the optimal solution as the benchmark. The simulation results show that DRL-based algorithm converges in a short time, and its performance closes to the optimal benchmark obtained by ILP in terms of latency and bandwidth for the online scenarios. In addition, the performance of the generated policies based on DRL is compared with a classic heuristic algorithm, i.e., first-fit algorithm. The performance of DRL-based algorithm is superior to the first-fit algorithm from above two perspectives. The fast convergence and the near-optimal performance prove that the DRL-based algorithm is a promising approach for the BBF placement and routing of RAN in 5G and Beyond.

引用

页码：470 / 480

页数：11

共 35 条

[1] Next Generation 5G Wireless Networks: A Comprehensive Survey
Agiwal, Mamta
Roy, Abhishek
Saxena, Navrati
[J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (03): : 1617 - 1655
[2] Alam Faiz, 2018, IEEE STANDARDS ASS T
[3] [Anonymous], 2017, 3GPP TS 38300 V041
[4] Multi-Tenant Provisioning for Quantum Key Distribution Networks With Heuristics and Reinforcement Learning: A Comparative Study
Cao, Yuan
Zhao, Yongli
Li, Jun
Lin, Rui
Zhang, Jie
Chen, Jiajia
[J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (02): : 946 - 957
[5] Energy-Efficient Baseband Unit Placement in a Fixed/Mobile Converged WDM Aggregation Network
Carapellese, Nicola
Tornatore, Massimo
Pattavina, Achille
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2014, 32 (08) : 1542 - 1551
[6] Building Autonomic Elastic Optical Networks with Deep Reinforcement Learning
Chen, Xiaoliang
Proietti, Roberto
Yoo, S. J. Ben
[J]. IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (10) : 20 - 26
[7] GAO Z, 2019, PAPER W2A22
[8] HIRAYAMA H, 2019, PROC IEEE 90 VEH TEC, P1, DOI DOI 10.1109/VTCFALL.2019.8891584
[9] Artificial intelligence-driven autonomous optical networks: 3S architecture and key technologies
Ji, Yuefeng
Gu, Rentao
Yang, Zeyuan
Li, Jin
Li, Hui
Zhang, Min
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (06)
[10] Khorsandi BM, 2018, 22ND INTERNATIONAL CONFERENCE ON OPTICAL NETWORK DESIGN AND MODELING (ONDM 2018), P106, DOI 10.23919/ONDM.2018.8396115

← 1 2 3 4 →