Deep Reinforcement Learning-Based Policy for Baseband Function Placement and Routing of RAN in 5G and Beyond

被引:16
作者
Gao, Zhengguang [1 ,2 ]
Yan, Shuangyi [3 ]
Zhang, Jiawei [4 ]
Han, Bingtao [1 ]
Wang, Yongcheng [1 ]
Xiao, Yuming
Simeonidou, Dimitra [3 ]
Ji, Yuefeng
机构
[1] State Key Lab Mobile Network & Mobile Multimedia, Shenzhen 518055, Peoples R China
[2] Beijing Univ Posts & Telecommun, State Key Lab Informat Photon & Opt Commun, Beijing 100876, Peoples R China
[3] Univ Bristol, High Performance Networks Grp, Smart Internet Lab, Bristol BS8 1TH, Avon, England
[4] Beijing Univ Posts & Telecommun, State Key Lab Informat Photon & Opt Commun, Beijing 100876, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金; 欧盟地平线“2020”;
关键词
Routing; 5G mobile communication; Heuristic algorithms; Bandwidth; Baseband; Computer architecture; Benchmark testing; 5G and beyond; baseband function placement and routing; deep reinforcement learning; C-RAN; NETWORKS; SERVICE; ARCHITECTURE; RADIO;
D O I
10.1109/JLT.2021.3110788
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a deep reinforcement learning (DRL)-based algorithm to generate policies of Baseband Function (BBF) placement and routing. In order to explore the performance of the proposed algorithm in practical systems, the online scenario with the completely random requests is used in the simulation considering C-RAN and NG-RAN architectures. Besides, an Integer Linear Programming (ILP) model is formulated to generate the optimal solution as the benchmark. The simulation results show that DRL-based algorithm converges in a short time, and its performance closes to the optimal benchmark obtained by ILP in terms of latency and bandwidth for the online scenarios. In addition, the performance of the generated policies based on DRL is compared with a classic heuristic algorithm, i.e., first-fit algorithm. The performance of DRL-based algorithm is superior to the first-fit algorithm from above two perspectives. The fast convergence and the near-optimal performance prove that the DRL-based algorithm is a promising approach for the BBF placement and routing of RAN in 5G and Beyond.
引用
收藏
页码:470 / 480
页数:11
相关论文
共 35 条
  • [1] Next Generation 5G Wireless Networks: A Comprehensive Survey
    Agiwal, Mamta
    Roy, Abhishek
    Saxena, Navrati
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (03): : 1617 - 1655
  • [2] Alam Faiz, 2018, IEEE STANDARDS ASS T
  • [3] [Anonymous], 2017, 3GPP TS 38300 V041
  • [4] Multi-Tenant Provisioning for Quantum Key Distribution Networks With Heuristics and Reinforcement Learning: A Comparative Study
    Cao, Yuan
    Zhao, Yongli
    Li, Jun
    Lin, Rui
    Zhang, Jie
    Chen, Jiajia
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (02): : 946 - 957
  • [5] Energy-Efficient Baseband Unit Placement in a Fixed/Mobile Converged WDM Aggregation Network
    Carapellese, Nicola
    Tornatore, Massimo
    Pattavina, Achille
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2014, 32 (08) : 1542 - 1551
  • [6] Building Autonomic Elastic Optical Networks with Deep Reinforcement Learning
    Chen, Xiaoliang
    Proietti, Roberto
    Yoo, S. J. Ben
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (10) : 20 - 26
  • [7] GAO Z, 2019, PAPER W2A22
  • [8] HIRAYAMA H, 2019, PROC IEEE 90 VEH TEC, P1, DOI DOI 10.1109/VTCFALL.2019.8891584
  • [9] Artificial intelligence-driven autonomous optical networks: 3S architecture and key technologies
    Ji, Yuefeng
    Gu, Rentao
    Yang, Zeyuan
    Li, Jin
    Li, Hui
    Zhang, Min
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (06)
  • [10] Khorsandi BM, 2018, 22ND INTERNATIONAL CONFERENCE ON OPTICAL NETWORK DESIGN AND MODELING (ONDM 2018), P106, DOI 10.23919/ONDM.2018.8396115