A Deep-Reinforcement-Learning-Based Approach to Dynamic eMBB/URLLC Multiplexing in 5G NR

被引:55
作者
Huang, Yan [1 ]
Li, Shaoran [1 ]
Li, Chengzhang [1 ]
Hou, Y. Thomas [1 ]
Lou, Wenjing [1 ]
机构
[1] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
Multiplexing; Decoding; Optimization; Neural networks; Resource management; Approximation algorithms; 3GPP Standards; 5G NR; deep reinforcement learning (DRL); enhanced mobile broadband (eMBB); ultrareliable and low latency communication (URLLC) multiplexing; preemption; puncturing; resource allocation; DESIGN;
D O I
10.1109/JIOT.2020.2978692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the dynamic multiplexing of enhanced mobile broadband (eMBB) and ultrareliable and low latency communications (URLLC) on the same channel in 5G NR. Due to significant difference in transmission time scale, URLLC employs a preemptive puncturing technique to multiplex its traffic onto eMBB traffic for transmission. The optimization problem to solve is to minimize the adverse impact of such preemptive puncturing on eMBB users. We present DEMUX-a model-free deep reinforcement learning (DRL)-based solution to this problem. The essence of DEMUX is to use deep function approximators (neural networks) to learn an optimal algorithm for determining the preemption solution in each eMBB transmission time interval (TTI). Our novel contributions in the design of DEMUX include the first use of the DRL method with a large and continuous action domain for resource scheduling in NR, a mechanism to ensure fast and stable learning convergence by exploiting the intrinsic properties of the problem, and a mechanism to obtain a feasible preemption solution from the unconstrained output of a neural network while minimizing loss of information. The experimental results show that DEMUX significantly outperforms state-of-the-art algorithms proposed in the 3GPP standards body and the literature.
引用
收藏
页码:6439 / 6456
页数:18
相关论文
共 35 条
[1]  
Anand A, 2018, IEEE INFOCOM SER, P1979
[2]  
[Anonymous], 2017, 3GPP TSG RAN WG1 M H
[3]  
[Anonymous], 2017, 3GPP TSG RAN WG1 M V
[4]  
[Anonymous], 2018, Study on channel model for frequencies from 0.5 to 100 ghz
[5]  
[Anonymous], 2017, M24100 ITU
[6]  
[Anonymous], 2016, DEEP LEARNING
[7]  
[Anonymous], 2017, 3GPP Rep. TR 38.804 V1.0.0
[8]  
[Anonymous], 2017, FINAL REPORT 3GPP TS
[9]  
[Anonymous], 2019, 38824 3GPP TR
[10]  
[Anonymous], 2014, ICML ICML 14