A Deep-Reinforcement-Learning-Based Approach to Dynamic eMBB/URLLC Multiplexing in 5G NR

被引：58

作者：

Huang, Yan ^{[1
]}

Li, Shaoran ^{[1
]}

Li, Chengzhang ^{[1
]}

Hou, Y. Thomas ^{[1
]}

Lou, Wenjing ^{[1
]}

机构：

[1] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Blacksburg, VA 24061 USA

来源：

IEEE INTERNET OF THINGS JOURNAL | 2020年 / 7卷 / 07期

基金：

美国国家科学基金会;

关键词：

Multiplexing; Decoding; Optimization; Neural networks; Resource management; Approximation algorithms; 3GPP Standards; 5G NR; deep reinforcement learning (DRL); enhanced mobile broadband (eMBB); ultrareliable and low latency communication (URLLC) multiplexing; preemption; puncturing; resource allocation; DESIGN;

D O I：

10.1109/JIOT.2020.2978692

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article investigates the dynamic multiplexing of enhanced mobile broadband (eMBB) and ultrareliable and low latency communications (URLLC) on the same channel in 5G NR. Due to significant difference in transmission time scale, URLLC employs a preemptive puncturing technique to multiplex its traffic onto eMBB traffic for transmission. The optimization problem to solve is to minimize the adverse impact of such preemptive puncturing on eMBB users. We present DEMUX-a model-free deep reinforcement learning (DRL)-based solution to this problem. The essence of DEMUX is to use deep function approximators (neural networks) to learn an optimal algorithm for determining the preemption solution in each eMBB transmission time interval (TTI). Our novel contributions in the design of DEMUX include the first use of the DRL method with a large and continuous action domain for resource scheduling in NR, a mechanism to ensure fast and stable learning convergence by exploiting the intrinsic properties of the problem, and a mechanism to obtain a feasible preemption solution from the unconstrained output of a neural network while minimizing loss of information. The experimental results show that DEMUX significantly outperforms state-of-the-art algorithms proposed in the 3GPP standards body and the literature.

引用

页码：6439 / 6456

页数：18

共 35 条

[1]

3GPP, 2019, TS382115GNR 3GPP

[2]

Abadi Martin, 2015, TENSORFLOW LARGE SCA

[3]

Anand A, 2018, IEEE INFOCOM SER, P1979

[4]

[Anonymous], 2017, 3GPP TSG RAN WG1 M H

[5]

[Anonymous], 2017, 3GPP TSG RAN WG1 M V

[6]

[Anonymous], 2018, Rel-16, v16.1.0, TR 38.901

[7]

[Anonymous], 2017, M24100 ITU

[8]

[Anonymous], 2017, Rep. TR 38.804

[9]

[Anonymous], 2017, FINAL REPORT 3GPP TS

[10]

[Anonymous], 2019, Technical Report 3GPP TR 38.824 version 16.0.0 Release 16

← 1 2 3 4 →