Deep Reinforcement Learning-Aided Optimization of Multi-Interface Allocation for Short-Packet Communications

被引:3
作者
De Oliveira, Hugo [1 ,2 ]
Kaneko, Megumi [1 ]
Boukhatem, Lila [2 ]
Fukuda, Ellen Hidemi [3 ]
机构
[1] Natl Inst Informat, Informat Syst Architecture Sci Res Div, Tokyo 1018430, Japan
[2] Univ Paris Saclay, CNRS, Lab Interdisciplinaire Sci Numer, F-91190 Gif Sur Yvette, France
[3] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
关键词
Millimeter wave communication; Quality of service; Resource management; Ultra reliable low latency communication; Optimization; Interference; Delays; Beyond; 5G; sub-6; GHz; millimeter Wave; short-packet communications; deep reinforcement learning; resource allocation optimization; 5G; URLLC; EMBB; WAVE;
D O I
10.1109/TCCN.2023.3252661
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The severe spectrum scarcity and the stringent requirements of Beyond 5G applications call for an integrated use of low frequency Sub-6 GHz and high frequency millimeter Wave bands. Focusing on future Internet of Things (IoT) Short-Packet Communications (SPC), this paper investigates the optimized usage of such diverse wireless interfaces. We propose an unifying framework devoted to SPC that jointly optimizes the user partitioning over each band, and the radio resource scheduling within each band. Leveraging Deep Reinforcement Learning (DRL) tools, the proposed method enables to better tackle the challenges imposed by dynamically varying mobile environments such as the Line-of-Sight situations of each link, and the heterogeneity of individual Quality of Service (QoS) requirements, such as rate, delay and reliability. Regarding the DRL-based user partitioning to each band, we have investigated three different types of partitioning actions to obtain a high network performance as well as a rapid convergence. Regarding the proposed sub-schedulers within each band, we designed two optimization methods, i.e., one that leverages Difference of Convex Programming (DCP) technique, and the second that accelerates convergence to a local optimum. Numerical evaluations show that the proposed methods outperform conventional approaches in terms of sum-rate and QoS outage probabilities.
引用
收藏
页码:738 / 753
页数:16
相关论文
共 23 条
[1]   6G and Beyond: The Future of Wireless Communications Systems [J].
Akyildiz, Ian F. ;
Kak, Ahan ;
Nie, Shuai .
IEEE ACCESS, 2020, 8 :133995-134030
[2]   Intelligent Resource Slicing for eMBB and URLLC Coexistence in 5G and Beyond: A Deep Reinforcement Learning Based Approach [J].
Alsenwi, Madyan ;
Tran, Nguyen H. ;
Bennis, Mehdi ;
Pandey, Shashi Raj ;
Bairagi, Anupam Kumar ;
Hong, Choong Seon .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (07) :4585-4600
[3]   On the Optimization of User Association and Resource Allocation in HetNets With mm-Wave Base Stations [J].
Chaieb, Cirine ;
Mlika, Zoubeir ;
Abdelkefi, Fatma ;
Ajib, Wessam .
IEEE SYSTEMS JOURNAL, 2020, 14 (03) :3957-3967
[4]  
Dinh Q. T., 2010, RECENT ADV OPTIMIZAT, P93
[5]   Toward Massive, Ultrareliable, and Low-Latency Wireless Communication With Short Packets [J].
Durisi, Giuseppe ;
Koch, Tobias ;
Popovski, Petar .
PROCEEDINGS OF THE IEEE, 2016, 104 (09) :1711-1726
[6]  
Ghanem W. R., 2019, IEEE INT CONF COMM, P1
[7]   A Deep-Reinforcement-Learning-Based Approach to Dynamic eMBB/URLLC Multiplexing in 5G NR [J].
Huang, Yan ;
Li, Shaoran ;
Li, Chengzhang ;
Hou, Y. Thomas ;
Lou, Wenjing .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) :6439-6456
[8]  
Ismael Rojgar Q., 2019, 2019 International Engineering Conference (IEC). Proceedings, P1, DOI 10.1109/IEC47844.2019.8950607
[9]  
Kasgari A. T. Z., 2019, ICC 2019 2019 IEEE I, P1
[10]  
Leblanc S., 2021, PROC IEEE INT C COMM, P1