Reinforcement Learning-Based Joint User Scheduling and Link Configuration in Millimeter-Wave Networks

被引：9

作者：

Zhang, Yi ^{[1
]}

Heath Jr, Robert W. W. ^{[2
]}

机构：

[1] Univ Texas Austin, Chandra Family Dept Elect & Comp Engn, Austin, TX 78731 USA

[2] North Carolina State Univ, Dept Elect & Comp Engn, Raleigh, NC 27695 USA

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2023年 / 22卷 / 05期

关键词：

Millimeter wave communication; Relays; Training; Delays; Optimization; Dynamic scheduling; Wireless communication; Millimeter wave; mobility; user scheduling; relay selection; codebook selection; beam tracking; deep reinforcement learning; proximal policy optimization; multi-armed bandit; Thompson sampling; RELAY SELECTION; 5G; ALLOCATION; SYSTEMS;

D O I：

10.1109/TWC.2022.3215922

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we develop algorithms for joint user scheduling and three types of mmWave link configuration: relay selection, codebook optimization, and beam tracking in millimeter wave (mmWave) networks. Our goal is to design an online controller that dynamically schedules users and configures their links to minimize system delay. To solve this complex scheduling problem, we model it as a dynamic decision-making process and develop two reinforcement learning-based solutions. The first solution is based on deep reinforcement learning (DRL), which leverages the proximal policy optimization to train a neural network-based solution. Due to the potential high sample complexity of DRL, we also propose an empirical multi-armed bandit (MAB)-based solution, which decomposes the decision-making process into a sequential of sub-actions and exploits classic maxweight scheduling and Thompson sampling to decide those sub-actions. Our evaluation of the proposed solutions confirms their effectiveness in providing acceptable system delay. It also shows that the DRL-based solution has better delay performance while the MAB-based solution has a faster training process.

引用

页码：3038 / 3054

页数：17

共 55 条

[1]

Alrabeiah M, 2020, Arxiv, DOI [arXiv:2006.14501, 10.1109/TCOMM.2022.3168878]

[2]

[Anonymous], 2019, IEEE P802.11ay/D4.0, P1

[3]

[Anonymous], 2011, NIPS

[4]

[Anonymous], 2016, 802112012 IEEE, P1, DOI DOI 10.1109/IEEESTD.2016.7786995

[5]

[Anonymous], 2019, TS23501V1610 3GPP, P1

[6]

Aykin I, 2020, IEEE INFOCOM SER, P1469, DOI [10.1109/infocom41043.2020.9155408, 10.1109/INFOCOM41043.2020.9155408]

[7]

Balakrishnan R., 2019, PROC IEEE GLOBAL COM, P1

[8] Resource Allocation and Interference Management for Opportunistic Relaying in Integrated mmWave/sub-6 GHz 5G Networks [J].

Deng, Junquan ;

Tirkkonen, Olav ;

Freij-Hollanti, Ragnar ;

Chen, Tao ;

Nikaein, Navid .

IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (06) :94-101

[9] Cooperative Beamforming With Predictive Relay Selection for Urban mmWave Communications [J].

Dimas, Anastasios ;

Kalogerias, Dionysios S. ;

Petropulu, Athina P. .

IEEE ACCESS, 2019, 7 :157057-157071

[10]

Elsayed M, 2018, IEEE GLOB COMM CONF

← 1 2 3 4 5 6 →