Millimeter-Wave Concurrent Beamforming: A Multi-Player Multi-Armed Bandit Approach

被引：13

作者：

Mohamed, Ehab Mahmoud ^{[1
,2
]}

Hashima, Sherief ^{[3
,4
]}

Hatano, Kohei ^{[3
,5
]}

Kasban, Hani ^{[4
]}

Rihan, Mohamed ^{[6
]}

机构：

[1] Prince Sattam Bin Abdulaziz Univ, Coll Engn, Elect Engn Dept, Wadi Addwasir 11991, Saudi Arabia

[2] Aswan Univ, Fac Engn, Elect Engn Dept, Aswan 81542, Egypt

[3] RIKEN, Computat Learning Theory Team, Adv Intelligent Project, Fukuoka 8190395, Japan

[4] Egyptian Atom Energy Author, Engn Dept, Nucl Res Ctr, Cairo 13759, Egypt

[5] Kyushu Univ, Fac Arts & Sci, Fukuok 8190395, Japan

[6] Menoufia Univ, Fac Elect Engn, Elect & Elect Commun Engn, Menoufia 32952, Egypt

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2020年 / 65卷 / 03期

关键词：

Millimeter wave (mmWave); concurrent transmissions; reinforcement learning; multiarmed bandit (MAB); BEAM ALIGNMENT; WIRELESS NETWORKS; COMMUNICATION;

D O I：

10.32604/cmc.2020.011816

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The communication in the Millimeter-wave (mmWave) band, i.e., 30-300 GHz, is characterized by short-range transmissions and the use of antenna beamforming (BF). Thus, multiple mmWave access points (APs) should be installed to fully cover a target environment with gigabits per second (Gbps) connectivity. However, inter-beam interference prevents maximizing the sum rates of the established concurrent links. In this paper, a reinforcement learning (RL) approach is proposed for enabling mmWave concurrent transmissions by finding out beam directions that maximize the long-term average sum rates of the concurrent links. Specifically, the problem is formulated as a multiplayer multiarmed bandit (MAB), where mmWave APs act as the players aiming to maximize their achievable rewards, i.e., data rates, and the arms to play are the available beam directions. In this setup, a selfish concurrent multiplayer MAB strategy is advocated. Four different MAB algorithms, namely, epsilon-greedy, upper confidence bound (UCB), Thompson sampling (TS), and exponential weight algorithm for exploration and exploitation (EXP3) are examined by employing them in each AP to selfishly enhance its beam selection based only on its previous observations. After a few rounds of interactions, mmWave APs learn how to select concurrent beams that enhance the overall system performance. The proposed MAB based mmWave concurrent BF shows comparable performance to the optimal solution.

引用

页码：1987 / 2007

页数：21

共 50 条

[1] Gateway Selection in Millimeter Wave UAV Wireless Networks Using Multi-Player Multi-Armed Bandit
Mohamed, Ehab Mahmoud
Hashima, Sherief
Aldosary, Abdallah
Hatano, Kohei
Abdelghany, Mahmoud Ahmed
SENSORS, 2020, 20 (14) : 1 - 22
[2] An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit
Pacchiano, Aldo
Bartlett, Peter
Jordan, Michael
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 1166 - 1215
[3] MAMBA: A Multi-armed Bandit Framework for Beam Tracking in Millimeter-wave Systems
Aykin, Irmak
Akgun, Berk
Feng, Mingjie
Krunz, Marwan
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 1469 - 1478
[4] Cooperative Multi-player Multi-Armed Bandit: Computation Offloading in a Vehicular Cloud Network
Xu, Shilin
Guo, Caili
Hu, Rose Qingyang
Qian, Yi
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[5] Decentralized Learning for Multi-player Multi-armed Bandits
Kalathil, Dileep
Nayyar, Naumaan
Jain, Rahul
2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 3960 - 3965
[6] Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits
Xiong, Guojun
Li, Jian
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10528 - 10536
[7] Online Learning for Cooperative Multi-Player Multi-Armed Bandits
Chang, William
Jafarnia-Jahromi, Mehdi
Jain, Rahul
2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 7248 - 7253
[8] Decentralized Multi-player Multi-armed Bandits with No Collision Information
Shi, Chengshuai
Xiong, Wei
Shen, Cong
Yang, Jing
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
[9] A Multi-armed Bandit Approach to Distributed Robust Beamforming in Multicell Networks
Zhang, Xinruo
Nakhai, Mohammad Reza
Ariffin, Wan Nur Suryani Firuz Wan
2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
[10] Multi-Armed Bandit for Link Configuration in Millimeter-Wave Networks: An Approach for Solving Sequential Decision-Making Problems
Zhang, Yi
Heath Jr, Robert W.
IEEE VEHICULAR TECHNOLOGY MAGAZINE, 2023, 18 (02): : 39 - 49

← 1 2 3 4 5 →