Millimeter-Wave Concurrent Beamforming: A Multi-Player Multi-Armed Bandit Approach

被引:13
|
作者
Mohamed, Ehab Mahmoud [1 ,2 ]
Hashima, Sherief [3 ,4 ]
Hatano, Kohei [3 ,5 ]
Kasban, Hani [4 ]
Rihan, Mohamed [6 ]
机构
[1] Prince Sattam Bin Abdulaziz Univ, Coll Engn, Elect Engn Dept, Wadi Addwasir 11991, Saudi Arabia
[2] Aswan Univ, Fac Engn, Elect Engn Dept, Aswan 81542, Egypt
[3] RIKEN, Computat Learning Theory Team, Adv Intelligent Project, Fukuoka 8190395, Japan
[4] Egyptian Atom Energy Author, Engn Dept, Nucl Res Ctr, Cairo 13759, Egypt
[5] Kyushu Univ, Fac Arts & Sci, Fukuok 8190395, Japan
[6] Menoufia Univ, Fac Elect Engn, Elect & Elect Commun Engn, Menoufia 32952, Egypt
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2020年 / 65卷 / 03期
关键词
Millimeter wave (mmWave); concurrent transmissions; reinforcement learning; multiarmed bandit (MAB); BEAM ALIGNMENT; WIRELESS NETWORKS; COMMUNICATION;
D O I
10.32604/cmc.2020.011816
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The communication in the Millimeter-wave (mmWave) band, i.e., 30-300 GHz, is characterized by short-range transmissions and the use of antenna beamforming (BF). Thus, multiple mmWave access points (APs) should be installed to fully cover a target environment with gigabits per second (Gbps) connectivity. However, inter-beam interference prevents maximizing the sum rates of the established concurrent links. In this paper, a reinforcement learning (RL) approach is proposed for enabling mmWave concurrent transmissions by finding out beam directions that maximize the long-term average sum rates of the concurrent links. Specifically, the problem is formulated as a multiplayer multiarmed bandit (MAB), where mmWave APs act as the players aiming to maximize their achievable rewards, i.e., data rates, and the arms to play are the available beam directions. In this setup, a selfish concurrent multiplayer MAB strategy is advocated. Four different MAB algorithms, namely, epsilon-greedy, upper confidence bound (UCB), Thompson sampling (TS), and exponential weight algorithm for exploration and exploitation (EXP3) are examined by employing them in each AP to selfishly enhance its beam selection based only on its previous observations. After a few rounds of interactions, mmWave APs learn how to select concurrent beams that enhance the overall system performance. The proposed MAB based mmWave concurrent BF shows comparable performance to the optimal solution.
引用
收藏
页码:1987 / 2007
页数:21
相关论文
共 50 条
  • [1] Gateway Selection in Millimeter Wave UAV Wireless Networks Using Multi-Player Multi-Armed Bandit
    Mohamed, Ehab Mahmoud
    Hashima, Sherief
    Aldosary, Abdallah
    Hatano, Kohei
    Abdelghany, Mahmoud Ahmed
    SENSORS, 2020, 20 (14) : 1 - 22
  • [2] An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit
    Pacchiano, Aldo
    Bartlett, Peter
    Jordan, Michael
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 1166 - 1215
  • [3] MAMBA: A Multi-armed Bandit Framework for Beam Tracking in Millimeter-wave Systems
    Aykin, Irmak
    Akgun, Berk
    Feng, Mingjie
    Krunz, Marwan
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 1469 - 1478
  • [4] Cooperative Multi-player Multi-Armed Bandit: Computation Offloading in a Vehicular Cloud Network
    Xu, Shilin
    Guo, Caili
    Hu, Rose Qingyang
    Qian, Yi
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [5] Decentralized Learning for Multi-player Multi-armed Bandits
    Kalathil, Dileep
    Nayyar, Naumaan
    Jain, Rahul
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 3960 - 3965
  • [6] Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits
    Xiong, Guojun
    Li, Jian
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10528 - 10536
  • [7] Online Learning for Cooperative Multi-Player Multi-Armed Bandits
    Chang, William
    Jafarnia-Jahromi, Mehdi
    Jain, Rahul
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 7248 - 7253
  • [8] Decentralized Multi-player Multi-armed Bandits with No Collision Information
    Shi, Chengshuai
    Xiong, Wei
    Shen, Cong
    Yang, Jing
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [9] A Multi-armed Bandit Approach to Distributed Robust Beamforming in Multicell Networks
    Zhang, Xinruo
    Nakhai, Mohammad Reza
    Ariffin, Wan Nur Suryani Firuz Wan
    2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [10] Multi-Armed Bandit for Link Configuration in Millimeter-Wave Networks: An Approach for Solving Sequential Decision-Making Problems
    Zhang, Yi
    Heath Jr, Robert W.
    IEEE VEHICULAR TECHNOLOGY MAGAZINE, 2023, 18 (02): : 39 - 49