A Multi-Armed Bandit Algorithm for IRS-Aided VLC System Design With Device-to-Device Relays

被引:1
|
作者
Curry, Elam A. [1 ]
Borah, Deva K. [1 ]
机构
[1] New Mexico State Univ, Dept Elect & Comp Engn, Las Cruces, NM 88003 USA
关键词
Device-to-device communication; Mirrors; Radio frequency; Visible light communication; Relays; Downlink; Interference; intelligent reflecting surface; device-to-device communication; multi-armed bandit; VISIBLE-LIGHT COMMUNICATION; HETEROGENEOUS NETWORK; D2D; UPLINK; WIFI; OPTIMIZATION;
D O I
10.1109/ACCESS.2024.3354916
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a communications framework to overcome the connectivity constraints due to the nonavailability of the line-of-sight transmissions in indoor optical communication systems. This nonavailability can arise for various reasons, such as blockages due to physical objects, unfavorable device orientations or large distances between the transmitter and the receiving devices. The proposed system utilizes multiple intelligent reflecting surface (IRS) arrays and device-to-device (D2D) communications. The D2D communication is realized using infrared (IR) light-emitting diodes (LEDs) with limited output power for eye safety. The performance of this system depends significantly on the assignment of the mirrors in the IRS arrays to the appropriate user links and a direct combinatorial assignment search is too complex to implement. The proposed approach identifies the assignment of each mirror in the IRS arrays as a multi-armed bandit (MAB) problem, and the assignment of all the mirrors together as a combinatorial MAB (CMAB) problem. Since a simultaneous movement of all the IRS mirrors during the implementation of the CMAB algorithm could cause frequent link disruptions, a CMAB algorithm with low disruptions (CMAB-LD) is proposed to obtain the best mirror assignment with low link disruptions. Simulation results demonstrate that the proposed algorithm can provide significant improvement in reward performance and the total reward increases by more than 100% over random mirror assignments when the channels are blocked with high probabilities. In small size problems, the proposed CMAB-LD is found to achieve the global optimal solution in just a few rounds of full arm explore operations.
引用
收藏
页码:15764 / 15777
页数:14
相关论文
共 2 条
  • [1] A multi-armed bandit solver method for adaptive power allocation in device-to-device communication
    Khan, Muhidul Islam
    Alam, Muhammad Mahtab
    Le Moullec, Yannick
    9TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2018) / THE 8TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2018) / AFFILIATED WORKSHOPS, 2018, 130 : 1069 - 1076
  • [2] Robust Design of IRS-Aided Multi-Group Multicast System With Imperfect CSI
    Jiang, Weiheng
    Xiong, Peiyun
    Nie, Jiangtian
    Ding, Zhiguo
    Pan, Cunhua
    Xiong, Zehui
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (09) : 6314 - 6328