User Association in a VHetNet With Delayed CSI: A Deep Reinforcement Learning Approach

被引:10
作者
Khoshkbari, Hesam [1 ]
Sharifi, Sara [1 ]
Kaddoum, Georges [1 ,2 ]
机构
[1] Ecole Technol Super, Dept Elect Engn, Montreal, PQ H3C 1K3, Canada
[2] Lebanese Amer Univ, Cyber Secur Syst & Appl AI Res Ctr, Beirut 2801, Lebanon
关键词
Index Terms-HAPS; deep reinforcement learning; user association; Markov decision process; TERRESTRIAL NETWORKS;
D O I
10.1109/LCOMM.2023.3291613
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Non-terrestrial base stations (NTBSs) must be employed for next-generation wireless networks to provide users with ubiquitous connectivity and a higher data rate. In vertical heterogeneous networks (VHetNets), associating users with either a terrestrial base station (TBS) or a NTBS to maximize the sum-rate of the network while accounting for the resource limitations that exist at the NTBS poses a challenge. Moreover, a practical user association method should be capable of working in a realistic situation in which instantaneous channel state information (CSI) is not available. To solve this problem, we propose a deep Q-learning (DQL) approach in which a satellite is our agent and schedules each user to a TBS or a high-altitude platform station (HAPS) in each time slot using the CSI of the previous time slot. The proposed method achieves nearly identical results as the exhaustive search action selection method. Furthermore, we investigate the effect of imperfect CSI and show our proposed method outperforms the convex optimization user association scheme in the presence of noisy CSI.
引用
收藏
页码:2257 / 2261
页数:5
相关论文
共 19 条
[1]   Binary adaptive coded pilot symbol assisted modulation over Rayleigh fading channels without feedback [J].
Abou-Fayçal, I ;
Médard, M ;
Madhow, U .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2005, 53 (06) :1036-1046
[2]   Improvement of the Global Connectivity Using Integrated Satellite-Airborne-Terrestrial Networks With Resource Optimization [J].
Alsharoa, Ahmad ;
Alouini, Mohamed-Slim .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (08) :5088-5100
[3]  
Anicho O, 2019, 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORKS AND SATELLITE (COMNETSAT), P79, DOI [10.1109/COMNETSAT.2019.8844057, 10.1109/comnetsat.2019.8844057]
[4]  
[Anonymous], 2006, IEEE T WIRELESS COMM, V5, P2353
[5]   Service-Aware User Association and Resource Allocation in Integrated Terrestrial and Non-Terrestrial Networks: A Genetic Algorithm Approach [J].
Birabwa, Denise Joanitah ;
Ramotsoela, Daniel ;
Ventura, Neco .
IEEE ACCESS, 2022, 10 :104337-104357
[6]   Deep Reinforcement Learning For Multi-User Access Control in Non-Terrestrial Networks [J].
Cao, Yang ;
Lien, Shao-Yu ;
Liang, Ying-Chang .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (03) :1605-1619
[7]   Integrated services from high-altitude platforms: A flexible communication system [J].
Falletti, E ;
Laddomada, M ;
Mondin, M ;
Sellone, F .
IEEE COMMUNICATIONS MAGAZINE, 2006, 44 (02) :124-133
[8]   Interference Management in Cellular-Connected Internet of Drones Networks With Drone-Pairing and Uplink Rate-Splitting Multiple Access [J].
Hassan, Md Zoheb ;
Kaddoum, Georges ;
Akhrif, Ouassima .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (17) :16060-16079
[9]   Using Lagrangian Relaxation for Radio Resource Allocation in High Altitude Platforms [J].
Ibrahim, Ahmed ;
Alfa, Attahiru S. .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (10) :5823-5835
[10]   Deep Q-Learning-Based Transmission Power Control of a High Altitude Platform Station with Spectrum Sharing [J].
Jo, Seongjun ;
Yang, Wooyeol ;
Choi, Haing Kun ;
Noh, Eonsu ;
Jo, Han-Shin ;
Park, Jaedon .
SENSORS, 2022, 22 (04)