Fairness-Aware Link Optimization for Space-Terrestrial Integrated Networks: A Reinforcement Learning Framework

被引:24
作者
Arani, Atefeh Hajijamali [1 ]
Hu, Peng [1 ,2 ]
Zhu, Yeying [1 ]
机构
[1] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
[2] Natl Res Council Canada, Digital Technol Res Ctr, Ottawa, ON K1A 0R6, Canada
关键词
Resource management; Trajectory; Three-dimensional displays; Satellites; Internet; Throughput; Low earth orbit satellites; Space-terrestrial integrated networks; space-air-ground integrated networks; unmanned aerial vehicles; fairness; reinforcement learning; community networks; USER ASSOCIATION; BASE STATIONS; COVERAGE; ALLOCATION; DEPLOYMENT; PLACEMENT; LOCATION; ALTITUDE; DESIGN; NOMA;
D O I
10.1109/ACCESS.2021.3082862
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The integration of space and air components considering satellites and unmanned aerial vehicles (UAVs) into terrestrial networks in a space-terrestrial integrated network (STIN) has been envisioned as a promising solution to enhancing the terrestrial networks in terms of fairness, performance, and network resilience. However, employing UAVs introduces some key challenges, among which backhaul connectivity, resource management, and efficient three-dimensional (3D) trajectory designs of UAVs are very crucial. In this paper, low-Earth orbit (LEO) satellites are employed to alleviate the backhaul connectivity issues with UAV networks, where we address the problem of jointly determining backhaul-aware 3D trajectories of UAVs, resource management, and associations between users, satellites and base stations (BSs) in an STIN while satisfying ground users' quality-of-experience requirements and provisioning fairness concerning users' data rates. The proposed approach maximizes a novel objective function with joint consideration for BS's load and fairness, which can be categorized as a non-deterministic polynomial time hard (NP-hard) problem. To tackle this issue, we leverage a reinforcement learning framework, in which our problem is modeled as a multi-armed bandit problem. Accordingly, BSs learn the environment and its dynamics and then make decisions under an upper confidence bound based method. Simulation results show that our proposed approach outperforms the benchmark methods in terms of fairness, throughput, and load.
引用
收藏
页码:77624 / 77636
页数:13
相关论文
共 54 条
[1]   Trajectory Design and Power Allocation for Drone-Assisted NR-V2X Network With Dynamic NOMA/OMA [J].
Abbasi, Omid ;
Yanikomeroglu, Halim ;
Ebrahimi, Afshin ;
Yamchi, Nader Mokari .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (11) :7153-7168
[2]   Millimeter Wave Channel Modeling and Cellular Capacity Evaluation [J].
Akdeniz, Mustafa Riza ;
Liu, Yuanpeng ;
Samimi, Mathew K. ;
Sun, Shu ;
Rangan, Sundeep ;
Rappaport, Theodore S. ;
Erkip, Elza .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2014, 32 (06) :1164-1179
[3]  
Alzenad M, 2018, IEEE GLOBE WORK
[4]  
[Anonymous], 2012, P14105 ITUR
[5]  
[Anonymous], 2019, ITU-T Y.3172
[6]  
[Anonymous], 2020, 28808 TR 3 GEN PARTN
[7]  
Arani A. H., 2020, P IEEE 31 ANN INT S, P1
[8]  
Arani A. H., 2021, P IEEE INT C COMM, P1
[9]   Minimizing Base Stations' ON/OFF Switchings in Self-Organizing Heterogeneous Networks: A Distributed Satisfactory Framework [J].
Arani, Atefeh Hajijamali ;
Omidi, Mohammad Javad ;
Mehbodniya, Abolfazl ;
Adachi, Fumiyuki .
IEEE ACCESS, 2017, 5 :26267-26278
[10]   Distributed Learning for Energy-Efficient Resource Management in Self-Organizing Heterogeneous Networks [J].
Arani, Atefeh Hajijamali ;
Mehbodniya, Abolfazl ;
Omidi, Mohammad Javad ;
Adachi, Fumiyuki ;
Saad, Walid ;
Guvenc, Ismail .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (10) :9287-9303