A Multi-Agent Deep Reinforcement Learning-Based Handover Scheme for Mega-Constellation Under Dynamic Propagation Conditions

被引：0

作者：

Liu, Haotian ^{[1
]}

Wang, Yichen ^{[1
]}

Li, Peixuan ^{[1
]}

Cheng, Julian ^{[2
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Informat & Commun Engn, Xian 710049, Shaanxi, Peoples R China

[2] Univ British Columbia, Sch Engn, Kelowna, BC V1V 1V7, Canada

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2024年 / 23卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Satellites; Handover; Low earth orbit satellites; Satellite communications; Planetary orbits; Optimization; Heuristic algorithms; Satellite communication; low earth orbit (LEO) satellite; mega-constellation; satellite handover; multi-agent deep reinforcement learning; propagation condition; CHANNEL ALLOCATION TECHNIQUES; MOBILE SATELLITE CHANNELS; STATISTICAL-MODEL; POWER-CONTROL; MULTIBEAM; PERFORMANCE; SYSTEMS; STRATEGY; NETWORK; ACCESS;

D O I：

10.1109/TWC.2024.3407358

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the rapidly increasing number of satellites, the handover scheme design is critically important for the low Earth orbit (LEO) satellite networks, especially for the mega-constellations that include massive number of LEO satellites. However, the existing handover schemes for LEO satellite networks are designed based on the static propagation conditions, which cannot satisfy the dynamic feature of communication environment caused by the mobility of LEO satellites and users. To address this issue, a centralized adaptive intelligent handover scheme for mega-constellations is proposed, where the dynamics of the propagation conditions and limited LEO satellite capacity are taken into considerations. Specifically, we first use a three-state Markov model to characterize the dynamically varying propagation conditions between satellites and users. Then, the Loo model is employed to describe the dynamic land mobile satellite channels. By considering the user transmission rate requirement and the load-balancing demand of satellites, we design the user utility function and formulate an optimization problem that aims to maximize the overall long-term utility of the network. To reduce the handover decision-making complexity, a multi-agent successive hysteretic deep Q-learning algorithm is developed and it can efficiently solve the formulated problem by reducing the state and action space. To reduce the signaling overhead and the computation complexity of the proposed centralized handover scheme brought to the control center, a distributed intelligent handover scheme is further developed, where each user is enabled to independently make the handover decision only based on the local information. Simulation results show that both the proposed centralized and distributed approaches can efficiently improve the network performance over the existing schemes.

引用

页码：13579 / 13596

页数：18

共 56 条

[1] A new simple model for land mobile satellite channels: First- and second-order statistics
Abdi, A
Lau, WC
Alouini, MS
Kaveh, M
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2003, 2 (03) : 519 - 528
[2] A multiagent approach to managing air traffic flow
Agogino, Adrian K.
Tumer, Kagan
[J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2012, 24 (01) : 1 - 25
[3] Zero-Forcing Precoding With Partial CSI in Multibeam High Throughput Satellite Systems
Ahmad, Ishtiaq
Nguyen, Khoa D.
Letzepis, Nick
Lechner, Gottfried
Joroughi, Vahid
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (02) : 1410 - 1420
[4] Next Generation Mega Satellite Networks for Access Equality: Opportunities, Challenges, and Performance
Al Homssi, Bassel
Al-Hourani, Akram
Wang, Ke
Conder, Phillip
Kandeepan, Sithamparanathan
Choi, Jinho
Allen, Ben
Moores, Ben
[J]. IEEE COMMUNICATIONS MAGAZINE, 2022, 60 (04) : 18 - 24
[5] Session Duration Between Handovers in Dense LEO Satellite Networks
Al-Hourani, Akram
[J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (12) : 2810 - 2814
[6] Albulet M., 2017, Tech. Rep. SAT -LOA-20170301 -00027
[7] Predicting the visibility of LEO sate
Ali, I
Al-Dhahir, N
Hershey, JE
[J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1999, 35 (04) : 1183 - 1190
[8] Performance of the Multibeam Satellite Return Link With Correlated Rain Attenuation
Arnau, Jesus
Christopoulos, Dimitrios
Chatzinotas, Symeon
Mosquera, Carlos
Ottersten, Bjoern
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2014, 13 (11) : 6286 - 6299
[9] Age-Oriented Access Control in GEO/LEO Heterogeneous Network for Marine IoRT: A Deep Reinforcement Learning Approach
Cai, Yi
Wu, Shaohua
Luo, Jiping
Jiao, Jian
Zhang, Ning
Zhang, Qinyu
[J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 24919 - 24932
[10] Deep Reinforcement Learning For Multi-User Access Control in Non-Terrestrial Networks
Cao, Yang
Lien, Shao-Yu
Liang, Ying-Chang
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (03) : 1605 - 1619

← 1 2 3 4 5 6 →