A Multi-Agent Deep Reinforcement Learning-Based Handover Scheme for Mega-Constellation Under Dynamic Propagation Conditions

被引:0
作者
Liu, Haotian [1 ]
Wang, Yichen [1 ]
Li, Peixuan [1 ]
Cheng, Julian [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Informat & Commun Engn, Xian 710049, Shaanxi, Peoples R China
[2] Univ British Columbia, Sch Engn, Kelowna, BC V1V 1V7, Canada
基金
中国国家自然科学基金;
关键词
Satellites; Handover; Low earth orbit satellites; Satellite communications; Planetary orbits; Optimization; Heuristic algorithms; Satellite communication; low earth orbit (LEO) satellite; mega-constellation; satellite handover; multi-agent deep reinforcement learning; propagation condition; CHANNEL ALLOCATION TECHNIQUES; MOBILE SATELLITE CHANNELS; STATISTICAL-MODEL; POWER-CONTROL; MULTIBEAM; PERFORMANCE; SYSTEMS; STRATEGY; NETWORK; ACCESS;
D O I
10.1109/TWC.2024.3407358
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the rapidly increasing number of satellites, the handover scheme design is critically important for the low Earth orbit (LEO) satellite networks, especially for the mega-constellations that include massive number of LEO satellites. However, the existing handover schemes for LEO satellite networks are designed based on the static propagation conditions, which cannot satisfy the dynamic feature of communication environment caused by the mobility of LEO satellites and users. To address this issue, a centralized adaptive intelligent handover scheme for mega-constellations is proposed, where the dynamics of the propagation conditions and limited LEO satellite capacity are taken into considerations. Specifically, we first use a three-state Markov model to characterize the dynamically varying propagation conditions between satellites and users. Then, the Loo model is employed to describe the dynamic land mobile satellite channels. By considering the user transmission rate requirement and the load-balancing demand of satellites, we design the user utility function and formulate an optimization problem that aims to maximize the overall long-term utility of the network. To reduce the handover decision-making complexity, a multi-agent successive hysteretic deep Q-learning algorithm is developed and it can efficiently solve the formulated problem by reducing the state and action space. To reduce the signaling overhead and the computation complexity of the proposed centralized handover scheme brought to the control center, a distributed intelligent handover scheme is further developed, where each user is enabled to independently make the handover decision only based on the local information. Simulation results show that both the proposed centralized and distributed approaches can efficiently improve the network performance over the existing schemes.
引用
收藏
页码:13579 / 13596
页数:18
相关论文
共 56 条
  • [1] A new simple model for land mobile satellite channels: First- and second-order statistics
    Abdi, A
    Lau, WC
    Alouini, MS
    Kaveh, M
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2003, 2 (03) : 519 - 528
  • [2] A multiagent approach to managing air traffic flow
    Agogino, Adrian K.
    Tumer, Kagan
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2012, 24 (01) : 1 - 25
  • [3] Zero-Forcing Precoding With Partial CSI in Multibeam High Throughput Satellite Systems
    Ahmad, Ishtiaq
    Nguyen, Khoa D.
    Letzepis, Nick
    Lechner, Gottfried
    Joroughi, Vahid
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (02) : 1410 - 1420
  • [4] Next Generation Mega Satellite Networks for Access Equality: Opportunities, Challenges, and Performance
    Al Homssi, Bassel
    Al-Hourani, Akram
    Wang, Ke
    Conder, Phillip
    Kandeepan, Sithamparanathan
    Choi, Jinho
    Allen, Ben
    Moores, Ben
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2022, 60 (04) : 18 - 24
  • [5] Session Duration Between Handovers in Dense LEO Satellite Networks
    Al-Hourani, Akram
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (12) : 2810 - 2814
  • [6] Albulet M., 2017, Tech. Rep. SAT -LOA-20170301 -00027
  • [7] Predicting the visibility of LEO sate
    Ali, I
    Al-Dhahir, N
    Hershey, JE
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1999, 35 (04) : 1183 - 1190
  • [8] Performance of the Multibeam Satellite Return Link With Correlated Rain Attenuation
    Arnau, Jesus
    Christopoulos, Dimitrios
    Chatzinotas, Symeon
    Mosquera, Carlos
    Ottersten, Bjoern
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2014, 13 (11) : 6286 - 6299
  • [9] Age-Oriented Access Control in GEO/LEO Heterogeneous Network for Marine IoRT: A Deep Reinforcement Learning Approach
    Cai, Yi
    Wu, Shaohua
    Luo, Jiping
    Jiao, Jian
    Zhang, Ning
    Zhang, Qinyu
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 24919 - 24932
  • [10] Deep Reinforcement Learning For Multi-User Access Control in Non-Terrestrial Networks
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (03) : 1605 - 1619