Double-Timescale Multi-Agent Deep Reinforcement Learning for Flexible Payload in VHTS Systems

被引:0
作者
Feng, Linqing [1 ]
Zhang, Cheng [2 ]
Zhang, Qiuyang [1 ]
Zeng, Lingchao [2 ]
Qin, Pengfei [2 ]
Wang, Ying [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] China Acad Space Technol, Inst Telecommun & Nav Satellites, Beijing 100094, Peoples R China
关键词
bandwidth allocation; power allocation; double timescale; multi-agent deep reinforcement learning; very-high-throughput satellite; TERRESTRIAL NETWORKS; RESOURCE-MANAGEMENT; SATELLITE; ALLOCATION; VISION; REQUIREMENTS; OPTIMIZATION; CHALLENGES; POWER;
D O I
10.3390/electronics13142764
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the expansion of the very-high-throughput satellite (VHTS) system, the uneven distribution of traffic demands in time and space has become increasingly significant and cannot be ignored. It is a significant challenge to efficiently and dynamically allocate scarce on-board resources to ensure capacity and demand matching. The advancement of flexible payload technology provides the possibility to overcome this challenge. However, computational complexity is increasing due to the unsynchronized resource adjustment and the time-varying demands of the VHTS system. Therefore, we propose a double-timescale bandwidth and power allocation (DT-BPA) scheme to effectively manage the available resources in the flexible payload architecture. We use a multi-agent deep reinforcement learning (MADRL) algorithm aiming to meet the time-varying traffic demands of each beam and improve resource utilization. The simulation results demonstrate that the proposed DT-BPA algorithm enhanced the matching degree of capacity and demand as well as reduced the system's power consumption. Additionally, it can be trained offline and implemented online, providing a more cost-effective solution for the VHTS system.
引用
收藏
页数:22
相关论文
共 40 条
  • [1] Flexible Resource Optimization for GEO Multibeam Satellite Communication System
    Abdu, Tedros Salih
    Kisseleff, Steven
    Lagunas, Eva
    Chatzinotas, Symeon
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (12) : 7888 - 7902
  • [2] Al-Hraishawi H., 2020, P ADV SAT MULT SYST, P1
  • [3] Power Allocation in Multibeam Satellite Systems: A Two-Stage Multi-Objective Optimization
    Aravanis, Alexis I.
    Shankar, Bhavani M. R.
    Arapoglou, Pantelis-Daniel
    Danoy, Gregoire
    Cottis, Panayotis G.
    Ottersten, Bjoern
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (06) : 3171 - 3182
  • [4] Bachir A.F.B.A., 2019, P INT C OPT APPL ICO, P1
  • [5] Stable Online Computation Offloading via Lyapunov-guided Deep Reinforcement Learning
    Bi, Suzhi
    Huang, Liang
    Wang, Hui
    Zhang, Ying-Jun Angela
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [6] Radio Resource Management Optimization of Flexible Satellite Payloads for DVB-S2 Systems
    Cocco, Giuseppe
    de Cola, Tomaso
    Angelone, Martina
    Katona, Zoltan
    Erl, Stefan
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 2018, 64 (02) : 266 - 280
  • [7] Power allocation in multibeam satellites based on particle swarm optimization
    Durand, Fabio Renan
    Abrao, Taufik
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2017, 78 : 124 - 133
  • [8] Fenech H., 2017, 2017 11th European Conference on Antennas and Propagation (EUCAP), P2409, DOI 10.23919/EuCAP.2017.7928175
  • [9] Future technologies for very high throughput satellite systems
    Gaudenzi, Riccardo
    Angeletti, Piero
    Petrolati, Daniele
    Re, Emiliano
    [J]. INTERNATIONAL JOURNAL OF SATELLITE COMMUNICATIONS AND NETWORKING, 2020, 38 (02) : 141 - 161
  • [10] Giordani Marco, 2020, 2020 International Conference on Computing, Networking and Communications (ICNC), P383, DOI 10.1109/ICNC47757.2020.9049651