Double-Timescale Multi-Agent Deep Reinforcement Learning for Flexible Payload in VHTS Systems

被引：0

作者：

Feng, Linqing ^{[1
]}

Zhang, Cheng ^{[2
]}

Zhang, Qiuyang ^{[1
]}

Zeng, Lingchao ^{[2
]}

Qin, Pengfei ^{[2
]}

Wang, Ying ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

[2] China Acad Space Technol, Inst Telecommun & Nav Satellites, Beijing 100094, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 14期

关键词：

bandwidth allocation; power allocation; double timescale; multi-agent deep reinforcement learning; very-high-throughput satellite; TERRESTRIAL NETWORKS; RESOURCE-MANAGEMENT; SATELLITE; ALLOCATION; VISION; REQUIREMENTS; OPTIMIZATION; CHALLENGES; POWER;

D O I：

10.3390/electronics13142764

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the expansion of the very-high-throughput satellite (VHTS) system, the uneven distribution of traffic demands in time and space has become increasingly significant and cannot be ignored. It is a significant challenge to efficiently and dynamically allocate scarce on-board resources to ensure capacity and demand matching. The advancement of flexible payload technology provides the possibility to overcome this challenge. However, computational complexity is increasing due to the unsynchronized resource adjustment and the time-varying demands of the VHTS system. Therefore, we propose a double-timescale bandwidth and power allocation (DT-BPA) scheme to effectively manage the available resources in the flexible payload architecture. We use a multi-agent deep reinforcement learning (MADRL) algorithm aiming to meet the time-varying traffic demands of each beam and improve resource utilization. The simulation results demonstrate that the proposed DT-BPA algorithm enhanced the matching degree of capacity and demand as well as reduced the system's power consumption. Additionally, it can be trained offline and implemented online, providing a more cost-effective solution for the VHTS system.

引用

页数：22

共 40 条

[1] Flexible Resource Optimization for GEO Multibeam Satellite Communication System
Abdu, Tedros Salih
Kisseleff, Steven
Lagunas, Eva
Chatzinotas, Symeon
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (12) : 7888 - 7902
[2] Al-Hraishawi H., 2020, P ADV SAT MULT SYST, P1
[3] Power Allocation in Multibeam Satellite Systems: A Two-Stage Multi-Objective Optimization
Aravanis, Alexis I.
Shankar, Bhavani M. R.
Arapoglou, Pantelis-Daniel
Danoy, Gregoire
Cottis, Panayotis G.
Ottersten, Bjoern
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (06) : 3171 - 3182
[4] Bachir A.F.B.A., 2019, P INT C OPT APPL ICO, P1
[5] Stable Online Computation Offloading via Lyapunov-guided Deep Reinforcement Learning
Bi, Suzhi
Huang, Liang
Wang, Hui
Zhang, Ying-Jun Angela
[J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[6] Radio Resource Management Optimization of Flexible Satellite Payloads for DVB-S2 Systems
Cocco, Giuseppe
de Cola, Tomaso
Angelone, Martina
Katona, Zoltan
Erl, Stefan
[J]. IEEE TRANSACTIONS ON BROADCASTING, 2018, 64 (02) : 266 - 280
[7] Power allocation in multibeam satellites based on particle swarm optimization
Durand, Fabio Renan
Abrao, Taufik
[J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2017, 78 : 124 - 133
[8] Fenech H., 2017, 2017 11th European Conference on Antennas and Propagation (EUCAP), P2409, DOI 10.23919/EuCAP.2017.7928175
[9] Future technologies for very high throughput satellite systems
Gaudenzi, Riccardo
Angeletti, Piero
Petrolati, Daniele
Re, Emiliano
[J]. INTERNATIONAL JOURNAL OF SATELLITE COMMUNICATIONS AND NETWORKING, 2020, 38 (02) : 141 - 161
[10] Giordani Marco, 2020, 2020 International Conference on Computing, Networking and Communications (ICNC), P383, DOI 10.1109/ICNC47757.2020.9049651

← 1 2 3 4 →