Towards Beam Hopping and Power Allocation in Multi-Beam Satellite Systems With Parameterized Reinforcement Learning

被引：3

作者：

Ran, Yongyi ^{[1
]}

Tan, Feng ^{[1
]}

Chen, Shuangwu ^{[2
]}

Lei, Jizhao ^{[3
]}

Luo, Jiangtao ^{[1
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 404100, Peoples R China

[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230027, Peoples R China

[3] China Satellite Network Grp Co Ltd, Chongqing 401147, Peoples R China

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2024年 / 73卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Resource management; Satellites; Throughput; Optimization; Propagation losses; Downlink; Vectors; Multi-beam satellite; deep reinforcement learning; parameterized action space; beam hopping; power allocation;

D O I：

10.1109/TVT.2024.3395509

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The simultaneous optimisation of beam hopping and power allocation is a crucial technique for enhancing the performance of Multi-Beam Satellite (MBS) systems. However, the previous joint optimisation approaches cannot well handle with the issues of high-dimensional state space and discrete-continuous hybrid action space. In this paper, we propose a joint optimization approach based on parameterized reinforcement learning to simultaneously regulate beam hopping and power allocation for MBS systems (called DeepMBS). In DeepMBS, a multi-objective problem is firstly formulated to optimize system throughput and energy efficiency. Then, the optimization problem is modelled as a Markov Decision Process (MDP), and the original deep Q-network is extended with a parameterized action space to simultaneously determine the beam hopping (discrete action) and power allocation (continuous action). In addition, we design an empirical filtering mechanism to enhance the performance of DeepMBS. Finally, the results of extensive experiments demonstrate that the proposed DeepMBS can gain a better performance in terms of throughput and energy efficiency compared to the baseline algorithms. Furthermore, the proposed DeepMBS (EFM) algorithm demonstrates superior accuracy and sensitivity in capturing changes of communication demands.

引用

页码：14050 / 14055

页数：6

共 12 条

[1] Dynamic Resource Allocation for Beam Hopping Satellites Communication System: An Exploration [J].

Du, Xinqing ;

Hu, Xin ;

Wang, Yin ;

Wang, Weidong .

2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, :1296-1301

[2] Balancing Total Energy Consumption and Mean Makespan in Data Offloading for Space-Air-Ground Integrated Networks [J].

He, Lijun ;

Li, Jiandong ;

Wang, Yanting ;

Zheng, Jiangbin ;

He, Liang .

IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (01) :209-222

[3] Dynamic Beam Hopping Method Based on Multi-Objective Deep Reinforcement Learning for Next Generation Satellite Broadband Systems [J].

Hu, Xin ;

Zhang, Yuchen ;

Liao, Xianglai ;

Liu, Zhijun ;

Wang, Weidong ;

Ghannouchi, Fadhel M. .

IEEE TRANSACTIONS ON BROADCASTING, 2020, 66 (03) :630-646

[4] Joint HAP Access and LEO Satellite Backhaul in 6G: Matching Game-Based Approaches [J].

Jia, Ziye ;

Sheng, Min ;

Li, Jiandong ;

Zhou, Di ;

Han, Zhu .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (04) :1147-1159

[5]

Maral G., 2020, Satellite Communications Systems: Systems, Techniques and Technology, DOI DOI 10.1002/9780470834985

[6] Optimizing Data Center Energy Efficiency via Event-Driven Deep Reinforcement Learning [J].

Ran, Yongyi ;

Zhou, Xin ;

Hu, Han ;

Wen, Yonggang .

IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (02) :1296-1309

[7] Adaptive Power Resource Allocation With Multi-Beam Directivity Control in High-Throughput Satellite Communication Systems [J].

Takahashi, Masaki ;

Kawamoto, Yuichi ;

Kato, Nei ;

Miura, Amane ;

Toyoshima, Morio .

IEEE WIRELESS COMMUNICATIONS LETTERS, 2019, 8 (04) :1248-1251

[8] Deep Reinforcement Learning-Based Hierarchical Time Division Duplexing Control for Dense Wireless and Mobile Networks [J].

Van Dat Tuong ;

Nhu-Ngoc Dao ;

Noh, Wonjong ;

Cho, Sungrae .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (11) :7135-7150

[9] Joint Beam-Hopping Scheduling and Power Allocation in NOMA-Assisted Satellite Systems [J].

Wang, Anyue ;

Lei, Lei ;

Lagunas, Eva ;

Chatzinotas, Symeon ;

Perez Neira, Ana Isabel ;

Ottersten, Bjorn .

2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,

[10] Dynamic Beam Hopping of Multi-beam Satellite Based on Genetic Algorithm [J].

Wang, Libing ;

Hu, Xin ;

Ma, Shijun ;

Xu, Sujie ;

Wang, Weidong .

2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, :1364-1370

← 1 2 →