On Reward Shaping Methods in Deep Reinforcement Learning for Radio Resource Management in Wireless Networks

被引：1

作者：

Kopic, Amna ^{[1
]}

Turbic, Kenan ^{[1
]}

Gacanin, Haris ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, Inst Commun Technol & Embedded Syst, Aachen, Germany

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS | 2023年

关键词：

Power allocation; reinforcement learning; multi-carrier systems; POWER ALLOCATION;

D O I：

10.1109/ICCWORKSHOPS57953.2023.10283540

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper provides a comprehensive study on the learning models' power violation, sum-rate performance while taking into consideration power constraint, and computational efficiency in terms of training and execution times over a dynamic wireless channel. We propose a reward shaping method and modify learning models with the output scaling strategy to enforce them to fully respect the power constraints while optimizing the sum-rate performance. The proposed approach reaches close-to-optimal accuracy, i.e., up to 99.15%, while satisfying the predefined power constraint of the base station. Moreover, learning models are shown to be more computationally efficient compared to the traditional algorithm. However, solving the power allocation problem within the Orthogonal Frequency Division Multiplexing (OFDM) symbol duration of 16.7 mu s is a remaining challenge.

引用

页码：1020 / 1025

页数：6

共 18 条

[1] Deep Learning for Radio Resource Allocation in Multi-Cell Networks [J].

Ahmed, K., I ;

Tabassum, H. ;

Hossain, E. .

IEEE NETWORK, 2019, 33 (06) :188-195

[2] AUTONOMOUS WIRELESS SYSTEMS WITH ARTIFICIAL INTELLIGENCE A Knowledge Management Perspective [J].

Gacanin, Haris .

IEEE VEHICULAR TECHNOLOGY MAGAZINE, 2019, 14 (03) :51-59

[3] WIP: Demand-Driven Power Allocation in Wireless Networks with Deep Q-Learning [J].

Giannopoulos, A. ;

Spantideas, S. ;

Capsalis, N. ;

Gkonis, P. ;

Karkazis, P. ;

Sarakis, L. ;

Trakadas, P. ;

Capsalis, C. .

2021 IEEE 22ND INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM 2021), 2021, :248-251

[4]

Huang L., 2020, arXiv

[5] Deep-Learning-Based Wireless Resource Allocation With Application to Vehicular Networks [J].

Liang, Le ;

Ye, Hao ;

Yu, Guanding ;

Li, Geoffrey Ye .

PROCEEDINGS OF THE IEEE, 2020, 108 (02) :341-356

[6]

Lillicrap T. P, 2016, P 4 INT C LEARN REPR, P1

[7] Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches [J].

Meng, Fan ;

Chen, Peng ;

Wu, Lenan ;

Cheng, Julian .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (10) :6255-6267

[8] Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks [J].

Nasir, Yasar Sinan ;

Guo, Dongning .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) :2239-2250

[9]

Parkvall Stefan, 2017, IEEE Communications Standards Magazine, V1, P24, DOI 10.1109/MCOMSTD.2017.1700042

[10]

Patzold M., 2007, P INT S WIR PERS MUL, P394

← 1 2 →