A Reinforcement Learning based Game Theoretic Approach for Distributed Power Control in Downlink NOMA

被引：1

作者：

Rauniyar, Ashish ^{[1
,2
]}

Yazidi, Anis ^{[2
,3
]}

Engelstad, Paal ^{[1
,2
]}

Osterbo, Olav N. ^{[4
]}

机构：

[1] Univ Oslo UiO, Dept Technol Syst, Oslo, Norway

[2] OsloMet Oslo Metropolitan Univ, Dept Comp Sci, Oslo, Norway

[3] Norwegian Univ Sci & Technol NTNU, Dept Comp Sci, Trondheim, Norway

[4] Telenor Res, Trondheim, Norway

来源：

2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA) | 2020年

关键词：

Game Theory; NOMA; IoT; Power Allocation; Reinforcement Learning; Nash Equilibrium; NONORTHOGONAL MULTIPLE-ACCESS; OPPORTUNISTIC SPECTRUM ACCESS; 5G SYSTEMS; PERFORMANCE; NETWORKS; IOT;

D O I：

10.1109/nca51143.2020.9306737

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Optimal power allocation problem in wireless networks is known to be usually a complex optimization problem. In this paper, we present a simple and energy-efficient distributed power control in downlink Non-Orthogonal Multiple Access (NOMA) using a Reinforcement Learning (RL) based game theoretical approach. A scenario consisting of multiple Base Stations (BSs) serving their respective Near User(s) (NU) and Far User(s) (FU) is considered. The aim of the game is to optimize the achievable rate fairness of the BSs in a distributed manner by appropriately choosing the power levels of the BSs using trials and errors. By resorting to a subtle utility choice based on the concept of marginal price costing where a BS needs to pay a virtual tax offsetting the result of the interference its presence causes for the other BS, we design a potential game that meets the latter objective. As RL scheme, we adopt Learning Automata (LA) due to its simplicity and computational efficiency and derive analytical results showing the optimality and convergence of the game to a Nash Equilibrium (NE). Numerical results not only demonstrate the convergence of the proposed algorithm to a desirable equilibrium maximizing the fairness, but they also demonstrate the correctness of the proposal followed by thorough comparison with random and heuristic approaches.

引用

页数：10

共 39 条

[1] 3rd Generation Partnership Project (3GPP), 2015, TSG RAN M 67
[2] Abdulla AEAA, 2014, IEEE INFOCOM SER, P736, DOI 10.1109/INFOCOM.2014.6848000
[3] Game-theoretic power allocation algorithm for downlink NOMA cellular system
Aldebes, R.
Dimyati, K.
Hanafi, E.
[J]. ELECTRONICS LETTERS, 2019, 55 (25) : 1361 - 1363
[4] [Anonymous], 2003, P S THEOR COMP ASS
[5] [Anonymous], 2016, P 2016 IEEE INT C CO, DOI DOI 10.1139/CJM-2015-0350
[6] Choi J, 2018, EUR CONF NETW COMMUN, P54, DOI 10.1109/EuCNC.2018.8442662
[7] Daniel L., 2013, MATH MODELLING COMPU, P2
[8] On the Performance of Non-Orthogonal Multiple Access in 5G Systems with Randomly Deployed Users
Ding, Zhiguo
Yang, Zheng
Fan, Pingzhi
Poor, H. Vincent
[J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (12) : 1501 - 1505
[9] Learning automata-based solutions to the nonlinear fractional knapsack problem with applications to optimal resource allocation
Granmo, Ole-Christoffer
Oommen, B. John
Myrer, Svein Arild
Olsen, Morten Goodwin
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01): : 166 - 175
[10] Han Z., 2019, Game Theory for Next Generation Wireless and Communication Networks: Modeling, Analysis, and Design

← 1 2 3 4 →