A Reinforcement Learning based Game Theoretic Approach for Distributed Power Control in Downlink NOMA

被引:1
作者
Rauniyar, Ashish [1 ,2 ]
Yazidi, Anis [2 ,3 ]
Engelstad, Paal [1 ,2 ]
Osterbo, Olav N. [4 ]
机构
[1] Univ Oslo UiO, Dept Technol Syst, Oslo, Norway
[2] OsloMet Oslo Metropolitan Univ, Dept Comp Sci, Oslo, Norway
[3] Norwegian Univ Sci & Technol NTNU, Dept Comp Sci, Trondheim, Norway
[4] Telenor Res, Trondheim, Norway
来源
2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA) | 2020年
关键词
Game Theory; NOMA; IoT; Power Allocation; Reinforcement Learning; Nash Equilibrium; NONORTHOGONAL MULTIPLE-ACCESS; OPPORTUNISTIC SPECTRUM ACCESS; 5G SYSTEMS; PERFORMANCE; NETWORKS; IOT;
D O I
10.1109/nca51143.2020.9306737
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimal power allocation problem in wireless networks is known to be usually a complex optimization problem. In this paper, we present a simple and energy-efficient distributed power control in downlink Non-Orthogonal Multiple Access (NOMA) using a Reinforcement Learning (RL) based game theoretical approach. A scenario consisting of multiple Base Stations (BSs) serving their respective Near User(s) (NU) and Far User(s) (FU) is considered. The aim of the game is to optimize the achievable rate fairness of the BSs in a distributed manner by appropriately choosing the power levels of the BSs using trials and errors. By resorting to a subtle utility choice based on the concept of marginal price costing where a BS needs to pay a virtual tax offsetting the result of the interference its presence causes for the other BS, we design a potential game that meets the latter objective. As RL scheme, we adopt Learning Automata (LA) due to its simplicity and computational efficiency and derive analytical results showing the optimality and convergence of the game to a Nash Equilibrium (NE). Numerical results not only demonstrate the convergence of the proposed algorithm to a desirable equilibrium maximizing the fairness, but they also demonstrate the correctness of the proposal followed by thorough comparison with random and heuristic approaches.
引用
收藏
页数:10
相关论文
共 39 条
  • [1] 3rd Generation Partnership Project (3GPP), 2015, TSG RAN M 67
  • [2] Abdulla AEAA, 2014, IEEE INFOCOM SER, P736, DOI 10.1109/INFOCOM.2014.6848000
  • [3] Game-theoretic power allocation algorithm for downlink NOMA cellular system
    Aldebes, R.
    Dimyati, K.
    Hanafi, E.
    [J]. ELECTRONICS LETTERS, 2019, 55 (25) : 1361 - 1363
  • [4] [Anonymous], 2003, P S THEOR COMP ASS
  • [5] [Anonymous], 2016, P 2016 IEEE INT C CO, DOI DOI 10.1139/CJM-2015-0350
  • [6] Choi J, 2018, EUR CONF NETW COMMUN, P54, DOI 10.1109/EuCNC.2018.8442662
  • [7] Daniel L., 2013, MATH MODELLING COMPU, P2
  • [8] On the Performance of Non-Orthogonal Multiple Access in 5G Systems with Randomly Deployed Users
    Ding, Zhiguo
    Yang, Zheng
    Fan, Pingzhi
    Poor, H. Vincent
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (12) : 1501 - 1505
  • [9] Learning automata-based solutions to the nonlinear fractional knapsack problem with applications to optimal resource allocation
    Granmo, Ole-Christoffer
    Oommen, B. John
    Myrer, Svein Arild
    Olsen, Morten Goodwin
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01): : 166 - 175
  • [10] Han Z., 2019, Game Theory for Next Generation Wireless and Communication Networks: Modeling, Analysis, and Design